JP2019153160A

JP2019153160A - Digital signage device and program

Info

Publication number: JP2019153160A
Application number: JP2018038861A
Authority: JP
Inventors: 松本　征二; Seiji Matsumoto; 征二松本
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2018-03-05
Filing date: 2018-03-05
Publication date: 2019-09-12

Abstract

To provide a digital signage device and the like for displaying information to meet needs of a person in the vicinity.SOLUTION: A digital signage device comprises: a display part 35 covered by a half mirror; a microphone 36 for acquiring a voice from the vicinity of the display part 35; a text acquisition part for acquiring a text based on the voice acquired by the microphone 36; and an output part for outputting a content based on the text acquired by the text acquisition part to the display part 35. The output part outputs a search result using the text as a key.SELECTED DRAWING: Figure 1

Description

本発明は、デジタルサイネージ装置およびプログラムに関する。 The present invention relates to a digital signage apparatus and a program.

工事現場に設置される仮囲い用の壁にディスプレイを設けたデジタルサイネージ装置が提案されている（特許文献１）。工事現場周辺の歩行者および近隣住民等に、工事情報、地域情報および広告等の情報を提供できる。 There has been proposed a digital signage device in which a display is provided on a temporary enclosure wall installed at a construction site (Patent Document 1). Information such as construction information, regional information and advertisements can be provided to pedestrians and neighbors around the construction site.

特開２０１７−２２７１１３号公報JP 2017-227113 A

しかしながら、特許文献１のデジタルサイネージ装置においては、装置の近くにいる人のニーズに合わせた情報を表示することはできない。 However, the digital signage device of Patent Document 1 cannot display information that meets the needs of people near the device.

一つの側面では、近くにいる人のニーズに合わせた情報を表示するデジタルサイネージ装置等を提供することを目的とする。 In one aspect, an object of the present invention is to provide a digital signage device that displays information according to the needs of people nearby.

デジタルサイネージ装置は、ハーフミラーにより覆われた表示部と、前記表示部の近傍から音声を取得するマイクと、前記マイクが取得した音声に基づくテキストを取得するテキスト取得部と、前記テキスト取得部が取得した前記テキストに基づくコンテンツを前記表示部に出力する出力部とを備える。 The digital signage device includes a display unit covered with a half mirror, a microphone that acquires voice from the vicinity of the display unit, a text acquisition unit that acquires text based on the voice acquired by the microphone, and the text acquisition unit And an output unit that outputs content based on the acquired text to the display unit.

一つの側面では、近くにいる人のニーズに合わせた情報を表示するデジタルサイネージ装置等を提供できる。 In one aspect, it is possible to provide a digital signage device that displays information according to the needs of people nearby.

デジタルサイネージシステムの概要を説明する説明図である。It is explanatory drawing explaining the outline | summary of a digital signage system. デジタルサイネージシステムの構成を示す説明図である。It is explanatory drawing which shows the structure of a digital signage system. 問合先ＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of inquiry DB. プログラムの処理の流れを説明するフローチャートである。It is a flowchart explaining the flow of a process of a program. コンテンツ取得のサブルーチンの処理の流れを説明するフローチャートである。It is a flowchart explaining the flow of processing of a subroutine for content acquisition. 実施の形態２のデジタルサイネージシステムの概要を説明する説明図である。It is explanatory drawing explaining the outline | summary of the digital signage system of Embodiment 2. FIG. 実施の形態２のデジタルサイネージシステムの概要を説明する説明図である。It is explanatory drawing explaining the outline | summary of the digital signage system of Embodiment 2. FIG. 実施の形態２のプログラムの処理の流れを説明するフローチャートである。10 is a flowchart for explaining a processing flow of a program according to the second embodiment. 議事録表示のサブルーチンの処理の流れを説明するフローチャートである。It is a flowchart explaining the flow of processing of a subroutine for displaying minutes. 実施の形態３のデジタルサイネージ装置の機能ブロック図である。6 is a functional block diagram of a digital signage apparatus according to Embodiment 3. FIG. 実施の形態４のデジタルサイネージシステムの構成を示す説明図である。FIG. 10 is an explanatory diagram illustrating a configuration of a digital signage system according to a fourth embodiment.

［実施の形態１］
図１は、デジタルサイネージシステム１０の概要を説明する説明図である。デジタルサイネージシステム１０は、壁に取り付けられた表示部３５、マイク３６およびスピーカ３７を含む。表示部３５は、たとえば液晶表示パネルまたは有機ＥＬ（electro-luminescence）表示パネルである。 [Embodiment 1]
FIG. 1 is an explanatory diagram for explaining the outline of the digital signage system 10. The digital signage system 10 includes a display unit 35, a microphone 36, and a speaker 37 attached to a wall. The display unit 35 is, for example, a liquid crystal display panel or an organic EL (electro-luminescence) display panel.

表示部３５の表面は、アクリル板またはガラス板等の、透光性板により覆われている。透光性板は、一面に薄い反射膜を有するハーフミラーである。表示部３５の表示面と、透光性板とは、いわゆるダイレクトボンディングにより接合しても良い。表示面と透光性板とを、空気層を介さずに直接接合することにより、透光性板と表示面との間の光の反射による画質低下を防止できる。 The surface of the display unit 35 is covered with a translucent plate such as an acrylic plate or a glass plate. The translucent plate is a half mirror having a thin reflective film on one surface. The display surface of the display unit 35 and the translucent plate may be joined by so-called direct bonding. By directly joining the display surface and the translucent plate without interposing an air layer, it is possible to prevent deterioration in image quality due to light reflection between the translucent plate and the display surface.

マイク３６は、複数のマイクロフォンを含むアレイマイクであり、感度および音声検出領域を適宜調整できる。スピーカ３７の音量も適宜調整できる。マイク３６およびスピーカ３７は、表示部３５に一体に構成されていても良い。 The microphone 36 is an array microphone including a plurality of microphones, and can adjust sensitivity and sound detection area as appropriate. The volume of the speaker 37 can also be adjusted as appropriate. The microphone 36 and the speaker 37 may be configured integrally with the display unit 35.

表示部３５を覆う透光性板に、表示部３５よりも大きい板を使用し、透光性板の裏面の表示部３５で覆われていない部分にエキサイターを固定してスピーカ３７を構成しても良い。表示部３５全体から、音を発生するスピーカ３７を実現できる。マイク３６およびスピーカ３７は、天井等に取り付けられていても良い。 A plate larger than the display unit 35 is used as the translucent plate that covers the display unit 35, and an exciter is fixed to a portion of the rear surface of the translucent plate that is not covered with the display unit 35 to configure the speaker 37. Also good. A speaker 37 that generates sound can be realized from the entire display unit 35. The microphone 36 and the speaker 37 may be attached to a ceiling or the like.

本実施の形態のデジタルサイネージシステム１０は、たとえば駅のコンコース、バス停、公共施設の内壁、または、道路に面した建物の外壁等の、様々な場所に設置して使用される。 The digital signage system 10 according to the present embodiment is installed and used in various places such as a concourse of a station, a bus stop, an inner wall of a public facility, or an outer wall of a building facing a road.

図１Ａは、動作していない場合のデジタルサイネージシステム１０を示す。表示部３５を覆う透光性板により光が反射され、表示部３５は鏡の機能を果たす。これにより、空間を広く見せる演出が可能である。 FIG. 1A shows the digital signage system 10 when not in operation. Light is reflected by the translucent plate covering the display unit 35, and the display unit 35 functions as a mirror. Thereby, it is possible to produce a wide space.

図１Ｂは、動作している場合のデジタルサイネージシステム１０を示す。図１Ｂにおいては表示部３５の前に立ったユーザが「天気」と発声した場合を示す。マイク３６がユーザの声を検出して、天気情報を取得し、表示部３５に表示する。さらにスピーカ３７から「現在の天気は曇りです」という音声が出力される。 FIG. 1B shows the digital signage system 10 when operating. FIG. 1B shows a case where a user standing in front of the display unit 35 utters “weather”. The microphone 36 detects the voice of the user, acquires weather information, and displays it on the display unit 35. Further, a sound “Current weather is cloudy” is output from the speaker 37.

たとえばユーザが「札幌の天気」と発声した場合には、札幌の天気が表示部３５およびスピーカ３７から出力される。ユーザが「明日の大阪の天気」と発声した場合には、翌日の大阪の天気が表示部３５およびスピーカ３７から出力される。同様に、ユーザの発声に応じて、たとえば株価、交通機関の運行情報、地図、ニュース等の、様々なコンテンツが表示部３５およびスピーカから出力される。 For example, when the user utters “Sapporo weather”, the Sapporo weather is output from the display unit 35 and the speaker 37. When the user utters “Tomorrow's Osaka weather”, the next day's Osaka weather is output from the display unit 35 and the speaker 37. Similarly, various contents such as stock prices, transportation operation information, maps, news, and the like are output from the display unit 35 and the speaker according to the user's utterance.

以上のように、本実施の形態のデジタルサイネージシステム１０は、個々のユーザに対してニーズに応じた情報を提供し、非使用時には鏡になる。 As described above, the digital signage system 10 according to the present embodiment provides information according to needs to individual users and becomes a mirror when not in use.

図２は、デジタルサイネージシステム１０の構成を示す説明図である。デジタルサイネージシステム１０は、インターネット、公衆通信回線またはＬＡＮ（Local Area Network）等のネットワークを介して接続されたデジタルサイネージ装置１１、音声認識サーバ３０および分野サーバ４２を備える。 FIG. 2 is an explanatory diagram showing the configuration of the digital signage system 10. The digital signage system 10 includes a digital signage device 11, a voice recognition server 30, and a field server 42 connected via a network such as the Internet, a public communication line, or a LAN (Local Area Network).

デジタルサイネージ装置１１は、前述の表示部３５、マイク３６およびスピーカ３７に加えて情報処理装置２０を備える。本実施の形態の情報処理装置２０は、表示部３５の背面に固定可能な小型コンピュータである。情報処理装置２０は、汎用のパソコン、タブレットまたはスマートフォン等であっても良い。 The digital signage device 11 includes an information processing device 20 in addition to the display unit 35, the microphone 36, and the speaker 37 described above. The information processing apparatus 20 according to the present embodiment is a small computer that can be fixed to the back surface of the display unit 35. The information processing apparatus 20 may be a general-purpose personal computer, a tablet, a smartphone, or the like.

情報処理装置２０は、第１ＣＰＵ（Central Processing Unit）２１、主記憶装置２２、補助記憶装置２３、通信部２４、表示Ｉ／Ｆ（Interface）２５、マイクＩ／Ｆ２６、スピーカＩ／Ｆ２７およびバスを備える。 The information processing device 20 includes a first CPU (Central Processing Unit) 21, a main storage device 22, an auxiliary storage device 23, a communication unit 24, a display I / F (Interface) 25, a microphone I / F 26, a speaker I / F 27, and a bus. Prepare.

第１ＣＰＵ２１は、本実施の形態にかかるプログラムを実行する演算制御装置である。第１ＣＰＵ２１には、一または複数のＣＰＵまたはマルチコアＣＰＵ等が使用される。第１ＣＰＵ２１は、バスを介して情報処理装置２０を構成するハードウェア各部と接続されている。 The first CPU 21 is an arithmetic control device that executes a program according to the present embodiment. As the first CPU 21, one or a plurality of CPUs, a multi-core CPU, or the like is used. The first CPU 21 is connected to each hardware unit constituting the information processing apparatus 20 via a bus.

主記憶装置２２は、ＳＲＡＭ（Static Random Access Memory）、ＤＲＡＭ（Dynamic Random Access Memory）、フラッシュメモリ等の記憶装置である。主記憶装置２２には、第１ＣＰＵ２１が行なう処理の途中で必要な情報および第１ＣＰＵ２１で実行中のプログラムが一時的に保存される。 The main storage device 22 is a storage device such as an SRAM (Static Random Access Memory), a DRAM (Dynamic Random Access Memory), or a flash memory. The main storage device 22 temporarily stores information necessary during the processing performed by the first CPU 21 and a program being executed by the first CPU 21.

補助記憶装置２３は、ＳＲＡＭ、フラッシュメモリ、ハードディスクまたは磁気テープ等の記憶装置である。補助記憶装置２３には、第１ＣＰＵ２１に実行させるプログラム、問合先ＤＢ５１およびプログラムの実行に必要な各種情報が保存される。 The auxiliary storage device 23 is a storage device such as an SRAM, a flash memory, a hard disk, or a magnetic tape. The auxiliary storage device 23 stores a program to be executed by the first CPU 21, an inquiry DB 51, and various information necessary for executing the program.

問合先ＤＢ５１は、ネットワーク等を介して情報処理装置２０に接続された別の記憶装置に記憶されても良い。問合先ＤＢ５１の詳細については後述する。通信部２４は、ネットワークとの通信を行なうインターフェイスである。 The inquiry DB 51 may be stored in another storage device connected to the information processing apparatus 20 via a network or the like. Details of the contact DB 51 will be described later. The communication unit 24 is an interface that performs communication with a network.

表示Ｉ／Ｆ２５は、表示部３５とバスとを接続するインターフェイスである。マイクＩ／Ｆ２６は、マイク３６とバスとを接続するインターフェイスである。スピーカＩ／Ｆ２７は、スピーカ３７とバスとを接続するＩ／Ｆである。 The display I / F 25 is an interface that connects the display unit 35 and the bus. The microphone I / F 26 is an interface that connects the microphone 36 and the bus. The speaker I / F 27 is an I / F that connects the speaker 37 and the bus.

表示部３５と表示Ｉ／Ｆ２５との間、マイク３６とマイクＩ／Ｆ２６との間、および、スピーカ３７とスピーカＩ／Ｆ２７との間は、ケーブル、または、Ｂｌｕｅｔｏｏｔｈ（登録商標）等の無線通信により接続される。 Wireless communication such as a cable or Bluetooth (registered trademark) between the display unit 35 and the display I / F 25, between the microphone 36 and the microphone I / F 26, and between the speaker 37 and the speaker I / F 27. Connected by.

音声認識サーバ３０は、ネットワークを介して音声データを取得し、音声認識によりテキストデータに変換するサーバである。音声認識サーバ３０は、第２ＣＰＵ３１、主記憶装置３２、補助記憶装置３３および通信部３４を有する。 The voice recognition server 30 is a server that acquires voice data via a network and converts it into text data by voice recognition. The voice recognition server 30 includes a second CPU 31, a main storage device 32, an auxiliary storage device 33, and a communication unit 34.

第２ＣＰＵ３１は、本実施の形態にかかるプログラムを実行する演算制御装置である。第２ＣＰＵ３１には、一または複数のＣＰＵまたはマルチコアＣＰＵ等が使用される。第２ＣＰＵ３１は、バスを介して音声認識サーバ３０を構成するハードウェア各部と接続されている。 The second CPU 31 is an arithmetic control device that executes a program according to the present embodiment. As the second CPU 31, one or a plurality of CPUs, a multi-core CPU, or the like is used. The second CPU 31 is connected to each hardware part constituting the voice recognition server 30 via a bus.

主記憶装置３２は、ＳＲＡＭ、ＤＲＡＭ、フラッシュメモリ等の記憶装置である。主記憶装置３２には、第２ＣＰＵ３１が行なう処理の途中で必要な情報および第２ＣＰＵ３１で実行中のプログラムが一時的に保存される。 The main storage device 32 is a storage device such as SRAM, DRAM, or flash memory. The main storage device 32 temporarily stores information necessary during the processing performed by the second CPU 31 and a program being executed by the second CPU 31.

補助記憶装置３３は、ＳＲＡＭ、フラッシュメモリ、ハードディスクまたは磁気テープ等の記憶装置である。補助記憶装置３３には、第２ＣＰＵ３１に実行させるプログラム、およびプログラムの実行に必要な各種情報が保存される。 The auxiliary storage device 33 is a storage device such as an SRAM, a flash memory, a hard disk, or a magnetic tape. The auxiliary storage device 33 stores a program to be executed by the second CPU 31 and various information necessary for executing the program.

本実施の形態の音声認識サーバ３０は汎用のパーソナルコンピュータ、サーバマシン等の情報処理装置である。また、本実施の形態の音声認識サーバ３０は、大型計算機上で動作する仮想マシンでも良い。 The voice recognition server 30 of this embodiment is an information processing apparatus such as a general-purpose personal computer or server machine. Further, the voice recognition server 30 according to the present embodiment may be a virtual machine that operates on a large computer.

分野サーバ４２は、たとえば天気サーバ４２１、株価サーバ４２２、図示を省略する、交通機関の運行情報サーバ、地図サーバ、ニュースサーバ等の、様々なコンテンツ提供サーバを含む。それぞれの分野サーバ４２は、インターネットを介して一般ユーザに公開されたサーバであっても、本実施の形態のデジタルサイネージシステム１０の運営事業者が管理するサーバであっても良い。 The field server 42 includes various content providing servers such as a weather server 421, a stock price server 422, a transportation operation information server, a map server, a news server, etc. (not shown). Each field server 42 may be a server opened to general users via the Internet or a server managed by an operator of the digital signage system 10 according to the present embodiment.

図３は、問合先ＤＢ５１のレコードレイアウトを説明する説明図である。問合先ＤＢ５１は、デジタルサイネージシステム１０が出力するコンテンツの分野と、コンテンツを取得する取得元とを関連づけて記録するデータベースである。 FIG. 3 is an explanatory diagram illustrating the record layout of the inquiry destination DB 51. The inquiry DB 51 is a database that records the field of content output by the digital signage system 10 in association with the acquisition source from which the content is acquired.

問合先ＤＢ５１は、分野フィールド、取得元フィールドおよび書式フィールドを有する。分野フィールドにはコンテンツの分野が記録されている。取得元フィールドには、コンテンツを取得する取得元の分野サーバ４２のＵＲＬ（Uniform Resource Locator）が記録されている。書式フィールドには、分野サーバから必要な情報を取得する際に用いるパラメータ等の書式が記録されている。 The inquiry DB 51 has a field field, an acquisition source field, and a format field. The field of the content is recorded in the field of field. In the acquisition source field, the URL (Uniform Resource Locator) of the field server 42 from which the content is acquired is recorded. In the format field, a format such as a parameter used when acquiring necessary information from the field server is recorded.

なお、問合先ＤＢ５１には、コンテンツを取得する際に用いるＡＰＩ（Application Programming Interface）とパラメータ等が記録されていても良い。 The inquiry destination DB 51 may record an API (Application Programming Interface) and parameters used when acquiring content.

図４は、プログラムの処理の流れを説明するフローチャートである。なお、プログラムを開始する段階では、表示部３５にはコンテンツが出力されておらず、鏡にみえる状態である。 FIG. 4 is a flowchart for explaining the flow of processing of the program. Note that at the stage of starting the program, no content is output on the display unit 35, and the screen appears to be a mirror.

第１ＣＰＵ２１は、マイク３６を介して音声データを取得する（ステップＳ５０１）。なお、前述のとおりマイク３６は、複数のマイクロフォンを含むアレイマイクである。マイク３６は、音声をＡ／Ｄ（Analog/Digital）変換した音声データをバスに出力する。マイク３６は、それぞれのマイクロフォンが受信した音声の波形を所定の時間差で合成することにより、表示部３５の正面の所定の領域で発声された音声を増幅して取得できる。マイク３６は、それぞれのマイクロフォンが受信した音声をＡ／Ｄ変換した音声データをバスに出力し、第１ＣＰＵが所定の時間差で音声データを合成しても良い。 The first CPU 21 acquires audio data through the microphone 36 (step S501). As described above, the microphone 36 is an array microphone including a plurality of microphones. The microphone 36 outputs audio data obtained by A / D (Analog / Digital) conversion of audio to the bus. The microphones 36 can amplify and acquire the voice uttered in a predetermined area in front of the display unit 35 by synthesizing the waveforms of the sounds received by the respective microphones with a predetermined time difference. The microphone 36 may output audio data obtained by A / D converting audio received by each microphone to the bus, and the first CPU may synthesize the audio data with a predetermined time difference.

第１ＣＰＵ２１は取得した音声データを主記憶装置２２または補助記憶装置２３に一時的に記憶する。第１ＣＰＵ２１は、音声データを解析して日本語、英語、または、中国語等、どの言語の音声であるかを判定する言語判定を行なう（ステップＳ５０２）。言語判定を行なう方法は、従来から知られているため、詳細な説明については省略する。 The first CPU 21 temporarily stores the acquired audio data in the main storage device 22 or the auxiliary storage device 23. The first CPU 21 analyzes the audio data and performs language determination for determining which language, such as Japanese, English, or Chinese, is used (step S502). Since the method for performing language determination has been conventionally known, detailed description thereof is omitted.

第１ＣＰＵ２１は、言語判定が成功したか否かを判定する（ステップＳ５０３）。たとえば、取得した音声の音量が小さい場合、または、車の騒音等言語以外の音声である場合には、どの言語であるかの判定を行なえないため、第１ＣＰＵ２１は、言語判定は失敗したと判定する。言語判定が失敗したと判定した場合（ステップＳ５０３でＮＯ）、第１ＣＰＵ２１はステップＳ５０１に戻る。 The first CPU 21 determines whether or not the language determination is successful (step S503). For example, when the volume of the acquired voice is low, or when the voice is a language other than a language such as a car noise, the first CPU 21 determines that the language determination has failed because the language cannot be determined. To do. If it is determined that the language determination has failed (NO in step S503), the first CPU 21 returns to step S501.

言語判定に成功したと判定した場合（ステップＳ５０３でＹＥＳ）、第１ＣＰＵ２１は主記憶装置２２または補助記憶装置２３に一時的に記憶した音声データ、および、ステップＳ５０２で判定した言語の種類を音声認識サーバ３０に送信する（ステップＳ５０４）。 If it is determined that the language determination is successful (YES in step S503), the first CPU 21 recognizes the voice data temporarily stored in the main storage device 22 or the auxiliary storage device 23 and the language type determined in step S502. It transmits to the server 30 (step S504).

第２ＣＰＵ３１は、送信された音声データおよび言語を受信する（ステップＳ６０１）。第２ＣＰＵ３１は、送信された音声データの言語認識を行い、テキストデータに変換する（ステップＳ６０２）。言語認識は従来から広く行なわれているため、詳細な説明については省略する。なお、音声データと共に取得した言語種類に基づいて言語認識を行なうことにより、第２ＣＰＵ３１は短い処理時間で、正確が言語認識を行なえる。第２ＣＰＵ３１は、ステップＳ６０１で受信した音声データに対応するテキストデータを、情報処理装置２０に送信する（ステップＳ６０３）。 The second CPU 31 receives the transmitted voice data and language (step S601). The second CPU 31 performs language recognition of the transmitted voice data and converts it into text data (step S602). Since language recognition has been widely performed in the past, detailed description is omitted. In addition, by performing language recognition based on the language type acquired together with the voice data, the second CPU 31 can accurately perform language recognition in a short processing time. The second CPU 31 transmits text data corresponding to the voice data received in step S601 to the information processing apparatus 20 (step S603).

第１ＣＰＵ２１は、テキストデータを受信する（ステップＳ５１０）。第１ＣＰＵ２１は、テキストデータの意味を解析する（ステップＳ５１１）。意味の解析は、たとえば形態素解析等の自然言語処理手法により行なわれる。テキストの意味解析を行なう方法は、従来から知られているため、詳細な説明については省略する。 The first CPU 21 receives text data (step S510). The first CPU 21 analyzes the meaning of the text data (step S511). The semantic analysis is performed by a natural language processing technique such as morphological analysis. Since a method for analyzing the meaning of text has been conventionally known, a detailed description thereof will be omitted.

第１ＣＰＵ２１は、意味解析に成功したか否かを判定する（ステップＳ５１２）。たとえば、ステップＳ５１０で受信したテキストに、ステップＳ５０２で判定した言語の単語が含まれていない場合に、第１ＣＰＵ２１は意味解析に成功していないと判定する。意味解析に成功していないと判定した場合（ステップＳ５１２でＮＯ）、第１ＣＰＵ２１はステップＳ５０１に戻る。 The first CPU 21 determines whether or not the semantic analysis has succeeded (step S512). For example, if the text received in step S510 does not include the word in the language determined in step S502, the first CPU 21 determines that the semantic analysis has not been successful. If it is determined that the semantic analysis has not been successful (NO in step S512), the first CPU 21 returns to step S501.

意味解析に成功したと判定した場合（ステップＳ５１２でＹＥＳ）、第１ＣＰＵ２１はコンテンツ取得のサブルーチンを起動する（ステップＳ５１３）。コンテンツ取得のサブルーチンは、テキストデータに基づいて表示部３５およびスピーカ３７から出力するコンテンツを取得するサブルーチンである。コンテンツ取得のサブルーチンの処理の流れは後述する。 If it is determined that the semantic analysis is successful (YES in step S512), the first CPU 21 starts a content acquisition subroutine (step S513). The content acquisition subroutine is a subroutine for acquiring content output from the display unit 35 and the speaker 37 based on text data. The processing flow of the content acquisition subroutine will be described later.

第１ＣＰＵ２１は、取得したコンテンツを表示部３５およびスピーカ３７から出力する（ステップＳ５１４）。第１ＣＰＵ２１は、処理を終了するか否かを判定する（ステップＳ５１５）。たとえば、あらかじめ設定された運用終了時刻になった場合、または、ネットワークを介して管理者から運用終了の指示を受け付けた場合に、第１ＣＰＵ２１は処理を終了すると判定する。 The first CPU 21 outputs the acquired content from the display unit 35 and the speaker 37 (step S514). The first CPU 21 determines whether or not to end the process (step S515). For example, when the operation end time set in advance is reached, or when an operation end instruction is received from the administrator via the network, the first CPU 21 determines to end the process.

処理を終了すると判定した場合（ステップＳ５１５でＹＥＳ）、第１ＣＰＵ２１は処理を終了する。処理を終了しないと判定した場（ステップＳ５１５でＮＯ）、第１ＣＰＵ２１はステップＳ５０１に戻る。 If it is determined that the process is to be ended (YES in step S515), the first CPU 21 ends the process. If it is determined not to end the process (NO in step S515), the first CPU 21 returns to step S501.

図５は、コンテンツ取得のサブルーチンの処理の流れを説明するフローチャートである。コンテンツ取得のサブルーチンは、テキストデータに基づいて表示部３５およびスピーカ３７から出力するコンテンツを取得するサブルーチンである。 FIG. 5 is a flowchart for explaining the flow of processing of the content acquisition subroutine. The content acquisition subroutine is a subroutine for acquiring content output from the display unit 35 and the speaker 37 based on text data.

第１ＣＰＵ２１は、テキストを意味解析した結果に基づいて表示するコンテンツの分野を判定する（ステップＳ５２１）。たとえば、天気に関する質問であると判定した場合に、第１ＣＰＵ２１は分野は天気であると判定する。 The first CPU 21 determines the field of the content to be displayed based on the result of the semantic analysis of the text (step S521). For example, when it is determined that the question is related to the weather, the first CPU 21 determines that the field is weather.

第１ＣＰＵ２１は、判定した分野をキーとして問合先ＤＢ５１を検索して、レコードを抽出する。第１ＣＰＵ２１は、抽出したレコードの書式フィールドに基づいて、コンテンツを取得するコマンドを生成する（ステップＳ５２２）。なお第１ＣＰＵ２１は、ユーザが使用した言語を用いたコンテンツを取得するコマンドを生成することが望ましい。 The first CPU 21 searches the inquiry DB 51 using the determined field as a key, and extracts a record. The first CPU 21 generates a command for acquiring content based on the format field of the extracted record (step S522). Note that the first CPU 21 desirably generates a command for acquiring content using the language used by the user.

第１ＣＰＵ２１は、抽出したレコードの取得元フィールドに記録されたＵＲＬ宛にコマンドを送信する。第１ＣＰＵ２１は、コマンドを受信した分野サーバ４２は、コマンドに基づいて検索処理を行い、検索結果であるコンテンツを送信する。第１ＣＰＵ２１は、コンテンツを取得する（ステップＳ５２３）。第１ＣＰＵ２１は、処理を終了する。 The first CPU 21 transmits a command to the URL recorded in the acquisition source field of the extracted record. The first CPU 21 receives the command, the field server 42 performs search processing based on the command, and transmits the content that is the search result. The first CPU 21 acquires content (step S523). The first CPU 21 ends the process.

第１ＣＰＵ２１が取得するコンテンツは、「現在の天気は曇りです」のようなテキスト情報、天気図のような静止画、過去２４時間の天気図の移り変わり等の動画等、任意の形式のコンテンツを使用できる。どのような形式のコンテンツを取得するかの指定を、ステップＳ５２２で作成するコマンドに含めても良い。 The content acquired by the first CPU 21 uses content in an arbitrary format such as text information such as “current weather is cloudy”, a still image such as a weather map, and a moving image such as a transition of the weather map for the past 24 hours. it can. Specification of what type of content is to be acquired may be included in the command created in step S522.

スピーカ３７は、鋭い指向性を持ち特定の位置にいる人だけに音が聞こえるようにするパラメトリック・スピーカであっても良い。たとえば図書館または美術館等の、静粛な場所に設置可能なデジタルサイネージシステム１０を提供できる。 The speaker 37 may be a parametric speaker that has a sharp directivity and allows only a person at a specific position to hear a sound. For example, the digital signage system 10 that can be installed in a quiet place such as a library or an art museum can be provided.

デジタルサイネージ装置１１は、人感センサを備えても良い。たとえば、一定時間以上表示部の前で立ち止まっているユーザを検知した場合、第１ＣＰＵ２１は「御質問があれば、話して下さい」等の文字および音声を出力して、ユーザに入力を促しても良い。 The digital signage device 11 may include a human sensor. For example, when the first CPU 21 detects a user who has stopped in front of the display unit for a certain period of time, the first CPU 21 may output characters and voices such as “Please speak if there is a question” to prompt the user to input. good.

表示部３５は、表面にタッチセンサを備えるタッチパネルであっても良い。音声に加えて、タッチ操作によりユーザが入力を行なえるデジタルサイネージ装置１１を提供できる。 The display unit 35 may be a touch panel including a touch sensor on the surface. In addition to voice, the digital signage device 11 that allows the user to input by touch operation can be provided.

本実施の形態によると、近くにいる人のニーズに合わせた情報を表示するデジタルサイネージ装置を提供できる。本実施の形態によると、非使用時には鏡にみえて、空間を広く見せる効果を有するデジタルサイネージ装置１１を提供できる。 According to this embodiment, it is possible to provide a digital signage device that displays information according to the needs of people nearby. According to the present embodiment, it is possible to provide the digital signage device 11 having an effect of making the space look wide when not in use.

本実施の形態によると、図４を使用して説明したステップＳ５０２においてどの言語の音声であるかを判定した後に、音声認識サーバ３０において言語認識を行なう。これにより、言語以外の音を処理するため、および、様々な言語である場合の処理を行なうため音声認識サーバ３０のリソースが消費されることを防ぐことができる。以上により、ユーザの発声から、コンテンツの表示までの時間が短く、ユーザが快適に使用できるデジタルサイネージシステム１０を提供できる。 According to the present embodiment, the speech recognition server 30 performs language recognition after determining which language is the speech in step S502 described with reference to FIG. Accordingly, it is possible to prevent the resources of the speech recognition server 30 from being consumed for processing sounds other than languages and for processing in the case of various languages. As described above, it is possible to provide the digital signage system 10 that can be used comfortably by the user with a short time from the user's voice to the display of the content.

本実施の形態によると、ネットワークを介して様々な分野サーバ４２からコンテンツを取得するため、ユーザの使用する言語に応じたコンテンツを表示できる。マイク３６にアレイマイクを使用することにより、騒音のある環境であっても表示部３５の前にいるユーザの声を増幅して取得するデジタルサイネージシステム１０を提供できる。 According to the present embodiment, since content is acquired from various field servers 42 via a network, it is possible to display content according to the language used by the user. By using an array microphone as the microphone 36, it is possible to provide the digital signage system 10 that amplifies and acquires the voice of the user in front of the display unit 35 even in a noisy environment.

［実施の形態２］
本実施の形態は、ユーザの声をテキストに変換して表示部３５に表示する議事録モードを有するデジタルサイネージシステム１０に関する。実施の形態１と共通する部分については、説明を省略する。 [Embodiment 2]
The present embodiment relates to a digital signage system 10 having a minutes mode in which a user's voice is converted into text and displayed on a display unit 35. Description of portions common to the first embodiment is omitted.

図６および図７は、実施の形態２のデジタルサイネージシステムの概要を説明する説明図である。図６は、二人のユーザが表示部３５の前に立って日本語で会話をしている。会話の内容がテキストに変換されて、表示部３５に表示される。 6 and 7 are explanatory diagrams for explaining the outline of the digital signage system according to the second embodiment. In FIG. 6, two users are standing in front of the display unit 35 and talking in Japanese. The contents of the conversation are converted into text and displayed on the display unit 35.

図７においては、ユーザは英語で会話をしている。表示部３５には、英語のテキストが表示されている。なお、表示部３５に表示されたテキストは、議事録モードの終了後、メール等によりユーザに送信される。 In FIG. 7, the user has a conversation in English. The display unit 35 displays English text. Note that the text displayed on the display unit 35 is transmitted to the user by e-mail or the like after the minutes mode ends.

図８は、実施の形態２のプログラムの処理の流れを説明するフローチャートである。ステップＳ５１２までは、図１を使用して説明した実施の形態１のプログラムの処理の流れと同一であるため、説明を省略する。 FIG. 8 is a flowchart for explaining the processing flow of the program according to the second embodiment. Steps up to step S512 are the same as the process flow of the program according to the first embodiment described with reference to FIG.

意味解析に成功していないと判定した場合（ステップＳ５１２でＮＯ）、第１ＣＰＵ２１はステップＳ５０１に戻る。意味解析に成功したと判定した場合（ステップＳ５１２でＹＥＳ）、第１ＣＰＵ２１は、テキストが議事録モードの起動を意味するか否かを判定する（ステップＳ５３１）。 If it is determined that the semantic analysis has not been successful (NO in step S512), the first CPU 21 returns to step S501. If it is determined that the semantic analysis is successful (YES in step S512), the first CPU 21 determines whether or not the text means activation of the minutes mode (step S531).

議事録モードの起動を意味すると判定した場合（ステップＳ５３１でＹＥＳ）、第１ＣＰＵ２１は議事録表示のサブルーチンを起動する（ステップＳ５３２）。議事録表示のサブルーチンは、ユーザの音声をテキスト変換して、表示部３５に表示するサブルーチンである。議事録表示のサブルーチンの処理の流れについては、後述する。 If it is determined that it means the start of the minutes mode (YES in step S531), the first CPU 21 starts a minutes display subroutine (step S532). The minutes display subroutine is a subroutine for converting the user's voice into text and displaying it on the display unit 35. The process flow of the minutes display subroutine will be described later.

議事録モードの起動を意味しないと判定した場合（ステップＳ５３１でＮＯ）、第１ＣＰＵ２１はコンテンツ取得のサブルーチンを起動する（ステップＳ５１３）。コンテンツ取得のサブルーチンは、図５を使用して説明したサブルーチンと同一のサブルーチンである。 If it is determined that it does not mean that the minutes mode is activated (NO in step S531), the first CPU 21 activates a content acquisition subroutine (step S513). The content acquisition subroutine is the same as the subroutine described with reference to FIG.

第１ＣＰＵ２１は、取得したコンテンツを表示部３５およびスピーカ３７から出力する（ステップＳ５１４）。ステップＳ５１４またはステップＳ５３２の終了後、第１ＣＰＵ２１は、処理を終了するか否かを判定する（ステップＳ５１５）。 The first CPU 21 outputs the acquired content from the display unit 35 and the speaker 37 (step S514). After step S514 or step S532, the first CPU 21 determines whether or not to end the process (step S515).

図９は、議事録表示のサブルーチンの処理の流れを説明するフローチャートである。議事録表示のサブルーチンは、ユーザの音声をテキスト変換して、表示部３５に表示するサブルーチンである。 FIG. 9 is a flowchart for explaining the processing flow of the minutes display subroutine. The minutes display subroutine is a subroutine for converting the user's voice into text and displaying it on the display unit 35.

第１ＣＰＵ２１は、マイク３６を介して音声データを取得する（ステップＳ５４１）。第１ＣＰＵ２１は取得した音声データを主記憶装置２２または補助記憶装置２３に一時的に記憶する。第１ＣＰＵ２１は、音声データを解析して日本語、英語、または、中国語等、どの言語の音声であるかを判定する言語判定を行なう（ステップＳ５４２）。 The first CPU 21 acquires audio data through the microphone 36 (step S541). The first CPU 21 temporarily stores the acquired audio data in the main storage device 22 or the auxiliary storage device 23. The first CPU 21 analyzes the audio data and performs language determination for determining which language, such as Japanese, English, Chinese, or the like (step S542).

第１ＣＰＵ２１は、言語判定が成功したか否かを判定する（ステップＳ５４３）。言語判定が失敗したと判定した場合（ステップＳ５４３でＮＯ）、第１ＣＰＵ２１はステップＳ５４１に戻る。 The first CPU 21 determines whether or not the language determination is successful (step S543). If it is determined that the language determination has failed (NO in step S543), the first CPU 21 returns to step S541.

言語判定に成功したと判定した場合（ステップＳ５４３でＹＥＳ）、第１ＣＰＵ２１は音声の話者を識別する（ステップＳ５４４）。ここで識別とは、表示部３５の前にいる数名程度の話者のうち、どの話者が発声したかを識別することを意味する。 If it is determined that the language determination has succeeded (YES in step S543), the first CPU 21 identifies a speaker of speech (step S544). Here, the identification means identifying which speaker has spoken out of about several speakers in front of the display unit 35.

たとえば、第１ＣＰＵ２１は声紋解析等により話者を識別できる。また、第１ＣＰＵ２１はマイク３６のマイクアレイを構成する各マイクロフォンの出力に基づいて、話者の位置を推定することにより、話者を識別してもよい。 For example, the first CPU 21 can identify the speaker by voiceprint analysis or the like. Further, the first CPU 21 may identify the speaker by estimating the position of the speaker based on the output of each microphone constituting the microphone array of the microphone 36.

第１ＣＰＵ２１は主記憶装置２２または補助記憶装置２３に一時的に記憶した音声データ、および、ステップＳ５０２で判定した言語の種類を音声認識サーバ３０に送信する（ステップＳ５４５）。 The first CPU 21 transmits the voice data temporarily stored in the main storage device 22 or the auxiliary storage device 23 and the language type determined in step S502 to the voice recognition server 30 (step S545).

第２ＣＰＵ３１は、送信された音声データおよび言語を受信する（ステップＳ６２１）。第２ＣＰＵ３１は、送信された音声データの言語認識を行い、テキストデータに変換する（ステップＳ６２２）。第２ＣＰＵ３１は、テキストデータを情報処理装置２０に送信する（ステップＳ６２３）。 The second CPU 31 receives the transmitted voice data and language (step S621). The second CPU 31 performs language recognition of the transmitted voice data and converts it into text data (step S622). The second CPU 31 transmits text data to the information processing apparatus 20 (step S623).

第１ＣＰＵ２１は、テキストデータを受信する（ステップＳ５５０）。第１ＣＰＵ２１は、受信したテキストデータを、主記憶装置２２または補助記憶装置２３に一時的に記憶する。第１ＣＰＵ５５０は、表示部３５にテキストデータを表示する（ステップＳ５５１）。 The first CPU 21 receives text data (step S550). The first CPU 21 temporarily stores the received text data in the main storage device 22 or the auxiliary storage device 23. The first CPU 550 displays text data on the display unit 35 (step S551).

なお、ステップＳ５５１において第１ＣＰＵは、図６および図７に例示するように吹き出しの向きで区別して表示する。第１ＣＰＵは、吹き出しの向きの代わりに、吹き出しの色、文字の色またはフォント等、任意の形式で話者を区別して表示しても良い。第１ＣＰＵ２１は、話者を区別せずにテキストデータを表示しても良い。 In step S551, the first CPU distinguishes and displays the directions according to the direction of the balloon as illustrated in FIGS. The first CPU may distinguish and display the speaker in an arbitrary format such as a balloon color, a character color, or a font instead of the balloon direction. The first CPU 21 may display text data without distinguishing speakers.

第１ＣＰＵ２１は、処理を終了するか否かを判定する（ステップＳ５５５）。たとえば、所定の時間音声データが検出されない場合に、第１ＣＰＵ２１は処理を終了すると判定する。処理を終了しないと判定した場合（ステップＳ５５５でＮＯ）、第１ＣＰＵ２１はステップＳ５４１に戻る。 The first CPU 21 determines whether or not to end the process (step S555). For example, if no audio data is detected for a predetermined time, the first CPU 21 determines to end the process. When it determines with not complete | finishing a process (it is NO at step S555), 1st CPU21 returns to step S541.

処理を終了すると判定した場合（ステップＳ５５５でＹＥＳ）、第１ＣＰＵ２１は処理を終了する。なお第１ＣＰＵ２１は、処理を終了する前に主記憶装置２２または補助記憶装置２３に記憶したテキストをユーザにより指定されたメールアドレス等に送信しても良い。第１ＣＰＵ２１は、処理を終了する前に主記憶装置２２または補助記憶装置２３に記憶したテキストを所定のサーバに保存し、アクセス用のＵＲＬを表示部３５に表示しても良い。 If it is determined that the process is to be ended (YES in step S555), the first CPU 21 ends the process. Note that the first CPU 21 may transmit the text stored in the main storage device 22 or the auxiliary storage device 23 to an e-mail address or the like designated by the user before finishing the process. The first CPU 21 may store the text stored in the main storage device 22 or the auxiliary storage device 23 in a predetermined server before displaying the process, and display the access URL on the display unit 35.

本実施の形態によると、自動的に議事録を作成するデジタルサイネージシステム１０を提供できる。ユーザは、議事録モードを一人で使用しても良い。いわゆる口述筆記を行なうデジタルサイネージシステム１０を提供できる。 According to the present embodiment, it is possible to provide a digital signage system 10 that automatically creates minutes. The user may use the minutes mode alone. A digital signage system 10 that performs so-called dictation can be provided.

なお、図９を使用して説明した議事録表示のサブルーチンにおいて、ステップＳ５５０で受信したテキストデータを機械翻訳した後に、ステップＳ５５１で表示しても良い。たとえば、日本語の話者による発言を図７に示すように英語で、英語の話者による発言を図６に示すように日本語で表示することにより、異なる言語を使用するユーザ同士のコミュニュケーションを支援するデジタルサイネージシステム１０を提供できる。 In the minutes display subroutine described with reference to FIG. 9, the text data received in step S550 may be machine translated and then displayed in step S551. For example, by displaying a speech by a Japanese speaker in English as shown in FIG. 7 and a speech by an English speaker in Japanese as shown in FIG. 6, communication between users who use different languages can be performed. The digital signage system 10 that supports the application can be provided.

［実施の形態３］
図１０は、実施の形態３のデジタルサイネージ装置１１の機能ブロック図である。デジタルサイネージ装置１１は、前述のマイク３６と表示部３５とに加えて、テキスト取得部８１および出力部８２を備える。 [Embodiment 3]
FIG. 10 is a functional block diagram of the digital signage apparatus 11 according to the third embodiment. The digital signage apparatus 11 includes a text acquisition unit 81 and an output unit 82 in addition to the microphone 36 and the display unit 35 described above.

表示部３５は、ハーフミラーにより覆われている。マイク３６は、表示部３５の近傍から音声を取得する。テキスト取得部８１は、マイク３６が取得した音声に基づくテキストを取得する。出力部８２は、テキスト取得部８１が取得したテキストに基づくコンテンツを表示部３５に出力する。 The display unit 35 is covered with a half mirror. The microphone 36 acquires sound from the vicinity of the display unit 35. The text acquisition unit 81 acquires text based on the voice acquired by the microphone 36. The output unit 82 outputs content based on the text acquired by the text acquisition unit 81 to the display unit 35.

［実施の形態４］
本実施の形態は、汎用のコンピュータとプログラム９７とを組み合わせて動作させることにより、本実施の形態のデジタルサイネージシステム１０を実現する形態に関する。図１１は、実施の形態４のデジタルサイネージシステム１０の構成を示す説明図である。なお、実施の形態１と共通する部分の説明は省略する。 [Embodiment 4]
The present embodiment relates to a mode for realizing the digital signage system 10 of the present embodiment by operating a general-purpose computer and a program 97 in combination. FIG. 11 is an explanatory diagram illustrating a configuration of the digital signage system 10 according to the fourth embodiment. Note that description of portions common to the first embodiment is omitted.

本実施の形態のデジタルサイネージシステム１０は、ネットワークを介して接続されたデジタルサイネージ装置１１、音声認識サーバ３０および分野サーバ４２を備える。デジタルサイネージ装置１１は、コンピュータ９０、表示部３５、マイク３６およびスピーカ３７を備える。 The digital signage system 10 of this embodiment includes a digital signage device 11, a voice recognition server 30, and a field server 42 connected via a network. The digital signage apparatus 11 includes a computer 90, a display unit 35, a microphone 36, and a speaker 37.

コンピュータ９０は、第１ＣＰＵ２１、主記憶装置２２、補助記憶装置２３、通信部２４、表示Ｉ／Ｆ２５、マイクＩ／Ｆ２６、スピーカＩ／Ｆ２７、読取部９５およびバスを備える。 The computer 90 includes a first CPU 21, a main storage device 22, an auxiliary storage device 23, a communication unit 24, a display I / F 25, a microphone I / F 26, a speaker I / F 27, a reading unit 95, and a bus.

プログラム９７は、可搬型記録媒体９６に記録されている。第１ＣＰＵ２１は、読取部９５を介してプログラム９７を読み込み、補助記憶装置２３に保存する。また第１ＣＰＵ２１は、コンピュータ９０に実装されたフラッシュメモリ等の半導体メモリ９８に記憶されたプログラム９７を読出しても良い。さらに、第１ＣＰＵ２１は、通信部２４および図示しないネットワークを介して接続される図示しない他のサーバコンピュータからプログラム９７をダウンロードして補助記憶装置２３に保存しても良い。 The program 97 is recorded on a portable recording medium 96. The first CPU 21 reads the program 97 via the reading unit 95 and stores it in the auxiliary storage device 23. The first CPU 21 may read the program 97 stored in the semiconductor memory 98 such as a flash memory mounted on the computer 90. Furthermore, the first CPU 21 may download the program 97 from another server computer (not shown) connected via the communication unit 24 and a network (not shown) and store the program 97 in the auxiliary storage device 23.

プログラム９７は、コンピュータ９０の制御プログラムとしてインストールされ、主記憶装置２２にロードして実行される。これにより、コンピュータ９０は上述した情報処理装置２０として機能する。 The program 97 is installed as a control program for the computer 90, loaded into the main storage device 22, and executed. Thereby, the computer 90 functions as the information processing apparatus 20 described above.

各実施例で記載されている技術的特徴（構成要件）はお互いに組合せ可能であり、組み合わせすることにより、新しい技術的特徴を形成することができる。
今回開示された実施の形態はすべての点で例示であって、制限的なものでは無いと考えられるべきである。本発明の範囲は、上記した意味では無く、特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The technical features (components) described in each embodiment can be combined with each other, and new technical features can be formed by combining them.
The embodiments disclosed herein are illustrative in all respects and should not be considered as restrictive. The scope of the present invention is defined not by the above-described meaning but by the scope of the claims, and is intended to include all modifications within the meaning and scope equivalent to the scope of the claims.

１０デジタルサイネージシステム
１１デジタルサイネージ装置
２０情報処理装置
２１第１ＣＰＵ
２２主記憶装置
２３補助記憶装置
２４通信部
２５表示Ｉ／Ｆ
２６マイクＩ／Ｆ
２７スピーカＩ／Ｆ
３０音声認識サーバ
３１第２ＣＰＵ
３２主記憶装置
３３補助記憶装置
３４通信部
３５表示部
３６マイク
３７スピーカ
４２分野サーバ
４２１天気サーバ
４２２株価サーバ
５１問合先ＤＢ
８１テキスト取得部
８２出力部
９０コンピュータ
９５読取部
９６可搬型記録媒体
９７プログラム
９８半導体メモリ DESCRIPTION OF SYMBOLS 10 Digital signage system 11 Digital signage apparatus 20 Information processing apparatus 21 1st CPU
22 Main storage device 23 Auxiliary storage device 24 Communication unit 25 Display I / F
26 Microphone I / F
27 Speaker I / F
30 Voice recognition server 31 2nd CPU
32 Main storage device 33 Auxiliary storage device 34 Communication unit 35 Display unit 36 Microphone 37 Speaker 42 Field server 421 Weather server 422 Stock price server 51 Inquiry DB
81 Text Acquisition Unit 82 Output Unit 90 Computer 95 Reading Unit 96 Portable Recording Medium 97 Program 98 Semiconductor Memory

Claims

A display unit covered with a half mirror;
A microphone for acquiring sound from the vicinity of the display unit;
A text acquisition unit for acquiring text based on the voice acquired by the microphone;
A digital signage apparatus comprising: an output unit that outputs content based on the text acquired by the text acquisition unit to the display unit.

The digital signage apparatus according to claim 1, wherein the output unit outputs a search result using the text as a key.

A language determination unit for determining a language based on the voice acquired by the microphone;
The digital signage apparatus according to claim 1, wherein the text acquisition unit acquires the text corresponding to the language specified by the language determination unit.

Get audio through the microphone,
Get text based on the acquired voice,
A program that causes a computer to execute a process of outputting content based on the acquired text to a display unit that is disposed in the vicinity of the microphone and covered with a half mirror.