JP2016164791A

JP2016164791A - Server device and search method

Info

Publication number: JP2016164791A
Application number: JP2016075003A
Authority: JP
Inventors: 本間　健; Takeshi Honma; 健本間; 福永　功一郎; Koichiro Fukunaga; 功一郎福永; 則男度會; Norio Watarai; 将敬本橋; Masataka Motohashi; 康成大淵; Yasunari Obuchi
Original assignee: Clarion Co Ltd
Current assignee: Faurecia Clarion Electronics Co Ltd
Priority date: 2016-04-04
Filing date: 2016-04-04
Publication date: 2016-09-08
Anticipated expiration: 2031-09-22
Also published as: JP6109373B2

Abstract

PROBLEM TO BE SOLVED: To provide a technique for an information terminal that enables a user to easily utilize a high performance search function.SOLUTION: An information terminal comprises: voice input reception means which receives input of voice; communication means which performs communication with a predetermined server device via a network; output means; A Point Of Interest (POI) identification means which transmits voice information received by the voice input reception means to the server device and receives information for identifying POI candidates related to the voice information; POI candidates output means which outputs the information for identifying the POI candidates received by the POI identification means to the output means; and route search means which, after receiving input of selection from the information for identifying the POI candidates, searches for a route which leads to the selected POI.SELECTED DRAWING: Figure 6

Description

本発明は、情報端末の技術に関するものである。 The present invention relates to information terminal technology.

従来、ナビゲーション装置等の情報端末では、マイク等により音声入力を受け付けて、目的地や経由地等となりうる地理的名称を検索する技術が用いられている。特許文献１には、このようなナビゲーション装置についての技術が記載されている。 2. Description of the Related Art Conventionally, information terminals such as navigation devices use a technique for receiving a voice input with a microphone or the like and searching for a geographical name that can be a destination or a waypoint. Patent Document 1 describes a technique regarding such a navigation device.

特開２００６−３４９４２７号公報JP 2006-349427 A

上記のようなナビゲーション装置では、ナビゲーション装置において音声を認識し、候補となる施設等を辞書検索する処理を実施しており、より高い検索機能を提供するためには、ナビゲーション装置の処理負荷を考慮して非常に高性能なデバイスを用いる必要がある。 In the navigation device as described above, the navigation device recognizes speech and performs a dictionary search for candidate facilities and the like. In order to provide a higher search function, the processing load of the navigation device is considered. Therefore, it is necessary to use a very high performance device.

本発明の目的は、より手軽に高い検索機能を利用できる情報端末の技術を提供することにある。 An object of the present invention is to provide a technology of an information terminal that can use a high search function more easily.

上記課題を解決すべく、本発明に係る情報端末は、音声の入力を受け付ける音声入力受付手段と、ネットワークを介して所定のサーバー装置と通信を行う通信手段と、出力手段と、前記音声入力受付手段により受け付けた音声の情報を前記サーバー装置へ送信して前記音声の情報に関係するＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の候補を特定する情報を受信するＰＯＩ特定手段と、前記ＰＯＩ特定手段により受信したＰＯＩの候補を特定する情報を前記出力手段へ出力するＰＯＩ候補出力手段と、前記ＰＯＩの候補を特定する情報の選択入力を受け付けて当該ＰＯＩへ至る経路を探索する経路探索手段と、を備えることを特徴とする。 In order to solve the above problems, an information terminal according to the present invention includes a voice input receiving unit that receives voice input, a communication unit that communicates with a predetermined server device via a network, an output unit, and the voice input receiving unit. POI specifying means for transmitting voice information received by the means to the server device and receiving information for specifying POI (Point Of Interest) candidates related to the voice information; POI received by the POI specifying means A POI candidate output means for outputting information specifying the candidate to the output means, and a route search means for receiving a selection input of information specifying the POI candidate and searching for a route to the POI. Features.

また、本発明に係るサーバー装置は、ネットワークを介して所定の情報端末から音声情報を受信する音声情報受信手段と、前記音声情報から雑音情報を除去する雑音除去手段と、雑音を除去された前記音声情報を所定の音声認識装置へ前記ネットワークを介して送信する音声送信手段と、前記文字列に関連するＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の情報を前記ネットワークを介して受信するＰＯＩ情報受信手段と、前記ＰＯＩの情報を前記情報端末へ送信するＰＯＩ情報送信手段と、を備えることを特徴とする。 Further, the server device according to the present invention includes a voice information receiving unit that receives voice information from a predetermined information terminal via a network, a noise removing unit that removes noise information from the voice information, and the noise-removed unit Voice transmitting means for transmitting voice information to a predetermined voice recognition device via the network; POI information receiving means for receiving POI (Point Of Interest) information related to the character string via the network; And POI information transmitting means for transmitting POI information to the information terminal.

また、本発明に係る検索システムは、情報端末と、ネットワークを介して前記情報端末と通信するサーバー装置と、を有する検索システムであって、前記情報端末は、音声の入力を受け付ける音声入力受付手段と、ネットワークを介して前記サーバー装置と通信を行う通信手段と、出力手段と、前記音声入力受付手段により受け付けた音声の情報を前記サーバー装置へ送信して前記音声の情報に関係するＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の候補を特定する情報を受信するＰＯＩ特定手段と、前記ＰＯＩ特定手段により受信したＰＯＩの候補を特定する情報を前記出力手段へ出力するＰＯＩ候補出力手段と、前記ＰＯＩの候補を特定する情報の選択入力を受け付けて当該ＰＯＩへ至る経路を探索する経路探索手段と、を備え、前記サーバー装置は、ネットワークを介して前記情報端末から音声情報を受信する音声情報受信手段と、前記音声情報から雑音情報を除去する雑音除去手段と、雑音を除去された前記音声情報を所定の音声認識装置へ前記ネットワークを介して送信する音声送信手段と、前記文字列に関連するＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の情報を前記ネットワークを介して受信するＰＯＩ情報受信手段と、前記ＰＯＩの情報を前記情報端末へ送信するＰＯＩ情報送信手段と、を備えることを特徴とする。 Moreover, the search system according to the present invention is a search system having an information terminal and a server device that communicates with the information terminal via a network, wherein the information terminal receives voice input means. And a communication means for communicating with the server device via a network, an output means, and voice information received by the voice input receiving means to the server device to send a POI (Point) related to the voice information Of interest) POI specifying means for receiving information for specifying candidates, POI candidate output means for outputting information for specifying POI candidates received by the POI specifying means to the output means, and specifying the POI candidates Route search means for receiving a selection input of information to be searched and searching for a route to the POI, The server device includes: a voice information receiving unit that receives voice information from the information terminal via a network; a noise removing unit that removes noise information from the voice information; and a predetermined voice recognition for the voice information from which noise has been removed. Voice transmitting means for transmitting to the apparatus via the network; POI information receiving means for receiving POI (Point Of Interest) information related to the character string via the network; and the POI information for the information terminal And a POI information transmitting means for transmitting to.

また、本発明に係る検索方法は、情報端末と、ネットワークを介して前記情報端末と通信するサーバー装置と、を有する検索システムの検索方法であって、前記検索システムは、音声の入力を受け付ける音声入力受付手段と、出力手段と、を備え、前記音声入力受付手段により受け付けた音声の情報を前記サーバー装置へ送信する送信ステップと、前記ネットワークを介して前記情報端末から音声情報を受信する音声情報受信ステップと、前記音声情報から雑音情報を除去する雑音除去ステップと、雑音を除去された前記音声情報を所定の音声認識装置へ前記ネットワークを介して送信する音声送信ステップと、前記文字列に関連するＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の情報を前記ネットワークを介して前記音声認識装置から受信するＰＯＩ情報受信ステップと、前記ＰＯＩの情報を前記情報端末へ送信するＰＯＩ情報送信ステップと、前記ＰＯＩの情報を受信するＰＯＩ特定ステップと、前記ＰＯＩ特定ステップにより受信したＰＯＩ情報を前記出力手段へ出力するＰＯＩ候補出力ステップと、を実施することを特徴とする。 The search method according to the present invention is a search method for a search system having an information terminal and a server device that communicates with the information terminal via a network, the search system accepting voice input. A transmission step for transmitting voice information received by the voice input reception unit to the server device; and voice information for receiving voice information from the information terminal via the network. A reception step; a noise removal step for removing noise information from the voice information; a voice transmission step for transmitting the voice information from which noise has been removed to a predetermined voice recognition device via the network; and POI (Point Of Interest) information to be received from the voice recognition device via the network. A POI information receiving step for transmitting, a POI information transmitting step for transmitting the POI information to the information terminal, a POI specifying step for receiving the POI information, and the POI information received by the POI specifying step as the output means. And a POI candidate output step of outputting to the system.

本願発明によれば、より手軽に高い検索機能を利用できる情報端末の技術を提供することが可能となる。 According to the present invention, it is possible to provide a technology of an information terminal that can use a higher search function more easily.

検索システムの概要図である。It is a schematic diagram of a search system. 中継サーバー装置のハードウェア構成図である。It is a hardware block diagram of a relay server apparatus. ナビゲーション装置の概略構成図である。It is a schematic block diagram of a navigation apparatus. リンクテーブルの構成を示す図である。It is a figure which shows the structure of a link table. 演算処理部の機能構成図である。It is a functional block diagram of an arithmetic processing part. 情報検索処理の流れを示す図である。It is a figure which shows the flow of an information search process. 情報検索処理におけるシーケンス図である。It is a sequence diagram in an information search process. ＰＯＩ検索結果統合処理のフロー図である。It is a flowchart of a POI search result integration process. マイク選択処理のフロー図である。It is a flowchart of a microphone selection process. マイク発話時選択処理のフロー図である。It is a flowchart of the selection process at the time of microphone utterance. 情報検索処理の変形例の流れを示す図である。It is a figure which shows the flow of the modification of an information search process. 情報検索処理の別の変形例におけるシーケンス図である。It is a sequence diagram in another modification of information search processing.

以下に、本発明の第一の実施形態を適用したナビゲーション装置および検索システムについて、図面を参照して説明する。 A navigation device and a search system to which a first embodiment of the present invention is applied will be described below with reference to the drawings.

図１に、検索システム１０００の全体構成図を示す。検索システム１０００は、インターネット等の広域ネットワークやＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）、携帯電話網等のネットワーク３０を介して、車両に搭載されるナビゲーション装置１００と、中継サーバー装置５００と、音声特定サーバー装置９００と、ＰＯＩ提供サーバー装置９５０と、が接続可能に構成されている。 FIG. 1 shows an overall configuration diagram of the search system 1000. The search system 1000 includes a navigation device 100 mounted on a vehicle and a relay server device 500 via a network 30 such as a wide area network such as the Internet, a LAN (Local Area Network), a WAN (Wide Area Network), or a mobile phone network. The voice specifying server device 900 and the POI providing server device 950 are connectable.

ここで、音声特定サーバー装置９００は、所定の事業者等がネットワーク３０を介して提供する音声認識サービスを実現する装置である。本実施形態においては、音声特定サーバー装置９００は、送信された音声情報（音声の特性を特定する波形情報）を受信すると、音声認識を行い、認識した言葉を文字列として送信する。なお、認識した言葉は通常曖昧性を有するため、Ｎ−ｂｅｓｔ検索等の曖昧さを許容する認識を行い、認識の確度に応じて、該当する可能性のある一または複数の文字列を送信する。 Here, the voice identification server device 900 is a device that realizes a voice recognition service provided by a predetermined operator or the like via the network 30. In the present embodiment, when receiving the transmitted voice information (waveform information specifying the characteristics of the voice), the voice identification server device 900 performs voice recognition and transmits the recognized word as a character string. Since the recognized words usually have ambiguity, recognition that allows ambiguity such as N-best search is performed, and one or more character strings that may be applicable are transmitted according to the accuracy of recognition. .

また、ＰＯＩ提供サーバー装置９５０は、所定の事業者等がネットワーク３０を介して提供するＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の検索サービスを実現する装置である。本実施形態においては、ＰＯＩ提供サーバー装置９５０は、送信された文字列を受信すると、当該文字列に該当するＰＯＩ、すなわち当該文字列を含むか、または当該文字列に類似する文字列を含む一または複数のＰＯＩ、を検索して特定し、その該当の確からしさに応じて、ＰＯＩのリストを送信する。なお、ＰＯＩのリストには、送信された文字列ごとに、確度の高い順に一または複数のＰＯＩが対応付けられ、各ＰＯＩは、ＰＯＩの名称、ＰＯＩの位置を特定する緯度経度等の座標情報、ＰＯＩの住所、ＰＯＩについての電話番号等を含んでいる。 The POI providing server device 950 is a device that realizes a POI (Point Of Interest) search service provided by a predetermined operator or the like via the network 30. In the present embodiment, when the POI providing server device 950 receives the transmitted character string, the POI providing server device 950 includes a POI corresponding to the character string, that is, one that includes the character string or a character string similar to the character string. Alternatively, a plurality of POIs are searched and specified, and a list of POIs is transmitted according to the probability of the corresponding. In the POI list, one or a plurality of POIs are associated with each transmitted character string in descending order of accuracy, and each POI has coordinate information such as the POI name and latitude / longitude specifying the POI position. , POI address, phone number for POI, etc.

ナビゲーション装置１００は、地図情報を表示して、ナビゲーション装置１００の現在地を示す地点と、設定された目的地までの経路を誘導する情報とを示すことが可能ないわゆるナビゲーション装置である。 The navigation device 100 is a so-called navigation device capable of displaying map information and indicating a point indicating the current location of the navigation device 100 and information for guiding a route to a set destination.

中継サーバー装置５００は、ナビゲーション装置１００からＰＯＩの検索要求と、音声情報とを受け付けると、音声情報について雑音除去を行い、音声特定サーバー装置９００へ送信し、音声特定サーバー装置９００から送信された文字列をＰＯＩ提供サーバー装置９５０へ送信し、受信したＰＯＩリストをナビゲーション装置１００へ送信する。 When the relay server device 500 receives the POI search request and the voice information from the navigation device 100, the relay server device 500 performs noise removal on the voice information, transmits it to the voice specifying server device 900, and transmits the character transmitted from the voice specifying server device 900. The column is transmitted to the POI providing server device 950, and the received POI list is transmitted to the navigation device 100.

ここで、中継サーバー装置５００の構成について、さらに詳細に説明する。中継サーバー装置５００は、記憶部５１０と、制御部５２０と、送受信部５３０と、を備えている。記憶部５１０には、サーバ情報テーブル５１１が格納されており、音声特定を行うための音声特定サーバー装置９００と、ＰＯＩ提供を行うためのＰＯＩ提供サーバー装置９５０と、を特定するための設定情報が格納されている。 Here, the configuration of the relay server device 500 will be described in more detail. The relay server device 500 includes a storage unit 510, a control unit 520, and a transmission / reception unit 530. A server information table 511 is stored in the storage unit 510, and setting information for specifying the voice specifying server device 900 for specifying voice and the POI providing server device 950 for providing POI is stored. Stored.

制御部５２０には、雑音除去処理部５２１と、ＰＯＩ提示部５２２と、が含まれている。雑音除去処理部５２１は、ナビゲーション装置１００から受信した音声情報に対して、一または複数の雑音除去アルゴリズムを用いて、各アルゴリズムに応じた雑音除去を行う。すなわち、雑音除去処理部５２１が、例えば４種類の雑音除去アルゴリズムを実行可能とする場合、ナビゲーション装置１００から受信した音声情報に対し、それぞれのアルゴリズムを適用して、４種類の雑音除去済みの音声情報を出力する。なお、このようなアルゴリズムには、適応フィルタリングにより雑音を取り除くアルゴリズムや、周波数領域で雑音のスペクトルを取り除くスペクトルサブトラクション、周波数領域で時間的に変化する短時間スペクトル(ＲｕｎｎｉｎｇＳｐｅｃｔｒｕｍ)に対して、各周波数毎に時間軸方向にディジタルフィルタを通過させる事で雑音を除去するＲｕｎｎｉｎｇＳｐｅｃｔｒｕｍＦｉｌｔｅｒ等のアルゴリズムがある。 The control unit 520 includes a noise removal processing unit 521 and a POI presentation unit 522. The noise removal processing unit 521 performs noise removal corresponding to each algorithm by using one or a plurality of noise removal algorithms on the voice information received from the navigation device 100. That is, when the noise removal processing unit 521 can execute, for example, four types of noise removal algorithms, each type of algorithm is applied to the voice information received from the navigation device 100 to obtain four types of noise-removed voices. Output information. Such an algorithm includes an algorithm that removes noise by adaptive filtering, a spectrum subtraction that removes the spectrum of noise in the frequency domain, and a short-time spectrum that changes in time in the frequency domain (Running Spectrum). There is an algorithm such as Running Spectrum Filter that removes noise by passing a digital filter in the time axis direction every time.

ＰＯＩ提示部５２２は、ナビゲーション装置１００から音声情報を受け付けて、雑音除去処理部５２１に雑音除去を実施させ、雑音が除去された一または複数の音声情報を、サーバ情報テーブル５１１に格納された設定情報に基づき音声特定サーバー装置９００へ送信する。そして、音声特定サーバー装置９００から送信された一または複数の認識文字列を受信すると、ＰＯＩ提示部５２２は、受信した認識文字列をナビゲーション装置１００へ送信し、ナビゲーション装置１００の使用者から選択された文字列を受け付けると、ＰＯＩ提供サーバー装置９５０へ当該文字列を含むＰＯＩ検索要求を送信する。そして、ＰＯＩ提供サーバー装置９５０から送信されたＰＯＩリストを、ナビゲーション装置１００へ送信する。 The POI presentation unit 522 receives voice information from the navigation device 100, causes the noise removal processing unit 521 to perform noise removal, and sets one or more pieces of voice information from which noise has been removed stored in the server information table 511. Based on the information, it is transmitted to the voice identification server device 900. Upon receiving one or more recognized character strings transmitted from the voice identification server device 900, the POI presentation unit 522 transmits the received recognized character strings to the navigation device 100, and is selected by the user of the navigation device 100. When the received character string is received, a POI search request including the character string is transmitted to the POI providing server device 950. Then, the POI list transmitted from the POI providing server device 950 is transmitted to the navigation device 100.

送受信部５３０は、ネットワーク３０を介して、他の装置へ情報を送信し、他の装置から情報を受信する。本実施形態においては、送受信部５３０は、ナビゲーション装置１００と、音声特定サーバー装置９００と、ＰＯＩ提供サーバー装置９５０と、の間で通信を行う。 The transmission / reception unit 530 transmits information to other devices via the network 30 and receives information from other devices. In the present embodiment, the transmission / reception unit 530 performs communication among the navigation device 100, the voice identification server device 900, and the POI providing server device 950.

図２は、中継サーバー装置５００のハードウェア構成図である。中継サーバー装置５００は、入力装置５５１と、出力装置５５２と、通信装置５５３と、演算装置５５４と、主記憶装置５５５と、外部記憶装置５５６と、を有する。それぞれの装置は、バス５５７により接続されている。なお、入力装置５５１と、出力装置５５２と、は必須の構成ではなく、必要に応じて設けられるものであってよい。 FIG. 2 is a hardware configuration diagram of the relay server device 500. The relay server device 500 includes an input device 551, an output device 552, a communication device 553, a calculation device 554, a main storage device 555, and an external storage device 556. Each device is connected by a bus 557. Note that the input device 551 and the output device 552 are not essential components, and may be provided as necessary.

入力装置５５１は、キーボードやマウス、あるいはタッチペン、その他ポインティングデバイスなどの入力を受け付ける装置である。出力装置５５２は、ディスプレイなどの表示を行う装置である。通信装置５５３は、ネットワーク３０などのネットワークを介して他の装置と通信を行う装置である。中継サーバー装置５００の通信装置５５３は、ネットワーク３０を通じて、音声特定サーバー装置９００と、ＰＯＩ提供サーバー装置９５０およびナビゲーション装置１００の通信装置１２等と通信を行うことができる。演算装置５５４は、例えばＣＰＵ（Central Processing Unit）などの演算装置である。主記憶装置５５５は、例えばＲＡＭ（Random Access Memory）などのメモリ装置である。外部記憶装置５５６は、例えばハードディスク装置やＳＳＤ（Solid State Drive）などの不揮発性記憶装置である。 The input device 551 is a device that receives input from a keyboard, a mouse, a touch pen, and other pointing devices. The output device 552 is a device that performs display such as a display. The communication device 553 is a device that communicates with other devices via a network such as the network 30. The communication device 553 of the relay server device 500 can communicate with the voice specifying server device 900, the POI providing server device 950, the communication device 12 of the navigation device 100, and the like through the network 30. The arithmetic device 554 is an arithmetic device such as a CPU (Central Processing Unit). The main storage device 555 is a memory device such as a RAM (Random Access Memory). The external storage device 556 is a non-volatile storage device such as a hard disk device or an SSD (Solid State Drive).

なお、主記憶装置５５５に展開される命令コードは、外部記憶装置５５６に記憶されたものでもよく、また、通信装置５５３を介して、ネットワーク３０上の図示しない他の装置あるいはインターネット等のネットワーク上の装置から取得されたものでもよい。主記憶装置５５５は、演算装置５５４が実行する命令コードの展開を行う領域を有する。外部記憶装置５５６は、いわゆる通常の記憶装置であり、中継サーバー装置５００を動作させるソフトウェアや、当該ソフトウェアが必要とするデータの初期値、その他のデータなどを予め記録している。 Note that the instruction code expanded in the main storage device 555 may be stored in the external storage device 556, or on another network (not shown) on the network 30 or a network such as the Internet via the communication device 553. It may be obtained from the apparatus. The main storage device 555 has an area where the instruction code executed by the arithmetic device 554 is expanded. The external storage device 556 is a so-called normal storage device, and records in advance software for operating the relay server device 500, initial values of data required by the software, other data, and the like.

上記した中継サーバー装置５００の制御部５２０の雑音除去処理部５２１、ＰＯＩ提示部５２２は、演算装置５５４が所定のプログラムを読み込み実行することにより構築される。そのため、主記憶装置５５５には、各機能部の処理を実現するためのプログラムが記憶されている。 The noise removal processing unit 521 and the POI presentation unit 522 of the control unit 520 of the relay server device 500 described above are constructed by the arithmetic device 554 reading and executing a predetermined program. Therefore, the main storage device 555 stores a program for realizing processing of each functional unit.

なお、上記した中継サーバー装置５００の構成要素は、構成の理解を容易にするために、主な処理内容に応じて分類したものである。そのため、構成要素の分類の仕方やその名称によって、本願発明が制限されることはない。中継サーバー装置５００の構成は、処理内容に応じて、さらに多くの構成要素に分類することもできる。また、１つの構成要素がさらに多くの処理を実行するように分類することもできる。 Note that the components of the relay server device 500 described above are classified according to main processing contents in order to facilitate understanding of the configuration. Therefore, the present invention is not limited by the way of classifying the components and their names. The configuration of the relay server device 500 can be classified into more components depending on the processing content. Moreover, it can also classify | categorize so that one component may perform more processes.

また、中継サーバー装置５００の制御部５２０は、ハードウェア（ＡＳＩＣ、ＧＰＵなど）により構築されてもよい。また、各機能部の処理が一つのハードウェアで実行されてもよいし、複数のハードウェアで実行されてもよい。 Further, the control unit 520 of the relay server device 500 may be constructed by hardware (ASIC, GPU, etc.). Further, the processing of each functional unit may be executed by one hardware or may be executed by a plurality of hardware.

図３に、ナビゲーション装置１００の全体構成図を示す。ナビゲーション装置１００は、演算処理部１と、ディスプレイ２と、記憶装置３と、音声入出力装置４（音声入力装置としてマイクロフォン４１、音声出力装置としてスピーカ４２を備える）と、入力装置５と、ＲＯＭ装置６と、車速センサ７と、ジャイロセンサ８と、ＧＰＳ(Global Positioning System)受信装置９と、ＦＭ多重放送受信装置１０と、ビーコン受信装置１１と、通信装置１２と、を備えている。 FIG. 3 shows an overall configuration diagram of the navigation device 100. The navigation device 100 includes an arithmetic processing unit 1, a display 2, a storage device 3, a voice input / output device 4 (including a microphone 41 as a voice input device and a speaker 42 as a voice output device), an input device 5, and a ROM. The apparatus 6 includes a vehicle speed sensor 7, a gyro sensor 8, a GPS (Global Positioning System) receiver 9, an FM multiplex broadcast receiver 10, a beacon receiver 11, and a communication device 12.

演算処理部１は、様々な処理を行う中心的ユニットである。例えば各種センサ７,８やＧＰＳ受信装置９、ＦＭ多重放送受信装置１０等から出力される情報に基づいて現在地を算出する。また、得られた現在地情報に基づいて、表示に必要な地図データを記憶装置３あるいはＲＯＭ装置６から読み出す。 The arithmetic processing unit 1 is a central unit that performs various processes. For example, the current location is calculated based on information output from the various sensors 7 and 8, the GPS receiver 9, the FM multiplex broadcast receiver 10, and the like. Further, map data necessary for display is read from the storage device 3 or the ROM device 6 based on the obtained current location information.

また、演算処理部１は、読み出した地図データをグラフィックス展開し、そこに現在地を示すマークを重ねてディスプレイ２へ表示する。また、記憶装置３あるいはＲＯＭ装置６に記憶されている地図データ等を用いて、ユーザから指示された出発地又は現在地と、目的地（または、経由地や立ち寄り地）とを結ぶ最適な経路（推奨経路）を探索する。また、スピーカ４２やディスプレイ２を用いてユーザを誘導する。 The arithmetic processing unit 1 develops the read map data in graphics, and displays a mark indicating the current location on the display 2 in a superimposed manner. In addition, an optimal route (or a route or stopover) that connects the departure point or current location instructed by the user with the map data or the like stored in the storage device 3 or the ROM device 6 is used. Search for the recommended route. Further, the user is guided using the speaker 42 and the display 2.

ナビゲーション装置１００の演算処理部１は、各デバイス間をバス２５で接続した構成である。演算処理部１は、数値演算及び各デバイスを制御するといった様々な処理を実行するＣＰＵ(Central Processing Unit)２１と、記憶装置３から読み出した地図データ、演算データなどを格納するＲＡＭ(Random Access Memory)２２と、プログラムやデータを格納するＲＯＭ(Read Only Memory)２３と、各種ハードウェアを演算処理部１と接続するためのＩ／Ｆ（インターフェイス）２４と、を有する。 The arithmetic processing unit 1 of the navigation device 100 has a configuration in which each device is connected by a bus 25. The arithmetic processing unit 1 includes a CPU (Central Processing Unit) 21 that executes various processes such as numerical calculation and control of each device, and a RAM (Random Access Memory) that stores map data, arithmetic data, and the like read from the storage device 3. ) 22, a ROM (Read Only Memory) 23 for storing programs and data, and an I / F (interface) 24 for connecting various types of hardware to the arithmetic processing unit 1.

ディスプレイ２は、演算処理部１等で生成されたグラフィックス情報を表示するユニットである。ディスプレイ２は、液晶ディスプレイ、有機ＥＬディスプレイなどで構成される。 The display 2 is a unit that displays graphics information generated by the arithmetic processing unit 1 or the like. The display 2 is configured by a liquid crystal display, an organic EL display, or the like.

記憶装置３は、ＨＤＤ（Hard Disk Drive）や不揮発性メモリカードといった、少なくとも読み書きが可能な記憶媒体で構成される。 The storage device 3 is composed of at least a readable / writable storage medium such as an HDD (Hard Disk Drive) or a nonvolatile memory card.

この記憶媒体には、通常の経路探索装置に必要な地図データ（地図上の道路を構成するリンクのリンクデータを含む）であるリンクテーブル２００が記憶されている。 This storage medium stores a link table 200 that is map data (including link data of links constituting roads on a map) necessary for a normal route search device.

図４は、リンクテーブル２００の構成を示す図である。リンクテーブル２００は、地図上の区画された領域であるメッシュの識別コード（メッシュＩＤ）２０１ごとに、そのメッシュ領域に含まれる道路を構成する各リンクのリンクデータ２０２を含んでいる。 FIG. 4 is a diagram illustrating the configuration of the link table 200. The link table 200 includes, for each mesh identification code (mesh ID) 201, which is a partitioned area on the map, link data 202 of each link constituting a road included in the mesh area.

リンクデータ２０２は、リンクの識別子であるリンクＩＤ２１１ごとに、リンクを構成する２つのノード（開始ノード、終了ノード）の座標情報２２２、リンクを含む道路の種別を示す道路種別２２３、リンクの長さを示すリンク長２２４、予め記憶されたリンク旅行時間２２５、当該リンクの開始ノードに接続するリンクである開始接続リンクと、当該リンクの終了ノードに接続するリンクである終了接続リンクと、を特定する開始接続リンク、終了接続リンク２２６、リンクを含む道路の制限速度を示す制限速度２２７などを含んでいる。 For each link ID 211 that is a link identifier, the link data 202 includes coordinate information 222 of two nodes (start node and end node) constituting the link, a road type 223 indicating the type of road including the link, and a link length. Link length 224 indicating the link travel time 225 stored in advance, a start connection link that is a link connected to the start node of the link, and an end connection link that is a link connected to the end node of the link A start connection link, an end connection link 226, a speed limit 227 indicating a speed limit of the road including the link, and the like are included.

なお、ここでは、リンクを構成する２つのノードについて開始ノードと終了ノードとを区別することで、同じ道路の上り方向と下り方向とを、それぞれ別のリンクとして管理するようにしている。 Here, by distinguishing the start node and the end node for the two nodes constituting the link, the upward direction and the downward direction of the same road are managed as different links.

図３に戻って説明する。音声入出力装置４は、音声入力装置として内蔵のマイクロフォン４１と、音声出力装置としてスピーカ４２と、を備える。マイクロフォン４１は、ユーザやその他の搭乗者が発した声などのナビゲーション装置１００の外部の音声を取得する。 Returning to FIG. The voice input / output device 4 includes a built-in microphone 41 as a voice input device and a speaker 42 as a voice output device. The microphone 41 acquires sound outside the navigation device 100 such as a voice uttered by a user or another passenger.

また、音声入出力装置４は、拡張マイクロフォン４３の接続を受け付ける接続部を有する。すなわち、音声入出力装置４は、一般的により性能の高い集音性能を有するヘッドセット等の拡張マイクロフォン４３の接続を受け付けることができるため、より精度高く音声情報を受け付けることができる。 The voice input / output device 4 also has a connection unit that accepts connection of the extension microphone 43. That is, since the voice input / output device 4 can accept the connection of the extension microphone 43 such as a headset having generally higher performance sound collection performance, it can accept voice information with higher accuracy.

スピーカ４２は、演算処理部１で生成されたユーザへのメッセージを音声として出力する。マイクロフォン４１とスピーカ４２は、車両の所定の部位に、別個に配されている。ただし、一体の筐体に収納されていても良い。ナビゲーション装置１００は、マイクロフォン４１及びスピーカ４２を、それぞれ複数備えることができる。 The speaker 42 outputs a message to the user generated by the arithmetic processing unit 1 as voice. The microphone 41 and the speaker 42 are separately arranged at a predetermined part of the vehicle. However, it may be housed in an integral housing. The navigation device 100 can include a plurality of microphones 41 and speakers 42.

入力装置５は、ユーザからの指示をユーザによる操作を介して受け付ける装置である。入力装置５は、タッチパネル５１と、ダイヤルスイッチ５２と、その他のハードスイッチ（図示しない）であるスクロールキー、縮尺変更キーなどで構成される。また、入力装置５には、ナビゲーション装置１００に対して遠隔で操作指示を行うことができるリモートコントローラが含まれる。リモートコントローラは、ダイヤルスイッチやスクロールキー、縮尺変更キーなどを備え、各キーやスイッチが操作された情報をナビゲーション装置１００に送出することができる。 The input device 5 is a device that receives an instruction from the user through an operation by the user. The input device 5 includes a touch panel 51, a dial switch 52, and other hardware switches (not shown) such as scroll keys and scale change keys. In addition, the input device 5 includes a remote controller that can perform operation instructions to the navigation device 100 remotely. The remote controller includes a dial switch, a scroll key, a scale change key, and the like, and can send information on operation of each key or switch to the navigation device 100.

タッチパネル５１は、ディスプレイ２の表示面側に搭載され、表示画面を透視可能である。タッチパネル５１は、ディスプレイ２に表示された画像のＸＹ座標と対応したタッチ位置を特定し、タッチ位置を座標に変換して出力する。タッチパネル５１は、感圧式または静電式の入力検出素子などにより構成される。 The touch panel 51 is mounted on the display surface side of the display 2 and can see through the display screen. The touch panel 51 specifies a touch position corresponding to the XY coordinates of the image displayed on the display 2, converts the touch position into coordinates, and outputs the coordinate. The touch panel 51 includes a pressure-sensitive or electrostatic input detection element.

ダイヤルスイッチ５２は、時計回り及び反時計回りに回転可能に構成され、所定の角度の回転ごとにパルス信号を発生し、演算処理部１に出力する。演算処理部１では、パルス信号の数から、回転角度を求める。 The dial switch 52 is configured to be rotatable clockwise and counterclockwise, generates a pulse signal for every rotation of a predetermined angle, and outputs the pulse signal to the arithmetic processing unit 1. The arithmetic processing unit 1 obtains the rotation angle from the number of pulse signals.

ＲＯＭ装置６は、ＣＤ-ＲＯＭやＤＶＤ-ＲＯＭ等のＲＯＭ（Read Only Memory）や、ＩＣ（Integrated Circuit）カードといった、少なくとも読み取りが可能な記憶媒体で構成されている。この記憶媒体には、例えば、動画データや、音声データなどが記憶されている。 The ROM device 6 includes at least a readable storage medium such as a ROM (Read Only Memory) such as a CD-ROM or a DVD-ROM, or an IC (Integrated Circuit) card. In this storage medium, for example, moving image data, audio data, and the like are stored.

車速センサ７,ジャイロセンサ８およびＧＰＳ受信装置９は、ナビゲーション装置１００で現在地（自車位置）を検出するために使用されるものである。車速センサ７は、車速を算出するのに用いる値を出力するセンサである。ジャイロセンサ８は、光ファイバジャイロや振動ジャイロ等で構成され、移動体の回転による角速度を検出するものである。ＧＰＳ受信装置９は、ＧＰＳ衛星からの信号を受信し移動体とＧＰＳ衛星間の距離と距離の変化率とを３個以上の衛星に対して測定することで移動体の現在地、進行速度および進行方位を測定するものである。 The vehicle speed sensor 7, the gyro sensor 8, and the GPS receiver 9 are used by the navigation device 100 to detect the current location (own vehicle position). The vehicle speed sensor 7 is a sensor that outputs a value used to calculate the vehicle speed. The gyro sensor 8 is composed of an optical fiber gyro, a vibration gyro, or the like, and detects an angular velocity due to the rotation of the moving body. The GPS receiver 9 receives a signal from a GPS satellite and measures the distance between the mobile body and the GPS satellite and the rate of change of the distance with respect to three or more satellites to thereby determine the current location, travel speed, and travel of the mobile body. It measures the direction.

ＦＭ多重放送受信装置１０は、ＦＭ放送局から送られてくるＦＭ多重放送信号を受信する。ＦＭ多重放送には、ＶＩＣＳ（Vehicle Information Communication System：登録商標）情報の概略現況交通情報、規制情報、ＳＡ／ＰＡ（サービスエリア／パーキングエリア）情報、駐車場情報、天気情報などやＦＭ多重一般情報としてラジオ局が提供する文字情報などがある。 The FM multiplex broadcast receiver 10 receives an FM multiplex broadcast signal transmitted from an FM broadcast station. FM multiplex broadcasting includes VICS (Vehicle Information Communication System: Registered Trademark) information, current traffic information, regulatory information, SA / PA (service area / parking area) information, parking information, weather information, and FM multiplex general information. As text information provided by radio stations.

ビーコン受信装置１１は、ＶＩＣＳ情報などの概略現況交通情報、規制情報、ＳＡ／ＰＡ（サービスエリア／パーキングエリア）情報、駐車場情報、天気情報や緊急警報などを受信する。例えば、光により通信する光ビーコン、電波により通信する電波ビーコン等の受信装置である。 The beacon receiving device 11 receives rough current traffic information such as VICS information, regulation information, SA / PA (service area / parking area) information, parking lot information, weather information, emergency alerts, and the like. For example, it is a receiving device such as an optical beacon that communicates by light and a radio beacon that communicates by radio waves.

通信装置１２は、ナビゲーション装置１００を、ネットワーク３０等に接続させ、ネットワークに接続された中継サーバー装置５００等の他の装置と通信を行う装置である。なお、通信装置１２は、ナビゲーション装置１００に内蔵されるものであってもよいし、例えば携帯電話網を利用する通信モジュールや携帯電話等、外部機器として取り付け可能に搭載されるものであってもよい。また、ナビゲーション装置１００と通信装置１２との間は、ＵＳＢ（Universal Serial Bus）やＢｌｕｅｔｏｏｔｈ（登録商標）等の所定の通信規格により情報の送受信を行うものである。 The communication device 12 is a device that connects the navigation device 100 to the network 30 or the like and communicates with other devices such as the relay server device 500 connected to the network. The communication device 12 may be built in the navigation device 100 or may be mounted as an external device such as a communication module or a mobile phone using a mobile phone network. Good. In addition, information is transmitted and received between the navigation apparatus 100 and the communication apparatus 12 according to a predetermined communication standard such as USB (Universal Serial Bus) or Bluetooth (registered trademark).

図５は、演算処理部１の機能ブロック図である。図示するように、演算処理部１は、基本制御部１０１と、入力受付部１０２と、出力処理部１０３と、雑音レベル判定部１０４と、中継サーバー通信部１０５と、ＰＯＩ提示情報作成部１０６と、マイク認識部１０７と、を有する。 FIG. 5 is a functional block diagram of the arithmetic processing unit 1. As illustrated, the arithmetic processing unit 1 includes a basic control unit 101, an input reception unit 102, an output processing unit 103, a noise level determination unit 104, a relay server communication unit 105, and a POI presentation information creation unit 106. And microphone recognition unit 107.

基本制御部１０１は、様々な処理を行う中心的な機能部であり、処理内容に応じて、他の処理部を制御する。また、各種センサ、ＧＰＳ受信装置９等の情報を取得し、マップマッチング処理等を行って現在地を特定する。また、随時、走行した日付および時刻と、位置と、を対応付けて、リンクごとに走行履歴を記憶装置３に記憶する。さらに、各処理部からの要求に応じて、現在時刻を出力する。 The basic control unit 101 is a central functional unit that performs various processes, and controls other processing units according to the processing content. In addition, information on various sensors, the GPS receiver 9 and the like is acquired, and a map matching process is performed to identify the current location. In addition, the travel history is stored in the storage device 3 for each link by associating the travel date and time with the position as needed. Further, the current time is output in response to a request from each processing unit.

入力受付部１０２は、入力装置５またはマイクロフォン４１を介して入力された使用者からの指示を受け付け、その要求内容に対応する処理を実行するように演算処理部１の各部を制御する。例えば、使用者が推奨経路の探索を要求したときは、目的地を設定するため、地図をディスプレイ２に表示する処理を出力処理部１０３に要求する。 The input receiving unit 102 receives an instruction from the user input via the input device 5 or the microphone 41, and controls each unit of the arithmetic processing unit 1 so as to execute processing corresponding to the requested content. For example, when the user requests a search for a recommended route, the output processing unit 103 is requested to display a map on the display 2 in order to set a destination.

出力処理部１０３は、例えばポリゴン情報等の表示させる画面情報を受け取り、ディスプレイ２に描画するための信号に変換してディスプレイ２に対して描画する指示を行う。 The output processing unit 103 receives screen information to be displayed such as polygon information, for example, converts it into a signal for drawing on the display 2, and instructs the display 2 to draw.

雑音レベル判定部１０４は、音声入出力装置４のマイクロフォン４１または拡張マイクロフォン４３から入力される音声情報について、雑音レベルを判定する。具体的には雑音レベル判定部１０４は、受け付けた音声情報の所定の無音部分の波形、望ましくは音声情報の最初の１００ｍｓに相当する無音部分の波形に含まれるノイズ成分を抽出し、当該ノイズ量の多寡に応じて雑音レベルを所定のレベル以上であるか否かを判定する。 The noise level determination unit 104 determines the noise level of the voice information input from the microphone 41 or the extension microphone 43 of the voice input / output device 4. Specifically, the noise level determination unit 104 extracts a noise component included in a waveform of a predetermined silence portion of the received audio information, preferably a silence portion waveform corresponding to the first 100 ms of the audio information, and the noise amount It is determined whether or not the noise level is equal to or higher than a predetermined level according to the amount of the error.

中継サーバー通信部１０５は、入力された音声情報を、中継サーバー装置５００へ送信する。また、中継サーバー通信部１０５は、音声認識の結果得られた文字列を中継サーバー装置５００から受信する。また、中継サーバー通信部１０５は、選択されたＰＯＩの情報を中継サーバー装置５００へ送信し、ＰＯＩリスト情報を受信する。 The relay server communication unit 105 transmits the input voice information to the relay server device 500. Further, the relay server communication unit 105 receives a character string obtained as a result of speech recognition from the relay server device 500. Also, the relay server communication unit 105 transmits the selected POI information to the relay server device 500 and receives the POI list information.

ＰＯＩ提示情報作成部１０６は、受信したＰＯＩリスト情報を統合して、ＰＯＩリストとして選択可能に使用者へ提示するための画面情報等を作成し、出力処理部１０３へ出力を依頼する。 The POI presentation information creation unit 106 integrates the received POI list information to create screen information or the like for presentation to the user so that it can be selected as a POI list, and requests the output processing unit 103 to output it.

マイク認識部１０７は、ナビゲーション装置１００に接続されたマイクロフォンの認識を行う。具体的には、マイク認識部１０７は、拡張マイクロフォン４３が接続されたことを検知して、内蔵のマイクロフォン４１との間でいずれのマイクを使用するかについての使用者の選択に応じて使用するマイクロフォンを特定する。 The microphone recognition unit 107 recognizes a microphone connected to the navigation device 100. Specifically, the microphone recognition unit 107 detects that the extension microphone 43 is connected, and uses the microphone according to the user's selection as to which microphone to use with the built-in microphone 41. Identify the microphone.

上記した演算処理部１の各機能部、すなわち基本制御部１０１、入力受付部１０２、出力処理部１０３、雑音レベル判定部１０４、中継サーバー通信部１０５、ＰＯＩ提示情報作成部１０６、マイク認識部１０７は、ＣＰＵ２１が所定のプログラムを読み込み実行することにより構築される。そのため、ＲＡＭ２２には、各機能部の処理を実現するためのプログラムが記憶されている。 Each functional unit of the arithmetic processing unit 1 described above, that is, the basic control unit 101, the input reception unit 102, the output processing unit 103, the noise level determination unit 104, the relay server communication unit 105, the POI presentation information creation unit 106, and the microphone recognition unit 107. Is constructed by the CPU 21 reading and executing a predetermined program. Therefore, the RAM 22 stores a program for realizing the processing of each functional unit.

なお、上記した各構成要素は、ナビゲーション装置１００の構成を、理解を容易にするために、主な処理内容に応じて分類したものである。そのため、構成要素の分類の仕方やその名称によって、本願発明が制限されることはない。ナビゲーション装置１００の構成は、処理内容に応じて、さらに多くの構成要素に分類することもできる。また、１つの構成要素がさらに多くの処理を実行するように分類することもできる。 In addition, each above-mentioned component is classified according to the main processing content, in order to make an understanding easy the structure of the navigation apparatus 100. FIG. Therefore, the present invention is not limited by the way of classifying the components and their names. The configuration of the navigation device 100 can be classified into more components depending on the processing content. Moreover, it can also classify | categorize so that one component may perform more processes.

また、各機能部は、ハードウェア（ＡＳＩＣ、ＧＰＵなど）により構築されてもよい。また、各機能部の処理が一つのハードウェアで実行されてもよいし、複数のハードウェアで実行されてもよい。 Each functional unit may be constructed by hardware (ASIC, GPU, etc.). Further, the processing of each functional unit may be executed by one hardware or may be executed by a plurality of hardware.

［動作の説明］次に、ナビゲーション装置１００、中継サーバー装置５００、音声特定サーバー装置９００およびＰＯＩ提供サーバー装置９５０を含む検索システム１０００において実施されるＰＯＩ検索処理の動作について説明する。図６は、ＰＯＩ検索処理を示すフロー図である。このフローは、ナビゲーション装置１００が起動している状態において、所定のＰＴＴ（ＰｕｓｈＴｏＴａｌｋ）ボタン等による音声入力の開始指示を受け付けることで開始される。 [Description of Operation] Next, the operation of the POI search process performed in the search system 1000 including the navigation device 100, the relay server device 500, the voice specifying server device 900, and the POI providing server device 950 will be described. FIG. 6 is a flowchart showing the POI search process. This flow is started by receiving a voice input start instruction using a predetermined PTT (Push To Talk) button or the like while the navigation device 100 is activated.

まず、入力受付部１０２は、音声入力の待ち受けを開始する（ステップＳ００１）。そして、入力受付部１０２は、ＰＴＴボタンの開放等により音声待ち受けを終了する（ステップＳ００３）まで、音声区間を検出し、入力された音声情報を圧縮して音声情報を作成する（ステップＳ００２）。なお、ここで、雑音レベル判定部１０４が、入力された音声情報の雑音レベルを判定する。そして、雑音レベルが所定よりも高い場合、すなわち雑音が多い環境では、入力受付部１０２は、圧縮率を低く設定して圧縮することとし、圧縮による音質劣化を最小限に留めるようにしてもよい。また、雑音レベルがさらに高く所定の閾値を超える場合、すなわち雑音が大きすぎて到底正常に音声認識を行うことができない程度に大きい環境においては、入力受付部１０２は、音声情報の作成を行わず、以降の処理を実施しないようにしてもよい。 First, the input receiving unit 102 starts waiting for voice input (step S001). Then, the input receiving unit 102 detects the voice section until the voice standby is completed by releasing the PTT button or the like (step S003), and compresses the input voice information to create voice information (step S002). Here, the noise level determination unit 104 determines the noise level of the input voice information. When the noise level is higher than a predetermined level, that is, in an environment where there is a lot of noise, the input receiving unit 102 may set the compression rate to be low and perform compression to minimize sound quality deterioration due to compression. . In addition, in a case where the noise level is higher and exceeds a predetermined threshold, that is, in an environment where the noise is too large to be able to perform speech recognition normally, the input receiving unit 102 does not create voice information. The subsequent processing may not be performed.

そして、入力受付部１０２は、中継サーバー通信部１０５を介して、中継サーバー装置５００へ音声情報を送信する。そして、中継サーバー装置５００の雑音除去処理部５２１は受信した音声情報に対して、所定のアルゴリズムを実現する雑音除去処理を実施する（ステップＳ００４）。具体的には、雑音除去処理部５２１は、雑音除去処理時に適用することをあらかじめ定められた一または複数のアルゴリズムにより、受信した音声情報に対して雑音除去処理を実施して、雑音を除去された一または複数の音声情報を生成する。 Then, the input reception unit 102 transmits voice information to the relay server device 500 via the relay server communication unit 105. And the noise removal process part 521 of the relay server apparatus 500 implements the noise removal process which implement | achieves a predetermined algorithm with respect to the received audio | voice information (step S004). Specifically, the noise removal processing unit 521 performs noise removal processing on the received voice information according to one or a plurality of algorithms determined in advance to be applied at the time of noise removal processing, and noise is removed. Generating one or more audio information.

そして、ＰＯＩ提示部５２２は、雑音を除去された一または複数の音声情報を音声特定サーバー装置９００へ送信する。そして、音声特定サーバー装置９００は、各音声情報に所定の音声認識処理を実施して認識した結果である候補の一または複数の文字列情報を中継サーバー装置５００へ送信する（ステップＳ００５）。なお、当該音声認識処理においては、既存の音声認識等の処理が行われ、Ｎ−ｂｅｓｔ検索等により一または複数の認識結果の候補となる文字列がその確度とともに出力される。例えば、使用者が発話した音声情報が「ピザ」に該当するものである場合、「ピザ」、「Pizza」、「膝」、「いか」等の候補となる文字列が音声情報ごとに出力される。 Then, the POI presentation unit 522 transmits one or more pieces of voice information from which noise has been removed to the voice identification server device 900. Then, the voice identifying server device 900 transmits one or more candidate character string information, which is a result of performing recognition by performing predetermined voice recognition processing on each voice information, to the relay server device 500 (step S005). In the speech recognition processing, existing speech recognition processing or the like is performed, and character strings that are candidates for one or a plurality of recognition results are output together with the accuracy by N-best search or the like. For example, if the voice information spoken by the user corresponds to “pizza”, character strings that are candidates for “pizza”, “Pizza”, “knee”, “squid”, etc. are output for each voice information. The

そして、ＰＯＩ提示部５２２は、出力された認識結果の文字列情報を受け取ると、認識結果に対して重みづけを行う（ステップＳ００６）。具体的には、出力された認識結果の文字列情報は、雑音除去のアルゴリズムに応じて一または複数の候補が挙げられており、その中で重複する候補があれば一つに統合し、統合された候補についてはその確度をより高いもの（例えば、確度に所定の割合を上乗せする）として補正し、確度の順に候補の文字列を順位付けする。なお、ＰＯＩ提示部５２２は、当該重みづけ処理において、音声情報に適用された雑音除去のアルゴリズムに応じて重みづけを行うようにしてもよい。すなわち、適切な雑音除去のアルゴリズムが適用された音声情報は認識精度が高いものと考えられるため、認識精度が高いと考えられる候補を重視するようにしてもよい。また、施設に該当しない可能性が高いＰＯＩについての候補があれば、これを除去してもよい。 Upon receiving the output character string information of the recognition result, the POI presentation unit 522 weights the recognition result (step S006). Specifically, the output recognition result string information includes one or more candidates according to the noise removal algorithm, and if there are duplicate candidates among them, they are integrated into one. The candidates are corrected with higher accuracy (for example, a predetermined ratio is added to the accuracy), and the candidate character strings are ranked in the order of accuracy. Note that the POI presenting unit 522 may perform weighting in the weighting process according to a noise removal algorithm applied to the voice information. That is, since speech information to which an appropriate noise removal algorithm is applied is considered to have high recognition accuracy, priority may be given to candidates that are considered to have high recognition accuracy. In addition, if there is a candidate for a POI that is highly unlikely to be a facility, it may be removed.

ＰＯＩ提示部５２２は、順位付けした候補の文字列をナビゲーション装置１００へ送信する。そして、ナビゲーション装置１００のＰＯＩ提示情報作成部１０６は、受信した順位付けされた認識結果の候補の文字列を、選択可能にリスト出力する画面情報を作成し、出力処理部１０３へ指示してディスプレイ２へ表示させる（ステップＳ００７）。なお、ここで、順位付けした文字列の候補の数が所定の数に満たない場合、あるいは、ステップＳ００２にて受け付けた音声情報に含まれる雑音のレベルが所定よりも低い場合、すなわち音声の認識結果にあいまいさが少ない場合には、後述するステップＳ００８の処理を省略して、ステップＳ００９のＰＯＩの検索依頼を送信する処理を実施するようにしてもよい。 The POI presenting unit 522 transmits the ranked candidate character strings to the navigation device 100. Then, the POI presentation information creation unit 106 of the navigation device 100 creates screen information that allows the received list of recognized recognition result candidate character strings to be selected, and instructs the output processing unit 103 to display the screen information. 2 is displayed (step S007). Here, when the number of ranked character string candidates is less than a predetermined number, or when the level of noise included in the audio information received in step S002 is lower than a predetermined level, that is, speech recognition. When there is little ambiguity in the result, the process of step S008 described later may be omitted and the process of transmitting a POI search request in step S009 may be performed.

そして、入力受付部１０２は、表示された画面において使用者が指定した候補の選択入力を受け付けて、中継サーバー通信部１０５を介して中継サーバー装置５００へ送信する（ステップＳ００８）。 Then, the input receiving unit 102 receives a candidate selection input designated by the user on the displayed screen, and transmits it to the relay server device 500 via the relay server communication unit 105 (step S008).

ＰＯＩ提示部５２２は、送信された候補の文字列について、ＰＯＩ提供サーバー装置９５０へ送信し、ＰＯＩの検索依頼を送信する（Ｓ００９）。 The POI presentation unit 522 transmits the transmitted candidate character string to the POI providing server device 950, and transmits a POI search request (S009).

ＰＯＩ提供サーバー装置９５０は、送信された候補の文字列をその施設名あるいは住所等に含むか、送信された候補の文字列に類似する文字列をその施設名あるいは住所等に含むＰＯＩをゆらぎ検索し、複数のＰＯＩの候補を確度別に検索し、当該ＰＯＩの名称、座標、電話番号、住所等を含む情報を含むＰＯＩリストを中継サーバー装置５００へ送信する（ステップＳ０１０）。 The POI providing server device 950 fluctuates and searches for a POI that includes the transmitted candidate character string in the facility name or address, or includes a character string similar to the transmitted candidate character string in the facility name or address. Then, a plurality of POI candidates are searched by accuracy, and a POI list including information including the name, coordinates, telephone number, address, etc. of the POI is transmitted to the relay server device 500 (step S010).

ＰＯＩ提示部５２２は、送信されたＰＯＩリストに対して、後述するＰＯＩ検索結果統合処理を実施して統合する（ステップＳ０１１）。そして、ＰＯＩ提示部５２２は、ナビゲーション装置１００に対して、統合したＰＯＩ検索結果を送信する。 The POI presentation unit 522 integrates the transmitted POI list by performing POI search result integration processing described later (step S011). Then, the POI presentation unit 522 transmits the integrated POI search result to the navigation device 100.

ＰＯＩ提示情報作成部１０６は、受信したＰＯＩ検索結果を用いて、各ＰＯＩを選択可能に表示する表示画面を作成し、出力処理部１０３に対してディスプレイ２に表示するよう指示する（ステップＳ０１２）。例えば、ＰＯＩ提示情報作成部１０６は、選択された候補が「ピザ」である場合には、ピザ食を提供するレストランのリスト等を選択可能に表示するとともに、当該レストランの座標位置に応じて、地図上の該当する位置に当該レストランのアイコンを表示する画面を作成する。 The POI presentation information creation unit 106 creates a display screen for selectively displaying each POI using the received POI search result, and instructs the output processing unit 103 to display it on the display 2 (step S012). . For example, when the selected candidate is “pizza”, the POI presentation information creation unit 106 displays a list of restaurants that provide pizza foods in a selectable manner, and according to the coordinate position of the restaurant, Create a screen that displays the icon of the restaurant at the appropriate location on the map.

そして、基本制御部１０１は、表示したＰＯＩの選択を入力受付部１０２を介して受け付け、選択されたＰＯＩを目的地あるいは経由地とする経路探索を行う（ステップＳ０１３）。当該経路探索時には、基本制御部１０１は、選択されたＰＯＩの名称を含むルート探索メッセージを表示する。例えば、基本制御部１０１は、選択されたＰＯＩの名称が「東京ピザ」であれば、「東京ピザへのルートを探索します」というメッセージを表示して、当該ＰＯＩへの経路探索を実施する。 Then, the basic control unit 101 receives selection of the displayed POI via the input reception unit 102, and performs a route search using the selected POI as a destination or waypoint (step S013). During the route search, the basic control unit 101 displays a route search message including the name of the selected POI. For example, if the name of the selected POI is “Tokyo Pizza”, the basic control unit 101 displays a message “Searching for a route to Tokyo Pizza” and performs a route search to the POI. .

以上が、ＰＯＩ検索処理のフローである。ＰＯＩ検索処理によると、より手軽に高い検索機能を利用できる。具体的には、ナビゲーション装置の処理能力は特別に高くなくとも、精度の高い音声認識および高機能なＰＯＩの検索機能を利用することができるといえる。 The above is the flow of the POI search process. According to the POI search process, a higher search function can be used more easily. Specifically, even if the processing capability of the navigation device is not particularly high, it can be said that highly accurate voice recognition and a highly functional POI search function can be used.

図７は、図６に示したＰＯＩ検索処理におけるステップＳ００２〜ステップＳ００７におけるプロセス間の関連を表したシーケンス図である。 FIG. 7 is a sequence diagram showing the relationship between processes in steps S002 to S007 in the POI search process shown in FIG.

まず、ナビゲーション装置１００の中継サーバー通信部１０５は、中継サーバー装置５００の送受信プロセス（ＰＯＩ提示部５２２により制御される）に対して音声情報の送信を開始する（ステップＳ１０１）。 First, the relay server communication unit 105 of the navigation device 100 starts transmitting voice information to the transmission / reception process (controlled by the POI presenting unit 522) of the relay server device 500 (step S101).

そして、中継サーバー通信部１０５は、音声情報の送信をすべて終える（ステップＳ１０２）まで、音声情報の送信を継続する。 Then, the relay server communication unit 105 continues to transmit the audio information until the transmission of all the audio information is completed (step S102).

中継サーバー通信部１０５からの音声情報の送信が終わるのを待って、中継サーバー装置５００の送受信プロセスでは、雑音除去処理部５２１により制御される雑音除去プロセスへ音声情報の送信が開始される（ステップＳ１０３）。そして、すべての音声情報の送信が終わると、中継サーバー装置５００の送受信プロセスでは、雑音除去プロセスへの音声情報の送信が終了する（ステップＳ１０４）。 In the transmission / reception process of the relay server device 500, after the transmission of the audio information from the relay server communication unit 105 is finished, the transmission of the audio information is started to the noise removal process controlled by the noise removal processing unit 521 (step S103). When the transmission of all audio information is completed, the transmission / reception process of the relay server device 500 ends the transmission of the audio information to the noise removal process (step S104).

雑音除去プロセスでは、雑音除去処理部５２１により、送信された音声情報に対して所定の雑音除去処理が行われる（ステップＳ１０５）。中継サーバー装置５００の雑音除去処理部５２１は、雑音が除去された音声情報を、ＰＯＩ提示部５２２により制御されるサーバ通信プロセスへ送信開始する（ステップＳ１０６）。そして、雑音が除去された音声情報の送信が終わると、中継サーバー装置５００の雑音除去プロセスでは、サーバ通信プロセスへの音声情報の送信が終了する（ステップＳ１０７）。 In the noise removal process, the noise removal processing unit 521 performs a predetermined noise removal process on the transmitted voice information (step S105). The noise removal processing unit 521 of the relay server device 500 starts transmitting the voice information from which noise has been removed to the server communication process controlled by the POI presenting unit 522 (step S106). When the transmission of the voice information from which noise has been removed is completed, the transmission of the voice information to the server communication process is completed in the noise removal process of the relay server device 500 (step S107).

雑音が除去された音声情報をすべて受信すると、サーバ通信プロセスでは、ＰＯＩ提示部５２２により、雑音除去プロセスから送られた雑音が除去された音声情報について、音声特定サーバー装置９００への送信が開始される（ステップＳ１０８）。なお、ここでは、雑音除去のアルゴリズム別に音声情報が存在する場合には、複数の音声情報がすべて送信される。 When all the speech information from which noise has been removed is received, in the server communication process, the POI presenting unit 522 starts transmission of the speech information from which noise has been removed from the noise removal process to the speech identification server device 900. (Step S108). Here, when there is audio information for each noise removal algorithm, a plurality of audio information are all transmitted.

そして、雑音が除去された音声情報の送信が終わると、中継サーバー装置５００のサーバ通信プロセスでは、音声特定サーバー装置９００への音声情報の送信が終了する（ステップＳ１０９）。 When the transmission of the voice information from which noise has been removed is completed, the transmission of the voice information to the voice specifying server apparatus 900 is completed in the server communication process of the relay server apparatus 500 (step S109).

そして、音声特定サーバー装置９００は、受信した雑音除去された音声情報に対して、所定の音声認識処理を行い、認識した結果得られた候補となる文字列をＮ−ｂｅｓｔ検索により一または複数特定する（ステップＳ１１０）。 Then, the voice identification server device 900 performs a predetermined voice recognition process on the received noise-removed voice information, and specifies one or a plurality of candidate character strings obtained as a result of the recognition by N-best search. (Step S110).

そして、中継サーバー装置５００のサーバ通信プロセスでは、ＰＯＩ提示部５２２が、音声特定サーバー装置９００から送信された候補となる文字列をすべて受信する（ステップＳ１１１）。 Then, in the server communication process of the relay server device 500, the POI presentation unit 522 receives all candidate character strings transmitted from the voice identification server device 900 (step S111).

中継サーバー装置５００のサーバ通信プロセスでは、ＰＯＩ提示部５２２が、送受信プロセスにおいて受信した文字列をすべて送受信プロセスに送る（ステップＳ１１２）。 In the server communication process of the relay server device 500, the POI presentation unit 522 sends all the character strings received in the transmission / reception process to the transmission / reception process (step S112).

中継サーバー装置５００の送受信プロセスでは、ＰＯＩ提示部５２２により、ナビゲーション装置１００の中継サーバー通信部１０５へ認識結果の文字列が送信される（ステップＳ１１３）。 In the transmission / reception process of the relay server device 500, the POI presentation unit 522 transmits the character string of the recognition result to the relay server communication unit 105 of the navigation device 100 (step S113).

以上が、図６に示したＰＯＩ検索処理におけるステップＳ００２〜ステップＳ００７におけるプロセス間の関連である。なお、雑音除去（ステップＳ００４）や認識結果の重みづけ（ステップＳ００６）等の処理については、当該プロセス間の関係の説明では詳細な説明を省略している。 The above is the relationship between processes in steps S002 to S007 in the POI search process shown in FIG. Note that detailed explanations of processes such as noise removal (step S004) and recognition result weighting (step S006) are omitted in the description of the relationship between the processes.

図８は、ＰＯＩ検索結果統合処理のフローを示す図である。ＰＯＩ検索結果統合処理は、図６のＰＯＩ検索処理のステップＳ０１１において、中継サーバー装置５００によって実施される処理である。 FIG. 8 is a diagram showing a flow of POI search result integration processing. The POI search result integration process is a process executed by the relay server device 500 in step S011 of the POI search process of FIG.

まず、ＰＯＩ提示部５２２は、認識文字列間でＰＯＩリストが同一のものがあれば、確度の低い認識文字列とＰＯＩリストとを削除する（ステップＳ２０１）。具体的には、ＰＯＩ提示部５２２は、ＰＯＩ提供サーバー装置９５０から受信した一または複数の認識文字列とその確度、および対応するＰＯＩリストについて、各ＰＯＩリスト同士を比較して、ＰＯＩリストを構成するＰＯＩが完全に一致するＰＯＩリストがある場合には、確度の低い認識文字列についてのＰＯＩリストを削除し、合わせてその認識文字列と確度の情報も削除する。これをすべてのＰＯＩリスト間の重複がなくなるまで繰り返す。 First, if the POI list is the same among the recognized character strings, the POI presenting unit 522 deletes the recognized character string and the POI list with low accuracy (step S201). Specifically, the POI presenting unit 522 compares each POI list with respect to one or a plurality of recognized character strings received from the POI providing server device 950, their accuracy, and the corresponding POI list, and constructs a POI list. When there is a POI list in which the POIs to be completely matched, the POI list for the recognized character string with low accuracy is deleted, and the recognized character string and the accuracy information are also deleted. This is repeated until there is no overlap between all POI lists.

そして、ＰＯＩ提示部５２２は、ＰＯＩリスト内で重複するＰＯＩがある場合には、確度の低いＰＯＩを当該リスト内から削除する（ステップＳ２０２）。ここで、ＰＯＩリストには、ＰＯＩの情報と、認識文字列に対する当該ＰＯＩの確度の情報が含まれているものとする。ＰＯＩ提示部５２２は、一の認識文字列についてのＰＯＩリスト内に、同一のＰＯＩ名称を有するＰＯＩが複数含まれる場合には、確度の低いＰＯＩをＰＯＩリストから削除し、重複を排除する。 Then, if there are overlapping POIs in the POI list, the POI presenting unit 522 deletes the POI with low accuracy from the list (step S202). Here, it is assumed that the POI list includes POI information and information on the accuracy of the POI with respect to the recognized character string. When a plurality of POIs having the same POI name are included in the POI list for one recognized character string, the POI presenting unit 522 deletes the POI with low accuracy from the POI list and eliminates duplication.

そして、ＰＯＩ提示部５２２は、認識文字列間で共通するＰＯＩがあれば、確度の低い認識文字列のＰＯＩを削除する（ステップＳ２０３）。具体的には、ＰＯＩ提示部５２２は、認識文字列に対応するＰＯＩリストを認識文字列間で比較し、互いのＰＯＩリストに同一のＰＯＩ名称を有するＰＯＩが含まれる場合には、確度の低い認識文字列に対応付けられたＰＯＩリストに含まれるＰＯＩをＰＯＩリストから削除して、重複を排除する。 Then, if there is a POI common to the recognized character strings, the POI presenting unit 522 deletes the POI of the recognized character string with low accuracy (step S203). Specifically, the POI presenting unit 522 compares the POI lists corresponding to the recognized character strings between the recognized character strings, and when the POIs having the same POI name are included in each POI list, the accuracy is low. The POI included in the POI list associated with the recognized character string is deleted from the POI list to eliminate duplication.

次に、ＰＯＩ提示部５２２は、類似する認識文字列間において、確度の低い認識文字列のＰＯＩを確度の高い認識文字列のＰＯＩリストに移動させる（ステップＳ２０４）。具体的には、ＰＯＩ提示部５２２は、認識文字列間において、文字列同士の類似する度合いが所定以上となる組み合わせを特定し、当該組み合わせにおいて、確度の低い認識文字列に対応付けられたＰＯＩリストに含まれるＰＯＩの情報を、確度の高い認識文字列に対応付けられたＰＯＩリストの下位に移動させて、ＰＯＩリストを統合する。 Next, the POI presenting unit 522 moves the POI of the recognized character string with low accuracy to the POI list of the recognized character string with high accuracy between similar recognized character strings (step S204). Specifically, the POI presenting unit 522 specifies a combination in which the degree of similarity between character strings is greater than or equal to a predetermined value between the recognized character strings, and the POI associated with the recognized character string with low accuracy in the combination. The POI list is integrated by moving the POI information included in the list to the lower level of the POI list associated with the recognized character string with high accuracy.

以上が、ＰＯＩ検索結果統合処理のフローである。ＰＯＩ検索結果統合処理によると、認識された文字列が類似する関係にある検索文字列により検索されたＰＯＩリストを統合しつつ、重複を排除したＰＯＩリストを得ることができる。 The above is the flow of the POI search result integration process. According to the POI search result integration process, it is possible to obtain a POI list from which duplication is eliminated while integrating the POI lists searched by the search character strings having similar relationships with the recognized character strings.

図９は、ナビゲーション装置１００にて実施される、マイク選択処理の処理フローである。マイク選択処理は、ナビゲーション装置１００において、基本制御部１０１等が拡張マイクロフォン４３の接続を新たに検知した場合に実施される FIG. 9 is a processing flow of microphone selection processing performed in the navigation device 100. The microphone selection process is performed when the basic control unit 101 or the like newly detects the connection of the extension microphone 43 in the navigation device 100.

まず、マイク認識部１０７は、新たに接続されたマイクロフォン（以降、新規マイクと称呼）は既接続のマイクよりも近接するか否かを判定する（ステップＳ３０１）。具体的には、マイク認識部１０７は、新規マイクが発話者の口元付近に位置することが想定されるヘッドセットマイクであれば、当該新規マイクを最も「近接する」と判定する。新規マイクが外付けマイクであれば、内蔵のマイクロフォン４１より「近接する」と判定する。ただし、マイク認識部１０７は、ヘッドセットマイクが既接続であり、新規マイクが外付けマイクである場合には、新規マイクを「近接する」とは判定しない。 First, the microphone recognizing unit 107 determines whether or not a newly connected microphone (hereinafter referred to as a new microphone) is closer to the already connected microphone (step S301). Specifically, the microphone recognizing unit 107 determines that the new microphone is “closest” if the new microphone is a headset microphone that is assumed to be located near the mouth of the speaker. If the new microphone is an external microphone, it is determined to be “closer” than the built-in microphone 41. However, the microphone recognition unit 107 does not determine that the new microphone is “close” when the headset microphone is already connected and the new microphone is an external microphone.

新規マイクが既接続のマイクよりも近接しない場合（ステップＳ３０１にて「Ｎｏ」の場合）には、マイク認識部１０７は、マイク選択処理を終了させる。 When the new microphone is not closer than the already connected microphone (in the case of “No” in step S301), the microphone recognizing unit 107 ends the microphone selection process.

新規マイクが既接続のマイクよりも近接する場合（ステップＳ３０１にて「Ｙｅｓ」の場合）には、マイク認識部１０７は、使用するマイクの変更を問い合わせる表示を行う（ステップＳ３０２）。具体的には、マイク認識部１０７は、「新しいマイクを通常使用するマイクに設定しますか？」等のメッセージとともに、メッセージに対する肯定／否定等の応答となる変更指示を受け付けるダイアログボックス等を出力するよう出力処理部１０３に指示する。 When the new microphone is closer to the already connected microphone (in the case of “Yes” in step S301), the microphone recognizing unit 107 performs display for inquiring about the change of the microphone to be used (step S302). Specifically, the microphone recognizing unit 107 outputs a dialog box or the like that accepts a change instruction that becomes a response such as affirmative / negative for the message, along with a message such as “Would you like to set a new microphone as a normal microphone?” The output processing unit 103 is instructed to do so.

マイク認識部１０７は、表示させた問い合わせに対する肯定／否定等の応答となる変更指示を受け付けると、受け付けた指示が肯定の内容であるか否かを判定する（ステップＳ３０３）。肯定する内容ではない場合には、マイク認識部１０７は、マイク選択処理を終了させる。 When the microphone recognizing unit 107 receives a change instruction that becomes a response such as affirmative / negative for the displayed inquiry, the microphone recognizing unit 107 determines whether the received instruction is affirmative (step S303). If the content is not affirmative, the microphone recognition unit 107 ends the microphone selection process.

肯定する内容を受け付けた場合（ステップＳ３０３にて「Ｙｅｓ」の場合）には、マイク認識部１０７は、音声認識に使用するマイクを新規マイクへと変更する（ステップＳ３０４）。具体的には、マイク認識部１０７は、音声認識処理時に入力を受け付けるマイクロフォンの設定を、新規マイクに関連付ける。 When the content to be affirmed is received (in the case of “Yes” in step S303), the microphone recognizing unit 107 changes the microphone used for voice recognition to a new microphone (step S304). Specifically, the microphone recognizing unit 107 associates the setting of the microphone that receives input during the voice recognition processing with the new microphone.

以上が、マイク選択処理のフローである。マイク選択処理によると、新規マイクが認識されると、当該マイクが既接続のマイクよりも近接するものである場合には、当該マイクを音声認識に使用するか否か、使用者の指示に応じて設定することができる。なお、上記マイク選択処理は、使用者によりあらかじめ指定を受け付けたマイクがある場合には、当該マイクを優先して設定するようにしてもよい。 The above is the flow of the microphone selection process. According to the microphone selection process, when a new microphone is recognized, if the microphone is closer to the already connected microphone, whether or not to use the microphone for voice recognition depends on the user's instruction. Can be set. Note that the microphone selection process may be set with priority when there is a microphone whose designation has been received in advance by the user.

なお、マイク選択処理において新規マイクとして認識されるマイクは、ヘッドセットマイク等のマイクロフォンに限らず、例えば携帯電話等とブルートゥース（Ｂｌｕｅｔｏｏｔｈ：登録商標）接続されるマイクロフォンや、ＦＭトランスミッタ等によりナビゲーション装置１００と通信を行うマイクロフォン等、音声入力を受け付けることができるデバイスであればよい。 Note that the microphone recognized as a new microphone in the microphone selection process is not limited to a microphone such as a headset microphone. For example, the navigation device 100 may be a microphone connected to a mobile phone or the like (Bluetooth: Bluetooth), an FM transmitter, or the like. Any device that can accept voice input, such as a microphone that communicates with the device, may be used.

次に、使用者が発話する際にマイク認識部１０７が実施するマイク発話時選択処理について、図１０を用いて説明する。マイク発話時選択処理は、複数のマイクにより入力を受け付けることができる場合に、音質の良好なマイクからの入力を選択的に採用して入力音質を高く維持する処理である。 Next, the microphone utterance selection process performed by the microphone recognition unit 107 when the user speaks will be described with reference to FIG. The microphone utterance selection process is a process of selectively adopting an input from a microphone with good sound quality and maintaining high input sound quality when input can be received by a plurality of microphones.

まず、基本制御部１０１は、入力されているすべてのマイクにおいて音声を受け付ける（ステップＳ４０１）。なお、音声を受け付けるマイクは、すべてのマイクではなく、あらかじめ指定された複数のマイク、あるいはゲインの大きい順に選択された所定数のマイクであってもよい。 First, the basic control unit 101 receives sound in all input microphones (step S401). Note that the microphones that receive audio may not be all microphones, but may be a plurality of microphones designated in advance, or a predetermined number of microphones selected in descending order of gain.

次に、マイク認識部１０７は、マイクごとのノイズレベルを特定し、低レベルのマイクで受け付けた音声を採用する（ステップＳ４０２）。具体的には、マイク認識部１０７は、マイクごとに、入力された音声情報について、雑音レベル判定部１０４を介してノイズのレベル（Ｓ／Ｎ比）を特定し、ノイズ比が最も低い音声情報を採用して入力された音声として特定する。 Next, the microphone recognizing unit 107 identifies the noise level for each microphone and adopts the sound received by the low-level microphone (step S402). Specifically, the microphone recognizing unit 107 specifies the noise level (S / N ratio) for the input audio information for each microphone via the noise level determining unit 104, and the audio information with the lowest noise ratio is obtained. Is used to identify the input voice.

以上が、マイク発話時選択処理の処理フローである。マイク発話時選択処理によると、実際に発話された音声情報のうち、音質が良い音声情報を採用することができるため、車両等音響環境が随時変化する場合等において、発話ごとに最も良い音質で音声入力を行うことができるといえる。 The above is the processing flow of the microphone utterance selection process. According to the microphone utterance selection process, it is possible to adopt voice information with good sound quality among the voice information actually spoken, so the best sound quality for each utterance when the acoustic environment such as the vehicle changes from time to time. It can be said that voice input can be performed.

以上、本発明の第一の実施形態について説明した。本発明の第一の実施形態によると、ナビゲーション装置１００は、より手軽に高い検索機能を利用することができる。なお、上記マイク発話時選択処理は、使用者による指定を受け付けたマイクがある場合には、当該マイクを優先して採用するようにしてもよい。 The first embodiment of the present invention has been described above. According to the first embodiment of the present invention, the navigation device 100 can use a higher search function more easily. Note that the microphone utterance selection process may be preferentially adopted when there is a microphone that has been designated by the user.

本発明は、上記の実施形態に制限されない。上記の実施形態は、本発明の技術的思想の範囲内で様々な変形が可能である。例えば、図１１に示すように、音声認識された結果の文字列について、使用者の指示を待たずにＰＯＩ探索を行うよう、ＰＯＩ検索処理のフローを変更した第二の実施形態としてもよい。 The present invention is not limited to the above embodiment. The above embodiment can be variously modified within the scope of the technical idea of the present invention. For example, as shown in FIG. 11, a second embodiment may be adopted in which the flow of POI search processing is changed so that a POI search is performed without waiting for a user's instruction for a character string obtained as a result of speech recognition.

第二の実施形態について、以下に説明する。第二の実施形態は、基本的に第一の実施形態とほぼ同様の構成を備える検索システムであり、ＰＯＩ検索処理に相違がある。当該相違を中心に、以下に説明する。 A second embodiment will be described below. The second embodiment is a search system basically having substantially the same configuration as the first embodiment, and there is a difference in POI search processing. The following description will be focused on the difference.

まず、入力受付部１０２は、音声入力の待ち受けを開始する（ステップＳ５０１）。そして、入力受付部１０２は、ＰＴＴボタンの開放等により音声待ち受けを終了する（ステップＳ５０３）まで、音声区間を検出し、入力された音声情報を圧縮して音声情報を作成する（ステップＳ５０２）。なお、ここで、雑音レベル判定部１０４が、入力された音声情報の雑音レベルを判定する。そして、雑音レベルが所定よりも高い場合、すなわち雑音が多い環境では、入力受付部１０２は、圧縮率を低く設定して圧縮することとし、圧縮による音質劣化を最小限に留めるようにしてもよい。また、雑音レベルがさらに高く所定の閾値を超える場合、すなわち雑音が大きすぎて到底正常に音声認識を行うことができない程度に大きい環境においては、入力受付部１０２は、音声情報の作成を行わず、以降の処理を実施しないようにしてもよい。 First, the input receiving unit 102 starts waiting for voice input (step S501). Then, the input receiving unit 102 detects a voice interval and compresses the input voice information to create voice information (step S502) until the voice waiting is ended by releasing the PTT button or the like (step S503). Here, the noise level determination unit 104 determines the noise level of the input voice information. When the noise level is higher than a predetermined level, that is, in an environment where there is a lot of noise, the input receiving unit 102 may set the compression rate to be low and perform compression to minimize sound quality deterioration due to compression. . In addition, in a case where the noise level is higher and exceeds a predetermined threshold, that is, in an environment where the noise is too large to be able to perform speech recognition normally, the input receiving unit 102 does not create voice information. The subsequent processing may not be performed.

そして、入力受付部１０２は、中継サーバー通信部１０５を介して、中継サーバー装置５００へ音声情報を送信する。そして、中継サーバー装置５００の雑音除去処理部５２１は、受信した音声情報に対して、所定のアルゴリズムを実現する雑音除去処理を実施する（ステップＳ５０４）。具体的には、雑音除去処理部５２１は、雑音除去処理時に適用することをあらかじめ定められた一または複数のアルゴリズムにより、受信した音声情報に対して雑音除去処理を実施して、雑音を除去された一または複数の音声情報を生成する。 Then, the input reception unit 102 transmits voice information to the relay server device 500 via the relay server communication unit 105. And the noise removal process part 521 of the relay server apparatus 500 implements the noise removal process which implement | achieves a predetermined algorithm with respect to the received audio | voice information (step S504). Specifically, the noise removal processing unit 521 performs noise removal processing on the received voice information according to one or a plurality of algorithms determined in advance to be applied at the time of noise removal processing, and noise is removed. Generating one or more audio information.

そして、ＰＯＩ提示部５２２は、雑音を除去された一または複数の音声情報を音声特定サーバー装置９００へ送信する。音声特定サーバー装置９００は、各音声情報に所定の音声認識処理を実施して認識した結果である候補である一または複数の文字列情報を中継サーバー装置５００へ送信する（ステップＳ５０５）。なお、当該音声認識処理においては、既存の音声認識等の処理が行われ、Ｎ−ｂｅｓｔ検索等により一または複数の認識結果の候補となる文字列がその確度とともに出力される。例えば、使用者が発話した音声情報が「ピザ」に該当するものである場合、「ピザ」、「Pizza」、「膝」、「いか」等の候補となる文字列が音声情報ごとに出力される。 Then, the POI presentation unit 522 transmits one or more pieces of voice information from which noise has been removed to the voice identification server device 900. The voice identification server device 900 transmits one or a plurality of character string information that is a result of recognition by performing predetermined voice recognition processing on each voice information to the relay server device 500 (step S505). In the speech recognition processing, existing speech recognition processing or the like is performed, and character strings that are candidates for one or a plurality of recognition results are output together with the accuracy by N-best search or the like. For example, if the voice information spoken by the user corresponds to “pizza”, character strings that are candidates for “pizza”, “Pizza”, “knee”, “squid”, etc. are output for each voice information. The

そして、ＰＯＩ提示部５２２は、出力された認識結果の文字列情報を受け取り、認識結果に対して重みづけを行う（ステップＳ５０６）。具体的には、出力された認識結果の文字列情報は、雑音除去のアルゴリズムに応じて一または複数の候補が挙げられており、ＰＯＩ提示部５２２は、その中で重複する候補があれば一つに統合し、統合された候補についてはその確度をより高いもの（例えば、確度に所定の割合を上乗せする）として補正し、確度の順に候補の文字列を順位付けする。 Then, the POI presenting unit 522 receives the output character string information of the recognition result, and weights the recognition result (step S506). Specifically, the output character string information of the recognition result includes one or a plurality of candidates according to the noise removal algorithm, and the POI presenting unit 522 determines that there is a duplicate candidate. The accuracy of the integrated candidates is corrected to be higher (for example, a predetermined ratio is added to the accuracy), and the candidate character strings are ranked in the order of accuracy.

ＰＯＩ提示部５２２は、順位付けした候補の文字列（複数）を、ＰＯＩ提供サーバー装置９５０へ送信し、ＰＯＩの検索依頼を送信する（Ｓ０５０７）。 The POI presentation unit 522 transmits the ranked candidate character strings (plurality) to the POI providing server device 950, and transmits a POI search request (S0507).

ＰＯＩ提供サーバー装置９５０は、送信された候補の文字列のそれぞれについて、施設名あるいは住所等に含むか、送信された候補の文字列のそれぞれに類似する文字列をその施設名あるいは住所等に含むＰＯＩをゆらぎ検索し、一または複数のＰＯＩの候補を確度別に検索し、当該ＰＯＩの名称、座標、電話番号、住所等を含む情報を含むＰＯＩリストを候補の文字列に対応付けて中継サーバー装置５００へ送信する（ステップＳ５０８）。 The POI providing server device 950 includes each of the transmitted candidate character strings in the facility name or address, or includes a character string similar to each of the transmitted candidate character strings in the facility name or address. A relay server apparatus that searches for fluctuations in POI, searches for one or more POI candidates by accuracy, and associates a POI list including information including the name, coordinates, telephone number, address, etc. of the POI with a candidate character string It transmits to 500 (step S508).

ＰＯＩ提示部５２２は、送信された候補の文字列ごとのＰＯＩリストを、ナビゲーション装置１００に対して送信する（ステップＳ５０９）。 The POI presentation unit 522 transmits the transmitted POI list for each candidate character string to the navigation device 100 (step S509).

ＰＯＩ提示情報作成部１０６は、受信したＰＯＩ検索結果を用いて、候補の文字列ごとに、各ＰＯＩを選択可能に表示する表示画面を作成し、出力処理部１０３に対してディスプレイ２に表示するよう指示する（ステップＳ５１０）。例えば、ＰＯＩ提示情報作成部１０６は、候補の文字列が「ピザ」、「Pizza」、「膝」である場合には、ピザ食を提供するレストランのリスト等を、候補の文字列ごとに選択可能に表示するとともに、当該レストランの座標位置に応じて、地図上の該当する位置に当該レストランのアイコンを表示する画面を作成する。 The POI presentation information creation unit 106 uses the received POI search result to create a display screen that displays each POI for each candidate character string so as to be selectable, and displays it on the display 2 on the output processing unit 103. (Step S510). For example, when the candidate character strings are “pizza”, “Pizza”, and “knee”, the POI presentation information creation unit 106 selects a list of restaurants that provide pizza food for each candidate character string. A screen for displaying the icon of the restaurant at a corresponding position on the map is created according to the coordinate position of the restaurant.

そして、基本制御部１０１は、表示したＰＯＩの選択を、入力受付部１０２を介して受け付け、選択されたＰＯＩを目的地あるいは経由地とする経路探索を行う（ステップＳ５１１）。当該経路探索時には、基本制御部１０１は、選択されたＰＯＩの名称を含むルート探索メッセージを表示する。例えば、基本制御部１０１は、選択されたＰＯＩの名称が「東京ピザ」であれば、「東京ピザへのルートを探索します」というメッセージを表示して、当該ＰＯＩへの経路探索を実施する。 Then, the basic control unit 101 receives selection of the displayed POI via the input reception unit 102, and performs a route search using the selected POI as a destination or waypoint (step S511). During the route search, the basic control unit 101 displays a route search message including the name of the selected POI. For example, if the name of the selected POI is “Tokyo Pizza”, the basic control unit 101 displays a message “Searching for a route to Tokyo Pizza” and performs a route search to the POI. .

以上が、第二の実施形態に係るＰＯＩ検索処理のフローである。第二の実施形態に係るＰＯＩ検索処理によると、音声情報にもとづき認識された文字列の一または複数の候補それぞれについてＰＯＩを検索し提示することができるため、使用者の入力操作をより減らし、手軽に高い検索機能を利用できる。具体的には、ナビゲーション装置の処理能力は特別に高くなくとも、精度の高い音声認識および高機能なＰＯＩの検索機能を利用することができるといえる。 The above is the flow of the POI search process according to the second embodiment. According to the POI search process according to the second embodiment, the POI can be searched and presented for each of one or more candidates of the character string recognized based on the voice information, so that the user's input operation is further reduced, You can easily use the high search function. Specifically, even if the processing capability of the navigation device is not particularly high, it can be said that highly accurate voice recognition and a highly functional POI search function can be used.

また、例えば、第一の実施形態においては、図７においてプロセス間の関連を表したように、音声情報を発話終了後にまとめてナビゲーション装置１００から中継サーバー装置５００等へ送信しているが、これに限られない。すなわち、音声情報を発話中にナビゲーション装置１００から中継サーバー装置５００等へ送信して、雑音除去を順次実施するようにしてもよい。このような変形処理について、図１２を用いて説明する。 Further, for example, in the first embodiment, as shown in FIG. 7, the voice information is collectively transmitted from the navigation device 100 to the relay server device 500 or the like after the utterance is finished, as shown in FIG. Not limited to. That is, voice information may be transmitted from the navigation device 100 to the relay server device 500 or the like during speech, and noise removal may be sequentially performed. Such deformation processing will be described with reference to FIG.

まず、ナビゲーション装置１００の中継サーバー通信部１０５は、中継サーバー装置５００の送受信プロセス（ＰＯＩ提示部５２２により制御される）に対して音声情報の送信を開始する（ステップＳ６０１）。 First, the relay server communication unit 105 of the navigation device 100 starts transmitting voice information to the transmission / reception process (controlled by the POI presenting unit 522) of the relay server device 500 (step S601).

なお、中継サーバー通信部１０５は、音声情報の送信をすべて終える（ステップＳ６０７）まで、音声情報の送信を継続する。 Note that the relay server communication unit 105 continues to transmit the audio information until the transmission of all the audio information is completed (step S607).

中継サーバー装置５００の送受信プロセスでは、ＰＯＩ提示部５２２は、音声情報の送信が開始されると、中継サーバー通信部１０５からの音声情報の送信が終わるのを待たずに、雑音除去処理部５２１により制御される雑音除去プロセスへ音声情報の送信を開始する（ステップＳ６０２）。なお、すべての音声情報の送信を終えると、中継サーバー装置５００の送受信プロセスでは、雑音除去プロセスへの音声情報の送信が終了する（ステップＳ６０８）。 In the transmission / reception process of the relay server device 500, when the transmission of the voice information is started, the POI presentation unit 522 does not wait for the transmission of the voice information from the relay server communication unit 105 to end, and the noise removal processing unit 521 Transmission of voice information is started to the controlled noise removal process (step S602). When the transmission of all audio information is completed, the transmission / reception process of the relay server device 500 ends the transmission of the audio information to the noise removal process (step S608).

雑音除去プロセスでは、雑音除去処理部５２１により、送信された音声情報に対して所定の雑音除去処理が行われる。中継サーバー装置５００の雑音除去処理部５２１は、まず、受け取った音声情報の先頭の所定の時間（例えば、１００ミリ秒間）の無音部分について、雑音レベルの判定を行い、雑音レベルに応じて適切な雑音除去アルゴリズムを一つまたは複数決定する（ステップＳ６０３）。そして、中継サーバー装置５００の雑音除去処理部５２１は、決定した雑音除去アルゴリズムを適用して、受信している音声情報に対して雑音除去を行い（ステップＳ６０４）、雑音が除去された部分から順次、ＰＯＩ提示部５２２により制御されるサーバ通信プロセスへ送信開始する（ステップＳ６０５）。なお、雑音が除去されたすべての音声情報の送信が終わると、中継サーバー装置５００の雑音除去プロセスでは、サーバ通信プロセスへの音声情報の送信が終了する（ステップＳ６０９）。 In the noise removal process, the noise removal processing unit 521 performs predetermined noise removal processing on the transmitted voice information. First, the noise removal processing unit 521 of the relay server device 500 determines a noise level for a silent part at the beginning of the received voice information for a predetermined time (for example, 100 milliseconds), and selects an appropriate level according to the noise level. One or more denoising algorithms are determined (step S603). Then, the noise removal processing unit 521 of the relay server device 500 applies the determined noise removal algorithm to perform noise removal on the received voice information (step S604), and sequentially from the part from which the noise has been removed. Then, transmission to the server communication process controlled by the POI presenting unit 522 is started (step S605). When transmission of all audio information from which noise has been removed is completed, transmission of audio information to the server communication process ends in the noise removal process of relay server device 500 (step S609).

雑音が除去された音声情報を受信すると、サーバ通信プロセスでは、ＰＯＩ提示部５２２は、雑音除去プロセスから送られた雑音が除去された音声情報を順次、音声特定サーバー装置９００へ送信開始する（ステップＳ６０６）。なお、ここでは、雑音除去のアルゴリズム別に音声情報が存在する場合には、ＰＯＩ提示部５２２は、異なる雑音除去のアルゴリズムが適用された複数の音声情報をすべて送信する。 When the voice information from which the noise has been removed is received, in the server communication process, the POI presenting unit 522 starts transmitting the voice information from which the noise has been removed from the noise removal process to the voice identification server device 900 in sequence (step). S606). Here, when voice information exists for each noise removal algorithm, POI presenting section 522 transmits all of the plurality of voice information to which different noise removal algorithms are applied.

そして、雑音が除去された音声情報の送信が終わると、中継サーバー装置５００のサーバ通信プロセスでは、音声特定サーバー装置９００への音声情報の送信が終了する（ステップＳ６１０）。 When the transmission of the voice information from which noise has been removed is completed, the transmission of the voice information to the voice specifying server apparatus 900 is completed in the server communication process of the relay server apparatus 500 (step S610).

そして、音声特定サーバー装置９００は、受信した雑音除去された一または複数の音声情報に対して、所定の音声認識処理を行い、認識した結果得られた候補となる文字列をＮ−ｂｅｓｔ検索により一または複数特定する（ステップＳ６１１）。 Then, the voice specifying server device 900 performs a predetermined voice recognition process on the received noise-removed one or more pieces of voice information, and performs N-best search for candidate character strings obtained as a result of the recognition. One or more are specified (step S611).

そして、中継サーバー装置５００のサーバ通信プロセスでは、ＰＯＩ提示部５２２は、音声特定サーバー装置９００から送信された候補となる文字列をすべて受信する（ステップＳ６１２）。 Then, in the server communication process of the relay server device 500, the POI presentation unit 522 receives all candidate character strings transmitted from the voice identification server device 900 (step S612).

中継サーバー装置５００のサーバ通信プロセスでは、ＰＯＩ提示部５２２は、送受信プロセスに対して、受信した文字列をすべて送る（ステップＳ６１３）。 In the server communication process of relay server device 500, POI presentation unit 522 sends all received character strings to the transmission / reception process (step S613).

中継サーバー装置５００の送受信プロセスでは、ＰＯＩ提示部５２２により、ナビゲーション装置１００の中継サーバー通信部１０５に認識結果の文字列が送信される（ステップＳ６１４）。 In the transmission / reception process of the relay server device 500, the POI presentation unit 522 transmits the character string of the recognition result to the relay server communication unit 105 of the navigation device 100 (step S614).

以上が、ＰＯＩ検索処理におけるステップＳ００２〜ステップＳ００７におけるプロセス間の関連の変形例である。なお、雑音除去（ステップＳ００４）や認識結果の重みづけ（ステップＳ００６）等の処理については、当該プロセス間の関係の説明では詳細な説明を省略している。 The above is a modified example of the relationship between processes in steps S002 to S007 in the POI search process. Note that detailed explanations of processes such as noise removal (step S004) and recognition result weighting (step S006) are omitted in the description of the relationship between the processes.

このように変形することで、音声情報の発話から音声認識の開始までをリアルタイムに行うことができるため、音声認識処理の開始タイミングを早めることができ、応答性を高くすることができる。 By transforming in this way, it is possible to perform in real time from the speech of speech information to the start of speech recognition, so that the start timing of speech recognition processing can be advanced and responsiveness can be enhanced.

以上、本発明について、実施形態を中心に説明した。なお、上記の各実施形態では、本発明をナビゲーション装置等に適用した例について説明したが、本発明はナビゲーション装置に限らず、情報端末全般に適用することができる。また、各実施形態においてナビゲーション装置１００で実施する処理および当該処理を実施するのに用いられる処理部は、他の端末装置（例えば、通信装置１２を介した携帯電話、スマートフォン等）に設けられ、当該処理の一部をナビゲーション装置１００と他の端末装置との間で分散させて処理するようにしてもよい。 In the above, this invention was demonstrated centering on embodiment. In each of the above-described embodiments, examples in which the present invention is applied to a navigation device and the like have been described. However, the present invention is not limited to a navigation device and can be applied to all information terminals. In each embodiment, the processing performed by the navigation device 100 and the processing unit used to perform the processing are provided in another terminal device (for example, a mobile phone or a smartphone via the communication device 12). A part of the processing may be distributed between the navigation device 100 and other terminal devices.

１・・・演算処理部、２・・・ディスプレイ、３・・・記憶装置、４・・・音声出入力装置、５・・・入力装置、６・・・ＲＯＭ装置、７・・・車速センサ、８・・・ジャイロセンサ、９・・・ＧＰＳ受信装置、１０・・・ＦＭ多重放送受信装置、１１・・・ビーコン受信装置、１２・・・通信装置、２１・・・ＣＰＵ、２２・・・ＲＡＭ、２３・・・ＲＯＭ、２４・・・Ｉ／Ｆ、２５・・・バス、３０・・・ネットワーク、４１・・・マイクロフォン、４２・・・スピーカ、４３・・・拡張マイクロフォン、５１・・・タッチパネル、５２・・・ダイヤルスイッチ、１００・・・ナビゲーション装置、１０１・・・基本制御部、１０２・・・入力受付部、１０３・・・出力処理部、１０４・・・雑音レベル判定部、１０５・・・中継サーバー通信部、１０６・・・ＰＯＩ提示情報作成部、１０７・・・マイク認識部、２００・・・リンクテーブル、５００・・・中継サーバー装置、５１０・・・記憶部、５２０・・・制御部、５３０・・・送受信部、１０００・・・検索システム DESCRIPTION OF SYMBOLS 1 ... Arithmetic processing part, 2 ... Display, 3 ... Memory | storage device, 4 ... Voice output device, 5 ... Input device, 6 ... ROM device, 7 ... Vehicle speed sensor , 8 ... Gyro sensor, 9 ... GPS receiver, 10 ... FM multiplex broadcast receiver, 11 ... Beacon receiver, 12 ... Communication device, 21 ... CPU, 22 ... RAM 23 ... ROM 24 ... I / F 25 ... bus 30 ... network 41 ... microphone 42 ... speaker 43 ... extension microphone 51 ..Touch panel, 52 ... Dial switch, 100 ... Navigation device, 101 ... Basic control unit, 102 ... Input reception unit, 103 ... Output processing unit, 104 ... Noise level determination unit 105 ... Relay server Communication unit 106 ... POI presentation information creation unit 107 ... Microphone recognition unit 200 ... Link table 500 ... Relay server device 510 ... Storage unit 520 ... Control unit 530 ... Transmission / reception unit, 1000 ... Search system

上記課題を解決すべく、本発明に係るサーバー装置は、ネットワークを介して所定の情報端末から音声情報を受信する音声情報受信手段と、複数の雑音除去アルゴリズムを適用して上記音声情報から雑音情報を除去し、適用された上記雑音除去アルゴリズムに応じて、雑音を除去された音声情報を複数生成する雑音除去手段と、雑音を除去された上記音声情報を、所定の音声認識装置へ上記ネットワークを介して送信する音声送信手段と、上記音声認識装置による上記音声情報の認識結果である複数の文字列候補を、上記ネットワークを介して受信する文字列受信手段と、適用された雑音除去のアルゴリズムに応じて重み付けが行われた一又は複数の上記文字列候補を、ＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）の情報を提供する所定のＰＯＩ提供装置へ上記ネットワークを介して送信する文字列送信手段と、上記文字列候補に関連するＰＯＩの情報を、上記ネットワークを介して受信するＰＯＩ情報受信手段と、上記ＰＯＩの情報を上記情報端末へ送信するＰＯＩ情報送信手段と、を備えることを特徴とする。
In order to solve the above problems, a server device according to the present invention includes a voice information receiving unit that receives voice information from a predetermined information terminal via a network, and a plurality of noise removal algorithms to apply noise information from the voice information. Noise removal means for generating a plurality of pieces of speech information from which noise has been removed in accordance with the applied noise removal algorithm, and the network from the speech information from which noise has been removed to a predetermined speech recognition device. Voice transmission means for transmitting via the network, character string reception means for receiving a plurality of character string candidates that are recognition results of the voice information by the voice recognition device via the network, and an applied noise removal algorithm One or a plurality of the above-described character string candidates that are weighted in accordance with the predetermined information that provides POI (Point Of Interest) information Character string transmitting means for transmitting to the OI providing apparatus via the network, POI information receiving means for receiving the POI information related to the character string candidate via the network, and the POI information for the information terminal And a POI information transmitting means for transmitting to .

Claims

Voice input receiving means for receiving voice input;
Communication means for communicating with a predetermined server device via a network;
Output means;
POI identifying means for transmitting the voice information accepted by the voice input accepting means to the server device and receiving information for identifying a POI (Point Of Interest) candidate related to the voice information;
POI candidate output means for outputting information specifying the POI candidates received by the POI specifying means to the output means;
Route search means for receiving a selection input of information for identifying the POI candidate and searching for a route to the POI;
An information terminal comprising:

The information terminal according to claim 1,
The POI specifying means further includes:
When a plurality of character string candidates obtained as a result of recognizing the speech information are received from the server device, the received character strings are selectably output to the output means, and one selection input of the character string candidates is received. Accept,
An information terminal characterized by that.

The information terminal according to claim 1 or 2,
The POI specifying means is:
The voice received by the voice input accepting unit is information compressed at a predetermined compression rate and transmitted to the server device,
If the quality of the voice received by the voice input receiving means is less than or equal to a predetermined value, the predetermined compression rate is set to reduce deterioration
An information terminal characterized by that.

The information terminal according to any one of claims 1 to 3,
The POI specifying unit does not transmit the audio information to the server device when the quality of the audio received by the audio input receiving unit is below a specified level.
An information terminal characterized by that.

The information terminal according to any one of claims 1 to 4, wherein
The POI specifying means sequentially transmits the voice received by the voice input receiving means to the server device.
An information terminal characterized by that.

The information terminal according to any one of claims 1 to 5,
The POI candidate output means includes:
In the process of outputting the information specifying the POI candidate received by the POI specifying means to the output means, when there are a plurality of character strings obtained as a result of recognizing the speech information, the POI candidate is Output to the output means in association with each character string,
An information terminal characterized by that.

Comprising voice input receiving means for receiving one or more voice inputs;
The voice input receiving means identifies and adopts a voice input with better sound quality when receiving a plurality of voice inputs;
An information terminal characterized by that.

Control means;
One or a plurality of voice input receiving means,
The voice input receiving means receives one or a plurality of voice inputs;
When the control means includes a plurality of the voice input receiving means, the control means adopts a voice input receiving means close to the user.
An information terminal characterized by that.

Control means;
One or a plurality of voice input receiving means,
The voice input receiving means receives one or a plurality of voice inputs;
When the control means includes a plurality of voice input reception means, if there is a voice input reception means designated in advance by the user, the designated voice input reception means is adopted.
An information terminal characterized by that.

Voice information receiving means for receiving voice information from a predetermined information terminal via a network;
Noise removing means for removing noise information from the voice information;
Voice transmission means for transmitting the voice information from which noise has been removed to a predetermined voice recognition device via the network;
POI information receiving means for receiving POI (Point Of Interest) information related to the character string via the network;
POI information transmitting means for transmitting the POI information to the information terminal;
A server apparatus comprising:

The server device according to claim 10, further comprising:
A character string receiving means for receiving a character string, which is a recognition result of the voice information by the voice recognition device, via the network;
A character string transmitting means for transmitting the character string via the network to a predetermined POI providing device that provides POI (Point Of Interest) information related to the character string;
A server apparatus comprising:

The server device according to claim 11, further comprising:
If the character string received by the character string receiving means includes a plurality of character string candidates, the character string candidate transmitting means for transmitting the character string candidates to the information terminal;
A selection character string receiving means for receiving, from the information terminal, one candidate character string selected from the character string candidates;
The character string transmitting unit transmits the one candidate character string via the network to the POI providing device when the selected character string receiving unit receives the one candidate character string.
A server device characterized by that.

The server device according to claim 11, further comprising:
When the character string received by the character string receiving means includes a plurality of character string candidates, the character string transmitting means transmits the plurality of character strings via the network to the POI providing device,
The POI information receiving means receives information on a plurality of POIs associated with the plurality of character strings.
A server device characterized by that.

The server device according to any one of claims 10 to 13, further comprising:
POI deduplication means for eliminating duplication when the POI information received by the POI information receiving means includes a plurality of POIs;
A server apparatus comprising:

The server device according to any one of claims 10 to 14,
The noise removing means generates a plurality of speech information from which noise has been removed using a plurality of noise removing algorithms.
A server device characterized by that.

The server device according to any one of claims 10 to 15, wherein
The noise removing means includes
When the voice information receiving means starts receiving voice information from a predetermined information terminal via a network, noise information is sequentially removed from the received information,
The voice transmission means sequentially transmits voice information from which noise has been removed by the noise removal means to a predetermined voice recognition device via the network.
A server device characterized by that.

A search system comprising an information terminal and a server device that communicates with the information terminal via a network,
The information terminal
Voice input receiving means for receiving voice input;
Communication means for communicating with the server device via a network;
Output means;
POI identifying means for transmitting the voice information accepted by the voice input accepting means to the server device and receiving information for identifying a POI (Point Of Interest) candidate related to the voice information;
POI candidate output means for outputting information specifying the POI candidates received by the POI specifying means to the output means;
Route search means for receiving a selection input of information for identifying the POI candidate and searching for a route to the POI;
With
The server device is
Voice information receiving means for receiving voice information from the information terminal via a network;
Noise removing means for removing noise information from the voice information;
Voice transmission means for transmitting the voice information from which noise has been removed to a predetermined voice recognition device via the network;
POI information receiving means for receiving POI (Point Of Interest) information related to the character string via the network;
POI information transmitting means for transmitting the POI information to the information terminal;
A search system comprising:

A search method for a search system, comprising: an information terminal; and a server device that communicates with the information terminal via a network,
The search system includes:
Voice input receiving means for receiving voice input;
An output means,
A transmitting step of transmitting information of the sound received by the sound input receiving means to the server device;
A voice information receiving step for receiving voice information from the information terminal via the network;
A noise removing step for removing noise information from the voice information;
A voice transmission step of transmitting the voice information from which noise has been removed to a predetermined voice recognition device via the network;
A POI information receiving step of receiving POI (Point Of Interest) information related to the character string from the voice recognition device via the network;
A POI information transmission step of transmitting the POI information to the information terminal;
A POI specifying step for receiving the POI information;
A POI candidate output step of outputting the POI information received in the POI specifying step to the output means;
The search method characterized by implementing.