JP5300576B2

JP5300576B2 - SEARCH DEVICE, SEARCH METHOD, AND SEARCH PROGRAM

Info

Publication number: JP5300576B2
Application number: JP2009104474A
Authority: JP
Inventors: 聡士田原; 優網谷
Original assignee: NTT Communications Corp
Current assignee: NTT Communications Corp
Priority date: 2009-04-22
Filing date: 2009-04-22
Publication date: 2013-09-25
Anticipated expiration: 2029-04-22
Also published as: JP2010257085A

Description

本発明は、テキストまたは音声で入力された検索文に基づいて検索する検索装置、検索方法、および検索プログラムに関する。 The present invention relates to a search device, a search method, and a search program for searching based on a search sentence input by text or voice.

ユーザが入力した検索条件に基づいて、所定のデータベースやネットワーク上の情報を検索する検索システム・検索エンジンが存在する。検索エンジンを提供するサービスとして、例えば、非特許文献１などがある。 There are search systems and search engines that search information on a predetermined database or network based on a search condition input by a user. As a service for providing a search engine, for example, there is Non-Patent Document 1.

“ｇｏｏ検索”、［online］、［平成２１年４月１日検索］、インターネット＜URL: http://search.goo.ne.jp/web.jsp?MT=＞“Go search”, [online], [search April 1, 2009], Internet <URL: http://search.goo.ne.jp/web.jsp?MT=>

キーボード等の入力デバイスを用いてテキストで検索条件を入力する場合、キーボードの操作に不慣れなユーザにとっては、テキストの入力に時間を要し、また入力ミスなどが発生するなどし、ユーザの入力負荷が大きい。 When entering search conditions with text using an input device such as a keyboard, users who are unfamiliar with keyboard operations need time to input text, and input errors may occur. Is big.

また、マイクなどの入力デバイスを用いて音声で検索条件を入力する場合、音声を認識する際に誤認識が発生しやすく、誤認識された検索条件で検索されることにより、ユーザが意図しない検索結果が提示されるという問題がある。 In addition, when a search condition is input by voice using an input device such as a microphone, erroneous recognition is likely to occur when the voice is recognized, and the search is performed by the erroneously recognized search condition. There is a problem that results are presented.

本発明は上記事情に鑑みてなされたものであり、本発明の目的は、ユーザの入力負荷を軽減するとともに、ユーザが意図する検索条件をより容易に入力させ、ユーザの利便性をより向上させることにある。 The present invention has been made in view of the above circumstances, and an object of the present invention is to reduce a user's input load and more easily input search conditions intended by the user, thereby further improving the convenience of the user. There is.

上記課題を解決するために、本発明は、検索装置であって、テキスト検索文の入力を受け付けるテキスト入力受付手段と、前記テキスト検索文を形態素解析して、解析結果の各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する形態素解析手段と、音声検索文の入力を受け付ける音声入力受付手段と、前記音声検索文を音声認識してテキストに変換するとともに、前記変換されたテキストの各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する音声認識手段と、前記形態素解析手段または前記音声認識手段が生成した共通意味ＩＤ付テキストデータを解析し、検索条件と当該検索条件に対応する共通意味ＩＤを抽出する検索条件解析手段と、前記検索条件および対応する共通意味ＩＤを検索条件記憶手段に記憶する検索条件管理手段と、前記検索条件記憶手段に記憶された検索条件を提示する提示手段と、前記検索条件を用いて検索し、検索結果を出力する検索手段と、を有する。 In order to solve the above-described problem, the present invention provides a search device, a text input receiving unit that receives an input of a text search sentence, and a morphological analysis of the text search sentence, and a common semantic ID for each morpheme of the analysis result Morphological analysis means for generating text data with a common meaning ID, voice input acceptance means for accepting input of a voice search sentence, voice recognition of the voice search sentence and converting it into text, and the converted A speech recognition unit that assigns a common meaning ID to each morpheme of text and generates text data with a common meaning ID, and analyzes the text data with a common meaning ID generated by the morpheme analysis unit or the speech recognition unit, and a search condition Search condition analysis means for extracting a common meaning ID corresponding to the search condition, the search condition and the corresponding common meaning ID as a search condition. Has a search condition management means for storing in the storage means, and presenting means for presenting the search conditions stored in the search condition storage means, and searching using the search condition, a search means for outputting a search result.

また、本発明は、コンピュータが行う検索方法であって、テキスト検索文の入力、または、音声検索文の入力を受け付ける入力受付ステップと、前記テキスト検索文の入力を受け付けた場合、前記テキスト検索文を形態素解析して、解析結果の各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成するとともに、前記音声検索文を受け付けた場合、前記音声検索文を音声認識してテキストに変換するとともに、前記変換されたテキストの各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する生成ステップと、前記生成ステップで生成した共通意味ＩＤ付テキストデータを解析し、検索条件と当該検索条件に対応する共通意味ＩＤを抽出する検索条件解析ステップと、前記検索条件および対応する共通意味ＩＤを検索条件記憶部に記憶する検索条件管理ステップと、前記検索条件記憶部に記憶された検索条件を提示する提示ステップと、前記検索条件を用いて検索し、検索結果を出力する検索ステップと、を行う。 In addition, the present invention is a search method performed by a computer, wherein an input receiving step for receiving an input of a text search sentence or an input of a voice search sentence, and when the input of the text search sentence is received, the text search sentence Morphological analysis is performed, a common meaning ID is assigned to each morpheme of the analysis result, text data with a common meaning ID is generated, and when the voice search sentence is received, the voice search sentence is recognized and converted into text. A conversion step is performed, a common meaning ID is assigned to each morpheme of the converted text, and text data with common meaning ID is generated, and the text data with common meaning ID generated in the generation step is analyzed and searched. A search condition analysis step for extracting a common semantic ID corresponding to the search condition and the search condition; A search condition management step for storing a semantic ID in a search condition storage unit, a presentation step for presenting the search condition stored in the search condition storage unit, and a search step for performing a search using the search condition and outputting a search result And do.

また、本発明は、コンピュータが実行する検索プログラムであって、前記コンピュータに、テキスト検索文の入力、または、音声検索文の入力を受け付ける入力受付ステップと、前記テキスト検索文の入力を受け付けた場合、前記テキスト検索文を形態素解析して、解析結果の各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成するとともに、前記音声検索文を受け付けた場合、前記音声検索文を音声認識してテキストに変換するとともに、前記変換されたテキストの各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する生成ステップと、前記生成ステップで生成した共通意味ＩＤ付テキストデータを解析し、検索条件と当該検索条件に対応する共通意味ＩＤを抽出する検索条件解析ステップと、前記検索条件および対応する共通意味ＩＤを検索条件記憶部に記憶する検索条件管理ステップと、前記検索条件記憶部に記憶された検索条件を提示する提示ステップと、前記検索条件を用いて検索し、検索結果を出力する検索ステップと、を実行させる。 Further, the present invention is a search program executed by a computer, wherein the computer receives an input of a text search sentence or an input of a voice search sentence, and an input of the text search sentence is received The text search sentence is morphologically analyzed, a common meaning ID is assigned to each morpheme of the analysis result, text data with a common meaning ID is generated, and when the voice search sentence is accepted, the voice search sentence is Recognizing and converting to text, adding a common meaning ID to each morpheme of the converted text to generate text data with common meaning ID, and text data with common meaning ID generated in the generating step A search condition analysis step for extracting a search condition and a common semantic ID corresponding to the search condition; A search condition management step for storing the search condition and the corresponding common meaning ID in a search condition storage unit, a presentation step for presenting the search condition stored in the search condition storage unit, and a search using the search condition, And a search step for outputting a search result.

本発明により、ユーザの入力負荷を軽減するとともに、ユーザが意図する検索条件をより容易に入力させ、ユーザの利便性をより向上させることができる。 According to the present invention, it is possible to reduce a user's input load, to more easily input a search condition intended by the user, and to further improve user convenience.

本発明の一実施形態が適用された検索装置の構成を示す構成図である。It is a block diagram which shows the structure of the search device with which one Embodiment of this invention was applied. 本実施形態の検索処理のフローチャートである。It is a flowchart of the search process of this embodiment. 共通辞書の一例を示す図である。It is a figure which shows an example of a common dictionary.

以下、本発明の実施の形態について説明する。 Embodiments of the present invention will be described below.

図１は、本発明の一実施形態である検索装置の構成を示す構成図である。本実施形態の検索装置１は、インターネットなどのネットワークを介してＰＣ（Personal computer）３から音声またはテキストで入力された入力情報（検索文）を解析して検索条件を生成し、当該検索条件で検索された検索結果をＰＣ３に送信する。 FIG. 1 is a configuration diagram showing a configuration of a search device according to an embodiment of the present invention. The search device 1 according to the present embodiment analyzes input information (search text) input by voice or text from a PC (Personal computer) 3 via a network such as the Internet, generates a search condition, and uses the search condition. The retrieved search result is transmitted to the PC 3.

図示する検索装置１は、テキスト入力受付部１１と、形態素解析部１２と、音声入力受付部１３と、音声認識部１４と、検索条件解析部１５と、検索条件管理部１６と、検索条件記憶部１７と、検索条件表示部１８と、検索部１９と、検索データベース（検索ＤＢ）２０と、検索結果出力部２１と、共通辞書２２とを有する。 The illustrated search apparatus 1 includes a text input receiving unit 11, a morpheme analyzing unit 12, a voice input receiving unit 13, a voice recognizing unit 14, a search condition analyzing unit 15, a search condition managing unit 16, and a search condition storage. A unit 17, a search condition display unit 18, a search unit 19, a search database (search DB) 20, a search result output unit 21, and a common dictionary 22 are included.

テキスト入力受付部１１は、ＰＣ３からテキストで入力された検索文の入力を受け付け、音声入力受付部１３は、ＰＣ３から音声で入力された検索文の入力を受け付ける。形態素解析部１２は、テキスト入力受付部１１が受け付けたテキストの検索文を形態素解析して、解析結果の各形態素に共通辞書２２を参照して共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する。 The text input accepting unit 11 accepts input of a search sentence input as text from the PC 3, and the voice input accepting unit 13 accepts input of a search sentence input as speech from the PC 3. The morpheme analysis unit 12 performs morphological analysis on the text search sentence received by the text input reception unit 11, assigns a common meaning ID to each morpheme of the analysis result by referring to the common dictionary 22, and text data with a common meaning ID Is generated.

音声認識部１４は、音声入力受付部１３が受け付けた音声の検索文を音声認識してテキストに変換するとともに、変換されたテキストの各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する。 The voice recognition unit 14 recognizes a voice search sentence received by the voice input reception unit 13 and converts it into text, and assigns a common meaning ID to each morpheme of the converted text, and text data with a common meaning ID. Is generated.

検索条件解析部１５は、形態素解析部１２および音声認識部１４が生成した共通意味ＩＤ付テキストデータを解析し、検索条件と当該検索条件に対応する共通意味ＩＤを抽出する。検索条件管理部１６は、検索条件解析部１５が解析した検索条件および対応する共通意味ＩＤを検索条件記憶部１７に記憶する。 The search condition analysis unit 15 analyzes the text data with common meaning ID generated by the morpheme analysis unit 12 and the speech recognition unit 14, and extracts the search condition and the common meaning ID corresponding to the search condition. The search condition management unit 16 stores the search condition analyzed by the search condition analysis unit 15 and the corresponding common meaning ID in the search condition storage unit 17.

検索条件表示部１８は、検索条件記憶部１７に記憶された検索条件をＰＣ３に送信し、ＰＣ３のディスプレイに表示させることにより、ユーザに提示する。検索部１９は、検索条件記憶部１７に記憶された検索条件を用いて、様々な情報が格納された検索ＤＢ２０またはネットワーク上に存在するデータベースを検索し、検索結果を検索結果出力部２１に出力する。検索結果出力部２１は、検索部１９の検索結果をＰＣ３に送信し、ＰＣ３のディスプレイに表示させることにより、ユーザに提示する。共通辞書２２は、共通意味ＩＤが割り当てられた単語（形態素）と、当該単語の読みおよび表記とが対応付けて格納されたものである。 The search condition display unit 18 transmits the search conditions stored in the search condition storage unit 17 to the PC 3 and displays it on the display of the PC 3 to present it to the user. The search unit 19 uses the search conditions stored in the search condition storage unit 17 to search the search DB 20 storing various information or a database existing on the network, and outputs the search results to the search result output unit 21. To do. The search result output unit 21 transmits the search result of the search unit 19 to the PC 3 and displays it on the display of the PC 3 to present it to the user. The common dictionary 22 stores a word (morpheme) to which a common meaning ID is assigned and a reading and a notation of the word in association with each other.

以上説明した検索装置１は、ＣＰＵと、メモリと、外部記憶装置と、入力装置と、出力装置と、ネットワークと接続するための通信制御装置とを備えた汎用的なコンピュータシステムを用いることができる。このコンピュータシステムにおいて、ＣＰＵがメモリ上にロードされた所定のプログラムを実行することにより、検索装置１の各機能が実現される。 The search device 1 described above can use a general-purpose computer system including a CPU, a memory, an external storage device, an input device, an output device, and a communication control device for connecting to a network. . In this computer system, each function of the search device 1 is realized by the CPU executing a predetermined program loaded on the memory.

次に、本実施形態の検索処理について説明する。 Next, the search process of this embodiment will be described.

図２は、本実施形態の検索処理を示すフローチャートである。まず、検索装置１は、ＰＣ３からの検索要求を受け付けて、音声またはテキストの検索文の入力の受け付けを開始する（Ｓ１１）。ユーザは、ＰＣ３に備えられたマイクなどの音声入力装置に音声で検索文を入力するか、または、キーボードなどのテキスト入力装置を用いてテキスト（文字）で検索文を入力する。このように、本実施形態では、ユーザが任意に選択した所望の方法（音声入力またはテキスト入力）で検索文の入力を行うことができる。 FIG. 2 is a flowchart showing search processing according to the present embodiment. First, the search device 1 accepts a search request from the PC 3 and starts accepting input of a voice or text search sentence (S11). The user inputs a search sentence by voice to a voice input device such as a microphone provided in the PC 3 or inputs a search sentence by text (characters) using a text input device such as a keyboard. As described above, in the present embodiment, a search sentence can be input by a desired method (speech input or text input) arbitrarily selected by the user.

テキストの検索文が入力された場合、テキスト入力受付部１１が検索文の入力を受け付け、音声の検索文が入力された場合、音声入力受付部１３が検索文の入力を受け付ける（Ｓ１２）。 When a text search sentence is input, the text input reception unit 11 receives an input of the search sentence. When a voice search sentence is input, the voice input reception unit 13 receives an input of the search sentence (S12).

音声とテキストの同時入力を防止するために、テキストの検索文が入力された場合（Ｓ１３：テキスト）、音声入力受付部１３は音声での検索文の入力を停止し（Ｓ１４）、音声の検索文が入力された場合（Ｓ１３：音声）、テキスト入力受付部１１はテキストでの検索文の入力を停止する（Ｓ１５）。このように、音声入力とテキスト入力の排他制御を行う。 In order to prevent simultaneous input of voice and text, when a text search sentence is input (S13: text), the voice input receiving unit 13 stops inputting the search sentence by voice (S14), and searches for the voice. When the sentence is input (S13: voice), the text input receiving unit 11 stops the input of the search sentence in the text (S15). In this way, exclusive control of voice input and text input is performed.

そして、音声の検索文が入力された場合（Ｓ１３：音声）、音声認識部１４は、音声認識技術により、音声内容を分析してテキストに変換するとともに、変換されたテキストの各形態素（単語）に共通辞書２２を参照して共通意味ＩＤを付与して共通意味ＩＤ付テキストデータを生成する（Ｓ１６）。すなわち、音声認識部１４は、音声認識処理により音声をテキストに変換し、当該音声認識処理の最中に認識した各形態素に、共通意味ＩＤを付加する。なお、形態素については、後述する。 When a voice search sentence is input (S13: voice), the voice recognition unit 14 analyzes the voice content and converts it into text by voice recognition technology, and each morpheme (word) of the converted text. A common meaning ID is assigned with reference to the common dictionary 22 to generate text data with a common meaning ID (S16). That is, the speech recognition unit 14 converts speech into text by speech recognition processing, and adds a common meaning ID to each morpheme recognized during the speech recognition processing. The morpheme will be described later.

音声認識技術は、人の話す音声言語をコンピュータによって解析し、話している内容を文字データとして取り出す処理であって、隠れマルコフモデル、音響モデルなどの様々な技術が存在する。音声認識部１４は、例えば「ＶｏｉｃｅＲｅｘ」などの音声認識エンジンを用い、音声を形態素毎（単語）に認識して、テキストに変換することが考えられる。 The speech recognition technology is a process of analyzing a spoken language spoken by a person by a computer and extracting the content being spoken as character data, and there are various technologies such as a hidden Markov model and an acoustic model. The voice recognition unit 14 may use a voice recognition engine such as “VoiceRex”, for example, to recognize the voice for each morpheme (word) and convert it into text.

「ＶｏｉｃｅＲｅｘ」については「http://www.ntt.co.jp/RD/OFIS/active/2007pdf/pdf/g_ct07.pdf#search='VoiceRex'」に記載されている。 “VoiceRex” is described in “http://www.ntt.co.jp/RD/OFIS/active/2007pdf/pdf/g_ct07.pdf#search='VoiceRex '”.

具体的には、例えば「あー、平成２０年に起きた火事の情報を知りたいです」の音声による検索文がＳ１２で入力されたものとする。この場合、音声認識部１４は、「平成２０年／に／起き／た／火事／の／情報／を／知り／たい」のように形態素に分割し、各形態素に共通辞書２２から取得した共通意味ＩＤを付加する。 Specifically, for example, it is assumed that a search sentence with a voice “Ah, I want to know information on a fire that occurred in 2008” is input in S12. In this case, the voice recognition unit 14 divides into morphemes such as “2008 / In / Wake up / Take / Fire / No / Information / Know / Want”, and the common morpheme acquired from the common dictionary 22 is obtained. A semantic ID is added.

図３は、共通辞書２２の一例を示すものである。共通辞書２２は、音声の検索文およびテキストの検索文の両方に使用される辞書である。共通辞書は、同じ意味を有する形態素毎に、共通意味ＩＤと、意味ＩＤと、読みと、表記とを有する。図３では、具体例として年を表現する単語として「２００８年」および「平成２０年」を示している。 FIG. 3 shows an example of the common dictionary 22. The common dictionary 22 is a dictionary used for both a voice search sentence and a text search sentence. The common dictionary has a common meaning ID, a meaning ID, a reading, and a notation for each morpheme having the same meaning. In FIG. 3, “2008” and “2008” are shown as words expressing the year as a specific example.

音声入力時には「表記」という概念がなく「ニセンハチネン」または「ヘイセイニジュウネン」のどちらで入力されても「２００８年」と認識できればよいため、辞書として音声用データ２２１の部分（２レコード）を有すればよい。一方、テキスト入力時には、同様に、辞書としてテキスト用データ２２２（２レコード）の部分を有すればよい。 Since there is no concept of “notation” at the time of voice input, it is sufficient that it can be recognized as “2008” regardless of whether “Nisenhachinen” or “Heisei Nijuunen” is input. Therefore, there is a portion (2 records) of the voice data 221 as a dictionary. do it. On the other hand, at the time of text input, similarly, the text data 222 (2 records) may be included as a dictionary.

本実施形態の共通辞書２２では、音声用データ２２１と、テキスト用データ２２２とを包含・統合する辞書として図示するように３レコードを有し、音声入力された検索文をテキストで修正またはテキスト入力された検索文を音声で修正可能なように、音声・テキスト共通の共通意味ＩＤ（Ｙ２００８）を割り当てることとした。このような共通辞書を用いることにより、様々な入力方法の揺れ（同じ意味に対する異なった複数の入力）に対応することができる。 The common dictionary 22 of the present embodiment has three records as illustrated as a dictionary that includes and integrates the voice data 221 and the text data 222, and corrects or inputs text to a search sentence input by voice. The common meaning ID (Y2008) common to the voice and text is assigned so that the retrieved text can be corrected by voice. By using such a common dictionary, it is possible to cope with fluctuations in various input methods (a plurality of different inputs for the same meaning).

音声認識部１４は、共通辞書２２を参照し、認識した形態素が「平成２０年」の場合、音声用データ２２１を用いて当該単語の共通意味ＩＤ（Ｙ２００８）を取得し、他の形態素についても同様に共通意味ＩＤを取得する。そして、形態素に分割したテキストデータに、各形態素の共通意味ＩＤを付加した共通意味ＩＤ付テキストデータを生成する。 The speech recognition unit 14 refers to the common dictionary 22, and if the recognized morpheme is “2008”, the speech recognition unit 14 acquires the common meaning ID (Y2008) of the word using the speech data 221, and also for other morphemes Similarly, the common meaning ID is acquired. Then, text data with a common meaning ID is generated by adding the common meaning ID of each morpheme to the text data divided into morphemes.

共通意味ＩＤ付テキストデータの具体例としては、例えば「平成２０年／に／起き／た／火事／の／情報／を／知り／たい：Ｙ２００８／ＩＤＸＸ／ＩＤＹＹ／・・・・」などである。 Specific examples of text data with a common meaning ID include, for example, “2008 / In / Wake / Ta / Fire / No / Information / To know / want: Y2008 / IDXX / IDYY /... .

一方、テキストの検索文が入力された場合（Ｓ１３：テキスト）、形態素解析部１２は、Ｓ１２で入力されたテキストの検索文の形態素解析を行う（Ｓ１７）。形態素解析（Morphological Analysis）は、対象言語の文法のルールや辞書（品詞等の情報付きの単語リスト）を用いて、自然言語で記述された文を、形態素（言語で意味を持つ最小の言語単位）に分割する。そして、形態素解析部１２は、共通辞書２２（テキスト用データ２２２）を参照して、分割した形態素の各々に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータを生成する（Ｓ１８）。 On the other hand, when a text search sentence is input (S13: text), the morpheme analysis unit 12 performs morphological analysis of the text search sentence input in S12 (S17). Morphological Analysis is a morpheme (the smallest linguistic unit that has meaning in a language) using a grammatical rule or dictionary (word list with information such as parts of speech) in the target language. ). The morpheme analyzer 12 refers to the common dictionary 22 (text data 222), assigns a common meaning ID to each of the divided morphemes, and generates text data with a common meaning ID (S18).

そして、検索条件解析部１５は、構文解析、自然言語処理技術を用いて、Ｓ１６またはＳ１８で生成された共通意味ＩＤ付テキストデータを解析し、当該テキストデータに含まれる検索条件および対応する意味ＩＤを抽出する（Ｓ１９）。自然言語処理（Natural language processing）は、自然言語をコンピュータに入力し、所定の目的に応じた情報処理を行う技術である。 Then, the search condition analysis unit 15 analyzes the text data with the common meaning ID generated in S16 or S18 using syntax analysis and natural language processing technology, and searches the search condition included in the text data and the corresponding meaning ID. Is extracted (S19). Natural language processing is a technique for inputting a natural language into a computer and performing information processing according to a predetermined purpose.

図２に示す具体例では、災害に関する情報を検索する処理であるため、検索条件解析部１５は、時期に関する検索条件、災害種別に関する検索条件、地域に関する検索条件などを共通意味ＩＤ付テキストデータから抽出する。 In the specific example shown in FIG. 2, the process is a process for searching for information related to disasters. Therefore, the search condition analysis unit 15 calculates search conditions related to time, search conditions related to disaster types, search conditions related to regions, etc. from text data with a common meaning ID. Extract.

この場合、検索条件解析部１５は、以下の検索条件と当該検索条件に対応する共通意味ＩＤを抽出する。 In this case, the search condition analysis unit 15 extracts the following search condition and a common meaning ID corresponding to the search condition.

＜検索条件＞時期：２００８年［Ｙ２００８］
災害種別：火災［ＦＩＲＥ］
なお、Ｓ１２で入力された音声（読み）は「平成２０年（ヘイセイニジュウネン）」であるが、共通辞書２２の表記は「２００８年」であるため、検索条件は「２００８年」となる。 <Search conditions> Time: 2008 [Y2008]
Disaster type: Fire [FIRE]
Note that the speech (reading) input in S12 is “2008 (Heisei Jiunen)”, but the notation of the common dictionary 22 is “2008”, so the search condition is “2008”.

検索条件管理部１６は、検索条件解析部１５が解析した検索条件および共通意味ＩＤを、検索条件記憶部１７に登録または更新する（Ｓ２０）。そして、検索条件表示部１８は、検索条件記憶部１７に登録・更新された検索条件をＰＣ３に送信し、ＰＣ３の表示装置に表示させる（Ｓ２１）。 The search condition management unit 16 registers or updates the search condition and the common meaning ID analyzed by the search condition analysis unit 15 in the search condition storage unit 17 (S20). Then, the search condition display unit 18 transmits the search condition registered / updated in the search condition storage unit 17 to the PC 3 and displays it on the display device of the PC 3 (S21).

ＰＣ３を使用するユーザは、表示装置に表示された検索条件を閲覧し、検索条件が意図したものでない場合、または、新たな検索条件を追加したい場合には、追加修正指示をＰＣ３に入力する。これにより、検索条件の追加修正指示が有りになり（Ｓ２２：有り）、Ｓ１１に戻り、音声またはテキストの検索文の入力の受け付けを開始する。 The user using the PC 3 browses the search conditions displayed on the display device, and inputs an additional correction instruction to the PC 3 when the search conditions are not intended or when a new search condition is to be added. As a result, there is a search condition addition / correction instruction (S22: present), and the process returns to S11 to start accepting input of a voice or text search sentence.

本実施形態では、検索条件記憶部１７に検索条件とともに共通意味ＩＤを登録することにより、最初に音声で検索文を入力した後に、音声での追加修正指示の入力だけでなくテキストでの追加修正指示の入力を行うことができ、また、最初にテキストで検索文を入力した後に、テキストでの追加修正指示の入力だけでなく音声での追加修正指示の入力を行うことができる（Ｓ１２）。 In this embodiment, by registering the common meaning ID together with the search condition in the search condition storage unit 17, after inputting the search sentence by voice first, not only by inputting an additional correction instruction by voice but also by adding correction by text It is possible to input an instruction, and after inputting a search sentence in text first, it is possible to input not only an additional correction instruction in text but also an additional correction instruction in speech (S12).

ここで、ユーザはＰＣ３を用いてテキストでの追加修正指示（例えば「２００８年ではなく２００７年の情報が知りたい」）を入力するものとする。この場合、音声での検索文の入力を停止し（Ｓ１４）、形態素解析部１２は、形態素解析を行い（Ｓ１７）、共通辞書２２を参照して各形態素に共通意味ＩＤを付与し、共通意味ＩＤ付テキストデータ（例えば、「２００８年／では／なく／２００７／・・・・：Ｙ２００８／ＩＤａａ／ＩＤｂｂ／・・・・」）を生成する（Ｓ１８）。 Here, it is assumed that the user inputs an additional correction instruction in text using the PC 3 (for example, “I want to know information on 2007, not 2008”). In this case, the input of the search sentence by voice is stopped (S14), and the morpheme analysis unit 12 performs morpheme analysis (S17), refers to the common dictionary 22 and assigns a common meaning ID to each morpheme, and thus the common meaning. ID-added text data (for example, “2008 / not / not / 2007 /...: Y2008 / IDaa / IDbb /...)” Is generated (S18).

そして、検索条件解析部１５は、追加修正指示の共通意味ＩＤ付テキストデータを解析し、検索条件記憶部１７に登録された検索条件および共通意味ＩＤを更新するための更新情報（例えば、「時期（変更）：２００８年［Ｙ２００８］→２００７年［Ｙ２００７］」を生成する（Ｓ１９）。 Then, the search condition analysis unit 15 analyzes the text data with the common meaning ID of the additional correction instruction, and updates information for updating the search condition and the common meaning ID registered in the search condition storage unit 17 (for example, “time” (Change): 2008 [Y2008] → 2007 [Y2007] ”is generated (S19).

検索条件管理部１６は、検索条件解析部１５が生成した更新情報に基づいて、検索条件記憶部１７に登録された検索条件および共通意味ＩＤを更新する（Ｓ２０）。これにより、検索条件記憶部１７には以下の検索条件および共通意味ＩＤが記憶される。 The search condition management unit 16 updates the search condition and the common meaning ID registered in the search condition storage unit 17 based on the update information generated by the search condition analysis unit 15 (S20). Thereby, the following search condition and common meaning ID are stored in the search condition storage unit 17.

＜検索条件＞時期：２００７年［Ｙ２００７］
災害種別：火災［ＦＩＲＥ］
そして、検索条件表示部１８は、更新された検索条件をＰＣ３に送信し、ＰＣ３の表示装置に表示させ（Ｓ２１）、検索条件の追加修正指示がない場合（Ｓ２２：無し）、検索部１９は、当該検索条件で所定の検索ＤＢ２０を検索し（Ｓ２３）、検索結果出力部２１は、検索結果をＰＣ３に送信する（Ｓ２４）。なお、検索ＤＢ２０は、検索装置１が備える場合だけでなく、ネットワーク上に存在するデータベースであってもよい。 <Search conditions> Time: 2007 [Y2007]
Disaster type: Fire [FIRE]
Then, the search condition display unit 18 transmits the updated search condition to the PC 3 and displays it on the display device of the PC 3 (S21). When there is no additional search condition correction instruction (S22: none), the search unit 19 Then, the predetermined search DB 20 is searched with the search condition (S23), and the search result output unit 21 transmits the search result to the PC 3 (S24). The search DB 20 may be a database existing on the network as well as the case where the search device 1 is provided.

以上説明した本実施形態では、ユーザは任意の入力方法（テキスト入力、音声入力）を選択して検索文を入力することができる。これにより、キーボードの操作に不慣れなユーザは音声で入力し、キーボードの操作に慣れているユーザはテキストで入力するなど、ユーザは自身の特性や状況に応じて自由に入力方式を選択でき、ユーザの検索文の入力負荷を軽減するとともに、ユーザの利便性を向上することができる。 In the present embodiment described above, the user can select an arbitrary input method (text input, voice input) and input a search sentence. This allows users who are unfamiliar with keyboard operation to input by voice, users who are accustomed to keyboard operation input by text, etc., so that the user can freely select the input method according to his / her characteristics and situation. It is possible to reduce the input load of the search sentence and improve user convenience.

例えば、ＰＣ３が置かれている場所において、キーボードの配置スペースがない場合はマイクを用いて音声入力し、また、騒音が大きな場所ではテキスト入力をすることが考えられる。また、最初は音声で検索文を入力し、検索条件が提示された後、誤認識により意図しない検索条件なっている部分のみテキストで追加修正指示を入力することが考えらる。また、基本的には入力が容易な音声で検索文を入力し、読み方がわからない地名など音声での入力が困難な場合にテキストで入力することが考えられる。 For example, in a place where the PC 3 is placed, if there is no space for the keyboard, it is conceivable that voice input is performed using a microphone, and text input is performed in a place where noise is high. In addition, it is conceivable that a search sentence is first input by voice, and after a search condition is presented, an additional correction instruction is input in text only for a part that has an unintended search condition due to misrecognition. Basically, it is conceivable that a search sentence is input by voice that is easy to input, and text is input when it is difficult to input by voice, such as a place name that is not understood.

また、本実施形態では、検索部１９が検索を行う前に、検索条件をＰＣ３に送信し、ユーザに提示する。これにより、ユーザは、検索条件を確認した上で、必要に応じて検索条件の修正および追加を行うことができる。すなわち、ユーザが意図した検索条件で検索を行うため、ユーザの利便性を向上させることができ、より効率の良い検索処理を実現することができる。 In this embodiment, before the search unit 19 performs a search, the search condition is transmitted to the PC 3 and presented to the user. Thereby, the user can correct and add the search condition as necessary after confirming the search condition. That is, since the search is performed under the search condition intended by the user, the convenience for the user can be improved, and more efficient search processing can be realized.

また、本実施形態では、ユーザに提示される検索条件は、ユーザが入力した検索文そのものではなく、検索装置が解析した検索条件であるため、ユーザは、一目で意図した検索条件か否かを判断することができる。 In the present embodiment, the search condition presented to the user is not the search sentence itself input by the user, but the search condition analyzed by the search device. Therefore, the user determines whether the search condition is intended at a glance. Judgment can be made.

また、本実施形態では、共通辞書を用いて、テキスト入力されて検索文の各形態素と音声入力された検索文の各形態素に、共通意味ＩＤを付与する。これにより、テキスト入力であっても、音声入力であっても、同じ意味を有する形態素（単語）には同じ共通意味ＩＤが付与され、検索条件記憶部に記憶される。これにより、音声入力された検索文の検索条件をテキスト入力で追加修正すること、または、テキスト入力された検索文の検索条件を音声入力で追加修正することができる。 In this embodiment, a common meaning ID is assigned to each morpheme of a search sentence that is input by text and each of the search sentences that are input by voice using a common dictionary. As a result, the same common meaning ID is assigned to the morphemes (words) having the same meaning regardless of whether the input is text input or voice input, and is stored in the search condition storage unit. Thereby, the search condition of the search sentence inputted by voice can be additionally corrected by text input, or the search condition of the search sentence inputted by text can be additionally corrected by voice input.

なお、本発明は上記の実施形態に限定されるものではなく、その要旨の範囲内で数々の変形が可能である。 In addition, this invention is not limited to said embodiment, Many deformation | transformation are possible within the range of the summary.

１：検索装置
１１：テキスト入力受付部
１２：形態素解析部
１３：音声入力受付部
１４：音声認識部
１５：検索条件解析部
１６：検索条件管理部
１７：検索条件記憶部
１８：検索条件表示部
１９：検索部
２０：検索結果出力部
２１：検索ＤＢ
２２：共通辞書
３：ＰＣ DESCRIPTION OF SYMBOLS 1: Search apparatus 11: Text input reception part 12: Morphological analysis part 13: Speech input reception part 14: Speech recognition part 15: Search condition analysis part 16: Search condition management part 17: Search condition storage part 18: Search condition display part 19: Search unit 20: Search result output unit 21: Search DB
22: Common dictionary 3: PC

Claims

A search device,
Text input accepting means for accepting input of a text search sentence;
Morphological analysis of the text search sentence, assigning a common meaning ID to each morpheme of the analysis result, and generating text data with a common meaning ID;
Voice input receiving means for receiving input of a voice search sentence;
Speech recognition means for recognizing the speech search sentence and converting it to text, giving a common meaning ID to each morpheme of the converted text, and generating text data with a common meaning ID;
A search condition analysis unit that analyzes the text data with common meaning ID generated by the morpheme analysis unit or the speech recognition unit, and extracts a search condition and a common meaning ID corresponding to the search condition;
Search condition management means for storing the search condition and the corresponding common meaning ID in the search condition storage means;
Presenting means for presenting the search conditions stored in the search condition storage means;
And a search unit that searches using the search condition and outputs a search result.

The search device according to claim 1,
The text input receiving means and the voice input receiving means receive a search condition correction instruction presented by the presenting means in text or voice,
The morpheme analyzing means performs a morphological analysis on the correction instruction inputted with the text, gives a common meaning ID to each morpheme of the analysis result, generates text data with a common meaning ID,
The voice recognition means recognizes the correction instruction inputted by voice and converts it into text, assigns a common meaning ID to each morpheme of the converted text, generates text data with a common meaning ID,
The search condition analysis unit analyzes the text data with the common meaning ID of the correction instruction generated by the morpheme analysis unit or the voice recognition unit, and extracts the search condition of the correction instruction and the common meaning ID corresponding to the search condition. ,
The search condition management means updates the search condition and the corresponding common meaning ID stored in the search condition storage means to a common meaning ID corresponding to the search condition of the correction instruction.

The search device according to claim 1 or 2, wherein
For a morpheme having the same meaning, further has a common dictionary that associates reading, notation, and common meaning ID,
The morpheme analyzing unit and the speech recognition unit assign a common meaning ID to each morpheme using the common dictionary.

A search method performed by a computer,
An input receiving step for receiving input of a text search sentence or input of a voice search sentence;
When the input of the text search sentence is accepted, the text search sentence is subjected to morphological analysis, a common meaning ID is assigned to each morpheme of the analysis result, and text data with a common meaning ID is generated. If accepted, the speech search sentence is recognized and converted into text, a common meaning ID is assigned to each morpheme of the converted text, and a generation step of generating text data with a common meaning ID;
A search condition analysis step of analyzing the text data with common meaning ID generated in the generation step and extracting a search condition and a common meaning ID corresponding to the search condition;
A search condition management step of storing the search condition and the corresponding common meaning ID in a search condition storage unit;
A presenting step of presenting search conditions stored in the search condition storage unit;
And a search step of performing a search using the search condition and outputting a search result.

The search method according to claim 4,
The input receiving step receives a search condition correction instruction presented in the presenting step in text or voice,
In the generation step, when the correction instruction is input as text, the correction instruction is morphologically analyzed, a common meaning ID is given to each morpheme of the analysis result, and text data with a common meaning ID is generated. When a correction instruction is input by voice, the correction instruction input by voice is recognized and converted into text, a common meaning ID is assigned to each morpheme of the converted text, and text data with a common meaning ID is Generate
The search condition analysis step analyzes the text data with the common meaning ID of the correction instruction generated in the generation step, extracts the search condition of the correction instruction and the common meaning ID corresponding to the search condition,
The search condition management step updates the search condition and the corresponding common meaning ID stored in the search condition storage unit to a common meaning ID corresponding to the search condition of the correction instruction.

The search method according to claim 4 or 5, wherein:
The computer has a common dictionary in which readings, notations, and common meaning IDs are associated with morphemes having the same meaning,
The generating step assigns a common meaning ID to each morpheme using the common dictionary.

A search program executed by a computer,
In the computer,
An input receiving step for receiving input of a text search sentence or input of a voice search sentence;
When the input of the text search sentence is accepted, the text search sentence is subjected to morphological analysis, a common meaning ID is assigned to each morpheme of the analysis result, and text data with a common meaning ID is generated. If accepted, the speech search sentence is recognized and converted into text, a common meaning ID is assigned to each morpheme of the converted text, and a generation step of generating text data with a common meaning ID;
A search condition analysis step of analyzing the text data with common meaning ID generated in the generation step and extracting a search condition and a common meaning ID corresponding to the search condition;
A search condition management step of storing the search condition and the corresponding common meaning ID in a search condition storage unit;
A presenting step of presenting search conditions stored in the search condition storage unit;
A search step for performing a search using the search condition and outputting a search result.

The search program according to claim 7,
The input receiving step receives a search condition correction instruction presented in the presenting step in text or voice,
In the generation step, when the correction instruction is input as text, the correction instruction is morphologically analyzed, a common meaning ID is given to each morpheme of the analysis result, and text data with a common meaning ID is generated. When a correction instruction is input by voice, the correction instruction input by voice is recognized and converted into text, a common meaning ID is assigned to each morpheme of the converted text, and text data with a common meaning ID is Generate
The search condition analysis step analyzes the text data with the common meaning ID of the correction instruction generated in the generation step, extracts the search condition of the correction instruction and the common meaning ID corresponding to the search condition,
The search condition management step updates the search condition and the corresponding common meaning ID stored in the search condition storage unit to a common meaning ID corresponding to the search condition of the correction instruction.

A search program according to claim 7 or claim 8, wherein
The computer has a common dictionary in which readings, notations, and common meaning IDs are associated with morphemes having the same meaning,
The generating step assigns a common meaning ID to each morpheme using the common dictionary.