JP2004085826A

JP2004085826A - Retrieval apparatus and retrieval method for speech inputted compound noun

Info

Publication number: JP2004085826A
Application number: JP2002245783A
Authority: JP
Inventors: Kumiko Omori; 大森　久美子; Masanobu Higashida; 東田　正信
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2002-08-26
Filing date: 2002-08-26
Publication date: 2004-03-18

Abstract

<P>PROBLEM TO BE SOLVED: To enable users to quickly and efficiently retrieve the retrieval words speech inputted by the users when the compound nouns of the number exceeding the number of the retrieval words processable within the interactive processing execution time are registered in a retrieval database and the compound nouns present with a number of candidates resembling each other are registered in the retrieval database. <P>SOLUTION: The retrieval system is so constituted that the compound nouns constituted of a plurality of single nouns are registered as the retrieval words, that the retrieval words are punctuated to the respective single nouns, and are registered in the retrieval database, that the single nouns constituting the total retrieval words registered in the database are registered in order of the high frequency, that a plurality of the single nouns registered in order of the higher frequency, are divided by each of the prescribed numbers successively from the higher frequency, that a plurality of the single noun set group are formed in a retrieval auxiliary database, and that the users retrieve the compound nouns speech inputted in series without punctuation on the way. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、複数の単名詞で構成される複合名詞を、利用者が音声入力し、この音声入力された複合名詞を特定する複合名詞の検索装置および検索方法に関するものである。
【０００２】
つまり、本発明において、検索対象となる検索語は、複数の単名詞の羅列で構成されている複合名詞であり、多数の検索語が、検索データベースに保持されている場合、音声認識処理と検索とに要する時間を利用者に感じさせることなく、しかも、音声認識処理の欠陥等による不自然な対話を利用者に生じさせることなく、検索語を特定する装置および方法に関するものである。
【０００３】
【従来の技術】
音声認識装置を利用した従来のデータベース検索処理では、ユーザが入力した音声と認識対象データベースとを照合し、認識対象データベース中の全語彙と入力音声との類似度を、認識尤度として算出する。
【０００４】
現在の認識技術では、対話処理実時間（検索装置と利用者とがやりとりする場合、上記利用者にストレスまたは不自然さを感じさせない時間）内に認識できる語彙数が限られ、この対話処理実時間を超える語彙数を認識対象にすると、処理時間が長くなり、利用者を待機させることになる。一方、認識対象語彙数が多ければ、また、認識対象が非常に似通った語彙で構成されていれば、人間同士の対話でさえも、聞き間違え、聞き損じが起こるように、認識精度の低下を避けることができない。
【０００５】
さらに、認識精度は、発話者や発話環境に大きく依存し、周囲の雑音等の影響によっては、常に１００％の精度が得られるという保証はない。
【０００６】
したがって、音声認識技術を利用する従来の検索装置では、利用者による入力音声を音声認識した結果の候補が、正しく認識できているか否かを判断する場合、利用者に上記候補の正誤確認を行い、認識尤度の高い順に正解が確認できるまで、上記候補の提示を繰り返す。
【０００７】
【発明が解決しようとする課題】
音声認識装置を用い、所定のデータベースの中から、利用者が意図する検索語を検索し、確定する検索装置において、上記データベースを構成する検索語の数が、対話処理実時間内に処理可能な数よりも多ければ、認識処理の間、利用者を待機させ、この待機後に、認識装置が出力した候補を利用者に提示し、正誤確認を行うことによって、正しく認識できたか否かを判断する。
【０００８】
利用者にとって、発話後に待たされ、この待たされた後に、正誤の確認を強いられることは、対話の不自然さにつながり、大きなストレスになるという問題がある。
【０００９】
一方、オペレータ対応のシステムでは、聞き間違え、聞き損じが生じたとしても、自然な対話の流れの中で、自らの意図をオペレータに伝えることができるので、対話の不自然さがなく、大きなストレスは生じない。
【００１０】
音声入力された検索語を検索する検索装置において、利用者満足度を獲得するためには、オペレータ対応のように、リアルタイムに応答できること、しかも、利用者の意図を正確に把握できること、何よりも、利用者との間における自然な対話の流れの中で、利用者意図を確定することが必要である。
【００１１】
本発明は、対話処理実時間内で処理可能な検索語数を超える数の複合名詞が検索データベースに登録され、しかも、互いに似通った候補が数多く存在する複合名詞が検索データベースに登録されている場合、利用者が音声入力した検索語を、短時間で効率良く検索することができる音声入力された複合名詞の検索装置、検索方法およびデータベースを提供することを目的とするものである。
【００１２】
【課題を解決するための手段】
本発明は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されている検索データベースと、上記検索データベースに登録されている全検索語を構成する単名詞が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている複数の単名詞が、その頻度の高い方から順に、所定の数毎に分割されて、複数の単名詞集合群が形成されている検索補助データベースとを有するデータベースを利用して、利用者が途中で区切らずに一連で音声入力した複合名詞を検索するものである。
【００１３】
【発明の実施の形態および実施例】
図１は、本発明の第１の実施例である音声入力された複合名詞の検索装置１を示すブロック図である。
【００１４】
音声入力された複合名詞の検索装置１は、音声入力部２と、音声認識部３と、音声認識用ソフトウェア３Ｓと、音声認識結果出力部４と、実在検索語候補リスト作成部５と、対話制御部６と、音声出力部７と、音声出力用ソフトウェア７Ｓと、システムデータベース８とを有する。
【００１５】
音声入力された複合名詞の検索装置１において、音声入力部２を介して入力された利用者Ｐの音声が音声認識部３へ送られ、音声認識部３は、入力音声を音声認識処理する際に、システムデータベース８を利用する。また、音声認識部３は、利用者Ｐによる入力音声について、音声認識用ソフトウェア３Ｓを利用して、認識処理を実行し、また、音声認識部３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切ることができる認識部である。
【００１６】
システムデータベース８は、検索データベース８１と、検索補助データベース８２と、ＹＥＳ／ＮＯデータベース８３とによって構成されている。
【００１７】
検索データベース８１は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されているデータベースである。
【００１８】
検索補助データベース８２は、検索データベース８１に登録されている全検索語を構成する単名詞が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている単名詞が、その頻度の高い方から順に、所定の数（検索装置１では５００）毎に分割されて、複数の単名詞集合群が形成されているデータベースである。
【００１９】
ＹＥＳ／ＮＯデータベース８３は、利用者Ｐが応答した内容（たとえば、はい／いいえ、ＹＥＳ／ＮＯ）を認識するデータベースである。
【００２０】
音声認識用ソフトウェア３Ｓは、検索装置１の処理の場面に合わせて、検索データベース８１または検索補助データベース８２を、システムデータベース８から選択するものである。
【００２１】
検索語が音声入力されると、検索補助データベース８２を参照し、また、利用者Ｐへの正誤確認に対する応答を認識する場合は、ＹＥＳ／ＮＯデータベース８３が参照される。
【００２２】
また、音声認識部３は、音声認識処理の際に、音声認識用ソフトウェア３Ｓを使用し、音声出力部７は、音声出力の際に、音声出力用ソフトウェア７Ｓを使用する。
【００２３】
次に、検索装置１について、より具体的に説明する。
【００２４】
なお、以下の説明では、企業名等、法人名義を確定するサービスの入力インタフェース部分に、音声入力された複合名詞の検索装置１が設けられている場合について説明する。
【００２５】
日本全国の電話帳に掲載されている法人名義は２２００万件も存在し、２２００万件の中から１つの法人名義を特定するには相当の時間がかかり、したがって、現行の音声認識技術では、対話処理実時間（検索装置と利用者Ｐとがやりとりする場合、上記利用者Ｐにストレスまたは不自然さを感じさせない時間）内で法人名義を認識処理することが不可能であり、しかも、この場合、非常に似通ったデータが多いので、認識精度が非常に低い。
【００２６】
図２は、検索データベース８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【００２７】
検索データベース８１に登録されている法人名義を構成している各名詞の頻度を調べ、単名詞を頻度順に記録し、検索補助データベース８２を作成する。つまり、検索補助データベース８２は、検索データベース８１に登録されている法人名義を構成している各単名詞について、その頻度をカウントし、頻度順に登録してあるデータベースである。
【００２８】
なお、上記２２００万件の法人名義を構成する単名詞の総数は、６５０万種類である。
【００２９】
図３は、検索装置１において、検索補助データベース８２の具体例を示す図である。
【００３０】
検索補助データベース８２中の単名詞は、検索データベース８１２登録されている複合名詞を構成する単名詞であり、その使用頻度の高い順に並べられ、使用頻度が高い方から５００個づつ取り出し、この５００個の名詞を１つの単名詞集合群とする。
【００３１】
使用頻度が最も高い単名詞を含む集合群を、第１の単名詞集合群Ｇ１（他の単名詞集合群よりも優先して認識する対象である単名詞集合群）とし、その次に使用頻度が高い単名詞を含む単名詞集合群を、第２の単名詞集合群Ｇ２とし、以下、使用頻度が高い順に、第３の単名詞集合群Ｇ３、第４の単名詞集合群Ｇ４、……とする。
【００３２】
１つの集合群を構成する名詞の数をｎとすると、検索装置１では、ｎ＝５００であり、この数は、次のようにして決められる。つまり、対話処理実時間（検索装置と利用者Ｐとがやりとりする場合、上記利用者Ｐにストレスまたは不自然さを感じさせない時間である）内に音声認識用ソフトウェア３Ｓが処理可能な単名詞の数をＴとし、検索データベース８１に登録されている全検索語の平均単名詞数をＭとすると、ｎ＝Ｔ／Ｍである。
【００３３】
具体的には、対話処理実時間内に音声認識用ソフトウェア３Ｓが処理可能な単名詞の数Ｔが１５００であるとし、検索データベース８１に登録されている全検索語の平均単名詞数Ｍが３であるとすると、ｎ＝Ｔ／Ｍ＝１５００／３＝５００である。
【００３４】
なお、単名詞集合群Ｇ１、Ｇ２、Ｇ３、Ｇ４、……のそれぞれを構成する単名詞の数ｎは、５００以外の数でもよいが、上記のように、対話処理実時間内に処理可能な単名詞の数Ｔ（音声認識用ソフトウェア３Ｓの性能）と、検索データベース８１に登録されている検索語が有する単名詞の数の平均Ｍとによって定められる。
【００３５】
図４は、検索装置１において、検索語を確定する処理プロセスを具体的に示すフローチャートである。
【００３６】
この前提として、検索データベース８１、検索補助データベース８２が作成されているとする。
【００３７】
まず、検索したい法人名義（複合名詞）について音声入力することを、利用者Ｐに要求する（Ｓ０）。この場合、利用者Ｐは、複合名詞を構成する単名詞毎に区切らずに、複合名詞を一連で入力したとする。音声認識部３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切る。そして、単名詞集合群の順位を示す関数ｋを１とする。
【００３８】
ここで、複合名詞「横須賀／市民／病院」が検索語として音声入力された場合について考える。まず、音声入力された単名詞「横須賀」、「市民」、「病院」のそれぞれについて、第１の単名詞集合群Ｇ１を使用し、音声認識部３が認識処理を行う（Ｓ１）。つまり、第１の単名詞集合群Ｇ１の範囲内で、音声認識部３が認識処理を行う。そして、音声認識結果出力部４が、その認識結果を出力する。
【００３９】
音声入力された単名詞「横須賀」、「市民」、「病院」のうちで、単名詞「横須賀」、「病院」は、第１の単名詞集合群Ｇ１に含まれているが、「市民」の頻度順位は、図３に示すように、頻度順位７９０位であるので、第１の単名詞集合群Ｇ１には含まれず、第２の単名詞集合群Ｇ２に含まれている。
【００４０】
図５は、検索装置１において、音声入力された検索語（複合名詞）を構成する各単名詞に対する認識結果と、その認識尤度とが対応している認識結果リストの具体例を示す図である。
【００４１】
１つの単名詞が音声入力されると、この音声入力された単名詞に対する複数の構成名詞候補と、これら複数の構成名詞候補のそれぞれに対する認識尤度とを、音声認識結果出力部４が出力し、この出力された構成名詞候補と認識尤度とを対応させて、認識結果リストを作成する（Ｓ１）。このようにして作成された認識結果リストが、図５に示されている。
【００４２】
たとえば、単名詞「横須賀」を音声入力すると、この入力された音声を、音声認識部３が、図５に示すように、「横須賀」、「横浜」、「須加浜」、「横横」、……と認識し、「横須賀」の認識尤度が９７であり、「横浜」の認識尤度が９０であり、「須加浜」の認識尤度が８９であり、「横横」の認識尤度が８０である。
【００４３】
なお、構成名詞候補についての認識尤度は、検索データベース８１を作るときに、予め定められている。
【００４４】
ここで、認識尤度が所定の閾値以上である構成名詞候補を、「有力構成名詞候補」とする。なお、検索装置１では、上記所定の閾値を８０とする。つまり、認識尤度が８０以上である構成名詞候補が、有力構成名詞候補である。
【００４５】
そして、図５に示す各認識結果から、認識尤度が８０以上である単名詞を、有力構成名詞候補として選択する（Ｓ２）。
【００４６】
図５に示すように、音声入力された複合名詞を構成する単名詞「横須賀」については、「横須賀」、「横浜」、「須加浜」、「横々」の４候補が有力構成名詞候補として選択され、音声入力された単名詞「市民」ついては、「市立」の１つの候補が有力構成名詞候補として選択され、音声入力された単名詞「病院」については、「病院」、「美容院」の２候補が有力構成名詞候補として選択される。これら有力構成名詞候補を利用し、実在検索語候補リストを作成する（Ｓ３）。なお、実在検索語候補リストは、検索データベース８１に実在する検索語のうちで、利用者Ｐに提示し、確認を求める検索語の候補を集めたリストである。
【００４７】
ところで、補助データベース８２における単名詞「市民」の頻度順位は、図２に示すように、７９０位であるので、第１の単名詞集合群Ｇ１には、単名詞「市民」が含まれていない。したがって、図５に示す「市民」に対する認識結果には、単名詞「市民」が含まれていない。
【００４８】
図６は、検索装置１において、実在検索語候補リスト作成の様子を示す図である。
【００４９】
図５に示す認識結果のうちで認識尤度が８０以上である単名詞を組み合わせた場合、この組み合わせのうちで、検索データベース８１に登録されている単名詞の組合わせは、「横須賀／市立／病院」と「横浜／市立／病院」との２つの組み合わせであり、この２つの組み合わせが、検索データベース８１に実在する検索語候補である「実在検索語候補」として選択される。このようにして選択された実在検索語候補をリスト化すると、図６の下部に示す実在検索語候補リストが作成される（Ｓ３）。
【００５０】
検索装置１において、検索語認識尤度は、図６に示すように、各単名詞の認識尤度を加算したものである。なお、検索語認識尤度を算出する場合、各単名詞の認識尤度を乗算するようにしてもよい。
【００５１】
そして、実在検索語候補リストの中に、利用者Ｐによる確認処理のみで検索語を確定することができる条件を満たす候補が存在するか否かを、判断する（Ｓ２１）。
【００５２】
ここで、検索語認識尤度の閾値が予め定められ、所定の検索語候補の検索語認識尤度が、検索語認識尤度の閾値を越えていれば、その検索語候補は、利用者Ｐによる確認処理のみで検索語を確定することができる条件を満たしていると判断する。なお、検索装置１においては、上記検索語認識尤度の閾値が２７０であると定められているとする。
【００５３】
そして、検索語候補を利用者Ｐにディスプレイ等で提示し、確認を求め（Ｓ２２）、利用者ＰがＹＥＳの応答をすれば（Ｓ２３）、検索処理を終了する。
【００５４】
図６に示す実在検索語候補リスト中の検索語候補は、それらのどれも、検索語認識尤度の閾値２７０を越えないので、図６に示す実在検索語候補リスト中の検索語候補は、全て、利用者Ｐによる確認処理のみでは検索語を確定することができない。
【００５５】
このように検索語を確定することができない場合に備えて、第２の単名詞集合群Ｇ２の範囲で単名詞の認識、認識尤度の算出等の処理を、予め実行しておく。すなわち、検索語候補を利用者Ｐにディスプレイ等で提示し、確認を求める処理（Ｓ２２）および利用者Ｐによる応答（Ｓ２３）と並行して、第２の単名詞集合群Ｇ２の範囲で単名詞の認識、認識尤度の算出（Ｓ１２）、有力構成名詞候補の再選出（Ｓ１３）、実在検索語候補リストの更新（Ｓ１４）を実行する。
【００５６】
つまり、まず、単名詞集合群の順位を示す関数ｋを１インクリメントし（Ｓ１１）、ｋ＝２とし、認識処理が終了した第１の単名詞集合群Ｇ１の次に使用頻度が高い第２の単名詞集合群Ｇ２（頻度順位が５００位〜１，０００位である単名詞によって構成されている単名詞集合群）の範囲で、認識し、この認識結果を利用し、有力構成名詞候補の再抽出を試みる（Ｓ１２、Ｓ１３、Ｓ１４）。
【００５７】
図４に示すフローチャートにおいて、ステップＳ３の後に、ステップ２１に進むルートとは別に、ステップＳ１１に進むルートが破線で示されている。これは、ステップＳ２１〜Ｓ２３のルーチンと、ステップＳ１１〜Ｓ１５のルーチンとが並行処理されるという意味である。つまり、第１の単名詞集合群Ｇ１について利用者Ｐとの対話処理（Ｓ２１〜Ｓ２３）が実行されている間に、第２の単名詞集合群Ｇ２について、有力構成名詞候補の再選出、実在検索語候補リストの更新等（Ｓ１１〜Ｓ１５）が実行される。
【００５８】
また、その後に、必要であれば、第２の単名詞集合群Ｇ２について利用者Ｐとの対話処理（Ｓ２１〜Ｓ２３）が実行されている間に、第３の単名詞集合群Ｇ３について、有力構成名詞候補の再選出、実在検索語候補リストの更新等（Ｓ１１〜Ｓ１５）が実行され、さらに、第３の単名詞集合群Ｇ３について利用者Ｐとの対話処理（Ｓ２１〜Ｓ２３）が実行されている間に、第４の単名詞集合群Ｇ４について、有力構成名詞候補の再選出、実在検索語候補リストの更新等（Ｓ１１〜Ｓ１５）が実行される。これらの並行処理が、必要に応じて、さらに繰り返される。
【００５９】
図７は、検索装置１において、更新された認識結果リストの具体例を示す図である。
【００６０】
図７では、第２の単名詞集合群Ｇ２の範囲で認識された認識結果が追加され、有力構成名詞候補が更新されたリストが示されている。
【００６１】
なお、図２に示すように、補助データベース８２における単名詞「市民」の頻度順位は、７９０位であるので、第２の単名詞集合群Ｇ２に単名詞「市民」が含まれ、したがって、図７に示す更新された認識結果リストには、音声入力された複合名詞を構成する単名詞「市民」に対する認識結果の中に、有力構成名詞候補として、「市民」が含まれている。
【００６２】
図８は、検索装置１において、実在検索語候補リストを作成する処理を実行した結果、得られる実在検索語候補リストを示す図である。
【００６３】
実在検索語候補リストを作成する処理を実行した結果、図８に示す実在検索語候補リストが得られ、上記閾値２７０を越える検索語認識尤度を有している検索語候補は、「横須賀／市民／病院」、「横浜／市民／病院」であり、これらのうちの「横須賀／市民／病院」について、利用者Ｐに提示し、その正誤確認を行う（Ｓ２２）。
【００６４】
正誤確認において、利用者Ｐから肯定を示す応答（ＹＥＳ）が得られると（Ｓ２３）、検索語が確定されたものとして処理を終了する。
【００６５】
ところで、音声認識対象語彙数が大規模であり、しかも、似通った語彙が多い場合、従来の音声認識技術では認識処理時間が長くなるので、従来例では、利用者Ｐが音声入力した検索語を、その利用者Ｐにストレスを与えずに一定時間内で検索することは、非常に困難であり、また、認識装置の精度の低下が避けられない。
【００６６】
そこで、検索装置１では、検索対象（検索語）を、単名詞の羅列からなる複合名詞だけに限定し、（単名詞で１つづつ区切って、検索語を入力するように利用者Ｐに指示し、）利用者Ｐが一連で入力した複合名詞を音声認識部３が単名詞に区切りこの区切られた各単名詞について、検索語を構成する単名詞を使用頻度順に保持している検索補助データベース８２を認識対象として認識処理する。この点が、検索装置１の特徴である。
【００６７】
検索データベース８１に登録されている検索語の総数よりも、検索補助データベース８２に登録されている単名詞の総数が少ないことが期待されるが、検索補助データベース８２に登録されている単名詞の総数は、対話処理実時間内で処理不可能な数である場合が多い。このように、検索補助データベース８２に登録されている単名詞の総数が、対話処理実時間内で処理不可能な数であり、しかも、検索補助データベース８２に登録されている単名詞を一度に処理しようとすれば、利用者Ｐを待機させざるを得ない。
【００６８】
そこで、検索装置１では、検索補助データベース８２中の単名詞を、使用頻度順に並べ、これを、対話処理実時間内で処理可能な一定の所定数づつに分割して、複数の単名詞集合群を構成し、使用頻度の高い単名詞順に認識処理を行い、各単名詞集合群に対する認識結果を検索語候補選定対話する時間に、残りの単名詞集合群に対する認識処理を行う。これによって、１つの検索語について１回目の検索語候補選定対話が終わると、直ちに、２回目の検索語候補選定対話が行われ、必要ならば、３回目、４回目、……の検索語候補選定対話が引き続いておこなわれるので、利用者Ｐを待機させる暇がない。
【００６９】
すわなち、検索装置１の内部の処理状況を利用者Ｐが一切見ることができないので、あたかも、検索データベース８１に登録されている検索語を一括して認識処理を行っているかのように見える。
【００７０】
また、検索装置１によれば、各単名詞に対する認識結果から、有力構成名詞候補を選択し、これら選択された有力構成名詞候補の全組み合わせの中から、検索データベース８１に実在する検索語候補のみを選択し、利用者Ｐによる確認処理のみで確定できる条件を満たすと判断できる検索語候補を、利用者Ｐに提示をする。したがって、検索装置１では、存在確率が高い検索語順に、認識結果を抽出することができる。
【００７１】
すなわち、検索装置１では、１回で認識処理する単名詞の数を絞ることによって、利用者Ｐを待機させることがなく、しかも、使用される頻度の高い単名詞から順に、認識処理することによって、存在確率が高い検索語順に、認識結果を抽出することができる。
【００７２】
換言すれば、検索装置１では、音声認識を利用して複合名詞を検索する場合、認識対象が大多数であることから生じる認識処理時間の長さを利用者Ｐに感じさせず、誤認識によって正誤確認の繰り返しを余儀なくさせることによるストレスを、利用者Ｐに感じさせず、しかも、高精度な自然性を備えている。
【００７３】
図９は、検索装置１における検索データベース８１を一般的に示す図である。
【００７４】
検索データベース８１は、対話処理実時間内で音声認識処理が不可能な数の検索語を保持し、各検索語は、単名詞に分割された形で登録されている。
【００７５】
つまり、たとえば検索語である複合名詞Ｎ_１は、単名詞ｎ_１、ｎ_２、ｎ_３によって構成され、検索データベース８１には、複合名詞Ｎ_１として、単名詞ｎ_１、ｎ_２、ｎ_３が、単名詞毎に区切られた状態で登録されている。
【００７６】
図１０は、検索補助データベース８２を一般的に示す図である。
【００７７】
検索補助データベース８２は、検索データベース８１に登録されている全検索語を構成する各単名詞の使用頻度をそれぞれ算出し、単名詞を頻度順に並べた状態で登録しているデータベースである。
【００７８】
検索補助データベース８２が登録している単名詞の数は、検索データベース８１の検索語数よりも少ないことが予想されるが、対話処理実時間内には処理不可能な数で構成される。
【００７９】
検索補助データベース８２では、頻度が高い順に単名詞が並べられ（序列化し）、予め定められた対話処理実時間内に処理可能な単名詞の数づつ、単名詞を分割し（単名詞の上記序列を分割し）、単名詞集合群を形成した状態で登録されている。最も頻度の高い単名詞を含む単名詞集合群を、第１の単名詞集合群Ｇ１という。
【００８０】
また、検索装置１を、プログラムの発明として把握することができる。
【００８１】
つまり、検索装置１は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されている検索データベースを作成する検索データベース作成手順と、上記検索データベースに登録されている全検索語を構成する単名詞が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている単名詞が、その頻度の高い方から順に、所定の数毎に分割されて、複数の単名詞集合群が形成されている検索補助データベースを作成する検索補助データベース作成手順と、利用者が上記検索語を単名詞毎に区切らずに、一連で音声入力すると、上記音声入力された検索語を単名詞毎に区切る検索語区切り手順と、上記区切られた単名詞毎に、上記複数の単名詞集合群のうちで、最も頻度が高い単名詞を含む第１の単名詞集合群の範囲内で認識処理し、認識尤度を対応させて、認識結果リストを作成する認識結果リスト作成手順と、上記認識処理された単名詞である構成名詞候補と、上記構成名詞候補についての認識尤度との組が認識尤度順に並べられている認識結果リストを、上記音声入力された単名詞のそれぞれついて作成し、上記認識結果リストに記載されている構成名詞候補のうちで、所定の閾値を超える認識尤度を具備する構成名詞候補を、有力構成名詞候補として選出する有力構成名詞候補選出手順と、上記有力構成名詞候補の全組合わせの中から、上記検索データベースに実在する検索語のみを選択し、各有力構成名詞候補の認識尤度に所定の演算を行い、検索語認識尤度を算出し、この算出された検索語認識尤度の大きい順に並べた実在検索語候補リストを作成する実在検索語候補リスト作成手順と、上記実在検索語候補リスト中の検索語候補が、利用者による確認処理のみで検索語を特定することができる条件を満たす場合は、上記利用者との間で必要な対話を実行し、検索語を確定する検索語候補選定対話手順とをコンピュータに実行させるプログラムの例である。
【００８２】
なお、上記プログラムを、ＦＤ、ＣＤ、ＤＶＤ、ＨＤ、半導体メモリ等の記録媒体に記録するようにしてもよい。
【００８３】
なお、商品配送サービスや、電話番号検索や郵便番号検索等における入力インタフェース部分において、検索装置１で行った法人名義の確定作業を、幅広く適用することができる。
【００８４】
図１１は、本発明の第２の実施例である音声入力された複合名詞の検索装置１０１を示すブロック図である。
【００８５】
音声入力された複合名詞の検索装置１０１は、音声入力部１０２と、音声認識部１０３と、音声認識用ソフトウェア１０３Ｓと、音声認識結果出力部１０４と、実在検索語候補リスト作成部１０５と、対話制御部１０６と、音声出力部１０７と、音声出力用ソフトウェア１０７Ｓと、システムデータベース１０８とを有する。
【００８６】
音声入力された複合名詞の検索装置１０１において、音声入力部１０２を介して入力された利用者Ｐの音声が音声認識部１０３へ送られ、音声認識部１０３は、入力音声を音声認識処理する際に、システムデータベース１０８を利用する。また、音声認識部１０３は、利用者Ｐによる入力音声について、音声認識用ソフトウェア３Ｓを利用して、認識処理を実行し、さらに、音声認識部１０３は、利用者Ｐが一連で音声入力した複合名詞を、単名詞毎に区切ることができる認識部である。
【００８７】
システムデータベース１０８は、検索データベース１８１と、検索補助データベース１８２と、ＹＥＳ／ＮＯデータベース１８３とによって構成されている。
【００８８】
検索データベース１８１は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されているデータベースである。
【００８９】
検索補助データベース１８２は、登録されている各複合名詞のｎ番目（ｎは整数値）に表記されている単名詞を集めた群を、ｎ番目表記の単名詞集合群と呼び、上記ｎ番目表記の単名詞集合群が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている単名詞が、その頻度の高い方から順に、所定の数（第２の実施例では５００個）毎にまとめられ、複数のサブ集合群が形成されている検索補助データベースである。
【００９０】
ＹＥＳ／ＮＯデータベース１８３は、利用者Ｐが応答した内容（たとえば、はい／いいえ、ＹＥＳ／ＮＯ）を認識するデータベースである。
【００９１】
音声認識用ソフトウェア１０３Ｓは、検索装置１０１の処理の場面に合わせて、検索データベース１８１または検索補助データベース１８２を、システムデータベース１０８から選択するものである。
【００９２】
検索語が音声入力されると、検索補助データベース１８２を参照し、また、利用者Ｐへの正誤確認に対する応答を認識する場合は、ＹＥＳ／ＮＯデータベース１８３が参照される。
【００９３】
また、音声認識部１０３は、音声認識処理の際に、音声認識用ソフトウェア１０３Ｓを使用し、音声出力部１０７は、音声出力の際に、音声出力用ソフトウェア１０７Ｓを使用する。
【００９４】
次に、第２の実施例について、より具体的に説明する。
【００９５】
なお、以下の説明では、企業名等、法人名義を確定するサービスの入力インタフェース部分に、音声入力された複合名詞の検索装置１０１が設けられている場合について説明する。
【００９６】
図１２は、第２の実施例において、検索データベース１８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【００９７】
検索データベース１８１に登録されている法人名義を構成している各名詞の頻度を調べ、単名詞を頻度順に記録し、検索補助データベース１８２を作成する。
【００９８】
なお、上記２２００万件の法人名義を構成する単名詞の総数は、６５０万種類である。また、法人名義２，２００万件を構成する名詞のうち、１番目表記の単名詞（法人名義中の第１単語目に位置する単名詞、つまり、法人名義である複合名詞を構成する複数の単名詞のうちで、１番目に位置する単名詞）の総数は、約３６０万種類であり、２番目表記の単名詞（法人名義中の第２単語目に位置する単名詞、つまり、法人名義である複合名詞を構成する複数の単名詞のうちで、２番目に位置する単名詞）の総数は、約２５０万種類であり、３番目表記の単名詞（法人名義中の第３単語目に位置する単名詞、つまり、法人名義である複合名詞を構成する複数の単名詞のうちで、３番目に位置する単名詞）の総数は、約２７０万種類であり、４番目表記の単名詞（法人名義中の第４単語目に位置する単名詞、つまり、法人名義である複合名詞を構成する複数の単名詞のうちで、４番目に位置する単名詞）の総数は、約１００万種類、…（最長構成単語数７）であり、対話処理実時間内では、１つの複合名詞に対する各単名詞の認識処理は不可能であり、精度も低いことが予想される。
【００９９】
図１３は、第２の実施例において、検索補助データベース１８２に格納されているデータの具体例を示す図である。
【０１００】
検索補助データベース１８２中の単名詞は、検索データベース１８１に登録されている複合名詞を構成する単名詞であり、複合名詞における表記の順番毎に、単名詞集合群が形成されている。複合名詞における表記の順番が最初である単名詞だけを集めて、１番目表記の単名詞集合群Ｇａ１が形成され、１番目表記の単名詞集合群Ｇａ１において、その使用頻度の高い順に単名詞が登録され、しかも、これら登録されている単名詞が、その頻度の高い方から順に、５００個の単名詞毎にまとめられ、複数のサブ集合群が形成され、頻度の高い順に、第１サブ集合群Ｇａ１−１、第２サブ集合群Ｇａ１−２、……、第ｍサブ集合群Ｇａ１−ｍが形成されている。
【０１０１】
なお、使用頻度が最も高い単名詞を含む第１サブ集合群Ｇａ１−１は、他のサブ集合群Ｇａ１−２〜Ｇａ１−ｍよりも優先して認識する対象である。
【０１０２】
また、複合名詞における表記の順番が２番目である単名詞だけを集めて、２番目表記の単名詞集合群Ｇａ２が形成され、２番目表記の単名詞集合群Ｇａ２において、その使用頻度の高い順に単名詞が登録され、しかも、これら登録されている単名詞が、その頻度の高い方から順に、５００個の単名詞毎にまとめられ、複数のサブ集合群が形成され、頻度の高い順に、第１サブ集合群Ｇａ２−１、第２サブ集合群Ｇａ２−２、……、第ｍサブ集合群Ｇａ２−ｍが形成されている。
【０１０３】
なお、使用頻度が最も高い単名詞を含む第１サブ集合群Ｇａ２−１は、他のサブ集合群Ｇａ２−２〜Ｇａ２−ｍよりも優先して認識する対象である。
【０１０４】
以下、上記と同様に、複合名詞における表記の順番がｎ番目である単名詞だけを集めて、ｎ番目表記の単名詞集合群Ｇａｎが形成され、ｎ番目表記の単名詞集合群Ｇａｎにおいて、その使用頻度の高い順に単名詞が登録され、しかも、これら登録されている単名詞が、その頻度の高い方から順に、５００個の単名詞毎にまとめられ、複数のサブ集合群が形成され、頻度の高い順に、第１サブ集合群Ｇａｎ１、第２サブ集合群Ｇａｎ２、……、第ｎサブ集合群Ｇａｎ−ｍが形成されている。
【０１０５】
なお、使用頻度が最も高い単名詞を含む第１サブ集合群Ｇａｎ−１は、他のサブ集合群Ｇａｎ−２〜Ｇｎ−ｍよりも優先して認識する対象である。
【０１０６】
ところで、１つのサブ集合群を構成する名詞の数をＮとすると、第２の実施例では、Ｎ＝５００であり、この数は、次のようにして決められる。つまり、対話処理実時間（検索装置と利用者Ｐとがやりとりする場合、上記利用者Ｐにストレスまたは不自然さを感じさせない時間である）内に音声認識用ソフトウェア１０３Ｓが処理可能な単名詞の数をＴとし、検索データベース１８１に登録されている全検索語の平均単名詞数をＭとすると、Ｎ＝Ｔ／Ｍである。
【０１０７】
具体的には、対話処理実時間内に音声認識用ソフトウェア１０３Ｓが処理可能な単名詞の数Ｔが１５００であるとし、検索データベース１８１に登録されている全検索語の平均単名詞数Ｍが３であるとすると、Ｎ＝Ｔ／Ｍ＝１５００／３＝５００である。
【０１０８】
なお、各サブ集合群Ｇａ１−１〜Ｇａｎ−ｍのそれぞれを構成する単名詞の数Ｎは、５００以外の数でもよいが、上記のように、対話処理実時間内に処理可能な単名詞の数Ｔ（音声認識用ソフトウェア１０３Ｓの性能）と、検索データベース１８１に登録されている検索語が有する単名詞の数の平均Ｍとによって定められる。
【０１０９】
図１４は、第２の実施例において、検索語を確定する処理プロセスを具体的に示すフローチャートである。
【０１１０】
この前提として、検索データベース１８１、検索補助データベース１８２が作成されているとする。
【０１１１】
まず、検索したい法人名義（複合名詞）について音声入力することを、利用者Ｐに要求する（Ｓ１００）。この場合、利用者Ｐは、複合名詞を構成する単名詞毎に区切らずに、複合名詞を一連で入力したとする。音声認識部３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切る。そして、単名詞集合群の順位を示す関数ｋを１とする。
【０１１２】
そして、検索語である複合名詞に関する各単名詞集合群におけるサブ集合群の関数（サブ集合群の順位を示す関数）ｋを１とする。つまり、サブ集合群の関数ｋ＝１であれば、複数のサブ集合群のうちで、サブ集合群Ｇａ１−１、Ｇａ２−１、Ｇａ３−１、……を使用して、各単名詞が認識される。
【０１１３】
ここで、複合名詞「横浜／グランド／ホテル」が検索語として音声入力された場合について考える。
【０１１４】
まず、音声入力された複合名詞が、音声認識部３によって、単名詞「横浜」、「グランド」、「ホテル」に区切られ、これら区切られた「横浜」、「グランド」、「ホテル」のそれぞれについて、１番目表記の単名詞集合群Ｇａ１中の第１サブ集合群Ｇａ１−１、２番目表記の単名詞集合群Ｇａ２中の第１サブ集合群Ｇａ２−１、３番目表記の単名詞集合群Ｇａ３中の第１サブ集合群Ｇａ３−１を使用し、音声認識部１０３が認識処理を行う（Ｓ１０１）。つまり、音声入力された複合名詞を構成する単名詞「横浜」について、第１サブ集合群Ｇａ１−１の範囲内で認識処理し、音声入力された単名詞「グランド」について、第１サブ集合群Ｇａ２−１の範囲内で認識処理し、音声入力された単名詞「ホテル」について、第１サブ集合群Ｇａ３−１の範囲内で認識処理する。そして、音声認識結果出力部１０４が、その認識結果を出力する。
【０１１５】
音声入力された複合名詞を構成する単名詞「横浜」、「グランド」、「ホテル」のうちで、単名詞「横浜」は、図１３に示すように、第１サブ集合群Ｇａ１−１に含まれ、単名詞「ホテル」も、第１サブ集合群Ｇａ３−１に含まれているとする。しかし、単名詞「グランド」の頻度順位は、図１３に示すように、頻度順位９５１位であるので、第１サブ集合群Ｇａ２−１には含まれず、第２サブ集合群Ｇａ２−２に含まれている。
【０１１６】
図１５は、第２の実施例において、音声入力された検索語（複合名詞）を構成する各単名詞に対する認識結果と、その認識尤度とが対応している認識結果リストの具体例を示す図である。
【０１１７】
１つの単名詞が音声入力されると、この音声入力された複合名詞を構成する単名詞に対する複数の構成名詞候補と、これら複数の構成名詞候補のそれぞれに対する認識尤度とを、音声認識結果出力部１０４が出力し、この出力された構成名詞候補と認識尤度とを対応させて、認識結果リストを作成する（Ｓ１０１）。このようにして作成された認識結果リストが、図１５に示されている。
【０１１８】
たとえば、音声入力された複合名詞から音声認識部１０３が単名詞「横浜」を切り出すと、この切り出された音声を、音声認識部１０３が、図１５に示すように、「横浜」、「横須賀」、「横山」、「横山」、……と認識し、「横浜」の認識尤度が９５であり、「横須賀」の認識尤度が９０であり、「横溝」の認識尤度が８１であり、「横山」の認識尤度が７５である。
【０１１９】
なお、各単名詞についての認識尤度は、音声入力された複合名詞から音声認識部１０３が単名詞を切り出したときに、音声認識用ソフトウェア１０３Ｓが、個々に判断する。
【０１２０】
ここで、認識尤度が所定の閾値以上である構成名詞候補を、有力構成名詞候補とする。なお、第２の実施例では、上記所定の閾値を８０とする。つまり、認識尤度が８０以上である構成名詞候補が、有力構成名詞候補である。上記所定の閾値が８０であることは、データベース８１、８２を作るときに、予め設定されている。
【０１２１】
そして、図１５に示す各認識結果から、認識尤度が８０以上である単名詞を、有力構成名詞候補として選択する（Ｓ１０２）。
【０１２２】
図１５に示すように、音声入力された複合名詞を構成する単名詞「横浜」については、「横浜」、「横須賀」、「横溝」の３候補が有力構成名詞候補として選択され、音声入力された複合名詞を構成する単名詞「グランド」ついては、有力構成名詞候補として選択されたものがなく、音声入力された複合名詞を構成する単名詞「ホテル」については、「ホテル」、「ホール」の２候補が有力構成名詞候補として選択される。
【０１２３】
そして、これら有力構成名詞候補を利用し、実在検索語候補リストを作成する（Ｓ１０３）。なお、実在検索語候補リストは、検索データベース１８１に実在する検索語のうちで、利用者Ｐに提示し、確認を求める検索語の候補を集めたリストである。
【０１２４】
ところで、２番目表記の単名詞「グランド」の頻度順位は、補助データベース８２に格納されている２番目表記の単名詞集合群Ｇａ２において、図１２に示すように、９５１位であるので、２番目表記の単名詞集合群Ｇａ２の第１サブ集合群Ｇａ２−１には、単名詞「グランド」が含まれていない。よって、図１５に示す音声入力された単名詞「グランド」に対する認識結果には、単名詞「グランド」が含まれていない。
【０１２５】
したがって、実在検索語候補リストを作成するステップ（Ｓ１０３）では、実在検索語候補リストが作成されない。
【０１２６】
そして、確認処理のみで検索語特定可能な条件を満たさないので（Ｓ１２１）、サブ集合群の関数ｋを１インクリメントし（Ｓ１１１）、検索対象のサブ集合群が、Ｇａ１−２、Ｇａ２−２、Ｇａ３−１になり、音声入力された複合名詞を構成する各単名詞について、新たな集合群Ｇａ２−２を使用して、認識し、認識尤度を算出する（Ｓ１１２）。そして、新たに認識した認識結果を追加した状態で（更新された認識結果リストを使用して）、有力構成名詞候補を再選出し（Ｓ１１３）、実在検索語候補リストを更新する（Ｓ１１４）。
【０１２７】
図１６は、第２の実施例において、更新された認識結果リストの具体例を示す図である。
【０１２８】
図１６では、第２サブ集合群Ｇａ１−２、Ｇａ２−２、Ｇａ３−２の範囲で認識された認識結果が追加され、有力構成名詞候補が更新されたリストが示されている。
【０１２９】
なお、図１３に示すように、補助データベース８２において、２番目表記の単名詞集合群Ｇａ２における単名詞「グランド」の頻度順位は、９５１位であるので、第２サブ集合群Ｇａ２−２に単名詞「グランド」が含まれ、したがって、図１６に示す更新された認識結果リストには、音声入力された複合名詞を構成する単名詞「グランド」に対する認識結果の中に、有力構成名詞候補として、「グランド」が含まれている。
【０１３０】
図１７は、第２の実施例において、実在検索語候補リスト作成の様子を示す図である。
【０１３１】
有力構成名詞候補を選出できなかった音声入力単名詞「グランド」におけるサブ集合群を新たにして、認識し、この認識結果のうちで認識尤度が８０以上である単名詞を組み合わせ、この組み合わせのうちで、検索データベース１８１に登録されている単名詞の組合わせは、図１７に示すように、「横浜／グランド／ホテル」と「横浜／ゴールド／ホール」との２つの組み合わせであり、この２つの組み合わせが、検索データベース１８１に実在する検索語候補である実在検索語候補として選択される。このようにして選択された実在検索語候補について、実在検索語候補リストを作成すると、図１７の下部に示す実在検索語候補リストが更新される（Ｓ１１４）。
【０１３２】
第２の実施例において、検索語認識尤度は、図１７に示すように、各単名詞の認識尤度を加算したものである。なお、検索語認識尤度を算出する場合、各単名詞の認識尤度を乗算するようにしてもよい。
【０１３３】
そして、実在検索語候補リストの中に、利用者Ｐによる確認処理のみで検索語を確定することができる条件を満たす候補が存在する否かを、判断する（Ｓ１２１）。
【０１３４】
ここで、検索語認識尤度の閾値が予め定められ、所定の検索語候補の検索語認識尤度が、検索語認識尤度の閾値を越えていれば、その検索語候補は、利用者Ｐによる確認処理のみで検索語を確定することができる条件を満たしていると判断する。なお、第２の実施例においては、上記検索語認識尤度の閾値が２８０であると定められているとする。
【０１３５】
そして、検索語候補を利用者Ｐにディスプレイ等で提示し、確認を求め（Ｓ１２２）、利用者ＰがＹＥＳの応答をすれば（Ｓ１２３）、検索処理を終了する。
【０１３６】
図１７に示す実在検索語候補リスト中の検索語候補のうちで１つの候補が、検索語認識尤度の閾値２８０以上であるという条件を満たすので、図１７に示す実在検索語候補リスト中の検索語候補は、利用者Ｐによる確認処理のみで検索語を確定することができる。
【０１３７】
もし、実在検索語候補リスト中の検索語候補のいずれも、検索語認識尤度の閾値２８０以上であるという条件を満たしていなければ、それら実在検索語候補リスト中の検索語候補は、全て、利用者Ｐによる確認処理のみでは検索語を確定することができない。
【０１３８】
このように検索語を確定することができない場合に備えて、第３サブ集合群Ｇａ１−３、Ｇ２−３、Ｇ３−３の範囲で単名詞の認識、認識尤度の算出等の処理を、予め実行しておく。すなわち、検索語候補を利用者Ｐにディスプレイ等で提示し、確認を求める処理（Ｓ１２２）および利用者Ｐによる応答（Ｓ１２３）と並行して、第３サブ集合群Ｇａ１−３、Ｇａ２−３、Ｇａ３−３の範囲で単名詞の認識、認識尤度の算出（Ｓ１１２）、有力構成名詞候補の再選出（Ｓ１１３）、実在検索語候補リストの更新（Ｓ１１４）を実行する。
【０１３９】
つまり、サブ集合群の順位を示す関数ｋをさらに１インクリメントし（Ｓ１１１）、ｋ＝３とし、認識処理が終了した第２サブ集合群の次に使用頻度が高い第３サブ集合群Ｇａ１−３、Ｇａ２−３、Ｇａ３−３（頻度順位が１００１位〜１，５００位である単名詞によって構成されているサブ集合群Ｇａ１−３、Ｇａ２−３、Ｇａ３−３の範囲で、認識し、この認識結果を利用し、有力構成名詞候補の再抽出を試みる（Ｓ１１２、Ｓ１１３、Ｓ１１４）。
【０１４０】
図１４に示すフローチャートにおいて、ステップＳ３の後に、ステップ２１に進むルートとは別に、ステップＳ１１１に進むルートが破線で示されている。これは、ステップＳ１２１〜Ｓ１２３のルーチンと、ステップＳ１１１〜Ｓ１１５のルーチンとが並行処理されるという意味である。
【０１４１】
つまり、第１サブ集合群Ｇａ１−１、Ｇａ２−１、Ｇａ３−１について利用者Ｐとの対話処理（Ｓ１２１〜Ｓ１２３）が実行されている間に、第２サブ集合群Ｇａ１−２、Ｇａ２−２、Ｇａ３−２について、有力構成名詞候補の再選出、実在検索語候補リストの更新等（Ｓ１１１〜Ｓ１１５）が実行される。
【０１４２】
また、その後に、必要であれば、第２サブ集合群Ｇａ１−２、Ｇａ２−２、Ｇａ３−２について利用者Ｐとの対話処理（Ｓ１２１〜Ｓ１２３）が実行されている間に、第３サブ集合群Ｇａ１−３、Ｇａ２−３、Ｇａ３−３について、有力構成名詞候補の再選出、実在検索語候補リストの更新等（Ｓ１１１〜Ｓ１１５）が実行され、さらに、第３サブ集合群Ｇａ１−３、Ｇａ２−３、Ｇａ３−３について利用者Ｐとの対話処理（Ｓ１２１〜Ｓ１２３）が実行されている間に、第４サブ集合群Ｇａ１−４、Ｇａ２−４、Ｇａ３−４について、有力構成名詞候補の再選出、実在検索語候補リストの更新等（Ｓ１１１〜Ｓ１１５）が実行される。これらの並行処理が、必要に応じて、さらに繰り返される。
【０１４３】
ところで、音声認識対象語彙数が大規模であり、しかも、似通った語彙が多い場合、従来の音声認識技術では認識処理時間が長くなるので、従来例では、利用者Ｐが音声入力した検索語を、その利用者Ｐにストレスを与えずに一定時間内で検索することは、非常に困難であり、また、認識装置の精度の低下が避けられない。
【０１４４】
そこで、第２の実施例では、検索対象（検索語）を、単名詞の羅列からなる複合名詞だけに限定し、検索語を入力するように利用者Ｐに指示し、利用者Ｐが入力した複合名詞を音声認識部が単名詞に区切り、これら区切られた各単名詞について、上記複合名詞において表記されている位置毎に、単名詞集合群が形成され、しかも、上記単名詞集合群毎に、その頻度の高い順に登録され、この頻度の高い順に登録されている複数の単名詞が、その頻度の高い方から順に、所定の数毎にまとめられ、複数のサブ集合群が形成されている検索補助データベースとを有するデータベースを利用して、利用者が音声入力した複合名詞を検索するものである。
【０１４５】
上記のように、複合名詞における単名詞の位置に応じて、単名詞毎に、単名詞集合群が形成され、この単名詞集合群が、複数のサブ集合群に分割され、そのうちで、使用頻度が高い単名詞が含まれているものから、認識する点が、第２の実施例の特徴である。
【０１４６】
検索データベース１８１に登録されている検索語の総数よりも、検索補助データベース１８２に登録されている単名詞の総数が少ないことが期待されるが、検索補助データベース１８２に登録されている単名詞の総数は、対話処理実時間内で処理不可能な数である場合が多い。このように、検索補助データベース１８２に登録されている単名詞の総数が、対話処理実時間内で処理不可能な数であり、しかも、検索補助データベース１８２に登録されている単名詞を一度に処理しようとすれば、利用者Ｐを待機させざるを得ない。
【０１４７】
そこで、第２の実施例では、検索補助データベース１８２中の単名詞を、使用頻度順に並べ、これを、対話処理実時間内で処理可能な一定の所定数づつに分割して、複数のサブ集合群を構成し、使用頻度の高い単名詞順に認識処理を行い、各サブ集合群に対する認識結果を検索語候補選定対話する時間に、残りのサブ集合群に対する認識処理を行う。これによって、１つの検索語について１回目の検索語候補選定対話が終わると、直ちに、２回目の検索語候補選定対話が行われ、必要ならば、３回目、４回目、……の検索語候補選定対話が引き続いて行われるので、利用者Ｐを待機させる暇がない。
【０１４８】
すなわち、検索装置１０１の内部の処理状況を利用者Ｐが一切見ることができないので、あたかも、検索データベース１８１に登録されている検索語を一括して認識処理を行っているかのように見える。
【０１４９】
また、第２の実施例によれば、各単名詞に対する認識結果から、有力構成名詞候補を選択し、これら選択された有力構成名詞候補の全組み合わせの中から、検索データベース１８１に実在する検索語候補のみを選択し、利用者Ｐによる確認処理のみで確定できる条件を満たすと判断できる検索語候補を、利用者Ｐに提示する。したがって、第２の実施例では、存在確率が高い検索語順に、認識結果を抽出することができる。
【０１５０】
すなわち、第２の実施例では、１回で認識処理する単名詞の数を絞ることによって、利用者Ｐを待機させることがなく、しかも、使用される頻度の高い単名詞から順に、認識処理することによって、存在確率が高い検索語順に、認識結果を抽出することができる。
【０１５１】
換言すれば、第２の実施例では、音声認識を利用して複合名詞を検索する場合、認識対象が大多数であることから生じる認識処理時間の長さを利用者Ｐに感じさせず、誤認識によって正誤確認の繰り返しを余儀なくさせることによるストレスを、利用者Ｐに感じさせず、しかも、高精度な自然性を備えている。
【０１５２】
ところで、複合名詞「横浜／グランド／ホテル」の２番目表記の単名詞（複合名詞における２番目の単名詞）である「グランド」は、全検索語を構成する単名詞の全てを頻度順に並べた場合、２，４５０番目に位置するので、検索データベース１８１に格納されている全ての単名詞をその使用頻度順に並べて、５００単語ずつの単名詞について認識処理すると、上位から５つ目の５００単語の組に初めて出現することになる。この場合、少なくとも、認識処理と実在検索語リスト作成処理とを５回繰返すことになり、その過程において、検索語認識尤度が規定閾値を超える実在検索語候補が存在した場合は、利用者への提示が行われることになり、利用者Ｐが違和感を抱く。ところが、第２の実施例では、「横浜／グランド／ホテル」の２番目表記の単名詞（複合名詞における２番目の単名詞）である「グランド」は、２番目表記の単名詞集合群Ｇａ２では、頻度順位が９５１番目であるので、認識処理と実在検索語リスト作成処理とを２回実行すれば、検索語を確定することができ、利用者Ｐは違和感を抱かない。
【０１５３】
また、第２の実施例を、プログラムの発明として把握することができる。
【０１５４】
つまり、第２の実施例は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されている検索データベースを作成する検索データベース作成手順と、上記登録されている各複合名詞のｎ番目（ｎは整数値）に表記されている単名詞の群を、ｎ番目表記の単名詞集合群と呼び、上記ｎ番目表記の単名詞集合群が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている単名詞が、その頻度の高い方から順に、所定の数毎にまとめられ、複数のサブ集合群が形成されている検索補助データベースを作成する検索補助データベース作成手順と、利用者が上記検索語を単名詞毎に区切らずに、一連で音声入力すると、上記音声入力された検索語を単名詞毎に区切る検索語区切り手順と、上記利用者が上記検索語を一連で音声入力すると、上記複合名詞のｎ番目表記の単名詞については、上記ｎ番目表記の単名詞集合群で認識し、しかも上記ｎ番目表記の単名詞集合群のうちで、最も頻度が高い単名詞を含む第１サブ集合群の範囲内で認識処理し、認識尤度を対応させて、認識結果リストを作成する認識結果リスト作成手順と、上記認識処理された単名詞である構成名詞候補と、上記構成名詞候補についての認識尤度との組が認識尤度順に並べられている認識結果リストを、上記音声入力された単名詞のそれぞれついて作成し、上記認識結果リストに記載されている構成名詞候補のうちで、所定の閾値を超える認識尤度を具備する構成名詞候補を、有力構成名詞候補として選出する有力構成名詞候補選出手順と、上記有力構成名詞候補の全組合わせの中から、上記検索データベースに実在する検索語のみを選択し、各有力構成名詞候補の認識尤度に所定の演算を行い、検索語認識尤度を算出し、この算出された検索語認識尤度の大きい順に並べた実在検索語候補リストを作成する実在検索語候補リスト作成手順と、上記実在検索語候補リスト中の検索語候補が、利用者による確認処理のみで検索語を特定することができる条件を満たす場合は、上記利用者との間で必要な対話を実行し、検索語を確定する検索語候補選定対話手順とをコンピュータに実行させるプログラムの例である。
【０１５５】
なお、上記プログラムを、ＦＤ、ＣＤ、ＤＶＤ、ＨＤ、半導体メモリ等の記録媒体に記録するようにしてもよい。
【０１５６】
なお、商品配送サービスや、電話番号検索や郵便番号検索等における入力インタフェース部分において、第２の実施例で行った法人名義の確定作業を、幅広く適用することができる。
【０１５７】
図１８は、本発明の第３の実施例である音声入力された複合名詞の検索装置２０１を示すブロック図である。
【０１５８】
音声入力された複合名詞の検索装置２０１は、音声入力部２０２と、音声認識部２０３と、音声認識用ソフトウェアを使用する音声認識装置２０３Ｓと、音声認識結果出力部２０４と、音声認識結果リスト作成部２０４ａと、実在検索語候補リスト作成部２０５と、対話制御部２０６と、音声出力部２０７と、音声出力用ソフトウェアを使用する音声出力装置２０７Ｓと、システムデータベース２０８とを有する。
【０１５９】
音声入力された複合名詞の検索装置２０１において、音声入力部２０２を介して入力された利用者２１０の音声が音声認識部２０３へ送られ、音声認識部２０３は、入力音声を音声認識処理する際に、システムデータベース２０８を利用する。また、音声認識部２０３は、利用者２１０による入力音声について、音声認識装置２０３Ｓにおける音声認識用ソフトウェアを利用して、認識処理を実行し、さらに、音声認識部２０３は、利用者Ｐが一連で音声入力した複合名詞を、単名詞毎に区切ることができる認識部である。
【０１６０】
システムデータベース２０８は、検索データベース２８１と、順序順序検索補助データベース２８２と、逆順検索補助データベース２８３と、ＹＥＳ／ＮＯデータベース２８４とによって構成されている。
【０１６１】
検索データベース２８１は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されているデータベースである。
【０１６２】
順序検索補助データベース２８２は、検索データベース２８１に登録されている各複合名詞のｎ番目（ｎは整数値）に位置する単名詞の集合であるｎ単語目に位置する単名詞集合が、登録されているデータベースであり、上記ｎ単語目に位置する単名詞集合に属する単名詞が、使用頻度の高い単名詞順にしかも所定数（たとえば５００）毎に分割されることによって、複数のｎ番目の単名詞サブ集合が構成されている。
【０１６３】
逆順検索補助データベース２８３は、検索データベース２８１に登録されている各複合名詞の逆順ｍ単語目（ｍは整数値）に位置する単名詞の集合である逆順ｍ単語目に位置する単名詞集合が、登録されているデータベースであり、上記逆順ｍ単語目に位置する単名詞集合に属する単名詞が、使用頻度の高い単名詞順にしかも所定数毎に分割されることによって、複数の逆順ｍ単語目の単名詞サブ集合が構成されている。
【０１６４】
ＹＥＳ／ＮＯデータベース２８４は、利用者２１０が応答した内容（たとえば、はい／いいえ、ＹＥＳ／ＮＯ）を認識するデータベースである。
【０１６５】
音声認識装置２０３Ｓで使用される音声認識用ソフトウェアは、検索装置２０１の処理の場面に合わせて、検索データベース２８１、順序検索補助データベース２８２または逆順検索補助データベース２８３を、システムデータベース２０８から選択するものである。
【０１６６】
検索語が音声入力されると、順序検索補助データベース２８２または逆順検索補助データベース２８３を参照し、また、利用者２１０への正誤確認に対する応答を認識する場合は、ＹＥＳ／ＮＯデータベース２８４が参照される。
【０１６７】
音声認識部２０３は、音声認識処理する際に、音声認識用ソフトウェア２０３Ｓを使用し、音声出力部２０７は、音声出力する際に、音声出力用ソフトウェア７Ｓを使用する。
【０１６８】
なお、以下の説明では、企業名等、法人名義を確定するサービスの入力インタフェース部分に、音声入力された複合名詞の検索装置２０１が設けられている場合について説明する。
【０１６９】
図１９は、複合名詞の検索装置２０１において、検索データベース２８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【０１７０】
検索データベース２８１に登録されている法人名義を構成している各名詞の頻度を調べ、単名詞を頻度順に記録し、順序検索補助データベース２８２を作成する。つまり、順序検索補助データベース２８２は、検索データベース２８１に登録されている法人名義を構成している各単名詞について、その頻度をカウントし、頻度順に登録してあるデータベースである。
【０１７１】
なお、上記２２００万件の法人名義を構成する単名詞の総数は、６５０万種類である。
【０１７２】
利用者２１０が、検索語を単名詞毎に区切らずに、一連で音声入力すると、音声認識部２０３は、音声入力された複合名詞を単名詞毎に区切り、音声入力された複合名詞における１単語目に位置する単名詞（最初から１つ目の単名詞）、２単語目に位置する単名詞（最初から２つ目の単名詞）、……の順で、認識処理を実行するとともに、音声入力された複合名詞における逆順１単語目に位置する単名詞（最後から１つ目の単名詞）、逆順２単語目に位置する単名詞（最後から２つ目の単名詞）、……の順で、認識処理を実行する部分である。
【０１７３】
音声認識結果出力部２０４は、認識結果を音声認識結果リスト作成部２０４ａへ送る部分である。
【０１７４】
音声認識結果リスト作成部２０４ａは、順序検索補助データベース２８２を使用した場合における認識結果と、逆順検索補助データベース２８３を使用した場合における認識結果とを、単名詞毎に、音声認識装置２０３Ｓが算出した認識尤度を利用してマージし、認識結果リストを作成し、実在検索語候補リスト作成部２０５へ送る部分である。
【０１７５】
この時点で、順序検索補助データベース２８２、逆順検索補助データベース２８３の中に、まだ認識処理が終了していない単名詞が存在する場合には、音声認識部２０３は、使用頻度が次に高い単名詞によって構成される単名詞の集合の範囲で認識処理を再びスタートさせ、音声認識結果出力部２０４は、認識結果を出力し、音声認識結果リスト作成部２０４ａへ送り、認識結果リストを作成し、実在検索語候補リスト作成部２０５へ送る処理を繰返す。
【０１７６】
実在検索語候補リスト作成部２０５は、認識結果に基づいて、有力構成名詞候補を選択し、有力構成名詞候補の全ての組合わせを作成し、検索データベース２８１を参照しながら、実在する検索語を抽出した実在検索語候補リストを作成し、対話制御部２０６へ送る部分である。
【０１７７】
対話制御部２０６は、実在検索語候補リストを参照し、実在検索語候補リスト中の検索語候補が、利用者２１０との確認処理のみで検索語特定可能な所定の条件を満たす場合は、確認処理ガイダンスの出力命令を音声出力部２０７へ送るものである。
【０１７８】
実在検索語候補リスト中の検索語候補が、利用者２１０との確認処理のみでは検索語特定可能な条件を満たさない場合は、使用頻度が次に高い単名詞の集合について認識処理し、この認識処理結果から得られた有力構成名詞候補を利用して更新実在検索語候補リストを利用し、検索語候補選定対話の実行を繰返す。
【０１７９】
音声出力部２０７は、確認処理ガイダンス出力命令を受けると、検索語候補と指定された候補の正誤確認を、利用者２１０に行うガイダンスを出力する。
【０１８０】
そして、確認処理ガイダンスに対する応答が、音声入力部２０２に再び入力されると、音声認識部２０３は、ＹＥＳ／ＮＯデータベース２８４を参照し、利用者２１０の応答を認識し、音声認識結果出力部１１が認識結果を出力し、利用者２１０から肯定を表す応答が得られると、検索語特定が完了した旨を利用者２１０ヘガイダンスする命令を、対話制御部２０６が音声出力部２０７へ送る。
【０１８１】
検索語が特定できるまで、実在検索語候補リストの更新と、検索語候補選定対話とを繰返し、順序検索補助データベース２８２と逆順検案補助データベース２８３とに格納されている全ての単名詞の集合に対する認識処理が終了し、実在検索語候補リストの更新が、これ以上不可能な状態になった時点で、対話制御部２０６は、実在検索語候補リストの上位から、検索語候補を利用者２１０に提示するガイダンス出力命令を音声出力部２０７へ送る。
【０１８２】
検索装置２０１は、実在検索語候補リストの更新と検索語候補選定対話とが行われている間に、優先認識対象以外の単名詞のうちの少なくとも１集合に対して、認識処理が終了しているように、上記１集合を構成する単名詞の数を規定する。
【０１８３】
次に、複合名詞の検索装置２０１における検索語確定の処理プロセスについて具体的に説明する。
【０１８４】
図２０は、複合名詞の検索装置２０１における順序検索補助データベース２８２の具体例を示す図である。
【０１８５】
順序検索補助データベース２８２に格納されている複数の単名詞は、ｎ単語目に位置する単名詞集合Ｇｂｎに分割され、つまり、１単語目に位置する単名詞集合Ｇｂ１と、２単語目に位置する単名詞集合Ｇｂ２と、３単語目に位置する単名詞集合Ｇｂ３と、……に分類されている。
【０１８６】
また、ｎ単語目に位置する単名詞集合に属する単名詞が、使用頻度の高い単名詞順に５００個毎に分割されることによって、複数のｎ番目の単名詞サブ集合が構成され、つまり、１単語目に位置する単名詞集合Ｇｂ１は、使用頻度の高い単名詞順に５００個毎に、単名詞サブ集合Ｇｂ１−１と、単名詞サブ集合Ｇｂ１−２と、単名詞サブ集合Ｇｂ１−３と、……に分割され、２単語目に位置する単名詞集合Ｇｂ２は、使用頻度の高い単名詞順に５００個毎に、単名詞サブ集合Ｇｂ２−１と、単名詞サブ集合Ｇｂ２−２と、単名詞サブ集合Ｇｂ２−３と、……に分割されている。
【０１８７】
なお、上記分割する場合における数ｃとして、複合名詞の検索装置２０１では、５００を設定しているが、上記分割する場合の数ｃは、使用する音声認識装置２０３Ｓの性能によって規定される。
【０１８８】
図２１は、複合名詞の検索装置２０１における逆順検索補助データベース２８３の具体例を示す図である。
【０１８９】
逆順検索補助データベース２８３に格納されている複数の単名詞は、逆順ｍ単語目に位置する単名詞集合Ｇｂｍに分割されている。つまり、音声入力された複合名詞における逆順１単語目に位置する単名詞（最後から１つ目の単名詞）の集合である逆順単名詞集合Ｈ１と、音声入力された複合名詞における逆順２単語目に位置する単名詞（最後から２つ目の単名詞）の集合である逆順単名詞集合Ｈ２と、音声入力された複合名詞における逆順３単語目に位置する単名詞（最後から３つ目の単名詞）の集合である逆順単名詞集合Ｈ３と、……に分類されている。
【０１９０】
また、逆順ｍ単語目に位置する単名詞集合に属する単名詞が、使用頻度の高い単名詞順に５００個毎に分割されることによって、複数の逆順ｍ単語目に位置する逆順単名詞サブ集合が構成され、つまり、逆順１単語目に位置する逆順単名詞集合Ｈ１は、使用頻度の高い単名詞順に５００個毎に、逆順単名詞サブ集合Ｈ１−１と、逆順単名詞サブ集合Ｈ１−２と、逆順単名詞サブ集合Ｈ１−３と、……に分割され、逆順２単語目に位置する逆順単名詞集合Ｈ２は、使用頻度の高い単名詞順に５００個毎に、逆順単名詞サブ集合Ｈ２−１と、逆順単名詞サブ集合Ｈ２−２と、逆順単名詞サブ集合Ｈ２−３と、……に分割されている。
【０１９１】
なお、上記分割する場合における数ｃとして、複合名詞の検索装置２０１では、５００を設定しているが、上記分割する場合の数ｃは、使用する音声認識装置２０３Ｓの性能によって規定される。
【０１９２】
図２２は、複合名詞の検索装置２０１における検索語確定の処理プロセスを示すフローチャートである。
【０１９３】
まず、検索したい法人名義（複合名詞）について音声入力することを、利用者Ｐに要求する（Ｓ２００）。この場合、利用者Ｐは、複合名詞を構成する単名詞毎に区切らずに、複合名詞を一連で入力したとする。音声認識部３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切る。
【０１９４】
まず、利用者２１０は、検索目的である法人名義を、単名詞に区切って音声入力する（Ｓ２００）。複合名詞の検索装置２０１では、「横須賀／セントラル／ホテル」が検索語として入力された場合について考える。
【０１９５】
ここで、複合名詞を構成する単名詞が、その複合名詞において何単語目に位置するかを示す変数ｎとして、「１」を設定し、そのｎ単語目に位置する単名詞集合Ｇｂｎを構成する単名詞サブ集合Ｇｂｎ−ｐにおける順位（使用頻度の高さを示す順位）の変数ｐとして、「１」を設定する（Ｓ２０１）。
【０１９６】
そして、「横須賀」、「セントラル」、「ホテル」のそれぞれに対して、単名詞サブ集合Ｇｂ１−１、単名詞サブ集合Ｇｂ２−１、単名詞サブ集合Ｇｂ３−１について、認識処理を行なう（Ｓ２０２、Ｓ２０３）。続いて、単名詞毎に、認識尤度を対応させて、順序検索した場合における認識結果リストを作成する（Ｓ２０４）。続いて、認識尤度が８０以上である単名詞を、有力構成名詞候補として選出する（Ｓ２０５）。
【０１９７】
上記「有力構成名詞候補」は、認識尤度が所定の閾値以上である構成名詞候補であり、複合名詞の検索装置２０１では、上記閾値を８０とする。つまり、認識尤度が８０以上である構成名詞候補が、有力構成名詞候補である。なお、有力構成名詞候補を選択する場合に使用する認識尤度は、検索データベース２８１を作るときに、予め定められている。
【０１９８】
図２３は、複合名詞の検索装置２０１において、順序検索した場合における認識結果リストの具体例を示す図である。
【０１９９】
なお、「順序検索」は、順序検索補助データベース２８２を使用して、音声入力された複合名詞におけるｎ単語目に位置する単名詞を、ｎ単語目に位置する単名詞集合Ｇｂｎの範囲で認識し、検索する動作である。
【０２００】
また、後述の「逆順検索」は、逆順検索補助データベース２８３を使用して、音声入力された複合名詞における逆順ｍ単語目に位置する単名詞を、逆順ｍ単語目に位置する単名詞集合Ｇｂｍの範囲で認識し、検索する動作である。
【０２０１】
一方、上記順序検索における認識動作と並行して、逆順検索における認識動作が行われる。
【０２０２】
つまり、複合名詞を構成する単名詞が、その複合名詞における逆順で何番目であるかを示す変数ｍとして、「１」を設定し、その逆順ｍ単語目に位置する単名詞集合Ｇｂｍを構成する単名詞サブ集合Ｇｂｍ−ｑにおける順位（使用頻度の高さを示す順位）の変数ｑとして、「１」を設定する（Ｓ２１１）。
【０２０３】
そして、「横須賀」、「セントラル」、「ホテル」のそれぞれに対して、逆順単名詞サブ集合Ｈ１−１、逆順単名詞サブ集合Ｈ２−１、逆順単名詞サブ集合Ｈ３−１について、認識処理を行なう（Ｓ２１２、Ｓ２１３）。続いて、単名詞毎に、認識尤度を対応させて、逆順検索した場合における認識結果リストを作成する（Ｓ２１４）。そして、認識尤度が８０以上である単名詞を、有力構成名詞候補として選出する（Ｓ２１５）。
【０２０４】
複合名詞の検索装置２０１において、上記のように、順序検索における認識動作と並行して、逆順検索における認識動作が行われるが、この場合、順序検索（Ｓ２０１〜Ｓ２０５）を１つのＣＰＵで実行し、逆順検索（Ｓ２１１〜Ｓ２１５）を別のＣＰＵで実行するようにしてもよく、または、１つのＣＰＵで、順序検索（Ｓ２０１〜Ｓ２０５）と逆順検索（Ｓ２１１〜Ｓ２１５）とを実行するようにしてもよい。
【０２０５】
図２４は、複合名詞の検索装置２０１において、逆順検索した場合における認識結果リストの具体例を示す図である。
【０２０６】
順序検索補助データベース２８２における使用頻度順位は、音声入力した複合名詞における１単語目に位置する単名詞「横須賀」は、図２０に示すように、４２０位であり、同２単語目に位置する単名詞「セントラル」は、１２５０位であり、同３単語目に位置する単名詞「ホテル」は、８９２位である。「セントラル」、「ホテル」は、５００位以下であるので、優先認識対象である単名詞サブ集合Ｇｂ２−１、Ｇｂ３−１には含まれず、最初に認識する対象ではない。
【０２０７】
ところが、逆順検索補助データベース２８３における使用頻度順位は、音声入力した複合名詞における逆順２単語目に位置する単名詞「セントラル」は、図２１に示すように、９位であり、同逆順１単語目の単名詞「ホテル」は、１位であるので、逆順単名詞サブ集合Ｈ２−１、Ｈ１−１に含まれ、優先認識対象であり、順序検索する場合よりも、早く認識される。
【０２０８】
図２５は、複合名詞の検索装置２０１において、順序検索した場合における認識結果と、逆順検索した場合における認識結果とをマージした場合における認識結果リストの具体例を示す図である。
【０２０９】
図２３に示す順序検索した場合における認識結果と、図２４に示す逆順検索した場合における認識結果とをマージする（Ｓ２２１）と、図２５に示すように、統合された認識結果リストができ上がり、有力構成名詞候補を選択する。
【０２１０】
複合名詞の検索装置２０１では、認識尤度が８０以上である単名詞を、有力構成名詞候補と定め、これによって、図２５に示すように、音声入力された単名詞「横須賀」に対して、「横浜」、「横須賀」の２候補が有力構成名詞候補として選択され、音声入力された単名詞「セントラル」に対して、「セントラル」が有力構成名詞候補として選択され、音声入力された単名詞「ホテル」に対して、「ホテル」、「ホール」の２候補が有力構成名詞候補として選択される。
【０２１１】
図２６は、複合名詞の検索装置２０１において、実在検索語候補リスト作成直前の状態を示す図である。
【０２１２】
検索装置２０１は、図２５に示すマージされた認識結果における有力構成名詞候補を利用し、実在検索語リスト作成処理を行う（Ｓ２２２）。
【０２１３】
検索装置２０１は、有力構成名詞候補の全組み合わせと、検索データベース２８１に格納されている複合名詞とに基づいて、図２６に示すように、実在する検索語候補をリストアップする。
【０２１４】
複合名詞の検索装置２０１では、「横須賀／セントラル／ホテル」（検索語認識尤度９０＋９５＋９５＝２８０）が、実在検索語候補としてリストアップされる。複合名詞の検索装置２０１において、検索語認識尤度は、図２６に示すように、各単名詞の認識尤度を加算する方式で計算する。
【０２１５】
複合名詞の検索装置２０１において、利用者２１０との確認処理のみで検索語確定可能か否かの判断（Ｓ２２３）の基準は、複合名詞の検索装置２０１が予め規定した閾値と上記検索語認識尤度とを比較し、上記規定した閾値を越える検索語認識尤度を持つ検索語候補は、「確認処理のみで検索語確定可能な条件を満たす」と判断する。
【０２１６】
なお、実在検索語候補リストは、検索データベース２８１に実在する検索語のうちで、利用者２１０に提示し、確認を求める検索語の候補を集めたリストである。
【０２１７】
複合名詞の検索装置２０１では、検索語認識尤度に対する規定閾値を２８０と定める。複合名詞の検索装置２０１において、検索語認識尤度が２８０以上の検索語候補である「横須賀／セントラル／ホテル」が実在検索語リストから選択され、利用者２１０に提示される（Ｓ２２４）。正誤確認に対して利用者２１０から肯定を示す応答が得られると（Ｓ２２５）、検索語を確定することができたとして処理を終了する。
【０２１８】
ところで、検索語である複合名詞を構成する単名詞の数が３、４、５、……、８であると仮定した場合、つまり、最多単名詞数が８であると仮定した場合、複合名詞の逆順１番目の単名詞（末尾の単名詞）として使用される可能性が高い「ホテル」、「銀行」、「センター」等の単名詞は、複合名詞が３つの単名詞で構成されていれば、３単語目に位置する単名詞としてカウントされ、複合名詞が４つの単名詞で構成されていれば、４単語目に位置する単名詞としてカウントされるので、使用されている「ホテル」、「銀行」、「センター」の単名詞の９９％が、逆順１番目の単名詞（末尾の単名詞）として使用されているとしても、頻度統計は分散する。したがって、図２０に示すように、「ホテル」、「銀行」、「センター」は、使用頻度の高い優先認識対象にはならず、このために、迅速な検索が行なわれないことが多い。
【０２１９】
ところが、複合名詞の検索装置２０１において、末尾から数えた単名詞の位置（逆順番目）毎に頻度統計をカウントして、逆順検索補助データベース２８２に格納されるので、図２１に示すように、末尾から１番目に使用される頻度が高い単名詞の集合である逆順１単語目に位置する逆順単名詞集合Ｈ１として、「ホテル」、「銀行」、「センター」等が登録され、優先認識対象となる。また、末尾から２番目に使用される頻度が高い単名詞の集合である逆順２単語目に位置する逆順単名詞集合Ｈ２として、「セントラル」「グランド」等が登録される。したがって、複合名詞の検索装置２０１では、短時間で認識動作が実行される。
【０２２０】
また、複合名詞の検索装置２０１において、実在検索語リスト作成処理（Ｓ２２２）と同時に、単名詞サブ集合Ｇｂｎ−２、逆順単名詞サブ集合Ｈｍ−２について、認識処理をスタートするようにしてもよい。つまり、検索語候補を利用者２１０に提示し、確認を求めている（Ｓ２２４）間に、使用頻度が次に高い単名詞の範囲で認識を行うと、利用者２１０から見れば、認識する動作の合間に、何もせずに待つ必要がなく、認識動作をしないで、検索語候補が次々と提示されているように思える。
【０２２１】
検索装置２０１からの質問と、この質問に対する利用者２１０からの応答とを、１ターンとした場合、複合名詞の検索装置２０１では、利用者２１０との間で、できる限り少ない対話ターン数で、検索語特定を実現することができ、これによって、対話時間の減少を図り、より迅速に検索処理を行うことができる。
【０２２２】
複合名詞の検索装置２０１における検索の基本方針は、一連で入力された複合語（検索語）を音声認識部２０３が単名詞毎に区切り、この区切られた単名詞の先頭からの位置を考慮し、単名詞毎に頻度統計の大きい順に、優先的に認識処理を行い、各単名詞に対する認識結果を組合わせ、実在する検索語のみを選択することによって、認識装置の処理時間と精度の不完全性とを補い、利用者２１０にストレスを与えずに検索語を特定することである。
【０２２３】
上記基本方針に加えて、複合名詞の検索装置２０１の大きな特徴は、単名詞毎に行なう認識処理と並行して、検索語である複合名詞の末尾から数えて、１単語目、２単語目、３単語目、……毎に、頻度統計を調べ、頻度統計順に並べた逆順検索補助データベース２８３について、音声入力が終了した時点で、優先的に認識処理を行い、各単名詞に対する認識結果を組合わせ、実在する検索語のみを選択することによって、認識装置の処理時間と精度の不完全性とを補い、利用者２１０にストレスを与えずに検索語を特定する点である。
【０２２４】
また、順序検索補助データベース２８２を利用した認識結果と、逆順検索補助データベース２８３を利用した認識結果とをマージするようにし、これによって、認識処理の精度をより向上させることができ、検索時間をさらに短縮することができる。
【０２２５】
図２７は、本発明の第４の実施例の動作を示すフローチャートである。
【０２２６】
第４の実施例は、順序検索を実行せずに、逆順検索のみを実行する実施例である。
【０２２７】
図２７において、まず、検索したい法人名義（複合名詞）について音声入力することを、利用者Ｐに要求する（Ｓ２３０）。この場合、利用者Ｐは、複合名詞を構成する単名詞毎に区切らずに、複合名詞を一連で入力したとする。音声認識部３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切る。
【０２２８】
複合名詞を構成する単名詞が、その複合名詞における逆順で何単語目に位置するかを示す変数ｍとして、「１」を設定し、その逆順ｍ単語目に位置する逆順単名詞集合Ｇｂｍを構成する単名詞サブ集合Ｇｂｍ−ｑにおける順番ｑとして、「１」を設定する（Ｓ２３１）。そして、「横須賀」、「セントラル」、「ホテル」のそれぞれに対して、逆順単名詞サブ集合Ｈ１−１、逆順単名詞サブ集合Ｈ２−１、逆順単名詞サブ集合Ｈ３−１について、認識処理を行なう（Ｓ２３２、Ｓ２３３）。続いて、単名詞毎に、認識尤度を対応させて、逆順検索した場合における認識結果リストを作成する（Ｓ２３４）。そして、有力構成名詞候補を選出し（Ｓ２３５）、実在検索語候補リストを作成し（Ｓ２３６）、確認処理のみで検索語を特定可能な条件を満たせば（Ｓ２３７）、検索語候補を利用者に提示し（Ｓ２３８）、利用者による応答がＹＥＳであれば（Ｓ２３９）、検索を終了し、一方、確認処理のみで検索語を特定可能な条件を満たす候補が存在しなければ（Ｓ２３７）、使用頻度が次に高い逆順単名詞サブ集合の範囲で認識を行う。
【０２２９】
つまり、上記第４の実施例は、音声入力された複合名詞における逆順ｍ単語目に位置する単名詞については、データベースに登録されている各複合名詞の逆順ｍ単語目（ｍは整数値）に位置する単名詞の集合である逆順ｍ単語目に位置する単名詞集合の範囲で認識する音声入力された複合名詞の検索装置である。
【０２３０】
具体的には、上記第４の実施例は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されている検索データベースと、上記登録されている各複合名詞の逆順ｍ単語目（ｍは整数値）に位置する単名詞の集合である逆順ｍ単語目に位置する単名詞集合が、登録されている逆順検索補助データベースと、利用者が上記検索語である複合名詞を単名詞毎に区切らずに一連で音声入力すると、音声認識部が上記複合名詞を単名詞に区切り、上記音声入力された複合名詞の逆順ｍ単語目に位置する単名詞については、上記逆順ｍ単語目に位置する単名詞集合の範囲で認識し、上記認識された単名詞である構成名詞候補と、上記構成名詞候補についての認識尤度との組が認識尤度順に並べられている逆順認識結果リストを作成する逆順認識結果リスト作成手段と、上記逆順認識結果リストを、上記音声入力された単名詞のそれぞれについて作成し、上記逆順認識結果リストに記載されている構成名詞候補のうちで、所定の閾値を超える認識尤度を具備する構成名詞候補を、逆順有力構成名詞候補として選出する逆順有力構成名詞候補選出手段と、上記逆順有力構成名詞候補の全組合わせのそれぞれについて、上記有力構成名詞候補の認識尤度に所定の演算を行なうことによって、検索語認識尤度を算出し、この演算された検索語認識尤度の大きい順に並べた実在検索語候補リストを作成する実在検索語候補リスト作成手段と、上記実在検索語候補のうちで、上記検索語認識尤度が所定の閾値以上である実在検索語について、上記利用者との間で必要な対話を実行し、検索語を確定する検索語候補選定対話手段とを有する音声入力された複合名詞の検索装置である。
【０２３１】
なお、上記実施例を、プログラムの実施例として把握することができる。つまり、上記実施例は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されている検索データベースと、上記登録されている各複合名詞の逆順ｍ単語目（ｍは整数値）に位置する単名詞の集合である逆順ｍ単語目に位置する単名詞集合が、登録されている逆順検索補助データベースとを使用し、音声入力された複合名詞を検索する手順と、利用者が上記検索語である複合名詞を単名詞毎に区切らずに、一連で音声入力すると、上記音声入力された検索語を単名詞毎に区切る検索語区切り手順と、上記利用者が上記検索語である複合名詞を一連で音声入力すると、音声入力された複合名詞の逆順ｍ単語目に位置する単名詞については、上記逆順ｍ単語目に位置する単名詞集合の範囲で認識し、上記認識された単名詞である構成名詞候補と、上記構成名詞候補についての認識尤度との組が認識尤度順に並べられている逆順認識結果リストを作成する逆順認識結果リスト作成手順と、上記逆順認識結果リストを、上記音声入力された単名詞のそれぞれについて作成し、上記逆順認識結果リストに記載されている構成名詞候補のうちで、所定の閾値を超える認識尤度を具備する構成名詞候補を、逆順有力構成名詞候補として選出する逆順有力構成名詞候補選出手順と、上記逆順有力構成名詞候補の全組合わせのそれぞれについて、上記有力構成名詞候補の認識尤度に所定の演算を行なうことによって、検索語認識尤度を算出し、この演算された検索語認識尤度の大きい順に並べた実在検索語候補リストを作成する実在検索語候補リスト作成手順と、上記実在検索語候補のうちで、上記検索語認識尤度が所定の閾値以上である実在検索語について、上記利用者との間で必要な対話を実行し、検索語を確定する検索語候補選定対話手順とをコンピュータに実行させるプログラムの例である。
【０２３２】
また、上記プログラムを、ＦＤ、ＣＤ、ＤＶＤ、ＨＤ、半導体メモリ等の記録媒体に記録するようにしてもよい。
【０２３３】
なお、商品配送サービスや、電話番号検索や郵便番号検索等における入力インタフェース部分において、上記実施例で行った法人名義の確定作業を、幅広く適用することができる。
【０２３４】
上記実施例によれば、先頭からの構成順序のみを考慮するのではなく、末尾に使用される頻度の高い構成名詞、末尾から２単語目に使用される頻度の高い構成名詞というように、頻度をカウントすることによって、同じ末尾に使われていても、構成単語数によって、先頭から数えると異なる単語目にカウントされ、使用頻度上位単語とは認識されない単語を、優先認識対象と捕らえることができる。
【０２３５】
なお、上記実施例において、検索語認識尤度は、図２６に示すように、各単名詞の認識尤度を加算したものであるが、検索語認識尤度を算出する場合、各単名詞の認識尤度を乗算するようにしてもよい。
【０２３６】
また、実在検索語候補リスト中の検索語候補における検索語認識尤度が、その閾値（たとえばその閾値が２８０であるとする）以上であれば、利用者２１０による確認処理のみで検索語を確定することができ、逆に、検索語認識尤度が上記閾値未満であれば、利用者２１０による確認処理のみで検索語を確定することができない。
【０２３７】
上記のように検索語を確定することができない場合に備えて、単名詞サブ集合Ｇｂ２−ｐ、逆順単名詞サブ集合Ｈ２−ｑの範囲で単名詞の認識、認識尤度の算出等の処理を、予め実行しておくようにしてもよい。すなわち、検索語候補を利用者２１０にディスプレイ等で提示し、確認を求める処理（Ｓ２２４）および利用者２１０による応答（Ｓ２２５）と並行して、使用頻度が次に高い単名詞サブ集合、逆順単名詞サブ集合の範囲で単名詞の認識、認識尤度の算出（Ｓ２０２、Ｓ２１２）、有力構成名詞候補の再選出（Ｓ２０４、Ｓ２１４）、実在検索語候補リストの更新（Ｓ２２２）を実行する。
【０２３８】
このようにすれば、検索装置２０１の内部の処理状況を利用者２１０が一切見ることができないので、利用者２１０には、あたかも、検索データベース２８１に登録されている検索語を一括してしかも非常に高速で認識処理を行っているかのように見える。
【０２３９】
図２８は、本発明の第５の実施例である音声対話型複合名詞の検索装置３０１を示すブロック図である。
【０２４０】
この音声対話型複合名詞の検索装置３０１は、音声入力部３０２と、音声認識部３０３と、音声認識用ソフトウェア３０３Ｓと、音声認識結果出力部３０４と、実在検索語候補リスト作成部３０５と、対話制御部３０６と、音声出力部３０７と、音声出力用ソフトウェア３０７Ｓとによって構成されている。
【０２４１】
音声入力された複合名詞の検索装置３０１において、音声入力部３０２を介して入力された利用者Ｐの音声が音声認識部３０３へ送られ、音声認識部３０３は、入力音声を音声認識処理する際に、システムデータベース３０８を利用する。また、音声認識部３０３は、利用者Ｐによる入力音声について、音声認識用ソフトウェア３０３Ｓを利用して、認識処理を実行し、さらに、音声認識部３０３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切ることができる認識部である。
【０２４２】
システムデータベース３０８は、検索データベース３８１と、検索補助データベース３８２と、ＹＥＳ／ＮＯデータベース３８３とを有する。
【０２４３】
検索データベース３８１は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されているデータベースである。
【０２４４】
検索補助データベース３８２は、登録されている各複合名詞のｎ番目（ｎは整数値）に表記されている単名詞の群を、ｎ番目表記の単名詞集合群と呼び、上記ｎ番目表記の単名詞集合群が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている単名詞が、その頻度の高い方から順に、所定の数（検索装置３０１では５００個）毎にまとめられ、複数のサブ集合群が形成されているデータベースである。
【０２４５】
ＹＥＳ／ＮＯデータベース３８３は、利用者Ｐが応答した内容（たとえば、はい／いいえ、ＹＥＳ／ＮＯ）を認識するデータベースである。
【０２４６】
音声認識用ソフトウェア３０３Ｓは、検索装置３０１の処理の場面に合わせて、検索データベース３８１または検索補助データベース３８２を、システムデータベース３０８から選択するものである。
【０２４７】
検索語が音声入力されると、検索補助データベース３８２を参照し、また、利用者Ｐへの正誤確認に対する応答を認識する場合は、ＹＥＳ／ＮＯデータベース３８３が参照される。
【０２４８】
また、音声認識部３０３は、音声認識処理の際に、音声認識用ソフトウェア３０３Ｓを使用し、音声出力部３０７は、音声出力の際に、音声出力用ソフトウェア３０７Ｓを使用する。
【０２４９】
音声認識結果出力部３０４は、利用者が検索語である複合語を単名詞毎に入力すると、上記複合名詞のｎ番目表記の単名詞については、上記ｎ番目表記の単名詞集合群で認識し、しかも上記ｎ番目表記の単名詞集合群のうちで、最も頻度が高い単名詞を含む第１サブ集合群の範囲内で認識処理し、認識尤度を対応させて、認識結果リストを作成する認識結果リスト作成手段の例である。
【０２５０】
また、音声認識結果出力部３０４は、上記認識処理された単名詞である構成名詞候補と、上記構成名詞候補についての認識尤度との組が認識尤度順に並べられている認識結果リストを、上記音声入力された単名詞のそれぞれについて作成し、上記認識結果リストに記載されている構成名詞候補のうちで、所定の第１の閾値を超える認識尤度を具備する構成名詞候補を、有力構成名詞候補として選出する有力構成名詞候補選出手段の例である。
【０２５１】
実在検索語候補リスト作成部３０５は、実在検索語リスト作成部３５１と、部分一致検索語リスト作成部３５２とによって構成されている。
【０２５２】
実在検索語リスト作成部３５１は、音声認識結果出力部３０４が出力した音声認識結果に基づいて、検索データベース３８１格納されている検索語から、実在検索語を抽出する部分である。
【０２５３】
部分一致検索語リスト作成部３５２は、上記検索語を構成する複数の単名詞のうちで一部の単名詞のみの尤度が上記第１の所定の閾値を超える場合、上記第１の閾値を超える尤度を具備する上記単名詞を備え、上記検索語を構成する単名詞と同じ数の単名詞を備えている部分一致検索語候補を、上記検索データベースから抽出する部分一致検索語候補抽出手段の例である。
【０２５４】
また、対話制御部３０６は、検索語候補選定対話部３６１と、部分一致検索語候補選定対話部３６２とによって構成されている。
【０２５５】
部分一致検索語候補選定対話部３６２は、上記抽出された部分一致検索語候補を構成する各単名詞の尤度を所定の演算方法で演算して部分一致尤度を演算する部分一致尤度演算手段の例である。
【０２５６】
なお、上記部分一致尤度を演算する上記所定の演算方法は、検索装置３０１では、各単名詞の尤度を加算する方法であるが、上記加算の代わりに、乗算するようにしてもよい。
【０２５７】
音声出力部３０７は、音声出力用ソフトウェア３０７Ｓを使用して、音声出力するものである。
【０２５８】
なお、音声認識部３０３は、上記演算された部分一致尤度が、所定の第２の閾値を超えている上記部分一致検索語候補について、音声認識処理を行う部分一致検索語候補用音声認識処理手段の例である。
【０２５９】
次に、検索装置３０１について、より具体的に説明する。
【０２６０】
なお、以下の説明では、企業名等、法人名義を確定するサービスの入力インタフェース部分に、音声入力された複合名詞の検索装置３０１が設けられている場合について説明する。
【０２６１】
図２９は、検索装置３０１において、検索データベース３８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【０２６２】
検索データベース３８１に登録されている法人名義を構成している各名詞の頻度を調べ、単名詞を頻度順に記録し、検索補助データベース３８２を作成する。
【０２６３】
なお、上記２２００万件の法人名義を構成する単名詞の総数は、６５０万種類である。また、法人名義２，２００万件を構成する名詞のうち、１番目表記の単名詞（法人名義中の第１単語目に位置する単名詞）の総数は、約３６０万種類であり、２番目表記の単名詞（法人名義中の第２単語目に位置する単名詞）の総数は、約２５０万種類であり、３番目表記の単名詞（法人名義中の第３単語目に位置する単名詞）の総数は、約２７０万種類であり、４番目表記の単名詞（法人名義中の第４単語目に位置する単名詞）の総数は、約１００万種類、…（最長構成単語数７）であり、対話処理実時間内では、１つの複合名詞に対する各単名詞の認識処理は不可能であり、精度も低いことが予想される。
【０２６４】
図３０は、検索装置３０１において、検索補助データベース３８２に格納されているデータの具体例を示す図である。
【０２６５】
検索補助データベース３８２中の単名詞は、検索データベース３８１に登録されている複合名詞を構成する単名詞であり、複合名詞における表記の順番毎に、単名詞集合群が形成されている。複合名詞における表記の順番が最初である単名詞だけを集めて、１番目表記の単名詞集合群Ｇｃ１が形成され、１番目表記の単名詞集合群Ｇｃ１において、その使用頻度の高い順に単名詞が登録され、しかも、これら登録されている単名詞が、その頻度の高い方から順に、５００個の単名詞毎にまとめられ、複数のサブ集合群が形成され、頻度の高い順に、第１サブ集合群Ｇｃ１−１、第２サブ集合群Ｇｃ１−２、……、第ｍサブ集合群Ｇｃ１−ｍが形成されている。
【０２６６】
なお、使用頻度が最も高い単名詞を含む第１サブ集合群Ｇｃ１−１は、他のサブ集合群Ｇｃ１−２、……、Ｇｃ１−ｍよりも優先して認識する対象である。
【０２６７】
また、複合名詞における表記の順番が２番目である単名詞だけを集めて、２番目表記の単名詞集合群Ｇｃ２が形成され、２番目表記の単名詞集合群Ｇｃ２において、その使用頻度の高い順に単名詞が登録され、しかも、これら登録されている単名詞が、その頻度の高い方から順に、５００個の単名詞毎にまとめられ、複数のサブ集合群が形成され、頻度の高い順に、第１サブ集合群Ｇｃ２−１、第２サブ集合群Ｇｃ２−２、……、第ｍサブ集合群Ｇｃ２−ｍが形成されている。
【０２６８】
なお、使用頻度が最も高い単名詞を含む第１サブ集合群Ｇｃ２−１は、他のサブ集合群Ｇｃ２−２、……、Ｇｃ２−ｍよりも優先して認識する対象である。
【０２６９】
以下、上記と同様に、複合名詞における表記の順番がｎ番目である単名詞だけを集めて、ｎ番目表記の単名詞集合群Ｇｃｎが形成され、ｎ番目表記の単名詞集合群Ｇｃｎにおいて、その使用頻度の高い順に単名詞が登録され、しかも、これら登録されている単名詞が、その頻度の高い方から順に、５００個の単名詞毎にまとめられ、複数のサブ集合群が形成され、頻度の高い順に、第１サブ集合群Ｇｃｎ１、第２サブ集合群Ｇｃｎ２、……、第ｎサブ集合群Ｇｃｎ−ｍが形成されている。
【０２７０】
なお、使用頻度が最も高い単名詞を含む第１サブ集合群Ｇｃｎ−１は、他のサブ集合群Ｇｃｎ−２、……、Ｇｃｎ−ｍよりも優先して認識する対象である。
【０２７１】
ところで、１つのサブ集合群を構成する名詞の数をＮとすると、検索装置３０１では、Ｎ＝５００であり、この数は、次のようにして決められる。つまり、対話処理実時間（検索装置と利用者Ｐとがやりとりする場合、上記利用者Ｐにストレスまたは不自然さを感じさせない時間である）内に音声認識用ソフトウェア３０３Ｓが処理可能な単名詞の数をＴとし、検索データベース３８１に登録されている全検索語の平均単名詞数をＭとすると、Ｎ＝Ｔ／Ｍである。
【０２７２】
具体的には、対話処理実時間内に音声認識用ソフトウェア３０３Ｓが処理可能な単名詞の数Ｔが１５００であるとし、検索データベース３８１に登録されている全検索語の平均単名詞数Ｍが３であるとすると、Ｎ＝Ｔ／Ｍ＝１５００／３＝５００である。
【０２７３】
なお、各サブ集合群Ｇｃ１−１、……、Ｇｃｎ−ｍのそれぞれを構成する単名詞の数Ｎは、５００以外の数でもよいが、上記のように、対話処理実時間内に処理可能な単名詞の数Ｔ（音声認識用ソフトウェア３０３Ｓの性能）と、検索データベース３８１に登録されている検索語が有する単名詞の数の平均Ｍとによって定められる。
【０２７４】
次に、検索装置３０１の動作について説明する。
【０２７５】
図３１は、検索装置３０１において、検索語を確定する処理プロセスを具体的に示すフローチャートである。
【０２７６】
この前提として、検索データベース３８１、検索補助データベース３８２が作成されているとする。
【０２７７】
まず、検索したい法人名義（複合名詞）について音声入力することを、利用者Ｐに要求する（Ｓ３００）。この場合、利用者Ｐは、複合名詞を構成する単名詞毎に区切らずに、複合名詞を一連で入力したとする。音声認識部３は、利用者Ｐが一連で入力した複合名詞を、単名詞毎に区切る。
【０２７８】
そして、検索語である複合名詞に関する各単名詞集合群におけるサブ集合群の関数（サブ集合群の順位を示す関数）ｋを１とする。つまり、サブ集合群の関数ｋ＝１であれば、複数のサブ集合群のうちで、サブ集合群Ｇｃ１−１、Ｇｃ２−１、Ｇｃ３−１、……を使用して、各単名詞が認識される。
【０２７９】
ここで、複合名詞「横浜／東急／ホテル」が検索語として音声入力された場合について考える。
【０２８０】
まず、音声入力された複合名詞を、音声認識部３０３が複数の単名詞に区切り、この区切られた単名詞「横浜」、「東急」、「ホテル」について、それぞれ、１番目表記の単名詞集合群Ｇｃ１中の第１サブ集合群Ｇｃ１−１、２番目表記の単名詞集合群Ｇｃ２中の第１サブ集合群Ｇｃ２−１、３番目表記の単名詞集合群Ｇｃ３中の第１サブ集合群Ｇｃ３−１を、使用し、音声認識部３０３が認識処理を行う（Ｓ３０１）。つまり、音声入力された単名詞「横浜」について、第１サブ集合群Ｇｃ１−１の範囲内で認識処理し、音声入力された単名詞「東急」について、第１サブ集合群Ｇｃ２−１の範囲内で認識処理し、音声入力された単名詞「ホテル」について、第１サブ集合群Ｇｃ３−１の範囲内で認識処理する。そして、音声認識結果出力部３０４が、その認識結果を出力する。
【０２８１】
音声入力された複合名詞が区切られた単名詞「横浜」、「東急」、「ホテル」のうちで、単名詞「横浜」は、図３０に示すように、第１サブ集合群Ｇｃ１−１に含まれており、単名詞「ホテル」が、第１サブ集合群Ｇｃ３−１に含まれているとする。ところが、単名詞「東急」の頻度順位は、図３０に示すように、頻度順位９５１位であるので、第１サブ集合群Ｇｃ２−１には含まれず、第２サブ集合群Ｇｃ２−２に含まれている。
【０２８２】
図３２は、検索装置３０１において、音声入力された検索語（複合名詞）を構成する各単名詞に対する認識結果と、その認識尤度とが対応している認識結果リストの具体例を示す図である。
【０２８３】
１つの単名詞が音声入力されると、この音声入力された単名詞に対する複数の構成名詞候補と、これら複数の構成名詞候補のそれぞれに対する認識尤度とを、音声認識結果出力部３０４が出力し、この出力された構成名詞候補と認識尤度とを対応させて、認識結果リストを作成する（Ｓ３０１）。このようにして作成された認識結果リストの具体例を、図３２に示してある。
【０２８４】
たとえば、単名詞「横浜」を音声入力すると、この入力された音声を、音声認識部３０３が、図３２に示すように、「横浜」、「横須賀」、「横溝」、「横山」、……と認識し、「横浜」の認識尤度が９５であり、「横須賀」の認識尤度が９０であり、「横溝」の認識尤度が８１であり、「横山」の認識尤度が７５である。
【０２８５】
なお、各単名詞についての認識尤度は、その単名詞が音声入力されたときに、音声認識用ソフトウェア３０３Ｓが、個々に判断する。
【０２８６】
ここで、認識尤度が所定の閾値以上である構成名詞候補を、有力構成名詞候補とする。なお、検索装置３０１では、上記所定の閾値を８０とする。つまり、認識尤度が８０以上である構成名詞候補が、有力構成名詞候補である。上記所定の閾値が８０であることは、データベース８１、８２を作るときに、予め設定されている。
【０２８７】
そして、図３２に示す各認識結果から、認識尤度が８０以上である単名詞を、有力構成名詞候補として選択する（Ｓ３０２）。
【０２８８】
この時点で、各検索補助データベース３８２中に、認識処理がまだ終了していない第１サブ集合群が存在する場合、音声認識部３０３は、第１サブ集合群に対する認識処理を再びスタートさせ、音声認識結果出力部３０４は、認識結果を出力し、検索語候補リスト作成部３０５へ送る処理を繰り返す。
【０２８９】
検索語候補リスト作成部３０５において、実在検索語候補リスト作成部３０５１は、認識結果に基づいて、有力構成名詞候補を選択し、有力構成名詞候補の全ての組み合わせを作成し、検索データベース３８１を参照しながら、実在する検索語を抽出した実在検索語候補リストを作成し（Ｓ３０３）、対話制御部３０６へ送る。
【０２９０】
なお、実在検索語候補リストは、検索データベース３８１に実在している検索語のうちで、利用者Ｐｃに提示し、確認を求める検索語の候補を集めたリストである。
【０２９１】
ところで、２番目表記の単名詞「東急」の頻度順位は、補助データベース８２に格納されている２番目表記の単名詞集合群Ｇｃ２において、図２９に示すように、９５１位であるので、２番目表記の単名詞集合群Ｇｃ２の第１サブ集合群Ｇｃ２−１には、単名詞「東急」が含まれていない。よって、図３２に示す音声入力された単名詞「東急」に対する認識結果には、単名詞「東急」が含まれていない。
【０２９２】
したがって、実在検索語候補リストを作成するステップ（Ｓ３０３）では、実在検索語候補リストが作成されない。
【０２９３】
これと同時に、部分一致検索語リスト作成部３５２は、部分一致検索語候補における部分一致尤度を計算し（Ｓ３３１）、部分一致検索語候補を、検索データベース３８１から抽出し、対話制御部３０６へ送る。
【０２９４】
なお、「部分一致検索語候補」は、検索語を構成する複数の単名詞のうちで一部の単名詞のみの尤度が第１の所定の閾値を超える場合、上記第１の閾値を超える尤度を具備する上記単名詞を備え、上記検索語を構成する単名詞と同じ数の単名詞を備えている検索語候補である。
【０２９５】
また、「部分一致尤度」は、上記抽出された部分一致検索語候補を構成する各単名詞の尤度を所定の演算方法で演算して求めた尤度である。
【０２９６】
対話制御部３０６は、実在検索語候補と、上記部分一致検索語候補とを利用して、検索語候補選定対話を実行する。
【０２９７】
すなわち、実在検索語候補リスト中の検索語候補が、利用者との確認処理のみで、検索語特定可能な検索装置の規定条件を満たす場合は、確認処理ガイダンスの出力命令を、音声出力部３０７へ送る。
【０２９８】
逆に、実在検索語候補リスト中の検索語候補が、利用者との確認処理のみでは、検索語特定可能な条件を満たさない場合（Ｓ３２１）、または実在検索語候補が抽出されない場合は、それら部分一致検索語候補を利用して、音声入力部３０２に最初に入力された音声を、音声認識部３０３が、再度、音声認識処理し（Ｓ３３２）、認識尤度の算出を行う。
【０２９９】
そして、この算出された認識尤度が、利用者との確認処理のみで検索語特定可能な検索装置の規定状態を満たす場合（Ｓ３２１）は、確認処理ガイダンスの出力命令を音声出力部３０７へ送り、検索語候補を提示し、利用者Ｐｃに確認を求める（Ｓ３２２）。
【０３００】
確認処理のみで特定可能な検索語候補が、実在検索語候補からも、部分一致検索語候補を利用した部分一致検索語処理からも抽出されない場合（Ｓ３２１）は、この時点で、認識処理を終了し、使用頻度が次に高い構成名詞集合群について（Ｓ３１１）、認識処理の結果得られた有力構成名詞候補を利用し（Ｓ３１２、Ｓ３１３）、実在検索語候補リストと部分一致検索語リストとを更新し（Ｓ３１４）、検索語候補選定対話と、部分一致検索語処理とを繰り返す（Ｓ３１５）。
【０３０１】
音声出力部３０７は、確認処理ガイダンス出力命令を受けた場合は、利用者に検索語候補と指定された候補について正誤確認を行うガイダンスを出力する。
【０３０２】
そして、確認処理ガイダンスに対する応答が、音声入力部３０２から再び入力されると、音声認識部３０３は、ＹＥＳ／ＮＯデータベース３８３を参照し、利用者の応答を認識し、音声認識結果出力部３０４から認識結果を出力し、利用者から、肯定を表す応答を得ることができた場合（Ｓ３１５）は、検索語特定が完了した旨を利用者へガイダンスする命令を、対話制御部３０６が音声出力部３０７へ送る。
【０３０３】
検索語が特定できるまで、検索語候補リストの更新（Ｓ３１４）と、検索語候補選定対話と、部分一致検索語処理（部分一致検索語候補を検索データベース３８１から抽出し、この抽出された部分一致検索語候補を構成する単名詞の尤度の合計尤度である部分一致尤度を計算する処理）とを繰り返す。
【０３０４】
検索補助データベース３８２中の全ての構成名詞集合群に対する認識処理が終了し、実在検索語候補リストをそれ以上更新不可能な場合（Ｓ３１５）は、対話制御部３０６は、実在検索語候補リストの上位から、検索語候補を利用者に提示するガイダンス出力命令を、音声出力部３０７へ送る。
【０３０５】
すなわち、検索装置３０１は、実在検索語候補リストの更新と検索語候補選定対話とが行われている間、優先認識対象名詞群以外の構成名詞集合群の少なくとも１集合（規定数からなる集合１つ分）について、認識処理が終了しているように構成名詞数を規定する。
【０３０６】
次に、検索装置３０１の動作をより具体的に説明する。
【０３０７】
これら有力構成名詞候補を利用して、実在検索語候補リスト作成処理を行うが、２番目表記の単名詞に関する第１サブ集合群Ｇｃ２−１については、「東急」に関する有力構成名詞候補が選択されていないので、検索装置３０１は、第１サブ集合群Ｇｃ１−１、Ｇｃ３−１に対する認識処理が終了次第、部分一致検索処理を行う。
【０３０８】
つまり、１番目の単名詞である１番目表記の単名詞集合群Ｇｃ１中の第１サブ集合群Ｇｃ１−１における有力構成名詞候補「横浜」または「横須賀」を、１番目の単名詞に持つ検索語候補であって、３番目の単名詞である３番目表記の単名詞集合群Ｇｃ３中の第１サブ集合群Ｇｃ３−１における有力構成名詞候補「ホテル」または「ホール」を、３番目の単名詞に持つ検索語候補を、検索データベース３８１から検索する。
【０３０９】
図３３（１）は、検索装置３０１において、部分一致検索語候補と部分一致尤度との例を示す図である。
【０３１０】
検索装置３０１において、「横浜／グランド／ホテル」、「横浜／東急／ホテル」「横浜／東武／ホテル」、「横須賀／セントラル／ホテル」、「横浜／セントラル／ホール」、「横浜／グランド／ホール」、「横浜／音楽／ホール」、「横浜／中央／ホール」、「横須賀／芸術／ホール」の９候補が、部分一致検索語候補として、検索されている。
【０３１１】
上記例において、たとえば「横浜／グランド／ホテル」のうちで、「横浜」と「ホテル」とが、図３２に示すように、有力構成名詞候補であり、「グランド」が有力構成名詞候補ではない。つまり、「横浜／グランド／ホテル」のうちで、一部の単名詞のみが有力構成名詞候補であり、したがって、「横浜／グランド／ホテル」は、部分一致検索語候補である。
【０３１２】
また、たとえば「横浜／グランド／ホテル」のうちで、「横浜」の認識尤度が、図３２に示すように９５であり、「ホテル」の認識尤度が、図３２に示すように８８であり、これら有力構成名詞候補の各認識尤度を加算した値（９５＋８８＝１８３）である１８３が部分一致尤度である。
【０３１３】
検索装置３０１において、部分一致尤度における閾値（第２の閾値）が、１３０であるとする。
【０３１４】
そして、これら９候補を認識対象として、最初の入力検索語に対する音声認識処理を、音声認識部３０３において、再度、実行する。
【０３１５】
この結果、図３３（２）に示すように、「横浜／東急／ホテル」、「横浜／東武／ホテル」の検索語認識尤度が、それぞれ、９５、８１であり、上記閾値（第２の閾値）である８０を超え、したがって、「横浜／東急／ホテル」、「横浜／東武／ホテル」が検索語候補として抽出さる。上位から提示確認を行うことによって、検索語特定が完了する。
【０３１６】
検索装置３０１において、上記部分一致検索語処理の結果について、利用者との確認処理のみで検索語確定可能か否かの判断基準は、複合名詞検索装置が予め規定した閾値と、検索語認識尤度とを比較し、再認識された結果についての閾値（第３の閾値）が８０であるとした場合、第３の閾値８０を越える検索語認識尤度を持つ検索語候補が、確認処理のみで検索語確定可能な条件を満たすと判断する。
【０３１７】
したがって、検索装置３０１において、再認識後において、確認処理のみで検索語確定可能な条件を満たす検索語候補は、図３３（２）に示すように、「横浜／東急／ホテル」、「横浜／東武／ホテル」である。
【０３１８】
検索装置３０１では、実在検索語リスト作成処理、検索語候補選定対話を行っている間を利用して、残りの構成名詞について認識処理するだけでなく、部分一致尤度を利用した部分一致検索語処理を行う。
【０３１９】
部分一致検索語処理の結果、算出した部分一致尤度を利用して、検索語候補選定対話を行う。規定閾値（第３の閾値）を超える部分一致尤度を有する検索語候補が存在しない場合、または、部分一致検索語処理の結果、選定される検索語候補が存在しない場合は、第２サブ集合群について認識した結果を利用した実在検索語リスト作成処理、検索語候補選定対話を続ける。これと同時に、有力構成名詞リストを更新しながら、部分一致検索語処理、その結果に対する検索語候補選定対話を繰り返す。
【０３２０】
検索装置３０１において、第２サブ集合群Ｇｃ２−２に単名詞「東急」が含まれているので、第２サブ集合群について認識した結果を利用した実在検索語リスト作成処理を行うと、選定される検索語候補が存在するようになる。一方、部分一致検索語処理の結果、算出した部分一致尤度を利用して、検索語候補選定対話を行い、規定閾値（第３の閾値）を超える部分一致尤度を有する検索語候補が存在するようになる。この場合、これら２つの結果をマージするようにしてもよい。
【０３２１】
検索装置３０１は、構成順序毎の構成名詞の集合群に対する認識結果に基づいて実在検索語リスト作成処理を行う際、同時に、検索装置が予め定めた条件を満たす有力構成名詞候補における部分的な情報を利用した部分一致検索語リストの作成処理を行う。すなわち、有力構成名詞候補を利用して部分一致尤度を算出し、規定閾値以上の部分一致尤度を有する検索語を抽出し、この抽出された検索語を、認識対象と定め、最初に入力された検索語の認識処理を、再度実行し、認識尤度を算出する。
【０３２２】
算出された認識尤度が、利用者との確認処理のみで検索語を特定することができる条件を満たすと判断できる場合は、検索語候補選定対話を行う。規定閾値以上の部分一致尤度を有する検索語候補が存在しない場合、または、検索語候補選定対話の結果、選定される検索語候補が存在しない場合は、残りの各第ｎ構成名詞集合群について認識処理した結果、更新された有力構成名詞候補を利用し、実在検索語リスト作成処理と、検索語候補選定対話と、部分一致検索語処理と、検索語候補選定対話とを繰り返す。
【０３２３】
このように、構成名詞の部分的情報を利用することによって、各構成名詞の偏りに左右されずに、正確かつ迅速に、検索処理を実行することができる
ところで、実際には、検索語を構成する各単名詞の全てが高頻度であることが少なく、また、検索語を構成する各単名詞の全てが低頻度であることが少なく、頻度に偏りがある場合が多く、各単名詞のうちで、高頻度で使用される単名詞は、第１サブ集合群に設定されるので、最初の認識処理結果で出力されるが、使用頻度の低い単名詞は、その単語が含まれるサブ集合群についての認識処理が行われるまで、認識結果として出力されないので、規定閾値以上の尤度を有する各構成名詞候補の組み合わせを作成し、実在する検索語候補を抽出しても、正解は存在しない。
【０３２４】
そこで、検索装置３０１では、全部を認識対象に定めて実時間内に正しく認識処理することが不可能である大語彙検索語の中から、部分的に有力構成名詞を含む検索語を、部分一致検索語候補として検索データベース３８１から抽出することによって、有力候補を絞込むことができる。この絞込みによって、利用者を待機させることなく、検索処理が可能になる。
【０３２５】
つまり、検索装置３０１によれば、単名詞の部分的情報を併用することによって、正解検索語を絞り込み、さらに、認識精度を認識速度とを向上させることができる。
【０３２６】
図３１に示すフローチャートにおいて、ステップＳ３０３の後に、ステップＳ３２１に進むルートとは別に、ステップＳ３１１に進むルートと、ステップＳ３３１に進むルートとが破線で示されている。これは、ステップＳ３２１〜Ｓ３２３のルーチンと、ステップＳ３１１〜Ｓ３１５のルーチンと、ステップＳ３３１、Ｓ３３２のルートとが並行処理されるという意味である。
【０３２７】
また、検索装置３０１を、プログラムの発明として把握することができる。つまり、検索装置３０１は、検索語を構成する複数の単名詞のうちで一部の単名詞のみの尤度が第１の所定の閾値を超える場合、上記第１の閾値を超える尤度を具備する上記単名詞を備え、上記検索語を構成する単名詞と同じ数の単名詞を備えている部分一致検索語候補を、検索データベースから抽出する部分一致検索語候補抽出手順と、上記抽出された部分一致検索語候補を構成する各単名詞の尤度を所定の演算方法で演算して部分一致尤度を計算する部分一致尤度計算手順と、上記計算された部分一致尤度が、所定の第２の閾値を超えている検索語候補について、音声認識処理を行う部分一致検索語候補用音声認識処理手順とをコンピュータに実行させるプログラムの例である。
【０３２８】
また、検索装置３０１は、複数の単名詞で構成されている複合名詞が検索語として登録され、しかも、上記検索語が各単名詞に区切られて登録されている検索データベースと、上記登録されている各複合名詞のｎ番目（ｎは整数値）に表記されている単名詞の群を、ｎ番目表記の単名詞集合群と呼び、上記ｎ番目表記の単名詞集合群が、その頻度の高い順に登録され、しかも、この頻度の高い順に登録されている単名詞が、その頻度の高い方から順に、所定の数毎にまとめられ、複数のサブ集合群が形成されている検索補助データベースとを使用して、複合名詞を検索するプログラムであって、利用者が上記検索語である複合名詞を単名詞毎に区切らずに、一連で音声入力すると、上記音声入力された検索語である複合名詞を単名詞毎に区切る検索語区切り手順と、上記利用者が上記検索語である複合名詞を一連で音声入力すると、上記複合名詞のｎ番目表記の単名詞については、上記ｎ番目表記の単名詞集合群で認識し、しかも上記ｎ番目表記の単名詞集合群のうちで、最も頻度が高い単名詞を含む第１サブ集合群の範囲内で認識処理し、認識尤度を対応させて、認識結果リストを作成する認識結果リスト作成手順と、上記認識処理された単名詞である構成名詞候補と、上記構成名詞候補についての認識尤度との組が認識尤度順に並べられている認識結果リストを、上記音声入力された単名詞のそれぞれについて作成し、上記認識結果リストに記載されている構成名詞候補のうちで、所定の第１の閾値を超える認識尤度を具備する構成名詞候補を、有力構成名詞候補として選出する有力構成名詞候補選出手順と、上記検索語を構成する複数の単名詞のうちで一部の単名詞のみの尤度が上記第１の所定の閾値を超える場合、上記第１の閾値を超える尤度を具備する上記単名詞を備え、上記検索語を構成する単名詞と同じ数の単名詞を備えている部分一致検索語候補を、上記検索データベースから抽出する部分一致検索語候補抽出手順と、上記抽出された部分一致検索語候補を構成する各単名詞の尤度を所定の演算方法で演算して部分一致尤度を演算する部分一致尤度演算手順と、上記演算された部分一致尤度が、所定の第２の閾値を超えている上記部分一致検索語候補について、音声認識処理を行う部分一致検索語候補用音声認識処理手順とをコンピュータに実行させるプログラムの例である。
【０３２９】
なお、商品配送サービスや、電話番号検索や郵便番号検索等における入力インタフェース部分において、検索装置３０１で行った法人名義の確定作業を、幅広く適用することができる。
【０３３０】
【発明の効果】
本発明によれば、利用者を待機させることなく、しかも、存在確率の高い検索語順に、認識結果を抽出することができるという効果を奏する。
【図面の簡単な説明】
【図１】本発明の第１の実施例である音声入力された複合名詞の検索装置１を示すブロック図である。
【図２】検索データベース８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【図３】検索装置１において、検索補助データベース８２の具体例を示す図である。
【図４】検索装置１において、検索語を確定する処理プロセスを具体的に示すフローチャートである。
【図５】検索装置１において、音声入力された検索語（複合名詞）を構成する各単名詞に対する認識結果と、その認識尤度とが対応している認識結果リストの具体例を示す図である。
【図６】検索装置１において、実在検索語候補リスト作成の様子を示す図である。
【図７】検索装置１において、更新された認識結果リストの具体例を示す図である。
【図８】検索装置１において、実在検索語候補リストを作成する処理を実行した結果、得られる実在検索語候補リストを示す図である。
【図９】検索装置１における検索データベース８１を一般的に示す図である。
【図１０】検索補助データベース８２を一般的に示す図である。
【図１１】本発明の第２の実施例である音声入力された複合名詞の検索装置１０１を示すブロック図である。
【図１２】検索データベース１８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【図１３】第２の実施例において、検索補助データベース１８２の具体例を示す図である。
【図１４】第２の実施例において、検索語を確定する処理プロセスを具体的に示すフローチャートである。
【図１５】第２の実施例において、音声入力された検索語（複合名詞）を構成する各単名詞に対する認識結果と、その認識尤度とが対応している認識結果リストの具体例を示す図である。
【図１６】第２の実施例において、更新された認識結果リストの具体例を示す図である。
【図１７】第２の実施例において、実在検索語候補リスト作成の様子を示す図である。
【図１８】本発明の第３の実施例である音声入力された複合名詞の検索装置２０１を示すブロック図である。
【図１９】複合名詞の検索装置２０１において、検索データベース２８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【図２０】複合名詞の検索装置２０１における順序検索補助データベース２８２の具体例を示す図である。
【図２１】複合名詞の検索装置２０１における逆順検索補助データベース２８３の具体例を示す図である。
【図２２】複合名詞の検索装置２０１における検索語確定の処理プロセスを示すフローチャートである。
【図２３】複合名詞の検索装置２０１において、順序検索した場合における認識結果リストの具体例を示す図である。
【図２４】複合名詞の検索装置２０１において、逆順検索した場合における認識結果リストの具体例を示す図である。
【図２５】複合名詞の検索装置２０１において、順序検索した場合における認識結果と、逆順検索した場合における認識結果とをマージした場合における認識結果リストの具体例を示す図である。
【図２６】複合名詞の検索装置２０１において、実在検索語候補リスト作成直前の状態を示す図である。
【図２７】本発明の第４の実施例の動作を示すフローチャートである。
【図２８】本発明の第５の実施例である音声対話型複合名詞の検索装置３０１を示すブロック図である。
【図２９】検索装置３０１において、検索データベース３８１中の法人名義が複合名詞で構成され、この複合名詞が、それを構成する単名詞毎に区切って登録されている具体例を示す図である。
【図３０】検索装置３０１において、検索補助データベース３８２に格納されているデータの具体例を示す図である。
【図３１】検索装置３０１において、検索語を確定する処理プロセスを具体的に示すフローチャートである。
【図３２】検索装置３０１において、音声入力された検索語（複合名詞）を構成する各単名詞に対する認識結果と、その認識尤度とが対応している認識結果リストの具体例を示す図である。
【図３３】検索装置３０１において、部分一致検索語候補と部分一致尤度との例と、再認識結果の例とを示す図である。
【符号の説明】
１…音声入力された複合名詞の検索装置、
２…音声入力部、
３…音声認識部、
４…音声認識結果出力部、
５…実在検索語候補リスト作成部、
６…対話制御部、
７…音声出力部、
８…システムデータベース、
８１…検索データベース、
８２…検索補助データベース、
８３…ＹＥＳ／ＮＯデータベース、
Ｇ１…第１の単名詞集合群、
Ｇ２…第２の単名詞集合群、
Ｇ３…第３の単名詞集合群、
Ｇ４…第４の単名詞集合群、
１０１…音声入力された複合名詞の検索装置、
１０２…音声入力部、
１０３…音声認識部、
１０４…音声認識結果出力部、
１０５…実在検索語候補リスト作成部、
１０６…対話制御部、
１０７…音声出力部、
１０８…システムデータベース、
１８１…検索データベース、
１８２…検索補助データベース、
１８３…ＹＥＳ／ＮＯデータベース、
Ｇａ１…１番目表記の単名詞集合群、
Ｇａ１−１…１番目表記の単名詞集合群Ｇａ１における第１サブ集合群、
Ｇａ１−２…１番目表記の単名詞集合群Ｇａ１における第２サブ集合群、
Ｇａ１−３…１番目表記の単名詞集合群Ｇａ１における第３サブ集合群、
Ｇａ２…２番目表記の単名詞集合群、
Ｇａ２−１…２番目表記の単名詞集合群Ｇａ２における第１サブ集合群、
Ｇａ２−２…２番目表記の単名詞集合群Ｇａ２における第２サブ集合群、
Ｇａ２−３…２番目表記の単名詞集合群Ｇａ２における第３サブ集合群、
２０１…音声入力された複合名詞の検索装置、
２０２…音声入力部、
２０３…音声認識部、
２０３Ｓ…音声認識装置、
２０４…音声認識結果出力部、
２０４ａ…音声認識結果リスト作成部、
２０５…実在検索語候補リスト作成部、
２０６…対話制御部、
２０７…音声出力部、
２０７Ｓ…音声出力装置、
２０８…システムデータベース、
２８１…検索データベース、
２８２…順序検索補助データベース、
２８３…逆順検索補助データベース、
２８４…ＹＥＳ／ＮＯデータベース、
３０１…音声対話型複合名詞の検索装置、
３０２…音声入力部、
３０３…音声認識部、
３０３Ｓ…音声認識用ソフトウェア、
３０４…音声認識結果出力部、
３０５…実在検索語候補リスト作成部、
３０６…対話制御部、
３０７…音声出力部、
３０７Ｓ…音声出力用ソフトウェア、
３０８…システムデータベース、
３８１…検索データベース、
３８２…検索補助データベース、
３８３…ＹＥＳ／ＮＯデータベース、
Ｇｃ１…１番目表記の単名詞集合群、
Ｇｃ１−１…１番目表記の単名詞集合群Ｇｃ１における第１サブ集合群、
Ｇｃ１−２…１番目表記の単名詞集合群Ｇｃ１における第２サブ集合群、
Ｇｃ２…２番目表記の単名詞集合群、
Ｇｃ２−１…２番目表記の単名詞集合群Ｇｃ２における第１サブ集合群、
Ｇｃ２−２…２番目表記の単名詞集合群Ｇｃ２における第２サブ集合群。[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a compound noun search device and a search method for specifying a compound noun by a user who inputs a compound noun composed of a plurality of simple nouns by voice.
[0002]
That is, in the present invention, the search term to be searched is a compound noun composed of a sequence of a plurality of simple nouns, and when a large number of search terms are stored in the search database, the speech recognition processing and the search The present invention relates to an apparatus and a method for specifying a search word without causing the user to feel the time required for the search, and without causing the user to have an unnatural conversation due to a defect in the voice recognition processing or the like.
[0003]
[Prior art]
In a conventional database search process using a speech recognition device, a speech input by a user is collated with a recognition target database, and a similarity between all vocabularies in the recognition target database and the input speech is calculated as a recognition likelihood.
[0004]
With the current recognition technology, the number of vocabulary words that can be recognized within the real time of the dialog processing (when the search device and the user interact with each other does not cause the user to feel stress or unnaturalness) is limited, and the real time of the dialog processing is limited. If the number of words exceeding the time is set as the recognition target, the processing time becomes longer, and the user waits. On the other hand, if the number of vocabulary words to be recognized is large, and if the recognition target is composed of very similar vocabulary words, even if the dialogue between humans is performed, the recognition accuracy will be reduced so that mistakes in hearing and hearing loss occur. I can't avoid it.
[0005]
Furthermore, the recognition accuracy greatly depends on the speaker and the utterance environment, and there is no guarantee that 100% accuracy can always be obtained depending on the influence of ambient noise and the like.
[0006]
Therefore, in the conventional search device using the voice recognition technology, when it is determined whether or not the candidate of the result of voice recognition of the voice input by the user is correctly recognized, the user is confirmed whether the candidate is correct or not. The above-mentioned candidate presentation is repeated until a correct answer can be confirmed in the order of higher recognition likelihood.
[0007]
[Problems to be solved by the invention]
Using a speech recognition device, a search device that searches for a search word intended by a user from a predetermined database and determines the number of search words constituting the database can be processed in real time during the interactive processing. If the number is larger than the number, the user is put on standby during the recognition process, and after this standby, the candidates output by the recognition device are presented to the user, and correctness is checked to determine whether or not the recognition was successful. .
[0008]
For the user, waiting after the utterance, and being forced to confirm correctness after the waiting, leads to unnaturalness of the dialogue, and causes a problem of great stress.
[0009]
On the other hand, a system that supports operators can communicate their intentions to the operator in a natural flow of dialogue even if the user misses or misses hearing, so there is no unnaturalness in the dialogue and there is no great stress. Does not occur.
[0010]
In a search device that searches for a search term input by voice, in order to obtain user satisfaction, it is necessary to be able to respond in real time, as in the case of an operator, and to be able to accurately grasp the intention of the user. It is necessary to determine the user intention in the natural flow of dialogue with the user.
[0011]
In the present invention, the compound nouns in the number exceeding the number of search words that can be processed in the real time of the dialog processing are registered in the search database, and, further, when the compound noun in which there are many similar candidates is registered in the search database, An object of the present invention is to provide a search device, a search method, and a database for a compound noun input by voice, in which a search word input by a user by voice can be searched efficiently in a short time.
[0012]
[Means for Solving the Problems]
According to the present invention, a compound noun composed of a plurality of simple nouns is registered as a search word, and further, the search word is registered in the search database in which each of the single nouns is divided and registered. Single nouns that make up all the search terms are registered in descending order of their frequency, and the plural nouns registered in descending order of frequency are divided by a predetermined number in descending order of frequency. Then, using a database having a search assistance database in which a plurality of single noun set groups are formed, a user searches for compound nouns which are input in a series without being separated on the way.
[0013]
Embodiments and Examples of the Invention
FIG. 1 is a block diagram showing a search device 1 for a compound noun input by voice according to a first embodiment of the present invention.
[0014]
The apparatus 1 for retrieving a compound noun input by speech includes a speech input unit 2, a speech recognition unit 3, speech recognition software 3S, a speech recognition result output unit 4, a real search word candidate list creation unit 5, and a dialogue. The control unit 6 includes a control unit 6, a voice output unit 7, voice output software 7S, and a system database 8.
[0015]
In the search device 1 for a compound noun input by voice, the voice of the user P input via the voice input unit 2 is sent to the voice recognition unit 3, and the voice recognition unit 3 performs the voice recognition process on the input voice. First, the system database 8 is used. In addition, the voice recognition unit 3 executes a recognition process on the voice input by the user P using the voice recognition software 3S, and the voice recognition unit 3 executes the compound noun Is a recognition unit that can be divided for each single noun.
[0016]
The system database 8 includes a search database 81, a search auxiliary database 82, and a YES / NO database 83.
[0017]
The search database 81 is a database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and the search word is registered by being divided into each simple noun.
[0018]
In the search auxiliary database 82, the single nouns constituting all the search words registered in the search database 81 are registered in descending order of the frequency, and the single nouns registered in the descending order of the frequency are registered in the order of the frequency. The database is divided into a predetermined number (500 in the search device 1) in order from the highest one to form a plurality of single noun set groups.
[0019]
The YES / NO database 83 is a database that recognizes contents (for example, YES / NO, YES / NO) to which the user P has responded.
[0020]
The voice recognition software 3 </ b> S selects a search database 81 or a search auxiliary database 82 from the system database 8 according to the processing scene of the search device 1.
[0021]
When the search word is input by voice, the search assistance database 82 is referred to. When the response to the user P is confirmed, the YES / NO database 83 is referred to.
[0022]
The voice recognition unit 3 uses voice recognition software 3S for voice recognition processing, and the voice output unit 7 uses voice output software 7S for voice output.
[0023]
Next, the search device 1 will be described more specifically.
[0024]
In the following description, a case will be described in which an input interface portion of a service for determining the name of a corporation, such as a company name, is provided with a search device 1 for a compound noun input by voice.
[0025]
There are as many as 22 million corporate names in telephone directories nationwide in Japan, and it takes a considerable amount of time to identify one corporate name out of 22 million. Therefore, with current speech recognition technology, It is impossible to recognize and process the name of the corporation within the real time of the interactive processing (when the search device and the user P interact, the time when the user P does not feel stress or unnaturalness). In this case, since there are many similar data, the recognition accuracy is very low.
[0026]
FIG. 2 is a diagram showing a specific example in which the corporate name in the search database 81 is composed of compound nouns, and the compound nouns are registered for each single noun constituting the compound noun.
[0027]
The frequency of each noun composing the corporate name registered in the search database 81 is checked, single nouns are recorded in order of frequency, and the search auxiliary database 82 is created. That is, the search auxiliary database 82 is a database that counts the frequency of each simple noun constituting the corporate name registered in the search database 81 and registers the single noun in order of frequency.
[0028]
In addition, the total number of simple nouns constituting the 22 million corporate names is 6.5 million.
[0029]
FIG. 3 is a diagram illustrating a specific example of the search auxiliary database 82 in the search device 1.
[0030]
The simple nouns in the search auxiliary database 82 are the simple nouns constituting the compound noun registered in the search database 812, and are arranged in descending order of the use frequency. Is a single noun set group.
[0031]
The set group containing the single noun having the highest use frequency is defined as a first simple noun set group G1 (a simple noun set group that is to be recognized in preference to other simple noun set groups), followed by the use frequency A set of simple nouns including simple nouns with high is referred to as a second simple noun set G2, and thereafter, a third simple noun set G3, a fourth simple noun set G4,... And
[0032]
Assuming that the number of nouns forming one set group is n, in the search device 1, n = 500, and this number is determined as follows. In other words, a single noun that can be processed by the voice recognition software 3S in real time of the dialog processing (when the search device and the user P communicate with each other, the time does not cause the user P to feel stress or unnaturalness). Assuming that the number is T and the average number of single nouns of all the search words registered in the search database 81 is M, n = T / M.
[0033]
Specifically, it is assumed that the number T of single nouns that can be processed by the voice recognition software 3S in the real time of the interactive processing is 1500, and the average number M of single nouns of all search words registered in the search database 81 is 3 Then, n = T / M = 1500/3 = 500.
[0034]
The number n of the single nouns constituting each of the single noun set groups G1, G2, G3, G4,... May be a number other than 500, but as described above, it can be processed within the real time of the interactive processing. It is determined by the number T of simple nouns (performance of the speech recognition software 3S) and the average M of the number of simple nouns included in the search words registered in the search database 81.
[0035]
FIG. 4 is a flowchart specifically illustrating a processing process for determining a search word in the search device 1.
[0036]
It is assumed that a search database 81 and a search auxiliary database 82 have been created.
[0037]
First, a request is made to the user P to input a voice for a corporate name (compound noun) to be searched (S0). In this case, it is assumed that the user P has input a compound noun in a series without separating each single noun constituting the compound noun. The voice recognition unit 3 divides compound nouns input by the user P in a series for each single noun. Then, the function k indicating the order of the single noun set group is set to 1.
[0038]
Here, consider a case in which the compound noun "Yokosuka / citizen / hospital" is input as a search word by voice. First, the speech recognition unit 3 performs recognition processing for each of the single nouns "Yokosuka", "citizens", and "hospitals" input using speech, using the first single noun set group G1 (S1). That is, the speech recognition unit 3 performs the recognition process within the range of the first single noun set group G1. Then, the speech recognition result output unit 4 outputs the recognition result.
[0039]
Of the single nouns “Yokosuka”, “Citizen”, and “Hospital” input by voice, the simple nouns “Yokosuka” and “Hospital” are included in the first simple noun set group G1, but “Citizen” As shown in FIG. 3, the frequency order of の is the 790th order, and is therefore not included in the first simple noun set group G1, but included in the second simple noun set group G2.
[0040]
FIG. 5 is a diagram showing a specific example of a recognition result list in which the recognition results for each single noun constituting the search word (compound noun) input by speech and the recognition likelihood correspond to each other in the search device 1. is there.
[0041]
When one single noun is spoken, the speech recognition result output unit 4 outputs a plurality of constituent noun candidates for the single noun and the recognition likelihood for each of the plurality of constituent noun candidates. The recognition result list is created by associating the outputted constituent noun candidates with the recognition likelihood (S1). The recognition result list created in this way is shown in FIG.
[0042]
For example, when the single noun “Yokosuka” is voice-inputted, the voice recognition unit 3 recognizes the input voice as “Yokosuka”, “Yokohama”, “Sukahama”, “Yoko-Yoko”, as shown in FIG. The recognition likelihood of “Yokosuka” is 97, the recognition likelihood of “Yokohama” is 90, the recognition likelihood of “Sukahama” is 89, and the recognition likelihood of “Yokoshiko” is The degree is 80.
[0043]
Note that the recognition likelihood for constituent noun candidates is predetermined when the search database 81 is created.
[0044]
Here, a constituent noun candidate whose recognition likelihood is equal to or greater than a predetermined threshold is referred to as a “potential constituent noun candidate”. In the search device 1, the predetermined threshold is set to 80. That is, constituent noun candidates whose recognition likelihood is 80 or more are dominant constituent noun candidates.
[0045]
Then, from the recognition results shown in FIG. 5, a single noun having a recognition likelihood of 80 or more is selected as a potential constituent noun candidate (S2).
[0046]
As shown in FIG. 5, with respect to the simple noun "Yokosuka" constituting the compound noun input by speech, four candidates of "Yokosuka", "Yokohama", "Sukahama", and "Yokosuka" are the leading constituent noun candidates. For the single noun “citizen” selected and spoken, one candidate of “municipal” is selected as a probable constituent noun candidate, and for the single noun “hospital” spoken, “hospital”, “beauty salon” Are selected as potential constituent noun candidates. Using these influential constituent noun candidates, a real search word candidate list is created (S3). The real search word candidate list is a list in which, among the search words existing in the search database 81, the search word candidates that are presented to the user P and require confirmation are collected.
[0047]
By the way, the frequency order of the simple noun “citizen” in the auxiliary database 82 is 790, as shown in FIG. 2, so the simplest noun “citizen” is not included in the first simple noun set group G1. . Therefore, the recognition result for “citizen” illustrated in FIG. 5 does not include the simple noun “citizen”.
[0048]
FIG. 6 is a diagram showing how the search device 1 creates a real search word candidate list.
[0049]
When a single noun whose recognition likelihood is 80 or more is combined in the recognition results shown in FIG. 5, the combination of the single noun registered in the search database 81 is “Yokosuka / municipal / Hospital "and" Yokohama / Municipal / Hospital ", and these two combinations are selected as" existing search word candidates "which are actual search word candidates in the search database 81. When the real search word candidates thus selected are listed, a real search word candidate list shown in the lower part of FIG. 6 is created (S3).
[0050]
In the search device 1, the search word recognition likelihood is obtained by adding the recognition likelihood of each single noun as shown in FIG. When calculating the search word recognition likelihood, the recognition likelihood of each single noun may be multiplied.
[0051]
Then, it is determined whether or not there is a candidate in the real search word candidate list that satisfies the condition that the search word can be determined only by the confirmation processing by the user P (S21).
[0052]
Here, the threshold value of the search word recognition likelihood is predetermined, and if the search word recognition likelihood of the predetermined search word candidate exceeds the threshold value of the search word recognition likelihood, the search word candidate is determined by the user P It is determined that the condition that the search term can be determined only by the confirmation processing by is satisfied. It is assumed that the threshold value of the search word recognition likelihood is set to 270 in the search device 1.
[0053]
Then, the search word candidates are presented to the user P on a display or the like, and confirmation is requested (S22). If the user P responds with YES (S23), the search process is terminated.
[0054]
Since none of the search word candidates in the real search word candidate list shown in FIG. 6 exceed the threshold 270 of the search word recognition likelihood, the search word candidates in the real search word candidate list shown in FIG. In all cases, the search term cannot be determined only by the confirmation processing by the user P.
[0055]
In preparation for the case where the search word cannot be determined as described above, processing such as recognition of a single noun and calculation of a recognition likelihood is performed in advance within the range of the second single noun set group G2. That is, the search term candidates are presented to the user P on a display or the like, and in parallel with the process of asking for confirmation (S22) and the response by the user P (S23), the single nouns within the range of the second single noun set group G2 , Calculation of the recognition likelihood (S12), re-selection of potential constituent noun candidates (S13), and updating of the existing search word candidate list (S14).
[0056]
That is, first, the function k indicating the rank of the single noun set group is incremented by 1 (S11), and k is set to 2, and the second single noun set group G1 having the second highest frequency of use after the recognition processing is completed. Recognize within a single noun set group G2 (single noun set group composed of single nouns whose frequency ranking is 500th to 1,000th), and use this recognition result to Attempt extraction (S12, S13, S14).
[0057]
In the flowchart shown in FIG. 4, after step S3, the route to step S11 is indicated by a broken line, separately from the route to step S21. This means that the routine of steps S21 to S23 and the routine of steps S11 to S15 are performed in parallel. In other words, while the dialog processing (S21 to S23) with the user P is being performed on the first single noun set group G1, the re-selection and real existence of a potential constituent noun candidate for the second single noun set group G2 are performed. Update of the search word candidate list and the like (S11 to S15) are executed.
[0058]
After that, if necessary, while the dialog processing (S21 to S23) with the user P is being performed on the second single noun set group G2, the third single noun set group G3 is Re-selection of constituent noun candidates, updating of the existing search word candidate list, and the like (S11 to S15) are performed, and further, dialog processing (S21 to S23) with the user P is performed on the third single noun set group G3. In the meantime, for the fourth single noun set group G4, re-selection of potential constituent noun candidates, updating of the existing search word candidate list, and the like (S11 to S15) are performed. These parallel processes are further repeated as necessary.
[0059]
FIG. 7 is a diagram illustrating a specific example of an updated recognition result list in the search device 1.
[0060]
FIG. 7 shows a list in which the recognition results recognized in the range of the second single noun set group G2 are added and the potential constituent noun candidates are updated.
[0061]
As shown in FIG. 2, the frequency order of the single noun “citizen” in the auxiliary database 82 is 790, so that the second single noun set group G2 includes the single noun “citizen”. In the updated recognition result list shown in FIG. 7, “citizen” is included as a probable constituent noun candidate in the recognition result for the single noun “citizen” constituting the compound noun input by voice.
[0062]
FIG. 8 is a diagram illustrating a real search word candidate list obtained as a result of executing a process of creating a real search word candidate list in the search device 1.
[0063]
As a result of executing the process of creating the real search word candidate list, the real search word candidate list shown in FIG. 8 is obtained, and the search word candidates having the search word recognition likelihood exceeding the threshold 270 are “Yokosuka / Citizens / hospitals "and" Yokohama / citizens / hospitals ", of which" Yokosuka / citizens / hospitals "are presented to the user P and their correctness is confirmed (S22).
[0064]
In the right / wrong confirmation, when a response (YES) indicating affirmativeness is obtained from the user P (S23), the processing is terminated assuming that the search word has been determined.
[0065]
By the way, when the number of vocabulary words to be recognized is large and there are many similar vocabularies, the conventional speech recognition technology requires a long recognition processing time. It is very difficult to search within a certain time without giving a stress to the user P, and the accuracy of the recognition device is inevitably reduced.
[0066]
Therefore, the search device 1 restricts the search target (search word) to only compound nouns composed of a series of simple nouns, and instructs the user P to input a search word by dividing the single noun one by one. A) The speech recognition unit 3 separates compound nouns input by the user P in series into single nouns, and for each of the separated single nouns, holds a single noun constituting a search word in order of use frequency. Recognition processing is performed with 82 as a recognition target. This is a feature of the search device 1.
[0067]
Although it is expected that the total number of simple nouns registered in the search auxiliary database 82 is smaller than the total number of search words registered in the search database 81, the total number of simple nouns registered in the search auxiliary database 82 Is often a number that cannot be processed within the real time of the interactive processing. As described above, the total number of simple nouns registered in the search auxiliary database 82 is a number that cannot be processed within the real time of the interactive processing, and the single nouns registered in the search auxiliary database 82 are processed at one time. If so, the user P must wait.
[0068]
Therefore, the retrieval device 1 sorts the single nouns in the retrieval auxiliary database 82 in the order of use frequency, divides the single nouns into a predetermined number that can be processed in the real time of the interactive processing, and generates a plurality of single noun set groups. And performs recognition processing in the order of the single nouns that are frequently used, and performs recognition processing on the remaining single noun set groups at the time when the recognition result for each single noun set group is interacted with by a search word candidate selection dialogue. As a result, as soon as the first search word candidate selection dialog for one search word is completed, the second search word candidate selection dialog is performed, and if necessary, the third, fourth,. Since the selection dialogue continues, there is no time for the user P to wait.
[0069]
That is, since the user P cannot see the processing status inside the search device 1 at all, it looks as if the recognition process registered in the search database 81 is performed collectively. .
[0070]
In addition, according to the search device 1, from the recognition results for each single noun, potential constituent noun candidates are selected, and from among all combinations of the selected potential constituent noun candidates, only the search word candidates actually existing in the search database 81 are selected. Is selected, and a search word candidate that can be determined to satisfy the condition that can be determined only by the confirmation processing by the user P is presented to the user P. Therefore, the search device 1 can extract the recognition result in the order of the search words having the highest existence probability.
[0071]
In other words, the search device 1 reduces the number of single nouns to be recognized at one time, thereby preventing the user P from waiting, and by performing recognition processing in order from the most frequently used single noun. , The recognition results can be extracted in the order of the search terms having the highest existence probability.
[0072]
In other words, when searching for compound nouns using speech recognition, the search device 1 does not make the user P feel the length of the recognition processing time resulting from the large number of recognition targets, The user P does not feel the stress caused by the necessity of repeating the correctness check, and has a highly accurate naturalness.
[0073]
FIG. 9 is a diagram generally showing a search database 81 in the search device 1.
[0074]
The search database 81 holds a number of search words for which speech recognition processing cannot be performed within the real time of the interactive processing, and each search word is registered in a form divided into simple nouns.
[0075]
That is, for example, the compound noun N ₁ Is a simple noun n ₁ , N ₂ , N ₃ The search database 81 includes a compound noun N ₁ As a simple noun n ₁ , N ₂ , N ₃ Are registered in a state separated for each single noun.
[0076]
FIG. 10 is a diagram generally showing the search auxiliary database 82.
[0077]
The search auxiliary database 82 is a database that calculates the use frequency of each simple noun that constitutes all search words registered in the search database 81, and registers the single nouns arranged in order of frequency.
[0078]
The number of single nouns registered in the search auxiliary database 82 is expected to be smaller than the number of search words in the search database 81, but is configured to be a number that cannot be processed within the real time of the interactive processing.
[0079]
In the search assistance database 82, the simple nouns are arranged in descending order of frequency (ordering), and the simple nouns are divided by the number of the simple nouns that can be processed in a predetermined real time of the dialog processing (the above-described order of simple nouns). Is divided) to form a single noun set group. A simple noun set group including the most frequent simple noun is referred to as a first simple noun set group G1.
[0080]
Further, the search device 1 can be understood as a program invention.
[0081]
That is, the search device 1 creates a search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and further, a search database is created in which the search word is divided and registered in each simple noun. Procedures and simple nouns that constitute all the search terms registered in the search database are registered in descending order of frequency, and furthermore, the nouns registered in descending order of frequency are sorted in descending order of frequency. In order, divided into a predetermined number, a search auxiliary database creation procedure to create a search auxiliary database in which a plurality of single noun set group is formed, without the user dividing the search word for each single noun, When a series of speech inputs are made, a search term separation procedure for dividing the speech input search term for each single noun, and for each of the separated single nouns, A recognition result list creating procedure for creating a recognition result list by performing a recognition process within a first union set group including a single noun having a high degree of correspondence, and making the recognition likelihood correspond to the recognition result list; And a recognition result list in which pairs of the constituent noun candidates and the recognition likelihoods of the constituent noun candidates are arranged in the order of the recognition likelihood. Among the constituent noun candidates described in (1), a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold is selected as a strong constituent noun candidate, and all of the strong constituent noun candidates are selected. From the combinations, only the search terms that exist in the search database are selected, a predetermined operation is performed on the recognition likelihood of each influential constituent noun candidate, a search term recognition likelihood is calculated, and the calculated search term is calculated. Recognition A procedure for creating a real search word candidate list in which the real search word candidate list is arranged in descending order of likelihood, and the search word candidates in the above real search word candidate list are used to identify the search word only by the user's confirmation processing. This example is a program that causes a computer to execute a necessary dialogue with the user when the condition that can be satisfied is satisfied and to execute a search term candidate selection dialogue procedure for determining a search term.
[0082]
The program may be recorded on a recording medium such as an FD, a CD, a DVD, a HD, and a semiconductor memory.
[0083]
It should be noted that the work of determining the name of a corporation performed by the search device 1 can be widely applied to an input interface portion in a product delivery service, a telephone number search, a postal code search, and the like.
[0084]
FIG. 11 is a block diagram showing a search device 101 for a compound noun input by voice according to a second embodiment of the present invention.
[0085]
The apparatus 101 for retrieving a compound noun input by voice includes a voice input unit 102, a voice recognition unit 103, voice recognition software 103S, a voice recognition result output unit 104, and a real search word candidate list creation unit 105. The control unit 106 includes an audio output unit 107, audio output software 107S, and a system database 108.
[0086]
In the search device 101 for a compound noun input by voice, the voice of the user P input through the voice input unit 102 is sent to the voice recognition unit 103, and the voice recognition unit 103 performs a voice recognition process on the input voice. First, the system database 108 is used. In addition, the voice recognition unit 103 performs recognition processing on the voice input by the user P using the voice recognition software 3S. It is a recognition unit that can divide nouns into single nouns.
[0087]
The system database 108 includes a search database 181, a search auxiliary database 182, and a YES / NO database 183.
[0088]
The search database 181 is a database in which compound nouns composed of a plurality of simple nouns are registered as search words, and the search words are registered by being divided into each simple noun.
[0089]
The search auxiliary database 182 refers to a group of unified nouns of each compound noun registered as n-th (n is an integer value) a single-noun set group of the n-th notation, and Are registered in descending order of the frequency, and the nouns registered in the descending order of frequency are a predetermined number (500 in the second embodiment) in descending order of the frequency. ) Is a search assistance database in which a plurality of sub-set groups are formed.
[0090]
The YES / NO database 183 is a database that recognizes contents (for example, YES / NO, YES / NO) to which the user P has responded.
[0091]
The voice recognition software 103S selects a search database 181 or a search auxiliary database 182 from the system database 108 in accordance with the processing scene of the search device 101.
[0092]
When the search word is input by voice, the search assistance database 182 is referred to, and when a response to the user P is confirmed to be correct, the YES / NO database 183 is referred to.
[0093]
The voice recognition unit 103 uses voice recognition software 103S for voice recognition processing, and the voice output unit 107 uses voice output software 107S for voice output.
[0094]
Next, a second embodiment will be described more specifically.
[0095]
In the following description, a case will be described in which an input interface portion of a service for determining the name of a corporation, such as a company name, is provided with a search device 101 for a compound noun input by voice.
[0096]
FIG. 12 is a diagram showing a specific example in which, in the second embodiment, the corporate name in the search database 181 is composed of compound nouns, and the compound nouns are registered separately for each of the constituent simple nouns. is there.
[0097]
The frequency of each noun composing the corporate name registered in the search database 181 is checked, single nouns are recorded in order of frequency, and the search auxiliary database 182 is created.
[0098]
In addition, the total number of simple nouns constituting the 22 million corporate names is 6.5 million. In addition, among the nouns that make up 22 million cases in the corporate name, the first noun (the single noun located at the first word in the corporate name, that is, the plural nouns that make up the compound noun that is the corporate name) Among the simple nouns, the total number of the simplest noun located at the first position is about 3.6 million types, and the simplest noun in the second notation (the simple noun located at the second word in the corporate name, that is, the corporate name) The total number of the simplest nouns constituting the compound noun is approximately 2.5 million types, and the simplest noun in the third notation (the third word in the corporate name) The total number of located simple nouns, that is, the third noun among a plurality of simple nouns constituting a compound noun in the name of a corporation, is about 2.7 million types, and the single noun in the fourth notation ( A simple noun located at the fourth word in the corporate name, that is, the corporate name Among a plurality of simple nouns composing a noun, the total number of the fourth single noun) is about one million,... (The longest constituent word number is 7). Recognition processing of each single noun with respect to a compound noun is impossible, and it is expected that accuracy is low.
[0099]
FIG. 13 is a diagram illustrating a specific example of data stored in the search assistance database 182 in the second embodiment.
[0100]
The simple nouns in the search assistance database 182 are simple nouns that compose the compound noun registered in the search database 181, and a single noun set group is formed for each notation order in the compound noun. Only the single nouns in the compound noun whose order of notation is the first are collected to form a single noun set group Ga1 of the first notation, and in the single noun set group Ga1 of the first notation, the single nouns are sorted in descending order of the use frequency. The registered single nouns are grouped into 500 single nouns in descending order of their frequency to form a plurality of sub-set groups. A group Ga1-1, a second subgroup Ga1-2,..., An m-th subgroup Ga1-m are formed.
[0101]
The first sub-set group Ga1-1 including the single noun that is used most frequently is an object to be recognized in preference to the other sub-set groups Ga1-2 to Ga1-m.
[0102]
Also, by collecting only single nouns in the second order of compound nouns, a single noun set group Ga2 of the second notation is formed. Single nouns are registered, and these registered single nouns are grouped for each 500 single nouns in descending order of frequency, to form a plurality of sub-set groups. The first sub-set group Ga2-1, the second sub-set group Ga2-2, ..., the m-th sub-set group Ga2-m are formed.
[0103]
The first sub-set group Ga2-1 including the single noun that is used most frequently is a target to be recognized with priority over the other sub-set groups Ga2-2 to Ga2-m.
[0104]
Hereinafter, similarly to the above, by collecting only the nouns in the order of the n-th notation in the compound noun, a single noun set group Gan of the n-th notation is formed, and in the single noun set group Gan of the n-th notation, Simple nouns are registered in order of use frequency, and these registered simple nouns are grouped for every 500 simple nouns in descending order of the frequency to form a plurality of sub-set groups. , A first sub-set group Gan1, a second sub-set group Gan2,..., And an n-th sub-set group Gan-m are formed.
[0105]
Note that the first sub-set group Gan-1 including the single noun that is used most frequently is an object to be recognized with priority over the other sub-set groups Gan-2 to Gn-m.
[0106]
By the way, assuming that the number of nouns constituting one sub-set group is N, in the second embodiment, N = 500, and this number is determined as follows. In other words, a single noun that can be processed by the speech recognition software 103S in the real time of the dialog processing (when the search device and the user P communicate with each other, the time does not cause the user P to feel stress or unnaturalness). Assuming that the number is T and the average number of single nouns of all the search words registered in the search database 181 is M, N = T / M.
[0107]
More specifically, it is assumed that the number T of single nouns that can be processed by the voice recognition software 103S in the real time of the interactive processing is 1500, and the average number M of single nouns of all search words registered in the search database 181 is 3 Then, N = T / M = 1500/3 = 500.
[0108]
The number N of simple nouns constituting each of the sub-set groups Ga1-1 to Gan-m may be a number other than 500. It is determined by the number T (performance of the speech recognition software 103S) and the average M of the number of simple nouns included in the search words registered in the search database 181.
[0109]
FIG. 14 is a flowchart specifically showing a processing process for determining a search word in the second embodiment.
[0110]
It is assumed that a search database 181 and a search auxiliary database 182 have been created.
[0111]
First, a request is made to the user P to input a voice for a corporate name (compound noun) to be searched (S100). In this case, it is assumed that the user P has input a compound noun in a series without separating each single noun constituting the compound noun. The voice recognition unit 3 divides compound nouns input by the user P in a series for each single noun. Then, the function k indicating the order of the single noun set group is set to 1.
[0112]
Then, the function k of the sub-set group (function indicating the rank of the sub-set group) in each single noun set group relating to the compound noun as the search word is set to 1. That is, if the function k of the subset group is 1, each single noun is recognized using the subset groups Ga1-1, Ga2-1, Ga3-1,... Among the plurality of subset groups. Is done.
[0113]
Here, consider a case where the compound noun “Yokohama / Grand / Hotel” is input as a search word by voice.
[0114]
First, the compound noun input by voice is separated into single nouns “Yokohama”, “Grand”, and “Hotel” by the voice recognition unit 3, and each of these separated “Yokohama”, “Grand”, and “Hotel” is separated. , The first subset group Ga1-1 in the first notation set noun group Ga1, the first subset group Ga2-1 in the second notation noun set group Ga2, the third noun noun set group The speech recognition unit 103 performs a recognition process using the first subgroup Ga3-1 in Ga3 (S101). That is, the single noun “Yokohama” constituting the compound noun input by speech is recognized and processed within the range of the first sub-set group Ga1-1, and the single noun “ground” input by speech is subjected to the first sub-set group. Recognition processing is performed within the range of Ga2-1, and recognition processing is performed on the single noun "hotel" input by voice within the range of the first subset group Ga3-1. Then, the speech recognition result output unit 104 outputs the recognition result.
[0115]
Of the simple nouns “Yokohama”, “Grand”, and “Hotel” constituting the compound noun input by voice, the simple noun “Yokohama” is included in the first sub-set group Ga1-1 as shown in FIG. It is assumed that the simple noun “hotel” is also included in the first sub-set group Ga3-1. However, since the frequency order of the single noun “Grand” is the 951 order as shown in FIG. 13, it is not included in the first sub-set group Ga2-1 but is included in the second sub-set group Ga2-2. Have been.
[0116]
FIG. 15 shows a specific example of a recognition result list in which the recognition results for each single noun constituting the search term (compound noun) input by voice and the recognition likelihood correspond to each other in the second embodiment. FIG.
[0117]
When one single noun is input as speech, a plurality of constituent noun candidates for the single noun constituting the compound input noun and the recognition likelihood for each of the plurality of constituent noun candidates are output as a speech recognition result. The recognition result list is created by associating the outputted constituent noun candidates with the recognition likelihood by the unit 104 (S101). The recognition result list created in this way is shown in FIG.
[0118]
For example, when the speech recognition unit 103 cuts out the single noun “Yokohama” from the compound noun that is input as speech, the speech recognition unit 103 converts the cut-out speech into “Yokohama” and “Yokosuka” as shown in FIG. , “Yokoyama”, “Yokoyama”,..., The recognition likelihood of “Yokohama” is 95, the recognition likelihood of “Yokosuka” is 90, and the recognition likelihood of “Yokomizo” is 81. , “Yokoyama” has a recognition likelihood of 75.
[0119]
Note that the recognition likelihood of each single noun is determined individually by the voice recognition software 103S when the voice recognition unit 103 cuts out a single noun from a compound noun input by voice.
[0120]
Here, a constituent noun candidate whose recognition likelihood is equal to or greater than a predetermined threshold is defined as a probable constituent noun candidate. In the second embodiment, the predetermined threshold is set to 80. That is, constituent noun candidates whose recognition likelihood is 80 or more are dominant constituent noun candidates. The fact that the predetermined threshold value is 80 is preset when the databases 81 and 82 are created.
[0121]
Then, from the respective recognition results shown in FIG. 15, a single noun having a recognition likelihood of 80 or more is selected as a potential constituent noun candidate (S102).
[0122]
As shown in FIG. 15, with respect to the single noun "Yokohama" constituting the compound noun input by speech, three candidates "Yokohama", "Yokosuka", and "Yokomizo" are selected as the probable constituent noun candidates and input by voice. There is no single noun "Grand" that composes a compound noun that has not been selected as a probable constituent noun candidate. Two candidates are selected as potential constituent noun candidates.
[0123]
Then, a real search word candidate list is created using these influential constituent noun candidates (S103). Note that the actual search word candidate list is a list of search word candidates that are presented to the user P and are asked for confirmation among search words existing in the search database 181.
[0124]
By the way, as shown in FIG. 12, the frequency order of the second notation single noun “Grand” is 951 in the second notation single noun set group Ga2 stored in the auxiliary database 82, The first subset group Ga2-1 of the notation single noun set group Ga2 does not include the single noun “ground”. Therefore, the recognition result for the single noun “Ground” illustrated in FIG. 15 does not include the single noun “Grand”.
[0125]
Therefore, in the step (S103) of creating a real search word candidate list, no real search word candidate list is created.
[0126]
Since the condition for specifying the search term is not satisfied only by the confirmation processing (S121), the function k of the sub-set group is incremented by 1 (S111), and the sub-set group to be searched is Ga1-2, Ga2-2, Each simple noun that becomes Ga3-1 and constitutes the compound noun that is input by voice is recognized using the new set group Ga2-2, and the recognition likelihood is calculated (S112). Then, with the newly recognized recognition result added (using the updated recognition result list), the influential constituent noun candidates are re-selected (S113), and the existing search word candidate list is updated (S114).
[0127]
FIG. 16 is a diagram illustrating a specific example of the updated recognition result list in the second embodiment.
[0128]
FIG. 16 shows a list in which recognition results recognized in the range of the second sub-set groups Ga1-2, Ga2-2, and Ga3-2 have been added and the potential constituent noun candidates have been updated.
[0129]
As shown in FIG. 13, in the auxiliary database 82, the frequency order of the single noun “Grand” in the second single noun set group Ga2 is 951, and thus the single noun “Grand” is included in the second subset group Ga2-2. The updated recognition result list shown in FIG. 16 includes the noun “Grand”, and therefore, in the recognition result for the single noun “Grand” constituting the compound noun input by speech, "Grand" is included.
[0130]
FIG. 17 is a diagram showing how a real search word candidate list is created in the second embodiment.
[0131]
A subset of the voice input single noun “grand” for which no influential constituent noun candidate could not be selected was newly recognized, and the recognition result was combined with a single noun having a recognition likelihood of 80 or more. As shown in FIG. 17, combinations of simple nouns registered in the search database 181 are two combinations of “Yokohama / Grand / Hotel” and “Yokohama / Gold / Hall”. The combination of the two is selected as a real search word candidate that is a real search word candidate in the search database 181. When a real search word candidate list is created for the real search word candidates thus selected, the real search word candidate list shown in the lower part of FIG. 17 is updated (S114).
[0132]
In the second embodiment, the search word recognition likelihood is obtained by adding the recognition likelihood of each single noun as shown in FIG. When calculating the search word recognition likelihood, the recognition likelihood of each single noun may be multiplied.
[0133]
Then, it is determined whether or not there is a candidate in the real search word candidate list that satisfies the condition that the search word can be determined only by the confirmation processing by the user P (S121).
[0134]
Here, the threshold value of the search word recognition likelihood is predetermined, and if the search word recognition likelihood of the predetermined search word candidate exceeds the threshold value of the search word recognition likelihood, the search word candidate is determined by the user P It is determined that the condition that the search term can be determined only by the confirmation processing by is satisfied. In the second embodiment, it is assumed that the threshold value of the search word recognition likelihood is set to 280.
[0135]
Then, the search term candidates are presented to the user P on a display or the like, and confirmation is requested (S122). If the user P responds with YES (S123), the search process is terminated.
[0136]
Since one of the search word candidates in the real search word candidate list shown in FIG. 17 satisfies the condition that the threshold of the search word recognition likelihood is 280 or more, the real search word candidate list shown in FIG. The search word candidate can determine the search word only by the confirmation processing by the user P.
[0137]
If none of the search word candidates in the real search word candidate list satisfies the condition that the search word recognition likelihood threshold is 280 or more, all of the search word candidates in the real search word candidate list are: The search word cannot be determined only by the confirmation processing by the user P.
[0138]
In preparation for the case where the search term cannot be determined in this way, processing such as recognition of a single noun and calculation of a recognition likelihood in the range of the third subset group Ga1-3, G2-3, G3-3 is performed. It is executed in advance. In other words, the search term candidates are presented to the user P on a display or the like, and in parallel with the process of requesting confirmation (S122) and the response of the user P (S123), the third sub-set groups Ga1-3, Ga2-3, Recognition of a single noun, calculation of a recognition likelihood (S112), re-selection of influential constituent noun candidates (S113), and updating of a real search word candidate list (S114) are performed in the range of Ga3-3.
[0139]
In other words, the function k indicating the rank of the sub-set group is further incremented by 1 (S111), and k = 3, and the third sub-set group Ga1-3 having the second highest usage frequency after the second sub-set group after the recognition process is completed. , Ga2-3, Ga3-3 (recognize in the range of the sub-set groups Ga1-3, Ga2-3, Ga3-3 composed of simple nouns whose frequency ranking is 1001 to 1,500. Attempt to re-extract influential constituent noun candidates using the recognition result (S112, S113, S114).
[0140]
In the flowchart shown in FIG. 14, after step S3, a route to step S111 is indicated by a broken line, separately from a route to step 21. This means that the routine of steps S121 to S123 and the routine of steps S111 to S115 are processed in parallel.
[0141]
In other words, while the interactive processing (S121 to S123) with the user P is being performed on the first sub-set groups Ga1-1, Ga2-1, and Ga3-1, the second sub-set groups Ga1-2, Ga2- 2. For Ga3-2, re-selection of influential constituent noun candidates, updating of the existing search word candidate list, and the like (S111 to S115) are performed.
[0142]
After that, if necessary, the third sub-set group Ga1-2, Ga2-2, and Ga3-2 are executed while the dialogue processing with the user P (S121 to S123) is executed. For the set groups Ga1-3, Ga2-3, and Ga3-3, re-selection of influential constituent noun candidates, updating of the existing search word candidate list, and the like (S111 to S115) are performed, and further, the third subset group Ga1-3 , Ga2-3, and Ga3-3, while the dialogue processing with the user P is being performed (S121 to S123), the leading constituent nouns of the fourth sub-set groups Ga1-4, Ga2-4, and Ga3-4 Re-selection of candidates, updating of the existing search word candidate list, and the like (S111 to S115) are performed. These parallel processes are further repeated as necessary.
[0143]
By the way, when the number of vocabulary words to be recognized is large and there are many similar vocabularies, the conventional speech recognition technology requires a long recognition processing time. It is very difficult to search within a certain time without giving a stress to the user P, and the accuracy of the recognition device is inevitably reduced.
[0144]
Therefore, in the second embodiment, the search target (search word) is limited to only compound nouns composed of a series of simple nouns, and the user P is instructed to input the search word, and the user P inputs the search word. The speech recognition unit divides the compound noun into simple nouns, and for each of the separated simple nouns, a single noun set group is formed at each position described in the compound noun, and further, each single noun set group is formed. Are registered in descending order of frequency, and a plurality of single nouns registered in descending order of frequency are grouped by a predetermined number in descending order of frequency to form a plurality of subset groups. Using a database having a search assistance database, a compound noun searched by a user by voice is searched.
[0145]
As described above, according to the position of the simple noun in the compound noun, a simple noun set group is formed for each simple noun, and the simple noun set group is divided into a plurality of sub-set groups. The feature of the second embodiment is that it is recognized from a single noun having a high level.
[0146]
Although it is expected that the total number of simple nouns registered in the search auxiliary database 182 is smaller than the total number of search words registered in the search database 181, the total number of simple nouns registered in the search auxiliary database 182 is small. Is often a number that cannot be processed within the real time of the interactive processing. As described above, the total number of simple nouns registered in the search auxiliary database 182 is a number that cannot be processed in the real time of the interactive processing, and the single nouns registered in the search auxiliary database 182 are processed at one time. If so, the user P must wait.
[0147]
Therefore, in the second embodiment, the simple nouns in the search auxiliary database 182 are arranged in the order of use frequency, and the simple nouns are divided into a predetermined number that can be processed in the real time of the interactive processing. A group is formed, a recognition process is performed in the order of the single nouns that are frequently used, and a recognition process is performed on the remaining sub-set groups at a time when the recognition result for each of the sub-set groups is searched for a search word candidate. As a result, as soon as the first search word candidate selection dialog is completed for one search word, the second search word candidate selection dialog is performed, and if necessary, the third, fourth,. Since the selection dialogue continues, there is no time for the user P to wait.
[0148]
That is, since the user P cannot see the processing status inside the search device 101 at all, it looks as if the search words registered in the search database 181 are collectively processed.
[0149]
Further, according to the second embodiment, from the recognition result for each single noun, a candidate for a probable constituent noun is selected, and a search term existing in the search database 181 is selected from all combinations of the selected probable constituent noun candidates. Only the candidates are selected, and a search term candidate that can be determined to satisfy the condition that can be determined only by the confirmation processing by the user P is presented to the user P. Therefore, in the second embodiment, the recognition results can be extracted in the order of the search words having the highest existence probability.
[0150]
That is, in the second embodiment, by reducing the number of single nouns to be recognized at one time, the recognition process is performed in order from the frequently used single noun without waiting for the user P. Thus, the recognition results can be extracted in the order of the search words having the highest existence probability.
[0151]
In other words, in the second embodiment, when a compound noun is searched using speech recognition, the user P does not feel the length of the recognition processing time caused by the fact that the recognition target is large, and the user P does not make an error. The user P does not feel the stress caused by the necessity of repetition of correct / incorrect confirmation by recognition, and has high-accuracy naturalness.
[0152]
By the way, the single noun (the second simple noun in the compound noun) of the second notation of the compound noun "Yokohama / Grand / Hotel" is "Grand", in which all of the single nouns constituting all the search words are arranged in order of frequency. In this case, since it is located at the 2,450th position, all the nouns stored in the search database 181 are arranged in order of their use frequency, and the recognition processing is performed on the 500 single-word nouns. It will appear for the first time in the group. In this case, at least the recognition process and the real search word list creation process are repeated five times. In the process, if there is a real search word candidate whose search word recognition likelihood exceeds a specified threshold, the user is notified. Is presented, and the user P feels strange. However, in the second embodiment, “Grand”, which is the second noun of “Yokohama / Grand / Hotel” (the second single noun in the compound noun), is in the second noun set group Ga2. Since the frequency order is 951, if the recognition process and the real search word list creation process are executed twice, the search word can be determined, and the user P does not feel uncomfortable.
[0153]
Further, the second embodiment can be understood as a program invention.
[0154]
That is, in the second embodiment, a compound noun composed of a plurality of simple nouns is registered as a search word, and further, a search database for creating a search database in which the above-described search word is divided and registered into each simple noun is registered. The database creation procedure and a group of simple nouns described in the nth (n is an integer) of each of the registered compound nouns is called a simple noun set group in the nth notation, and a simple noun set group in the nth notation is referred to. Noun set groups are registered in descending order of their frequency, and single nouns registered in descending order of frequency are grouped by a predetermined number in descending order of their frequency. A search assistance database creation procedure for creating a search assistance database that has been formed, and when the user inputs speech in a series without separating the above search terms into single nouns, the speech input search terms are converted into single nouns Inspection When the user inputs the search term in a series of words by the word separation procedure, the single noun of the n-th notation of the compound noun is recognized by the single noun set group of the n-th notation, and the n-th notation is obtained. A recognition result list creating procedure for performing recognition processing within a first sub-set group including the most frequent single noun among the simple noun sets, and creating a recognition result list in correspondence with the recognition likelihood; A recognition result list in which a set of a constituent noun candidate that is a single noun that has been subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of recognition likelihood. And a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold among the constituent noun candidates described in the recognition result list. From among all the combinations of the influential constituent noun candidates, only the actual search words in the search database are selected, a predetermined operation is performed on the recognition likelihood of each influential constituent noun candidate, and the search word recognition likelihood is calculated. Then, a procedure for creating a real search term candidate list arranged in the order of the calculated search term recognition likelihood in the descending order, and a search term candidate in the above real search term candidate list are checked by the user. If the condition that the search term can be specified only by the processing is satisfied, the necessary dialogue with the user is executed, and the search term candidate selection dialogue procedure for determining the search term is executed by the computer. It is an example.
[0155]
The program may be recorded on a recording medium such as an FD, a CD, a DVD, a HD, and a semiconductor memory.
[0156]
In addition, in the product delivery service, the input interface part in the telephone number search, the postal code search, and the like, the work of determining the corporate name performed in the second embodiment can be widely applied.
[0157]
FIG. 18 is a block diagram illustrating a compound noun search device 201 according to a third embodiment of the present invention.
[0158]
The apparatus 201 for retrieving a compound noun input by voice includes a voice input unit 202, a voice recognition unit 203, a voice recognition device 203S using voice recognition software, a voice recognition result output unit 204, and a voice recognition result list creation. A unit 204a, a real search word candidate list creation unit 205, a dialogue control unit 206, a voice output unit 207, a voice output device 207S using voice output software, and a system database 208.
[0159]
In the compound noun search device 201 input by voice, the voice of the user 210 input via the voice input unit 202 is sent to the voice recognition unit 203, and the voice recognition unit 203 performs a voice recognition process on the input voice. First, the system database 208 is used. In addition, the voice recognition unit 203 performs a recognition process on the voice input by the user 210 using voice recognition software in the voice recognition device 203S. Further, the voice recognition unit 203 determines that the user P It is a recognition unit that can divide a compound noun input by voice into single nouns.
[0160]
The system database 208 includes a search database 281, an order search auxiliary database 282, a reverse search auxiliary database 283, and a YES / NO database 284.
[0161]
The search database 281 is a database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and the search word is registered by being divided into each simple noun.
[0162]
In the order search auxiliary database 282, a single noun set located at the nth word, which is a set of n-th (n is an integer) single noun of each compound noun registered in the search database 281 is registered. A plurality of n-th single nouns by dividing a single noun belonging to a set of simple nouns located at the n-th word in the order of frequently used single nouns and by a predetermined number (for example, 500). Subsets are configured.
[0163]
The reverse order search auxiliary database 283 includes a set of simple nouns located at the backward mth word, which is a set of simple nouns located at the backward mth word (m is an integer) of each compound noun registered in the search database 281, In the registered database, simple nouns belonging to the simple noun set located in the reverse m-th word are divided into single nouns in a frequently used simple noun and at a predetermined number of times. A simple noun subset is constructed.
[0164]
The YES / NO database 284 is a database for recognizing the contents to which the user 210 has responded (for example, YES / NO, YES / NO).
[0165]
The voice recognition software used in the voice recognition device 203S selects a search database 281, an order search auxiliary database 282, or a reverse order search auxiliary database 283 from the system database 208 according to the processing scene of the search device 201. is there.
[0166]
When the search word is input by speech, the order search auxiliary database 282 or the reverse order search auxiliary database 283 is referred to, and when a response to the user 210 is confirmed, the YES / NO database 284 is referred. .
[0167]
The voice recognition unit 203 uses voice recognition software 203S when performing voice recognition processing, and the voice output unit 207 uses voice output software 7S when outputting voice.
[0168]
In the following description, a case will be described in which an input interface part of a service for determining the name of a corporation, such as a company name, is provided with a search device 201 for a compound noun input by voice.
[0169]
FIG. 19 is a diagram showing a specific example of a compound noun search device 201 in which the corporate name in the search database 281 is composed of compound nouns, and the compound nouns are registered for each single noun constituting the compound noun. It is.
[0170]
The frequency of each noun composing the corporate name registered in the search database 281 is checked, single nouns are recorded in order of frequency, and an order search auxiliary database 282 is created. That is, the order search auxiliary database 282 is a database that counts the frequency of each single noun constituting the corporate name registered in the search database 281 and registers the single noun in order of frequency.
[0171]
In addition, the total number of simple nouns constituting the 22 million corporate names is 6.5 million.
[0172]
When the user 210 voice-inputs a series of search words without separating them into single nouns, the voice recognition unit 203 separates the voice-input compound nouns into single nouns, and outputs one word in the voice-input compound nouns. Recognition processing is performed in the order of a single noun located at the eye (first single noun from the beginning), a single noun located at the second word (second single noun from the beginning),. In the input compound noun, a simple noun located in the first word in reverse order (first last noun), a simple noun located in the second word in reverse order (second last noun), ... Is a part for executing the recognition processing.
[0173]
The speech recognition result output unit 204 is a unit that sends the recognition result to the speech recognition result list creation unit 204a.
[0174]
The speech recognition result list creating unit 204a calculates, for each single noun, the recognition result when the order search auxiliary database 282 is used and the recognition result when the reverse search auxiliary database 283 is used, for each single noun. This is a part that performs merging using the recognition likelihood, creates a recognition result list, and sends it to the actual search word candidate list creation unit 205.
[0175]
At this point, if there is a single noun that has not yet been recognized in the order search auxiliary database 282 and the reverse order search auxiliary database 283, the speech recognition unit 203 outputs The speech recognition result output unit 204 outputs the recognition result again to the speech recognition result list creation unit 204a, creates a recognition result list, and generates a recognition result list. The process of sending to the search word candidate list creation unit 205 is repeated.
[0176]
Based on the recognition result, the real search word candidate list creation unit 205 selects the influential constituent noun candidates, creates all combinations of the influential constituent noun candidates, and searches for the existing This is a part that creates the extracted real search word candidate list and sends it to the dialog control unit 206.
[0177]
The dialog control unit 206 refers to the real search word candidate list, and if the search word candidates in the real search word candidate list satisfy a predetermined condition that can specify the search word only by the confirmation processing with the user 210, the The output instruction of the processing guidance is sent to the audio output unit 207.
[0178]
If the search term candidates in the real search term candidate list do not satisfy the condition for specifying the search term only by the confirmation processing with the user 210, a recognition processing is performed on a set of single nouns having the next highest use frequency, and this recognition is performed. The execution of the search term candidate selection dialogue is repeated using the updated existing search term candidate list using the influential constituent noun candidates obtained from the processing result.
[0179]
When receiving the confirmation processing guidance output instruction, the voice output unit 207 outputs guidance for performing the correctness confirmation of the candidate designated as the search word candidate to the user 210.
[0180]
Then, when the response to the confirmation processing guidance is input again to the voice input unit 202, the voice recognition unit 203 refers to the YES / NO database 284, recognizes the response of the user 210, and outputs the voice recognition result output unit 11 Outputs a recognition result, and when a response indicating affirmativeness is obtained from the user 210, the dialogue control unit 206 sends a command for providing guidance to the user 210 to the effect that the search term identification has been completed to the voice output unit 207.
[0181]
Until the search word can be specified, the update of the existing search word candidate list and the search word candidate selection dialog are repeated, and the recognition for the set of all single nouns stored in the order search auxiliary database 282 and the reverse order inspection auxiliary database 283 is repeated. When the processing is completed and it becomes impossible to update the existing search word candidate list any more, the interaction control unit 206 presents the search word candidates to the user 210 from the top of the existing search word candidate list. A guidance output command to be transmitted to the audio output unit 207.
[0182]
During the update of the existing search word candidate list and the search word candidate selection dialogue, the search device 201 completes the recognition process for at least one set of simple nouns other than the priority recognition target. As described above, the number of simple nouns constituting the above set is defined.
[0183]
Next, a specific description will be given of a process of determining a search word in the compound noun search device 201.
[0184]
FIG. 20 is a diagram showing a specific example of the order search auxiliary database 282 in the compound noun search device 201.
[0185]
The plural nouns stored in the order retrieval auxiliary database 282 are divided into a single noun set Gbn located at the nth word, that is, a single noun set Gb1 located at the first word and a single noun set Gb1 located at the second word. .. Are classified into a single noun set Gb2, a single noun set Gb3 located at the third word, and so on.
[0186]
Further, by dividing a single noun belonging to a single noun set located at the n-th word into 500 single nouns in order of the most frequently used single noun, a plurality of n-th single noun sub-sets are formed. The single noun set Gb1 located at the word is composed of a single noun subset Gb1-1, a single noun subset Gb1-2, and a single noun subset Gb1-3 for every 500 single nouns in order of frequency of use. Are divided into..., And the noun set Gb2 located at the second word is composed of a single noun subset Gb2-1, a single noun subset Gb2-2, and a single noun .. Are divided into sub-sets Gb2-3.
[0187]
In the compound noun search device 201, 500 is set as the number c in the case of the division, but the number c in the case of the division is defined by the performance of the speech recognition device 203S used.
[0188]
FIG. 21 is a diagram showing a specific example of the reverse search auxiliary database 283 in the compound noun search device 201.
[0189]
A plurality of simple nouns stored in the backward search auxiliary database 283 are divided into a simple noun set Gbm located at the m-th backward word. In other words, a reverse order noun set H1 which is a set of simple nouns (first last noun) located in the first word in the reverse order in the voice-input compound noun, and a reverse second word in the voice-input compound noun , A reverse noun set H2, which is a set of simple nouns (the second last single noun), and a simple noun (the third last single noun) located at the third reverse word in the compound noun input by speech .. Are classified into a reverse order noun set H3, which is a set of nouns.
[0190]
In addition, by dividing a noun belonging to a set of simple nouns located in the m-th backward word into 500 single nouns in descending order of frequency of use, a reverse noun sub-set located in a plurality of the m-th backward words is divided. In other words, the reverse union noun set H1 located at the first word in the reverse order includes a reverse single noun sub-set H1-1 and a reverse single noun sub-set H1-2 for every 500 single nouns in order of frequency of use. , And the reverse unary noun sub-set H2, which is divided into the reverse single noun sub-set H1-3 and the second word in the reverse order, is placed in reverse order single noun sub-set H2- 1 and a reverse order noun sub-set H2-2, a reverse order noun sub-set H2-3, and so on.
[0191]
In the compound noun search device 201, 500 is set as the number c in the case of the division, but the number c in the case of the division is defined by the performance of the speech recognition device 203S used.
[0192]
FIG. 22 is a flowchart illustrating a process of determining a search word in the compound noun search device 201.
[0193]
First, a request is made to the user P to input a voice for a corporate name (compound noun) to be searched (S200). In this case, it is assumed that the user P has input a compound noun in a series without separating each single noun constituting the compound noun. The voice recognition unit 3 divides compound nouns input by the user P in a series for each single noun.
[0194]
First, the user 210 inputs the name of the corporation to be searched by dividing the name into simple nouns and inputting the voice (S200). The compound noun search device 201 considers a case where “Yokosuka / Central / Hotel” is input as a search word.
[0195]
Here, “1” is set as a variable n indicating the word number of a single noun constituting a compound noun in the compound noun, and a single noun set Gbn located in the n-th word is formed. “1” is set as the variable p of the rank (rank indicating the frequency of use) in the simple noun subset Gbn-p (S201).
[0196]
Then, recognition processing is performed on each of the simple noun subset Gb1-1, the simple noun subset Gb2-1, and the simple noun subset Gb3-1 for each of "Yokosuka", "Central", and "hotel" (S202). , S203). Subsequently, a recognition result list in the case of the order search is created by associating the recognition likelihood with each single noun (S204). Next, a single noun having a recognition likelihood of 80 or more is selected as a potential constituent noun candidate (S205).
[0197]
The “potential constituent noun candidate” is a constituent noun candidate whose recognition likelihood is equal to or greater than a predetermined threshold. In the compound noun search device 201, the threshold is set to 80. That is, constituent noun candidates whose recognition likelihood is 80 or more are dominant constituent noun candidates. It should be noted that the recognition likelihood used when selecting a probable constituent noun candidate is predetermined when the search database 281 is created.
[0198]
FIG. 23 is a diagram illustrating a specific example of a recognition result list when the compound noun search device 201 performs an order search.
[0199]
The “order search” uses the order search auxiliary database 282 to recognize a single noun located at the nth word in a compound noun input by speech within a range of a single noun set Gbn located at the nth word. , Search operation.
[0200]
In addition, the “reverse order search” described later uses the reverse order search auxiliary database 283 to replace the single noun located in the m-th reverse order word in the compound noun input by speech with the single noun set Gbm located in the m-th reverse order word. This is an operation of recognizing and searching for a range.
[0201]
On the other hand, in parallel with the recognition operation in the order search, a recognition operation in the reverse order search is performed.
[0202]
In other words, “1” is set as a variable m indicating the number of the simple noun that forms the compound noun in the reverse order of the compound noun, and the single noun set Gbm located at the m-th word in the reverse order is formed. “1” is set as the variable q of the rank (rank indicating the frequency of use) in the single noun subset Gbm-q (S211).
[0203]
Then, for each of “Yokosuka”, “Central”, and “Hotel”, recognition processing is performed on the reverse order noun sub-set H1-1, the reverse order noun sub-set H2-1, and the reverse order noun sub-set H3-1. (S212, S213). Subsequently, a recognition result list is created in the case of performing a reverse search by associating the recognition likelihood with each single noun (S214). Then, a single noun having a recognition likelihood of 80 or more is selected as a probable constituent noun candidate (S215).
[0204]
In the compound noun search device 201, as described above, the recognition operation in the reverse order search is performed in parallel with the recognition operation in the order search. In this case, the order search (S201 to S205) is executed by one CPU. The reverse search (S211 to S215) may be executed by another CPU, or one CPU may execute the order search (S201 to S205) and the reverse search (S211 to S215). Is also good.
[0205]
FIG. 24 is a diagram showing a specific example of a recognition result list in the case of performing a reverse search in the compound noun search device 201.
[0206]
As shown in FIG. 20, the use frequency rank in the order search auxiliary database 282 is such that the single noun “Yokosuka” located at the first word in the compound noun input by voice is at the 420th position as shown in FIG. The noun “Central” ranks 1250, and the single noun “Hotel” located in the third word ranks 892. Since “Central” and “Hotel” are ranked 500th or lower, they are not included in the single noun subsets Gb2-1 and Gb3-1 that are priority recognition targets, and are not the targets to be recognized first.
[0207]
However, the use frequency rank in the reverse search auxiliary database 283 is such that the simple noun “Central” located in the second word of the reverse order in the compound noun input by voice is in the ninth place as shown in FIG. Since the simple noun “hotel” is ranked first, it is included in the reverse order simple noun sub-sets H2-1 and H1-1, is a priority recognition target, and is recognized earlier than in the case of performing an order search.
[0208]
FIG. 25 is a diagram illustrating a specific example of a recognition result list in a case where the recognition result in the case of order search and the recognition result in the case of reverse search are merged in the compound noun search device 201.
[0209]
When the recognition result in the case of the order search shown in FIG. 23 and the recognition result in the case of the reverse search shown in FIG. 24 are merged (S221), an integrated recognition result list is completed as shown in FIG. Select a constituent noun candidate.
[0210]
In the compound noun search device 201, a single noun having a recognition likelihood of 80 or more is determined as a probable constituent noun candidate. As a result, as shown in FIG. Two candidates, "Yokohama" and "Yokosuka", are selected as potential constituent noun candidates, and "Central" is selected as a potential constituent noun candidate for the spoken single noun, "Central". With respect to “hotel”, two candidates “hotel” and “hall” are selected as potential constituent noun candidates.
[0211]
FIG. 26 is a diagram showing a state immediately before the real search word candidate list is created in the compound noun search device 201.
[0212]
The search device 201 performs a real search word list creation process using the influential constituent noun candidates in the merged recognition result shown in FIG. 25 (S222).
[0213]
The search device 201 lists real search word candidates based on all combinations of influential constituent noun candidates and compound nouns stored in the search database 281 as shown in FIG.
[0214]
In the compound noun search device 201, “Yokosuka / Central / Hotel” (search word recognition likelihood 90 + 95 + 95 = 280) is listed as a real search word candidate. In the compound noun search device 201, the search word recognition likelihood is calculated by a method of adding the recognition likelihood of each single noun as shown in FIG.
[0215]
In the compound noun search device 201, the criterion for determining whether or not the search word can be determined only by the confirmation processing with the user 210 (S223) is based on the threshold value predetermined by the compound noun search device 201 and the search word recognition likelihood. The search word candidate having a search word recognition likelihood exceeding the above-defined threshold value is determined as “satisfying the condition that the search word can be determined only by the confirmation processing”.
[0216]
The real search word candidate list is a list of search word candidates that are presented to the user 210 and require confirmation among search words that exist in the search database 281.
[0217]
In the compound noun search device 201, the specified threshold value for the search word recognition likelihood is set to 280. In the compound noun search device 201, “Yokosuka / Central / Hotel” which is a search word candidate having a search word recognition likelihood of 280 or more is selected from the existing search word list and presented to the user 210 (S224). When a response indicating affirmativeness is obtained from the user 210 with respect to the right / wrong confirmation (S225), the process is terminated assuming that the search word has been determined.
[0218]
By the way, assuming that the number of simple nouns constituting the compound noun which is the search word is 3, 4, 5,..., 8, that is, assuming that the maximum number of simple nouns is 8, the compound noun Simple nouns such as “hotel”, “bank”, and “center” that are likely to be used as the first simple noun (the last simple noun) in a compound noun are composed of three simple nouns For example, it is counted as a single noun located in the third word, and if a compound noun is composed of four simple nouns, it is counted as a single noun located in the fourth word. Even if 99% of the simple nouns of “bank” and “center” are used as the first simple noun in reverse order (the last single noun), the frequency statistics are dispersed. Therefore, as shown in FIG. 20, "hotel", "bank", and "center" are not frequently used as priority recognition targets, and therefore, a quick search is often not performed.
[0219]
However, in the compound noun search device 201, the frequency statistics are counted for each position (reverse order) of the single noun counted from the end and stored in the reverse order search auxiliary database 282, so as shown in FIG. "Hotel", "bank", "center", etc., are registered as the reverse order noun set H1 located in the reverse first word, which is the set of single nouns that are used most frequently from Become. In addition, “central”, “grand”, and the like are registered as a reverse single noun set H2 located in the second reverse word, which is a set of single nouns used most frequently from the end. Therefore, in the compound noun search device 201, the recognition operation is executed in a short time.
[0220]
In addition, in the compound noun search device 201, at the same time as the real search word list creation processing (S222), the recognition processing may be started for the single noun sub-set Gbn-2 and the reversed single noun sub-set Hm-2. . In other words, when the search term candidates are presented to the user 210 and the confirmation is requested (S224), if the recognition is performed in the range of the single noun having the next highest frequency of use, the recognition operation is performed from the viewpoint of the user 210. In between, there is no need to wait without doing anything, without performing a recognition operation, and it seems that search word candidates are being presented one after another.
[0221]
Assuming that the question from the search device 201 and the response from the user 210 to this question are one turn, the compound noun search device 201 exchanges the user 210 with the user 210 with as few dialogue turns as possible. The search word can be specified, thereby reducing the conversation time and performing the search process more quickly.
[0222]
The basic policy of the search by the compound noun search device 201 is that the speech recognition unit 203 separates a compound word (search word) input in a series for each single noun and considers the position of the separated single noun from the beginning. , The recognition process is preferentially performed in the descending order of the frequency statistic for each single noun, the recognition results for each single noun are combined, and only the actual search words are selected. In other words, the search term is specified without compensating for the gender and giving the user 210 no stress.
[0223]
In addition to the above basic policy, a major feature of the compound noun search device 201 is that, in parallel with the recognition processing performed for each single noun, the first word, the second word, For each third word,..., The frequency statistics are checked, and the reverse search auxiliary database 283 arranged in the order of the frequency statistics is preferentially subjected to the recognition processing when the voice input is completed, and the recognition result for each single noun is set. In addition, by selecting only the actual search words, the processing time of the recognition device and the imperfect accuracy are compensated for, and the search words are specified without giving the user 210 any stress.
[0224]
Further, the recognition result using the order search auxiliary database 282 and the recognition result using the reverse order search auxiliary database 283 are merged, so that the accuracy of the recognition processing can be further improved, and the search time can be further reduced. Can be shortened.
[0225]
FIG. 27 is a flowchart showing the operation of the fourth embodiment of the present invention.
[0226]
The fourth embodiment is an embodiment in which only a reverse order search is executed without executing an order search.
[0227]
In FIG. 27, first, a request is made to the user P to perform a voice input on a corporate name (compound noun) to be searched (S230). In this case, it is assumed that the user P has input a compound noun in a series without separating each single noun constituting the compound noun. The voice recognition unit 3 divides compound nouns input by the user P in a series for each single noun.
[0228]
Set “1” as a variable m indicating the number of the word in the reverse order of the simple noun constituting the compound noun, and configure the reverse order noun set Gbm located at the mth word in the reverse order “1” is set as the order q in the single noun sub-set Gbm-q (S231). Then, for each of “Yokosuka”, “Central”, and “Hotel”, recognition processing is performed on the reverse order noun sub-set H1-1, the reverse order noun sub-set H2-1, and the reverse order noun sub-set H3-1. (S232, S233). Next, a recognition result list is created in the case of performing a reverse search by associating the recognition likelihood with each single noun (S234). Then, influential constituent noun candidates are selected (S235), a real search word candidate list is created (S236), and if a condition that can specify the search word is satisfied only by the confirmation processing (S237), the search word candidate is provided to the user. The search is presented (S238), and if the response from the user is YES (S239), the search is terminated. On the other hand, if there is no candidate that satisfies the condition for specifying the search word only by the confirmation processing (S237), the search is performed. Recognition is performed in the range of the reverse unary noun subset having the next highest frequency.
[0229]
That is, in the fourth embodiment, the simple noun located at the m-th reverse word in the compound noun input by speech is changed to the m-th reverse word (m is an integer) of each compound noun registered in the database. This is a device for retrieving speech-input compound nouns that recognizes in the range of a single noun set located at the mth word in reverse order, which is a set of single nouns located.
[0230]
More specifically, in the fourth embodiment, a compound database composed of a plurality of simple nouns is registered as a search word, and further, the search word is divided and registered in each simple noun. And a set of simple nouns located at the m-th reverse order word, which is a set of simple nouns located at the m-th reverse order word (m is an integer value) of each registered compound noun, is registered in the reverse order search auxiliary database. And the user inputs a series of compound nouns, which are the above search terms, by voice without delimiting each single noun, and the voice recognition unit divides the compound noun into single nouns, and reverses m words of the compound nouns input by voice. The single noun located in the eye is recognized within the range of the single noun set located in the reverse m-th word, and the constituent noun candidate which is the recognized single noun and the recognition likelihood of the constituent noun candidate are calculated. Pairs arranged in recognition likelihood Reverse order recognition result list creating means for creating the reverse order recognition result list, and the reverse order recognition result list created for each of the single nouns input by speech, and the constituent nouns described in the reverse order recognition result list Among the candidates, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold, a reverse strong constituent noun candidate selecting means for selecting the reverse strong constituent noun candidate, and all combinations of the reverse strong constituent noun candidates, respectively By performing a predetermined operation on the recognition likelihood of the above-mentioned influential constituent noun candidates, a search word recognition likelihood is calculated, and a real search word candidate list arranged in the descending order of the calculated search word recognition likelihood is created. A real search word candidate list creating means, and among the real search word candidates, the real search word whose likelihood of the search word recognition is equal to or more than a predetermined threshold is Perform the necessary interaction between the use's a search apparatus compound nouns which are voice input and a search word candidate selecting interacting means for determining a search term.
[0231]
The above embodiment can be understood as a program embodiment. That is, in the above embodiment, a compound database composed of a plurality of simple nouns is registered as a search word, and further, the search database is registered with the search word divided into each simple noun, A set of simple nouns located at the mth word in the reverse order, which is the set of simple nouns located at the mth word in the reverse order of each compound noun (m is an integer value), is registered using the registered reverse order search auxiliary database, A procedure for searching for the input compound noun, and a search for separating the above-mentioned input search word into single nouns when the user inputs the compound noun, which is the above-mentioned search word, without separating it into single nouns. When the user inputs a series of compound nouns, which are the above search words, by speech, in a word segmentation procedure, a single noun located at the m-th reverse word of the compound noun input at the voice is located at the m-th reverse word. Simple noun set A reverse order recognition result list for generating a reverse order recognition result list in which a set of a constituent noun candidate that is recognized in the range and is a recognized noun and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood Creation procedure, the reverse order recognition result list is created for each of the input nouns, and among the constituent noun candidates described in the reverse order recognition result list, the recognition likelihood exceeding a predetermined threshold is determined. A constituent noun candidate to be provided is selected as a reverse-order strong constituent noun candidate selecting procedure as a reverse-order strong constituent noun candidate, and for each of all combinations of the reverse-order strong constituent noun candidates, a predetermined recognition likelihood of the strong constituent noun candidate is determined. A real search for calculating a search term recognition likelihood by performing an operation and creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood A candidate list creation procedure, and among the real search word candidates, for a real search word in which the search word recognition likelihood is equal to or more than a predetermined threshold, necessary dialogue with the user is executed, and the search word is 7 is an example of a program for causing a computer to execute a search term candidate selection dialogue procedure to be determined.
[0232]
Further, the program may be recorded on a recording medium such as an FD, a CD, a DVD, an HD, and a semiconductor memory.
[0233]
In addition, in the input interface part in the product delivery service, the telephone number search, the postal code search, and the like, the work of determining the corporate name performed in the above embodiment can be widely applied.
[0234]
According to the above-described embodiment, instead of considering only the constituent order from the beginning, the constituent noun that is frequently used at the end, the constituent noun that is frequently used at the second word from the end, By counting the words, even if they are used at the same end, words counted as different words from the beginning depending on the number of constituent words and not recognized as the most frequently used words can be regarded as priority recognition targets .
[0235]
In the above embodiment, the search word recognition likelihood is obtained by adding the recognition likelihood of each single noun, as shown in FIG. 26. The recognition likelihood may be multiplied.
[0236]
If the search word recognition likelihood of the search word candidate in the real search word candidate list is equal to or more than the threshold value (for example, the threshold value is 280), the search word is determined only by the confirmation processing by the user 210. On the other hand, if the search word recognition likelihood is less than the threshold, the search word cannot be determined only by the confirmation processing by the user 210.
[0237]
In preparation for the case where the search term cannot be determined as described above, processing such as recognition of a single noun and calculation of a recognition likelihood within the range of the single noun subset Gb2-p and the reverse single noun subset H2-q is performed. May be executed in advance. That is, the search term candidates are presented to the user 210 on a display or the like, and in parallel with the process of requesting confirmation (S224) and the response of the user 210 (S225), the simplest noun sub-set with the next highest frequency of use, Recognition of a single noun, calculation of a recognition likelihood (S202, S212), re-selection of potential constituent noun candidates (S204, S214), and updating of a real search word candidate list (S222) are executed within the range of the noun subset.
[0238]
In this way, since the user 210 cannot see the processing status inside the search device 201 at all, the user 210 is provided with the search words registered in the search database It looks as if the recognition process is being performed at high speed.
[0239]
FIG. 28 is a block diagram showing a voice interactive type compound noun search device 301 according to a fifth embodiment of the present invention.
[0240]
The speech interactive compound noun search device 301 includes a speech input unit 302, a speech recognition unit 303, speech recognition software 303S, a speech recognition result output unit 304, a real search word candidate list creation unit 305, and a dialogue. The control unit 306 includes an audio output unit 307 and audio output software 307S.
[0241]
In the compound noun search device 301 that is input by voice, the voice of the user P input via the voice input unit 302 is sent to the voice recognition unit 303, and the voice recognition unit 303 performs a voice recognition process on the input voice. First, the system database 308 is used. The voice recognition unit 303 performs recognition processing on the voice input by the user P using the voice recognition software 303S. Further, the voice recognition unit 303 executes the compound noun input by the user P in series. Is a recognition unit that can be divided for each single noun.
[0242]
The system database 308 includes a search database 381, a search auxiliary database 382, and a YES / NO database 383.
[0243]
The search database 381 is a database in which compound nouns composed of a plurality of simple nouns are registered as search words, and the search words are registered by being divided into single nouns.
[0244]
The search assistance database 382 calls a group of simple nouns described in the nth (n is an integer) of each registered compound noun a single noun set group in the nth notation, and a single noun set group in the nth notation. The noun set group is registered in order of the frequency, and the nouns registered in the order of the frequency are grouped by a predetermined number (500 in the search device 301) in the order of the frequency. This is a database in which a plurality of sub-set groups are formed.
[0245]
The YES / NO database 383 is a database that recognizes contents (for example, YES / NO, YES / NO) to which the user P has responded.
[0246]
The voice recognition software 303S selects the search database 381 or the search auxiliary database 382 from the system database 308 in accordance with the processing scene of the search device 301.
[0247]
When the search word is input by voice, the search assistance database 382 is referred to, and when a response to the user P is confirmed, the YES / NO database 383 is referred to.
[0248]
The voice recognition unit 303 uses voice recognition software 303S for voice recognition processing, and the voice output unit 307 uses voice output software 307S for voice output.
[0249]
When the user inputs a compound word as a search word for each single noun, the speech recognition result output unit 304 recognizes the n-th notation single noun of the compound noun in the n-th notation single noun group. In addition, among the single noun sets in the n-th notation, the recognition processing is performed within the range of the first sub-set group including the single most frequently used single noun, and the recognition result list is created by associating the recognition likelihood. It is an example of a recognition result list creation unit.
[0250]
The speech recognition result output unit 304 generates a recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood. A constituent noun candidate having a recognition likelihood exceeding a predetermined first threshold among constituent noun candidates described in the recognition result list created for each of the voice-input single nouns is classified as a leading constituent noun. It is an example of an influential constituent noun candidate selecting means for selecting as a noun candidate.
[0251]
The real search word candidate list creation unit 305 includes a real search word list creation unit 351 and a partial match search word list creation unit 352.
[0252]
The real search word list creation unit 351 is a unit that extracts a real search word from the search words stored in the search database 381 based on the voice recognition result output by the voice recognition result output unit 304.
[0253]
If the likelihood of only some of the single nouns among the plurality of single nouns constituting the search word exceeds the first predetermined threshold, the partial match search word list creation unit 352 sets the first threshold to A partial match search word candidate extracting means for extracting, from the search database, partial match search word candidates having the single noun having a likelihood exceeding the same and having the same number of single nouns as the single noun constituting the search word This is an example.
[0254]
The dialogue control unit 306 includes a search word candidate selection dialogue unit 361 and a partially matching search word candidate selection dialogue unit 362.
[0255]
The partial match search word candidate selection dialogue section 362 calculates a partial match likelihood by calculating the likelihood of each single noun constituting the extracted partial match search word candidate by a predetermined calculation method. It is an example of a means.
[0256]
Note that the above-described predetermined calculation method for calculating the partial match likelihood is a method of adding the likelihood of each simple noun in the search device 301, but multiplication may be performed instead of the addition.
[0257]
The audio output unit 307 outputs audio using the audio output software 307S.
[0258]
The speech recognition unit 303 performs a speech recognition process for a partial match search word candidate for performing a speech recognition process on the partial match search word candidate whose calculated partial match likelihood exceeds a second predetermined threshold value. It is an example of the means.
[0259]
Next, the search device 301 will be described more specifically.
[0260]
In the following description, a case will be described in which an input interface part of a service for determining the name of a corporation, such as a company name, is provided with a search device 301 for a compound noun input by voice.
[0261]
FIG. 29 is a diagram showing a specific example in which, in the search device 301, the corporate name in the search database 381 is composed of a compound noun, and the compound noun is registered for each simple noun constituting the compound noun.
[0262]
The frequency of each noun composing the corporate name registered in the search database 381 is checked, single nouns are recorded in order of frequency, and the search auxiliary database 382 is created.
[0263]
In addition, the total number of simple nouns constituting the 22 million corporate names is 6.5 million. In addition, among the nouns that make up 22 million corporate names, the total number of single nouns in the first notation (single nouns located at the first word in the corporate name) is about 3.6 million types, The total number of written single nouns (single nouns located in the second word in the corporate name) is about 2.5 million types, and the third written single noun (single nouns located in the third word in the corporate name) ) Is about 2.7 million types, and the total number of the fourth noun (the single noun located at the fourth word in the name of the corporation) is about 1 million types,... In the real time of the interactive processing, recognition processing of each single noun with respect to one compound noun is impossible, and it is expected that accuracy is low.
[0264]
FIG. 30 is a diagram showing a specific example of data stored in the search auxiliary database 382 in the search device 301.
[0265]
The simple nouns in the search assistance database 382 are simple nouns that compose a compound noun registered in the search database 381, and a single noun set group is formed for each notation order in the compound noun. Only the single nouns in the compound noun whose order of notation is first are collected to form a single noun set group Gc1 of the first notation. In the single noun set group Gc1 of the first notation, the single nouns are sorted in descending order of the frequency of use. The registered single nouns are grouped into 500 single nouns in descending order of their frequency to form a plurality of sub-set groups. A group Gc1-1, a second sub-set group Gc1-2,..., And an m-th sub-set group Gc1-m are formed.
[0266]
The first sub-set group Gc1-1 including the single noun that is used most frequently is a target to be recognized with priority over the other sub-set groups Gc1-2,..., Gc1-m.
[0267]
Also, by collecting only single nouns in the second order of compound nouns, a single noun set group Gc2 in the second notation is formed. Single nouns are registered, and these registered single nouns are grouped for each 500 single nouns in descending order of frequency, to form a plurality of sub-set groups. A first sub-set group Gc2-1, a second sub-set group Gc2-2, ..., an m-th sub-set group Gc2-m are formed.
[0268]
The first sub-set group Gc2-1 including the single noun with the highest frequency of use is a target to be recognized with priority over the other sub-set groups Gc2-2,..., Gc2-m.
[0269]
Hereinafter, similarly to the above, by collecting only the nouns in the order of the n-th notation in the compound noun, a single noun set group Gcn of the n-th notation is formed, and in the single noun set group Gcn of the n-th notation, Simple nouns are registered in order of use frequency, and these registered simple nouns are grouped for every 500 simple nouns in descending order of the frequency to form a plurality of sub-set groups. , A first sub-set group Gcn1, a second sub-set group Gcn2,..., An n-th sub-set group Gcn-m are formed.
[0270]
The first sub-set group Gcn-1 including the single noun with the highest frequency of use is an object to be recognized in preference to the other sub-set groups Gcn-2,..., Gcn-m.
[0271]
By the way, assuming that the number of nouns forming one subset group is N, in the search device 301, N = 500, and this number is determined as follows. In other words, a single noun that can be processed by the speech recognition software 303S within the real time of the dialog processing (when the search device and the user P communicate with each other, the time does not cause the user P to feel stress or unnaturalness). Assuming that the number is T and the average number of single nouns of all the search words registered in the search database 381 is M, N = T / M.
[0272]
Specifically, it is assumed that the number T of simple nouns that can be processed by the speech recognition software 303S in real time of the interactive processing is 1500, and the average number M of single nouns of all search words registered in the search database 381 is 3 Then, N = T / M = 1500/3 = 500.
[0273]
Note that the number N of simple nouns constituting each of the sub-set groups Gc1-1,..., Gcn-m may be a number other than 500, but can be processed within the real time of the dialog processing as described above. It is determined by the number T of simple nouns (performance of the speech recognition software 303S) and the average M of the number of simple nouns included in the search words registered in the search database 381.
[0274]
Next, the operation of the search device 301 will be described.
[0275]
FIG. 31 is a flowchart specifically showing a processing process for determining a search word in the search device 301.
[0276]
It is assumed that a search database 381 and a search auxiliary database 382 have been created.
[0277]
First, a request is made to the user P to input a voice for a corporate name (compound noun) to be searched (S300). In this case, it is assumed that the user P has input a compound noun in a series without separating each single noun constituting the compound noun. The voice recognition unit 3 divides compound nouns input by the user P in a series for each single noun.
[0278]
Then, the function k of the sub-set group (function indicating the rank of the sub-set group) in each single noun set group relating to the compound noun as the search word is set to 1. That is, if the function k of the subset group is 1, each single noun is recognized using the subset groups Gc1-1, Gc2-1, Gc3-1,... Among the plurality of subset groups. Is done.
[0279]
Here, a case is considered where the compound noun “Yokohama / Tokyu / Hotel” is input as a search word by voice.
[0280]
First, the speech recognition unit 303 divides a compound noun input by voice into a plurality of single nouns, and the first noun set of each of the separated single nouns “Yokohama”, “Tokyu”, and “Hotel” First subset group Gc1-1 in group Gc1, first subset group Gc2-1 in second notation set Gc2, first subset group Gc3 in third notation single noun set group Gc3 The speech recognition unit 303 performs a recognition process using -1 (S301). That is, the voice-input single noun “Yokohama” is recognized and processed within the range of the first sub-set group Gc1-1, and the voice-input single noun “Tokyu” is recognized and processed within the range of the first sub-set group Gc2-1. The recognition processing is performed within the first sub-set group Gc3-1 for the single noun “hotel” that is input as speech. Then, the speech recognition result output unit 304 outputs the recognition result.
[0281]
Of the simple nouns “Yokohama”, “Tokyu”, and “Hotel” into which the speech-input compound nouns are separated, the simple noun “Yokohama” is included in the first sub-set group Gc1-1 as shown in FIG. It is assumed that the simple noun “hotel” is included in the first sub-set group Gc3-1. However, since the frequency order of the single noun “Tokyu” is the 951 order as shown in FIG. 30, it is not included in the first sub-set group Gc2-1 but is included in the second sub-set group Gc2-2. Have been.
[0282]
FIG. 32 is a diagram illustrating a specific example of a recognition result list in which the recognition results for each single noun constituting the search word (compound noun) input by speech and the recognition likelihood correspond to each other in the search device 301. is there.
[0283]
When one single noun is input by speech, the speech recognition result output unit 304 outputs a plurality of constituent noun candidates for the single input noun and the recognition likelihood for each of the plurality of constituent noun candidates. A recognition result list is created by associating the outputted constituent noun candidates with the recognition likelihood (S301). FIG. 32 shows a specific example of the recognition result list created in this way.
[0284]
For example, when the single noun "Yokohama" is input by voice, the voice recognition unit 303 outputs the input voice as "Yokohama", "Yokosuka", "Yokomizo", "Yokoyama", as shown in FIG. The recognition likelihood of “Yokohama” is 95, the recognition likelihood of “Yokosuka” is 90, the recognition likelihood of “Yokomizo” is 81, and the recognition likelihood of “Yokoyama” is 75. is there.
[0285]
Note that the recognition likelihood of each single noun is individually determined by the voice recognition software 303S when the single noun is input by voice.
[0286]
Here, a constituent noun candidate whose recognition likelihood is equal to or greater than a predetermined threshold is defined as a probable constituent noun candidate. In the search device 301, the predetermined threshold is set to 80. That is, constituent noun candidates whose recognition likelihood is 80 or more are dominant constituent noun candidates. The fact that the predetermined threshold value is 80 is preset when the databases 81 and 82 are created.
[0287]
Then, from the respective recognition results shown in FIG. 32, a single noun having a recognition likelihood of 80 or more is selected as a potential constituent noun candidate (S302).
[0288]
At this time, if there is a first sub-set group for which the recognition processing has not been completed yet in each search auxiliary database 382, the speech recognition unit 303 restarts the recognition processing for the first sub-set group, and The recognition result output unit 304 repeats the process of outputting the recognition result and sending it to the search word candidate list creation unit 305.
[0289]
In the search word candidate list creation unit 305, the real search word candidate list creation unit 3051 selects a major constituent noun candidate based on the recognition result, creates all combinations of the major constituent noun candidates, and refers to the search database 381. Meanwhile, a real search word candidate list from which real search words are extracted is created (S303), and sent to the dialog control unit 306.
[0290]
The real search word candidate list is a list of search word candidates that are presented to the user Pc and are asked for confirmation among the search words actually existing in the search database 381.
[0291]
By the way, as shown in FIG. 29, the frequency ranking of the second notation single noun “Tokyu” is 951 in the second notation single noun set group Gc2 stored in the auxiliary database 82, The first subset group Gc2-1 of the notation simple noun set group Gc2 does not include the simple noun “Tokyu”. Therefore, the recognition result for the single noun “Tokyu” shown in FIG. 32 does not include the single noun “Tokyu”.
[0292]
Therefore, in the step (S303) of creating a real search word candidate list, no real search word candidate list is created.
[0293]
At the same time, the partial match search word list creation unit 352 calculates the partial match likelihood of the partial match search word candidate (S331), extracts the partial match search word candidate from the search database 381, and sends it to the dialog control unit 306. send.
[0294]
The “partial match search word candidate” exceeds the first threshold when the likelihood of only some of the single nouns among the plurality of single nouns constituting the search word exceeds the first predetermined threshold. A search word candidate that includes the single noun having likelihood and includes the same number of single nouns as the single noun that forms the search word.
[0295]
The “partial match likelihood” is a likelihood obtained by calculating the likelihood of each single noun constituting the extracted partial match search word candidate by a predetermined calculation method.
[0296]
The dialog control unit 306 executes a search word candidate selection dialog using the existing search word candidates and the partial match search word candidates.
[0297]
In other words, if the search term candidates in the existing search word candidate list satisfy the specified conditions of the search device capable of specifying the search term only by the confirmation processing with the user, the output instruction of the confirmation processing guidance is output to the audio output unit 307. Send to
[0298]
Conversely, if the search term candidates in the real search term candidate list do not satisfy the conditions for specifying the search term only by the confirmation processing with the user (S321), or if the real search term candidates are not extracted, Using the partial match search word candidate, the voice recognition unit 303 performs voice recognition processing again on the voice first input to the voice input unit 302 (S332), and calculates the recognition likelihood.
[0299]
Then, when the calculated recognition likelihood satisfies the prescribed state of the search device capable of specifying the search word only by the confirmation processing with the user (S321), the output instruction of the confirmation processing guidance is sent to the voice output unit 307. Then, a search word candidate is presented, and confirmation is requested from the user Pc (S322).
[0300]
If the search word candidates that can be specified only by the confirmation processing are not extracted from the existing search word candidates or from the partial match search word processing using the partial match search word candidates (S321), the recognition processing ends at this point. Then, for the constituent noun set group having the next highest use frequency (S311), the influential constituent noun candidates obtained as a result of the recognition processing are used (S312, S313), and the existing search word candidate list and the partial match search word list are combined. It is updated (S314), and the search word candidate selection dialogue and the partial match search word processing are repeated (S315).
[0301]
When receiving the confirmation processing guidance output instruction, the voice output unit 307 outputs guidance to the user to confirm correctness of the candidate specified as the search word candidate.
[0302]
Then, when the response to the confirmation processing guidance is input again from the voice input unit 302, the voice recognition unit 303 refers to the YES / NO database 383, recognizes the user's response, and outputs from the voice recognition result output unit 304. If the recognition result is output and a response indicating affirmation can be obtained from the user (S315), the dialogue control unit 306 issues a command to provide guidance to the user that the search word identification has been completed. Send to 307.
[0303]
Until the search word can be specified, the search word candidate list is updated (S314), the search word candidate selection dialogue, the partial match search word processing (partial match search word candidates are extracted from the search database 381, and the extracted partial match A process of calculating a partial match likelihood, which is the total likelihood of the single nouns constituting the search word candidate).
[0304]
If the recognition processing for all constituent noun set groups in the search auxiliary database 382 has been completed and the real search word candidate list cannot be updated anymore (S315), the dialogue control unit 306 sets a higher rank in the real search word candidate list. Sends a guidance output instruction for presenting a search word candidate to the user to the voice output unit 307.
[0305]
That is, while the update of the existing search word candidate list and the search word candidate selection dialogue are being performed, the search device 301 at least one set of constituent noun sets other than the priority recognition target noun group (set 1 consisting of a specified number) ), The number of constituent nouns is defined so that the recognition processing is completed.
[0306]
Next, the operation of the search device 301 will be described more specifically.
[0307]
A real search word candidate list creation process is performed by using these influential constituent noun candidates. In the first subset Gc2-1 relating to the single noun in the second notation, the influential constituent noun candidate relating to “Tokyu” is selected. Therefore, the search device 301 performs a partial match search process as soon as the recognition process for the first sub-set groups Gc1-1 and Gc3-1 ends.
[0308]
That is, a search in which the first constituent noun candidate “Yokohama” or “Yokosuka” in the first subset group Gc1-1 in the first notation set of single nouns Gc1 as the first single noun is included in the first single noun. The potential constituent noun candidate “hotel” or “hall” in the first sub-set group Gc3-1 of the third noun, which is the third simple noun, which is the third candidate noun, is the third simple noun. A search term candidate having a noun is searched from the search database 381.
[0309]
FIG. 33A is a diagram illustrating an example of a partial match search word candidate and a partial match likelihood in the search device 301.
[0310]
In the search device 301, “Yokohama / Grand / Hotel”, “Yokohama / Tokyu / Hotel”, “Yokohama / Tobu / Hotel”, “Yokosuka / Central / Hotel”, “Yokohama / Central / Hall”, “Yokohama / Grand / Hall” , "Yokohama / Music / Hall", "Yokohama / Center / Hall", and "Yokosuka / Art / Hall" are searched as partial match search word candidates.
[0311]
In the above example, for example, of “Yokohama / Grand / Hotel”, “Yokohama” and “Hotel” are the major constituent noun candidates as shown in FIG. 32, and “Grand” is not the major constituent noun candidate. . In other words, of "Yokohama / Grand / Hotel", only a part of the single noun is a probable constituent noun candidate. Therefore, "Yokohama / Grand / Hotel" is a partial match search word candidate.
[0312]
For example, among “Yokohama / Grand / Hotel”, the recognition likelihood of “Yokohama” is 95 as shown in FIG. 32, and the recognition likelihood of “Hotel” is 88 as shown in FIG. 183, which is a value (95 + 88 = 183) obtained by adding the respective recognition likelihoods of these influential constituent noun candidates, is the partial matching likelihood.
[0313]
In the search device 301, it is assumed that the threshold (second threshold) in the partial match likelihood is 130.
[0314]
Then, with these nine candidates as recognition targets, the speech recognition unit 303 executes again the speech recognition processing for the first input search word.
[0315]
As a result, as shown in FIG. 33 (2), the search word recognition likelihoods of “Yokohama / Tokyu / Hotel” and “Yokohama / Tobu / Hotel” are 95 and 81, respectively, and the threshold (second Therefore, “Yokohama / Tokyu / Hotel” and “Yokohama / Tobu / Hotel” are extracted as search word candidates. By performing the presentation confirmation from the top, the search word specification is completed.
[0316]
In the search device 301, the criteria for determining whether or not the search word can be determined only by the confirmation process with the user with respect to the result of the partial match search word processing include a threshold value predetermined by the compound noun search device and a search word recognition likelihood. If the threshold (third threshold) for the re-recognized result is 80, the search word candidate having the search word recognition likelihood exceeding the third threshold 80 is only the confirmation process. It is determined that the condition that can determine the search word is satisfied.
[0317]
Therefore, in the search device 301, after the re-recognition, the search term candidates satisfying the condition that the search term can be determined only by the confirmation processing are “Yokohama / Tokyu / Hotel” and “Yokohama / Tobu / Hotel ".
[0318]
The search device 301 not only recognizes the remaining constituent nouns while performing the real search word list creation process and the search word candidate selection dialogue, but also performs a partial match search word using the partial match likelihood. Perform processing.
[0319]
As a result of the partial match search word processing, a search word candidate selection dialogue is performed using the calculated partial match likelihood. If there is no search word candidate having a partial match likelihood exceeding a prescribed threshold (third threshold), or if there is no search word candidate to be selected as a result of the partial match search word processing, the second sub-set The process of creating a real search word list using the results recognized for the group and the dialogue for selecting a search word candidate are continued. At the same time, while updating the influential constituent noun list, the partial match search word processing and the search word candidate selection dialogue for the result are repeated.
[0320]
In the search device 301, since the second noun group “Tokyu” is included in the second sub-set group Gc2-2, when the real search word list creation process using the result of recognition for the second sub-set group is performed, the search device 301 is selected. Search word candidates will be present. On the other hand, a search word candidate selection dialogue is performed using the calculated partial match likelihood as a result of the partial match search word processing, and there is a search word candidate having a partial match likelihood exceeding a specified threshold (third threshold). I will do it. In this case, these two results may be merged.
[0321]
When the search device 301 performs the real search word list creation process based on the recognition result for the set of constituent nouns in each configuration order, at the same time, the partial information in the potential constituent noun candidates satisfying the conditions predetermined by the search device Perform a process of creating a partial match search word list using. That is, a partial match likelihood is calculated using a probable constituent noun candidate, a search word having a partial match likelihood greater than or equal to a specified threshold is extracted, and the extracted search word is determined as a recognition target, and is first input. The recognition process of the search word thus performed is executed again to calculate the recognition likelihood.
[0322]
If it is determined that the calculated recognition likelihood satisfies the condition for specifying the search word only by the confirmation processing with the user, a search word candidate selection dialogue is performed. If there is no search word candidate having a partial match likelihood equal to or greater than the specified threshold, or if there is no search word candidate to be selected as a result of the search word candidate selection dialog, the remaining n-th constituent noun set groups As a result of the recognition processing, the updated potential constituent noun candidates are used, and the actual search word list creation processing, the search word candidate selection dialogue, the partial match search word processing, and the search word candidate selection dialogue are repeated.
[0323]
As described above, by using the partial information of the constituent nouns, the search process can be executed accurately and quickly without being influenced by the bias of each constituent noun.
By the way, in practice, all of the simple nouns constituting the search word are rarely high in frequency, and all of the simple nouns constituting the search word are rarely low in frequency, and the frequency is biased. In many cases, among the simple nouns, the simple noun that is frequently used is set in the first subset, and thus is output in the first recognition processing result. Is not output as a recognition result until a recognition process is performed on a subset group including the word, so that a combination of constituent noun candidates having a likelihood greater than or equal to a specified threshold is created, and an existing search word candidate is extracted. Even if extracted, there is no correct answer.
[0324]
Therefore, the search device 301 partially matches a search term including a major constituent noun from a large vocabulary search term that cannot be correctly recognized and processed in real time by setting all of the search terms as recognition targets. By extracting the search word candidates from the search database 381, the influential candidates can be narrowed down. This narrowing down enables search processing without waiting for the user.
[0325]
In other words, according to the search device 301, by using the partial information of the single noun together, it is possible to narrow down the correct search words and further improve the recognition accuracy and the recognition speed.
[0326]
In the flowchart shown in FIG. 31, after the step S303, the route to the step S311 and the route to the step S331 are indicated by broken lines, separately from the route to the step S321. This means that the routine of steps S321 to S323, the routine of steps S311 to S315, and the routes of steps S331 and S332 are processed in parallel.
[0327]
Further, the search device 301 can be grasped as a program invention. That is, when the likelihood of only some of the single nouns among the plurality of single nouns constituting the search word exceeds the first predetermined threshold, the search device 301 includes the likelihood exceeding the first threshold. A partial match search word candidate extraction procedure for extracting, from a search database, partial match search word candidates having the same number of single nouns as the single nouns constituting the search word, A partial match likelihood calculation procedure for calculating the partial match likelihood by calculating the likelihood of each single noun constituting the partial match search word candidate by a predetermined calculation method; and 9 is an example of a program for causing a computer to execute a partially matching search word candidate speech recognition processing procedure of performing a speech recognition process for a search word candidate exceeding a second threshold value.
[0328]
In addition, the search device 301 registers a compound noun composed of a plurality of simple nouns as a search word, and further includes a search database in which the search word is divided and registered in each simple noun, A group of simple nouns described in the nth (n is an integer value) of each compound noun is referred to as a simple noun set group of the nth notation, and the simplest noun set group of the nth notation has a high frequency. Single nouns that are registered in order and that are registered in the order of higher frequency are grouped by a predetermined number in descending order of the frequency, and a search auxiliary database in which a plurality of sub-set groups are formed. A program for searching for compound nouns using a compound noun that is a search term that is the above-mentioned search term that is input when the user inputs speech in a series without dividing the compound noun that is the above-mentioned search term for each single noun. Is separated by single noun When the user inputs the compound noun, which is the search word, in a series of search term separation procedures, the n-th single noun of the compound noun is recognized by the n-th single noun set group, Moreover, among the n-th single noun set groups, the recognition processing is performed within the range of the first sub-set group including the most frequently used single noun, and the recognition result list is created by associating the recognition likelihood. A result list creation procedure and a recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood for the constituent noun candidate are arranged in the order of the recognition likelihood are input by the voice. For each of the simple nouns, a constituent noun candidate having a recognition likelihood exceeding a predetermined first threshold is selected as a probable constituent noun candidate from constituent noun candidates described in the recognition result list. put out A leading constituent noun candidate selecting procedure, and, if the likelihood of only some of the single nouns among the plurality of single nouns constituting the search word exceeds the first predetermined threshold, the likelihood of exceeding the first threshold is determined. A partial match search word candidate extraction step of extracting the partial match search word candidate having the same number of simple nouns as the single noun constituting the search word, comprising the single noun having the degree, and extracting the partial match search word candidate from the search database, A partial match likelihood calculation procedure for calculating the partial match likelihood by calculating the likelihood of each single noun composing the extracted partial match search word candidate by a predetermined calculation method, and the calculated partial match likelihood Is an example of a program for causing a computer to execute a partial matching search word candidate voice recognition processing procedure of performing a voice recognition process on the partial matching search word candidate exceeding a second predetermined threshold value.
[0329]
It should be noted that in the input interface portion of the commodity delivery service, the telephone number search, the postal code search, and the like, the work of determining the corporate name performed by the search device 301 can be widely applied.
[0330]
【The invention's effect】
According to the present invention, it is possible to extract a recognition result in a search word order having a high existence probability without waiting for a user.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a search device 1 for a compound noun input by voice according to a first embodiment of the present invention.
FIG. 2 is a diagram showing a specific example in which a corporate name in a search database 81 is composed of compound nouns, and the compound nouns are registered for each single noun constituting the compound noun.
FIG. 3 is a diagram showing a specific example of a search auxiliary database 82 in the search device 1.
FIG. 4 is a flowchart specifically showing a processing process for determining a search word in the search device 1.
FIG. 5 is a diagram showing a specific example of a recognition result list in which a recognition result for each single noun constituting a search word (compound noun) input by speech and a recognition likelihood thereof correspond to each other in the search device 1; is there.
FIG. 6 is a diagram showing how a real search word candidate list is created in the search device 1.
FIG. 7 is a diagram showing a specific example of an updated recognition result list in the search device 1.
FIG. 8 is a diagram showing a real search word candidate list obtained as a result of executing a process of creating a real search word candidate list in the search device 1.
9 is a diagram generally showing a search database 81 in the search device 1. FIG.
FIG. 10 is a diagram generally showing a search assistance database 82;
FIG. 11 is a block diagram showing a compound noun search device 101 input by voice according to a second embodiment of the present invention.
FIG. 12 is a diagram showing a specific example in which the corporate name in the search database 181 is composed of compound nouns, and the compound nouns are registered separately for each simple noun constituting the compound noun.
FIG. 13 is a diagram showing a specific example of a search assistance database 182 in the second embodiment.
FIG. 14 is a flowchart specifically showing a processing process for determining a search word in the second embodiment.
FIG. 15 is a diagram illustrating a specific example of a recognition result list in which recognition results for each single noun constituting a search word (compound noun) input by voice and the recognition likelihood correspond to each other in the second embodiment; FIG.
FIG. 16 is a diagram illustrating a specific example of an updated recognition result list in the second embodiment.
FIG. 17 is a diagram showing how a real search word candidate list is created in the second embodiment.
FIG. 18 is a block diagram illustrating a compound noun search device 201 input by voice according to a third embodiment of the present invention.
FIG. 19 is a diagram showing a specific example in which in the compound noun search device 201, the corporate name in the search database 281 is composed of compound nouns, and the compound nouns are registered separately for each single noun constituting the compound noun. It is.
20 is a diagram showing a specific example of an order search auxiliary database 282 in the compound noun search device 201. FIG.
FIG. 21 is a diagram showing a specific example of a reverse search auxiliary database 283 in the compound noun search device 201.
FIG. 22 is a flowchart showing a process of determining a search word in a compound noun search device 201;
FIG. 23 is a diagram showing a specific example of a recognition result list in the case of performing an order search in the compound noun search device 201.
FIG. 24 is a diagram showing a specific example of a recognition result list when a compound noun search device 201 performs a reverse search.
FIG. 25 is a diagram showing a specific example of a recognition result list in a case where the recognition result in the case of order search and the recognition result in the case of reverse search are merged in the compound noun search device 201;
FIG. 26 is a diagram showing a state immediately before creation of a real search word candidate list in the compound noun search device 201;
FIG. 27 is a flowchart showing the operation of the fourth embodiment of the present invention.
FIG. 28 is a block diagram showing a spoken dialogue type compound noun search device 301 according to a fifth embodiment of the present invention.
FIG. 29 is a diagram showing a specific example in which, in the search device 301, the corporate name in the search database 381 is composed of compound nouns, and the compound nouns are registered separately for each single noun constituting the compound noun.
FIG. 30 is a diagram showing a specific example of data stored in a search auxiliary database 382 in the search device 301.
FIG. 31 is a flowchart specifically showing a processing process for determining a search word in the search device 301.
FIG. 32 is a diagram showing a specific example of a recognition result list in which a recognition result for each single noun constituting a search word (compound noun) input by speech and a recognition likelihood thereof correspond to each other in the search device 301; is there.
FIG. 33 is a diagram showing an example of a partial match search word candidate and a partial match likelihood, and an example of a re-recognition result in the search device 301.
[Explanation of symbols]
1: a device for retrieving a compound noun input by voice
2. Voice input unit,
3. Voice recognition unit
4 ... Speech recognition result output unit
5 ... Real search word candidate list creation unit,
6: Dialogue control unit,
7 ... Audio output unit,
8. System database,
81: Search database,
82 ... Search assistance database,
83: YES / NO database,
G1: first single noun set group,
G2: second simple noun set group,
G3: third simple noun set group,
G4: fourth simple noun set group,
101: Search device for compound nouns input by voice
102: voice input unit,
103: voice recognition unit,
104: voice recognition result output unit
105: Real search word candidate list creation unit,
106: Dialogue control unit,
107 ... audio output unit,
108: system database,
181: Search database,
182 ... search assistance database,
183: YES / NO database,
Ga1... A single noun set group in the first notation,
Ga1-1... The first sub-set group in the single noun set group Ga1 of the first notation;
Ga1-2... The second sub-set group in the single noun set group Ga1 of the first notation;
Ga1-3 ... the third sub-set group in the single noun set group Ga1 in the first notation,
Ga2... A set of simple nouns in the second notation,
Ga2-1... The first sub-set group in the simplest noun group Ga2 in the second notation;
Ga2-2... The second sub-set group in the second noun singleton set group Ga2;
Ga2-3... Third subset group in the simplest noun set group Ga2 in the second notation;
201: device for retrieving compound nouns input by voice
202: voice input unit,
203: voice recognition unit,
203S: voice recognition device,
204: voice recognition result output unit
204a: voice recognition result list creation unit,
205: Real search word candidate list creation unit,
206: Dialogue control unit,
207 ... audio output unit,
207S ... audio output device,
208: system database,
281: Search database,
282: order search auxiliary database,
283: reverse order search auxiliary database,
284: YES / NO database,
301: Spoken dialogue type compound noun search device,
302 ... voice input unit,
303 ... Speech recognition unit,
303S: voice recognition software,
304: voice recognition result output unit
305: Real search word candidate list creation unit,
306: Dialogue control unit,
307 ... audio output unit,
307S ... Audio output software,
308 ... System database,
381: Search database,
382: search assistance database,
383: YES / NO database,
Gc1... The first notation set group,
Gc1-1... The first subset group in the single noun set group Gc1 in the first notation;
Gc1-2... The second sub-set group in the single noun set group Gc1 of the first notation;
Gc2: a set of simple nouns in the second notation,
Gc2-1... A first sub-set group in the second noun set Gc2;
Gc2-2... The second sub-set group in the single noun set group Gc2 in the second notation.

Claims

A search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
Simple nouns constituting all the search words registered in the search database are registered in descending order of their frequency, and furthermore, the nouns registered in descending order of frequency are determined in a predetermined order in descending order of frequency. A search assistance database divided into a number of groups to form a plurality of single noun set groups;
A user inputting a series of voices of the search words without separating them into single nouns; search word separating means for separating the voice-inputted search words into single nouns;
For each of the delimited single nouns, the recognition processing is performed within the range of the first single noun set including the most frequent single noun among the plurality of single noun sets, and the recognition likelihood is made to correspond. A recognition result list creating means for creating a recognition result list;
A recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood is described for each of the voice-inputted single nouns. A constituent noun candidate selecting means for selecting, as a constituent noun candidate, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates created and described in the recognition result list;
From all the combinations of the influential constituent noun candidates, only the actual search words in the search database are selected, a predetermined operation is performed on the recognition likelihood of each influential constituent noun candidate, and the search word recognition likelihood is calculated. A real search word candidate list creating means for creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood;
If the search term candidates in the above list of real search terms satisfy the condition that the search term can be specified only by the confirmation processing by the user, the necessary dialogue with the user is executed, and the search term is executed. Search word candidate selection dialogue means for determining
An apparatus for retrieving a compound noun input by voice, comprising:

In claim 1,
At the time of executing the above-mentioned search word candidate selection dialogue, recognition within the remaining one of the above-mentioned union set groups, selection of influential constituent noun candidates exceeding the above-mentioned threshold for each of the above-mentioned single nouns, An apparatus for retrieving a speech-input compound noun, comprising a parallel processing means for processing updating of a list in parallel.

A search database creation step of creating a search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
Simple nouns constituting all the search words registered in the search database are registered in descending order of their frequency. A search auxiliary database creating step of creating a search auxiliary database in which a plurality of single noun set groups are formed by being divided by the number of
A user inputting a series of voices of the search word without separating them into single nouns; a search word separating step of separating the voice-inputted search words into single nouns;
For each of the delimited single nouns, the recognition processing is performed within the range of the first single noun set including the most frequent single noun among the plurality of single noun sets, and the recognition likelihood is made to correspond. Creating a recognition result list, and generating a recognition result list;
A recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood is described for each of the voice-input single nouns. A potential constituent noun candidate selecting step of selecting, as a potential constituent noun candidate, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold from the constituent noun candidates created and described in the recognition result list;
From all combinations of the influential constituent noun candidates, select only the actual search words in the search database, perform a predetermined operation on the recognition likelihood of each influential constituent noun candidate, and calculate the search word recognition likelihood. A real search word candidate list creating step of creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood;
If the search term candidates in the above list of real search terms satisfy the condition that the search term can be specified only by the confirmation processing by the user, a necessary dialogue with the user is executed, and the search term is executed. A dialogue stage for selecting a search term candidate for determining
A method for searching for a compound noun input by voice, characterized by having:

A search database creation procedure for creating a search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
Simple nouns constituting all the search words registered in the search database are registered in descending order of their frequency, and furthermore, the nouns registered in descending order of frequency are determined in a predetermined order in descending order of frequency. A search assistance database creation procedure for creating a search assistance database in which a plurality of single noun set groups are formed by being divided by the number of
A user inputting a series of voices without separating the above-mentioned search words for each single noun, a search word separating procedure for separating the above-mentioned voice-inputted search words for each single noun;
For each of the delimited single nouns, the recognition processing is performed within the range of the first single noun set including the most frequent single noun among the plurality of single noun sets, and the recognition likelihood is made to correspond. A recognition result list creating procedure for creating a recognition result list;
A recognition result list in which a set of a constituent noun candidate that is the recognized single noun and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood, A constituent noun candidate selection procedure for selecting, as a constituent noun candidate, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates created and described in the recognition result list;
From all the combinations of the influential constituent noun candidates, only the actual search words in the search database are selected, a predetermined operation is performed on the recognition likelihood of each influential constituent noun candidate, and the search word recognition likelihood is calculated. A real search word candidate list creating procedure for creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood;
If the search term candidates in the above list of real search terms satisfy the condition that the search term can be specified only by the confirmation processing by the user, the necessary dialogue with the user is executed, and the search term is executed. A search term candidate selection dialogue procedure for determining
A program that causes a computer to execute.

A search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
A group of simple nouns described at the nth (n is an integer) of each of the registered compound nouns is called a simple noun set group of the nth notation, and the simple noun set group of the nth notation is Single nouns that are registered in the order of their frequency and that are registered in the order of the frequency are grouped by a predetermined number in descending order of the frequency to form a plurality of subset groups. An auxiliary database;
A user inputting a series of voices of the search words without separating them into single nouns; search word separating means for separating the voice-inputted search words into single nouns;
The n-th single noun of the compound noun spoken by the user is recognized by the n-th notation single noun set group. Recognition result list creation means for performing recognition processing within a first sub-set group including a high single noun and creating a recognition result list in correspondence with the recognition likelihood;
A recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood is described for each of the voice-inputted single nouns. A constituent noun candidate selecting means for selecting, as a constituent noun candidate, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates created and described in the recognition result list;
From all the combinations of the influential constituent noun candidates, only the actual search words in the search database are selected, a predetermined operation is performed on the recognition likelihood of each influential constituent noun candidate, and the search word recognition likelihood is calculated. A real search word candidate list creating means for creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood;
If the search term candidates in the above list of real search terms satisfy the condition that the search term can be specified only by the confirmation processing by the user, the necessary dialogue with the user is executed, and the search term is executed. Search word candidate selection dialogue means for determining
An apparatus for retrieving a compound noun input by voice, comprising:

In claim 5,
At the time of executing the above-mentioned search word candidate selection dialogue, the recognition within the range of the remaining one sub-set group, the selection of influential constituent noun candidates exceeding the above-mentioned threshold for each single noun, and the real search word candidate list An apparatus for retrieving a speech-input compound noun, comprising a parallel processing means for processing an update in parallel.

A search database creation step of creating a search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
A group of simple nouns described at the nth (n is an integer) of each of the registered compound nouns is called a simple noun set group of the nth notation, and the simple noun set group of the nth notation is Single nouns that are registered in the order of their frequency and that are registered in the order of the frequency are grouped by a predetermined number in descending order of the frequency to form a plurality of subset groups. Creating a search auxiliary database for creating an auxiliary database;
A user inputting a series of voices of the search word without separating them into single nouns; a search word separating step of separating the voice-inputted search words into single nouns;
The n-th single noun of the compound noun spoken by the user is recognized by the n-th notation single noun set group. A recognition result list creating step of creating a recognition result list by performing recognition processing within a first sub-set group including a high single noun and making the recognition likelihood correspond thereto;
A recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood is described for each of the voice-input single nouns. A potential constituent noun candidate selecting step of selecting, as a potential constituent noun candidate, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold from the constituent noun candidates created and described in the recognition result list;
From all combinations of the influential constituent noun candidates, select only the actual search words in the search database, perform a predetermined operation on the recognition likelihood of each influential constituent noun candidate, and calculate the search word recognition likelihood. A real search word candidate list creating step of creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood;
If the search term candidates in the above list of real search terms satisfy the condition that the search term can be specified only by the confirmation processing by the user, a necessary dialogue with the user is executed, and the search term is executed. A dialogue stage for selecting a search term candidate for determining
A method for searching for a compound noun input by voice, characterized by having:

In claim 7,
At the time of executing the above-mentioned search term candidate selection dialogue, recognition within the range of the remaining one sub-set group, selection of influential constituent noun candidates exceeding the above-mentioned threshold for each single noun, A method for retrieving a speech-input compound noun, comprising a parallel processing step of processing updating in parallel.

A search database creation procedure for creating a search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
A group of simple nouns described at the nth (n is an integer) of each of the registered compound nouns is called a simple noun set group of the nth notation, and the simple noun set group of the nth notation is Single nouns that are registered in the order of their frequency and that are registered in the order of the frequency are grouped by a predetermined number in descending order of the frequency to form a plurality of subset groups. A search auxiliary database creation procedure for creating an auxiliary database;
A user inputting a series of voices without separating the above-mentioned search words for each single noun, a search word separating procedure for separating the above-mentioned voice-inputted search words for each single noun;
When the user voice-inputs the search word in series, the n-th single noun of the compound noun is recognized by the n-th single noun set group, and the n-th single noun set group is recognized. A recognition result list creating procedure for creating a recognition result list by performing recognition processing within a range of a first sub-set group including a single noun having the highest frequency;
A recognition result list in which a set of a constituent noun candidate that is the recognized single noun and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood, A constituent noun candidate selection procedure for selecting, as a constituent noun candidate, a constituent noun candidate having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates created and described in the recognition result list;
From all the combinations of the influential constituent noun candidates, only the actual search words in the search database are selected, a predetermined operation is performed on the recognition likelihood of each influential constituent noun candidate, and the search word recognition likelihood is calculated. A real search word candidate list creating procedure for creating a real search word candidate list arranged in descending order of the calculated search word recognition likelihood;
If the search term candidates in the above list of real search terms satisfy the condition that the search term can be specified only by the confirmation processing by the user, the necessary dialogue with the user is executed, and the search term is executed. A search term candidate selection dialogue procedure for determining
A program that causes a computer to execute.

A search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
A registered reverse noun search auxiliary database, which is a set of reverse nouns located at the reverse m-th word (m is an integer value) of each of the registered compound nouns;
When the user inputs a series of voices of the compound noun, which is the above-mentioned search word, without separating it into single nouns;
When the user inputs a series of compound nouns, which are the above search words, by voice, a single noun located at the mth word in the reverse order of the compound noun input by speech is in a range of a single noun set located at the mth word in the reverse order. And generating a reverse recognition result list in which a set of a constituent noun candidate that is the recognized simple noun and the recognition likelihood of the constituent noun candidate is arranged in the recognition likelihood order. Means;
The reverse order recognition result list is created for each of the voice-input single nouns, and constituent nouns having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates described in the reverse order recognition result list. A reverse-order strong constituent noun candidate selecting means for selecting a candidate as a reverse-order strong constituent noun candidate;
By performing a predetermined operation on the recognition likelihood of the influential constituent noun candidate for each of the combinations of the reverse order influential constituent noun candidates, a search word recognition likelihood is calculated, and the calculated search word recognition likelihood is calculated. Real search word candidate list creating means for creating a real search word candidate list arranged in descending order of
Among the real search word candidates, for a real search word whose search word recognition likelihood is equal to or greater than a predetermined threshold, a necessary dialogue is performed with the user to select a search word candidate for determining the search word. Means of dialogue;
An apparatus for retrieving a compound noun input by voice, comprising:

In claim 10,
A single noun belonging to a set of simple nouns located at the m-th word in the reverse order is divided into the nouns with the highest frequency of use and by a predetermined number, thereby forming a plurality of single noun sub-sets of the mth word in the reverse order,
The reverse order recognition result list creating unit performs a recognition process on a reverse order noun subset including the most frequent single noun among the simplest noun subsets of the mth word in the reverse order, and creates the reverse order recognition result list. An apparatus for retrieving compound nouns input by voice, characterized in that:

In claim 11,
At the time of executing the search word candidate selection dialogue, a recognition process to be performed in a range of a single noun sub-set of the remaining m-order reverse word of the plurality of single nouns of the reverse order m-word; An apparatus for retrieving a speech-input compound noun, comprising parallel processing means for concurrently processing the selection of a reverse-order influential constituent noun candidate having a threshold value or more for each single noun and the updating of a real search word candidate list. .

In claim 10,
A registered noun set located at the n-th word, which is a set of n-th (n is an integer) single nouns of each of the registered compound nouns;
The n-th single noun of the compound noun input by speech is recognized within the range of the single noun set located at the n-th word, and the constituent noun candidate which is the recognized single noun and the constituent noun candidate A recognition result list creating means for creating a recognition result list in which a set of the recognition likelihoods of the set is arranged in the order of the recognition likelihood;
The recognition result list is created for each of the voice-input single nouns, and constituent noun candidates having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates described in the recognition result list. Means for selecting an influential constituent noun candidate as an influential constituent noun candidate;
The real search word candidate list creating means, the reverse order potential constituent noun candidate selected by the reverse order potential constituent noun candidate selection means, and the order potential constituent noun candidate selected by the order potential constituent noun candidate selection means For each combination of the merged influential constituent noun candidates, a predetermined operation is performed on the recognition likelihood of the influential constituent noun candidate to calculate a search word recognition likelihood. An apparatus for retrieving a speech-input compound noun, which is a means for creating a real search term candidate list arranged in descending order of search term recognition likelihood.

In claim 13,
A single noun belonging to a set of simple nouns located at the m-th word in the reverse order is divided into the nouns with the highest frequency of use and by a predetermined number, thereby forming a plurality of single noun sub-sets of the mth word in the reverse order,
The recognition result list creating means performs the recognition process in the range of the reverse mononoun subset including the most frequent mononoun among the monom noun subsets of the reverse mth word, and generates the recognition result list. An apparatus for retrieving compound nouns input by voice, wherein

A compound noun composed of a plurality of simple nouns is registered as a search word, and furthermore, a search database in which the above-described search word is registered by being divided into each simple noun, and a reverse order m of each of the registered compound nouns A set of simple nouns located at the m-th reverse word, which is a set of simple nouns located at the word (m is an integer value), is searched for a compound noun input by speech using the registered reverse search auxiliary database. Search method,
A user inputting a series of voices without separating a compound noun, which is the above-mentioned search word, into single nouns; a search word partitioning step, in which the above-mentioned voice-inputted search word is separated into single nouns;
When the user inputs a series of compound nouns, which are the above search words, by voice, a single noun located at the mth word in the reverse order of the compound noun input by speech is in a range of a single noun set located at the mth word in the reverse order. And generating a reverse recognition result list in which a set of a constituent noun candidate that is the recognized simple noun and the recognition likelihood of the constituent noun candidate is arranged in the recognition likelihood order. Stages;
The reverse order recognition result list is created for each of the voice-input single nouns, and constituent nouns having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates described in the reverse order recognition result list. A step of selecting a candidate for a reverse order constituent noun candidate to select the candidate as a candidate for a reverse order constituent noun;
By performing a predetermined operation on the recognition likelihood of the influential constituent noun candidate for each of the combinations of the reverse order influential constituent noun candidates, a search word recognition likelihood is calculated, and the calculated search word recognition likelihood is calculated. A real search word candidate list creating step of creating a real search word candidate list arranged in descending order of
Among the real search word candidates, for a real search word whose search word recognition likelihood is equal to or greater than a predetermined threshold, a necessary dialogue is performed with the user to select a search word candidate for determining the search word. A dialogue phase;
A method for searching for a compound noun input by voice, characterized by having:

In claim 15,
A single noun belonging to a set of simple nouns located at the m-th word in the reverse order is divided into the nouns with the highest frequency of use and by a predetermined number, thereby forming a plurality of single noun sub-sets of the mth word in the reverse order,
The step of creating the reverse order recognition result list includes the step of performing recognition processing on a reverse order noun subset including the most frequent single noun among the single noun subsets of the mth word in the reverse order, and creating the reverse order recognition result list. A method for searching for a compound noun input by voice, characterized in that:

In claim 16,
At the time of executing the search word candidate selection dialogue, a recognition process to be performed in a range of a single noun sub-set of the remaining m-order reverse word of the plurality of single nouns of the reverse order m-word; A method for retrieving a speech-input compound noun, comprising a parallel processing step of concurrently processing the selection of a reverse-order influential constituent noun candidate equal to or greater than the threshold value for each single noun and updating a list of real search word candidates. .

A compound noun composed of a plurality of simple nouns is registered as a search word, and furthermore, a search database in which the above-described search word is registered by being divided into each simple noun, and a reverse order m of each of the registered compound nouns A set of simple nouns located at the m-th reverse word, which is a set of simple nouns located at the word (m is an integer value), is searched for a compound noun input by speech using the registered reverse search auxiliary database. Procedure;
A user inputting a series of voices of the compound noun, which is the above-mentioned search word, without separating them into single nouns; a search word separation procedure for separating the above-mentioned voice-inputted search words into single nouns;
When the user inputs a series of compound nouns, which are the above search words, by voice, a single noun located at the mth word in the reverse order of the compound noun input by speech is in a range of a single noun set located at the mth word in the reverse order. And generating a reverse recognition result list in which a set of a constituent noun candidate that is the recognized simple noun and the recognition likelihood of the constituent noun candidate is arranged in the recognition likelihood order. Procedure;
The reverse order recognition result list is created for each of the voice-input single nouns, and constituent nouns having a recognition likelihood exceeding a predetermined threshold among constituent noun candidates described in the reverse order recognition result list. A reverse-order strong constituent noun candidate selection procedure for selecting a candidate as a reverse-order strong constituent noun candidate;
By performing a predetermined operation on the recognition likelihood of the influential constituent noun candidate for each of the combinations of the reverse order influential constituent noun candidates, a search word recognition likelihood is calculated, and the calculated search word recognition likelihood is calculated. A real search word candidate list creating procedure for creating a real search word candidate list arranged in descending order of
Among the real search word candidates, for a real search word whose search word recognition likelihood is equal to or greater than a predetermined threshold, a necessary dialogue is performed with the user to select a search word candidate for determining the search word. Dialogue procedures;
A program that causes a computer to execute.

A search database in which a compound noun composed of a plurality of simple nouns is registered as a search word, and wherein the search word is registered by being divided into each simple noun;
A group of simple nouns described at the nth (n is an integer) of each of the registered compound nouns is called a simple noun set group of the nth notation, and the simple noun set group of the nth notation is Single nouns that are registered in the order of their frequency and that are registered in the order of the frequency are grouped by a predetermined number in descending order of the frequency to form a plurality of subset groups. An auxiliary database;
When the user inputs the compound noun, which is the search word, by a series of voices without separating the compound noun into single nouns, a search word separating procedure, which separates the compound noun, which is the voice input search word, into single nouns;
When the user inputs a series of compound nouns that are the search words by speech, the single noun of the n-th notation of the compound noun is recognized by the single noun set group of the n-th notation. A recognition result list creating means for performing a recognition process within a range of a first subset group including the most frequent single noun among the single noun set group, and creating a recognition result list in correspondence with the recognition likelihood;
A recognition result list in which a set of the constituent noun candidate that is the recognized single noun and the recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood, Among the constituent noun candidates created and described in the recognition result list, a constituent noun candidate having a recognition likelihood exceeding a predetermined first threshold is selected as a strong constituent noun candidate. Means;
When the likelihood of only a part of the single nouns among the plurality of simple nouns constituting the search word exceeds the first predetermined threshold, the likelihood having the likelihood exceeding the first threshold is determined. A partial match search word candidate extracting means for extracting, from the search database, a partial match search word candidate having the same number of simple nouns as the single noun constituting the search word;
Partial match likelihood calculating means for calculating the likelihood of each single noun constituting the extracted partial match search word candidate by a predetermined calculation method to calculate the partial match likelihood;
Speech recognition processing means for partial match search word candidates for performing a speech recognition process on the partial match search word candidates for which the calculated partial match likelihood exceeds a predetermined second threshold value;
A device for retrieving spoken dialogue compound nouns, comprising:

In claim 19,
As a result of the real search word candidate selection process performed using the first sub-set group and the recognition process, the calculated likelihood satisfies the condition that the search word can be specified only by the determination process with the user. If it is a degree, a necessary dialogue with the user is executed, a search word candidate selection dialogue for determining the search word, and the partial match search word candidate is extracted from the search database, and the extracted partial match search is performed. Parallel processing means for performing in parallel a partial match search word process for calculating a partial match likelihood by calculating the likelihood of each single noun constituting a word candidate by a predetermined calculation method;
As a result of executing the partial match search word processing, if a search word that can be used in the search word candidate selection dialog cannot be selected, the dialog time with the user is continuously used for the second and subsequent subset groups. Repeating means for performing the recognition processing and repeating the existing search word candidate selection processing, the search word candidate selection dialogue, the partial match search word processing, and the search word candidate selection dialogue for the result;
A device for retrieving spoken dialogue compound nouns, comprising:

A compound noun composed of a plurality of simple nouns is registered as a search word, and furthermore, a search database in which the above-described search word is divided into each simple noun and registered, and an n-th of each of the registered compound nouns A group of simple nouns described in (n is an integer value) is called a simple noun set group of the n-th notation, and the simple noun set group of the n-th notation is registered in descending order of the frequency. Simple nouns registered in order of frequency are grouped by a predetermined number in descending order of frequency, and a compound noun is formed by using a search auxiliary database in which a plurality of subset groups are formed. A search method,
A user inputting a series of voices of the compound noun, which is the above-described search word, without separating the compound noun for each single noun; a search word partitioning step of separating the compound noun, which is the above-mentioned input search word, for each single noun;
When the user voice-inputs a series of compound nouns as the search word, the n-th single noun of the compound noun is recognized by the single noun set group of the n-th notation. A recognition result list creating step of performing recognition processing within a first sub-set group including the most frequent single noun among the simple noun set group, and creating a recognition result list in correspondence with the recognition likelihood;
A recognition result list in which a set of a constituent noun candidate that is a single noun subjected to the recognition processing and a recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood, Among the constituent noun candidates created and described in the recognition result list, a constituent noun candidate having a recognition likelihood exceeding a predetermined first threshold is selected as a strong constituent noun candidate. Stages;
In the case where the likelihood of only some of the single nouns among the plurality of simple nouns constituting the search word exceeds the first predetermined threshold, the likelihood having the likelihood exceeding the first threshold is determined. Extracting a partial match search word candidate from the search database, wherein the partial match search word candidate has the same number of single nouns as the single noun constituting the search word;
Calculating a partial match likelihood by calculating the likelihood of each single noun constituting the extracted partial match search word candidate by a predetermined calculation method;
Performing a speech recognition process for a partial match search word candidate for which the calculated partial match likelihood exceeds a second predetermined threshold;
A method for retrieving a speech-interactive compound noun, comprising:

A compound noun composed of a plurality of simple nouns is registered as a search word, and furthermore, a search database in which the above-described search word is divided into each simple noun and registered, and an n-th of each of the registered compound nouns The group of simple nouns described in (n is an integer value) is called a simple noun set group of the n-th notation, and the simple noun set group of the n-th notation is registered in descending order of the frequency. Simple nouns registered in order of frequency are grouped by a predetermined number in descending order of frequency, and a compound noun is formed by using a search auxiliary database in which a plurality of subset groups are formed. A program to search,
When the user inputs the compound noun, which is the search word, by a series of voices without separating the compound noun into single nouns, a search word separating procedure, which separates the compound noun, which is the voice input search word, into single nouns;
When the user inputs a series of compound nouns that are the search words by speech, the single noun of the n-th notation of the compound noun is recognized by the single noun set group of the n-th notation. A recognition result list creating procedure for performing a recognition process within a range of a first subset group including the most frequent single noun among the single noun set group, and creating a recognition result list in correspondence with the recognition likelihood;
A recognition result list in which a set of the constituent noun candidate that is the recognized single noun and the recognition likelihood of the constituent noun candidate is arranged in the order of the recognition likelihood, Among the constituent noun candidates created and described in the recognition result list, a constituent noun candidate having a recognition likelihood exceeding a predetermined first threshold is selected as a strong constituent noun candidate. Procedure;
When the likelihood of only a part of the single nouns among the plurality of simple nouns constituting the search word exceeds the first predetermined threshold, the likelihood having the likelihood exceeding the first threshold is determined. A partial match search word candidate extraction procedure for extracting, from the search database, partial match search word candidates having the same number of simple nouns as the single nouns constituting the search word;
A partial match likelihood calculation procedure of calculating the partial match likelihood by calculating the likelihood of each single noun constituting the extracted partial match search word candidate by a predetermined calculation method;
A speech recognition processing procedure for a partial match search word candidate for performing a speech recognition process on the partial match search word candidate whose calculated partial match likelihood exceeds a second predetermined threshold value;
A program that causes a computer to execute.