JP2003228394A

JP2003228394A - Noun specifying device using voice input and method thereof

Info

Publication number: JP2003228394A
Application number: JP2002024847A
Authority: JP
Inventors: Kumiko Omori; 久美子大森
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2002-01-31
Filing date: 2002-01-31
Publication date: 2003-08-15
Anticipated expiration: 2022-01-31
Also published as: JP3678360B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a noun specifying device which can automate large-scale specified object operation that a human operator has handled before such as the construction of a call center using voice recognition and realize higher- precision and higher-efficiency speedy retrieval from a database consisting of many objects not as a substitute for the human operator, and uses voice input. <P>SOLUTION: The noun specifying device which performs voice recognition processing for contents that a user inputs in voice by using a voice recognizing device and narrows down results of the voice recognition processing is characterized in that it uses information based upon differences in notation as additional information for the narrowing-down processing and specifies a noun that the user inputs in voice among many retrieval objects. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、非常に大多数の似
通った検索対象から、利用者が目的とする情報を、一意
に特定する検索と、特定までの絞り込みとに関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a search for uniquely specifying information desired by a user from a very large number of similar search targets and a narrowing down to the specification.

【０００２】[0002]

【従来の技術】従来、多くの検索対象の中から利用者が
目的とする情報について、付加情報を利用して、一意に
特定する際に、情報が持つ属性を利用し、予め情報を分
類し、目的とする情報が有する属性の属性値を利用者に
尋ね、利用者が答えた属性値に基づいて、目的とする情
報を絞り込み、これによって、多くの候補の中から、目
的とする情報を絞り込み、一意に特定する手法が通常、
採用されている。2. Description of the Related Art Conventionally, when information that a user wants from a large number of search targets is uniquely specified by using additional information, the attributes of the information are used to classify the information in advance. , Ask the user for the attribute value of the attribute that the target information has, narrow down the target information based on the attribute value answered by the user, and by this, select the target information from many candidates. The method of narrowing down and uniquely identifying is usually
Has been adopted.

【０００３】一般に、検索対象である情報が、日本全国
の住所のように、都道府県名、市区郡名、町村字名のよ
うな属性を持ち、属性が階層構造を有している場合や、
社員名簿のように、所属部署、入社年次、性別等の属性
を持つ場合ならば、属性毎の属性値を利用者に尋ね、利
用者が答えた属性値に基づいて、検索候補を絞り込み、
特定することが可能である。Generally, when the information to be searched has attributes such as a prefecture name, a city / ward county name, a town / village name like an address in Japan, and the attributes have a hierarchical structure. ,
If you have attributes such as department, year of employment, gender, etc. like an employee list, ask the user for the attribute value for each attribute, narrow down the search candidates based on the attribute value answered by the user,
It is possible to specify.

【０００４】[0004]

【発明が解決しようとする課題】しかし、検索対象が何
の属性も持たない情報である場合、しかも、候補が多数
存在し、それらが互いに非常に似通った情報である場
合、検索候補を絞り込み特定するために利用可能な付加
情報が、存在しない。However, when the search target is information having no attribute, and when there are many candidates and they are very similar to each other, the search candidates are narrowed down and specified. There is no additional information available to do this.

【０００５】特に、利用者が名詞を音声入力した場合、
音声認識装置の精度が常に１００％であるという保証が
ないので、曖昧さを解消する方法がなく、認識装置が返
す候補を、利用者に順に提示し、利用者から正解である
という回答が得られるまで、利用者への提示を繰り返
す。この提示の繰り返しは、利用者に非常にストレスを
与えると言われている。そのために、現状の音声認識を
利用した検索システムが検索対象とする情報は、非常に
限られたものであり、その対象数も、非常に少ないとい
う問題がある。In particular, when the user inputs a noun by voice,
Since there is no guarantee that the accuracy of the voice recognition device will always be 100%, there is no way to resolve the ambiguity, and the candidates returned by the recognition device are presented to the user in order, and the user obtains the correct answer. Presentation to the user is repeated until it is asked. Repeating this presentation is said to be very stressful to the user. Therefore, there is a problem that the information searched by the current search system using voice recognition is very limited, and the number of the searched objects is also very small.

【０００６】本発明は、音声認識を利用したコールセン
タの構築等、従来人間オペレータが対応していた大規模
な対象の特定業務等の自動化が可能になり、また、人間
オペレータの代替のみではなく、多くの対象数からなる
データベース検索の際に、より高精度で、より高効率
で、迅速な検索を実現することができる音声入力を利用
する名詞特定装置を提供することを目的とするものであ
る。The present invention makes it possible to automate a large-scale target specific task, which was conventionally handled by a human operator, such as the construction of a call center using voice recognition, and is not only a substitute for the human operator. An object of the present invention is to provide a noun identification device that utilizes voice input, which can realize a more accurate, more efficient, and quick search when searching a database with a large number of objects. .

【０００７】[0007]

【課題を解決するための手段】本発明は、非常に数多い
情報の集合について、日本語の表記を、絞り込み用付加
情報として利用するものである。The present invention utilizes Japanese notation as additional information for narrowing down a very large number of information sets.

【０００８】[0008]

【発明の実施の形態および実施例】まず、音声入力され
た日本人個人の姓や、名を検索対象とする場合について
考える。BEST MODE FOR CARRYING OUT THE INVENTION First, let us consider a case in which the first name and last name of a voice-input Japanese individual are to be searched.

【０００９】日本人の個人姓や個人名は、固定電話加入
者数４０００万人を対象にした場合で、読みの異なりを
一意に数えた場合、個人姓は、約１８万種類、存在し、
個人名は、約１６万種類、存在することが知られてい
る。Japanese personal surnames and personal names are intended for 40 million fixed telephone subscribers, and if the readings are uniquely counted, there are about 180,000 different personal surnames.
It is known that there are about 160,000 types of personal names.

【００１０】一般に、音声認識装置を利用して、利用者
が許容できる範囲の精度と処理速度とによって認識し、
特定可能な対象数は、数百種から１０００種程度とされ
ている。こういう状況下で、十数万種の対象は、非常に
大規模であり、音声認識装置のみの利用では、精度よく
認識することが困難であるといわれている。Generally, a voice recognition device is used to recognize with a range of accuracy and processing speed that the user can accept.
The number of identifiable targets is about several hundreds to 1,000. Under such circumstances, it is said that the objects of more than 100,000 kinds are very large in scale, and it is difficult to accurately recognize them by using only the voice recognition device.

【００１１】一方、個人姓名の音声入力は、コールセン
タ業務等、カスタマケアサービスにおける顧客の特定等
において、必須の対象であり、音声入力されたものを自
動的に特定することができるようになれば、非常に需要
は大きい。On the other hand, the voice input of the personal first and last name is an indispensable target in the customer identification in the customer care service such as call center business, and if the voice input can be automatically specified. , There is a great demand.

【００１２】ところが、個人姓や個人名は、階層化され
た属性等を持たない。However, the personal surname and personal name do not have hierarchical attributes.

【００１３】本発明の実施例では、個人姓を構成する文
字列、文字数、等表記に着目をした場合、特に先頭漢字
の読みに着目し、以下のように、先頭漢字の読み情報
を、付加情報として利用することを考える。In the embodiment of the present invention, when paying attention to the notation such as the character string constituting the individual surname, the number of characters, etc., the reading of the leading kanji is particularly noted, and the reading information of the leading kanji is added as follows. Consider using it as information.

【００１４】個人姓１８万種に使用される先頭漢字の種
類は、約３００種である。したがって、先頭漢字がどの
ような漢字であるかを示す情報を、絞り込み用の付加情
報として獲得すれば、音声認識すべき対象（検索対象）
を、従来方法における音声認識対象の３００分の１に縮
小することができる。There are about 300 types of leading kanji used for 180,000 personal surnames. Therefore, if information indicating what kind of kanji is the leading kanji is acquired as additional information for narrowing down, it is the target (search target) that should be voice-recognized.
Can be reduced to 1/300 of the speech recognition target in the conventional method.

【００１５】そこで、利用者の入力に対して、音声認識
処理をし、結果を出力する。この認識処理の結果、非常
に信頼度（尤度）高く出力された候補は、正解である可
能性が高いので、利用者に提示し、確認する。Therefore, voice recognition processing is performed on the user's input, and the result is output. As a result of this recognition processing, a candidate that is output with a very high reliability (likelihood) is likely to be the correct answer, so it is presented to the user and confirmed.

【００１６】信頼度の高い候補が複数出力された場合、
または候補が出力されなかった場合には、音声入力した
名詞における先頭の漢字の読み仮名を、絞込み用の付加
情報として、利用者に尋ねる。When a plurality of highly reliable candidates are output,
Alternatively, if no candidate is output, the user is queried as the additional information for narrowing down the phonetic kana of the first kanji in the noun that is input by voice.

【００１７】音声入力した名詞における先頭の漢字の読
みを獲得できたら、その漢字のよみを有する個人姓を、
検索対象データベースから抽出し、抽出した検索対象に
対してのみ、再度、音声認識処理をし、または、候補が
複数出力された場合、その複数候補の中から、先頭漢字
が、該当する読みを持つ検索候補が存在していれば、利
用者に提示し、確認する。このようにすることによっ
て、音声認識技術のみの利用ではなく、付加情報をも利
用して音声認識結果を絞り込み、これによって、認識結
果の信頼性を向上させることができる。When the reading of the first kanji in the noun that is input by voice is acquired, the individual surname having the reading of that kanji is
Extracted from the search target database, perform voice recognition processing again only for the extracted search target, or when multiple candidates are output, the first Kanji from the multiple candidates has the corresponding reading. If a search candidate exists, it is presented to the user and confirmed. By doing so, the voice recognition result can be narrowed down not only by using only the voice recognition technology but also by using the additional information, thereby improving the reliability of the recognition result.

【００１８】次に、具体的な個人姓「増尾」を特定する
場合を例に挙げて、上記実施例を説明する。Next, the above-described embodiment will be described by taking as an example the case where a specific individual surname "Masao" is specified.

【００１９】図１は、上記実施例の動作を示すフローチ
ャートである。FIG. 1 is a flow chart showing the operation of the above embodiment.

【００２０】まず、検索対象である個人姓を入力するこ
とを、検索装置（図７に示す名詞確定装置１）が利用者
に要求する（Ｓ１）と、「ますお」を音声入力する（Ｓ
２）。First, when the search device (the noun determination device 1 shown in FIG. 7) requests the user to input the personal surname to be searched (S1), "masu" is voice-input (S).
2).

【００２１】図２は、上記実施例における認識対象リス
トＬ１の例を示す図である。FIG. 2 is a diagram showing an example of the recognition target list L1 in the above embodiment.

【００２２】検索装置（名詞確定装置１）は、検索対象
である個人姓リストを、認識対象リストＬ１として予め
保持し、この場合、使用する音声認識装置に合わせて、
つまり、入力の形態にあわせた整形し、保持している。The search device (noun determination device 1) holds in advance the personal surname list to be searched as the recognition target list L1. In this case, according to the voice recognition device to be used,
In other words, it is shaped and stored according to the input form.

【００２３】これと同時に、各個人姓について、その先
頭の漢字の読み仮名と、個人姓の文字数がいくつである
かという文字列情報とを保持する。At the same time, for each personal surname, the reading kana of the leading kanji and the character string information indicating the number of characters of the personal surname are held.

【００２４】検索装置（名詞確定装置１）は、音声入力
された「ますお」という音声を、音声認識処理し（Ｓ
３）、認識対象リストＬ１の中から、信頼度の高い（尤
度の高い）順に、正解候補を、たとえば、下記のよう
に、「１：まつお、………、４：まつだ」を出力する
（Ｓ４）。The retrieval device (noun determination device 1) performs a voice recognition process on the voice input "masu" (S).
3) From the recognition target list L1, correct answer candidates are output in the order of high reliability (highest likelihood), for example, "1: Matsuo, ........., 4: Matsuda" as shown below. Yes (S4).

【００２５】１：まつお（松尾、末尾）、２：まつの（松野、抹野）、３：ますお（増尾、益尾）、４：まつだ（松田、松多）図３は、上記実施例における漢字表記候補リストＬ２の
例を示す図である。1: Matsuo (Matsuo, tail), 2: Matsuno (Matsuno, Mino), 3: Masuo (Masuo, Masuo), 4: Matsuda (Matsuda, Matsuta). It is a figure which shows the example of the Chinese character description candidate list L2 in an example.

【００２６】漢字表記候補リストＬ２は、音声入力され
た検索対象（さとう、すずき、たなか等）と、その漢字
表記情報（佐藤、左藤、佐東、砂糖等）と、その先頭漢
字の読み情報（さ、ひだり、すな等）とが対応して記載
されているリストである。The kanji notation candidate list L2 includes voice-inputted search targets (sato, suzuki, tanaka, etc.), its kanji notation information (Sato, Sato, Sato, sugar, etc.), and its leading kanji reading information ( It is a list that corresponds to each other.

【００２７】それぞれの認識結果に対して、個人姓リス
トから考えられる漢字表記候補リストＬ２を、括弧内に
リストアップしてある。With respect to each recognition result, a kanji notation candidate list L2 that can be considered from the personal surname list is listed in parentheses.

【００２８】利用者の目的とする姓を特定するために、
検索装置（名詞確定装置１）は、「先頭の漢字は何と読
みますか？」と利用者に質問する（Ｓ５）。[0028] In order to specify the intended family name of the user,
The search device (noun determination device 1) asks the user "What do you read the first kanji?" (S5).

【００２９】利用者は、「ふえる（増える）」、または
「ぞう（増）」というように、「増」という字の読みを
答える（Ｓ６）。The user answers the reading of the character "increase" such as "increase (increase)" or "elevate (increase)" (S6).

【００３０】図４は、上記実施例である検索装置（名詞
確定装置１）に予め用意されている漢字の読み対象リス
トＬ３を示す図である。FIG. 4 is a diagram showing a kanji reading target list L3 prepared in advance in the search device (noun determination device 1) according to the above embodiment.

【００３１】漢字表記候補リストＬ２は、漢字の読み
と、その読みを持つ漢字候補の情報と、先頭漢字の読み
情報とが対応して記録されているリストである。The kanji writing candidate list L2 is a list in which readings of kanji, information of kanji candidates having the readings, and reading information of the leading kanji are associated with each other.

【００３２】利用者が、「増」という字の読みを答えた
（Ｓ６）後に、検索装置（名詞確定装置１）は、図４に
示す漢字の読み対象リストＬ３を利用し、音声認識処理
を行う（Ｓ７）。After the user answers the reading of the character "masu" (S6), the retrieval device (noun determination device 1) uses the kanji reading target list L3 shown in FIG. Perform (S7).

【００３３】図５は、上記実施例における先頭漢字読み
と個人姓との対応表Ｌ４の一例を示す図である。FIG. 5 is a diagram showing an example of the correspondence table L4 between the leading kanji reading and the personal surname in the above embodiment.

【００３４】先頭漢字読みと個人姓との対応表とＬ４
は、先頭漢字の読みと、上記先頭漢字の読みに対応する
漢字と、上記漢字を使用する個人性とが対応しているリ
ストである。Correspondence table between leading kanji reading and individual surname and L4
Is a list in which the reading of the leading kanji, the kanji corresponding to the reading of the leading kanji, and the personality of using the kanji correspond.

【００３５】先頭漢字読みと個人姓との対応表Ｌ４を利
用して、認識結果の読みを持つ漢字と、その漢字を先頭
に持つ個姓をリストアップする。「ふえる」という認識
に対しても、曖昧性が生じ、１：「ふえる」（増える）、２：「ひえる」（冷える）、３：「はえる」（生える、映える、栄える）という結果が得られたとする。Utilizing the correspondence table L4 between the leading kanji reading and the personal surname, the kanji having the reading of the recognition result and the individual family name having the kanji at the head are listed. There is ambiguity in the recognition of "fluttering", and the result is 1: "fluffing" (increasing), 2: "hiring" (cooling), 3: "flying" (growing, shining, prospering) Suppose

【００３６】図６は、上記実施例において、音声入力し
た「ますお」に基づいて、個人姓「増尾」を特定する場
合するにおける動作を示す説明図である。FIG. 6 is an explanatory diagram showing the operation in the case of identifying the individual surname "Masuo" based on the voice input "Masuo" in the above embodiment.

【００３７】ここで、音声入力された「ふえる」に対し
て、「ふえる」という読みを持つ漢字候補を、図６
（３）に示すように、全て、漢字の読み対象リストから
検索し（Ｓ８）、個人姓の先頭文字を認識した結果と、
図６（４）に示す、上記先頭漢字読みの認識結果とを合
わせて、両者の関連性をチェックする（Ｓ９）。Here, the kanji candidates that have the reading "fueru" in response to the voiced "fueru" are shown in FIG.
As shown in (3), all the results are obtained by searching the kanji reading target list (S8) and recognizing the first character of the personal surname,
The relationship between the two is checked together with the recognition result of the leading Kanji reading shown in FIG. 6 (4) (S9).

【００３８】すなわち、先頭漢字の読みを認識した結
果、得られた漢字を持つ個人姓候補が、出力結果である
図６（２）に示す「まつお」、「まつの」、「ます
お」、「まつだ」の中に存在しているか否かをチェック
する（Ｓ１０）。この場合「ますお（増尾）」が図６
（５）に示すように、該当候補として挙がってくる。That is, the individual surname candidate having the kanji obtained as a result of recognizing the reading of the leading kanji is the output result, "Matsuo", "Matsuno", and "Masuo". , "Matsuda" is checked (S10). In this case, "masuo (masuo)" is shown in Figure 6.
As shown in (5), they are listed as applicable candidates.

【００３９】つまり、上記実施例は、非常に数多い情報
の集合について、日本語の表記を、絞り込み用付加情報
として利用する実施例である。That is, the above-described embodiment is an embodiment in which the Japanese notation is used as the additional information for narrowing down, regarding a very large number of information sets.

【００４０】なお、上記「日本語の表記」は、平仮名表
記した場合の先頭文字や、先頭から２番目の文字、また
は末尾文字等、先頭から数えて特定番目の文字は何であ
るかという情報、平仮名表記した際の文字数の情報、ま
た漢字表記した際に使用される漢字の特徴、たとえば、
先頭の漢字の読み、２番目の漢字の読み等、先頭から特
定番目の漢字の読み情報、または先頭漢字の部首等を、
絞り込みのための付加情報として利用する。The above-mentioned "Japanese notation" means information such as the first character in the Hiragana notation, the second character from the beginning, the last character, etc., which is the specific character counted from the beginning. Information on the number of characters when written in hiragana, and the characteristics of the kanji used when writing kanji, for example,
Reading the first kanji, reading the second kanji, etc., reading information for the specific kanji from the beginning, or radical of the first kanji, etc.
It is used as additional information for narrowing down.

【００４１】そして、チェックした結果の候補を表示し
（Ｓ１０）、利用者に質問し（Ｓ１１）、利用者からの
回答を得（Ｓ１２）、表示された候補が正しければ、そ
の表示された候補で確定する。Then, the candidates of the checked result are displayed (S10), the user is asked a question (S11), the answer from the user is obtained (S12), and if the displayed candidate is correct, the displayed candidate is displayed. Confirm with.

【００４２】一方、表示された候補が正しくないと、利
用者に判断されれば（Ｓ１２）、音声入力した個人姓の
うちで、２番目等、ｎ番目（ｎは２以上の整数）の漢字
の読み方について、音声入力を利用者に依頼し、音声認
識し、漢字候補を検索し、利用者に質問する（Ｓ１
３）。On the other hand, if the displayed candidate is not correct and the user judges (S12), the second or the like the nth (n is an integer of 2 or more) kanji among the personal surnames input by voice. About how to read, ask the user to input voice, recognize the voice, search for Kanji candidates, and ask the user (S1
3).

【００４３】図７は、本発明の一実施例である名詞確定
装置１を示すブロック図である。FIG. 7 is a block diagram showing a noun determination device 1 which is an embodiment of the present invention.

【００４４】名詞確定装置１は、音声入力部２と、音声
認識部３と、音声認識用ソフトウェア３Ｓと、音声認識
結果出力部４と、対話制御部６と、音声出力部７と、音
声出力用ソフトウェア７Ｓと、システムデータベース８
とを有する。The noun determination device 1 includes a voice input unit 2, a voice recognition unit 3, a voice recognition software 3S, a voice recognition result output unit 4, a dialogue control unit 6, a voice output unit 7, and a voice output. Software 7S and system database 8
Have and.

【００４５】名詞確定装置１において、音声入力部２を
介して入力された利用者Ｐの音声が音声認識部３へ送ら
れ、音声認識部３は、入力音声を音声認識処理する際
に、システムデータベース８を利用する。また、音声認
識部３は、利用者Ｐによる入力音声について、音声認識
用ソフトウェア３Ｓを利用して、認識処理を実行する。In the noun determination device 1, the voice of the user P input through the voice input unit 2 is sent to the voice recognition unit 3, and the voice recognition unit 3 performs a system for voice recognition processing of the input voice. The database 8 is used. Further, the voice recognition unit 3 executes the recognition process on the voice input by the user P by using the voice recognition software 3S.

【００４６】システムデータベース８は、検索データベ
ース８１と、検索補助データベース８２と、ＹＥＳ／Ｎ
Ｏデータベース８３とによって構成されている。The system database 8 includes a search database 81, a search auxiliary database 82, and YES / N.
And an O database 83.

【００４７】検索データベース８１は、複数の個人姓が
検索語として登録されているデータベースである。The search database 81 is a database in which a plurality of individual family names are registered as search words.

【００４８】検索補助データベース８２は、検索データ
ベース８１に登録されている全個人姓（検索対象語）に
関連する認識対象リストＬ１、漢字表記候補リストＬ
２、漢字の読み対象リストＬ３、先頭漢字の読みと個人
姓との対応表Ｌ４が格納されているデータベースであ
る。The search assisting database 82 is a recognition target list L1 related to all individual surnames (search target words) registered in the search database 81 and a kanji writing candidate list L.
2. A database storing a kanji reading target list L3 and a correspondence table L4 for reading the first kanji and individual surnames.

【００４９】ＹＥＳ／ＮＯデータベース８３は、利用者
Ｐが応答した内容（たとえば、はい／いいえ、ＹＥＳ／
ＮＯ）を認識するデータベースである。The YES / NO database 83 contains the contents that the user P responded to (for example, YES / NO, YES / NO).
It is a database that recognizes (NO).

【００５０】音声認識用ソフトウェア３Ｓは、検索装置
（名詞確定装置１）の処理の場面に合わせて、検索デー
タベース８１または検索補助データベース８２を、シス
テムデータベース８から選択するものである。The voice recognition software 3S selects the search database 81 or the search auxiliary database 82 from the system database 8 in accordance with the scene of processing of the search device (noun determination device 1).

【００５１】検索語が音声入力されると、検索補助デー
タベース８２を参照し、また、利用者Ｐへの正誤確認に
対する応答を認識する場合は、ＹＥＳ／ＮＯデータベー
ス８３が参照される。When a search word is input by voice, the search auxiliary database 82 is referred to, and when the response to the user P for the correctness confirmation is recognized, the YES / NO database 83 is referred to.

【００５２】また、音声認識部３は、音声認識処理の際
に、音声認識用ソフトウェア３Ｓを使用し、音声出力部
７は、音声出力の際に、音声出力用ソフトウェア７Ｓを
使用する。Further, the voice recognition section 3 uses the voice recognition software 3S in the voice recognition processing, and the voice output section 7 uses the voice output software 7S in the voice output.

【００５３】つまり、上記実施例は、利用者が音声で入
力した内容を、音声認識装置を利用して音声認識処理
し、音声認識処理した結果を絞り込む名詞特定装置にお
いて、絞り込み用付加情報として、表記の違いによる情
報を利用し、多数の検索対象の中から、利用者が音声入
力した名詞を特定する音声入力を利用する名詞特定装置
の例である。That is, in the above embodiment, in the noun specifying device for performing the voice recognition processing of the content input by the user by the voice using the voice recognition device and narrowing down the result of the voice recognition processing, as the additional information for narrowing down, This is an example of a noun identification device that utilizes voice input to identify a noun that a user has voice input from a large number of search targets, using information based on the difference in notation.

【００５４】また、上記実施例は、利用者が音声で入力
した内容を、音声認識装置を利用して音声認識処理し、
音声認識処理した結果を絞り込む名詞特定装置におい
て、個人姓を音声入力することを利用者に要求する個人
姓入力要求手段と、入力された音声を音声認識する音声
認識手段と、上記入力された個人姓の表記のうちの少な
くとも１つについて、音声入力することを上記利用者に
要求する表記入力要求手段と、上記音声入力された個人
姓と、上記音声入力された上記表記とに基づいて、上記
音声入力された個人姓を特定する個人姓特性手段とを有
する音声入力を利用する名詞特定装置の例である。Further, in the above-described embodiment, the contents input by the user by voice are subjected to the voice recognition processing by using the voice recognition device,
In a noun identification device that narrows down the result of voice recognition processing, an individual surname input requesting unit that requests a user to input a personal surname by voice, a voice recognition unit that recognizes an input voice by voice, and the input individual Based on at least one of the notation of the family name, the notation input requesting means for requesting the user to make a voice input, the voice input personal surname, and the voice input the notation. It is an example of a noun identification device using voice input having a personal surname characteristic means for identifying a personal surname input by voice.

【００５５】この場合、上記表記は、漢字表記した際に
使用される漢字の先頭またはｎ番目（ｎは２以上の整
数）の漢字の読みの情報であり、また、先頭漢字の部首
であり、さらに、平仮名表記した際に使用される平仮名
の先頭またはｎ番目（ｎは２以上の整数）の情報または
末尾の情報であり、そして、平仮名表記した際の文字数
の情報である。In this case, the above-mentioned notation is information on the reading of the first or nth (n is an integer of 2 or more) kanji of the kanji used when the kanji is written, and is the radical of the first kanji. Further, it is the information of the beginning or the nth (n is an integer of 2 or more) or the end of the hiragana used when the hiragana is written, and the number of characters when the hiragana is written.

【００５６】また、利用者が目的とする名詞を絞り込む
過程において、所定の漢字を先頭漢字とする個人姓のう
ちで、所定の数よりも多く使用されている個人姓だけを
集めたグループを作り、そのグループのみを検索対象と
して検索するようにしてもよい。Further, in the process of narrowing down the target nouns by the user, a group is created in which only personal surnames that are used more than a predetermined number among personal surnames having a predetermined kanji as the leading kanji are used. Alternatively, only that group may be searched as the search target.

【００５７】さらに、上記実施例を、プログラムの発明
として把握することができ、つまり、上記実施例は、利
用者が音声で入力した内容を、音声認識装置を利用して
音声認識処理し、音声認識処理した結果を絞り込む場
合、絞り込み用付加情報として、表記の違いによる情報
を利用し、多数の検索対象の中から、利用者が音声入力
した名詞を特定する手順をコンピュータに実行させるプ
ログラムの例である。また、上記実施例は、利用者が音
声で入力した内容を、音声認識装置を利用して音声認識
処理し、音声認識処理した結果を絞り込む場合、個人姓
を音声入力することを利用者に要求する個人姓入力要求
手順と、入力された音声を音声認識する音声認識手順
と、上記入力された個人姓の表記のうちの少なくとも１
つについて、音声入力することを上記利用者に要求する
表記入力要求手順と、上記音声入力された個人姓と、上
記音声入力された上記表記とに基づいて、上記音声入力
された個人姓を特定する個人姓特性手順とをコンピュー
タに実行させるプログラムの例である。Furthermore, the above embodiment can be understood as a program invention, that is, in the above embodiment, the contents input by the user by voice are subjected to voice recognition processing using a voice recognition device, When narrowing down the results of recognition processing, an example of a program that causes the computer to execute the procedure to specify the noun spoken by the user from a large number of search targets, using information based on the difference in notation as additional information for narrowing down Is. Further, in the above-described embodiment, when the contents input by the user by voice are subjected to the voice recognition processing using the voice recognition device and the result of the voice recognition processing is narrowed down, the user is requested to input the personal surname by voice. At least one of a personal surname input request procedure, a voice recognition procedure for recognizing an input voice, and a notation of the input personal surname
The voice input personal name, based on the voice input personal name and the voice input personal name, the voice input input request procedure for requesting the user to input voice. 3 is an example of a program that causes a computer to execute the personal surname characteristic procedure.

【００５８】なお、上記プログラムを、所定の記録媒体
に記録するようにしてもよい。この場合、ＦＤ、ＣＤ、
ＤＶＤ、ＨＤ、半導体メモリが上記所定の記録媒体の例
である。The above program may be recorded in a predetermined recording medium. In this case, FD, CD,
DVD, HD, and semiconductor memory are examples of the predetermined recording medium.

【００５９】[0059]

【発明の効果】本発明によれば、音声認識を利用したコ
ールセンタの構築等、従来人間オペレータが対応してい
た大規模な対象の特定業務等の自動化が可能になるとい
う効果を奏し、また、人間オペレータの代替のみではな
く、多くの対象数からなるデータベース検索の際に、検
索キーとして、表記の特徴を利用するので、より高精度
で、より高効率で、迅速な検索を実現することができる
という効果を奏する。According to the present invention, there is an effect that it is possible to automate a large-scale target specific task which was conventionally handled by a human operator, such as construction of a call center utilizing voice recognition. Not only as a substitute for a human operator, but also when searching a database with a large number of targets, the notation feature is used as a search key, so that more accurate, more efficient, and faster searches can be realized. It has the effect of being able to.

[Brief description of drawings]

【図１】本発明の実施例の動作を示すフローチャートで
ある。FIG. 1 is a flowchart showing the operation of an embodiment of the present invention.

【図２】上記実施例における認識対象リストＬ１の例を
示す図である。FIG. 2 is a diagram showing an example of a recognition target list L1 in the above embodiment.

【図３】上記実施例における漢字表記候補リストＬ２の
例を示す図である。FIG. 3 is a diagram showing an example of a Chinese character notation candidate list L2 in the above embodiment.

【図４】上記実施例において、検索装置（名詞確定装置
１）に予め用意されている漢字の読み対象リストＬ３を
示す図である。FIG. 4 is a diagram showing a kanji reading target list L3 prepared in advance in the search device (noun determination device 1) in the embodiment.

【図５】先頭漢字読みと個人姓との対応表Ｌ４の一例を
示す図である。FIG. 5 is a diagram showing an example of a correspondence table L4 for reading leading kanji and individual surnames.

【図６】上記実施例において、音声入力した「ますお」
に基づいて、個人姓「増尾」を特定する場合するにおけ
る動作を示す説明図である。[Fig. 6] "masu" input by voice in the above embodiment
FIG. 10 is an explanatory diagram showing an operation in the case of identifying the individual surname “Masao” based on the.

【図７】本発明の第１の実施例である名詞確定装置１を
示すブロック図である。FIG. 7 is a block diagram showing a noun determination device 1 according to the first embodiment of the present invention.

[Explanation of symbols]

１…名詞確定装置、２…音声入力部、３…音声認識部、３Ｓ…音声認識用ソフトウェア、４…音声認識結果出力部、６…対話制御部、７…音声出力部、７Ｓ…音声出力用ソフトウェア、８…システムデータベース。 1 ... Noun determination device, 2 ... Voice input section, 3 ... voice recognition unit, 3S ... software for voice recognition, 4 ... voice recognition result output unit, 6 ... Dialogue control unit, 7 ... voice output section, 7S ... software for voice output, 8 ... System database.

Claims

[Claims]

1. A noun identification device that performs voice recognition processing of a voice input content by a user using a voice recognition device and narrows down the result of the voice recognition processing. A noun specifying device using voice input, characterized in that a noun input by a user by voice is specified from a large number of search targets using.

2. A noun identification device for performing voice recognition processing of contents input by a user using a voice recognition device using a voice recognition device and narrowing down the result of the voice recognition process, wherein the user is required to input the personal surname by voice. Requesting personal surname input requesting means; voice recognizing means for recognizing input voice; requesting the user to voice input for at least one of the notations of the personal name input by voice Notation input requesting means ;; personal surname characteristic means for specifying the voice input personal surname based on the voice input personal surname and the voice input personal notation; A noun identification device that uses voice input.

3. The claim according to claim 1 or 2, wherein the notation is information on the reading of the head or nth (n is an integer of 2 or more) of the kanji used when the kanji is written. A noun identification device that utilizes characteristic voice input.

4. The noun identification device according to claim 1 or 2, wherein the notation is a radical of the first Chinese character.

5. The method according to claim 1 or 2, wherein the notation is information at the beginning or nth (n is an integer of 2 or more) or information at the end of the hiragana used in the hiragana notation. A noun identification device that utilizes characteristic voice input.

6. The noun identifying apparatus utilizing voice input according to claim 1 or 2, wherein the notation is information on the number of characters in Hiragana notation.

7. The method according to claim 1 or 2, wherein, in the process of narrowing down the intended noun by the user, more than a predetermined number of personal surnames having a predetermined kanji as the leading kanji are used. A noun identification device using voice input, which is characterized by making a group of only individual surnames and searching only that group as a search target.

8. A noun identifying method for performing voice recognition processing on a content input by a user by using a voice recognition device and narrowing down the result of the voice recognition processing. A noun identification method using voice input, which is characterized by identifying a noun that the user has input by voice from a large number of search targets.

9. A noun identifying method for performing voice recognition processing of a voice input content by a user using a voice recognition device and narrowing down the result of the voice recognition processing, wherein the user is required to voice input an individual surname. Requesting personal last name input step; voice recognition step of recognizing input voice; requesting the user to voice input at least one of the notations of the personal last name input by voice A notation input request step; a personal surname characteristic step of specifying the voice input individual surname based on the voice input personal surname and the voice input the notation. A noun identification method that uses voice input.

10. The claim according to claim 8 or claim 9, wherein the notation is information on reading of the head or nth (n is an integer of 2 or more) of the kanji used when the kanji is written. A method for identifying nouns using a featured voice input.

11. The noun specifying method using voice input according to claim 8 or 9, wherein the notation is a radical of the leading kanji.

12. The method according to claim 8 or 9, wherein the notation is the first or nth (n is an integer of 2 or more) information or the last information of the hiragana used in the hiragana notation. A method for identifying nouns using a featured voice input.

13. The noun identifying method using voice input according to claim 8 or 9, wherein the notation is information on the number of characters in Hiragana notation.

14. The method according to claim 8 or 9, wherein, in the process of narrowing down the intended noun by the user, more than a predetermined number of personal surnames having a predetermined kanji as the leading kanji are used. A noun identification method using voice input, characterized by creating a group that collects only individual surnames and searching only that group as a search target.

15. When the user performs voice recognition processing on the content input by voice using a voice recognition device and narrows down the result of the voice recognition processing, the information depending on the notation is used as the additional information for narrowing down. , A program that causes a computer to execute a procedure for identifying a noun spoken by a user from a large number of search targets.

16. An individual requesting the user to voice-input an individual surname when the voice recognition processing is performed on the content input by the user using a voice recognition device and the result of the voice recognition processing is narrowed down. Last name input request procedure; Voice recognition procedure for recognizing voice input; Voice input request requesting the user to voice input at least one of the voice input personal last name notations A program for causing a computer to execute a procedure; a personal surname characteristic procedure for identifying the voice input personal surname based on the voice input personal surname and the voice input personal notation.

17. When voice recognition processing is performed on a content input by a user using a voice recognition device and the result of the voice recognition processing is narrowed down, information based on a difference in notation is used as additional information for narrowing down. , A computer-readable recording medium in which a program for causing a computer to execute a procedure for identifying a noun spoken by a user from a large number of retrieval targets is recorded.

18. An individual requesting the user to voice-input an individual surname when the voice-recognition device is used to perform voice-recognition processing on the content input by the user and to narrow down the result of the voice-recognition processing. First name input request procedure; Voice recognition procedure for recognizing voice input; Voice input request requesting the user to voice input at least one of the voice input personal last name notations A computer recording a program for causing a computer to execute a procedure; a personal surname characteristic procedure for specifying the voice-input personal surname based on the voice-input personal surname and the voice-input notation. A readable recording medium.