JP2018097123A

JP2018097123A - Information processing unit

Info

Publication number: JP2018097123A
Application number: JP2016240669A
Authority: JP
Inventors: 恭平増井; Kyohei Masui; 亮磨垣見; Ryoma Kakimi
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2016-12-12
Filing date: 2016-12-12
Publication date: 2018-06-21
Anticipated expiration: 2036-12-12
Also published as: JP6724759B2

Abstract

PROBLEM TO BE SOLVED: To provide a technique for allowing a user to quickly select a correct character string candidate from among a plurality of displayed character string candidates.SOLUTION: In an information processing unit 1, a voice recognition section 12 voice-recognizes voice of a user and acquires a plurality of character string candidates corresponding to the voice-recognized voice from a database. A display section 16 displays the plurality of character string candidates acquired by the voice recognition section 12. An extraction section 18 extracts a different part between the character string candidates acquired by the voice recognition section 12. A display control section 20 allows the display section 16 to highlight the different part extracted by the extraction section 18. An acceptance section 22 accepts a selection instruction of the user with respect to the plurality of character string candidates displayed on the display section 16.SELECTED DRAWING: Figure 1

Description

本発明は、音声認識結果に基づいて複数の文字列候補を表示する情報処理装置に関する。 The present invention relates to an information processing apparatus that displays a plurality of character string candidates based on a speech recognition result.

ユーザの音声を音声認識し、音声認識結果に基づいて複数の文字列候補をリストとして表示し、リストの中からユーザが選択した文字列候補を正しい音声認識結果として処理する音声認識装置が知られている（例えば、特許文献１参照）。 A voice recognition device that recognizes a user's voice, displays a plurality of character string candidates as a list based on the voice recognition result, and processes the character string candidate selected by the user from the list as a correct voice recognition result is known. (For example, refer to Patent Document 1).

特表２００５−５３０２５３号公報JP 2005-530253 A

上記技術では、複数の文字列候補が表示されるだけであるため、文字列候補の中からユーザが正しい文字列候補を速やかに選択することは容易ではない。 In the above technique, since only a plurality of character string candidates are displayed, it is not easy for the user to quickly select a correct character string candidate from among the character string candidates.

本発明はこうした状況に鑑みてなされたものであり、その目的は、表示された複数の文字列候補の中から正しい文字列候補をユーザに速やかに選択させることができる技術を提供することにある。 The present invention has been made in view of such circumstances, and an object thereof is to provide a technique that allows a user to quickly select a correct character string candidate from among a plurality of displayed character string candidates. .

上記課題を解決するために、本発明のある態様の情報処理装置は、ユーザの音声を音声認識して、音声認識された前記音声に対応する複数の文字列候補をデータベースから取得する音声認識部と、前記音声認識部で取得された前記複数の文字列候補を表示する表示部と、前記音声認識部で取得された前記文字列候補の間の相違部分を抽出する抽出部と、前記抽出部で抽出された前記相違部分を前記表示部に強調表示させる表示制御部と、前記表示部に表示された前記複数の文字列候補に対するユーザの選択指示を受け付ける受付部と、を備える。 In order to solve the above problems, an information processing apparatus according to an aspect of the present invention recognizes a user's voice and acquires a plurality of character string candidates corresponding to the voice that has been voice-recognized from a database. A display unit that displays the plurality of character string candidates acquired by the voice recognition unit, an extraction unit that extracts a difference between the character string candidates acquired by the voice recognition unit, and the extraction unit A display control unit that highlights the difference portion extracted in step (b) on the display unit, and a reception unit that receives a user selection instruction for the plurality of character string candidates displayed on the display unit.

この態様によると、文字列候補の間の相違部分を表示部に強調表示させるようにしているので、ユーザは相違部分を容易に認識できる。従って、正しい文字列候補をユーザに速やかに選択させることができる。 According to this aspect, since the different portions between the character string candidates are highlighted on the display unit, the user can easily recognize the different portions. Accordingly, the user can promptly select a correct character string candidate.

本発明によれば、表示された複数の文字列候補の中から正しい文字列候補をユーザに速やかに選択させることができる。 According to the present invention, the user can promptly select a correct character string candidate from among a plurality of displayed character string candidates.

一実施形態に係る情報処理装置のブロック図である。It is a block diagram of the information processor concerning one embodiment. （ａ）〜（ｄ）は、一実施形態に係る文字列候補のデータベースの構成例を示す図である。(A)-(d) is a figure which shows the structural example of the database of the character string candidate which concerns on one Embodiment. 図１の表示部に表示された認識結果の候補リストを示す図である。It is a figure which shows the candidate list of the recognition result displayed on the display part of FIG. 図１の情報処理装置における表示処理を示すフローチャートである。3 is a flowchart showing display processing in the information processing apparatus of FIG. 1. 図１の表示部に表示された電話番号認識結果の候補リストを示す図である。It is a figure which shows the candidate list of the telephone number recognition result displayed on the display part of FIG. 変形例に係る一致部分が表示されていない文字列候補の表示例である。It is a display example of the character string candidate in which the matching part which concerns on a modification is not displayed. 変形例に係る相違部分の表示形態が変更された文字列候補の表示例である。It is a display example of the character string candidate by which the display form of the different part which concerns on a modification was changed.

図１は、一実施形態に係る情報処理装置１のブロック図である。以下では、情報処理装置１がカーナビゲーションシステムにおける音声認識に用いられる一例について説明するが、これに限らない。図１に示すように、情報処理装置１は、マイク１０と、音声認識部１２と、記憶部１４と、表示部１６と、抽出部１８と、表示制御部２０と、受付部２２と、を備える。 FIG. 1 is a block diagram of an information processing apparatus 1 according to an embodiment. Hereinafter, an example in which the information processing apparatus 1 is used for voice recognition in a car navigation system will be described, but the present invention is not limited thereto. As illustrated in FIG. 1, the information processing apparatus 1 includes a microphone 10, a voice recognition unit 12, a storage unit 14, a display unit 16, an extraction unit 18, a display control unit 20, and a reception unit 22. Prepare.

マイク１０は、ユーザの音声を音声信号に変換し、この音声信号を音声認識部１２に送信する。ユーザは、例えば、カーナビゲーションシステムに入力したい情報を発話する。このような情報として、例えば、住所、電話番号、連絡先などの目的地情報が挙げられる。 The microphone 10 converts the user's voice into a voice signal and transmits the voice signal to the voice recognition unit 12. For example, the user speaks information desired to be input to the car navigation system. Examples of such information include destination information such as an address, a telephone number, and a contact address.

音声認識部１２は、マイク１０から送信された音声信号に基づいてユーザの音声を音声認識して、音声認識された音声に対応する複数の文字列候補をデータベースから取得する。取得された複数の文字列候補の中のある文字列候補と他の文字列候補との間では、一部分が一致しており、他の部分が相違している。このような複数の文字列候補を取得することにより、マイク１０に車両の走行音などの雑音が混入して音声認識の不確実性が生じた場合であっても、取得された複数の文字列候補は正しい文字列候補を含み得る。音声に対応する複数の文字列候補を選択する方法としては、周知の技術を用いることができる。 The voice recognition unit 12 recognizes the user's voice based on the voice signal transmitted from the microphone 10 and acquires a plurality of character string candidates corresponding to the voice that has been voice-recognized from the database. A certain character string candidate and a different character string candidate among a plurality of acquired character string candidates are partially matched, and other parts are different. By acquiring such a plurality of character string candidates, even if noise such as a running sound of a vehicle is mixed in the microphone 10 and the uncertainty of voice recognition occurs, the acquired plurality of character strings Candidates may include correct character string candidates. A well-known technique can be used as a method of selecting a plurality of character string candidates corresponding to speech.

文字列候補のデータベースは、記憶部１４に予め格納されている。記憶部１４は、不揮発性メモリなどにより構成されている。 A database of character string candidates is stored in the storage unit 14 in advance. The storage unit 14 is configured by a nonvolatile memory or the like.

表示部１６は、カーナビゲーション用の地図などの各種情報を表示する液晶ディスプレイなどであり、音声認識部１２で取得された複数の文字列候補を表示する。 The display unit 16 is a liquid crystal display or the like that displays various information such as a map for car navigation, and displays a plurality of character string candidates acquired by the voice recognition unit 12.

抽出部１８は、音声認識部１２で取得された文字列候補の間の相違部分を抽出する。相違部分を抽出する具体的な方法は後述する。 The extraction unit 18 extracts a difference between character string candidates acquired by the speech recognition unit 12. A specific method for extracting the difference will be described later.

表示制御部２０は、表示部１６の表示を制御する。表示制御部２０は、抽出部１８で抽出された相違部分を表示部１６に強調表示させる。具体的には表示制御部２０は、文字列候補のうち相違部分の表示色を黒などの標準の表示色で表示部１６に表示させ、一致部分の表示色を標準の表示色より目立たないグレーなどの色で表示部１６に表示させ、これにより相違部分を相対的に強調表示させる。 The display control unit 20 controls display on the display unit 16. The display control unit 20 highlights the difference part extracted by the extraction unit 18 on the display unit 16. Specifically, the display control unit 20 causes the display unit 16 to display the display color of the different part of the character string candidates in a standard display color such as black, and the display color of the matching part is a gray that is less noticeable than the standard display color. Are displayed on the display unit 16 in such a manner that different portions are relatively highlighted.

受付部２２は、表示部１６に表示された複数の文字列候補に対するユーザの選択指示を受け付ける。受付部２２は、ユーザによる表示部１６へのタッチを検出するタッチパネルセンサ、スイッチ、または、リモコンなどである。 The accepting unit 22 accepts user selection instructions for a plurality of character string candidates displayed on the display unit 16. The receiving unit 22 is a touch panel sensor, a switch, or a remote controller that detects a touch on the display unit 16 by the user.

音声認識部１２、抽出部１８および表示制御部２０は、ハードウェア資源とソフトウェア資源の協働、またはハードウェア資源のみにより実現できる。ハードウェア資源としてアナログ素子、マイクロコンピュータ、ＤＳＰ、ＲＯＭ、ＲＡＭ、ＦＰＧＡ、その他のＬＳＩを利用できる。ソフトウェア資源としてファームウェア等のプログラムを利用できる。 The voice recognition unit 12, the extraction unit 18, and the display control unit 20 can be realized by cooperation of hardware resources and software resources, or only by hardware resources. As hardware resources, analog elements, microcomputers, DSPs, ROMs, RAMs, FPGAs, and other LSIs can be used. Firmware and other programs can be used as software resources.

図２（ａ）〜（ｄ）は、一実施形態に係る文字列候補のデータベースの構成例を示す図である。図２（ａ）に示すように、１つの文字列候補１００は、Ａパート、Ｂパート、Ｃパート、Ｄパート等の複数の単語単位に分割されて、データベースに格納されている。単語単位の数は、文字列候補１００に応じて異なる。例えば、図２（ｂ）に示すように、住所に関する文字列候補１００ａの場合、Ａパートは「愛知県」であり、Ｂパートは「豊田市」であり、Ｃパートは「トヨタ町」であり、Ｄパートは「１番」である。 FIGS. 2A to 2D are diagrams illustrating a configuration example of a database of character string candidates according to an embodiment. As shown in FIG. 2A, one character string candidate 100 is divided into a plurality of word units such as an A part, a B part, a C part, and a D part and stored in a database. The number of word units varies depending on the character string candidate 100. For example, as shown in FIG. 2B, in the case of the character string candidate 100a regarding the address, the A part is “Aichi Prefecture”, the B part is “Toyota City”, and the C part is “Toyota Town”. The D part is “No. 1”.

また、図２（ｃ）に示すように、電話番号に関する文字列候補１００ｂの場合、例えば、Ａパートは「＋８１」であり、Ｂパートは「０１」であり、Ｃパートは「０１２」であり、Ｄパートは「０１２３」である。 As shown in FIG. 2C, in the case of the character string candidate 100b related to the telephone number, for example, the A part is “+81”, the B part is “01”, and the C part is “012”. , D part is “0123”.

また、図２（ｄ）に示すように、連絡先に関する文字列候補１００ｃの場合、例えば、Ａパートは「豊田」であり、Ｂパートは「太郎」であり、Ｃパートは「職場」である。 Further, as shown in FIG. 2D, in the case of the character string candidate 100c related to the contact address, for example, the A part is “Toyota”, the B part is “Taro”, and the C part is “workplace”. .

図３は、図１の表示部１６に表示された認識結果の候補リストを示す図である。図３に示すように、音声認識の結果、複数の文字列候補が候補リストとして表示される。Ｎ−１（Ｎは２以上の整数）番目の文字列候補１００−（Ｎ−１）は、Ａ_Ｎ−１パート、Ｂ_Ｎ−１パート、Ｃ_Ｎ−１パート、Ｄ_Ｎ−１パート等を含み、Ｎ番目の文字列候補１００−Ｎは、Ａ_Ｎパート、Ｂ_Ｎパート、Ｃ_Ｎパート、Ｄ_Ｎパート等を含む。Ｎ番目の文字列候補１００−Ｎは、Ｎ−１番目の文字列候補１００−（Ｎ−１）の下段に表示される。 FIG. 3 is a diagram showing a recognition result candidate list displayed on the display unit 16 of FIG. As shown in FIG. 3, as a result of speech recognition, a plurality of character string candidates are displayed as a candidate list. The N-1 (N is an integer greater than or equal to 2) th character string candidate 100- (N-1) includes an A _N-1 part, a B _N-1 part, a C _N-1 part, a D _N-1 part, and the like. The Nth character string candidate 100-N includes an _AN part, a _BN part, a _CN part, a _DN part, and the like. The Nth character string candidate 100-N is displayed in the lower part of the (N-1) th character string candidate 100- (N-1).

ここで、表示制御部２０は、複数の文字列候補をソートして表示部１６に表示させる。ソートの方法は特に限定されないが、例えば、表示制御部２０は、Ａパート側から共通するパートの数が多い順番に複数の文字列候補をソートしてもよい。 Here, the display control unit 20 sorts the plurality of character string candidates and causes the display unit 16 to display them. Although the sorting method is not particularly limited, for example, the display control unit 20 may sort a plurality of character string candidates in order of increasing number of common parts from the A part side.

このように構成される文字列候補を用いて、抽出部１８は、相違部分を抽出する対象の文字列候補１００−Ｎと、当該文字列候補１００−Ｎの上段に隣接して表示される文字列候補１００−（Ｎ−１）との間において、左側のパートから順に、対応するパート同士が一致するか相違するか判定する。抽出部１８は、相違するパートを、文字列候補１００−（Ｎ−１）と文字列候補１００−Ｎとの間の相違部分として抽出する。表示制御部２０は、文字列候補１００−Ｎにおいて、抽出された相違部分と、相違部分より右側のパートとを表示部１６に強調表示させる。抽出部１８と表示制御部２０は、この処理を繰り返す。 Using the character string candidates configured as described above, the extraction unit 18 uses a character string candidate 100-N to be extracted as a difference portion and a character displayed adjacent to the upper stage of the character string candidate 100-N. It is determined whether the corresponding parts match or differ from the column candidate 100- (N-1) in order from the left part. The extraction unit 18 extracts different parts as different parts between the character string candidate 100- (N-1) and the character string candidate 100-N. In the character string candidate 100-N, the display control unit 20 highlights the extracted different part and the part on the right side of the different part on the display unit 16. The extraction unit 18 and the display control unit 20 repeat this process.

図４は、図１の情報処理装置１における表示処理を示すフローチャートである。ここでは、説明を明確化するため、１つの文字列候補が４つのＡパート、Ｂパート、ＣパートおよびＤパートから構成されている場合の処理を示す。この処理は、音声認識が行われる毎に行われる。 FIG. 4 is a flowchart showing display processing in the information processing apparatus 1 of FIG. Here, in order to clarify the explanation, a process when one character string candidate is composed of four A parts, B parts, C parts, and D parts is shown. This process is performed every time voice recognition is performed.

まず、表示制御部２０は、Ｎ＝０に設定し（Ｓ１０）、Ｎ＝Ｎ＋１に設定し（Ｓ１２）、Ｎ＝１であるか判定する（Ｓ１４）。Ｎ＝１である場合（Ｓ１４のＹ）、表示制御部２０は、１番目の文字列候補１００−１の全体を強調表示させ（Ｓ１６）、Ｎが文字列候補の総数と等しいか判定する（Ｓ１８）。等しい場合（Ｓ１８のＹ）、処理を終了し、等しくない場合（Ｓ１８のＮ）、Ｓ１２に戻る。 First, the display control unit 20 sets N = 0 (S10), sets N = N + 1 (S12), and determines whether N = 1 (S14). When N = 1 (Y in S14), the display control unit 20 highlights the entire first character string candidate 100-1 (S16), and determines whether N is equal to the total number of character string candidates (S16). S18). If they are equal (Y in S18), the process is terminated. If they are not equal (N in S18), the process returns to S12.

Ｓ１４においてＮ＝１でない場合（Ｓ１４のＮ）、抽出部１８は、Ａ_Ｎ−１パートがＡ_Ｎパートと一致するか判定し（Ｓ２０）、相違する場合（Ｓ２０のＮ）、表示制御部２０は、Ｎ番目の文字列候補１００−ＮのＡ_ＮパートからＤ_Ｎパートを強調表示させ（Ｓ１６）、Ｓ１８に移行する。 When N = 1 is not 1 in S14 (N in S14), the extraction unit 18 determines whether the A _N-1 part matches the A _N part (S20), and when they are different (N in S20), the display control unit 20 from N-th _{a N} part of a character string candidates 100-N to highlight _{D N} Part (S16), the process proceeds to S18.

Ａ_Ｎ−１パートがＡ_Ｎパートと一致する場合（Ｓ２０のＹ）、表示制御部２０は、Ａ_Ｎパートを目立たなくして（Ｓ２２）、抽出部１８は、Ｂ_Ｎ−１パートがＢ_Ｎパートと一致するか判定する（Ｓ２４）。相違する場合（Ｓ２４のＮ）、表示制御部２０は、Ｎ番目の文字列候補１００−ＮのＢ_ＮパートからＤ_Ｎパートを強調表示させ（Ｓ１６）、Ｓ１８に移行する。 When the A _N-1 part matches the A _N part (Y in S20), the display control unit 20 makes the A _N part inconspicuous (S22), and the extraction unit 18 determines that the B _N-1 part is the B _N part. (S24). If different (S24 of N), the display control unit 20, from the N-th character string candidates 100-N _{B N} Part highlight the _{D N} Part (S16), the process proceeds to S18.

Ｂ_Ｎ−１パートがＢ_Ｎパートと一致する場合（Ｓ２４のＹ）、表示制御部２０は、Ｂ_Ｎパートを目立たなくして（Ｓ２６）、抽出部１８は、Ｃ_Ｎ−１パートがＣ_Ｎパートと一致するか判定する（Ｓ２８）。相違する場合（Ｓ２８のＮ）、表示制御部２０は、Ｎ番目の文字列候補１００−ＮのＣ_ＮパートとＤ_Ｎパートを強調表示させ（Ｓ１６）、Ｓ１８に移行する。 If B _N-1 part coincides with _{B N} Part (S24 of Y), the display control unit 20, and obscure the _{B N} Part (S26), the extraction unit _{18, C N-1} parts are _{C N} Part (S28). If different (S28 of N), the display control unit 20, the N-th _{C N} part and _{D N} Part string candidate 100-N to highlight (S16), the process proceeds to S18.

Ｃ_Ｎ−１パートがＣ_Ｎパートと一致する場合（Ｓ２８のＹ）、表示制御部２０は、Ｎ番目の文字列候補１００−ＮのＣ_Ｎパートを目立たなくして（Ｓ３０）、Ｄ_Ｎパートを強調表示させ（Ｓ３２）、Ｓ１８に移行する。 If C _N-1 part coincides with _{C N} Part (S28 of Y), the display control unit 20, and obscuring _{C N} part of the N-th candidate character string 100-N (S30), the _{D N} Part Emphasis is displayed (S32), and the process proceeds to S18.

１つの文字列候補が３つ以下のパートまたは５つ以上のパートから構成される場合も、同様に処理を行うことができる。 The same processing can be performed when one character string candidate is composed of three or less parts or five or more parts.

次に、ユーザが電話番号を発話する一例について、情報処理装置１の全体的な動作を説明する。まず、ユーザは、カーナビゲーションシステムに目的地を設定するために、目的地の電話番号を発話する。これにより、図５に示すように、音声認識された音声に対応する複数の文字列候補１００−１〜１００−６が表示部１６に表示される。 Next, the overall operation of the information processing apparatus 1 will be described for an example in which the user utters a telephone number. First, the user utters the destination telephone number in order to set the destination in the car navigation system. Thereby, as shown in FIG. 5, a plurality of character string candidates 100-1 to 100-6 corresponding to the speech that has been speech-recognized are displayed on the display unit 16.

図５は、図１の表示部１６に表示された電話番号認識結果の候補リストを示す図である。文字列候補１００−１〜１００−６は、Ａパート側から共通するパートの数が多い順番にソートされている。１番目の文字列候補１００−１では、Ａ_１パートからＤ_１パートまでの「＋８１−０１−０１２−０１２３」が標準の表示色で表示されている。 FIG. 5 is a diagram showing a candidate list of telephone number recognition results displayed on the display unit 16 of FIG. The character string candidates 100-1 to 100-6 are sorted in descending order of the number of common parts from the A part side. In the first character string candidate 100-1, "+ 81-01-012-0123" from _{A 1} Part to _{D 1} part is displayed in the standard display color.

２番目の文字列候補１００−２では、１つ上段の文字列候補１００−１の対応するパートと一致するＡ_２パートからＣ_２パートまでの「＋８１−０１−０１２」がグレーで表示されている。つまり、一致部分はグレーアウトして表示されている。文字列候補１００−１の対応するパートと相違するＤ_２パートの「−０１２２」が標準の表示色で表示されている。結果として、「＋８１−０１−０１２」は目立たなく表示され、「−０１２２」は強調表示されている。 In the second character string candidate 100-2, one upper character string candidate 100-1 from the corresponding _{A 2-part} matching the part to a _{C 2} part is "+ 81-01-012" are displayed in gray Yes. That is, the matching part is displayed in gray. Of D ₂ parts that differ from the corresponding part of the character string candidates 100-1 "-0122" is displayed in the standard display color. As a result, “+ 81-01-012” is displayed inconspicuously and “−0122” is highlighted.

３番目の文字列候補１００−３では、１つ上段の文字列候補１００−２の対応するパートと一致するＡ_３パートからＣ_３パートまでの「＋８１−０１−０１２」がグレーで表示されている。文字列候補１００−２の対応するパートと相違するＤ_３パートの「−０２１３」が標準の表示色で表示されている。 In the third character string candidate 100-3, one upper character string candidate 100-2 from the corresponding _{A 3-part} matching the part to _{C 3} part is "+ 81-01-012" are displayed in gray Yes. Of D ₃ parts that differ from the corresponding part of the character string candidates 100-2 "-0213" is displayed in the standard display color.

４番目の文字列候補１００−４では、１つ上段の文字列候補１００−３の対応するパートと一致するＡ_４パートとＢ_４パートの「＋８１−０１」がグレーで表示されている。文字列候補１００−３の対応するパートと相違するＣ_４パートおよびそれ以降のＤ_４パートの「−１１２−０１２３」が標準の表示色で表示されている。 In the fourth string candidate 100-4, "+ 81-01" of _{A 4} parts and _{B 4} parts that match one upper corresponding part of the character string candidates 100-3 are displayed in gray. The _{C 4} parts and later _{D 4} parts differs from the corresponding part of the character string candidates 100-3 "-112-0123" is displayed in the standard display color.

５番目の文字列候補１００−５では、１つ上段の文字列候補１００−４の対応するパートと一致するＡ_５パートの「＋８１」がグレーで表示されている。文字列候補１００−４の対応するパートと相違するＢ_５パートおよびそれ以降のＤ_５パートまでの「−００−０１２−０１２３」が標準の表示色で表示されている。 In the fifth character string candidate 100-5, "+81" in the _{A 5} part that matches one upper corresponding part of the character string candidates 100-4 are displayed in gray. String candidates 100-4 to the corresponding _{B 5} parts different from the parts and later _{D 5} Part "-00-012-0123" is displayed in the standard display color.

６番目の文字列候補１００−６では、１つ上段の文字列候補１００−５の対応するパートと一致するＡ_６パートとＢ_６パートの「＋８１−００」がグレーで表示されている。文字列候補１００−４の対応するパートと相違するＣ_６パートおよびそれ以降のＤ_６パートの「−０１１−０１２３」が標準の表示色で表示されている。 In the sixth character string candidate 100-6, "+ 81-00" of _{A 6-part} and _{B 6} parts that match one upper corresponding part of the character string candidates 100-5 are displayed in gray. The corresponding _{C 6} Parts and later _{D 6} parts different from the part of the character string candidates 100-4 "-011-0123" is displayed in the standard display color.

ユーザは、受付部２２に選択指示を行うことで、表示された文字列候補１００−１〜１００−６の中から自分の音声に一致した正しい文字列候補を選択する。これにより、ユーザが発話した電話番号をカーナビゲーションシステムに入力することができる。 The user instructs the reception unit 22 to select a correct character string candidate that matches his / her voice from the displayed character string candidates 100-1 to 100-6. As a result, the telephone number spoken by the user can be input to the car navigation system.

本実施形態によれば、文字列候補の間の相違部分を表示部１６に強調表示させるようにしているので、ユーザは、ある文字列候補において、その１つ上段の文字列候補と異なる相違部分を容易に認識できる。従って、表示された複数の文字列候補の中から正しい文字列候補をユーザに速やかに選択させることができる。 According to this embodiment, since the different part between the character string candidates is highlighted on the display unit 16, the user is different from the one upper character string candidate in a certain character string candidate. Can be easily recognized. Therefore, it is possible to prompt the user to select a correct character string candidate from among the plurality of displayed character string candidates.

また、ユーザが表示部１６を直視する時間を短くすることができる。また、カーナビゲーションシステムにおける音声認識を用いた各種検索機能を使いやすくできる。 Moreover, the time for the user to directly view the display unit 16 can be shortened. In addition, various search functions using voice recognition in a car navigation system can be easily used.

以上、実施形態をもとに本発明を説明した。実施形態はあくまでも例示であり、各構成要素や各処理プロセスの組合せにいろいろな変形例が可能なこと、またそうした変形例も本発明の範囲にあることは当業者に理解されるところである。 The present invention has been described above based on the embodiments. The embodiments are merely examples, and it will be understood by those skilled in the art that various modifications can be made to the combination of each component and each processing process, and such modifications are within the scope of the present invention.

例えば、以上の実施形態では、相違部分の表示色より目立たない色で一致部分を表示させることによって相違部分を強調表示させているが、表示制御部２０は、図６に示すように、一致部分を表示部１６に表示させず、相違部分を標準の表示色で表示部１６に表示させてもよい。 For example, in the above embodiment, the difference portion is highlighted by displaying the matching portion in a color that is less conspicuous than the display color of the difference portion. However, as shown in FIG. May be displayed on the display unit 16 in a standard display color instead of being displayed on the display unit 16.

図６は、変形例に係る一致部分が表示されていない文字列候補の表示例である。１番目の文字列候補１００−１は、標準の表示色で全体が表示されている。２番目の文字列候補１００−２は、相違部分である「−０１２２」のみが標準の表示色で表示されている。このような表示によっても、相違部分を目立たせることができる。 FIG. 6 is a display example of a character string candidate that does not display a matching part according to the modification. The entire first character string candidate 100-1 is displayed in a standard display color. In the second character string candidate 100-2, only “−0122” which is a different part is displayed in the standard display color. Such a display can also make a different part conspicuous.

また、表示制御部２０は、図７に示すように、一致部分を目立たなくさせることに加え、相違部分の表示形態を変更して、相違部分を強調表示させてもよい。 Further, as shown in FIG. 7, the display control unit 20 may change the display form of the different part and highlight the different part in addition to making the matching part inconspicuous.

図７は、変形例に係る相違部分の表示形態が変更された文字列候補の表示例である。相違部分２００は、上述した実施形態と同様の標準字体で表示されている。表示制御部２０は、相違部分２００ａの字体を、相違部分２００の標準字体よりも太字で表示部１６に表示させてもよい。また、表示制御部２０は、相違部分２００ｂに下線を付して表示部１６に表示させてもよく、相違部分２００ｃを斜体で表示部１６に表示させてもよい。また、表示制御部２０は、相違部分２００ｄを拡大して表示部１６に表示させてもよく、相違部分２００ｅの背景に着色して表示部１６に表示させてもよい。 FIG. 7 is a display example of character string candidates in which the display form of the different part according to the modification is changed. The different part 200 is displayed in the same standard font as in the above-described embodiment. The display control unit 20 may display the font of the different portion 200 a on the display unit 16 in bolder than the standard font of the different portion 200. Further, the display control unit 20 may display the different part 200b on the display unit 16 with an underline, or may display the different part 200c on the display unit 16 in italics. The display control unit 20 may enlarge the different part 200d and display it on the display unit 16, or may color the background of the different part 200e and display it on the display unit 16.

この変形例では、相違部分をさらに強調することができる。よって、ユーザは相違部分をより容易に認識できる。 In this modification, the different part can be further emphasized. Therefore, the user can more easily recognize the difference.

１…情報処理装置、１０…マイク、１２…音声認識部、１４…記憶部、１６…表示部、１８…抽出部、２０…表示制御部、２２…受付部。 DESCRIPTION OF SYMBOLS 1 ... Information processing apparatus, 10 ... Microphone, 12 ... Voice recognition part, 14 ... Memory | storage part, 16 ... Display part, 18 ... Extraction part, 20 ... Display control part, 22 ... Reception part.

Claims

A voice recognition unit that recognizes a user's voice and acquires a plurality of character string candidates corresponding to the voice that has been voice-recognized from a database;
A display unit for displaying the plurality of character string candidates acquired by the voice recognition unit;
An extraction unit that extracts a difference between the character string candidates acquired by the voice recognition unit;
A display control unit that causes the display unit to highlight the difference portion extracted by the extraction unit;
A receiving unit that receives a user's selection instruction for the plurality of character string candidates displayed on the display unit;
An information processing apparatus comprising: