JP5246512B2

JP5246512B2 - Voice reading system and voice reading terminal

Info

Publication number: JP5246512B2
Application number: JP2009178921A
Authority: JP
Inventors: 健司永松
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2009-07-31
Filing date: 2009-07-31
Publication date: 2013-07-24
Anticipated expiration: 2029-07-31
Also published as: JP2011033764A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a terminal reading a spot name according to the method that a user reads frequently. <P>SOLUTION: A voice read system includes a plurality of voice read terminals for reading a plurality of words, and a reading information update server connected to the voice read terminals via a network. The voice read terminals keep combinations between the words and readings assigned to the words, and transmits the combinations to the reading information update server. The reading information update server keeps the plurality of combinations transmitted from the plurality of voice read terminals, updates the combinations, and transmits the updated combinations to the voice read terminals. The voice read terminals update the combinations that are kept according to the transmitted combinations, and based on the updated combinations, read the words. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、音声読み上げシステムに関し、特に、複数の読みを持つ名称に優先する読みを決定する音声読み上げシステムに関する。 The present invention relates to a speech-to-speech system, and more particularly to a speech-to-speech system that determines a reading that has priority over a name having a plurality of readings.

近年、自動車に搭載されるカーナビゲーション装置、ならびに公共機関および交通機関において自動放送をする装置など、読み上げる対象となるテキストを音声データに自動変換し、音声によるアナウンスとして出力する装置が広く普及している。これらの装置を用いるシステムには、録音した音声を接続して再生する録音編集方法を用いるシステムと、発音を表した文字または符号列から音声を合成する規則合成方法を用いるシステムとがある。 In recent years, devices that automatically convert text to be read into speech data and output it as speech announcements, such as car navigation devices mounted on automobiles and devices that automatically broadcast in public and transportation facilities, have become widespread. Yes. Systems using these apparatuses include a system that uses a recording and editing method that connects and reproduces recorded speech, and a system that uses a rule synthesis method that synthesizes speech from characters or code strings that represent pronunciation.

録音編集方法は、従来、鉄道等の自動音声案内において用いられてきた。鉄道等において用いられる自動音声案内は、定型的な表現が多く使用される。このため、録音編集方法は、定型的な表現を、録音された音声の部品としてあらかじめ複数用意し、それらの録音された音声の部品を要求に従って適宜組み合わせることによって、音声を生成する方法である。しかし、録音編集方法は、あらかじめ定められた表現を組合せることによって、音声を生成するが、それ以外の手段によって、音声を生成できない。 The recording / editing method has been conventionally used in automatic voice guidance for railways and the like. Automatic voice guidance used in railways and the like often uses a fixed expression. For this reason, the recording editing method is a method of generating a sound by preparing a plurality of standard expressions as recorded sound parts in advance and appropriately combining the recorded sound parts as required. However, the recording and editing method generates sound by combining predetermined expressions, but cannot generate sound by any other means.

一方、規則合成方法は、入力された任意のテキストを音声に変換する方法である。録音編集方法は、あらかじめ想定される表現を音声によって録音しておく必要があったが、規則合成方法は、テキストのみを入力し、入力されたテキストを音声に自動変換する。このため、規則合成方法を用いるシステムは、日々更新されるニュースおよび緊急情報など、頻繁に更新される内容を読み上げるシステムとして、自動車に搭載されるカーナビゲーション装置など様々な場所において利用される。 On the other hand, the rule synthesis method is a method of converting an input arbitrary text into speech. In the recording and editing method, it is necessary to record a presumed expression by voice. In the rule synthesis method, only text is input, and the input text is automatically converted into voice. For this reason, a system using the rule composition method is used in various places such as a car navigation device mounted on an automobile as a system that reads out frequently updated contents such as daily updated news and emergency information.

一般的な規則合成方法は、まず、入力されたテキストに後述の言語処理を行い、そして、読みおよびアクセントの情報を示す中間記号列を生成した後、基本周波数パタン（すなわち、声の高さに対応する声帯の振動周期）および音素継続時間長（すなわち、発声速度に対応する各音素の長さ）などの韻律パラメータを決定する。続いて、規則合成方法は、波形生成処理によって、韻律パラメータにあわせた音声波形を生成する。韻律パラメータから音声波形を生成する方法には、音素または音節に対応する音声素片を組み合わせる、波形接続型音声合成が広く用いられる。 A general rule synthesis method first performs linguistic processing, which will be described later, on an input text, generates an intermediate symbol string indicating reading and accent information, and then sets a fundamental frequency pattern (ie, voice pitch). Prosodic parameters such as the corresponding vocal cord vibration period) and phoneme duration (ie, the length of each phoneme corresponding to the speaking rate) are determined. Subsequently, the rule synthesis method generates a speech waveform according to the prosodic parameter by waveform generation processing. As a method for generating a speech waveform from prosodic parameters, waveform connected speech synthesis is widely used in which speech segments corresponding to phonemes or syllables are combined.

前述の言語処理は、通常、入力されたテキストをそのまま読み上げるように、テキストに読みを付与する処理を含む。すなわち、「国分寺」というテキストが入力された場合には、「国分寺」というテキストには、「こくぶんじ」という読みが付与される。 The language processing described above usually includes a process of giving a reading to the text so that the input text is read as it is. That is, when the text “Kokubunji” is input, the text “Kokubunji” is given the reading “Kokubunji”.

古井貞熙著、「ディジタル音声処理」東海大学出版会出版、１９８５年９月発行Published by Sadaaki Furui, “Digital Audio Processing”, published by Tokai University Press, September 1985 T. Dutoit著、「An Introduction to Text-to-Speech Synthesis」KLUWER出版、1997年発行T. Dutoit, “An Introduction to Text-to-Speech Synthesis”, published by KLUWER, 1997

例えば、カーナビゲーション装置において、地名、交差点名、および建物名などのような地点名称（ＰＯＩ、ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ）には、複数の読み方の情報が設定される場合がある。この複数の読み方の情報は、カーナビゲーション装置における目的地設定のための音声認識処理において、利用者がどの読み方によって指定しても目的の地点を設定できるようにするために用いられる情報である。 For example, in a car navigation device, information on a plurality of readings may be set for point names (POI, Point Of Interest) such as place names, intersection names, and building names. The information on the plurality of readings is information used for setting the target point regardless of the reading method specified by the user in the voice recognition processing for setting the destination in the car navigation apparatus.

しかしながら、この読み方の情報は、一般的に利用者からの発声を認識する音声認識において用いられ、カーナビゲーション装置が音声を読み上げるために用いられることは少ない。カーナビゲーション装置が音声を読み上げる音声読み上げにおいて、カーナビゲーション装置が地名を読み上げる場合も、利用者が発声した読み方によって読み上げられることができれば、利用者にとって利便性が向上する。 However, this reading information is generally used in voice recognition for recognizing a utterance from a user, and is rarely used by a car navigation device to read out a voice. When the car navigation apparatus reads out the voice when the car navigation apparatus reads out the voice, if the car navigation apparatus reads out the place name, the convenience can be improved for the user if it can be read out by the reading method spoken by the user.

また、従来の手段を用いて、利用者による音声によって入力した読み方を記録し、記録された読み方を用いて音声を読み上げる際の読み方を決定しても、利用者が音声によって入力したことのない地点名について、利用者が呼ぶであろう読み方を決定することはできない。 Moreover, even if the reading method input by the voice by the user is recorded using the conventional means and the reading method when the voice is read out using the recorded reading method is determined, the user has not input by the voice. It is not possible to determine the reading that the user will call for the location name.

本発明は、上記の問題を鑑みてなされたものであり、地点名称を、利用者が使用している読み方、または利用者が使用すると推測される読み方で読み上げる手法、およびその読み上げ装置を提供することを目的とする。 The present invention has been made in view of the above problems, and provides a method for reading a point name by a reading method used by a user or a reading method that is assumed to be used by a user, and a reading device therefor. For the purpose.

なお、前述の課題は、カーナビゲーション装置における課題によって例示したが、音声を読み上げる装置であれば、すべて同じ課題を持つ。 In addition, although the above-mentioned subject was illustrated by the subject in a car navigation apparatus, if it is an apparatus which reads out a sound, all have the same subject.

本発明の代表的な一例を示せば以下の通りである。すなわち、複数の単語を読み上げる（例えば、音声にて出力する）複数の音声読み上げ端末と、ネットワークを介して前記音声読み上げ端末と接続される読み情報更新サーバとを備える音声読み上げシステムであって、前記音声読み上げ端末は、前記単語と、前記単語に指定される読みとの組み合わせを保持し、前記組み合わせを、前記読み情報更新サーバに送信し、前記読み情報更新サーバは、複数の前記音声読み上げ端末から送信された、複数の前記組み合わせを保持し、前記組み合わせから、前記読みが指定されていない前記単語を取得し、複数の前記組み合わせの中から、前記単語における前記読みが、当該組み合わせの前記単語における前記読みと類似する複数の他の前記組み合わせを特定し、前記複数の他の組み合わせから、当該組み合わせにおいて前記読みが指定されていない前記単語の前記読みを抽出し、前記抽出された読みによって、前記読みが指定されていない単語の前記読みを指定し、前記単語と前記指定された読みとによって、前記組み合わせを更新し、前記更新された組み合わせを前記音声読み上げ端末に送信し、前記音声読み上げ端末は、保持された前記組み合わせを、前記送信された組み合わせによって更新し、前記更新された組み合わせに基づいて、前記単語を読み上げる。 A typical example of the present invention is as follows. That is, a speech-to-speech system comprising a plurality of speech-to-speech terminals that read a plurality of words (for example, output by speech) and a reading information update server connected to the speech-to-speech terminal via a network, The speech reading terminal holds a combination of the word and the reading specified for the word, and transmits the combination to the reading information update server. The reading information update server receives a plurality of the speech reading terminals from The transmitted plurality of the combinations are retained, and the word for which the reading is not designated is acquired from the combination, and the reading in the word is selected from the plurality of the combinations in the word of the combination. A plurality of other combinations similar to the reading are identified, and the plurality of other combinations are Extracting the reading of the word for which the reading is not specified in a combination, specifying the reading of the word for which the reading is not specified by the extracted reading, and by the word and the specified reading , Update the combination, and send the updated combination to the speech-reading terminal, the speech-reading terminal updates the held combination with the transmitted combination, and based on the updated combination Read the word.

本発明の一実施形態によると、利用者が使用している読み方で音声を読み上げる装置を提供できる。 According to an embodiment of the present invention, it is possible to provide an apparatus that reads out a voice in a reading method used by a user.

本発明の第１の実施形態の端末側装置およびサーバ側装置の構成を示すブロック図である。It is a block diagram which shows the structure of the terminal side apparatus and server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置およびサーバ側装置のハードウェアを示すブロック図である。It is a block diagram which shows the hardware of the terminal side apparatus and server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の処理を示す説明図である。It is explanatory drawing which shows the process of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の音声入力処理を示す説明図である。It is explanatory drawing which shows the audio | voice input process of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置による経路誘導等において用いられる地点情報が含まれた地点データベースの説明図である。It is explanatory drawing of the point database containing the point information used in the route guidance etc. by the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置における読み履歴データベースを示す説明図である。It is explanatory drawing which shows the reading history database in the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置における読み履歴データベースを示す説明図である。It is explanatory drawing which shows the reading history database in the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の音声合成処理を示す説明図である。It is explanatory drawing which shows the speech synthesis process of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の音声合成処理において用いられる単語辞書を示す説明図である。It is explanatory drawing which shows the word dictionary used in the speech synthesis process of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の読み優先順更新処理の処理を示す説明図である。It is explanatory drawing which shows the process of the reading priority order update process of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の読み履歴送信手段から送信される読み履歴ベクトル情報の説明図である。It is explanatory drawing of the reading history vector information transmitted from the reading history transmission means of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の読み履歴送信手段から送信される読み履歴ベクトル情報の説明図である。It is explanatory drawing of the reading history vector information transmitted from the reading history transmission means of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の読み優先順受信手段によって受信される読み優先順データを示す説明図である。It is explanatory drawing which shows the reading priority order data received by the reading priority order receiving means of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置における読み履歴データベースを示す説明図である。It is explanatory drawing which shows the reading history database in the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の端末側装置の経路誘導等において用いられる地点情報が含まれた地点データベースを示す説明図である。It is explanatory drawing which shows the point database containing the point information used in the route guidance etc. of the terminal side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の処理を示す説明図である。It is explanatory drawing which shows the process of the server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の読み履歴記憶手段によって保存される読み履歴ベクトルデータベースを示す説明図である。It is explanatory drawing which shows the reading history vector database preserve | saved by the reading history memory | storage means of the server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の読み履歴登録処理を示す説明図である。It is explanatory drawing which shows the reading history registration process of the server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の読み優先順決定処理を示すフローチャートである。It is a flowchart which shows the reading priority order determination process of the server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の形態素解析処理に基づく読み優先順決定処理を示すフローチャートである。It is a flowchart which shows the reading priority order determination process based on the morphological analysis process of the server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の読み優先順決定手段による地点名称の形態素解析結果を示す説明図である。It is explanatory drawing which shows the morphological analysis result of the point name by the reading priority order determination means of the server side apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のサーバ側装置の読み優先順決定手段による地点名称の形態素解析結果を示す説明図である。It is explanatory drawing which shows the morphological analysis result of the point name by the reading priority order determination means of the server side apparatus of the 1st Embodiment of this invention. 本発明の第２の実施形態の端末側装置の構成を示すブロック図である。It is a block diagram which shows the structure of the terminal side apparatus of the 2nd Embodiment of this invention.

（第１の実施形態）
図１は、本発明の第１の実施形態の端末側装置１００、およびサーバ側装置１０１の構成を示すブロック図である。 (First embodiment)
FIG. 1 is a block diagram illustrating configurations of a terminal-side device 100 and a server-side device 101 according to the first embodiment of this invention.

本発明において用いられる装置は、端末側装置１００およびサーバ側装置１０１の組合せを基本とする。 The apparatus used in the present invention is basically a combination of the terminal-side apparatus 100 and the server-side apparatus 101.

図１に示す端末側装置１００は、利用者によってテキストが入力され、入力されたテキストを音声として読み上げる装置である。また、図１に示すサーバ側装置１０１は、端末側装置１００に、地点名称などの読み方、または、最も利用者が呼ぶ可能性の高い読み方を示す読み優先順を送信する装置である。 A terminal-side device 100 illustrated in FIG. 1 is a device in which text is input by a user and the input text is read out as speech. Further, the server-side device 101 shown in FIG. 1 is a device that transmits to the terminal-side device 100 a reading priority order indicating how to read a place name or the like, or how to read the user most likely.

図１に示す端末側装置１００は、読み上げるテキストが入力されるテキスト入力手段１、入力されたテキストの読みを決定する読み決定手段２、決定された読みに従って入力されたテキストを音声に変換する音声合成手段３、変換された音声を利用者に読み上げる音声出力手段９、端末側装置１００に利用者が発した音声を入力する音声入力手段７、利用者によって入力された音声に従って記録されている読み履歴を更新する読み履歴更新手段６、地点名称などの読み方、読み優先順および利用者が音声入力した読み履歴などを保存した読み履歴記憶手段５、記録されている読み履歴をサーバ側装置１０１に送信する読み履歴送信手段４、および、サーバ側装置１０１から通知された読み優先順の情報を受信して読み履歴記憶手段５の情報を更新する読み優先順受信手段８を、少なくとも備える。 A terminal-side device 100 shown in FIG. 1 includes a text input unit 1 for inputting a text to be read, a reading determination unit 2 for determining a reading of the input text, and a voice for converting the text input according to the determined reading into a voice. Synthesis means 3, voice output means 9 for reading the converted voice to the user, voice input means 7 for inputting the voice uttered by the user to the terminal side device 100, and reading recorded according to the voice inputted by the user Reading history update means 6 for updating the history, reading history storage means 5 for storing the reading of the point name, reading priority order, reading history inputted by the user, and the recorded reading history in the server side device 101 The information of the reading priority order notified from the reading history transmitting means 4 to be transmitted and the server side apparatus 101 is received and the information of the reading history storing means 5 is received. Reading priority receiving means 8 for new to comprise at least.

図１に示すサーバ側装置１０１は、端末側装置１００から送信される読み履歴ベクトル情報を受信する読み履歴受信手段１１、複数の端末側装置１００から受信した読み履歴ベクトル情報を保存する読み履歴記憶手段１３、保存される読み履歴ベクトル情報に基づいて地点名称などの読み方、および読み優先順を決定する読み優先順決定手段１４、決定された読み優先順を端末側装置１００に通知する読み優先順送信手段１２を、少なくとも備える。さらに、サーバ側装置１０１が新規の地点名称を追加する機能を有する場合には、サーバ側装置１０１は、新規読み受信手段１５を備える。 A server side apparatus 101 shown in FIG. 1 has a reading history receiving means 11 for receiving reading history vector information transmitted from the terminal side apparatus 100, and a reading history storage for storing reading history vector information received from a plurality of terminal side apparatuses 100. Means 13; reading priority order determining means 14 for determining the reading of the spot name and the like and reading priority order based on the stored reading history vector information; and reading priority order for notifying the terminal device 100 of the determined reading priority order The transmission means 12 is provided at least. Further, when the server-side device 101 has a function of adding a new spot name, the server-side device 101 includes a new reading receiving unit 15.

図２は、本発明の第１の実施形態の端末側装置１００およびサーバ側装置１０１のハードウェアを示すブロック図である。 FIG. 2 is a block diagram illustrating hardware of the terminal device 100 and the server device 101 according to the first embodiment of this invention.

端末側装置１００は、ＣＰＵ２１、メモリ２２、入力装置２３、出力装置２４、およびＮＷインターフェース２５を備える。また、サーバ側装置１０１は、ＣＰＵ２６、メモリ２７、入力装置２８、出力装置２９、およびＮＷインターフェース３０を備える。 The terminal-side device 100 includes a CPU 21, a memory 22, an input device 23, an output device 24, and an NW interface 25. The server-side device 101 includes a CPU 26, a memory 27, an input device 28, an output device 29, and an NW interface 30.

前述の各手段は、各々メモリ２２またはメモリ２７に含まれるプログラムによって実行され、必要に応じて、メモリ２２またはメモリ２７を参照および更新する手段である。 Each of the means described above is executed by a program included in the memory 22 or the memory 27, and refers to and updates the memory 22 or the memory 27 as necessary.

音声出力手段９は、スピーカなどの出力装置２４によって実装され、音声入力手段７は、マイクロフォンなどの入力装置２３によって実装される。プログラムは、ＣＰＵ２１またはＣＰＵ２６によって実行される。また、端末側装置１００とサーバ側装置１０１との間は、インターネット、ＬＡＮまたはＷＡＮなどのネットワーク２０によって接続される。 The audio output means 9 is implemented by an output device 24 such as a speaker, and the audio input means 7 is implemented by an input device 23 such as a microphone. The program is executed by the CPU 21 or the CPU 26. The terminal side device 100 and the server side device 101 are connected by a network 20 such as the Internet, LAN, or WAN.

なお、複数個の端末側装置１００が、サーバ側装置１０１に接続されてよい。 A plurality of terminal-side devices 100 may be connected to the server-side device 101.

第１の実施形態においては、図１に示す端末側装置１００がカーナビゲーション装置に備わる場合を例に、端末側装置１００およびサーバ側装置１０１に実行される処理を示す。 In the first embodiment, processing executed by the terminal-side device 100 and the server-side device 101 will be described by taking as an example the case where the terminal-side device 100 shown in FIG.

図３は、本発明の第１の実施形態の端末側装置１００の処理を示す説明図である。 FIG. 3 is an explanatory diagram illustrating processing of the terminal-side device 100 according to the first embodiment of this invention.

図３に示す説明図は、端末側装置１００による音声を入力する処理と読み方の情報を更新する処理とを示す。カーナビゲーション装置において一般的に実施される、音声の意味を解析するなどの処理は、図３において省略される。端末側装置１００とカーナビゲーション装置とは、物理的に別のハードウェアを用いてもよいし、または、ハードウェアを共用し、プログラムによってわけられていてもよい。 The explanatory diagram shown in FIG. 3 shows a process of inputting voice and a process of updating reading information by the terminal-side device 100. The processing that is generally performed in the car navigation apparatus, such as analyzing the meaning of voice, is omitted in FIG. The terminal-side device 100 and the car navigation device may use physically different hardware, or may share hardware and be separated by a program.

第１の実施形態の端末側装置１００は、音声入力処理２０１、音声合成処理２０２、または読み優先順更新処理２０３のいずれかを実行する。そして、各々の処理の後、次の処理を待つという状態を繰り返す。 The terminal-side device 100 according to the first embodiment executes any of the voice input process 201, the voice synthesis process 202, or the reading priority order update process 203. Then, after each process, the state of waiting for the next process is repeated.

音声入力処理２０１は、利用者が発声した音声を端末側装置１００に入力する処理である。音声入力処理２０１は、読み履歴更新手段６、音声入力手段７によって実行される。音声合成処理２０２は、入力された音声を合成する処理である。音声合成処理２０２は、テキスト入力手段１、読み決定手段２、音声合成手段３、および、音声出力手段９によって実行される。読み優先順更新処理２０３は、読み履歴送信手段４、読み履歴記憶手段５、および読み優先順受信手段８によって実行される。 The voice input process 201 is a process for inputting voice uttered by the user to the terminal-side device 100. The voice input process 201 is executed by the reading history update unit 6 and the voice input unit 7. The voice synthesis process 202 is a process for synthesizing the input voice. The speech synthesis process 202 is executed by the text input unit 1, the reading determination unit 2, the speech synthesis unit 3, and the speech output unit 9. The reading priority order update processing 203 is executed by the reading history transmission unit 4, the reading history storage unit 5, and the reading priority order reception unit 8.

図４は、本発明の第１の実施形態の端末側装置１００の音声入力処理２０１を示す説明図である。 FIG. 4 is an explanatory diagram illustrating the voice input process 201 of the terminal device 100 according to the first embodiment of this invention.

端末側装置１００は、音声入力処理２０１において、音声入力処理３０１、音声認識処理３０２、および読み履歴更新処理３０３を実行する。音声入力処理３０１および音声認識処理３０２は、図１に示す音声入力手段７によって実行され、読み履歴更新処理３０３は、図１に示す読み履歴更新手段６によって実行される。 In the voice input process 201, the terminal-side device 100 executes a voice input process 301, a voice recognition process 302, and a reading history update process 303. The voice input process 301 and the voice recognition process 302 are executed by the voice input means 7 shown in FIG. 1, and the reading history update process 303 is executed by the reading history update means 6 shown in FIG.

以下に示す音声入力処理２０１の処理は、例えば、利用者がカーナビゲーション装置に目的地を設定するために、利用者が地点名称を意味する音声を発してカーナビゲーション装置に地点名称を入力する場合の処理である。 The following voice input processing 201 is performed when, for example, the user inputs a point name to the car navigation device by uttering a voice meaning the point name in order to set the destination in the car navigation device. It is processing of.

音声入力処理２０１が起動された場合、まず音声入力処理３０１が実行される。カーナビゲーション装置の利用者、すなわち運転者が発した音声が、音声入力処理３０１によって端末側装置１００に入力される。 When the voice input process 201 is activated, a voice input process 301 is first executed. A voice uttered by the user of the car navigation device, that is, the driver, is input to the terminal side device 100 by the voice input processing 301.

音声入力処理２０１は、利用者によって、または自動的に起動される。端末側装置１００は、音声入力処理３０１において、マイクロフォンなどの入力装置を介して利用者が発した音声を、端末側装置１００に入力する。端末側装置１００は、入力された音声を示す音声データを音声認識処理３０２へ送る。 The voice input process 201 is activated by the user or automatically. In the voice input process 301, the terminal-side device 100 inputs voice uttered by the user via an input device such as a microphone to the terminal-side device 100. The terminal-side device 100 sends voice data indicating the input voice to the voice recognition process 302.

ここで、利用者が、音声入力処理３０１において「こくぶんじひたち」という音声を発した場合を以下に示す。 Here, the case where the user utters the voice “Kokubunji Hitachi” in the voice input process 301 is shown below.

続いて、端末側装置１００は、音声認識処理３０２によって、利用者によって入力された音声データを認識する。音声認識処理３０２における音声データの認識は、既存の音声認識アルゴリズムを利用してもよい（例えば、非特許文献１、２参照）。非特許文献１には、音声データを、テキストデータに変換する音声認識アルゴリズムが記載されている。 Subsequently, the terminal-side device 100 recognizes voice data input by the user through a voice recognition process 302. For the recognition of the voice data in the voice recognition process 302, an existing voice recognition algorithm may be used (for example, see Non-Patent Documents 1 and 2). Non-Patent Document 1 describes a speech recognition algorithm for converting speech data into text data.

この音声認識処理３０２の結果、利用者によって入力された音声データは、テキストデータに変換される。利用者が「こくぶんじひたち」と発声した場合、入力された音声データは、音声認識処理３０２によって、「こくぶんじひたち」というカナのテキストデータに変換される。 As a result of the voice recognition process 302, the voice data input by the user is converted into text data. When the user utters “Kokubunji Hitachi”, the input voice data is converted into Kana text data “Kokubunji Hitachi” by the voice recognition processing 302.

一般的なカーナビゲーション装置は、このような音声認識の処理によって認識されたテキストデータに基づいて、メモリに備わる地点名称データベースを検索し、利用者が発声した地点が具体的にどの地点を示すかを判定する。そして、カーナビゲーション装置は、判定された地点を、目的地を設定する処理などに送る。 A general car navigation apparatus searches a point name database provided in a memory based on text data recognized by such voice recognition processing, and specifically shows a point indicated by a point uttered by a user. Determine. Then, the car navigation device sends the determined point to a process for setting the destination.

ここで、音声認識処理３０２において認識されたテキストデータに基づいて、利用者が発声した地点が具体的にどの地点をさすかを判定する処理の例を、後述する。 Here, an example of processing for determining which point the user utters specifically refers to based on the text data recognized in the speech recognition processing 302 will be described later.

図５は、本発明の第１の実施形態の端末側装置１００による経路誘導等において用いられる地点情報が含まれた地点名称データベース４００の説明図である。 FIG. 5 is an explanatory diagram of the spot name database 400 including spot information used in route guidance or the like by the terminal-side device 100 according to the first embodiment of this invention.

なお、地点名称データベース４００は、カーナビゲーション装置に備わるメモリに保存されてもよいし、端末側装置１００に備わるメモリに保存されてもよい。また、カーナビゲーション装置および端末側装置１００が共有するメモリに保存されてもよい。地点名称データベース４００は、必要に応じて、カーナビゲーション装置または端末側装置１００から参照または更新される。 The location name database 400 may be stored in a memory provided in the car navigation device, or may be stored in a memory provided in the terminal-side device 100. Moreover, you may preserve | save in the memory which a car navigation apparatus and the terminal side apparatus 100 share. The location name database 400 is referred to or updated from the car navigation device or the terminal-side device 100 as necessary.

図５に示す地点名称データベース４００には、カーナビゲーション装置において用いられる可能性のある地点のリストと、それらの地点を音声によって入力された場合に、入力された音声データを照合する地点名称読みデータとが、複数含まれる。地点名称データベース４００は、地点毎に一意に付された識別子である地点ＩＤ４０１、地点の一般的な名称を示す地点名称４０２、および、地点名称の読み方を示す地点名称読み４０３を、少なくとも含む。 The point name database 400 shown in FIG. 5 includes a list of points that may be used in the car navigation device and point name reading data that collates the input voice data when those points are input by voice. Are included. The spot name database 400 includes at least a spot ID 401 that is an identifier uniquely assigned to each spot, a spot name 402 that indicates a general name of the spot, and a spot name reading 403 that indicates how to read the spot name.

図５に示す地点名称データベース４００において、地点ＩＤ４０１が「１」を示す地点名称４０２は、「日立国分寺店」であり、地点名称４０２が「日立国分寺店」を示す地点名称読み４０３は、第１候補が「ひたちこくぶんじてん」、第２候補が「こくぶんじひたち」、第３候補が「ひたちこくぶんじ」である。 In the location name database 400 shown in FIG. 5, the location name 402 whose location ID 401 indicates “1” is “Hitachi Kokubunji store”, and the location name reading 403 where the location name 402 indicates “Hitachi Kokubunji store” The candidate is “Hitako Kubunji Ten”, the second candidate is “Kokubunji Hitachi”, and the third candidate is “Hitako Kubunji”.

図４に示す音声認識処理３０２の結果、入力された音声データが「こくぶんじひたち」であると認識された場合、端末側装置１００は、図５に示す地点名称データベース４００を検索し、地点ＩＤ４０１が「１」である地点名称読み４０３の第２候補と、入力された音声データとが一致すると判定する。その結果、端末側装置１００は、入力された音声データが示す地点の地点名称４０２は、地点ＩＤ４０１が「３」である「日立国分寺店」であると判定する。 As a result of the speech recognition processing 302 shown in FIG. 4, when the input speech data is recognized as “Kokubunji Hitachi”, the terminal-side device 100 searches the location name database 400 shown in FIG. It is determined that the second candidate of the spot name reading 403 whose ID 401 is “1” matches the input voice data. As a result, the terminal-side device 100 determines that the spot name 402 of the spot indicated by the input voice data is “Hitachi Kokubunji store” whose spot ID 401 is “3”.

音声認識処理３０２において判定された地点名称４０２は、一般的に、カーナビゲーション装置における目的地を設定する処理などに送られる。本発明の第１の実施形態における音声認識処理３０２において判定された地点名称４０２は、読み履歴更新処理３０３に送られる。 The spot name 402 determined in the voice recognition process 302 is generally sent to a process for setting a destination in the car navigation device. The spot name 402 determined in the speech recognition process 302 in the first embodiment of the present invention is sent to the reading history update process 303.

音声認識処理３０２から送られた音声データの判定結果に基づいて、端末側装置１００は、読み履歴更新処理３０３において、読み履歴データベース５００の更新処理を実行する。読み履歴データベース５００の更新処理を、図６および図７を用いて示す。 Based on the determination result of the voice data sent from the voice recognition process 302, the terminal device 100 executes an update process of the reading history database 500 in the reading history update process 303. The update process of the reading history database 500 will be described with reference to FIGS.

図６は、本発明の第１の実施形態の端末側装置１００における読み履歴データベース５００を示す説明図である。 FIG. 6 is an explanatory diagram illustrating the reading history database 500 in the terminal device 100 according to the first embodiment of this invention.

図６に示す読み履歴データベース５００は、図１に示す読み履歴記憶手段５によって保存され、読み決定手段２によって参照されるデータベースである。読み履歴データベース５００が保存されるメモリは、地点名称データベース４００と同じく、端末側装置１００から参照できれば、カーナビゲーション装置または端末側装置１００のいずれの装置にあってもよい。なお、読み履歴データベース５００は、図５に示す地点名称データベース４００と同じデータによって構成されるため、後述するように地点名称データベース４００を用いてもよい。 A reading history database 500 shown in FIG. 6 is a database that is saved by the reading history storage unit 5 shown in FIG. The memory in which the reading history database 500 is stored may be in any of the car navigation apparatus and the terminal side apparatus 100 as long as it can be referred to from the terminal side apparatus 100 as in the point name database 400. Since the reading history database 500 includes the same data as the spot name database 400 shown in FIG. 5, the spot name database 400 may be used as will be described later.

読み履歴データベース５００は、地点名称を示す地点表記５０１、および地点名称の読み方を示す地点読み順５０２を、少なくとも含む。地点表記５０１は、地点名称データベース４００における地点名称４０２と同じである。地点読み順５０２は、利用者によって最近使用された地点名称に基づいて、順位が付されており、最近使用された地点名称の読み方には、第１候補が付される。 The reading history database 500 includes at least a point notation 501 indicating a point name and a point reading order 502 indicating how to read the point name. The point notation 501 is the same as the point name 402 in the point name database 400. In the point reading order 502, a rank is assigned based on the name of a spot used recently by the user, and the first candidate is assigned to the reading of the spot name used recently.

端末側装置１００は、読み履歴更新処理３０３において、音声認識処理３０２から送られた音声データの判定結果を、読み履歴データベース５００において検索する。本実施形態における端末側装置１００は、音声認識処理３０２の判定結果である地点名称の「日立国分寺店」を、読み履歴データベース５００において検索し、地点表記５０１における「日立国分寺店」と、地点読み順５０２において第２候補である「こくぶんじひたち」とを、音声入力処理３０１において入力された音声データであると判定する。 In the reading history update process 303, the terminal side device 100 searches the reading history database 500 for the determination result of the voice data sent from the voice recognition process 302. The terminal-side device 100 according to the present embodiment searches the reading history database 500 for the spot name “Hitachi Kokubunji store”, which is the determination result of the speech recognition processing 302, and reads “Hitachi Kokubunji store” in the point notation 501 as the point reading. In step 502, the second candidate “Kokubunji Hitachi” is determined to be the voice data input in the voice input process 301.

続いて、端末側装置１００は、読み履歴更新処理３０３において、読み履歴データベース５００の地点読み順５０２において、判定された「こくぶんじひたち」を、最近使用された地点名称の読み方であるため、第２候補から第１候補に更新する。この読み履歴更新処理３０３の結果を、図７に示す。 Subsequently, since the terminal-side device 100 is the method of reading the recently used spot name in the reading history update process 303, the determined “Kokubunji Hitachi” in the spot reading order 502 of the reading history database 500 is used. The second candidate is updated to the first candidate. The result of this reading history update process 303 is shown in FIG.

図７は、本発明の第１実施形態の端末側装置１００における読み履歴データベース５００を示す説明図である。 FIG. 7 is an explanatory diagram illustrating the reading history database 500 in the terminal device 100 according to the first embodiment of this invention.

地点表記５０１が「日立国分寺店」である地点読み順５０２のうち「ひたちこくぶんじてん」は、図６に示す地点読み順５０２において第２候補であったが、読み履歴更新処理３０３によって、図７に示す地点読み順５０２の下線６０１に示すように第１候補に更新される。また、地点表記５０１が「日立国分寺店」である地点読み順５０２のうち「ひたちこくぶんじてん」は、図６に示す地点読み順５０２において第１候補であったが、読み履歴更新処理３０３によって、図７に示す地点読み順５０２の下線６０２に示すように第２候補に更新される。 Of the spot reading order 502 where the spot notation 501 is “Hitachi Kokubunji store”, “Hitako Kubunjiten” was the second candidate in the spot reading order 502 shown in FIG. 7 is updated to the first candidate as indicated by the underline 601 in the point reading order 502 shown in FIG. In addition, “Hitako Kubunjiten” in the spot reading order 502 whose spot notation 501 is “Hitachi Kokubunji branch” was the first candidate in the spot reading order 502 shown in FIG. As shown by the underline 602 in the point reading order 502 shown in FIG.

なお、本実施形態の端末側装置１００は、読み履歴更新処理３０３においても図５に示す地点名称データベース４００を用い、複数の音声入力処理２０１が同時に並行して処理される場合、読み履歴更新処理３０３における更新と音声認識処理３０２における検索とが同時に処理されることによって、音声認識処理３０２の検索結果に影響を受けることはない。 Note that the terminal-side device 100 of the present embodiment uses the spot name database 400 shown in FIG. 5 in the reading history update processing 303 as well, and when a plurality of voice input processes 201 are simultaneously processed in parallel, the reading history update processing is performed. Since the update in 303 and the search in the speech recognition process 302 are processed at the same time, the search result of the speech recognition process 302 is not affected.

端末側装置１００は、読み履歴更新処理３０３の後、音声入力処理２０１を終了する。 After the reading history update process 303, the terminal-side device 100 ends the voice input process 201.

前述の音声入力処理２０１によって、端末側装置１００は、利用者から入力された音声を地点名称として認識し、利用者が最近使用した地点名称の読みを、地点名称の読みの第１候補とすることができる。 By the voice input processing 201 described above, the terminal-side device 100 recognizes the voice input from the user as the spot name, and sets the reading of the spot name recently used by the user as the first candidate for reading the spot name. be able to.

次に、図３に示す音声合成処理２０２を、説明する。 Next, the speech synthesis process 202 shown in FIG. 3 will be described.

図８は、本発明の第１の実施形態の端末側装置１００における音声合成処理２０２を示す説明図である。 FIG. 8 is an explanatory diagram illustrating the speech synthesis process 202 in the terminal-side device 100 according to the first embodiment of this invention.

音声合成処理２０２は、カーナビゲーション装置において、例えば、目的地へ至る経路に沿って利用者を誘導する音声を読み上げる場合などに、起動される。音声合成処理２０２において、端末側装置１００は、テキスト入力処理７０１、読み決定処理７０２、および音声合成処理７０３を実行する。テキスト入力処理７０１は、テキスト入力手段１によって実行され、読み決定処理７０２は、読み決定手段２によって実行され、音声合成処理７０３は、音声合成手段３によって実行される。 The voice synthesizing process 202 is activated in the car navigation device, for example, when reading out voices that guide the user along the route to the destination. In the speech synthesis process 202, the terminal-side device 100 executes a text input process 701, a reading determination process 702, and a speech synthesis process 703. The text input process 701 is executed by the text input means 1, the reading determination process 702 is executed by the reading determination means 2, and the speech synthesis process 703 is executed by the speech synthesis means 3.

まず、端末側装置１００は、音声合成処理２０２によってカーナビゲーション装置が読み上げようとする音声を示すテキストデータを、テキスト入力処理７０１において入力される。このテキストデータは、カーナビゲーション装置において行われる、目的地へ至る経路に沿って利用者を誘導する音声をカーナビゲーション装置が読み上げる処理から送られたり、センターサーバから受信した配信情報およびメール情報などを音声としてカーナビゲーション装置が読み上げる処理から送られたりする。 First, in the text input process 701, the terminal-side apparatus 100 receives text data indicating the voice that the car navigation apparatus is to read out by the voice synthesis process 202. This text data is sent from the car navigation device that reads out the voice that guides the user along the route to the destination, or the distribution information and mail information received from the center server. It is sent as a voice from a process that the car navigation device reads out.

続いて、テキスト入力処理７０１によって入力されたテキストデータ、すなわち、カーナビゲーション装置から読み上げられる音声を示すテキストデータは、読み決定処理７０２へ送られる。読み決定処理７０２は、漢字かな混じり文として送られたテキストデータに、テキストデータに含まれる文字列の読みを付与する。 Subsequently, the text data input by the text input process 701, that is, the text data indicating the voice read out from the car navigation device is sent to the reading determination process 702. A reading determination process 702 adds reading of a character string included in text data to text data sent as a kanji-kana mixed sentence.

テキストデータに含まれる文字列の読みを付与する処理は、広義には、従来の音声合成技術における言語処理（読み付与処理）も含まれる。一方でこの読み決定処理７０２において、特定の地点名に対して振り仮名を付与するように、部分的な文字列に対して読みを指定する処理とすることも可能である。 The process of giving the reading of the character string included in the text data broadly includes language processing (reading giving process) in the conventional speech synthesis technology. On the other hand, in this reading determination process 702, it is possible to specify a reading for a partial character string so as to give a pseudonym to a specific spot name.

従来の音声合成技術における読み付与処理は、形態素解析処理に基づいて構成される。形態素解析処理については、例えば文献「自然言語処理」（長尾真編、岩波書店、１９９６年発行）に詳細な記述がある。 The reading imparting process in the conventional speech synthesis technique is configured based on the morphological analysis process. The morpheme analysis process is described in detail in, for example, the document “Natural Language Processing” (Masao Nagao, Iwanami Shoten, 1996).

図９は、本発明の第１の実施形態の端末側装置１００の音声合成処理において用いられる単語辞書８００を示す説明図である。 FIG. 9 is an explanatory diagram illustrating a word dictionary 800 used in the speech synthesis process of the terminal-side device 100 according to the first embodiment of this invention.

この形態素解析処理は、一般に、図９に示す単語辞書８００を参照して解析処理が行われる。音声合成のための形態素解析処理において用いられる単語辞書８００は、少なくとも表記８０１（単語エントリーとして検索される）、品詞８０２、アクセントを含む読み８０３が含まれる。 This morpheme analysis process is generally performed with reference to the word dictionary 800 shown in FIG. The word dictionary 800 used in the morphological analysis processing for speech synthesis includes at least a notation 801 (searched as a word entry), a part of speech 802, and a reading 803 including an accent.

例えば、読み決定処理７０２に、漢字かな混じりテキスト「日立国分寺店の先を右折です」が入力され、従来の読みを付与する処理を行う場合、端末側装置１００は、形態素解析処理の単語辞書に含まれる読み情報を用いて、「ひたちこくぶんじてんのさきをうせつです」のように読みを決定する。 For example, when the kanji-kana mixed text “Hitachi Kokubunji store is right-turned” is input to the reading determination process 702 and the conventional reading process is performed, the terminal-side device 100 adds the word dictionary for the morphological analysis process. Using the reading information included, the reading is determined as follows: “I am a kid.

本発明における端末側装置１００は、読み決定処理７０２において、図９に示す単語辞書８００に加えて、図６または図７に示す読み履歴データベース５００を用いる。 The terminal-side device 100 according to the present invention uses the reading history database 500 shown in FIG. 6 or 7 in addition to the word dictionary 800 shown in FIG.

具体的には、端末側装置１００は、従来の形態素解析処理における形態素の辞書検索処理において、図９に示す単語辞書８００よりも優先して読み履歴データベース５００を検索し、読み履歴データベース５００に含まれる地点表記５０１を、単語エントリーとして検索する。 Specifically, the terminal-side device 100 searches the reading history database 500 in preference to the word dictionary 800 shown in FIG. 9 in the morpheme dictionary search processing in the conventional morpheme analysis processing, and is included in the reading history database 500. The point notation 501 to be searched is searched as a word entry.

これによって、端末側装置１００は、読み履歴データベース５００における地点表記５０１と地点読み順５０２とを優先して検索することができ、読みを付与する処理に反映することができる。具体的には、端末側装置１００は、テキスト入力処理７０１において入力されたテキストデータに含まれる地点表記の文字列「日立国分寺店」に、図６に示す読み履歴データベース５００の状態においては「ひたちこくぶんじてん」という読みが、図７に示す読み履歴データベース５００の状態においては「こくぶんじひたち」という読みを、読み決定処理７０２において付与する。 As a result, the terminal-side device 100 can preferentially search the point notation 501 and the point reading order 502 in the reading history database 500, and can reflect them in the process of giving readings. Specifically, the terminal-side device 100 adds “Hitachi” to the character string “Hitachi Kokubunji” in the point notation included in the text data input in the text input process 701 in the state of the reading history database 500 shown in FIG. In the state of the reading history database 500 shown in FIG. 7, the reading “Kokubunjiten” is given in the reading determination processing 702 as “Kokubunji Hitachi”.

この結果、前述の例に示した「日立国分寺店の先を右折です」が入力された場合、端末側装置１００は、図７に示す読み履歴データベース５００の状態において、入力されたテキストデータに「こくぶんじひたちのさきをうせつです」という読みを付与する。 As a result, when “turn right at the end of the Hitachi Kokubunji store” as shown in the above example is input, the terminal-side device 100 adds the text data to the input text data in the state of the reading history database 500 shown in FIG. "I'm obsessed with our head".

前述の処理によって読みが付与されたテキストデータは、続いて、音声合成処理７０３に送られ、音声に変換される。この音声合成処理７０３は、例えば、非特許文献１、２に記載されている方法を用いればよい。そして、端末側装置１００は、音声に変換されたデータを、図１に示す音声出力手段９によって読み上げてもよいし、カーナビゲーション装置の処理に出力を戻してもよい。 The text data to which reading is given by the above processing is then sent to the speech synthesis processing 703 and converted to speech. For the speech synthesis processing 703, for example, the methods described in Non-Patent Documents 1 and 2 may be used. And the terminal side apparatus 100 may read the data converted into the audio | voice by the audio | voice output means 9 shown in FIG. 1, and may return an output to the process of a car navigation apparatus.

以上の処理によって、第１の実施形態における端末側装置１００を備えるカーナビゲーション装置は、あらかじめ端末側装置１００に登録されている地点名称を音声として読み上げる場合、利用者が音声によって入力したことのある読みによって読み上げることができる。すなわち、「日立国分寺店」という地点を、「こくぶんじひたち」と呼んでいる利用者には、カーナビゲーション装置も「こくぶんじひたち」という音声を読み上げ、「ひたちこくぶんじ」と呼んでいる利用者には、カーナビゲーション装置も「ひたちこくぶんじ」という音声を読み上げることができる。 Through the above processing, the car navigation device including the terminal-side device 100 according to the first embodiment may have been input by the user when the point name registered in advance in the terminal-side device 100 is read as voice. Can be read out by reading. In other words, the car navigation system reads out the voice “Kokubunji Hitachi” to the user who calls the location “Hitachi Kokubunji” as “Kokubunji Hitachi” and calls it “Hitako Kbunji”. The car navigation device can also read out the voice “Hitako Kubunji” to the user who is.

これによって、本発明を適用したカーナビゲーション装置は、利用者が慣れ親しんだ名称の呼び方によって、音声ガイダンスをすることが可能となり、利用者にとって利便性が向上する。 As a result, the car navigation device to which the present invention is applied can provide voice guidance according to the name familiar to the user, which improves convenience for the user.

次に、図３に示す読み優先順更新処理２０３を、図１０を用いて説明する。 Next, the reading priority order update processing 203 shown in FIG. 3 will be described with reference to FIG.

図１０は、本発明の第１の実施形態の端末側装置１００の読み優先順更新処理２０３の処理を示す説明図である。 FIG. 10 is an explanatory diagram illustrating processing of the reading priority order update processing 203 of the terminal device 100 according to the first embodiment of this invention.

読み優先順更新処理２０３は、図１に示す読み履歴送信手段４と読み優先順受信手段８とによって実行される。 The reading priority order update processing 203 is executed by the reading history transmitting means 4 and the reading priority order receiving means 8 shown in FIG.

カーナビゲーション装置は、登録されている地点情報（地点名称を含む）が更新されることがある。具体的には、カーナビゲーション装置が備える経路誘導用の地図データは、定期的に更新されることが多く、ＰＯＩ（ＰｏｉｎｔＯｆＩｎｔｅｒｅｓｔ：地点）または道路情報などが、追加、修正、または削除される。 In the car navigation apparatus, registered spot information (including spot names) may be updated. Specifically, the route guidance map data provided in the car navigation device is often updated regularly, and POI (Point Of Interest) or road information is added, modified, or deleted. .

従来は、利用者がカーディーラー等の店舗に行って、ＣＤ−ＲＯＭ、またはＤＶＤ−ＲＯＭなどの地図情報記録メディアを交換することによって、カーナビゲーション装置が備える地図データは、更新されていた。しかし、今後のカーナビゲーション装置は、カーナビゲーション装置に接続された携帯電話、または無線ＬＡＮなどを用いて、ネットワーク経由によって更新される場合が増えていくと推測される。 Conventionally, the map data provided in the car navigation apparatus has been updated by a user going to a store such as a car dealer and exchanging map information recording media such as a CD-ROM or DVD-ROM. However, it is speculated that future car navigation devices will be increasingly updated via a network using a mobile phone connected to the car navigation device or a wireless LAN.

カーナビゲーション装置は、本実施形態における読み優先順更新処理２０３を起動および実行するように構成すれば、手動によって更新しても自動によって更新しても、いずれの更新の方法を採ってもよい。本実施形態においては、ネットワーク経由において地図データを自動的に更新される場合を後述する。 As long as the car navigation apparatus is configured to activate and execute the reading priority update process 203 in the present embodiment, it may be updated manually or automatically. In this embodiment, a case where map data is automatically updated via a network will be described later.

カーナビゲーション装置において、地図データの更新処理が起動された場合、本発明の端末側装置１００における読み優先順更新処理２０３が同時に実行される。 In the car navigation apparatus, when the map data update process is activated, the reading priority order update process 203 in the terminal side apparatus 100 of the present invention is simultaneously executed.

読み優先順更新処理２０３は、図１０に示すように、読み履歴ベクトル作成処理９０１、読み履歴送信処理９０２、読み優先順受信処理９０３、および、読み履歴更新処理９０４の順に処理される。 As shown in FIG. 10, the reading priority order update process 203 is processed in the order of a reading history vector creation process 901, a reading history transmission process 902, a reading priority order reception process 903, and a reading history update process 904.

読み優先順更新処理２０３が起動されると、まず、読み履歴ベクトル作成処理９０１が実行される。 When the reading priority order update processing 203 is activated, first, a reading history vector creation processing 901 is executed.

読み履歴ベクトル作成処理９０１は、カーナビゲーション装置の利用者が地点をどのように呼んだか、すなわちその地点の名称がどのように音声として入力されたかを指定する読み履歴ベクトル情報１０００を作成する処理である。 The reading history vector creation process 901 is a process for creating reading history vector information 1000 that specifies how the user of the car navigation apparatus has called a point, that is, how the name of the point is input as speech. is there.

図１１は、本発明の第１の実施形態の端末側装置１００の読み履歴送信手段４から送信される読み履歴ベクトル情報１０００の説明図である。 FIG. 11 is an explanatory diagram of the reading history vector information 1000 transmitted from the reading history transmission unit 4 of the terminal device 100 according to the first embodiment of this invention.

読み履歴ベクトル情報１０００は、地点名称に関する様々な情報を含んでもよいが、最も簡単には、例えば、図１１に示すように、利用者が端末側装置１００に音声入力し、読み履歴更新処理３０３によって更新された読み履歴データベース５００内における地点表記５０１と地点読み順５０２の第１候補との組みあわせを列挙したベクトル形式であってもよい。図１１に示す地点表記１００１は、図６に示す地点表記５０１であり、図１１に示す地点読み第１候補１００２は、図６に示す地点読み順５０２の第１候補である。 The reading history vector information 1000 may include various pieces of information related to the location name. In the simplest case, for example, as shown in FIG. May be a vector format listing the combinations of the point notation 501 and the first candidate in the point reading order 502 in the reading history database 500 updated by. A spot notation 1001 shown in FIG. 11 is a spot notation 501 shown in FIG. 6, and a spot reading first candidate 1002 shown in FIG. 11 is a first candidate in the spot reading order 502 shown in FIG.

また、前述のように、読み履歴データベース５００と地点データベース４００とを共用する場合の、読み履歴ベクトル情報１０００を示す。 Further, as described above, the reading history vector information 1000 when the reading history database 500 and the point database 400 are shared is shown.

図１２は、本発明の第１の実施形態の端末側装置１００の読み履歴送信手段４から送信される読み履歴ベクトル情報１０００の説明図である。 FIG. 12 is an explanatory diagram of the reading history vector information 1000 transmitted from the reading history transmission unit 4 of the terminal device 100 according to the first embodiment of this invention.

図１２に示す読み履歴ベクトル情報１０００は、地点ＩＤ１１０１ごとの地点読み第１候補１１０２を列挙したベクトル形式によって示される。図１２に示す地点読み第１候補１１０２のうち「−」を示す行は、利用者による音声の入力がまだなされていない、すなわち、読み履歴更新処理３０３が行われていない地点を示す。 The reading history vector information 1000 shown in FIG. 12 is shown in a vector format in which the point reading first candidates 1102 for each point ID 1101 are listed. The row indicating “-” in the first point reading 1102 shown in FIG. 12 indicates a point where the user has not yet input a voice, that is, the reading history update process 303 is not performed.

読み履歴ベクトル作成処理９０１によって作成された読み履歴ベクトル情報１０００は、読み履歴送信処理９０２によって、サーバ側装置１０１へ送信される。 The reading history vector information 1000 created by the reading history vector creation processing 901 is transmitted to the server side device 101 by the reading history transmission processing 902.

読み履歴ベクトル作成処理９０１と、読み履歴送信処理９０２とは、読み履歴送信手段４によって実行される。 The reading history vector creation process 901 and the reading history transmission process 902 are executed by the reading history transmission unit 4.

読み履歴ベクトル情報１０００は、端末側装置１００からサーバ側装置１０１へ、前述に示したカーナビゲーション装置の地図データの更新と同じく、携帯電話または無線ＬＡＮのようなネットワークを経由して自動的に送信されてもよい。また、例えば、ＵＳＢメモリまたはＳＤメモリカードのようなデータ記録メディアを用いた手作業によって、カーディーラー店舗に設置された地図データを更新するサーバ装置に入力され、カーディーラー店舗に設置されたサーバ装置からサーバ側装置１０１に送られてもよい。 The reading history vector information 1000 is automatically transmitted from the terminal-side device 100 to the server-side device 101 via a network such as a mobile phone or a wireless LAN, similar to the update of the map data of the car navigation device described above. May be. Further, for example, a server device installed in a car dealer store is input to a server device that updates map data installed in a car dealer store by a manual operation using a data recording medium such as a USB memory or an SD memory card. To the server-side device 101.

読み履歴ベクトル送信処理９０２が終了すると、続いて読み優先順受信処理９０３が実行される。読み優先順受信処理９０３において、端末側装置１００は、サーバ側装置１０１から読み優先順データ１２００が送信されるまで待機する。 When the reading history vector transmission processing 902 is completed, reading priority order reception processing 903 is subsequently executed. In the reading priority order reception processing 903, the terminal side device 100 stands by until the reading priority order data 1200 is transmitted from the server side device 101.

図１３は、本発明の第１の実施形態の端末側装置１００の読み優先順受信手段８によって受信される読み優先順データ１２００を示す説明図である。 FIG. 13 is an explanatory diagram illustrating the reading priority data 1200 received by the reading priority receiving unit 8 of the terminal device 100 according to the first embodiment of this invention.

図１３に示す読み優先順データ１２００は、図１２に示す読み履歴ベクトル情報１０００と同じく地点ＩＤ１２０１ごとに読み優先順データ１２００が示される形式であるが、前述のように図１１に示す読み履歴ベクトル情報１０００と同じベクトル形式であってもよい。 The reading priority order data 1200 shown in FIG. 13 has the format in which the reading priority order data 1200 is shown for each point ID 1201 as in the reading history vector information 1000 shown in FIG. 12, but as described above, the reading history vector data 1200 shown in FIG. The same vector format as the information 1000 may be used.

サーバ側装置１０１から送信される読み優先順データ１２００は、端末側装置１００において新たに追加すべき地点を示す地点ＩＤ１２０１と、その読みを示す地点読み第１候補１２０２とを含む。 The reading priority order data 1200 transmitted from the server side device 101 includes a point ID 1201 indicating a point to be newly added in the terminal side device 100, and a first point reading candidate 1202 indicating the reading.

図１２に示す読み履歴ベクトル情報１０００において、地点ＩＤ１１０１が「２」であるエントリーは、地点読み第１候補１１０２に「−」と記載されていた。これに対し、図１３に示す読み優先順データ１２００において、地点ＩＤ１２０１が「２」であるエントリーは、地点ＩＤ１２０１の地点読み第１候補１２０２に、「ひたちほんしゃ」という読みが指定される。すなわち、サーバ側装置１０１は、読みが記載されていなかった地点ＩＤに読みを指定して、読みを指定した地点ＩＤを端末側装置１００に送る。 In the reading history vector information 1000 illustrated in FIG. 12, an entry whose location ID 1101 is “2” is described as “−” in the first location reading candidate 1102. On the other hand, in the reading priority order data 1200 shown in FIG. 13, for an entry whose location ID 1201 is “2”, the reading “Hitashonsha” is designated as the first location reading candidate 1202 of the location ID 1201. That is, the server-side apparatus 101 designates a reading for a spot ID for which no reading is described, and sends the spot ID for which the reading is designated to the terminal-side apparatus 100.

このように、図１３に示す読み優先順データ１２００は、読みを追加すべき地点と読みの組みあわせを、サーバ側装置１０１によって０個以上指定された情報を含む。なお、サーバ側装置１０１によって読みを指定された地点ＩＤ１２０１には、端末側装置１００において新たに追加すべき旨を示すフラグを付加してもよい。 As described above, the reading priority order data 1200 illustrated in FIG. 13 includes information in which zero or more combinations of points to which readings should be added and readings are designated by the server-side apparatus 101. Note that a flag indicating that a new addition should be added in the terminal device 100 may be added to the point ID 1201 designated for reading by the server device 101.

端末側装置１００は、サーバ側装置１０１から図１３に示す読み優先順データ１２００を受信すると、続いて読み履歴更新処理９０４を実行する。 Upon receiving the reading priority order data 1200 shown in FIG. 13 from the server-side apparatus 101, the terminal-side apparatus 100 subsequently executes a reading history update process 904.

読み履歴更新処理９０４は、サーバ側装置１０１から受信した読み優先順データ１２００に基づいて、読み履歴データベース５００を更新する。すなわち、読み履歴データベース５００に保存される各地点の地点読み順５０２に、受信した読み優先順データ１２００に指定された読みを第１候補として設定する。 The reading history update processing 904 updates the reading history database 500 based on the reading priority order data 1200 received from the server side device 101. That is, the reading specified in the received reading priority data 1200 is set as the first candidate in the point reading order 502 of each point stored in the reading history database 500.

例えば、図１３に示す読み優先順データ１２００を受信した場合、地点ＩＤ１２０１が「２」である地点と、その地点読み第１候補１２０２が示す読みとは、新たに読み履歴データベース５００に追加すべき地点とその読みとを示す。 For example, when the reading priority order data 1200 shown in FIG. 13 is received, the point where the point ID 1201 is “2” and the reading indicated by the point reading first candidate 1202 should be newly added to the reading history database 500. The point and its reading are shown.

端末側装置１００は、読み優先順受信処理９０３において受信した読み優先順データ１２００から、追加すべき地点と読みとの組み合わせをすべて抽出する。そして、抽出された組み合わせを、対応する読み履歴データベース５００の地点表記５０１と地点読み順５０２とに、第１候補として追加する。 The terminal-side device 100 extracts all combinations of points to be added and readings from the reading priority order data 1200 received in the reading priority order receiving process 903. Then, the extracted combination is added as a first candidate to the point notation 501 and the point reading order 502 of the corresponding reading history database 500.

この結果、図７に示す読み履歴データベース５００、または図５に示す地点データベース４００は、それぞれ図１４、および図１５に示す内容に更新される。 As a result, the reading history database 500 shown in FIG. 7 or the point database 400 shown in FIG. 5 is updated to the contents shown in FIGS. 14 and 15, respectively.

図１４は、本発明の第１の実施形態の端末側装置１００における読み履歴データベース５００を示す説明図である。 FIG. 14 is an explanatory diagram illustrating the reading history database 500 in the terminal device 100 according to the first embodiment of this invention.

端末側装置１００は、地点表記５０１が「日立本店」を示すエントリーの地点読み順５０２に、「ひたちほんしゃ」という読みを下線１３０１に示すように第１候補として追加する。 The terminal-side device 100 adds the reading “Hitashonsha” as the first candidate as indicated by the underline 1301 in the point reading order 502 of the entry whose point notation 501 indicates “Hitachi head office”.

端末側装置１００は、読み優先順データ１２００から抽出した組み合わせのうち、地点ＩＤ１２０１について、地点データベース４００において検索してから、地点表記５０１を検索してもよい。 The terminal-side device 100 may search the spot notation 501 after searching the spot database 400 for the spot ID 1201 among the combinations extracted from the reading priority order data 1200.

図１５は、本発明の第１の実施形態の端末側装置１００の経路誘導等において用いられる地点情報が含まれた地点データベース４００を示す説明図である。 FIG. 15 is an explanatory diagram illustrating a point database 400 including point information used in route guidance or the like of the terminal-side device 100 according to the first embodiment of this invention.

端末側装置１００は、地点表記４０１が「２」を示すエントリーの地点名称読み４０３に、「ひたちほんしゃ」という読みを下線１４０１に示すように第１候補として追加する。 The terminal-side device 100 adds the reading “Hitachi Honsha” as the first candidate as indicated by the underline 1401 to the point name reading 403 of the entry whose point notation 401 indicates “2”.

読み履歴データベース５００、または地点データベース４００の更新が終了すると、読み優先順更新処理２０３が終了する。 When the update of the reading history database 500 or the point database 400 ends, the reading priority order update processing 203 ends.

前述の通り、読み優先順更新処理２０３によって、カーナビゲーション装置の利用者がこれまで音声によって入力したことのない地点「日立本店」に、優先すべき読み「ひたちほんしゃ」が指定される。そして、読み優先順更新処理２０３が行われた後、カーナビゲーション装置は、利用者に経路を誘導するために読み上げる音声に、例えば「日立本店の先を右折です」というテキストが指定された場合、前述の音声合成処理２０２における読み決定処理３０２によって、「ひたちほんしゃのさきをうせつです」という読みを読み上げる。 As described above, the reading priority order update process 203 designates the reading “Hitahonsha” to be prioritized at the point “Hitachi head office” that the user of the car navigation apparatus has not input by voice so far. Then, after the reading priority order update process 203 is performed, the car navigation device, for example, if the text “Reading to the right after the Hitachi head office” is specified in the voice to be read to guide the route to the user, By the reading determination process 302 in the speech synthesis process 202 described above, a reading “Hitachi no saki saku setsu sei” is read out.

次に、端末側装置１００と連携するサーバ側装置１０１の処理について示す。 Next, processing of the server side apparatus 101 that cooperates with the terminal side apparatus 100 will be described.

サーバ側装置１０１の構成は、図１に示される。以下、この図１に従って、サーバ側装置１０１における処理を示す。 The configuration of the server side device 101 is shown in FIG. The processing in the server side device 101 will be described below according to FIG.

図１６は、本発明の第１の実施形態のサーバ側装置１０１の処理を示す説明図である。 FIG. 16 is an explanatory diagram illustrating processing of the server-side device 101 according to the first embodiment of this invention.

まず、図１１または図１２に示す読み履歴ベクトル情報１０００は、端末側装置１００の読み履歴送信手段４によって、サーバ側装置１０１に送信される。読み履歴ベクトル情報１０００は、サーバ側装置１０１において、読み履歴送信手段４に対応する受信手段である読み履歴受信手段１１によって受信され、続いて、読み履歴記憶手段１３へ送られる（読み履歴受信処理１８０１）。 First, the reading history vector information 1000 shown in FIG. 11 or 12 is transmitted to the server-side device 101 by the reading history transmission means 4 of the terminal-side device 100. The reading history vector information 1000 is received by the server-side apparatus 101 by the reading history receiving means 11 which is a receiving means corresponding to the reading history transmitting means 4 and subsequently sent to the reading history storage means 13 (reading history receiving process). 1801).

なお、サーバ側装置１０１は、複数の端末側装置１００から送信される複数の読み履歴ベクトル情報１０００を受信できる。その場合、サーバ側装置１０１は、一つの端末側装置１００から読み履歴ベクトル情報１０００を受信してから、その端末側装置１００に読み優先順データ１２００を送信するまでの間、他の端末側装置１００からの読み履歴ベクトル情報１０００の送信要求を承認せず、他の端末側装置１００を待機させておいてもよい。また、複数の端末側装置１００からの送信要求を、並列に処理してもよい。後者の場合、サーバ側装置１０１は、後述する読み履歴記憶手段１３および読み優先順決定手段１４を排他的に処理する。 The server device 101 can receive a plurality of reading history vector information 1000 transmitted from a plurality of terminal devices 100. In this case, the server-side apparatus 101 receives other reading history vector information 1000 from one terminal-side apparatus 100 and transmits other reading-priority data 1200 to the terminal-side apparatus 100. Instead of approving the transmission request of the reading history vector information 1000 from 100, another terminal device 100 may be kept on standby. Further, transmission requests from a plurality of terminal-side devices 100 may be processed in parallel. In the latter case, the server side device 101 exclusively processes a reading history storage unit 13 and a reading priority order determination unit 14 which will be described later.

以下に示す本実施形態のサーバ側装置１０１は、一つの端末側装置１００から送信された読み履歴ベクトル情報１０００について処理する。 The server side apparatus 101 of this embodiment shown below processes the reading history vector information 1000 transmitted from one terminal side apparatus 100.

読み履歴ベクトル情報１０００は、端末側装置１００から送信され、読み履歴受信手段１１によって受信されると、読み履歴記憶手段１３に送られて読み履歴ベクトルデータベース１５００に保存される（読み履歴登録処理１８０２）。 When the reading history vector information 1000 is transmitted from the terminal-side device 100 and received by the reading history receiving means 11, it is sent to the reading history storage means 13 and stored in the reading history vector database 1500 (reading history registration processing 1802). ).

図１７は、本発明の第１の実施形態の読み履歴記憶手段１３によって保存される読み履歴ベクトルデータベース１５００を示す説明図である。 FIG. 17 is an explanatory diagram illustrating a reading history vector database 1500 stored by the reading history storage unit 13 according to the first embodiment of this invention.

読み履歴ベクトルデータベース１５００は、各端末側装置１００から送信される読み履歴ベクトル情報１０００を、端末側装置１００を一意に識別する端末ＩＤ１５０１とともに保存する。例えば、端末ＩＤ１５０１が「１」（以降、端末ＩＤ１と記載する）である端末側装置１００から図１１または図１２に示す読み履歴ベクトル情報１０００が送られた場合、サーバ側装置１０１の読み履歴ベクトルデータベース１５００には、図１７に示すように端末ＩＤ１のエントリーに、受信された読み履歴ベクトル情報１０００が保存される。以降、各端末ＩＤ１５０１に対応するエントリーを、読み履歴ベクトルと記載する。 The reading history vector database 1500 stores reading history vector information 1000 transmitted from each terminal device 100 together with a terminal ID 1501 that uniquely identifies the terminal device 100. For example, when the reading history vector information 1000 shown in FIG. 11 or FIG. 12 is sent from the terminal side device 100 whose terminal ID 1501 is “1” (hereinafter referred to as terminal ID 1), the reading history vector of the server side device 101 is sent. In the database 1500, the received reading history vector information 1000 is stored in the entry of the terminal ID1, as shown in FIG. Hereinafter, an entry corresponding to each terminal ID 1501 is referred to as a reading history vector.

具体的には、端末側装置１００から送信された読み履歴ベクトル情報１０００の地点ＩＤ１１０１が「１」を示す地点読み第１候補１１０２の値は、読み履歴ベクトルデータベース１５００において、一意に決定される端末ＩＤ１５０１の地点ＩＤ１として保存される。また、読み履歴ベクトル情報１０００の地点ＩＤ１１０１が「２」を示す地点読み第１候補１１０２の値は、読み履歴ベクトルデータベース１５００における地点ＩＤ２に保存される。 Specifically, the value of the point reading first candidate 1102 in which the point ID 1101 of the reading history vector information 1000 transmitted from the terminal side device 100 indicates “1” is uniquely determined in the reading history vector database 1500. It is stored as the point ID1 of ID1501. In addition, the value of the first point reading 1102 where the point ID 1101 of the reading history vector information 1000 indicates “2” is stored in the point ID 2 in the reading history vector database 1500.

他の端末ＩＤ１５０１が示す端末側装置１００から読み履歴ベクトル情報１０００が送信された場合も同じく、サーバ側装置１０１は、対応する端末ＩＤ１５０１の読み履歴ベクトルへ送られた読み履歴ベクトル情報１０００を保存する。 Similarly, when the reading history vector information 1000 is transmitted from the terminal side device 100 indicated by the other terminal ID 1501, the server side device 101 stores the reading history vector information 1000 sent to the reading history vector of the corresponding terminal ID 1501. .

読み履歴登録処理１８０２は、読み履歴記憶手段１３によって保存される読み履歴ベクトルデータベース１５００の読み履歴ベクトルを作成し、作成された読み履歴ベクトルを読み履歴ベクトルデータベース１５００に登録する処理である。 The reading history registration process 1802 is a process of creating a reading history vector of the reading history vector database 1500 stored by the reading history storage unit 13 and registering the created reading history vector in the reading history vector database 1500.

図１８は、本発明の第１の実施形態のサーバ側装置の読み履歴登録処理１８０２を示す説明図である。 FIG. 18 is an explanatory diagram illustrating the reading history registration processing 1802 of the server-side device according to the first embodiment of this invention.

読み履歴登録処理１８０２は、受信した読み履歴ベクトル情報１０００に対応する端末ＩＤ１５０１を取得し（端末ＩＤ取得処理１９０１）、取得した端末ＩＤ１５０１を付与された読み履歴ベクトルデータベース１５００の読み履歴ベクトルを作成し（登録データ作成処理１９０２）、読み履歴ベクトルデータベース１５００に登録する（排他的ＤＢ登録処理１９０３）。 The reading history registration processing 1802 acquires a terminal ID 1501 corresponding to the received reading history vector information 1000 (terminal ID acquisition processing 1901), and creates a reading history vector of the reading history vector database 1500 assigned the acquired terminal ID 1501. (Registered data creation processing 1902), registering in the reading history vector database 1500 (exclusive DB registration processing 1903).

読み履歴登録処理１８０２に続いて、後述する読み優先順決定処理１８０３が実行される。 Subsequent to the reading history registration process 1802, a reading priority order determination process 1803 described later is executed.

読み優先順決定処理１８０３は、読み履歴ベクトル情報１０００を送信した端末側装置１００に、その端末側装置１００の利用者がまだ音声によって入力していない地点、すなわち受信した読み履歴ベクトル情報１０００内に読みが指定されていない地点の、読みを決定する。なお、読み優先順決定処理１８０３は、読み優先順決定手段１４によって実行される。 The reading priority order determination processing 1803 is performed in the terminal-side device 100 that has transmitted the reading history vector information 1000 to a point that the user of the terminal-side device 100 has not yet input by voice, that is, in the received reading history vector information 1000. Determine the reading at points where no reading is specified. Note that the reading priority order determination processing 1803 is executed by the reading priority order determination means 14.

図１９は、本発明の第１の実施形態のサーバ側装置１０１の読み優先順決定処理１８０３を示すフローチャートである。 FIG. 19 is a flowchart illustrating the reading priority order determination processing 1803 of the server-side apparatus 101 according to the first embodiment of this invention.

ここで、読み履歴登録処理１８０２において、端末側装置１００から送信された読み履歴ベクトル情報１０００は、端末ＩＤ１の読み履歴ベクトルに保存されたとする。 Here, in the reading history registration process 1802, it is assumed that the reading history vector information 1000 transmitted from the terminal-side device 100 is stored in the reading history vector of the terminal ID1.

サーバ側装置１０１は、読み優先順決定処理１８０３において、まず、読み履歴ベクトルデータベース１５００から、端末ＩＤ１の読み履歴ベクトルと比較する他の読み履歴ベクトルを取得する（Ｓ２００１）。 In the reading priority order determination process 1803, the server-side apparatus 101 first acquires another reading history vector to be compared with the reading history vector of the terminal ID1 from the reading history vector database 1500 (S2001).

そして、サーバ側装置１０１は、取得された読み履歴ベクトルと端末ＩＤ１の読み履歴ベクトルとの距離、すなわち類似性を、後述する手段によって算出する（Ｓ２００２）。 Then, the server side device 101 calculates the distance between the acquired reading history vector and the reading history vector of the terminal ID1, that is, the similarity by means described later (S2002).

サーバ側装置１０１は、読み履歴ベクトルデータベース１５００に含まれる全読み履歴ベクトルと、端末ＩＤ１の読み履歴ベクトルとの比較が、すべて終了したか否かを判定する（Ｓ２００３）。 The server-side apparatus 101 determines whether or not the comparison between the all reading history vector included in the reading history vector database 1500 and the reading history vector of the terminal ID1 has been completed (S2003).

終了していない場合、サーバ側装置１０１は、Ｓ２００１に戻り、まだ端末ＩＤ１の読み履歴ベクトルと比較していない読み履歴ベクトルを取得する。 If not completed, the server-side apparatus 101 returns to S2001, and acquires a reading history vector that has not been compared with the reading history vector of the terminal ID1.

終了した場合は、サーバ側装置１０１は、Ｓ２００２において算出された距離の中から、最小の距離となる読み履歴ベクトルを取得する（Ｓ２００４）。 When the processing is completed, the server-side apparatus 101 acquires a reading history vector that is the minimum distance from the distances calculated in S2002 (S2004).

サーバ側装置１０１は、最小の距離となる読み履歴ベクトルと、端末ＩＤ１の読み履歴ベクトルとを比較し、端末ＩＤ１の読み履歴ベクトルにおいて指定されていない地点ＩＤに、最小の距離となる読み履歴ベクトルにおける同じ地点ＩＤを持つ値をコピーする（Ｓ２００５）。 The server side device 101 compares the reading history vector having the minimum distance with the reading history vector of the terminal ID1, and reads the reading history vector having the minimum distance to the point ID not specified in the reading history vector of the terminal ID1. A value having the same point ID is copied (S2005).

最後に、サーバ側装置１０１は、Ｓ２００５によって地点ＩＤの値をコピーされた読み履歴ベクトルによって、読み履歴ベクトルデータベース１５００を更新する（Ｓ２００６）。 Finally, the server side apparatus 101 updates the reading history vector database 1500 with the reading history vector copied with the value of the point ID in S2005 (S2006).

Ｓ２００３の詳細を、後述する。 Details of S2003 will be described later.

ここで、読み優先順決定手段１４（すなわち、読み優先順決定処理１８０３）によって決定される読みは、利便性の向上のため、その端末側装置１００の利用者が今後、その地点を音声によって入力する場合に使用する可能性の高い読みである必要がある。 Here, the reading determined by the reading priority order determination means 14 (that is, the reading priority order determination processing 1803) is used by the user of the terminal-side device 100 to input the point by voice for the sake of convenience. It is necessary to read with a high possibility of being used.

利用者によって使用される可能性の高い読みを決定するためのＳ２００３の方法には、例えば、後述する方法がある。 Examples of the method of S2003 for determining readings that are likely to be used by the user include a method described later.

まず、サーバ側装置１０１は、読み履歴記憶手段１３によって読み履歴ベクトルデータベース１５００に保存される読み履歴ベクトルのうち、読み履歴ベクトル情報１０００を送信した端末側装置１００、すなわち本実施形態においては、端末ＩＤ１の読み履歴ベクトルに最も近い読み履歴ベクトルを検索する。 First, the server side device 101 transmits the reading history vector information 1000 among the reading history vectors stored in the reading history vector database 1500 by the reading history storage unit 13, that is, in this embodiment, the terminal side device 100. A reading history vector closest to the reading history vector of ID1 is searched.

なお、最も近い読み履歴ベクトル、すなわち最も類似している読み履歴ベクトルを検索するために、地点ＩＤ１およびその他の地点ＩＤの値の各々を要素ととらえ、一つの読み履歴ベクトルが複数の要素によってベクトルを構成しているとみなし、そのベクトルの距離を算出することによって、最も類似する読み履歴ベクトルを検索する。 In order to search for the nearest reading history vector, that is, the most similar reading history vector, each of the values of the point ID 1 and other point IDs is regarded as an element, and one reading history vector is a vector based on a plurality of elements. The most similar reading history vector is searched by calculating the distance of the vector.

ここで、ベクトルの距離には、例えば、読みの一致または不一致する地点ＩＤの個数によって算出するハミング距離、すなわち読みが一致しなかった地点ＩＤの個数を用いることができる。このとき、読みが指定されていない地点ＩＤ（図１７に示す「−」という記号が記載されている要素）は、読みが一致したものとして算出する。 Here, for the vector distance, for example, the Hamming distance calculated by the number of spot IDs that match or do not match, that is, the number of spot IDs that do not match the readings can be used. At this time, the point ID for which reading is not designated (the element in which the symbol “-” shown in FIG. 17 is described) is calculated as the reading matches.

具体的には、図１７に示す地点ＩＤ１、地点ＩＤ２、地点ＩＤ１０００以外の要素には、「−」が記載されている場合、端末ＩＤ１の読み履歴ベクトルと、端末ＩＤ１５０１が「２」（以降、端末ＩＤ２と記載する）を示す読み履歴ベクトルとのベクトルの距離は、地点ＩＤ１において、双方の値が一致しないため、１と算出される。また、端末ＩＤ１の読み履歴ベクトルと、端末ＩＤ１５０１が「１００」（以降、端末ＩＤ１００と記載する）を示す読み履歴ベクトルとのベクトルの距離は、地点ＩＤ１において、双方の値が一致し、他のすべての値も一致しているとみなせるため、０と算出される。 Specifically, when “−” is described in the elements other than the spot ID 1, the spot ID 2, and the spot ID 1000 illustrated in FIG. 17, the reading history vector of the terminal ID 1 and the terminal ID 1501 are “2” (hereinafter, “ The vector distance from the reading history vector indicating (terminal ID2) is calculated as 1 because both values do not match at the point ID1. Further, the distance between the reading history vector of the terminal ID1 and the reading history vector in which the terminal ID 1501 indicates “100” (hereinafter referred to as the terminal ID100) is the same in both values at the point ID1. Since all values can be regarded as matching, 0 is calculated.

なお、ベクトルの距離は、ハミング距離そのものではなく、距離を算出する二つの読み履歴ベクトルが各々示すベクトルにおいて算出されたハミング距離を、「−」が記載されていない地点ＩＤの個数によって割った値を距離としてもよい。 Note that the vector distance is not the Hamming distance itself, but is a value obtained by dividing the Hamming distance calculated in the vectors respectively indicated by the two reading history vectors for calculating the distance by the number of point IDs where "-" is not described. May be a distance.

また、ベクトルの距離は、あらかじめ地点ＩＤごとに重みを設定しておいて、読みが一致しない地点の重みを合計した値を距離としてもよい。 In addition, the vector distance may be set to a weight for each point ID in advance, and a value obtained by summing the weights of points where readings do not match may be used as the distance.

このベクトルの距離を計算する処理の結果、最も距離の値が低い読み履歴ベクトルの組み合わせが、最も近い読み履歴ベクトルであると判定される。前述の具体例において、端末ＩＤ１の読み履歴ベクトルに最も近い読み履歴ベクトルを持つ端末側装置１００は、ベクトルの距離が０と算出された、端末ＩＤ１００を示す端末側装置１００であると判定される。 As a result of the process of calculating the vector distance, it is determined that the combination of the reading history vectors having the lowest distance value is the closest reading history vector. In the specific example described above, the terminal-side device 100 having the reading history vector closest to the reading history vector of the terminal ID1 is determined to be the terminal-side device 100 indicating the terminal ID100 with the vector distance calculated as 0. .

続いて、サーバ側装置１０１は、端末ＩＤ１を示す読み履歴ベクトルから最もベクトルの距離が近いと判定された読み履歴ベクトルの中から、端末ＩＤ１の読み履歴ベクトルに読みが指定されていない地点ＩＤを検索し、その地点ＩＤの読みを抽出する。前述の具体例において、サーバ側端末１０１は、最も近いと判定された端末ＩＤ１００の読み履歴ベクトルから、端末ＩＤ１の読み履歴ベクトルには読みが指定されていない読み、すなわち、地点ＩＤ２における読み「ひたちほんしゃ」を抽出する。 Subsequently, the server-side apparatus 101 selects a point ID for which reading is not specified in the reading history vector of the terminal ID1, from among the reading history vectors determined to be closest to the reading history vector indicating the terminal ID1. Search and extract the reading of the point ID. In the specific example described above, the server-side terminal 101 reads from the reading history vector of the terminal ID 100 determined to be the closest to the reading that is not specified in the reading history vector of the terminal ID1, that is, the reading “Hitachi” at the point ID2. Extract "Honsha".

前述の具体例においては一つの地点とその読みとが抽出されたが、当然ながら、複数の地点とその読みとの組み合わせが抽出されてもよい。 In the above-described specific example, one point and its reading are extracted, but it goes without saying that a combination of a plurality of points and their readings may be extracted.

前述に示す読み優先順決定処理１８０３によれば、端末ＩＤ１から受信した読み履歴ベクトルにおいて指定されていない読みを、最も距離の近い読み履歴ベクトルから抽出し、読みを指定することができる。 According to the reading priority order determination processing 1803 described above, readings that are not specified in the reading history vector received from the terminal ID 1 can be extracted from the reading history vector with the shortest distance and the reading can be specified.

しかし、図１９に示す処理のように最も距離の近い読み履歴ベクトルを取得するのではなく、任意の距離に存在する複数の読み履歴ベクトルから、指定されていない読みを抽出してもよい。これによって、受信した読み履歴ベクトルにおいても読みが指定されていなく、また、最も距離の近い読み履歴ベクトルにおいても読みが指定されていない場合に、２番目以降に距離が近い読み履歴ベクトルに指定されている読みから抽出することによって、り指定されていない読みをより減らすことが可能となる。 However, instead of acquiring the closest reading history vector as in the process shown in FIG. 19, unspecified readings may be extracted from a plurality of reading history vectors existing at an arbitrary distance. As a result, when the reading is not specified even in the received reading history vector, and when reading is not specified even in the reading history vector closest to the distance, it is specified as the reading history vector closest to the second or later distance. By extracting from the readings that are present, it is possible to further reduce the readings that are not specified.

任意の距離に存在する複数の読み履歴ベクトルから読みを抽出する場合、サーバ側装置１０１は、図１９に示すＳ２００４において、あらかじめ定められた任意の距離に存在する読み履歴ベクトルを取得する。そして、Ｓ２００５において、最も距離の近い読み履歴ベクトルから、受信した読み履歴ベクトルにおいて読みを指定されていなかった地点ＩＤを検索し、最も距離の近い読み履歴ベクトルの地点ＩＤに読みが指定されていない場合は、２番目に距離の近い読み履歴ベクトルを検索する。このように、読みが指定されている読み履歴ベクトルを検索し、指定されている読みの中でも距離が近い読みを抽出する。 When extracting readings from a plurality of reading history vectors existing at an arbitrary distance, the server-side apparatus 101 acquires reading history vectors existing at an arbitrary distance in S2004 shown in FIG. In S2005, a point ID that has not been designated for reading in the received reading history vector is searched from the reading history vector with the closest distance, and no reading is specified for the point ID of the closest reading history vector. In this case, the second closest reading history vector is searched. In this way, reading history vectors for which reading is designated are searched, and readings having a short distance among the designated readings are extracted.

以上の読み優先順決定処理１８０３は、複数の地点の読み方が同じ利用者間において、どちらか一方の利用者がいまだ音声によって入力していなかった地点を初めて呼ぶ場合、もう一方の利用者が用いる読み方と同じ読み方によって、呼ぶ傾向が高いという特徴を利用している。すなわち、「日立国分寺店」を「こくぶんじひたち」と呼ぶ端末ＩＤ１に対応する端末側装置１００の利用者は、「日立国分寺店」を「こくぶんじひたち」と呼ぶ端末ＩＤ１００に対応する端末側装置１００の利用者と同じじように「日立本店」という地名を、「ひたちほんしゃ」と呼ぶ可能性が高い。 The reading priority order determination processing 1803 described above is used by the other user when calling a point that one of the users has not yet input by voice among the same users who read the plurality of points. It uses the feature that it tends to be called by the same reading as the reading. That is, a user of the terminal-side device 100 corresponding to the terminal ID 1 that calls “Hitachi Kokubunji store” as “Kokubunji Hitachi” has a terminal that corresponds to the terminal ID 100 that calls “Hitachi Kokubunji store” as “Kokubunji Hitachi”. Like the user of the side device 100, the place name “Hitachi head office” is highly likely to be called “Hitashonsha”.

図１２に示す読み履歴ベクトル情報１０００は、前述の通り読み優先順決定処理１８０３によって更新され、図１３に示す読み優先順データ１２００のように変更される。図１３に示される読み優先順データ１２００は、読み優先順決定手段１４から読み優先順送信手段１２へ送られ、その後、携帯電話、または無線ＬＡＮなどのネットワークを介して端末側装置１００へ送信される（読み優先順送信処理１８０４）。 The reading history vector information 1000 shown in FIG. 12 is updated by the reading priority order determination processing 1803 as described above, and is changed to reading priority order data 1200 shown in FIG. The reading priority order data 1200 shown in FIG. 13 is sent from the reading priority order determining means 14 to the reading priority order sending means 12, and then sent to the terminal-side device 100 via a network such as a mobile phone or a wireless LAN. (Reading priority order transmission processing 1804).

以上によって、サーバ側装置１０１における、読み履歴ベクトルへの処理を終了する。 Thus, the processing for the reading history vector in the server side device 101 is completed.

サーバ側装置１０１は、新たな地点情報が追加される場合、サーバ側装置１０１に備わる新規読み受信手段１５が実行され、新たな地点ＩＤが追加される。 When new point information is added, the server-side device 101 executes the new reading receiving means 15 provided in the server-side device 101 and adds a new point ID.

新たな地点情報を追加する処理は、まず、図１７に示す読み履歴ベクトルデータベース１５００において、新たな地点ＩＤの列を追加し、新たな地点ＩＤの列の値に未設定を示す「−」を記載する。前述のサーバ側装置１０１の処理は、地点の読み履歴ベクトルデータベース１５００の変更のみであるため、新たに追加された地点の名称およびそれらの地点に対応する複数の読み候補を、端末側装置１００に送信することができない。新たな地点情報を、端末側装置１００に追加する処理は、例えばカーナビゲーション装置に備わる地図データ更新技術などを用いて、別途、カーナビゲーション装置から端末側装置１００に送信されてもよい。 In the process of adding new point information, first, in the reading history vector database 1500 shown in FIG. 17, a new point ID column is added, and “−” indicating unset in the value of the new point ID column is set. Describe. Since the processing of the server side device 101 described above is only the change of the point reading history vector database 1500, the name of the newly added point and a plurality of reading candidates corresponding to those points are given to the terminal side device 100. Cannot send. The process of adding new point information to the terminal device 100 may be separately transmitted from the car navigation device to the terminal device 100 using, for example, a map data update technique provided in the car navigation device.

しかし、新たな地点情報を追加する処理によって、読み履歴ベクトルデータベース１５００の読みの値に「−」を記載するだけでは、端末側装置１００に優先すべき読みを送信できない。これは新たに追加された地点への読みは、すべての端末側装置１０１の利用者が入力していないため、前述の読み履歴ベクトルの距離、すなわち類似性に基づく読み優先順決定処理１８０３によって読みを決定できないためである。 However, by adding “−” to the reading value of the reading history vector database 1500 by the process of adding new point information, reading that should be prioritized cannot be transmitted to the terminal device 100. This is because reading to the newly added point is not input by all users of the terminal-side device 101, so reading is performed by the above-described reading history vector distance, that is, reading priority order determination processing 1803 based on similarity. This is because it cannot be determined.

後述の読み決定方法は、読み履歴ベクトルデータベース１５００において、地点ＩＤが示す地点名称の文字列を形態素解析し、解析結果の距離が近い地点ＩＤに指定されている読みをもとにして、追加された地点の読みを決定する方法である。 The reading determination method to be described later is added based on the reading specified in the point ID having a short distance in the analysis result by performing a morphological analysis on the character string of the point name indicated by the point ID in the reading history vector database 1500. It is a method to determine the reading of the spot.

なお、後述の読み決定方法は、前述の読み履歴ベクトルの距離、すなわち類似性に基づく読み決定手法の代わりに読み優先順決定処理１８０３において用いられてもよい。 Note that the reading determination method described later may be used in the reading priority order determination processing 1803 instead of the above-described reading history vector distance, that is, the reading determination method based on similarity.

以下、具体例を挙げて説明する。以下の説明において、サーバ側装置１０１は、端末ＩＤ１に対応する端末側装置１００の利用者に、読み優先順決定処理１８０３をする。 Hereinafter, a specific example will be described. In the following description, the server-side apparatus 101 performs a reading priority order determination process 1803 for the user of the terminal-side apparatus 100 corresponding to the terminal ID1.

図２０は、本発明の第１の実施形態のサーバ側装置１０１の形態素解析処理に基づく読み優先順決定処理１８０３を示すフローチャートである。 FIG. 20 is a flowchart illustrating the reading priority order determination process 1803 based on the morphological analysis process of the server-side apparatus 101 according to the first embodiment of this invention.

まず、読み履歴ベクトルデータベース１５００に、新たに「日立新宿店」という地点名称の文字列が追加されたとする。サーバ側装置１０１は、新規読み受信手段１５によって、新たな地点名称である「日立新宿店」を受信し、受信した新たな地点名称に一意な地点ＩＤを割り当て、読み履歴ベクトルデータベース１５００の列を作成する。 First, it is assumed that a character string having a location name “Hitachi Shinjuku store” is newly added to the reading history vector database 1500. The server side apparatus 101 receives the new spot name “Hitachi Shinjuku store” by the new reading receiving means 15, assigns a unique spot ID to the received new spot name, and sets the column of the reading history vector database 1500. create.

サーバ側装置１０１は、この文字列に形態素解析処理を行う（Ｓ２１０１）。ここで用いる形態素解析処理は、端末側装置１００による読み決定処理７０２において用いられる処理と同じである。 The server side device 101 performs morphological analysis processing on this character string (S2101). The morpheme analysis process used here is the same as the process used in the reading determination process 702 by the terminal device 100.

この形態素解析処理の結果、サーバ側装置１０１は、「日立新宿店」から図２１に示す解析結果１６００を得る。 As a result of the morphological analysis process, the server-side apparatus 101 obtains an analysis result 1600 shown in FIG. 21 from “Hitachi Shinjuku store”.

図２１は、本発明の第１の実施形態のサーバ側装置１０１の読み優先順決定手段１４による地点名称の解析結果１６００を示す説明図である。 FIG. 21 is an explanatory diagram illustrating a spot name analysis result 1600 by the reading priority order determination unit 14 of the server-side apparatus 101 according to the first embodiment of this invention.

図２１に示す解析結果１６００は、図９に示す単語辞書８００と同じ列を含む。解析結果１６００は、表記１６０１、品詞１６０２、および読み１６０３を含む。本実施形態のサーバ側装置１０１は、図２１に示すように、地点「日立新宿店」の文字列を、「日立」、「新宿」、および「店」の形態素に分割する。 The analysis result 1600 shown in FIG. 21 includes the same column as the word dictionary 800 shown in FIG. The analysis result 1600 includes a notation 1601, a part of speech 1602, and a reading 1603. As shown in FIG. 21, the server-side device 101 of the present embodiment divides the character string of the location “Hitachi Shinjuku store” into morphemes of “Hitachi”, “Shinjuku”, and “Store”.

また、サーバ側装置１０１は、解析結果１６００のうち、品詞１６０２の列のみを抽出し、追加された地点の品詞情報ベクトルを取得する。 Further, the server-side apparatus 101 extracts only the part-of-speech 1602 column from the analysis result 1600, and acquires the part-of-speech information vector of the added point.

次に、サーバ側装置１０１は、読み履歴ベクトルデータベース１５００の端末ＩＤ１の読み履歴ベクトルにおいて、地点ＩＤの読みを取得する（Ｓ２１０２）。そして、取得した地点ＩＤに既に指定されている読みが有るか無しかを判定する（Ｓ２１０３）。指定されている読みがない場合、サーバ側装置１０１は、Ｓ２１０２に戻り、次の地点ＩＤを取得する。 Next, the server side apparatus 101 acquires the reading of the point ID in the reading history vector of the terminal ID1 in the reading history vector database 1500 (S2102). Then, it is determined whether or not there is a reading already specified in the acquired point ID (S2103). When there is no designated reading, the server side apparatus 101 returns to S2102 and acquires the next point ID.

指定されている読みが有る場合、サーバ側装置１０１は、取得された地点ＩＤに対応する地点名称の表記文字列を、Ｓ２１０１の処理と同じく形態素解析する（Ｓ２１０４）。なお、形態素解析処理は、地点情報が追加された際に一度だけ実行し、その解析結果を保存しておいてもよい。 When there is a designated reading, the server-side apparatus 101 performs a morphological analysis on the notation character string of the spot name corresponding to the acquired spot ID in the same manner as the process of S2101 (S2104). Note that the morphological analysis process may be executed once when the point information is added, and the analysis result may be stored.

例えば、サーバ側装置１０１は、地点ＩＤ１に対応する地点「日立国分寺店」の文字列から、形態素解析によって、図２２に示す解析結果１７００を取得する。 For example, the server-side apparatus 101 acquires the analysis result 1700 shown in FIG. 22 from the character string of the location “Hitachi Kokubunji store” corresponding to the location ID 1 by morphological analysis.

図２２は、本発明の第１の実施形態のサーバ側装置１０１の読み優先順決定手段１４による地点名称の解析結果１７００を示す説明図である。 FIG. 22 is an explanatory diagram illustrating a point name analysis result 1700 by the reading priority order determination unit 14 of the server-side apparatus 101 according to the first embodiment of this invention.

読み優先順決定手段１４による解析結果１７００は、図２１と同じく図９に示す単語辞書８００と同じ列を含む。本実施形態のサーバ側装置は、図２２に示すように、地点「日立国分寺店」の文字列を、「日立」、「国分寺」、および「店」に分割する。 The analysis result 1700 by the reading priority order determination unit 14 includes the same columns as the word dictionary 800 shown in FIG. 9 as in FIG. As shown in FIG. 22, the server-side device of the present embodiment divides the character string of the point “Hitachi Kokubunji store” into “Hitachi”, “Kokubunji”, and “Store”.

そして、解析結果１７００のうち品詞１６０２の列のみを抽出し、既に読みが指定されている地点ＩＤの品詞情報ベクトルを取得する。そして、サーバ側装置１０１は、Ｓ２１０２において取得された追加された地点の品詞情報ベクトルと、既に読みが指定されている地点の品詞情報ベクトルとの距離を算出する（Ｓ２１０５）。この距離計算には、前述した一致および不一致によるハミング距離などを用いてもよい。また、距離計算の手段には、品詞情報ベクトルだけではなく、表記１６０１または読み１６０３を各々情報ベクトルとし、各々の情報ベクトル間の距離を算出し、算出した距離に重みをつけて加算するなどをしてもよい。 Then, only the part-of-speech 1602 column is extracted from the analysis result 1700, and the part-of-speech information vector of the point ID for which reading is already specified is acquired. Then, the server-side apparatus 101 calculates the distance between the part-of-speech information vector of the added point acquired in S2102 and the part-of-speech information vector of the point where reading is already specified (S2105). For this distance calculation, the above-described Hamming distance due to coincidence and mismatch may be used. The distance calculation means includes not only the part-of-speech information vector but also the notation 1601 or the reading 1603 as information vectors, calculates the distance between the respective information vectors, adds the weight to the calculated distance, and the like. May be.

次に、サーバ側装置１０１は、既に読みが指定されているすべての地点ＩＤについて、追加された地点の品詞情報ベクトルからの距離を算出したか否かを判定する（Ｓ２１０６）。すべての地点ＩＤについて距離を算出していない場合、サーバ側装置１０１は、Ｓ２１０２に戻る。すべての地点ＩＤについて距離を算出した場合、サーバ側装置１０１は、追加された地点から最も距離が小さい地点を取得する（Ｓ２１０７）。 Next, the server-side apparatus 101 determines whether or not the distance from the part-of-speech information vector of the added point has been calculated for all point IDs for which reading has already been specified (S2106). When the distance is not calculated for all the spot IDs, the server side apparatus 101 returns to S2102. When the distances are calculated for all the spot IDs, the server-side apparatus 101 acquires a point having the smallest distance from the added points (S2107).

ここで、品詞情報ベクトルによって、最も距離が小さい（近い）地点を検索した結果、端末ＩＤ１において、地点「日立新宿店」から最も距離が近い地点として、地点「日立国分寺店」が取得されたとする。 Here, the point “Hitachi Kokubunji store” is acquired as the closest point from the point “Hitachi Shinjuku store” in the terminal ID1 as a result of searching for the point with the smallest (closest) distance using the part of speech information vector. .

次に、サーバ側装置１０１は、読み履歴ベクトルデータベース１５００を参照し、Ｓ２１０７において取得された最も距離が近い地点における、形態素解析結果の読み情報の順序と、その地点ＩＤに設定された読み情報とを比較する。 Next, the server-side apparatus 101 refers to the reading history vector database 1500, and the order of the reading information of the morphological analysis result at the closest point acquired in S2107 and the reading information set in the point ID Compare

例えば、読み履歴ベクトルデータベース１５００の端末ＩＤ１において、地点「日立国分寺店」は、「こくぶんじひたち」という読みを優先すべきものとして指定されているとする。そして、図２２に示す解析結果１７００の読み１６０３を用いて、「こくぶんじひたち」という読みを構成できるか否かを判定する。この判定処理には、入力文字列「こくぶんじひたち」に、図２２に示す読み１６０３を用いて、最長一致法アルゴリズムによって全体が一致する文字列を構成できるか否かを判定してもよい。なお、この判定によって、読みを構成できないと判定された場合、サーバ側装置１０１は、Ｓ２１０７に戻り、次に距離が近い地点を取得してもよい。 For example, in the terminal ID 1 of the reading history vector database 1500, it is assumed that the point “Hitachi Kokubunji store” is designated as a priority given to reading “Kokubunji Hitachi”. Then, using the reading 1603 of the analysis result 1700 shown in FIG. 22, it is determined whether or not the reading “Kokubunji Hitachi” can be configured. In this determination processing, it is possible to determine whether or not a character string that matches the whole can be formed by the longest matching algorithm using the reading 1603 shown in FIG. 22 for the input character string “Kokubunji Hitachi”. . Note that if it is determined by this determination that the reading cannot be configured, the server-side apparatus 101 may return to S2107 and acquire the next closest point.

その結果、サーバ側装置１０１は、図２２に示す２行目の形態素の読み１６０３の「こくぶんじ」と、１行目の形態素の読み１６０３「ひたち」とを結合することによって「こくぶんじひたち」という読みが構成できると判定する。 As a result, the server-side apparatus 101 combines “Kokubunji” of the morpheme reading 1603 in the second row and “Kokubunji” of the morpheme reading 1603 in the first row shown in FIG. It is determined that the reading “Hitachi” can be constructed.

そして、サーバ側装置１０１は、この２行目の形態素の品詞１６０２「地名」と、１行目の形態素の品詞１６０２「固有名詞」の順番を、追加された地点名称の解析結果１６００に適用し、「しんじゅくひたち」という読みを生成する（Ｓ２１０８）。 Then, the server-side apparatus 101 applies the order of the morpheme part of speech 1602 “place name” in the second line and the morpheme part of speech 1602 “proper noun” in the first line to the analysis result 1600 of the added point name. , “Shinjuku Hitachi” is generated (S2108).

前述の処理の結果から、端末ＩＤ１の利用者は、追加された地点「日立新宿店」を、「しんじゅくひたち」という呼び方によって呼ぶ可能性が高いことが推測される。この読み「しんじゅくひたち」は、図１７に示す読み履歴ベクトルデータベース１５００に、追加された地点「日立新宿店」に対応する地点ＩＤに保存され、前述の処理に従って端末側装置１００へ図１３に示す読み優先順データ１２００の形式によって送信される。 From the result of the above-described processing, it is presumed that the user of the terminal ID 1 is highly likely to call the added location “Hitachi Shinjuku store” by the name “Shinjuku Hitachis”. This reading “Shinjuku Hitachi” is stored in the reading history vector database 1500 shown in FIG. 17 at the point ID corresponding to the added point “Hitachi Shinjuku store”, and is transferred to the terminal side device 100 according to the above-described processing. Is transmitted in the format of reading priority order data 1200 shown in FIG.

この結果、端末側装置１００は、新たに追加された地点「日立新宿店」に「しんじゅくひたち」という読みを第１候補として設定される。そして、端末側装置１００が備わるカーナビゲーション装置における音声ガイダンスは、これ以降、文字列「日立新宿店」に「しんじゅくひたち」という音声を読み上げる。 As a result, the terminal-side device 100 sets the reading “Shinjuku Hitachi” as the first candidate at the newly added location “Hitachi Shinjuku”. Then, the voice guidance in the car navigation device provided with the terminal-side device 100 reads out the voice “Shinjuku Hitachi” in the character string “Hitachi Shinjuku” thereafter.

なお、図２０に示す処理は、前述の新規に追加される地名にも、またはどの端末ＩＤ１５０１においても読みが指定されていない未知地名にも、適用することができる読み優先順決定処理１８０３である。図１９および図２０に示す処理は、読み履歴ベクトルデータベース１５００の読み履歴ベクトル間の距離を算出する処理であり、同様の流れによって行われる。ただし、図２０においては、サーバ側装置１０１から読み優先順データを送信する対象となる端末側装置１００を示す端末ＩＤ１５０１の読み履歴ベクトルのみを用いる。 The process shown in FIG. 20 is a reading priority order determination process 1803 that can be applied to the above-described newly added place name or an unknown place name that is not specified to be read by any terminal ID 1501. . The process shown in FIGS. 19 and 20 is a process of calculating the distance between reading history vectors in the reading history vector database 1500, and is performed in the same flow. However, in FIG. 20, only the reading history vector of the terminal ID 1501 indicating the terminal side device 100 that is the target of transmitting the reading priority order data from the server side device 101 is used.

また、前述において読み履歴ベクトル間の距離を取得するために、品詞情報ベクトルを用いたが、品詞情報のほかにも様々な言語情報またはＰＯＩに関する補足情報（飲食店、施設名などのカテゴリ情報など）を解析結果に含めてもよい。また、Ｓ２１０５における距離を算出するためのアルゴリズムには、前述の品詞情報ベクトル間におけるハミング距離のほかにも、品詞またはＰＯＩカテゴリの近さを考慮した重み付き距離などの様々な方法を用いてもよい。 In addition, the part-of-speech information vector is used in order to obtain the distance between the reading history vectors in the above description, but in addition to the part-of-speech information, various language information or supplementary information on POI (category information such as restaurants and facility names) ) May be included in the analysis results. In addition to the hamming distance between the part of speech information vectors described above, various methods such as a weighted distance considering the proximity of the part of speech or the POI category may be used as the algorithm for calculating the distance in S2105. Good.

この図２０に示す処理は、対象となった読み履歴ベクトルのすべての未知地名に読み情報が指定されるまで繰り返されてもよい。 The processing shown in FIG. 20 may be repeated until reading information is specified for all unknown place names of the target reading history vector.

（第２の実施形態）
第２の実施形態では、端末側装置１００およびサーバ側装置１０１という区別を設けずに、同じ装置によって本発明の処理を行う。第２の実施形態は、例えば、外部との通信機能を持たないカーナビゲーション装置においても適用できるし、また、通信機能を有していても、利用者の読み履歴情報をサーバ側装置１０１に送信することができない場合（セキュリティ等）にも適用できる。 (Second Embodiment)
In the second embodiment, the processing of the present invention is performed by the same device without distinguishing between the terminal device 100 and the server device 101. The second embodiment can be applied to, for example, a car navigation apparatus that does not have a communication function with the outside, and even if it has a communication function, the user's reading history information is transmitted to the server side apparatus 101. It can also be applied when it is not possible (security, etc.).

図２３は、本発明の第２の実施形態の端末側装置２２００の構成を示すブロック図である。 FIG. 23 is a block diagram illustrating a configuration of the terminal-side device 2200 according to the second embodiment of this invention.

第２の実施形態において、サーバ側装置１０１は使用されない。このため、端末側装置２２００は、第１の実施形態における端末側装置１００が備える手段と同様な手段を備えるが、読み履歴ベクトル送信手段４および読み優先順受信手段８を備えない。また、読み優先順受信手段８の代わりに、第１の実施形態においてサーバ側装置１０１に備えられた読み優先順決定手段１４と同じ機能を持つ読み優先順決定手段２２０８を備える。第１の実施形態における端末側装置１００に備わる手段と異なる手段は、この読み優先順決定手段２２０８のみであるため、この手段についてのみ後述し、その他の手段については省略する。 In the second embodiment, the server side device 101 is not used. For this reason, the terminal side device 2200 includes the same units as those included in the terminal side device 100 in the first embodiment, but does not include the reading history vector transmission unit 4 and the reading priority order receiving unit 8. Further, instead of the reading priority order receiving means 8, a reading priority order determining means 2208 having the same function as the reading priority order determining means 14 provided in the server side apparatus 101 in the first embodiment is provided. Since only the reading priority order determining means 2208 is different from the means provided in the terminal-side device 100 in the first embodiment, only this means will be described later, and the other means will be omitted.

この第２の実施形態において、端末側装置２２００は、第１の実施形態のように他の端末側装置１００の読み履歴ベクトル情報１０００、または読み優先順データ１２００を用いて、未知、すなわち新規の地名に読み優先順を決定できない。そのため、読み優先順決定手段２２０８では、第１の実施形態における読み優先順決定処理１８０３の中でも、形態素および品詞情報に基づく読み決定処理、すなわち、図２０に示す形態素ベクトル間距離に基づく読み優先順決定処理を行うことによって、未知、すなわち新規の地名に読み優先順を決定する。これによって、端末側装置２２００のみによって処理する構成が可能となる。 In the second embodiment, the terminal side device 2200 uses the reading history vector information 1000 or the reading priority order data 1200 of the other terminal side devices 100 as in the first embodiment, and thus the unknown, that is, new The order of priority for reading the place names cannot be determined. For this reason, the reading priority order determination unit 2208 includes the reading priority order based on the morpheme vector distance shown in FIG. 20 among the reading priority order determination processing 1803 in the first embodiment, that is, based on the morpheme vector distance shown in FIG. By performing the determination process, the reading priority order is determined for unknown, that is, new place names. This enables a configuration in which processing is performed only by the terminal-side device 2200.

すなわち、第２の実施の形態によれば、端末側装置は、複数の単語を音声にて出力し、前記単語と、当該単語に対応する読みとの組み合わせを保持し、前記保持された単語を形態素解析によって、品詞毎の単位文字列に分割し、前記分割された単位文字列が同じ品詞である場合、前記単位文字列の読みが類似する単語を取得し、前記取得された単語の読みに基づいて、前記単位文字列を並べる順番を特定し、前記特定された単位文字列の順番によって、当該取得された単語と単位文字列の品詞の配列が類似する単語の単位文字列を並べ、前記並べられた単位文字列に基づいて、当該単語の読みを生成し、前記生成された読みによって、前記組み合わせに含まれる単語の読みを更新し、前記更新された組み合わせを用いて、前記単語を音声にて出力するためのデータを作成することを特徴とする。 That is, according to the second embodiment, the terminal-side device outputs a plurality of words by voice, holds a combination of the word and a reading corresponding to the word, and stores the held word. Dividing into unit character strings for each part of speech by morphological analysis, and when the divided unit character strings are the same part of speech, obtain a word whose reading of the unit character string is similar, and read the acquired word Based on the order of arranging the unit character strings, and by arranging the unit character strings of the words that are similar in order of part of speech of the acquired word and the unit character string according to the order of the specified unit character string, A reading of the word is generated based on the arranged unit character strings, the reading of the word included in the combination is updated by the generated reading, and the word is sounded using the updated combination. At Characterized by generating data for force.

（第３の実施形態）
また、第１および第２の実施形態の中間段階として、第３の実施形態の端末側装置１００は、サーバ側装置１０１に利用者の読み履歴ベクトル情報１０００全体を送らずに、読み優先順を決定したい地点ＩＤのみを送信し、サーバ側装置１０１によって処理された結果の読み優先順データ１２００を受信してもよい。この場合、第２の実施形態と同じく読み履歴ベクトル間の距離計算ができないため、サーバ側装置１０１における読み優先順決定処理１８０３において、形態素および品詞情報の近さに基づく図２０の処理を行う。 (Third embodiment)
As an intermediate stage between the first and second embodiments, the terminal-side device 100 of the third embodiment sets the reading priority order without sending the entire user's reading history vector information 1000 to the server-side device 101. Only the point ID to be determined may be transmitted, and the reading priority order data 1200 as a result processed by the server-side apparatus 101 may be received. In this case, since the distance between reading history vectors cannot be calculated as in the second embodiment, in the reading priority order determination processing 1803 in the server side device 101, the processing of FIG.

例えば、端末側装置１００は、通信手段を持つがサーバ側装置１０１に大量のデータを送ることができない場合、または端末側装置１００ではＣＰＵの処理性能などの制限によって、第２の実施形態を適用できない場合などに、第３の実施形態は適用可能である。 For example, the terminal-side device 100 has the communication means but cannot send a large amount of data to the server-side device 101, or the terminal-side device 100 applies the second embodiment due to limitations on the processing performance of the CPU. The third embodiment can be applied to cases where it is impossible.

本発明の第１ないし第３の実施形態によれば、例えばカーナビゲーション装置に備わる端末側装置１００は、音声入力された地点名の読み情報を記録しておく読み履歴記憶手段５と、その情報を音声入力ごとに更新する読み履歴更新手段６と、記憶されている読み履歴ベクトル情報１０００をサーバ側装置１０１に送信する読み履歴送信手段４と、サーバ側装置１０１から送信される読み優先順情報を受信する読み優先順受信手段８と、読み履歴記憶手段５に格納されている読み履歴ベクトル情報１０００を利用して漢字かなテキストへの読みを付与する読み決定手段２とを備えることによって、利用者が音声によって入力した地点名の呼び方を用いて、以降の音声ガイダンスを行うことができる。また、端末側装置１００は、利用者がまだ入力していない地点名称にも、サーバ側装置１０１から送信された読みを用いることによって、利用者がその地点名称を呼ぶ可能性の高い読みによって地点名を読み上げることができる。 According to the first to third embodiments of the present invention, for example, the terminal-side device 100 provided in the car navigation device includes the reading history storage unit 5 that records the reading information of the spot name inputted by voice, and the information. Reading history update means 6 for updating the information for each voice input, reading history transmission means 4 for transmitting stored reading history vector information 1000 to the server side apparatus 101, and reading priority order information transmitted from the server side apparatus 101 By using the reading priority order receiving means 8 for receiving the reading and the reading determining means 2 for giving a reading to the kanji text using the reading history vector information 1000 stored in the reading history storage means 5. Subsequent voice guidance can be performed using the name of the point name entered by the person by voice. The terminal-side device 100 also uses the reading transmitted from the server-side device 101 for the spot name that has not yet been input by the user, so that the user is likely to call the spot name. You can read the name.

すなわち、第３の実施の形態によれば、端末側装置は、前記組み合わせのうち、一部の単語と、当該単語に対応する読みとの組み合わせを前記サーバに送信し、前記サーバ側装置は、前記送信された一部の組み合わせを使用する。 That is, according to the third embodiment, the terminal-side device transmits a combination of a part of the words and a reading corresponding to the word to the server, and the server-side device Use the transmitted partial combination.

１００端末側装置
１０１サーバ側装置
１テキスト入力手段
２読み決定手段
３音声合成手段
４読み履歴送信手段
５読み履歴記憶手段
６読み履歴更新手段
７音声入力手段
８読み優先順受信手段
９音声出力手段
１１読み履歴受信手段
１２読み優先順送信手段
１３読み履歴記憶手段
１４読み優先順決定手段
１５新規読み受信手段
１０００読み履歴ベクトル情報
１２００読み優先順データ
１５００読み履歴ベクトルデータベース DESCRIPTION OF SYMBOLS 100 Terminal side device 101 Server side device 1 Text input means 2 Reading determination means 3 Speech synthesis means 4 Reading history transmission means 5 Reading history storage means 6 Reading history update means 7 Voice input means 8 Reading priority order receiving means 9 Voice output means 11 Reading history receiving means 12 Reading priority order sending means 13 Reading history storage means 14 Reading priority order determining means 15 New reading receiving means 1000 Reading history vector information 1200 Reading priority order data 1500 Reading history vector database

Claims

A speech reading system comprising a plurality of speech reading terminals that read out a plurality of words, and a reading information update server connected to the voice reading terminals via a network,
The voice reading terminal is
Holding a combination of the word and the reading specified for the word;
Sending the combination to the reading information update server;
The reading information update server
Transmitted from a plurality of said voice reading terminal, a plurality of the combinations held,
Obtaining the word for which the reading is not specified from the combination;
Among the plurality of combinations, the reading in the word identifies a plurality of other combinations similar to the reading in the word of the combination;
Extracting the reading of the word for which the reading is not specified in the combination from the other combinations,
The extracted readings specify the readings of words for which the reading is not specified,
Updating the combination with the word and the specified reading;
Sending the updated combination to the voice reading terminal;
The voice reading terminal is
Updating the retained combination with the transmitted combination;
A speech-to-speech system, wherein the word is read out based on the updated combination.

The voice reading terminal is
Specify the reading for the word input by the user for the word included in the combination,
2. The speech reading system according to claim 1, wherein when the word is read out, the designated reading is preferentially used.

The reading information update server
Add a new word to the combination,
The speech reading system according to claim 1, wherein the combination to which the new word is added is transmitted to the speech reading terminal.

  A speech reading system comprising a plurality of speech reading terminals that read out a plurality of words, and a reading information update server connected to the voice reading terminals via a network,
  The voice reading terminal is
  Holding a combination of the word and the reading specified for the word;
  Sending the combination to the reading information update server;
  The reading information update server
  Holding a plurality of the combinations transmitted from a plurality of the speech reading terminals,
  Dividing the plurality of words into unit character strings for each part of speech by morphological analysis,
  In the same part of speech, obtaining the other word whose character string is similar to the character string of the word,
  Based on the reading specified for the other word, identify the order in which the unit character strings are arranged,
  Arranging the unit character strings of the words based on the order of the specified unit character strings,
  Generate word readings from the aligned unit strings,
  Updating the reading of the combination with the generated reading;
  Sending the updated combination to the voice reading terminal;
  The voice reading terminal is
  Updating the retained combination with the transmitted combination;
  A speech-to-speech system, wherein the word is read out based on the updated combination.

  A speech-to-speech device that reads out multiple words,
  Holding a combination of the word and the reading specified for the word;
  Add the new word to the combination,
  Dividing the plurality of words into unit character strings for each part of speech by morphological analysis,
  In the same part of speech, obtaining the other word whose character string is similar to the character string of the word,
  Based on the reading specified for the other word, specify the order of the unit character string,
  Extracting the reading of the word from the character string of the word arranged based on the order of the specified unit character string,
  A speech-to-speech terminal, wherein the reading of the combination is updated by the extracted reading.