JP6374771B2

JP6374771B2 - Retrieval device, program, and method for retrieving vague record of user's memory

Info

Publication number: JP6374771B2
Application number: JP2014234174A
Authority: JP
Inventors: 服部　元; 元服部; 滝嶋　康弘; 康弘滝嶋
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2014-11-19
Filing date: 2014-11-19
Publication date: 2018-08-15
Anticipated expiration: 2034-11-19
Also published as: JP2016099694A

Description

本発明は、ユーザが過去に発声又は記述したレコード（音声データ又はテキスト）を検索する技術に関する。 The present invention relates to a technique for retrieving a record (voice data or text) previously spoken or described by a user.

従来、会話を常時録音する装置について、会話の開始時刻及び終了時刻を検索キーとして、録音済みの音声データから任意の時間の会話を抽出する技術がある（例えば特許文献１参照）。この技術によれば、使用記憶容量を最小限に抑えるものであるが、記憶すべき音声データに基づく会話の開始時刻は、人手で指定する必要があり、音声データが多くなるほど、人手がかかることとなる。 2. Description of the Related Art Conventionally, there is a technique for extracting a conversation at an arbitrary time from recorded voice data using a conversation start time and an end time as search keys for an apparatus that constantly records conversation (see, for example, Patent Document 1). According to this technology, the used storage capacity is minimized, but it is necessary to manually specify the start time of the conversation based on the voice data to be stored, and the more voice data the more time is required. It becomes.

また、留守番電話機の録音機能について、音声メッセージからの感情の度合いを認識し、その感情の度合いに基づいて、音声メッセージに重要度（優先度）を割り当てる技術もある（例えば特許文献２参照）。ユーザは、優先度が高い音声メッセージほど、再生順を早くすることができる。この技術によれば、１区切りで録音された音声データに対し、自動的に優先度が割り当てられる。 There is also a technique for recognizing the degree of emotion from a voice message and assigning importance (priority) to the voice message based on the degree of emotion for the recording function of the answering machine (see, for example, Patent Document 2). The user can make the playback order earlier as the voice message has a higher priority. According to this technique, priority is automatically assigned to audio data recorded in one segment.

特開２００５−１１４８９８号公報JP 2005-114898 A 特開２００７−４９６５７号公報JP 2007-49657 A

本願の発明者らは、従来に無い新たな課題（サービス）として「ユーザにとって曖昧な記憶を検索することはできないか？」と考えた。例えば、ユーザに提供可能な以下のようなケースを想定した。
（例１）旅行の記憶を思い出すケース
旅行の話が多くなった場合、結果としてどの旅行の思い出だったのか、記憶が曖昧になる。例えば「すごくいい景色の部屋に泊まった旅館の思い出は、どの旅行だったか？」「きれいな富士山が見られたのは何年前の旅行だったか？」のようなケースがある。
（例２）仕事での会話を思い出すケース
受付業務や営業など、連続してほぼ同様の作業をしているために、誰にどのような会話をしたのか、記憶が曖昧になる。例えば「お客様の趣味や仕事内容を数日後にまた聞いてしまう」のようなケースがある。 The inventors of the present application have thought that “can an ambiguous memory be searched for a user?” As a new problem (service) that has not existed before. For example, the following cases that can be provided to the user are assumed.
(Example 1) The case of remembering the memory of a trip When there are many stories about a trip, as a result, the memory of which trip was a memory becomes ambiguous. For example, there are cases such as “What was the trip to the memory of the ryokan that stayed in a room with a very nice view?” “How many years ago was it to see the beautiful Mount Fuji?”.
(Example 2) Case of recalling conversations at work Since we are doing almost the same work continuously, such as reception work and sales, it becomes ambiguous to remember who and what kind of conversation. For example, there are cases such as "I will hear your hobbies and work contents again in a few days".

ユーザの記憶が曖昧となるレコードには、以下のような特徴がある。
（１）類似した多くの経験がある
（２）現在日時から見て遠い過去である A record in which a user's memory is ambiguous has the following characteristics.
(1) There are many similar experiences (2) The past is far from the current date and time

そこで、本発明は、従来に無い新たなサービスとして、ユーザの記憶が曖昧なレコード（音声データ又はテキスト）を検索することができる検索装置、プログラム及び方法を提供することを目的とする。 Therefore, an object of the present invention is to provide a search device, a program, and a method capable of searching for a record (voice data or text) in which a user's memory is ambiguous as a new service that has not existed before.

本発明によれば、テキストを含む複数のレコードを蓄積したレコード蓄積手段から、クエリテキストに対してユーザの記憶が曖昧と推定されるレコードを検索する検索装置であって、
レコードには、記録日時が付与されており、
レコード蓄積手段を用いて、複数のレコードを、所定数のカテゴリに分類するカテゴリ分類手段と、
レコード蓄積手段を用いて、クエリテキストと比較して、テキスト的に所定条件以上で類似するレコードを検索する類似レコード検索手段と、
検索されたレコード毎に、当該レコードが分類されるカテゴリに含まれるレコード数と、当該レコードの記録日時から現在日時までの時間差とを乗算した曖昧度を算出する曖昧度算出手段と、
クエリテキストに対して、検索結果として、曖昧度が高いレコードから順に出力する検索結果出力手段と
を有することを特徴とする。 According to the present invention, there is provided a search device for searching a record in which a user's memory is estimated to be ambiguous with respect to a query text from a record storage unit that stores a plurality of records including text,
The record has a recording date and time,
Category classification means for classifying a plurality of records into a predetermined number of categories using the record storage means;
Similar record search means for searching for records that are similar in terms of text or more in comparison with query text using record storage means;
Ambiguity calculation means for calculating the ambiguity by multiplying the number of records included in the category into which the record is classified for each searched record and the time difference from the recording date and time of the record to the current date and time;
Search result output means for outputting the query text as a search result in descending order of the ambiguity.

本発明の検索装置における他の実施形態によれば、
レコード蓄積手段は、レコード毎に、曖昧度算出手段を用いて算出された曖昧度を対応付けて蓄積することも好ましい。 According to another embodiment of the search device of the present invention,
The record storage means preferably stores the ambiguity calculated by using the ambiguity calculation means in association with each record.

本発明の検索装置における他の実施形態によれば、
マイクによって取得された音声データをテキストに変換し、該テキストを所定時間以上の無音区間毎に区分したレコードを生成し、該レコードをレコード蓄積手段に記憶する音声認識手段を更に有することも好ましい。 According to another embodiment of the search device of the present invention,
It is also preferable to further include voice recognition means for converting the voice data acquired by the microphone into text, generating a record in which the text is divided into silent sections of a predetermined time or longer, and storing the record in the record storage means.

本発明の検索装置における他の実施形態によれば、
マイク及び音声認識手段は、常時起動されたものであって、ユーザの日常的な会話におけるログとしての音声データを常時収集し、レコード蓄積手段に蓄積する
ことも好ましい。 According to another embodiment of the search device of the present invention,
It is also preferable that the microphone and the voice recognition unit are always activated, and the voice data as a log in the daily conversation of the user is always collected and stored in the record storage unit.

本発明の検索装置における他の実施形態によれば、
カテゴリ分類手段は、教師無し分類アルゴリズムを用いて、複数のレコードを各カテゴリに属するように分類することも好ましい。 According to another embodiment of the search device of the present invention,
The category classification means preferably classifies a plurality of records so as to belong to each category using an unsupervised classification algorithm.

本発明の検索装置における他の実施形態によれば、
カテゴリ分類手段は、新規レコードとして新規テキストが取得される毎に、当該新規テキストと各カテゴリに属するレコードのテキストとを比較し、当該新規テキストをいずれか１つのカテゴリに属するように分類することも好ましい。 According to another embodiment of the search device of the present invention,
Each time new text is acquired as a new record, the category classification means compares the new text with the text of a record belonging to each category, and classifies the new text so as to belong to any one category. preferable.

本発明の検索装置における他の実施形態によれば、
レコード蓄積手段に対して、曖昧度が所定値以上又は以下となるレコードを削除することも好ましい。 According to another embodiment of the search device of the present invention,
It is also preferable to delete records whose ambiguity is greater than or less than a predetermined value from the record storage means.

本発明によれば、テキストを含む複数のレコードを蓄積したレコード蓄積手段から、クエリテキストに対してユーザの記憶が曖昧と推定されるレコードを検索するように、装置に搭載されたコンピュータを機能させる検索プログラムであって、
レコードには、記録日時が付与されており、
レコード蓄積手段を用いて、複数のレコードを、所定数のカテゴリに分類するカテゴリ分類手段と、
レコード蓄積手段を用いて、クエリテキストと比較して、テキスト的に所定条件以上で類似するレコードを検索する類似レコード検索手段と、
検索されたレコード毎に、当該レコードが分類されるカテゴリに含まれるレコード数と、当該レコードの記録日時から現在日時までの時間差とを乗算した曖昧度を算出する曖昧度算出手段と、
クエリテキストに対して、検索結果として、曖昧度が高いレコードから順に出力する検索結果出力手段と
してコンピュータを機能させることを特徴とする。 According to the present invention, a computer mounted in an apparatus is caused to function so as to search a record in which a user's memory is estimated to be ambiguous with respect to a query text from a record storage unit that stores a plurality of records including text. A search program,
The record has a recording date and time,
Category classification means for classifying a plurality of records into a predetermined number of categories using the record storage means;
Similar record search means for searching for records that are similar in terms of text or more in comparison with query text using record storage means;
Ambiguity calculation means for calculating the ambiguity by multiplying the number of records included in the category into which the record is classified for each searched record and the time difference from the recording date and time of the record to the current date and time;
The computer is caused to function as a search result output means for sequentially outputting records with a high degree of ambiguity as search results for the query text.

本発明によれば、テキストを含む複数のレコードを蓄積したレコード蓄積部から、クエリテキストに対してユーザの記憶が曖昧と推定されるレコードを検索する装置の検索方法であって、
レコードには、記録日時が付与されており、
レコード蓄積部を用いて、複数のレコードを、所定数のカテゴリに分類する第１のステップと、
レコード蓄積部を用いて、クエリテキストと比較して、テキスト的に所定条件以上で類似するレコードを検索する第２のステップと、
検索されたレコード毎に、当該レコードが分類されるカテゴリに含まれるレコード数と、当該レコードの記録日時から現在日時までの時間差とを乗算した曖昧度を算出する第３のステップと、
クエリテキストに対して、検索結果として、曖昧度が高いレコードから順に出力する第４のステップと
を有することを特徴とする。 According to the present invention, there is provided a search method of an apparatus for searching a record in which a user's memory is estimated to be ambiguous with respect to a query text from a record storage unit that stores a plurality of records including text,
The record has a recording date and time,
A first step of classifying a plurality of records into a predetermined number of categories using a record storage unit;
A second step of using a record storage unit to search for records that are similar in terms of text or more compared to the query text;
A third step of calculating, for each retrieved record, an ambiguity obtained by multiplying the number of records included in the category into which the record is classified and the time difference from the recording date and time of the record to the current date and time;
And a fourth step of outputting the query text in order from the record with the higher ambiguity as a search result.

本発明の検索装置、プログラム及び方法によれば、ユーザの記憶が曖昧なレコードを検索することができる。 According to the search device, program, and method of the present invention, it is possible to search for a record in which a user's memory is ambiguous.

本発明における検索装置の機能構成図である。It is a functional block diagram of the search device in this invention. 本発明におけるカテゴリ分類及び曖昧度算出を表す説明図である。It is explanatory drawing showing the category classification | category and ambiguity calculation in this invention. 本発明における類似レコード検索及び検索結果出力を表す説明図である。It is explanatory drawing showing the similar record search and search result output in this invention. 本発明における端末及び検索サーバからなるシステム構成図である。It is a system configuration figure consisting of a terminal and a search server in the present invention.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明における検索装置の機能構成図である。 FIG. 1 is a functional configuration diagram of a search device according to the present invention.

図１の検索装置１は、過去の事象としてのクエリテキストを入力し、ユーザの記憶が曖昧と推定されるレコード（音声データ又はテキスト）を検索するものである。検索対象となるレコードは、ユーザの発声又は記述したログであって、生活全般の中で常時収集されたものである。検索装置１は、ユーザからのログを取得するべく、ユーザによって常時所持されるスマートフォンやウェアラブルデバイスのような端末に実装される。 The search apparatus 1 in FIG. 1 inputs a query text as a past event, and searches for a record (voice data or text) in which a user's memory is estimated to be ambiguous. The record to be searched is a user's utterance or a written log, and is always collected in the whole life. The search device 1 is mounted on a terminal such as a smartphone or a wearable device that is always possessed by the user in order to acquire a log from the user.

図１によれば、端末１は、ハードウェアとして、マイク部１０１と、タッチパネルディスプレイ１０２と、スピーカ部１０３とを有する。マイク部１０１は、ユーザの音声を常時収集するためのものであって、例えばピンマイクのようなデバイスである。タッチパネルディスプレイ１０２は、ユーザからクエリテキストを入力し、検索結果を表示するユーザインタフェースである。スピーカ部１０３は、検索結果としての音声データを再生する。タッチパネルディスプレイ１０２に表示されたキャラクタからの声が、スピーカ部１０３から発声されることによって、ユーザは、検索結果をキャラクタエージェントが回答しているように認識することができる。 According to FIG. 1, the terminal 1 includes a microphone unit 101, a touch panel display 102, and a speaker unit 103 as hardware. The microphone unit 101 is for collecting user's voice at all times, and is a device such as a pin microphone, for example. The touch panel display 102 is a user interface that inputs query text from a user and displays search results. The speaker unit 103 reproduces audio data as a search result. When the voice from the character displayed on the touch panel display 102 is uttered from the speaker unit 103, the user can recognize the search result as if the character agent is answering.

また、図１によれば、端末１は、音声蓄積部１１と、音声認識部１２と、レコード蓄積部１３と、カテゴリ分類部１４と、曖昧度算出部１５と、類似レコード検索部１６と、検索結果出力部１７とを有する。これら機能構成部は、端末に搭載されたコンピュータを機能させるプログラムを実行することによって実現できる。 Further, according to FIG. 1, the terminal 1 includes a voice storage unit 11, a voice recognition unit 12, a record storage unit 13, a category classification unit 14, an ambiguity calculation unit 15, a similar record search unit 16, And a search result output unit 17. These functional components can be realized by executing a program that causes a computer installed in the terminal to function.

［音声蓄積部１１］
音声蓄積部１１は、マイク部１０１によって取得された音声データを蓄積する。ここで、音声データの記録単位は、レコード蓄積部１３に蓄積されたレコードと対応付けて記憶される。即ち、検索結果としてのレコードと、音声データとが対応付けられている。 [Voice storage unit 11]
The audio storage unit 11 stores the audio data acquired by the microphone unit 101. Here, the recording unit of the audio data is stored in association with the record stored in the record storage unit 13. That is, the record as the search result is associated with the audio data.

［音声認識部１２］
音声認識部１２は、マイク部１０１によって取得された音声データをテキストに変換する。ここで、テキストとは、所定時間以上の無音区間毎に区分したレコード毎に生成されたものである。例えば、無音区間がｎ秒（例えば１００秒）以上継続した場合に、テキストをレコードとして区切るものであってもよい。また、話者認識技術を用いて、当該端末１を所持するユーザのみの発話が、所定の無音区間継続した場合に、レコードとして区切るものであってもよい。 [Voice recognition unit 12]
The voice recognition unit 12 converts the voice data acquired by the microphone unit 101 into text. Here, the text is generated for each record divided into silent sections of a predetermined time or longer. For example, when the silent section continues for n seconds (for example, 100 seconds) or longer, the text may be separated as a record. In addition, the speaker recognition technology may be used to divide the record as a record when the utterance of only the user having the terminal 1 continues for a predetermined silent period.

マイク部１０１及び音声認識部１２は、常時起動されたものであって、ユーザの日常的な会話ログとしての音声データを常時収集し、レコード蓄積部１３へ出力するものであってもよい。 The microphone unit 101 and the voice recognition unit 12 may be constantly activated, and may always collect voice data as a daily conversation log of the user and output it to the record storage unit 13.

［レコード蓄積部１３］
レコード蓄積部１３は、テキストを含む複数のレコードを蓄積する。本発明によれば、レコード（テキスト）毎に、「記録日時」及び「曖昧度」がを対応付けて蓄積されている。レコード蓄積部１３は、音声認識部１２から出力されたテキスト（会話ログ）のみならず、タッチパネルディスプレイ１０２から入力されたテキスト（筆談ログ）も、蓄積することが好ましい。 [Record accumulation unit 13]
The record storage unit 13 stores a plurality of records including text. According to the present invention, “recording date and time” and “ambiguity” are stored in association with each record (text). The record storage unit 13 preferably stores not only the text (conversation log) output from the voice recognition unit 12 but also the text (handwriting log) input from the touch panel display 102.

図２は、本発明におけるカテゴリ分類及び曖昧度算出を表す説明図である。 FIG. 2 is an explanatory diagram showing category classification and ambiguity calculation in the present invention.

［カテゴリ分類部１４］
カテゴリ分類部１４は、レコード蓄積部１３に蓄積された多数のレコードを、所定数のカテゴリに分類する（図２のカテゴリ分類部１４参照）。この分類には、「教師無し分類アルゴリズム」を用いたものであってもよい。例えば、ＬＤＡ(Latent Dirichlet allocation)や、ＳＶＭ(Support Vector Machine)を用いることができる。 [Category classification unit 14]
The category classification unit 14 classifies a large number of records accumulated in the record accumulation unit 13 into a predetermined number of categories (see the category classification unit 14 in FIG. 2). For this classification, an “unsupervised classification algorithm” may be used. For example, LDA (Latent Dirichlet allocation) or SVM (Support Vector Machine) can be used.

「ＬＤＡ」とは、各レコードを、各トピックグループに属する確からしさ（トピック比率）を算出し、トピックグループに分類する技術をいう。トピック分類は、レコード（テキスト）を解析するために、bag-of-wordsで表現された文書生成過程を、確率的にモデル化したものである。ＬＤＡは、単語文書行列を次元圧縮する技術（ＬＳＩ(latent Semantic Indexin)）に対して、単語の特徴ベクトルに揺らぎに基づく確率的な枠組みを導入したものである。その圧縮した次元の集合をトピックという。 “LDA” refers to a technique for calculating the probability (topic ratio) of each record belonging to each topic group and classifying the records into topic groups. Topic classification is a probabilistic model of the document generation process expressed in bag-of-words in order to analyze records (text). LDA introduces a probabilistic framework based on fluctuations in a feature vector of a word to a technique (LSI (latent Semantic Indexin)) for dimensional compression of a word document matrix. The compressed set of dimensions is called a topic.

「ＳＶＭ」とは、パターン認識に適用されるものであって、線形入力素子を用いて２クラスのパターン識別器を構成し、学習サンプルを用いて各特徴値との距離を算出することによって、線形入力素子のパラメータを学習する。サポートベクタマシンの識別エンジンは、抽出された複数の学習特徴量を教師データとして学習する。対象特徴量（新たに入力されたログのテキストの特徴量）が、特定カテゴリの学習特徴量（当該カテゴリの特徴量）に属するか否かを判定する。サポートベクタマシンの識別エンジンによれば、明確なルールを生成せず、外見上ブラックボックスであるが、特定カテゴリ毎の学習特徴量をサポートベクトルとして生成する。 “SVM” is applied to pattern recognition, and comprises two classes of pattern discriminators using linear input elements, and calculates distances from feature values using learning samples, Learn the parameters of a linear input element. The identification engine of the support vector machine learns the extracted plurality of learning feature quantities as teacher data. It is determined whether or not the target feature amount (the feature amount of the newly input log text) belongs to the learning feature amount of the specific category (the feature amount of the category). According to the identification engine of the support vector machine, a clear rule is not generated, and it looks like a black box, but a learning feature amount for each specific category is generated as a support vector.

カテゴリ分類部１４は、例えば以下のタイミングで、レコード蓄積部１３に対してカテゴリを分類する。
（１）所定期間毎（例えば１時間毎や２週間毎）
（２）所定数（例えば１００件）以上のレコードが、レコード蓄積部１３に蓄積される毎
そして、レコード毎に、割り当てられたカテゴリＩＤ(IDentifier)が付与される。例えば300個のカテゴリに分類した場合、カテゴリＩＤは1〜300の数値となる。 The category classification unit 14 classifies the category with respect to the record storage unit 13 at the following timing, for example.
(1) Every predetermined period (for example, every hour or every two weeks)
(2) Every time a predetermined number (for example, 100) or more records are accumulated in the record accumulation unit 13, an assigned category ID (IDentifier) is assigned to each record. For example, when it is classified into 300 categories, the category ID is a numerical value of 1 to 300.

［曖昧度算出部１５］
曖昧度算出部１５は、検索されたレコード毎に、当該レコードが分類されるカテゴリに含まれるレコード数と、当該レコードの記録日時から現在日時までの時間差とを乗算した曖昧度を算出する（図２の曖昧度算出部１５参照）。
曖昧度＝カテゴリのレコード数×（現在日時―記録日時） [Ambiguity calculator 15]
The ambiguity calculation unit 15 calculates, for each searched record, an ambiguity obtained by multiplying the number of records included in the category into which the record is classified and the time difference from the recording date and time of the record to the current date and time (see FIG. 2 ambiguity calculation unit 15).
Ambiguity = number of records in category x (current date-recording date)

曖昧度は、例えば以下のように算出される。
カテゴリのレコード数＝100個
現在日時−記録日時＝24,000時間
曖昧度＝100×24,000＝2,400,000
この場合、カテゴリのレコード数が多いほど、曖昧度は高くなり、レコードの記録が過去に遠いほど、曖昧度は高くなる。 The ambiguity is calculated as follows, for example.
Number of records in category = 100 Current date and time-Recording date and time = 24,000 hours Ambiguity = 100 x 24,000 = 2,400,000
In this case, the greater the number of records in the category, the higher the ambiguity level, and the farther the record recording is in the past, the higher the ambiguity level.

図３は、本発明における類似レコード検索及び検索結果出力を表す説明図である。 FIG. 3 is an explanatory diagram showing similar record search and search result output in the present invention.

［類似レコード検索部１６］
類似レコード検索部１６は、レコード蓄積部１３を用いて、クエリテキストと比較して、テキスト的に所定条件以上で類似するレコードを検索する（図３の類似レコード検索部１６参照）。 [Similar record search unit 16]
The similar record search unit 16 uses the record storage unit 13 to search for a record that is similar in terms of text as compared to the query text (see the similar record search unit 16 in FIG. 3).

クエリテキストとレコードテキストとの間の類似度は、例えば両テキストの単語間の品詞又は意味を解析したベクトルを算出し、これらベクトルのコサイン類似度（Bag of wordsベース）を算出するものであってもよい。勿論、出現頻度が高い重要語（名詞）を蓄積した重要語辞書を用いて、重要語のみについて類似度を算出するものであってもよい。コサイン距離は、例えば以下の概念式で算出される。
クエリテキストの単語の特徴ベクトル：Ｄ
レコードテキストの単語の特徴ベクトル：Ｅ
２つの文の類似度：sim(Ｄ,Ｅ)
sim(Ｄ,Ｅ)＝cosθ＝（Ｄ・Ｅ）／(|Ｄ||Ｅ|)
コサイン距離は、同じ単語同士である場合には類似度重みを１とする。 The similarity between the query text and the record text is, for example, calculating a vector that analyzes the part of speech or meaning between words of both texts, and calculating the cosine similarity (based on Bag of words) of these vectors. Also good. Of course, the similarity may be calculated only for the important word by using the important word dictionary storing the important words (nouns) having high appearance frequency. The cosine distance is calculated by the following conceptual expression, for example.
Query text word feature vector: D
Record text word feature vector: E
Similarity between two sentences: sim (D, E)
sim (D, E) = cos θ = (D · E) / (| D || E |)
When the cosine distance is the same word, the similarity weight is set to 1.

［検索結果出力部１７］
検索結果出力部１７は、クエリテキストに対して、検索結果として、曖昧度が高いレコードから順に出力する（図３の検索結果出力部１７参照）。これによって、ユーザの記憶が曖昧なレコードから順に、ユーザへ提示することができる。 [Search result output unit 17]
The search result output unit 17 outputs the query text as a search result in descending order of the ambiguity (see the search result output unit 17 in FIG. 3). Thereby, it is possible to present to the user in order from the record in which the user's memory is ambiguous.

検索結果出力部１７は、ユーザに対するキャラクタエージェント機能を有するものであってもよい。そのために、検索結果出力部１７は、検索結果として出力すべきテキストを、音声データに変換する音声変換機能と、その音声データに連動するキャラクタを表示するキャラクタ表示機能も有する。そして、タッチパネルディスプレイ１０２に表示されたキャラクタからの声を、スピーカ部１０３から発声されるように制御する。これによって、ユーザは、検索結果を、キャラクタエージェントが回答しているように認識することができる。 The search result output unit 17 may have a character agent function for the user. For this purpose, the search result output unit 17 has a voice conversion function for converting text to be output as a search result into voice data, and a character display function for displaying a character linked to the voice data. Then, control is performed so that the voice from the character displayed on the touch panel display 102 is uttered from the speaker unit 103. Thereby, the user can recognize the search result as if the character agent is answering.

＜レコード蓄積部１３における記憶量の削減＞
レコード蓄積部１３が、過去にログとして収集された大量のレコードを全て蓄積することは、記憶容量を無駄に使うだけであって、高コストとなる。また、検索する際に、不必要なデータの検索も必要となり、検索の処理効率も低下する。そのために、できる限り、ユーザの記憶が曖昧なレコードのみを蓄積することが好ましい。 <Reduction in storage amount in the record storage unit 13>
Accumulating all the large number of records collected as logs in the past by the record accumulating unit 13 only uses the storage capacity wastefully and is expensive. Further, when searching, it is necessary to search for unnecessary data, and the processing efficiency of the search also decreases. For this reason, it is preferable to accumulate only the ambiguous records of the user as much as possible.

レコード蓄積部１３は、曖昧度が所定値以上又は以下となるレコードを削除する（２次的記憶容量へ移行する）ものであってもよい。
曖昧度が所定値以上：類似した経験が多く、且つ、現在日時から見て遠い過去
曖昧度が所定値以下：類似した経験が少なく、且つ、現在日時から見て近い過去
曖昧度が所定値以上又は以下で区別しているが、いずれのレコードであっても検索することができる。これによって、レコード蓄積部１３に、将来検索対象とならないような無駄なデータを削減し、検索の処理効率の低下を防ぐことができる。 The record storage unit 13 may delete a record whose degree of ambiguity is greater than or equal to a predetermined value (shifts to a secondary storage capacity).
Ambiguity is greater than or equal to a predetermined value: Past with a lot of similar experiences and far from the current date and time Ambiguity is less than or equal to a predetermined value: Past with few similar experiences and near from the current date and time Ambiguity is greater than or equal to a predetermined value Or, as will be described below, any record can be searched. As a result, it is possible to reduce unnecessary data that will not be searched in the future in the record storage unit 13 and prevent a decrease in search processing efficiency.

図４は、本発明における端末及び検索サーバからなるシステム構成図である。 FIG. 4 is a system configuration diagram including a terminal and a search server in the present invention.

図４は、図１と比較して、本発明を、端末及び検索サーバからなるシステムとして構成したものである。図４によれば、端末１は、ユーザインタフェースとしてのマイク部１０１、タッチパネルディスプレイ１０２及びスピーカ部１０３と、音声認識部１２とを有すると共に、音声送信部を更に有する。音声送信部は、マイク１０１部によって取得された音声データを、検索サーバ２へ送信する。 FIG. 4 shows that the present invention is configured as a system including a terminal and a search server, as compared with FIG. According to FIG. 4, the terminal 1 includes a microphone unit 101 as a user interface, a touch panel display 102 and a speaker unit 103, and a voice recognition unit 12, and further includes a voice transmission unit. The voice transmission unit transmits the voice data acquired by the microphone 101 unit to the search server 2.

また、図４によば、検索サーバ２は、更に、音声受信部と、レコード受信部と、クエリテキスト入力部とを有する。音声受信部は、端末１から音声データ自体を受信し、音声蓄積部１１に記憶させる。レコード受信部は、端末１によって音声認識されたレコードテキストを受信し、レコード蓄積部１３へ出力する。クエリテキスト入力部は、端末１から、ユーザによって入力されたクエリテキストを受信し、類似レコード検索部１６へ出力する。尚、検索結果出力部１７は、検索結果としての音声データ及び／又はテキストを端末１へ送信する。 Further, according to FIG. 4, the search server 2 further includes a voice receiving unit, a record receiving unit, and a query text input unit. The voice receiving unit receives the voice data itself from the terminal 1 and stores it in the voice storage unit 11. The record receiving unit receives the record text recognized by the terminal 1 and outputs it to the record storage unit 13. The query text input unit receives the query text input by the user from the terminal 1 and outputs it to the similar record search unit 16. The search result output unit 17 transmits voice data and / or text as a search result to the terminal 1.

以上、詳細に説明したように、本発明の検索装置、プログラム及び方法によれば、ユーザの記憶が曖昧なレコード（音声データやテキスト）を検索することができる。 As described above in detail, according to the search device, program, and method of the present invention, it is possible to search for a record (voice data or text) in which the user's memory is ambiguous.

前述した本発明の種々の実施形態について、本発明の技術思想及び見地の範囲の種々の変更、修正及び省略は、当業者によれば容易に行うことができる。前述の説明はあくまで例であって、何ら制約しようとするものではない。本発明は、特許請求の範囲及びその均等物として限定するものにのみ制約される。 Various changes, modifications, and omissions of the above-described various embodiments of the present invention can be easily made by those skilled in the art. The above description is merely an example, and is not intended to be restrictive. The invention is limited only as defined in the following claims and the equivalents thereto.

１検索装置、端末、スマートフォン
１０１マイク部
１０２タッチパネルディスプレイ
１０３スピーカ部
１１音声蓄積部
１２音声認識部
１３レコード蓄積部
１４カテゴリ分類部
１５曖昧度算出部
１６類似レコード検索部
１７検索結果出力部
２検索サーバ DESCRIPTION OF SYMBOLS 1 Search apparatus, terminal, smart phone 101 Microphone part 102 Touch panel display 103 Speaker part 11 Voice storage part 12 Voice recognition part 13 Record storage part 14 Category classification part 15 Ambiguity calculation part 16 Similar record search part 17 Search result output part 2 Search server

Claims

A search device for searching for a record in which a user's memory is estimated to be ambiguous with respect to a query text from a record storage unit that stores a plurality of records including text,
The record is given a recording date and time,
Category classification means for classifying a plurality of records into a predetermined number of categories using the record accumulation means;
Similar record search means for searching records similar in terms of text or more in comparison with the query text using the record storage means;
Ambiguity calculation means for calculating the ambiguity obtained by multiplying the number of records included in the category into which the record is classified and the time difference from the recording date and time of the record to the current date and time for each of the retrieved records;
A search apparatus comprising: search result output means for outputting the query text as a search result in order from the record with the highest ambiguity.

The search apparatus according to claim 1, wherein the record storage unit stores the ambiguity calculated by using the ambiguity calculation unit in association with each record.

Voice recognition means for converting voice data acquired by a microphone into text, generating a record in which the text is divided into silent sections of a predetermined time or longer, and storing the record in the record storage means is further provided. The search device according to claim 1 or 2.

4. The microphone and the voice recognition unit are always activated, and voice data as a log in a daily conversation of a user is always collected and stored in the record storage unit. The search device described in 1.

The search device according to any one of claims 1 to 4, wherein the category classification means classifies a plurality of records so as to belong to each category by using an unsupervised classification algorithm.

Each time the new text is acquired as a new record, the category classification means compares the new text with the text of a record belonging to each category, and classifies the new text so as to belong to any one category. The search device according to any one of claims 1 to 5, wherein:

The search apparatus according to any one of claims 1 to 6, wherein a record having an ambiguity greater than or less than a predetermined value is deleted from the record storage means.

A search program for causing a computer mounted on a device to function so as to search a record in which a user's memory is estimated to be ambiguous with respect to a query text from a record storage unit that stores a plurality of records including text,
The record is given a recording date and time,
Category classification means for classifying a plurality of records into a predetermined number of categories using the record accumulation means;
Similar record search means for searching records similar in terms of text or more in comparison with the query text using the record storage means;
Ambiguity calculation means for calculating the ambiguity obtained by multiplying the number of records included in the category into which the record is classified and the time difference from the recording date and time of the record to the current date and time for each of the retrieved records;
A search program for causing a computer to function as search result output means for outputting the query text as a search result in order from the record with the higher ambiguity.

A search method of an apparatus for searching a record in which a user's memory is estimated to be ambiguous with respect to a query text from a record storage unit that stores a plurality of records including text,
The record is given a recording date and time,
A first step of classifying a plurality of records into a predetermined number of categories using the record storage unit;
A second step of searching for a record that is textually similar to or more than a predetermined condition using the record storage unit as compared with the query text;
A third step of calculating, for each retrieved record, an ambiguity obtained by multiplying the number of records included in the category into which the record is classified and a time difference from the recording date and time of the record to the current date and time;
And a fourth step of outputting the query text as a search result in order from the record with the higher ambiguity.