JP2022026752A

JP2022026752A - Knowledge database generation device, program, and automated answering system

Info

Publication number: JP2022026752A
Application number: JP2020130361A
Authority: JP
Inventors: 幸白石; Miyuki Shiraishi; 義久石原; Yoshihisa Ishihara; 潤平小林; Junpei Kobayashi
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2020-07-31
Filing date: 2020-07-31
Publication date: 2022-02-10
Anticipated expiration: 2040-07-31

Abstract

To provide a knowledge database generation device, a program, and an automated answering system, which can easily construct a knowledge database having highly accurate data.SOLUTION: A knowledge DB generation server 1 includes: a character processing section 12 that recognizes or acquires a character string included in data to be registered and extracts a word included in the character string; an image processing section 14 that performs image recognition processing on the data to be registered and acquires the recognized word; a word classification section 16 that acquires classification of the word obtained as a result of performing processing of the character processing section 12 and the image processing section 14 using dictionary data; and a data registration section 18 that associates the data to be registered with the word and the classification and stores in a knowledge DB 3.SELECTED DRAWING: Figure 1

Description

本発明は、知識データベース生成装置、プログラム及び自動応答システムに関する。 The present invention relates to a knowledge database generator, a program and an automated response system.

従来、業務効率化の観点から、企業の受付等にディスプレイやマイクを設置し、表示されたバーチャルなキャラクタ等が窓口業務等の接客業務を行う取り組みがなされている。このようなシステムでは、利用者の質問を、マイクを介した音声認識によって認識し、知識データベースから回答を検索し、ディスプレイへ表示及び／又はスピーカから発話することで、利用者の質問に回答する。このような対話システムの一例として、例えば、特許文献１に記載の技術が開示されている。 Conventionally, from the viewpoint of improving work efficiency, an effort has been made to install a display and a microphone at a reception desk of a company, etc., and to perform customer service work such as window work by the displayed virtual character or the like. In such a system, the user's question is recognized by voice recognition via a microphone, the answer is searched from the knowledge database, and the user's question is answered by displaying it on the display and / or speaking from the speaker. .. As an example of such a dialogue system, for example, the technique described in Patent Document 1 is disclosed.

特開２００７－１４８１１８号公報Japanese Unexamined Patent Publication No. 2007-148118

特許文献１に記載のような対話システムを実現する場合には、知識データベースに事前にデータを登録する必要があるが、その際、知識データベースに登録するデータ量やデータの精度が重要になる。そのため、高精度のデータを含む知識データベースを、より容易に構築することが望まれる。 In order to realize the dialogue system as described in Patent Document 1, it is necessary to register the data in the knowledge database in advance, but in that case, the amount of data to be registered in the knowledge database and the accuracy of the data are important. Therefore, it is desired to more easily construct a knowledge database containing highly accurate data.

そこで、本発明は、高精度のデータを有する知識データベースを容易に構築することが可能な知識データベース生成装置、プログラム及び自動応答システムを提供することを目的とする。 Therefore, an object of the present invention is to provide a knowledge database generator, a program, and an automatic response system capable of easily constructing a knowledge database having highly accurate data.

本発明は、以下のような解決手段により、前記課題を解決する。
第１の発明は、登録対象データに含まれる文字列を認識又は取得し、前記文字列に含まれる単語を抽出する文字処理手段と、前記登録対象データに画像認識処理を行い、認識された単語を取得する画像処理手段と、前記文字処理手段及び前記画像処理手段の処理を行った結果として得られた前記単語の分類を、辞書データを用いて取得する単語分類手段と、前記登録対象データに、前記単語と前記分類とを対応付けて、知識データベースに記憶させるデータ登録手段と、を備える、知識データベース生成装置である。
第２の発明は、第１の発明の知識データベース生成装置において、前記画像認識処理は、画像のシーンを認識する、知識データベース生成装置である。
第３の発明は、第１の発明又は第２の発明の知識データベース生成装置において、前記登録対象データを分析して画像と文字列との重なりの有無を確認し、重なりがある場合には、前記文字列を含む文字列領域を削除し、削除した前記文字列領域を、前記文字列領域の周囲の画像を用いて補完する画像加工手段を備え、前記画像処理手段は、前記画像加工手段による処理後の前記登録対象データを用いて処理を行う、知識データベース生成装置である。
第４の発明は、第１の発明から第３の発明までのいずれかの知識データベース生成装置において、前記文字処理手段及び前記画像処理手段の処理を行った結果として得られた前記単語の上位語を取得する上位語取得手段を備え、前記単語分類手段は、前記上位語取得手段により取得した前記上位語を前記単語として、前記分類をさらに取得する、知識データベース生成装置である。
第５の発明は、第１の発明から第４の発明までのいずれかの知識データベース生成装置において、前記文字処理手段及び前記画像処理手段の処理を行った結果として得られた前記単語の重みを決定する重み決定手段を備え、前記データ登録手段は、前記単語に、前記重み決定手段により決定した前記重みをさらに対応付けて前記知識データベースに記憶する、知識データベース生成装置である。
第６の発明は、第５の発明の知識データベース生成装置において、前記重み決定手段は、前記登録対象データにおける前記単語の出現頻度及び前記単語の強調表示の態様、並びに、前記登録対象データの領域における前記単語に対応する画像の占める割合のうち少なくともいずれかに基づいて、前記単語の重みを決定する、知識データベース生成装置である。
第７の発明は、第１の発明から第６の発明までのいずれかの知識データベース生成装置において、前記登録対象データを分析して色情報を取得する色情報取得手段を備え、前記データ登録手段は、前記色情報取得手段により取得した前記色情報を、前記登録対象データにさらに対応付けて、前記知識データベースに記憶させる、知識データベース生成装置である。
第８の発明は、第１の発明から第７の発明までのいずれかの知識データベース生成装置において、前記文字処理手段は、見出しになる前記文字列を抽出し、同一又は類似する前記見出しを有する前記登録対象データを関連付ける関連付け手段を備える、知識データベース生成装置である。
第９の発明は、第１の発明から第８の発明までのいずれかの知識データベース生成装置としてコンピュータを機能させるためのプログラムである。
第１０の発明は、第５の発明又は第６の発明の知識データベース生成装置において生成した前記知識データベースを用いる、自動応答システムであって、質問を受け付ける質問受付手段と、前記質問受付手段により受け付けた前記質問であって、文字列化された前記質問に対して形態素解析を行う質問解析手段と、前記質問解析手段による解析結果に基づいて前記知識データベースを検索し、検索結果として取得した、前記単語に対応した前記重みの合計が大きいものを優先に、前記登録対象データを出力する回答出力手段と、を備える、自動応答システムである。 The present invention solves the above-mentioned problems by the following solution means.
The first invention is a character processing means for recognizing or acquiring a character string included in the registration target data and extracting a word included in the character string, and an image recognition process for the registration target data to recognize the word. The word classification means for acquiring the word classification obtained as a result of processing the character processing means and the image processing means, and the word classification means for acquiring the word classification using dictionary data, and the registration target data. , A knowledge database generation device comprising a data registration means for associating the word with the classification and storing it in the knowledge database.
The second invention is the knowledge database generation device of the first invention, wherein the image recognition process is a knowledge database generation device that recognizes an image scene.
According to the third invention, in the knowledge database generator of the first invention or the second invention, the registration target data is analyzed to confirm whether or not the image and the character string overlap, and if there is an overlap, the third invention is performed. An image processing means for deleting a character string area including the character string and complementing the deleted character string area with an image around the character string area is provided, and the image processing means is based on the image processing means. It is a knowledge database generation device that performs processing using the registered data after processing.
The fourth invention is a hypernym of the word obtained as a result of processing the character processing means and the image processing means in any of the knowledge database generators from the first invention to the third invention. The word classification means is a knowledge database generation device that further acquires the classification by using the hypernym acquired by the hypernym acquisition means as the word.
In the fifth invention, the weight of the word obtained as a result of processing the character processing means and the image processing means in any of the knowledge database generation devices from the first invention to the fourth invention is used. The data registration means is a knowledge database generation device including a weight determining means for determining, which further associates the weight determined by the weight determining means with the word and stores the weight in the knowledge database.
A sixth aspect of the invention is the knowledge database generation device of the fifth aspect, wherein the weight determination means includes the frequency of appearance of the word in the registration target data, the mode of highlighting the word, and the area of the registration target data. A knowledge database generator that determines the weight of the word based on at least one of the proportions of the image corresponding to the word in.
The seventh invention comprises a color information acquisition means for analyzing the registration target data and acquiring color information in any of the knowledge database generation devices from the first invention to the sixth invention, and the data registration means. Is a knowledge database generation device that further associates the color information acquired by the color information acquisition means with the registration target data and stores it in the knowledge database.
According to the eighth aspect of the invention, in any one of the knowledge database generators from the first invention to the seventh invention, the character processing means extracts the character string to be a heading and has the same or similar heading. It is a knowledge database generation device provided with an association means for associating the registration target data.
The ninth invention is a program for making a computer function as a knowledge database generator according to any one of the first invention to the eighth invention.
The tenth invention is an automatic response system using the knowledge database generated by the knowledge database generator of the fifth invention or the sixth invention, and is received by a question receiving means for receiving a question and a question receiving means. The question was obtained by searching the knowledge database based on the question analysis means for performing morphological analysis on the question in character string and the analysis result by the question analysis means, and acquiring the question as a search result. It is an automatic response system including a response output means for outputting the registration target data, giving priority to a word having a large total weight.

本発明によれば、高精度のデータを有する知識データベースを容易に構築することが可能な知識データベース生成装置、プログラム及び自動応答システムを提供することができる。 According to the present invention, it is possible to provide a knowledge database generator, a program, and an automatic response system capable of easily constructing a knowledge database having highly accurate data.

本実施形態に係る自動応答システムの機能ブロック図である。It is a functional block diagram of the automatic response system which concerns on this embodiment. 本実施形態に係る知識ＤＢ生成サーバでの知識データ登録処理を示すフローチャートである。It is a flowchart which shows the knowledge data registration process in the knowledge DB generation server which concerns on this embodiment. 本実施形態に係る知識ＤＢに登録する登録対象データの例を示す図である。It is a figure which shows the example of the registration target data to be registered in the knowledge DB which concerns on this embodiment. 本実施形態に係る知識ＤＢ生成サーバでの文字処理を示すフローチャートである。It is a flowchart which shows the character processing in the knowledge DB generation server which concerns on this embodiment. 本実施形態に係る文字処理における画像加工処理の具体例を示す図である。It is a figure which shows the specific example of the image processing processing in the character processing which concerns on this embodiment. 本実施形態に係る知識ＤＢ生成サーバでの画像処理を示すフローチャートである。It is a flowchart which shows the image processing in the knowledge DB generation server which concerns on this embodiment. 本実施形態に係る知識ＤＢ生成サーバで生成した知識ＤＢに登録する知識データの例を示す図である。It is a figure which shows the example of the knowledge data which is registered in the knowledge DB generated by the knowledge DB generation server which concerns on this embodiment. 本実施形態に係る自動応答装置での質問応答処理を示すフローチャートである。It is a flowchart which shows the question answering process in the automatic answering apparatus which concerns on this embodiment. 本実施形態に係る自動応答装置における具体例を示す図である。It is a figure which shows the specific example in the automatic response apparatus which concerns on this embodiment.

以下、本発明を実施するための形態について、図を参照しながら説明する。なお、これは、あくまでも一例であって、本発明の技術的範囲はこれに限られるものではない。
（実施形態）
＜自動応答システム１００＞
図１は、本実施形態に係る自動応答システム１００の機能ブロック図である。
自動応答システム１００は、自動応答装置４が受け付けた質問に対して、知識ＤＢ（データベース）３を用いて応答するためのシステムである。また、自動応答システム１００は、知識ＤＢ生成サーバ１が、例えば、チラシやパンフレットといった既存のメディアを使用して、知識ＤＢ３を生成する。
自動応答システム１００は、知識ＤＢ生成サーバ１と、知識ＤＢ３と、自動応答装置４とを備える。 Hereinafter, embodiments for carrying out the present invention will be described with reference to the drawings. It should be noted that this is only an example, and the technical scope of the present invention is not limited to this.
(Embodiment)
<Automatic response system 100>
FIG. 1 is a functional block diagram of the automatic response system 100 according to the present embodiment.
The automatic response system 100 is a system for responding to a question received by the automatic response device 4 by using the knowledge DB (database) 3. Further, in the automatic response system 100, the knowledge DB generation server 1 generates the knowledge DB 3 by using an existing medium such as a leaflet or a pamphlet.
The automatic response system 100 includes a knowledge DB generation server 1, a knowledge DB 3, and an automatic response device 4.

＜知識ＤＢ生成サーバ１＞
知識ＤＢ生成サーバ１は、登録対象データと、登録対象データから得られる各種の情報とを知識ＤＢ３にデータを登録することで、知識ＤＢ３を生成する装置である。
知識ＤＢ生成サーバ１は、例えば、サーバやパーソナルコンピュータ（ＰＣ）等である。
知識ＤＢ生成サーバ１は、制御部１０と、記憶部２０と、通信インタフェース部２９とを備える。 <Knowledge DB generation server 1>
The knowledge DB generation server 1 is a device that generates the knowledge DB 3 by registering the registration target data and various information obtained from the registration target data in the knowledge DB 3.
The knowledge DB generation server 1 is, for example, a server, a personal computer (PC), or the like.
The knowledge DB generation server 1 includes a control unit 10, a storage unit 20, and a communication interface unit 29.

制御部１０は、知識ＤＢ生成サーバ１の全体を制御するＣＰＵ（中央処理装置）である。制御部１０は、記憶部２０に記憶されているＯＳ（オペレーティングシステム）やアプリケーションプログラムを適宜読み出して実行することにより、上述したハードウェアと協働し、各種機能を実行する。
制御部１０は、対象データ受付部１１と、文字処理部１２（文字処理手段）と、対象データ確認処理部１３（画像加工手段、色情報取得手段）と、画像処理部１４（画像処理手段）と、上位語取得部１５（上位語取得手段）と、単語分類部１６（単語分類手段）と、重み決定部１７（重み決定手段）と、データ登録部１８（データ登録手段）とを備える。 The control unit 10 is a CPU (central processing unit) that controls the entire knowledge DB generation server 1. The control unit 10 appropriately reads out and executes an OS (operating system) and an application program stored in the storage unit 20 to cooperate with the above-mentioned hardware and execute various functions.
The control unit 10 includes a target data receiving unit 11, a character processing unit 12 (character processing means), a target data confirmation processing unit 13 (image processing means, color information acquisition means), and an image processing unit 14 (image processing means). The upper word acquisition unit 15 (upper word acquisition means), the word classification unit 16 (word classification means), the weight determination unit 17 (weight determination means), and the data registration unit 18 (data registration means) are provided.

対象データ受付部１１は、登録対象データを受け付ける。ここで、登録対象データとは、知識ＤＢ３に登録する知識データになるものである。登録対象データは、例えば、既存の様々なメディアであってよく、チラシやパンフレットといった、紙媒体での配布を想定したものの他、ＷｅｂページやＳＮＳ（ＳｏｃｉａｌＮｅｔｗｏｒｋｉｎｇＳｅｒｖｉｃｅ）等の電子化されたデータであってもよいし、手書きのメモ等であってもよい。本実施形態では、登録対象データは、チラシやパンフレットのように、文字と画像とを含むものとして説明する。実際には、登録対象データは、文字のみのものや、画像のみのものを含んでもよい。
対象データ受付部１１は、紙媒体のものである場合には、例えば、スキャナや、カメラ等（図示せず）によって画像化し、登録対象データとして画像化した画像データを受け付ける。
また、対象データ受付部１１は、例えば、パンフレット類のように複数ページあるものについて、まとめて１つの登録対象データとして受け付けてもよいし、ページごとに登録対象データとして受け付けてもよい。 The target data receiving unit 11 receives the registration target data. Here, the registration target data is the knowledge data to be registered in the knowledge DB 3. The data to be registered may be, for example, various existing media, and may be distributed in paper media such as leaflets and pamphlets, as well as digitized data such as Web pages and SNS (Social Networking Services). It may be present, or it may be a handwritten memo or the like. In the present embodiment, the data to be registered will be described as including characters and images, such as a leaflet or a pamphlet. Actually, the data to be registered may include only characters or only images.
When the target data receiving unit 11 is a paper medium, the target data receiving unit 11 is imaged by, for example, a scanner, a camera, or the like (not shown), and receives the imaged image data as the registration target data.
Further, the target data receiving unit 11 may collectively accept a plurality of pages such as pamphlets as one registration target data, or may accept each page as registration target data.

文字処理部１２は、受け付けた登録対象データに含まれるテキスト（文字列）に対する処理を行う。
文字処理部１２は、登録対象データに含まれるテキストを取得し、又は、文字認識処理によって認識し、認識したテキストに含まれる単語を取得する。文字処理部１２は、テキストから単語を取得する処理を、自然言語処理として、例えば、形態素解析等により行うことができる。 The character processing unit 12 processes the text (character string) included in the received registration target data.
The character processing unit 12 acquires the text included in the registration target data, or recognizes it by the character recognition process and acquires the word included in the recognized text. The character processing unit 12 can perform a process of acquiring a word from a text as a natural language process, for example, by morphological analysis or the like.

対象データ確認処理部１３は、登録対象データを分析して画像とテキストとの重なりの有無を確認する。そして、重なりがある場合、つまり、画像の上にテキストが重なっている場合に、対象データ確認処理部１３は、テキストを含むテキスト領域を削除し、削除したテキスト領域を、テキスト領域の周囲の画像を用いて補完する画像加工処理を行う。
また、対象データ確認処理部１３は、登録対象データの色情報を取得する。対象データ確認処理部１３は、登録対象データの画像全体の色合いを分析することで、色情報を取得する。色情報は、例えば、ＲＧＢ値を１６進数で表したカラーコードとして取得する。 The target data confirmation processing unit 13 analyzes the registration target data and confirms whether or not the image and the text overlap. Then, when there is an overlap, that is, when the text overlaps the image, the target data confirmation processing unit 13 deletes the text area including the text, and the deleted text area is used as the image around the text area. Performs complementary image processing processing using.
Further, the target data confirmation processing unit 13 acquires the color information of the registration target data. The target data confirmation processing unit 13 acquires color information by analyzing the hue of the entire image of the registration target data. The color information is acquired, for example, as a color code in which the RGB value is represented by a hexadecimal number.

画像処理部１４は、登録対象データに対して画像認識処理を行い、認識された単語を取得する。より具体的には、画像処理部１４は、例えば、含まれる各画像を検出及び分析し、各画像に関連する単語を取得する。また、画像処理部１４は、例えば、既存の画像分析技術を活用して、登録対象データに含まれる各画像を検出及び切出し処理を行い、各画像に対するラベル付けをすることで、ラベル付けを単語として取得する。画像処理部１４は、各画像に対するラベル付けに、画像と単語とが関連付けられた画像ＤＢ（図示せず）を用い、各画像に類似する画像ＤＢの画像に関連付けられた単語を取得するようにしてもよい。さらに、画像処理部１４は、画像のシーンを認識する。ここで、画像処理部１４は、対象データ確認処理部１３による画像加工処理を行った場合には、画像加工処理後の登録対象データである加工後対象データから各画像を検出するようにしてもよい。
なお、登録対象データそのものに予めラベル付けがされている場合には、画像処理部１４は、そのラベルを取得してもよい。 The image processing unit 14 performs image recognition processing on the registration target data and acquires the recognized word. More specifically, the image processing unit 14 detects and analyzes each image included, and acquires a word related to each image, for example. Further, for example, the image processing unit 14 utilizes an existing image analysis technique to detect and cut out each image included in the registration target data, and labels each image to label the image. Get as. The image processing unit 14 uses an image DB (not shown) in which an image and a word are associated with each image for labeling, and acquires a word associated with an image in an image DB similar to each image. You may. Further, the image processing unit 14 recognizes the scene of the image. Here, when the image processing unit 14 performs the image processing processing by the target data confirmation processing unit 13, the image processing unit 14 may detect each image from the processed target data which is the registration target data after the image processing processing. good.
If the registration target data itself is labeled in advance, the image processing unit 14 may acquire the label.

上位語取得部１５は、文字処理部１２及び画像処理部１４で取得した単語の上位語を取得する。上位語取得部１５は、単語と上位語とを関連付けた、例えば、用語辞書（図示せず）を用いて、単語の上位概念に相当する上位語を取得する。上位語は、例えば、単語が「まぐろ」である場合に、「鮮魚」、「食べ物」といったものである。
単語分類部１６は、単語（上位語を含む）の分類を取得する。単語分類部１６は、シーン関連用語辞書２２（辞書データ）を参照して、単語を「物体」と「シーン」とに分類する。その結果、「物体」に分類される単語には、単語分類部１６は、さらにカテゴリ分けをしてもよい。 The hypernym acquisition unit 15 acquires the hypernym of the word acquired by the character processing unit 12 and the image processing unit 14. The hypernym acquisition unit 15 acquires a hypernym corresponding to a hypernym of a word by using, for example, a term dictionary (not shown) in which the word is associated with the hypernym. The hypernyms are, for example, "fresh fish" and "food" when the word is "tuna".
The word classification unit 16 acquires the classification of words (including hypernyms). The word classification unit 16 refers to the scene-related term dictionary 22 (dictionary data) and classifies words into "objects" and "scenes". As a result, the word classification unit 16 may further categorize the words classified as "objects".

重み決定部１７は、各単語の重みを決定する。ここで、重み決定部１７は、例えば、文字と画像とのいずれからも取得した単語についての重みを大きくしてもよい。また、重み決定部１７は、文字の大きさやフォント（態様）によって重みを決定してもよく、文字の大きさがより大きく、太字や下線が付されている単語の重みを、より大きくしてもよい。さらに、重み決定部１７は、画像の大きさによって重みを決定してもよく、例えば、登録対象データの全体の領域に占める画像の大きさ（領域）の割合が大きいほど、重みを大きくしてもよい。 The weight determination unit 17 determines the weight of each word. Here, the weight determination unit 17 may increase the weight of the word acquired from both the character and the image, for example. Further, the weight determination unit 17 may determine the weight according to the size of the character and the font (mode), and the weight of the word having a larger character size and being bold or underlined is made larger. May be good. Further, the weight determination unit 17 may determine the weight according to the size of the image. For example, the larger the ratio of the size (area) of the image to the entire area of the registration target data, the larger the weight. May be good.

データ登録部１８は、登録対象データに、単語と分類とを対応付けて、知識ＤＢ３に登録する。データ登録部１８は、登録対象データに、対象データ確認処理部１３により取得した色情報を、さらに対応付けてもよい。また、データ登録部１８は、単語に、重み決定部１７により決定した重みをさらに対応付けてもよい。 The data registration unit 18 associates a word with a classification with the data to be registered and registers it in the knowledge DB 3. The data registration unit 18 may further associate the color information acquired by the target data confirmation processing unit 13 with the registration target data. Further, the data registration unit 18 may further associate the weight determined by the weight determination unit 17 with the word.

記憶部２０は、制御部１０が各種の処理を実行するために必要なプログラム、データ等を記憶するためのハードディスク、半導体メモリ素子等の記憶領域である。
記憶部２０は、プログラム記憶部２１と、シーン関連用語辞書２２とを備える。
プログラム記憶部２１は、プログラムを記憶するための記憶領域である。プログラム記憶部２１は、プログラム２１ａを記憶している。
プログラム２１ａは、制御部１０の各種機能を実行するためのプログラムである。 The storage unit 20 is a storage area for a hard disk, a semiconductor memory element, or the like for storing programs, data, and the like necessary for the control unit 10 to execute various processes.
The storage unit 20 includes a program storage unit 21 and a scene-related term dictionary 22.
The program storage unit 21 is a storage area for storing a program. The program storage unit 21 stores the program 21a.
The program 21a is a program for executing various functions of the control unit 10.

シーン関連用語辞書２２は、少なくともシーンに関連する単語を記憶した辞書である。シーンとは、情景や場面といった意味のものであり、個々の物体とは異なる全体での状態を表す言葉である。シーン関連用語辞書２２は、単語に対して、シーンか、物体か、のうちのいずれかを対応付けたものであってもよい。シーン関連用語辞書２２は、予め用意されたものである。
通信インタフェース部２９は、通信ネットワークを介して知識ＤＢ３や、その他のＤＢや、他の装置等との通信を行うためのインタフェースである。 The scene-related term dictionary 22 is a dictionary that stores at least words related to the scene. A scene means a scene or a scene, and is a word that expresses a state as a whole that is different from individual objects. The scene-related term dictionary 22 may associate a word with either a scene or an object. The scene-related term dictionary 22 is prepared in advance.
The communication interface unit 29 is an interface for communicating with the knowledge DB 3, other DBs, other devices, and the like via the communication network.

＜知識ＤＢ３＞
知識ＤＢ３は、知識ＤＢ生成サーバ１によって登録対象データからに取得した、単語及び分類を含む登録対象データの情報を記憶するデータベースである。知識ＤＢ３は、図１では、知識ＤＢ生成サーバ１の外部に有するものになっている。知識ＤＢ３は、大量の画像データを含む大量の情報を記憶するため、大容量の記憶領域を有する。 <Knowledge DB3>
The knowledge DB 3 is a database that stores information on registration target data including words and classifications acquired from the registration target data by the knowledge DB generation server 1. In FIG. 1, the knowledge DB 3 is provided outside the knowledge DB generation server 1. The knowledge DB 3 has a large storage area for storing a large amount of information including a large amount of image data.

＜自動応答装置４＞
自動応答装置４は、知識ＤＢ３を用いて、質問に対する回答を行う装置である。
自動応答装置４は、例えば、ＰＣや、スマートフォン等であり、ロボット等であってもよい。自動応答装置４は、ＰＣやロボット等である場合には、例えば、観光案内所や、各所の受付等に設けられて、自動応答装置４が設置された場所を訪ねた一般人の質問に対して回答をするものである。また、自動応答装置４は、ＰＣやスマートフォン等である場合には、例えば、自動応答装置４の所持者の質問に対して回答をするものである。以降の説明において、自動応答装置４を用いて質問をする者を、利用者という。 <Automatic response device 4>
The automatic response device 4 is a device that answers a question by using the knowledge DB 3.
The automatic response device 4 is, for example, a PC, a smartphone, or the like, and may be a robot or the like. In the case of a PC, a robot, or the like, the automatic response device 4 is provided at a tourist information center, reception desks, etc. at various places, and responds to questions from ordinary people who visit the place where the automatic response device 4 is installed. It is the answer. Further, in the case of a PC, a smartphone, or the like, the automatic response device 4 answers, for example, a question of the owner of the automatic response device 4. In the following description, a person who asks a question using the automatic response device 4 is referred to as a user.

自動応答装置４は、制御部４０と、記憶部５０と、入力部５６と、表示部５７と、通信インタフェース部５９とを備える。
制御部４０は、自動応答装置４の全体を制御するＣＰＵである。制御部４０は、記憶部５０に記憶されているＯＳやアプリケーションプログラムを適宜読み出して実行することにより、上述したハードウェアと協働し、各種機能を実行する。
制御部４０は、受付処理部４１（質問受付手段、質問解析手段）と、ＤＢ処理部４２（回答出力手段）と、回答出力部４３（回答出力手段）とを備える。 The automatic response device 4 includes a control unit 40, a storage unit 50, an input unit 56, a display unit 57, and a communication interface unit 59.
The control unit 40 is a CPU that controls the entire automatic response device 4. The control unit 40 appropriately reads and executes the OS and application programs stored in the storage unit 50, thereby collaborating with the above-mentioned hardware to execute various functions.
The control unit 40 includes a reception processing unit 41 (question receiving means, question analysis means), a DB processing unit 42 (answer output means), and an answer output unit 43 (answer output means).

受付処理部４１は、利用者による質問を受け付ける。利用者が、例えば、キーボード等を用いて文字を入力したり、マイクに話しかけたりすることで、受付処理部４１は、利用者による質問を受け付ける。
そして、受付処理部４１は、受け付けた質問に対してテキスト化が必要な場合にはテキスト化した上で、形態素解析を行い、例えば、質問に含まれる単語を抽出する。 The reception processing unit 41 receives questions from users. When the user inputs characters using a keyboard or the like or speaks to a microphone, the reception processing unit 41 receives a question from the user.
Then, when the received question needs to be converted into text, the reception processing unit 41 performs morphological analysis after converting it into text, and extracts, for example, words included in the question.

ＤＢ処理部４２は、抽出した単語に基づいて知識ＤＢ３を検索する。そして、ＤＢ処理部４２は、知識ＤＢ３を検索した結果の一覧を生成する。ここで、ＤＢ処理部４２は、重みの大きい単語を含む登録対象データが上位になるようにした結果一覧を生成してもよい。
回答出力部４３は、ＤＢ処理部４２で生成した結果一覧から上位の登録対象データを、表示部５７に出力させる。 The DB processing unit 42 searches the knowledge DB 3 based on the extracted words. Then, the DB processing unit 42 generates a list of the results of searching the knowledge DB 3. Here, the DB processing unit 42 may generate a result list in which the registration target data including a word having a large weight is ranked higher.
The answer output unit 43 causes the display unit 57 to output higher-level registration target data from the result list generated by the DB processing unit 42.

記憶部５０は、制御部４０が各種の処理を実行するために必要なプログラム、データ等を記憶するためのハードディスク、半導体メモリ素子等の記憶領域である。
入力部５６は、例えば、キーボードやマウス等の入力装置である。入力部５６は、例えば、マイク等の音声入力装置であってもよい。
表示部５７は、例えば、ＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）等の表示装置である。
通信インタフェース部５９は、通信ネットワークを介して知識ＤＢ３等との通信を行うためのインタフェースである。 The storage unit 50 is a storage area for a hard disk, a semiconductor memory element, or the like for storing programs, data, and the like necessary for the control unit 40 to execute various processes.
The input unit 56 is, for example, an input device such as a keyboard or a mouse. The input unit 56 may be, for example, a voice input device such as a microphone.
The display unit 57 is, for example, a display device such as an LCD (Liquid Crystal Display).
The communication interface unit 59 is an interface for communicating with the knowledge DB 3 and the like via the communication network.

ここで、コンピュータとは、制御部、記憶装置等を備えた情報処理装置をいい、知識ＤＢ生成サーバ１及び自動応答装置４は、それぞれ制御部、記憶部等を備えた情報処理装置であり、コンピュータの概念に含まれる。 Here, the computer means an information processing device provided with a control unit, a storage device, and the like, and the knowledge DB generation server 1 and the automatic response device 4 are information processing devices provided with a control unit, a storage unit, and the like, respectively. Included in the concept of computers.

＜処理の説明＞
次に、知識ＤＢ３に知識データを登録する処理について説明する。
図２は、本実施形態に係る知識ＤＢ生成サーバ１での知識データ登録処理を示すフローチャートである。
図３は、本実施形態に係る知識ＤＢ３に登録する登録対象データの例を示す図である。
図４は、本実施形態に係る知識ＤＢ生成サーバ１での文字処理を示すフローチャートである。
図５は、本実施形態に係る文字処理における画像加工処理の具体例を示す図である。
図６は、本実施形態に係る知識ＤＢ生成サーバ１での画像処理を示すフローチャートである。
図７は、本実施形態に係る知識ＤＢ生成サーバ１で生成した知識ＤＢ３に登録する知識データ３１の例を示す図である。 <Explanation of processing>
Next, the process of registering the knowledge data in the knowledge DB 3 will be described.
FIG. 2 is a flowchart showing a knowledge data registration process in the knowledge DB generation server 1 according to the present embodiment.
FIG. 3 is a diagram showing an example of registration target data to be registered in the knowledge DB 3 according to the present embodiment.
FIG. 4 is a flowchart showing character processing in the knowledge DB generation server 1 according to the present embodiment.
FIG. 5 is a diagram showing a specific example of image processing processing in the character processing according to the present embodiment.
FIG. 6 is a flowchart showing image processing in the knowledge DB generation server 1 according to the present embodiment.
FIG. 7 is a diagram showing an example of knowledge data 31 registered in the knowledge DB 3 generated by the knowledge DB generation server 1 according to the present embodiment.

図２に示す知識データ登録処理は、登録対象データごとに行う処理である。登録対象データが複数ある場合には、制御部１０は、登録対象データごとに、知識データ登録処理を、複数回繰り返す。
図２のステップＳ（以下、単に「Ｓ」という。）１１において、知識ＤＢ生成サーバ１の制御部１０（対象データ受付部１１）は、登録対象データを受け付ける。ここで、登録したいメディアが電子化されたものではない場合には、制御部１０は、スキャナ等を用いて画像化してから、画像化データを登録対象データとして受け付ける。
登録対象データの例として、画像化されたパンフレット６１を、図３に示す。
図３に示すパンフレット６１は、グルメに関するものである。パンフレット６１は、テキスト６２ａから６２ｃまでと、画像６３ａから６３ｃまでを含む。 The knowledge data registration process shown in FIG. 2 is a process performed for each registration target data. When there are a plurality of registration target data, the control unit 10 repeats the knowledge data registration process a plurality of times for each registration target data.
In step S (hereinafter, simply referred to as “S”) 11 of FIG. 2, the control unit 10 (target data reception unit 11) of the knowledge DB generation server 1 receives the registration target data. Here, if the media to be registered is not digitized, the control unit 10 uses a scanner or the like to image the media, and then accepts the imaged data as the data to be registered.
As an example of the data to be registered, the imaged pamphlet 61 is shown in FIG.
The pamphlet 61 shown in FIG. 3 relates to gourmet food. The pamphlet 61 includes texts 62a to 62c and images 63a to 63c.

図２のＳ１２において、制御部１０は、文字処理を行う。
ここで、文字処理について、図４に基づき説明する。
図４のＳ２１において、制御部１０（文字処理部１２）は、登録対象データに含まれるテキストを文字認識処理によって認識する。図３のパンフレット６１の例では、「美味しいグルメＭＡＰ」のテキスト６２ａと、「鯛」のテキスト６２ｂと、「まぐろ」のテキスト６２ｃとを認識する。なお、制御部１０は、テキスト６２ａを、行ごとに別テキストとして、「美味しい」「グルメ」「ＭＡＰ」と３つの文字として認識してもよい。
図４のＳ２２において、制御部１０（文字処理部１２）は、テキストに含まれる単語を取得する。そして、制御部１０は、取得した単語を、記憶部２０に一時記憶する。 In S12 of FIG. 2, the control unit 10 performs character processing.
Here, the character processing will be described with reference to FIG.
In S21 of FIG. 4, the control unit 10 (character processing unit 12) recognizes the text included in the registration target data by the character recognition process. In the example of the pamphlet 61 of FIG. 3, the text 62a of "delicious gourmet MAP", the text 62b of "sea bream", and the text 62c of "tuna" are recognized. The control unit 10 may recognize the text 62a as three characters, "delicious", "gourmand", and "MAP", as separate texts for each line.
In S22 of FIG. 4, the control unit 10 (character processing unit 12) acquires a word included in the text. Then, the control unit 10 temporarily stores the acquired word in the storage unit 20.

Ｓ２３において、制御部１０（対象データ確認処理部１３）は、画像とテキストとの重なりを確認する。例えば、図３のテキスト６２ｂ及び６２ｃは、それぞれ画像６３ｂ及び６３ｃと重なっている。
Ｓ２４において、制御部１０は、画像とテキストとの重なりがあるか否かを判断する。１つでも画像とテキストとの重なりがある場合（Ｓ２４：ＹＥＳ）には、制御部１０は、処理をＳ２５に移す。他方、画像とテキストとの重なりがない場合（Ｓ２４：ＮＯ）には、制御部１０は、処理を図２のＳ１３に移す。
Ｓ２５において、制御部１０（対象データ確認処理部１３）は、登録対象データの画像とテキストとの重なり部分に対して画像加工処理を行い、加工後対象データを生成する。そして、制御部１０は、画像加工処理後の加工後対象データを、記憶部２０に一時記憶する。その後、制御部１０は、処理を図２のＳ１３に移す。 In S23, the control unit 10 (target data confirmation processing unit 13) confirms the overlap between the image and the text. For example, the texts 62b and 62c of FIG. 3 overlap with the images 63b and 63c, respectively.
In S24, the control unit 10 determines whether or not there is an overlap between the image and the text. If there is even one overlap between the image and the text (S24: YES), the control unit 10 shifts the processing to S25. On the other hand, when there is no overlap between the image and the text (S24: NO), the control unit 10 shifts the processing to S13 in FIG.
In S25, the control unit 10 (target data confirmation processing unit 13) performs image processing processing on the overlapping portion of the image and the text of the registration target data, and generates the processed target data. Then, the control unit 10 temporarily stores the processed target data after the image processing process in the storage unit 20. After that, the control unit 10 shifts the processing to S13 in FIG.

ここで、画像加工処理について、図５を用いて説明する。
図５（Ａ）は、パンフレット６１のうちの、画像とテキストとの重なり部分を有するテキスト６２ｃを含む画像６３ｃを示す。この画像６３ｃに対する画像加工処理は、次に示すステップを経て行われる。
制御部１０は、画像６３ｃに対して、テキスト６２ｃ含むテキスト領域を白抜きする。図５（Ｂ）は、制御部１０による白抜き処理後の画像７０であり、白抜き部７１を含む。
次に、制御部１０は、画像７０の白抜き部７１に対して、白抜き部７１の周囲の平均色で補完する処理を行い、図５（Ｃ）に示すように、補完画像７３を含む加工後画像７２を生成する。
制御部１０は、パンフレット６１のテキスト６２ｂを含む画像６３ｂについても、同様に画像加工処理を行う。そのようにすることで、画像加工処理後の加工後対象データは、画像とテキストとの重なりがないものになる。 Here, the image processing process will be described with reference to FIG.
FIG. 5A shows an image 63c of the pamphlet 61 including a text 62c having an overlapping portion between the image and the text. The image processing process for the image 63c is performed through the following steps.
The control unit 10 outlines the text area including the text 62c with respect to the image 63c. FIG. 5B is an image 70 after the whitening process by the control unit 10, and includes the whitening unit 71.
Next, the control unit 10 performs a process of complementing the white portion 71 of the image 70 with the average color around the white portion 71, and includes the complementary image 73 as shown in FIG. 5 (C). The processed image 72 is generated.
The control unit 10 also performs image processing processing on the image 63b including the text 62b of the pamphlet 61. By doing so, the processed target data after the image processing process has no overlap between the image and the text.

図２のＳ１３において、制御部１０は、画像処理を行う。
ここで、画像処理について、図６に基づき説明する。
図６のＳ３１において、制御部１０（対象データ確認処理部１３）は、登録対象データ（画像加工処理をした場合には、加工後対象データ）の色情報を取得する。その際、制御部１０は、登録対象データの全体における各色情報についての割合を取得する。色情報は、登録対象データの全体の色合いによる印象を表すため、漠然とした類似イメージ検索において活用できるものである。制御部１０は、取得した色情報を、記憶部２０に一時記憶する。 In S13 of FIG. 2, the control unit 10 performs image processing.
Here, image processing will be described with reference to FIG.
In S31 of FIG. 6, the control unit 10 (target data confirmation processing unit 13) acquires the color information of the registration target data (when image processing is performed, the processed target data). At that time, the control unit 10 acquires the ratio of each color information in the entire registration target data. Since the color information expresses the impression of the entire color of the data to be registered, it can be used in a vague similar image search. The control unit 10 temporarily stores the acquired color information in the storage unit 20.

Ｓ３２において、制御部１０（画像処理部１４）は、登録対象データに含まれる各画像を検出して切り出す画像切出処理を行う。この処理により、図３に示す画像６３ａから６３ｃまでの３つの画像を切り出すことができる。なお、制御部１０は、画像加工処理後の加工後対象データから画像を切り出すため、画像の切り出し精度を高精度なものにできる。 In S32, the control unit 10 (image processing unit 14) performs image cutting processing for detecting and cutting out each image included in the registration target data. By this processing, three images from images 63a to 63c shown in FIG. 3 can be cut out. Since the control unit 10 cuts out an image from the processed target data after the image processing process, the cutting accuracy of the image can be made highly accurate.

Ｓ３３において、制御部１０（画像処理部１４）は、画像認識処理を行い、認識された単語を取得する。制御部１０は、画像ＤＢ（図示せず）を用いて、切り出した画像に関する単語を取得してもよい。画像から取得する単語は、画像に含まれる物体に関するものの他、画像全体のシーンに関するものを含む。なお、この処理で使用する画像にはテキストが含まれていない。そのため、画像から単語を取得する処理を高精度に行うことができる。制御部１０は、取得した単語を、記憶部２０に一時記憶する。その後、制御部１０は、処理を図２のＳ１４に移す。 In S33, the control unit 10 (image processing unit 14) performs image recognition processing and acquires the recognized word. The control unit 10 may acquire a word related to the cut out image by using the image DB (not shown). Words obtained from an image include not only those related to objects contained in the image but also those related to the scene of the entire image. The image used in this process does not contain text. Therefore, the process of acquiring a word from an image can be performed with high accuracy. The control unit 10 temporarily stores the acquired word in the storage unit 20. After that, the control unit 10 shifts the processing to S14 in FIG.

図２のＳ１４において、制御部１０（上位語取得部１５）は、テキストや画像から取得した単語の上位語を取得する。ここで、制御部１０は、全ての単語の上位語を取得し、取得した上位語を単語に対応付けてもよい。また、制御部１０は、全ての単語の上位語を取得する必要はなく、例えば、複数の単語に共通の上位語がある場合にのみ、上位語を取得してもよい。制御部１０は、取得した上位語を、記憶部２０に一時記憶する。 In S14 of FIG. 2, the control unit 10 (hypernym acquisition unit 15) acquires the hypernym of the word acquired from the text or the image. Here, the control unit 10 may acquire the hypernyms of all the words and associate the acquired hypernyms with the words. Further, the control unit 10 does not need to acquire the hypernyms of all the words, and may acquire the hypernyms only when, for example, there is a common hypernym in a plurality of words. The control unit 10 temporarily stores the acquired hypernym in the storage unit 20.

Ｓ１５において、制御部１０（単語分類部１６）は、シーン関連用語辞書２２を参照して各単語（上位語を含む）の分類を取得する。そして、制御部１０は、各単語と取得した分類とを対応付ける。
Ｓ１６において、制御部１０（重み決定部１７）は、各単語の重みを決定する。制御部１０は、各単語の重みとして、上記したように、テキスト及び画像による様々な指標を用いることができる。また、制御部１０は、上位語の重みを、対応する単語の重みと同様にするようにしてもよいし、複数の単語の上位語であれば、大きい重みと同様にするようにしてもよい。 In S15, the control unit 10 (word classification unit 16) obtains the classification of each word (including the hypernym) by referring to the scene-related term dictionary 22. Then, the control unit 10 associates each word with the acquired classification.
In S16, the control unit 10 (weight determination unit 17) determines the weight of each word. As described above, the control unit 10 can use various indexes based on text and images as the weight of each word. Further, the control unit 10 may make the weight of the hypernym the same as the weight of the corresponding word, or may make the weight of the hypernym of a plurality of words the same as the weight of the large word. ..

Ｓ１７において、制御部１０（データ登録部１８）は、登録対象データに、単語と分類と重みとを対応付け、さらに色情報を対応付けて、知識ＤＢ３に登録する。ここで、制御部１０は、全ての単語の上位語を取得した場合には、重みが所定以上である単語に対応付けられた上位語のみを登録対象にしてもよい。
その後、制御部１０は、本処理を終了する。 In S17, the control unit 10 (data registration unit 18) associates the word, the classification, and the weight with the data to be registered, further associates the color information, and registers the data in the knowledge DB 3. Here, when the control unit 10 acquires the hypernyms of all the words, only the hypernyms associated with the words having a weight of a predetermined weight or more may be registered.
After that, the control unit 10 ends this process.

図７は、知識ＤＢ３に登録した知識データ３１の例を示す。
図７に示す知識データ３１は、図３に示すパンフレット６１を登録対象データとしたものである。
知識データ３１は、データ部３１ａと、キーワード分類部３１ｂと、色情報部３１ｃとを含む。
データ部３１ａには、登録対象データと、生成した場合には加工後対象データとを記憶する。
キーワード分類部３１ｂには、単語と分類と重みとを対応付けて記憶する。
例えば、単語が「まぐろ」の場合、テキストと画像とから抽出された単語であり、分類は「物体」の「食べ物」であり、重みは、「２」であることを示す。
色情報部３１ｃには、パンフレット６１の全体を構成する各色の色情報と、その割合とを対応付けて記憶する。 FIG. 7 shows an example of the knowledge data 31 registered in the knowledge DB 3.
The knowledge data 31 shown in FIG. 7 is the data to be registered in the pamphlet 61 shown in FIG.
The knowledge data 31 includes a data unit 31a, a keyword classification unit 31b, and a color information unit 31c.
The data unit 31a stores the registration target data and, if generated, the processed target data.
The keyword classification unit 31b stores words, classifications, and weights in association with each other.
For example, when the word is "tuna", it is a word extracted from text and an image, the classification is "food" of "object", and the weight is "2".
In the color information unit 31c, the color information of each color constituting the entire pamphlet 61 and the ratio thereof are stored in association with each other.

この知識データ登録処理によって、知識ＤＢ生成サーバ１は、登録対象データを、知識データとして知識ＤＢ３に登録する。知識ＤＢ３には、このような登録対象データに対応した知識データ３１が、複数記憶される。そして、知識ＤＢ３には、登録対象データに含まれるテキストだけでなく、画像に関する単語も含まれる。そして、画像に関する単語は、画像に含まれる物体だけでなく、画像から得られるシーンに関するものを含む。よって、知識ＤＢ３に登録される知識データを、検索時に使用可能な高精度のデータにできる。 By this knowledge data registration process, the knowledge DB generation server 1 registers the registration target data in the knowledge DB 3 as knowledge data. A plurality of knowledge data 31 corresponding to such registration target data are stored in the knowledge DB 3. Then, the knowledge DB 3 includes not only the text included in the registration target data but also the words related to the image. And the word about the image includes not only the object contained in the image but also the scene obtained from the image. Therefore, the knowledge data registered in the knowledge DB 3 can be converted into highly accurate data that can be used at the time of search.

次に、知識ＤＢ３を用いた自動応答に関する処理について説明する。
図８は、本実施形態に係る自動応答装置４での質問応答処理を示すフローチャートである。
図９は、本実施形態に係る自動応答装置４における具体例を示す図である。
図８のＳ４１において、自動応答装置４の制御部４０（受付処理部４１）は、利用者からの質問を、入力部５６を介して受け付ける。
Ｓ４２において、制御部４０（受付処理部４１）は、受け付けた質問をテキスト化する。なお、利用者からの質問を、キーボード等の入力装置を用いて受け付けた場合には、既にテキスト化されているため、当該処理は不要である。
Ｓ４３において、制御部４０（受付処理部４１）は、形態素解析処理を行い、単語を抽出する。 Next, the process related to the automatic response using the knowledge DB 3 will be described.
FIG. 8 is a flowchart showing a question answering process in the automatic answering device 4 according to the present embodiment.
FIG. 9 is a diagram showing a specific example of the automatic response device 4 according to the present embodiment.
In S41 of FIG. 8, the control unit 40 (reception processing unit 41) of the automatic response device 4 receives a question from the user via the input unit 56.
In S42, the control unit 40 (reception processing unit 41) converts the received question into a text. When a question from a user is received using an input device such as a keyboard, the process is not necessary because it has already been converted into text.
In S43, the control unit 40 (reception processing unit 41) performs morphological analysis processing and extracts words.

Ｓ４４において、制御部４０（ＤＢ処理部４２）は、抽出した単語に基づいて知識ＤＢ３を検索する。そして、制御部４０（ＤＢ処理部４２）は、重みの大きい単語を含む登録対象データを上位にした検索結果の一覧を生成する。
Ｓ４５において、制御部４０（回答出力部４３）は、生成した検索結果の一覧から上位の登録対象データを、表示部５７に出力する。その後、制御部４０は、本処理を終了する。 In S44, the control unit 40 (DB processing unit 42) searches the knowledge DB 3 based on the extracted words. Then, the control unit 40 (DB processing unit 42) generates a list of search results in which the registration target data including a word having a large weight is ranked higher.
In S45, the control unit 40 (answer output unit 43) outputs the higher-level registration target data from the generated search result list to the display unit 57. After that, the control unit 40 ends this process.

図９は、利用者による利用場面の一例を示す。
利用者が、「夜、デートで使えそうな美味しいご飯が食べられる店を探したいんだけど。」と発言すると、自動応答装置４の制御部４０は、マイク５６ａ（入力部５６）を介して質問を受け付ける。
そして、自動応答装置４による処理結果として、制御部４０は、パンフレット６１をディスプレイ５７ａ（表示部５７）に表示し、バーチャルなキャラクタが喋るように、「こんな情報がありますが、いかがですか？」という音声がスピーカ５８から出力される。その際、制御部４０は、パンフレット６１のうちキーワードとなる発言との関連部分を、ハイライト等による強調表示をしてもよい。 FIG. 9 shows an example of a usage scene by a user.
When the user says, "I want to find a restaurant where I can eat delicious rice that can be used on a date at night.", The control unit 40 of the automatic response device 4 asks a question via the microphone 56a (input unit 56). Accept.
Then, as a result of processing by the automatic response device 4, the control unit 40 displays the pamphlet 61 on the display 57a (display unit 57), and so that the virtual character speaks, "I have this information, how about it?" Is output from the speaker 58. At that time, the control unit 40 may highlight the portion of the pamphlet 61 that is related to the keyword statement by highlighting or the like.

ここで、利用者が他の情報をさらに要求する場合には、「他には？」と発言したり、次を示す入力部（図示せず）による操作を行うことで、制御部４０は、重みの大きい単語を含む登録対象データを上位にした検索結果の一覧から順番に、登録対象データを表示部５７に出力してもよい。 Here, when the user further requests other information, the control unit 40 can perform an operation by saying "What else?" Or by performing an operation by an input unit (not shown) indicating the following. The registration target data may be output to the display unit 57 in order from the list of search results in which the registration target data including a word having a large weight is ranked higher.

このように、本実施形態の自動応答システム１００によれば、以下のような効果がある。
（１）知識ＤＢ生成サーバ１は、登録対象データ全体並びに登録対象データに含まれるテキスト及び画像から得られた単語の分類を取得し、登録対象データに単語と分類とを対応付けて知識ＤＢ３に登録する。よって、知識ＤＢ３には、登録対象データから得られた様々な単語を分類に対応付けて、知識データとして登録できる。
（２）知識ＤＢ生成サーバ１は、シーンに関する用語を記憶したシーン関連用語辞書２２を参照して、単語の分類として、少なくともシーンであるか否かに分類する。よって、登録対象データの全体や、画像の雰囲気を伝えるシーンとして、各単語を分類できる。
（３）知識ＤＢ生成サーバ１は、画像とテキストとの重なり有無を確認し、重なっている場合には、画像からテキストを削除する加工を行い、処理後の画像を含む加工後対象データに対して画像処理を行う。よって、テキストを含まない画像を処理対象にすることで、画像処理の精度を向上できる。 As described above, the automatic response system 100 of the present embodiment has the following effects.
(1) The knowledge DB generation server 1 acquires the classification of words obtained from the entire registration target data and the texts and images contained in the registration target data, and associates the words and classifications with the registration target data in the knowledge DB 3. to register. Therefore, various words obtained from the registration target data can be associated with the classification and registered as knowledge data in the knowledge DB 3.
(2) The knowledge DB generation server 1 refers to the scene-related term dictionary 22 that stores terms related to the scene, and classifies the words as at least whether or not they are scenes. Therefore, each word can be classified as the whole of the data to be registered or as a scene that conveys the atmosphere of the image.
(3) The knowledge DB generation server 1 confirms whether or not the image and the text overlap, and if they overlap, the text is deleted from the image, and the processed target data including the processed image is subjected to processing. Image processing is performed. Therefore, the accuracy of image processing can be improved by targeting an image that does not contain text as a processing target.

（４）知識ＤＢ生成サーバ１は、単語の上位語を取得して、取得した上位語の分類を取得するので、取得した上位語及び分類を、知識ＤＢ３に登録でき、知識ＤＢ３に登録する知識データを、より精度の高いものにできる。
（５）知識ＤＢ生成サーバ１は、単語の重みを決定し、知識ＤＢ３に単語にさらに対応付けて重みを登録する。重みは、単語の出現頻度や強調度合や、画像の占有度合によって決定する。よって、質問に対する回答に知識ＤＢ３を使用する際に、重みを用いて、より精度の高い、よい回答を出力できる可能性がある。 (4) Since the knowledge DB generation server 1 acquires the hypernym of a word and acquires the acquired hypernym classification, the acquired hypernym and classification can be registered in the knowledge DB 3 and the knowledge to be registered in the knowledge DB 3. The data can be made more accurate.
(5) The knowledge DB generation server 1 determines the weight of a word, and further associates the weight with the word and registers the weight in the knowledge DB 3. The weight is determined by the frequency of appearance of words, the degree of emphasis, and the degree of occupancy of the image. Therefore, when the knowledge DB 3 is used for answering a question, there is a possibility that a better answer with higher accuracy can be output by using the weight.

（６）自動応答装置４は、質問を受け付けると、質問をテキスト化して形態素解析を行い、解析結果に基づいて知識ＤＢ生成サーバ１が生成した知識ＤＢ３を検索し、重みが大きいものから順番に登録対象データを質問に対する回答として出力する。よって、質問に対して、知識ＤＢ３を用いて回答を行うことができる。その際、重みを用いることで、質問により適合した知識ＤＢ３の登録対象データを、回答として出力できる。 (6) When the automatic response device 4 receives a question, the automatic response device 4 converts the question into text and performs morphological analysis, searches for the knowledge DB 3 generated by the knowledge DB generation server 1 based on the analysis result, and sequentially in descending order of weight. Output the registration target data as an answer to the question. Therefore, it is possible to answer the question using the knowledge DB3. At that time, by using the weight, the registration target data of the knowledge DB 3 more suitable for the question can be output as an answer.

以上、本発明の実施形態について説明したが、本発明は上述した実施形態に限定されるものではない。また、実施形態に記載した効果は、本発明から生じる最も好適な効果を列挙したに過ぎず、本発明による効果は、実施形態に記載したものに限定されない。なお、上述した実施形態及び後述する変形形態は、適宜組み合わせて用いることもできるが、詳細な説明は省略する。 Although the embodiments of the present invention have been described above, the present invention is not limited to the above-described embodiments. Moreover, the effects described in the embodiments are merely a list of the most suitable effects resulting from the present invention, and the effects according to the present invention are not limited to those described in the embodiments. The above-described embodiment and the modified form described later can be used in combination as appropriate, but detailed description thereof will be omitted.

（変形形態）
（１）本実施形態では、知識ＤＢ生成サーバと、自動応答装置とを異なる装置であるものとして説明したが、これに限定されない。知識ＤＢ生成サーバと、自動応答装置との機能が一体になった装置であってもよい。また、知識ＤＢは、例えば、知識ＤＢ生成サーバの記憶部に有してもよい。 (Transformed form)
(1) In the present embodiment, the knowledge DB generation server and the automatic response device have been described as different devices, but the present invention is not limited to this. It may be a device in which the functions of the knowledge DB generation server and the automatic response device are integrated. Further, the knowledge DB may be stored in the storage unit of the knowledge DB generation server, for example.

（２）本実施形態では、重みの決定を、複数回登場する単語が重要単語であると考えて重みを大きくするものを例に説明したが、これに限定されない。例えば、登録対象データに占める当該単語を表す画像の占有割合が大きいものほど、重みを大きくしてもよい。また、文字のフォントが大きかったり、太文字だったり、といった強調するものである場合に、重みを大きくしてもよい。 (2) In the present embodiment, the determination of the weight is described by taking as an example a word in which a word appearing a plurality of times is considered to be an important word and the weight is increased, but the weight is not limited to this. For example, the larger the occupancy ratio of the image representing the word in the registration target data, the larger the weight may be. Further, when the font of the character is large or the character is bold, the weight may be increased.

（３）本実施形態では、知識ＤＢの利用において、重みの大きい登録対象データから順番に出力するものを例に説明したが、これに限定されない。例えば、検索結果として複数の登録対象データを一覧として出力してもよく、利用者が一覧から登録対象データを指定すると、指定した登録対象データの詳細を出力するものであってもよい。
（４）本実施形態により知識ＤＢに登録した色情報は、例えば、漠然とした類似イメージ検索で、活用することができる。また、一度検索した後に、再度同じ情報を得たい場合等に、キー情報として利用できる。
（５）本実施形態で生成した知識データ同士を、さらに関連付けてもよい。例えば、文字処理手段が、見出しになるテキストを抽出する。見出しになるテキストは、予め指定してもよい。そして、関連付け手段が、同一又は類似する見出しを有する前記登録対象データを関連付けるようにすればよい。 (3) In the present embodiment, in the use of the knowledge DB, the data to be output in order from the registration target data having the largest weight has been described as an example, but the present invention is not limited to this. For example, a plurality of registration target data may be output as a list as a search result, and when the user specifies the registration target data from the list, the details of the designated registration target data may be output.
(4) The color information registered in the knowledge DB according to the present embodiment can be utilized, for example, by a vague similar image search. Also, it can be used as key information when you want to obtain the same information again after searching once.
(5) The knowledge data generated in the present embodiment may be further associated with each other. For example, the character processing means extracts the text to be the heading. The text to be the heading may be specified in advance. Then, the association means may associate the registration target data having the same or similar headings.

１知識ＤＢ生成サーバ
３知識ＤＢ
４自動応答装置
１０，４０制御部
１２文字処理部
１３対象データ確認処理部
１４画像処理部
１５上位語取得部
１６単語分類部
１７重み決定部
１８データ登録部
２０，５０記憶部
２１ａプログラム
２２シーン関連用語辞書
４１受付処理部
４２ＤＢ処理部
４３回答出力部
５６入力部
５７表示部
６１パンフレット
１００自動応答システム 1 Knowledge DB generation server 3 Knowledge DB
4 Automatic response device 10, 40 Control unit 12 Character processing unit 13 Target data confirmation processing unit 14 Image processing unit 15 Hypernym acquisition unit 16 Word classification unit 17 Weight determination unit 18 Data registration unit 20, 50 Storage unit 21a Program 22 Scene-related Term dictionary 41 Reception processing unit 42 DB processing unit 43 Answer output unit 56 Input unit 57 Display unit 61 Brochure 100 Automatic response system

Claims

A character processing means for recognizing or acquiring a character string included in the data to be registered and extracting a word contained in the character string, and a character processing means.
An image processing means that performs image recognition processing on the registration target data and acquires the recognized word,
A word classification means for acquiring the classification of the word obtained as a result of processing the character processing means and the image processing means using dictionary data, and a word classification means.
A data registration means for associating the word with the classification with the registration target data and storing the word in the knowledge database.
A knowledge database generator.

In the knowledge database generator according to claim 1,
The image recognition process is a knowledge database generation device that recognizes an image scene.

In the knowledge database generator according to claim 1 or 2.
The registration target data is analyzed to confirm whether or not the image and the character string overlap, and if there is an overlap, the character string area including the character string is deleted, and the deleted character string area is used as the character. Equipped with image processing means to complement using the image around the column area,
The image processing means is a knowledge database generation device that performs processing using the registration target data after processing by the image processing means.

In the knowledge database generator according to any one of claims 1 to 3.
It is provided with a hypernym acquisition means for acquiring a hypernym of the word obtained as a result of processing the character processing means and the image processing means.
The word classification means is a knowledge database generation device that further acquires the classification by using the hypernym acquired by the hypernym acquisition means as the word.

In the knowledge database generator according to any one of claims 1 to 4.
A weight determining means for determining the weight of the word obtained as a result of processing the character processing means and the image processing means is provided.
The data registration means is a knowledge database generation device that further associates the word with the weight determined by the weight determination means and stores it in the knowledge database.

In the knowledge database generator according to claim 5,
The weight determining means is based on at least one of the frequency of appearance of the word in the registration target data, the mode of highlighting the word, and the proportion of the image corresponding to the word in the region of the registration target data. A knowledge database generator that determines the weight of the word.

In the knowledge database generator according to any one of claims 1 to 6.
A color information acquisition means for analyzing the registration target data and acquiring color information is provided.
The data registration means is a knowledge database generation device that further associates the color information acquired by the color information acquisition means with the registration target data and stores it in the knowledge database.

In the knowledge database generator according to any one of claims 1 to 7.
The character processing means extracts the character string to be a heading and obtains the character string.
A knowledge database generator comprising associating means for associating the registration target data having the same or similar headings.

The program for operating a computer as the knowledge database generator according to any one of claims 1 to 8.

An automated response system using the knowledge database generated by the knowledge database generator according to claim 5 or 6.
Question reception means for accepting questions and
A question analysis means for performing morphological analysis on the question received by the question receiving means and converted into a character string, and a question analysis means.
An answer output means that searches the knowledge database based on the analysis result by the question analysis means and outputs the registration target data with priority given to the one having a large total of the weights corresponding to the words acquired as the search result. ,
Equipped with an automatic response system.