JP2931553B2

JP2931553B2 - Topic processing device

Info

Publication number: JP2931553B2
Application number: JP8228129A
Authority: JP
Inventors: 一志西本; 康之角
Original assignee: EI TEI AARU CHINO EIZO TSUSHIN KENKYUSHO KK
Current assignee: EI TEI AARU CHINO EIZO TSUSHIN KENKYUSHO KK
Priority date: 1996-08-29
Filing date: 1996-08-29
Publication date: 1999-08-09
Anticipated expiration: 2016-08-29
Also published as: JPH1069482A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は話題処理装置に関
し、特に、複数の参加者による対話を活性化するための
話題処理装置に関する。The present invention relates to a topic processing apparatus, and more particularly, to a topic processing apparatus for activating a conversation between a plurality of participants.

【０００２】[0002]

【従来の技術】対話は、新たな発想を得るために非常に
有効な創造的行為と見なすことができる。学会やシンポ
ジウムなどにおいて、その企画者や司会者がしばしば
「活発な議論を期待します」と言うのは、１つには議論
にそのような効果があることを前提としているからであ
る。また、研究者のラウンジで研究者が交わすとりとめ
もない対話の中から新たな情報が得られること、そして
新しい発想や問題解決のヒントが得られることはしばし
ば経験することである。2. Description of the Related Art Dialogue can be regarded as a very effective creative act for obtaining new ideas. In academic societies and symposiums, organizers and presenters often say that "we expect lively discussions" because, in part, it is premised that discussions have such an effect. It is also a common experience that researchers can gain new information from the disorganized dialogue they interact with in the lounge, and also gain new ideas and tips for problem solving.

【０００３】人の発想の過程には２つの重要な過程があ
るといわれる。１つは発散的思考過程であり、もう１つ
は収束的思考過程である。解の不明なある問題に対し
て、人はまず発散的思考過程においてその問題と明らか
に関連ある情報のみならず、関連が曖昧な情報やふと気
になる情報断片をも多数かき集める。こうして収集した
情報断片を、収束的思考過程において統合していく。こ
の際、一見関連が不明な情報断片同士の間に新たな関連
を見出したとき、人は新たな発想を得ることができる。It is said that there are two important processes in the process of human thinking. One is a divergent thinking process, and the other is a convergent thinking process. For a problem whose solution is unknown, a person first gathers not only information that is clearly related to the problem in the divergent thinking process, but also a lot of information that is vaguely related or suddenly anxious. The information fragments collected in this way are integrated in a convergent thinking process. At this time, when a new association is found between information fragments whose apparent association is unknown, a person can obtain a new idea.

【０００４】[0004]

【発明が解決しようとする課題】いくつかの発想法など
では、意図的にこの両過程を明確に分離することによっ
てより効果的に発想を得ようとする。たとえば、ブレイ
ンストーミングは主に発散的思考を効果的に行なうため
の手法であり、またＫＪ法は収束的思考を効果的に行な
うための手法である。Some ideas and the like try to obtain ideas more effectively by intentionally separating these two processes intentionally. For example, brainstorming is a method for mainly performing divergent thinking, and the KJ method is a method for effectively performing convergent thinking.

【０００５】一方、日常的に行なわれる対話ではこの両
者が区別されることは通常ない。しかし、見方を変えれ
ば日常の対話はこの発散的過程と収束的過程とを混合し
たものと考えられる。つまり、参加者達の発言によって
さまざまな情報が提供される（発散的過程）。ここであ
る参加者が提供された情報断片相互あるいは提供された
情報断片と自分の持つ知識の中のある情報とに何らかの
関連性を見出したとき（収束的過程）、この参加者は発
見した関連性に基づく新しい話題を提示し、対話は新た
な局面へと展開する。こうして発散と収束の過程を繰返
すことによって対話は次々と進行する。[0005] On the other hand, the two are not usually distinguished in daily conversation. However, from a different point of view, everyday dialogue is thought to be a mixture of these divergent and convergent processes. In other words, various information is provided by the participants' remarks (divergent process). Here, when a participant finds some association between the provided information fragments or between the provided information fragments and some information in his or her knowledge (convergent process), the participant finds the related Presenting new topics based on gender, the dialogue will evolve into a new phase. The dialogue proceeds one after another by repeating the process of divergence and convergence.

【０００６】それゆえに、この発明の主たる目的は、参
加している対話を常に活性化し、これによって対話の創
造性をより強力なものとするようにした話題処理装置を
提供することである。[0006] Therefore, a main object of the present invention is to provide a topic processing apparatus which always activates participating dialogues, thereby enhancing the creativity of the dialogues.

【０００７】[0007]

【課題を解決するための手段】請求項１に係る発明は、
複数の話者による対話を活性化するための話題処理装置
であって、各参加者の個々の発言を発言オブジェクトと
して形態素解析を行ない、その結果から所定の種類の単
語のみをキーワードとして抽出するキーワード抽出手段
と、抽出された各キーワードの出現頻度ならびに出現間
隔に基づき、各発言オブジェクトについてそれに含まれ
た各キーワードの重みを評価するキーワード重み計算手
段と、得られたキーワードおよびそれらの重みに基づ
き、ある時点までのすべての発言オブジェクト相互の関
連性を各発言オブジェクトが含むキーワード群の共有関
係を用いて計算し、一方ですべてのキーワード相互の関
連性をそれらのキーワードの各発言オブジェクトにおけ
る共起関係を用いて計算し、その結果発言オブジェクト
相互ならびにキーワード相互の関係を空間的構造として
表現することにより対話内容を構造化する対話構造化手
段と、得られた対話内容構造から特に発言オブジェクト
が密集している部分の空間位置情報を求めることによっ
てその時点までの対話における主要話題を検出する主要
話題検出手段とを備えて構成される。The invention according to claim 1 is
A topic processing apparatus for activating a dialogue between a plurality of speakers, wherein a morphological analysis is performed using individual utterances of each participant as a utterance object, and only keywords of a predetermined type are extracted as keywords from the result. Extracting means, keyword weight calculating means for evaluating the weight of each keyword included in each utterance object based on the appearance frequency and appearance interval of each extracted keyword, and based on the obtained keywords and their weights, The relevance of all the utterance objects up to a certain point is calculated using the shared relation of the keywords included in each utterance object, while the relevance of all the keywords is co-occurred in each utterance object of those keywords , And as a result, the utterance objects Dialogue structuring means for structuring the dialogue content by expressing the mutual relationships as a spatial structure, and obtaining the spatial position information of the part where speech objects are particularly dense from the obtained dialogue content structure. Main topic detecting means for detecting a main topic in the conversation up to the point in time.

【０００８】請求項２に係る発明は、複数の話者による
対話を活性化するための話題処理装置であって、請求項
１のキーワード抽出手段とキーワード重み計算手段と対
話構造化手段とを含み、さらに得られた対話内容構造か
ら発言オブジェクトが存在しない空白領域の位置情報を
求めることによってその時点までの対話における話題の
空白領域を検出する話題空白領域検出手段を備えて構成
される。According to a second aspect of the present invention, there is provided a topic processing apparatus for activating a dialogue between a plurality of speakers, including a keyword extracting unit, a keyword weight calculating unit, and a dialog structuring unit. And a topic blank area detecting means for detecting a blank area of a topic in the dialogue up to that time by obtaining position information of a blank area where no comment object exists from the obtained dialog content structure.

【０００９】請求項３に係る発明では、請求項２の各構
成に加えて、さらに得られた対話内容構造から特に発言
オブジェクトが密集している部分の空間位置情報を求め
ることによって、その時点までの対話における主要話題
を検出する主要話題検出手段を含む。According to the third aspect of the present invention, in addition to the respective configurations of the second aspect, the spatial position information of a portion where the utterance objects are particularly dense is obtained from the obtained dialog content structure, so that up to that point in time. And a main topic detecting means for detecting a main topic in the dialogue of the user.

【００１０】請求項４に係る発明では、請求項１または
請求項３の各構成に加えて、さらに主要話題検出手段に
よって把握された主要話題の空間位置近傍に位置するキ
ーワードを所定数集め、これらのキーワードの重みを主
要話題の空間位置からの距離に応じて評価する主要話題
関連キーワード抽出手段と、得られたキーワード群に基
づき、連想辞書を用いて連想キーワード群を想起する連
想想起手段と、想起された連想キーワード群を用いてデ
ータベースから関連情報を検索抽出する主要話題関連情
報抽出手段を含む。In the invention according to claim 4, in addition to the constitution of claim 1 or 3, a predetermined number of keywords located near the spatial position of the main topic grasped by the main topic detecting means are collected. Main topic-related keyword extraction means for evaluating the weight of the keyword according to the distance from the spatial position of the main topic, associative recall means for recalling the associative keyword group using the associative dictionary based on the obtained keyword group, A main topic related information extracting means for searching and extracting related information from a database using the recalled associative keyword group is included.

【００１１】請求項５に係る発明では、請求項２または
請求項３の各構成に加えて、さらに話題空白領域検出手
段によって把握された話題空白領域の近傍に位置するキ
ーワードを所定数集め、これらのキーワードの重みを話
題空白領域からの距離に応じて評価する空白領域周辺キ
ーワード抽出手段と、得られたキーワード群を用いてデ
ータベースから関連情報を検索抽出する空白領域関連情
報抽出手段を含む。According to a fifth aspect of the present invention, in addition to the configuration of the second or third aspect, a predetermined number of keywords located near the topic blank region detected by the topic blank region detecting means are collected. And a blank area surrounding keyword extracting means for evaluating the weight of the keyword according to the distance from the topic blank area, and a blank area related information extracting means for searching and extracting related information from a database using the obtained keyword group.

【００１２】請求項６に係る発明では、請求項３の構成
に加えて、さらに主要話題検出手段によって把握された
主要話題の空間位置近傍に位置するキーワードを所定数
集め、これらのキーワードの重みを主要話題の空間位置
からの距離に応じて評価する主要話題関連キーワード抽
出手段と、得られたキーワード群に基づき、連想辞書を
用いて連想キーワード群を想起する連想想起手段と、想
起された連想キーワード群を用いてデータベースから関
連情報を検索抽出する主要話題関連情報抽出手段と、把
握された話題空白領域の近傍に位置するキーワードを所
定数集め、これらのキーワードの重みを話題空白領域か
らの距離に応じて評価する空白領域周辺キーワード抽出
手段と、得られたキーワード群を用いてデータベースか
ら関連情報を検索抽出する空白領域関連情報抽出手段を
含む。In the invention according to claim 6, in addition to the structure of claim 3, a predetermined number of keywords located near the spatial position of the main topic grasped by the main topic detecting means are collected, and the weight of these keywords is weighted. Main topic related keyword extraction means for evaluating according to the distance from the spatial position of the main topic, associative recall means for recalling the associative keyword group using the associative dictionary based on the obtained keyword group, and recalled associative keyword A main topic related information extracting means for searching and extracting relevant information from a database using a group, and collecting a predetermined number of keywords located in the vicinity of the identified topic blank area, and weighting these keywords to a distance from the topic blank area Searching related information from the database using the keyword extraction means around the blank area and evaluating the keywords Including a blank area-related information extracting means for output.

【００１３】[0013]

【発明の実施の形態】図１はこの発明の一実施形態を示
すブロック図である。図１において、発言文がテキスト
データとして入力され、順次入力される個々のテキスト
をそれぞれ１つの発言オブジェクトと称する。この発言
オブジェクトは発言文処理モジュール１と話題構造化モ
ジュール２とに与えられる。発言文処理モジュール１は
構文解析モジュール１１とキーワード重み計算モジュー
ル１２とを含み、構文解析モジュール１１は入力された
発言オブジェクトを形態素解析し、キーワード群を抽出
する。このキーワード群は話題構造化モジュール２に与
えられるとともに、キーワード重み計算モジュール１２
に与えられる。キーワード重み計算モジュール１２は各
キーワードの重みを計算し、話題構造化モジュール２に
与える。FIG. 1 is a block diagram showing an embodiment of the present invention. In FIG. 1, a statement is input as text data, and each of the sequentially input texts is referred to as one statement object. This comment object is given to the comment sentence processing module 1 and the topic structuring module 2. The sentence processing module 1 includes a syntax analysis module 11 and a keyword weight calculation module 12, and the syntax analysis module 11 performs a morphological analysis on the input speech object and extracts a keyword group. The keyword group is provided to the topic structuring module 2 and the keyword weight calculation module 12
Given to. The keyword weight calculation module 12 calculates the weight of each keyword and gives it to the topic structuring module 2.

【００１４】話題構造化モジュール２は、その時点まで
に得られたキーワードオブジェクトと発言オブジェクト
との関連を反映して、これらを２次元空間上に配置し、
対話の内容を構造化する。発言オブジェクト／キーワー
ドオブジェクトの２次元空間配置情報は話題展開状況認
識モジュール３に与えられる。話題展開状況認識モジュ
ール３は主要話題検出モジュール３１と話題空白領域検
出モジュール３２とを含み、主要話題検出モジュール３
１は主要話題の位置を検出し、話題空白領域検出モジュ
ール３２は話題空白位置を検出する。The topic structuring module 2 arranges these in a two-dimensional space, reflecting the association between the keyword object and the comment object obtained up to that point,
Structure the content of the conversation. The two-dimensional spatial arrangement information of the comment object / keyword object is given to the topic development situation recognition module 3. The topic development situation recognition module 3 includes a main topic detection module 31 and a topic blank area detection module 32.
1 detects the position of the main topic, and the topic blank area detection module 32 detects the topic blank position.

【００１５】図２は図１に示した話題構造化モジュール
２と話題展開状況認識モジュール３の動作を説明するた
めのフローチャートである。FIG. 2 is a flowchart for explaining the operation of the topic structuring module 2 and the topic development situation recognition module 3 shown in FIG.

【００１６】次に、図１および図２を参照して、この発
明の一実施形態の具体的な動作について説明する。発言
文処理モジュール１の構文解析モジュール１１は入力さ
れた発言オブジェクトを構文解析し、各単語の品詞を決
定する。次いで、名詞および品詞を決定できなかった語
のみを取出し、この語群をその発言オブジェクトのキー
ワードとする。そして、キーワード重み計算モジュール
１２は第ｎ発言オブジェクトにおけるキーワードｗ_iの
重みＷ_wi,nを次式によって求める。Next, a specific operation of the embodiment of the present invention will be described with reference to FIGS. The syntax analysis module 11 of the statement processing module 1 parses the input statement object and determines the part of speech of each word. Next, only words for which nouns and parts of speech could not be determined are extracted, and this word group is used as a keyword of the utterance object. Then, the keyword weight calculation module 12 obtains the weight W _wi keyword w _i in the n speech _object, a _n by the following equation.

【００１７】[0017]

【数１】 (Equation 1)

【００１８】ただし、ｆ_wiは第ｎ−１発言オブジェクト
まででｗ_iがいくつの発言オブジェクトに出現したか、
ｆ_wi,nは第ｎ発言オブジェクトにｗ_iがいくつ現れた
か、ｉ_wi _,nはｗ_iが第ｎ発言オブジェクトの何発言前に
最近使用されたかを示す。また、Ｆ_g，Ｆ_l，Ｉは定数
であり、経験的にそれぞれ５，１，１０の値を与える。[0018] However, if f _wi has appeared in the w _i are a number of remarks object up to the (n-1) th speech object,
f _{wi, n} is either w _i was a number appeared in the n-th speech _object, i _wi, _n indicates whether w _i has been recently used to before what remarks of the n-th speech object. F _g , F _l , and I are constants, and empirically give values of 5, ₁ , and 10, respectively.

【００１９】このような重み付けとした理由は、次の点
にある。まず、出現頻度が高いキーワードは、どんな対
話においても一般的に使用される語であるか、あるいは
その対話の全体的な話題にかかわる語のいずれかであ
り、いずれにせよ話題の構造化における重要度が低い。
もしこのような語が大きな重みを持つと、後で述べる話
題の構造化段階において各話題クラスタを不明瞭にして
しまう可能性がある。The reason for such weighting is as follows. First, the keywords that appear frequently are either words that are commonly used in any dialogue, or words that relate to the overall topic of the dialogue, and in any case are important in structuring the topic. Low degree.
If such words have a large weight, each topic cluster may be obscured in the topic structuring stage described below.

【００２０】１発言中に多数出現する語はその発言での
重要度が高い、高頻度語であってもしばらく使用されな
かった後に使用される場合は重要度が高い、という考え
に基づく。It is based on the idea that words appearing many times in one utterance have a high importance in the utterance, and that a high-frequency word has a high importance if it is used after being unused for a while.

【００２１】次に、話題構造化モジュール２は図２に示
すステップ（図示ではＳＰと略称する）ＳＰ１におい
て、各発言オブジェクトのキーワードオブジェクトの共
有関係，逆に言えば各キーワードオブジェクトの発言オ
ブジェクトでの共起関係を用いて双対尺度法に基づい
て、各オブジェクト相互の距離を求める。ここで、双対
尺度法とは、複数の数量化属性で構成されたオブジェク
ト集合が与えられたときに、オブジェクト集合と属性集
合にそれぞれ得点数量を与えることによって、オブジェ
クト同士の属性共有性と属性同士の共起性を空間におけ
る相対的な位置関係として表現する手法である。ステッ
プＳＰ２において、空間構造を定量化する主成分のうち
寄与度の高い上位２つを取出し、各オブジェクトの２次
元空間上での位置情報を決定する。Next, in step SP1 (abbreviated as SP in FIG. 2) SP1 shown in FIG. 2, the topic structuring module 2 shares the keyword objects in each statement object with each other. The distance between the objects is obtained based on the dual scaling method using the co-occurrence relation. Here, the dual scaling method means that when an object set composed of a plurality of quantification attributes is given, the object set and the attribute set are each given a score quantity, whereby the attribute commonality between objects and the attribute This is a method of expressing the co-occurrence of as a relative positional relationship in space. In step SP2, the top two components having the highest contribution are taken out of the principal components for quantifying the spatial structure, and the position information of each object in the two-dimensional space is determined.

【００２２】話題展開状況認識モジュール３の主要話題
検出モジュール３１はステップＳＰ３において、２次元
空間を１６×１６のセルに分割する。そして、ステップ
ＳＰ４において第ｎセルについて、そのセル自身とその
周りの８つのセルに含まれる発言オブジェクトの個数の
平均を求め、これを第ｎセルの重みとする。これを１６
×１６の全セルについて計算する。ステップＳＰ５にお
いて重みが所定のしきい値θを越えるセルを選び、これ
を主要話題セルと見なし、それらのセルの中心の座標を
出力する。In step SP3, the main topic detection module 31 of the topic development situation recognition module 3 divides the two-dimensional space into 16 × 16 cells. Then, in step SP4, for the n-th cell, the average of the number of utterance objects included in the cell itself and the eight surrounding cells is obtained, and this is set as the weight of the n-th cell. This is 16
Calculate for all × 16 cells. In step SP5, a cell whose weight exceeds a predetermined threshold value θ is selected, this is regarded as a main topic cell, and the coordinates of the center of those cells are output.

【００２３】一方、話題空白領域検出モジュール３２は
ステップＳＰ６において、重みがゼロのセルが存在する
領域については、ユークリッド距離変換手法などを適用
することにより、その領域の中心位置とその中心の重み
（重みゼロでないセル領域からの距離、遠いほど重みが
大きくなる）を求める。ユークリッド距離変換手法と
は、空間上での領域中心とその中心の周辺からの距離を
求める手法である。この結果、重みが所定のしきい値φ
を越える空白領域中心を選び、これを話題空白中心と見
なし、それらの領域中心の座標を出力する。On the other hand, in step SP6, the topic blank area detection module 32 applies the Euclidean distance conversion method to the area where a cell having a weight of zero exists, thereby obtaining the center position of the area and the weight of the center ( The distance from the cell area where the weight is not zero, the greater the distance, the greater the weight). The Euclidean distance conversion method is a method of calculating the distance from the center of an area in space and the periphery of the center. As a result, the weight becomes a predetermined threshold φ
Is selected as the center of the topic blank, and the coordinates of the center of those areas are output.

【００２４】上述のごとく、この実施形態によって、主
要話題の位置と話題空白位置を検出することができる。As described above, according to this embodiment, the position of the main topic and the topic blank position can be detected.

【００２５】図３はこの発明の他の実施形態を示すブロ
ック図である。図３において、発言文処理モジュール１
と話題構造化モジュール２と話題展開状況認識モジュー
ル３は図１と同様であり、さらにこの発明の実施形態で
は、話題提供モジュール起動モジュール（話題転換検出
モジュール）４と話題提供モジュール５とが設けられ
る。話題提供モジュール起動モジュール４は話題転換点
を検出し、話題提供モジュール５は話題転換検出モジュ
ール４から指示されるタイミングで、状況に応じた話題
の種となる情報を抽出して利用者に提供する。このため
に、話題提供モジュール５はキーワード抽出および重み
計算モジュール５１と記事データベース検索モジュール
５２と連想想起モジュール５３とを含む。FIG. 3 is a block diagram showing another embodiment of the present invention. In FIG. 3, the statement processing module 1
The topic structuring module 2 and the topic development situation recognition module 3 are the same as those in FIG. 1, and in the embodiment of the present invention, a topic providing module starting module (topic conversion detecting module) 4 and a topic providing module 5 are provided. . The topic providing module starting module 4 detects a topic turning point, and the topic providing module 5 extracts information serving as a topic seed according to the situation and provides it to the user at the timing designated by the topic switching detecting module 4. . For this purpose, the topic providing module 5 includes a keyword extraction and weight calculation module 51, an article database search module 52, and an association recall module 53.

【００２６】図４は図３に示した話題提供モジュール５
の動作を説明するためのフローチャートである。なお、
図３における発言文処理モジュール１と話題構造化モジ
ュール２と話題展開状況認識モジュール３は図１と同様
の動作を行ない、主要話題位置と話題空白位置を検出し
て話題提供モジュール５に与える。話題提供モジュール
５はステップＳＰ１１において話題提供モジュール起動
モジュール４から起動命令が来たか否かを判別し、起動
命令が来なければ待機し、起動命令が来れば、ステップ
ＳＰ１２において話題展開状況認識モジュール３から話
題空白位置が検出されたか否かを判別する。FIG. 4 shows the topic providing module 5 shown in FIG.
5 is a flowchart for explaining the operation of FIG. In addition,
The comment processing module 1, topic structuring module 2, and topic development situation recognition module 3 in FIG. 3 perform the same operations as in FIG. 1, detect the main topic position and the topic blank position, and provide them to the topic providing module 5. The topic providing module 5 determines in step SP11 whether or not an activation command has been received from the topic providing module activation module 4. If no activation command has been received, the topic providing module 5 waits. If an activation command has been received, the topic development status recognition module 3 in step SP12. It is determined whether or not a topic blank position is detected from.

【００２７】話題空白領域があれば、ステップＳＰ１３
において最近入力された発言オブジェクトに最も近い話
題空白領域中心を探し、ステップＳＰ１４においてキー
ワード抽出および重み計算モジュール５１が話題空白領
域中心に近いものから順に所定数ｍ個のキーワードオブ
ジェクトを集める。同時に、各キーワードオブジェクト
について話題空白領域中心からの距離に反比例する重み
を求める。そして、ステップＳＰ１５において、獲得し
た重み付けキーワード群を検索キーワード群とする。ス
テップＳＰ１６において、検索キーワード群を用いて、
記事データベース検索モジュール５２が持つ記事データ
ベース検索を行ない、検索結果を提供する。If there is a topic blank area, step SP13
At step SP14, the keyword extraction and weight calculation module 51 collects a predetermined number m of keyword objects in order from the one closest to the center of the topic blank area. At the same time, a weight that is inversely proportional to the distance from the center of the topic blank area is obtained for each keyword object. Then, in step SP15, the obtained weighted keyword group is set as a search keyword group. In step SP16, using the search keyword group,
The article database search module 52 performs an article database search and provides search results.

【００２８】前述のステップＳＰ１２において、話題空
白領域がなければ、ステップＳＰ１７において最近入力
された発言オブジェクトに最も近い主要話題セルを探
す。そして、キーワード抽出および重み計算モジュール
５１がステップＳＰ１８において主要話題セル中心に近
いものから順に、所定数ｍ個のキーワードオブジェクト
を集める。同時に、各キーワードオブジェクトについて
主要話題セル中心からの距離に反比例する重みを求め
る。連想想起モジュール５３はステップＳＰ１９におい
て、求めた重み付けキーワード群を用いて、予めシステ
ムが持っている連想事象によって連想想起を行ない、連
想キーワード群を得る。そして、ステップＳＰ２０にお
いて、獲得した連想キーワード群を検索キーワード群と
して、ステップＳＰ１６において、検索キーワード群を
用いて、記事データベース検索モジュール５２の記事デ
ータベース検索を行なって検索結果を提供する。At step SP12, if there is no topic blank area, a main topic cell closest to the recently input utterance object at step SP17 is searched. Then, in step SP18, the keyword extraction and weight calculation module 51 collects a predetermined number m of keyword objects in order from the one closest to the main topic cell center. At the same time, a weight that is inversely proportional to the distance from the center of the main topic cell is obtained for each keyword object. In step SP19, the associative recall module 53 uses the obtained weighted keyword group to perform associative recall by an associative event that the system has in advance, and obtains an associative keyword group. Then, in step SP20, the acquired associative keywords are used as search keywords, and in step SP16, the article database search of the article database search module 52 is performed using the search keywords to provide search results.

【００２９】[0029]

【発明の効果】以上のように、この発明によれば、各参
加者の個々の発言を発言オブジェクトとして形態素解析
を行ない、その結果から所定の種類の単語のみをキーワ
ードとして抽出し、抽出した各キーワードの出現頻度な
らびに出現間隔に基づき、各発言オブジェクトについて
それに含まれた各キーワードの重みを評価し、得られた
キーワードおよびそれらの重みに基づいて、ある時点ま
でのすべての発言オブジェクト相互の関連性を各発言オ
ブジェクトが含むキーワード群の共有関係を用いて計算
し、一方ですべてのキーワード相互の関連性をそれらの
キーワードの各発言オブジェクトにおける共起関係を用
いて計算し、その発言オブジェクト相互ならびにキーワ
ード相互の関係を空間的構造として表現することにより
対話内容を構造化し、その対話内容構造から特に発言オ
ブジェクトが密集している部分の空間位置情報を求める
ことによって、その時点までの対話における主要話題を
検出するようにしたので、どんなタイプの対話について
も話題の獲得処理が可能となり、しかも空白領域を検出
することによって新たな話題の展開方向を見出すことが
できる。しかも、目的思考型のみならず、自由展開型対
話についても話題の獲得が可能となり、予めプランとし
て新たに展開すべき話題の方向を設定する必要がなく、
対話の状況に応じてその状況そのものから新たな展開方
向を獲得することが可能となる。As described above, according to the present invention, morphological analysis is performed using individual utterances of each participant as utterance objects, and only words of a predetermined type are extracted as keywords from the results, and each extracted word is extracted. Evaluate the weight of each keyword included in each statement object based on the appearance frequency and appearance interval of the keyword, and based on the obtained keywords and their weights, relevance of all statement objects to a certain point in time Is calculated using the shared relationship of the keywords included in each statement object, while the relevance of all the keywords is calculated using the co-occurrence relation of each keyword in each statement object, and the statement objects and keywords are calculated. Structuring dialogue by expressing mutual relationships as a spatial structure The main topic in the dialogue up to that point is detected by obtaining spatial position information of the part where the speech objects are particularly dense from the dialogue content structure, so that topic acquisition processing can be performed for any type of dialogue. Can be obtained, and the development direction of a new topic can be found by detecting a blank area. In addition, it is possible to acquire topics not only in the purpose thinking type but also in the free expansion type dialogue, and it is not necessary to set the direction of the topic to be newly developed as a plan in advance,
According to the situation of the dialogue, it is possible to acquire a new development direction from the situation itself.

[Brief description of the drawings]

【図１】この発明の一実施形態の概略ブロック図であ
る。FIG. 1 is a schematic block diagram of an embodiment of the present invention.

【図２】図１に示した話題構造化モジュールと話題展開
状況認識モジュールの動作を説明するためのフローチャ
ートである。FIG. 2 is a flowchart for explaining operations of a topic structuring module and a topic development situation recognition module shown in FIG. 1;

【図３】この発明の他の実施形態を示すブロック図であ
る。FIG. 3 is a block diagram showing another embodiment of the present invention.

【図４】図３に示した話題提供モジュールの動作を説明
するためのフローチャートである。FIG. 4 is a flowchart for explaining the operation of the topic providing module shown in FIG. 3;

[Explanation of symbols]

１発言文処理モジュール２話題構造化モジュール３話題展開状況認識モジュール４話題提供モジュール起動モジュール５話題提供モジュール１１構文解析モジュール１２キーワード重み計算モジュール３１主要話題検出モジュール３２話題空白領域検出モジュール５１キーワード抽出および重み計算モジュール５２記事データベース検索モジュール５３連想想起モジュール DESCRIPTION OF SYMBOLS 1 Comment sentence processing module 2 Topic structuring module 3 Topic development situation recognition module 4 Topic providing module starting module 5 Topic providing module 11 Parsing module 12 Keyword weight calculation module 31 Main topic detection module 32 Topic blank area detection module 51 Keyword extraction and Weight calculation module 52 Article database search module 53 Associative recall module

フロントページの続き (56)参考文献特開平６−236410（ＪＰ，Ａ) 特開平５−81328（ＪＰ，Ａ) 特開平７−230468（ＪＰ，Ａ) 角康之、間瀬健二、「グループディスカッションにおける話題空間の可視化」、電子情報通信学会技術研究報告、Ｖｏｌ．95、Ｎｏ．237（ＨＣＳ95− 19）、ｐ．83−ｐ．90（1995) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06F 17/20 - 17/30 ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (56) References JP-A-6-236410 (JP, A) JP-A-5-81328 (JP, A) JP-A-7-230468 (JP, A) Yasuyuki Kado, Kenji Mase, "Group Visualization of Topic Space in Discussion ”, IEICE Technical Report, Vol. 95, No. 237 (HCS95-19), p. 83-p. 90 (1995) (58) Field surveyed (Int. Cl. ⁶ , DB name) G06F 17/20-17/30 JICST file (JOIS)

Claims

(57) [Claims]

1. A topic processing apparatus for activating a conversation between a plurality of speakers, performing a morphological analysis using individual utterances of each participant as a utterance object, and extracting only words of a predetermined type from the results. Keyword extraction means for extracting as keywords, keyword weight calculation means for evaluating the weight of each keyword included in each utterance object based on the appearance frequency and appearance interval of each keyword extracted by the keyword extraction means; Means and the weights obtained by the keyword weight calculation means and based on the weights thereof, calculate the relevance of all the utterance objects up to a certain point in time using the sharing relationship of the keyword group included in each utterance object, All keywords are related to each other Dialog structuring means for calculating the dialog content by using the co-occurrence relationship of each of the utterance objects of the keywords and expressing the relationship between the utterance objects and the keywords as a spatial structure, and the dialog structuring A topic processing device comprising a main topic detection unit for detecting a main topic in a conversation up to that point by obtaining spatial position information of a part where speech objects are particularly dense from a conversation content structure obtained by the means.

2. A topic processing apparatus for activating a conversation between a plurality of speakers, performing a morphological analysis using individual utterances of each participant as a utterance object, and extracting only words of a predetermined type from the result. Keyword extraction means for extracting as keywords, keyword weight calculation means for evaluating the weight of each keyword included in each utterance object based on the appearance frequency and appearance interval of each keyword extracted by the keyword extraction means; Means and the weights obtained by the keyword weight calculation means and based on the weights thereof, calculate the relevance of all the utterance objects up to a certain point in time using the sharing relationship of the keyword group included in each utterance object, All keywords are related to each other Dialog structuring means for calculating the dialog content by using the co-occurrence relationship of each of the utterance objects of the keywords and expressing the relation between the utterance objects and the keywords as a spatial structure, and the dialog structuring A topic processing apparatus comprising: topic blank area detecting means for detecting a blank area of a topic in a dialogue up to that time by obtaining position information of a blank area where no comment object exists from a dialog content structure obtained by the means.

3. A main topic for detecting a main topic in a dialogue up to that point by obtaining spatial position information of a part where speech objects are particularly dense from the dialogue content structure obtained by the dialogue structuring means. 3. The topic processing device according to claim 2, further comprising topic detection means.

4. A method for collecting a predetermined number of keywords located near the spatial position of the main topic grasped by the main topic detecting means, and evaluating the weights of these keywords according to the distance from the spatial position of the main topic. A main topic-related keyword extracting unit, an associative recalling unit that recalls an associative keyword group using an associative dictionary based on the keyword group obtained by the main topic-related keyword extracting unit, and an associative keyword group recalled by the associative recalling unit 4. The topic processing apparatus according to claim 1, further comprising a main topic related information extracting means for searching for and extracting related information from a database by using a keyword.

5. A blank area which collects a predetermined number of keywords located near the topic blank area detected by the topic blank area detecting means, and evaluates the weight of these keywords according to the distance from the topic blank area. 4. The topic processing according to claim 2, further comprising a peripheral keyword extracting unit, and a blank region related information extracting unit that retrieves and extracts related information from a database using the keyword group obtained by the blank region surrounding keyword extracting unit. apparatus.

6. A method according to claim 1, further comprising the step of collecting a predetermined number of keywords located near the spatial position of the main topic detected by the main topic detecting means, and evaluating the weights of these keywords in accordance with the distance from the spatial position of the main topic. A main topic-related keyword extracting unit, an associative recalling unit that recalls an associative keyword group using an associative dictionary based on the keyword group obtained by the main topic-related keyword extracting unit, and an associative keyword group recalled by the associative recalling unit. Main topic related information extracting means for searching for and extracting relevant information from a database using a predetermined number of keywords located in the vicinity of the topic blank area grasped by the topic blank area detecting means; Means for extracting keywords around blank areas to be evaluated according to the distance from 4. The topic processing apparatus according to claim 3, further comprising a blank area related information extracting unit that searches and extracts related information from a database using the keyword group obtained by the blank area surrounding keyword extracting unit.