JP6075042B2

JP6075042B2 - Language processing apparatus, language processing method, and program

Info

Publication number: JP6075042B2
Application number: JP2012265710A
Authority: JP
Inventors: 清司大倉; 明潮田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2012-12-04
Filing date: 2012-12-04
Publication date: 2017-02-08
Anticipated expiration: 2032-12-04
Also published as: JP2014110026A

Description

本発明は、言語処理装置、言語処理方法、及びプログラムに関する。 The present invention relates to a language processing device, a language processing method, and a program.

文の読解支援に関し、文の構造をわかりやすく表示し、意味の理解を助ける言語処理方法が知られている。例えば、母国語以外の言語を読んで理解する場合、単語はわかっても文の構造がわからないと、意味がつかめないことが多い。このような場合においても、文の構造をわかりやすく表示することにより、理解を助けることができる。 Regarding sentence reading support, a language processing method that displays the structure of a sentence in an easy-to-understand manner and helps understand the meaning is known. For example, when reading and understanding a language other than your native language, it is often difficult to understand the meaning of the word without knowing the structure of the sentence. Even in such a case, displaying the sentence structure in an easy-to-understand manner can help understanding.

例えば、構文木の解析対象となるノードに付随する、句の細分類である種別の情報から、解析対象区分分類知識を参照して、該解析対象が、動詞を主辞とする動詞句類であるか、主に名詞を主辞とする非動詞句類であるか、品詞であるかを判定する例が知られている。このとき、解析対象が動詞句類であれば、動詞と、該動詞の深層格との関係を解析し、解析対象が非動詞句類であれば、名詞句と形容詞句間を含む修飾関係を解析する。また、解析対象が品詞であれば、意味解析を終了する。このように、後のキーワード抽出に必要な深層格を指定し、それ以外の不必要な深層格内部の意味解析を実行することなく、必要最小の範囲の意味構造を生成することによって解析の実行時間の短縮を実現することを目指している。また、解析全体の流れを明示的にすることによって、処理方式の保守の負担の軽減を実現することを目的としている。 For example, referring to the analysis target classification knowledge from the information of the classification that is the subcategory of the phrase attached to the node to be analyzed in the parse tree, the analysis target is a verb phrase having the verb as the main word. There are known examples of determining whether a non-verb phrase having a noun as a main word or a part of speech. At this time, if the object to be analyzed is a verb phrase, the relationship between the verb and the deep case of the verb is analyzed. If the object to be analyzed is a non-verb phrase, a modification relationship including a noun phrase and an adjective phrase is obtained. To analyze. If the analysis target is a part of speech, the semantic analysis is terminated. In this way, analysis is performed by specifying the deep case necessary for the subsequent keyword extraction and generating the semantic structure of the minimum necessary range without performing other unnecessary semantic analysis inside the deep case. The goal is to save time. Another object of the present invention is to reduce the maintenance burden of the processing method by clarifying the flow of the entire analysis.

また、対象物に関する知識の分類を行う知識分類支援装置で、検索母集団設定データの設定に応じて、機能モデルデータから検索母集団データを生成する例が知られている。この知識分類支援装置は、機能ノードに関する知識データを入力され、知識データと類似する機能ノードを検索し、類似する機能ノードの情報が格納される検索結果データを生成する。これにより、機能ツリーを構成する機能ノードに対する知識分類の割り当てを容易にすることを目的としている。この知識分類支援装置は、検索結果データの任意の情報が選択されると、任意の情報に関連する機能ノードを、分類先候補ノードとして表示部に強調表示させる。 Also, there is known an example in which search population data is generated from function model data in a knowledge classification support device that classifies knowledge about an object in accordance with the setting of search population setting data. This knowledge classification support device receives knowledge data related to a function node, searches for a function node similar to the knowledge data, and generates search result data in which information on the similar function node is stored. This is intended to facilitate the assignment of knowledge classifications to the function nodes constituting the function tree. When any information in the search result data is selected, this knowledge classification support device highlights a function node related to the arbitrary information on the display unit as a classification destination candidate node.

さらに、行動をノードおよび当該行動の遷移をアークで表現するグラフを表示する行動データ表示方法が知られている。この行動データ表示方法では、行動データ表示装置が、行動主体の行動履歴をもとに各行動の初期確率および行動間の遷移確率を算出する。行動データ表示装置は、算出した各行動の初期確率および行動間の遷移確率をもとに、行動あるいは行動から他の行動への遷移を取り除いたときの単純性を算出し、算出した単純性の最小のものを取り除く。行動データ表示装置は、単純化した後の各行動および行動間の遷移確率をもとに当該行動をノードおよび行動間の遷移をアークで表現したグラフを生成する。このとき、行動履歴中の遷移確率が類似した複数の行動をまとめて１つの行動としてグループ化すること、グループ化の際に、予め定義された行動の階層に従ってまとめることが知られている。これにより、特徴的な部分を見つけ出しやすくすることを目指している。 Furthermore, a behavior data display method for displaying a graph that represents a behavior as a node and a transition of the behavior as an arc is known. In this behavior data display method, the behavior data display device calculates the initial probability of each behavior and the transition probability between the behaviors based on the behavior history of the behavior subject. Based on the calculated initial probability of each action and the transition probability between actions, the action data display device calculates the simplicity when the action or the transition from the action to another action is removed. Remove the smallest. The behavior data display device generates a graph in which the behavior is represented by a node and the transition between the behaviors is represented by an arc based on the simplified behavior and the transition probability between the behaviors. At this time, it is known that a plurality of actions having similar transition probabilities in the action history are grouped together as one action, and are grouped according to a predefined action hierarchy at the time of grouping. This aims to make it easier to find characteristic parts.

さらにまた、仮想入力装置において、指示部から制御部に対して画面上の座標が入力され、座標に対応するボタンが代表仮想ボタンであった場合は、制御部が代表仮想ボタンを選択状態にする例が知られている。この例では、仮想入力装置は、表示状態切り換え部に対して、代表仮想ボタンに属する仮想ボタンを表示状態にする命令を送信し、表示状態切り換え部は、仮想ボタンを表示状態にする。座標に対応するボタンが仮想ボタンであった場合、制御部は指示部に付属するボタンの押状態をチェックし、仮想ボタンが押されたときは、制御部は、仮想ボタンに割り当てられた処理を実行する。座標に対応するボタンが存在しなかった場合は、選択状態の代表仮想ボタンが存在するかをチェックする。存在する場合は、制御部は、代表仮想ボタンを非選択状態にし、表示状態切り換え部に対して、代表仮想ボタンに属する仮想ボタンを非表示状態にする。これにより、少ない表示面積でも一覧性のある習得が容易なソフトウエアキーボードを提供することを目的としている。 Furthermore, in the virtual input device, when coordinates on the screen are input from the instruction unit to the control unit, and the button corresponding to the coordinates is the representative virtual button, the control unit selects the representative virtual button. Examples are known. In this example, the virtual input device transmits a command to set the virtual button belonging to the representative virtual button to the display state to the display state switching unit, and the display state switching unit sets the virtual button to the display state. When the button corresponding to the coordinates is a virtual button, the control unit checks the pressed state of the button attached to the instruction unit, and when the virtual button is pressed, the control unit performs the process assigned to the virtual button. Run. If there is no button corresponding to the coordinates, it is checked whether a representative virtual button in the selected state exists. If present, the control unit puts the representative virtual button into a non-selected state, and puts the virtual buttons belonging to the representative virtual button into a non-display state with respect to the display state switching unit. Accordingly, an object of the present invention is to provide a software keyboard that is easy to learn with a list property even with a small display area.

特開２０００−１４８７５５号公報JP 2000-148755 A 特開２００７−１１５１３９号公報JP 2007-115139 A 特開２００４−１０２６８１号公報JP 2004-102681 A 特開２０００−１８１６０８号公報JP 2000-181608 A

ところで、長文の解析には、どの単語がどの単語に係るか、どの表現がどの表現にかかるのか、理解に時間がかかることが多い。しかし、上記のような従来の意味解析や表示方法は、大局的にフレーズを捉えるためには利用できるかもしれないが、個々の単語レベルの係り受けなどに関しての表示はできない。よって、解析結果として意味のある単語が解読に必要な単語か否か判断ができず、適切な解読支援の為の表示が行なえないことがある。 By the way, in the analysis of long sentences, it often takes time to understand which word is related to which word and which expression is related to which expression. However, the conventional semantic analysis and display methods as described above may be used to capture phrases globally, but cannot display the dependency on individual word levels. Therefore, it may not be possible to determine whether a meaningful word is a word necessary for decoding as a result of analysis, and display for appropriate decoding support may not be performed.

そこで、長文であっても、解析結果を簡易に表示することが可能な言語処理装置、言語処理方法、及びプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide a language processing apparatus, a language processing method, and a program that can easily display an analysis result even for a long sentence.

ひとつの態様である言語処理装置は、句判定部と、解析部と、拡張部と、表示処理部と、表示部とを有する。句判定部は、入力された文に含まれる複数の単語を抽出して前記文の意味を解析した解析結果に基づき、複数の前記単語の夫々について、前記単語を含み一つの意味を成す句があるか否かを判定し、ある場合には前記句を出力する。解析部は、前記文の形態素解析を行い、前記形態素解析に基づき、前記文の形態素同士の関係から前記文の構文解析を行い、前記構文解析の結果に基づき、複数の前記単語のうちの互いに関係する２つの単語同士の意味上の関係、及び、複数の前記単語の夫々と前記句判定部より出力された句において主たる意味を保持する単語との意味上の関係を示す関係情報を生成する。拡張部は、前記関係情報に基づき、前記単語または前記句を前後の単語または句とつなげた別の句として表示させるか否かの判別を、前記関係情報と前記別の句として表示させることの可否との関係が予め定義されている拡張情報に従って行う。表示処理部は、前記別の句として表示させると判別された前記単語または句を一つの句としてまとめる。表示部は、前記文の意味を解析した前記解析結果及び前記表示処理部での処理の結果に基づき、前記文の中心概念として解析された語句、前記表示処理部によってまとめられた句、及び、該語句と該句との意味上の関係を表す前記関係情報を表示する。 A language processing apparatus according to one aspect includes a phrase determination unit, an analysis unit, an expansion unit, a display processing unit, and a display unit. The phrase determining unit extracts a plurality of words included in the input sentence and analyzes the meaning of the sentence, and for each of the plurality of words, a phrase including the word includes a single meaning. It is determined whether or not there is, and if there is, the phrase is output. The analysis unit performs a morphological analysis of the sentence, performs a syntax analysis of the sentence based on a relationship between the morphemes of the sentence based on the morphological analysis, and based on a result of the syntax analysis, Generates relationship information indicating a semantic relationship between two related words and a semantic relationship between each of a plurality of the words and a word having a main meaning in the phrase output from the phrase determination unit. . The extension unit displays, as the relationship information and the separate phrase, whether or not to display the word or the phrase as another phrase connected to the preceding or following word or phrase based on the relationship information. This is performed according to extension information in which the relationship with availability is defined in advance . Display processing unit summarizes the word or phrase that is determined to be displayed as the specific phrase as a phrase. The display unit, based on the analysis result of analyzing the meaning of the sentence and the result of processing in the display processing unit, the words and phrases analyzed as the central concept of the sentence , the phrases summarized by the display processing unit, and The relationship information representing the semantic relationship between the phrase and the phrase is displayed.

別の態様である言語処理方法は、言語処理装置が、入力された文に含まれる複数の単語を抽出して前記文の意味を解析した解析結果に基づき、複数の前記単語の夫々について、前記単語を含み一つの意味を成す句があるか否かを判定し、ある場合には前記句を出力する。また、言語処理装置は、前記文の形態素解析を行う。さらに、言語処理装置は、前記形態素解析に基づき、前記文の形態素同士の関係から前記文の構文解析を行う。また、言語処理装置は、前記構文解析の結果に基づき、複数の前記単語のうちの互いに関係する２つの単語同士の意味上の関係、及び、複数の前記単語の夫々と前記出力された句において主たる意味を保持する単語との意味上の関係を示す関係情報を生成する。さらに、言語処理装置は、前記関係情報に基づき、前記単語または前記句を前後の単語または句とつなげた別の句として表示させるか否かの判別を、前記関係情報と前記別の句として表示させることの可否との関係が予め定義されている拡張情報に従って行う。また、言語処理装置は、前記別の句として表示させると判別された前記単語または句を一つの句としてまとめる。さらに、言語処理装置は、前記文の意味を解析した前記解析結果及び前記一つの句としてまとめる処理の結果に基づき、前記文の中心概念として解析された語句、前記まとめられた句、及び、該語句と該句との意味上の関係を表す前記関係情報を表示する。 Language processing method according to another aspect, language processing apparatus extracts a plurality of words included in the input sentence based on the results analysis to analyze the meaning of the sentence, for each of a plurality of said word, said It is determined whether or not there is a phrase that includes a word and has one meaning, and if there is, the phrase is output. The language processing apparatus performs morphological analysis of the sentence. Furthermore, the language processing apparatus performs syntax analysis of the sentence from the relationship between the morphemes of the sentence based on the morphological analysis. In addition, the language processing device , based on the result of the syntax analysis, in the semantic relationship between two words that are related to each other among the plurality of words, and each of the plurality of the words in the output phrase Relation information indicating a semantic relation with a word having a main meaning is generated. Further, the language processing device displays, as the relation information and the another phrase, whether or not to display the word or the phrase as another phrase connected to the preceding or following word or phrase based on the relation information. This is performed according to the extension information in which the relationship with whether or not to be allowed is defined in advance . Further, the language processing unit summarizes the word or phrase that is determined to be displayed as the specific phrase as a phrase. Further, the language processing unit, based on the results of collectively Ru processing as the analysis result and the one phrase meaning analyzed the sentence, phrase parsed as central idea of the sentence, the gathered phrase and, The relationship information representing the semantic relationship between the phrase and the phrase is displayed.

なお、上述した態様に係る方法をコンピュータに行わせるためのプログラムであっても、このプログラムを当該コンピュータによって実行させることにより、上述した態様に係る方法と同様の作用効果を奏するので、前述した課題が解決される。 In addition, even if it is a program for making a computer perform the method which concerns on the aspect mentioned above, since there exists an effect similar to the method which concerns on the aspect mentioned above by making this program run by the said computer, the subject mentioned above Is resolved.

上述した態様によれば、長文であっても、解析結果を簡易に表示することが可能な言語処理装置、言語処理方法、及びプログラムを提供することができる。 According to the above-described aspect, it is possible to provide a language processing device, a language processing method, and a program that can easily display an analysis result even for a long sentence.

言語処理装置の機能の一例を示すブロック図である。It is a block diagram which shows an example of the function of a language processing apparatus. グラフ表現の一例を示す図である。It is a figure which shows an example of graph expression. グラフ表現における読解支援の一例を示す図である。It is a figure which shows an example of the reading assistance in a graph expression. グラフ表現の別の例を示す図である。It is a figure which shows another example of graph expression. ノードの縮退を示す図である。It is a figure which shows degeneration of a node. 入力文の表示例を示す図である。It is a figure which shows the example of a display of an input sentence. 形態素解析情報の一例を示す図である。It is a figure which shows an example of morphological analysis information. 品詞記号情報の一例を示す図である。It is a figure which shows an example of part of speech symbol information. 構文解析情報の一例を示す図である。It is a figure which shows an example of parsing information. 意味解析情報の一例を示す図である。It is a figure which shows an example of semantic analysis information. 拡張情報の一例を示す図である。It is a figure which shows an example of extended information. 解析結果の初期表示の一例を示す図である。It is a figure which shows an example of the initial display of an analysis result. 解析結果の初期表示の別の例を示す図である。It is a figure which shows another example of the initial display of an analysis result. グラフの展開例を示す図である。It is a figure which shows the example of expansion | deployment of a graph. グラフの別の展開例を示す図である。It is a figure which shows another example of expansion | deployment of a graph. グラフの別の展開例を示す図である。It is a figure which shows another example of expansion | deployment of a graph. 入力文文字列とノードとの対応を計算する処理を示す図である。It is a figure which shows the process which calculates a response | compatibility with an input sentence character string and a node. 名詞句及び動詞句の判定処理の一例を示す図である。It is a figure which shows an example of the determination process of a noun phrase and a verb phrase. 特定の文字列を含む名詞句の大きさの判定を示す図である。It is a figure which shows determination of the magnitude | size of the noun phrase containing a specific character string. ノードを名詞句に拡張した場合の表示例を示す図である。It is a figure which shows the example of a display at the time of expanding a node to a noun phrase. 入力文において、ユーザが特定の文字列を選択した場合の表示例を示す図である。It is a figure which shows the example of a display when a user selects a specific character string in an input sentence. 入力文において、ユーザが特定の文字列を選択した場合の別の表示例を示す図である。It is a figure which shows another example of a display when a user selects a specific character string in an input sentence. ノードの拡張の一例を示す図である。It is a figure which shows an example of expansion of a node. ノードの拡張の一例を示す図である。It is a figure which shows an example of expansion of a node. グラフ表現の表示例をノードの拡張により簡略化した例を示す図である。It is a figure which shows the example which simplified the example of a display of graph expression by expansion of the node. グラフ表現の一例を示す図である。It is a figure which shows an example of graph expression. 言語処理装置の動作を示すメインフローチャートである。It is a main flowchart which shows operation | movement of a language processing apparatus. ノード展開及び縮退処理を示すフローチャートである。It is a flowchart which shows a node expansion | deployment and degeneracy process. ノード拡張処理を示すフローチャートである。It is a flowchart which shows a node expansion process. ノード拡張判定処理を示すフローチャートである。It is a flowchart which shows a node expansion determination process. 表示方法の変形例を示す図である。It is a figure which shows the modification of a display method. 表示方法の変形例を示す図である。It is a figure which shows the modification of a display method. 文字列選択時の表示例を示す図である。It is a figure which shows the example of a display at the time of the character string selection. 文字列選択時の表示例を示す図である。It is a figure which shows the example of a display at the time of the character string selection. 標準的なコンピュータのハードウエア構成を示す図である。It is a figure which shows the hardware constitutions of a standard computer.

（第１の実施の形態）
以下、図面を参照しながら、第１の実施の形態による言語処理装置１について説明する。図１は、言語処理装置１の機能の一例を示すブロック図である。言語処理装置１は、入力された文を解析し、文の意味及び構成を簡易な表現で表示する装置である。 (First embodiment)
The language processing apparatus 1 according to the first embodiment will be described below with reference to the drawings. FIG. 1 is a block diagram illustrating an example of functions of the language processing device 1. The language processing device 1 is a device that analyzes an input sentence and displays the meaning and structure of the sentence in a simple expression.

図１に示すように、言語処理装置１は、入力部３、解析部５、名詞句判定部１３、動詞句判定部１５、ノード拡張部１７、句表示最適化部１９、対応計算部２１、表示制御部２３、解析結果記憶部２５、および表示部２７を有している。解析部５は、形態素解析部７、構文解析部９、及び意味解析部１１を有している。 As shown in FIG. 1, the language processing device 1 includes an input unit 3, an analysis unit 5, a noun phrase determination unit 13, a verb phrase determination unit 15, a node expansion unit 17, a phrase display optimization unit 19, a correspondence calculation unit 21, A display control unit 23, an analysis result storage unit 25, and a display unit 27 are provided. The analysis unit 5 includes a morphological analysis unit 7, a syntax analysis unit 9, and a semantic analysis unit 11.

入力部３は、例えば、キーボードやタッチパネルなどの入力装置である。解析部５は、入力された文を解析し、解析結果を出力する。解析部５において、形態素解析部７は、入力された文の形態素解析を行い、形態素解析情報１５０を出力する。構文解析部９は、形態素解析情報１５０に基づき構文解析を行い、構文解析情報１８０を出力する。意味解析部１１は、形態素解析情報１５０、構文解析情報１８０に基づき意味解析を行い、意味解析情報２００を出力する。 The input unit 3 is an input device such as a keyboard or a touch panel. The analysis unit 5 analyzes the input sentence and outputs an analysis result. In the analysis unit 5, the morpheme analysis unit 7 performs morpheme analysis of the input sentence and outputs morpheme analysis information 150. The syntax analysis unit 9 performs syntax analysis based on the morphological analysis information 150 and outputs syntax analysis information 180. The semantic analysis unit 11 performs semantic analysis based on the morphological analysis information 150 and the syntax analysis information 180 and outputs the semantic analysis information 200.

名詞句判定部１３は、構文解析情報１８０に基づき、名詞句を構成する語句を判定する。動詞句判定部１５は、構文解析情報１８０に基づき、動詞句を構成する語句を判定する。ノード拡張部１７は、名詞句判定部１３、動詞句判定部１５の判定結果に基づき、まとめて表示する語句を判定する。句表示最適化部１９は、ノード拡張部１７でまとめて表示すると判定された語句を、一つの語句にまとめる処理を行う。対応計算部２１は、入力部３、名詞句判定部１３、動詞句判定部１５、ノード拡張部１７、句表示最適化部１９からの出力に応じて、後述する入力文のグラフ表現のための計算処理、入力文の文字列とグラフ表現におけるノードとの対応付けのための計算処理等を行う。 The noun phrase determination unit 13 determines words constituting the noun phrase based on the syntax analysis information 180. The verb phrase determination unit 15 determines the phrases constituting the verb phrase based on the syntax analysis information 180. The node expansion unit 17 determines words to be displayed together based on the determination results of the noun phrase determination unit 13 and the verb phrase determination unit 15. The phrase display optimizing unit 19 performs processing for grouping phrases determined to be displayed together by the node expanding unit 17 into one phrase. The correspondence calculation unit 21 is used for graph expression of an input sentence, which will be described later, in accordance with outputs from the input unit 3, noun phrase determination unit 13, verb phrase determination unit 15, node expansion unit 17, and phrase display optimization unit 19. Calculation processing, calculation processing for associating a character string of an input sentence with a node in a graph expression, and the like are performed.

表示制御部２３は、解析部５、名詞句判定部１３、動詞句判定部１５、ノード拡張部１７、句表示最適化部１９等の出力に基づき、表示部２７の制御を行う。表示部２７は、例えば、液晶表示装置などの表示装置である。解析結果記憶部２５は、形態素解析情報１５０、構文解析情報１８０、意味解析情報２００等を記憶する記憶装置である。 The display control unit 23 controls the display unit 27 based on outputs from the analysis unit 5, noun phrase determination unit 13, verb phrase determination unit 15, node expansion unit 17, phrase display optimization unit 19, and the like. The display unit 27 is a display device such as a liquid crystal display device, for example. The analysis result storage unit 25 is a storage device that stores morphological analysis information 150, syntax analysis information 180, semantic analysis information 200, and the like.

図２から図５を参照しながら、本実施の形態において用いる単語間の関係のグラフ表現を説明する。図２は、グラフ表現の一例を示す図である。図２に示すように、表現例３０は「太郎は明日学校へ行く」という文をグラフ表現した例である。ここでグラフ表現とは、文を構成する単語の概念を「ノード」で示し、単語と単語との関係を、矢印の方向とその関係の内容を示す「アーク」で表すものである。ノードで示される概念は本来は単語に相当する意味記号で示されるが、本発明においてはユーザへわかりやすく表示させるという目的から、概念ではなく単語（またはその単語を主辞とする文節あるいは句）そのもので示すものとする。 A graph representation of the relationship between words used in the present embodiment will be described with reference to FIGS. FIG. 2 is a diagram illustrating an example of a graph expression. As shown in FIG. 2, the expression example 30 is an example in which a sentence “Taro goes to school tomorrow” is expressed in a graph. Here, the graph expression indicates the concept of words constituting a sentence by “nodes”, and the relationship between words is expressed by “arc” indicating the direction of the arrow and the content of the relationship. The concept represented by the node is originally represented by a semantic symbol corresponding to the word. However, in the present invention, for the purpose of easily displaying it to the user, it is not a concept but a word (or a phrase or phrase having the word as the main word) itself. It shall be indicated by

「太郎は明日学校へ行く」という文には、「太郎」「明日」「学校」「行」という概念があり、対応するノード３２〜３８が設定される。例えば、表現例３０において、ノード３２の「行」に着目すると、ノード３８の「学校」に向かって「目標」というアーク４４で、単語間の関係が表されている。アーク４４は、「学校」は「行」の目標（行き先）を表すということを示している。ノード３２の「行」からノード３６の「明日」に出ている「時間」というアーク４２は、「明日」が「行」の時間を表すことを示している。同様に、ノード３２の「行」からノード３４の「太郎」に出ているアーク４０の「動作主」は、「太郎」が「行」の動作の主体であることを表す。以下、ノードが表す概念を、ノードの内容といい、アークが示す単語間の関係を、アークの内容という。 The sentence “Taro will go to school tomorrow” has the concept of “Taro”, “Tomorrow”, “School”, and “Row”, and corresponding nodes 32 to 38 are set. For example, in the expression example 30, when attention is paid to the “row” of the node 32, the relationship between words is represented by an arc 44 of “target” toward the “school” of the node 38. The arc 44 indicates that “school” represents the goal (destination) of “line”. An arc 42 “time” appearing from “row” of node 32 to “tomorrow” of node 36 indicates that “tomorrow” represents the time of “row”. Similarly, the “actor” of the arc 40 appearing from the “row” of the node 32 to the “taro” of the node 34 represents that “taro” is the subject of the “row” operation. Hereinafter, the concept represented by the node is referred to as the node content, and the relationship between the words indicated by the arc is referred to as the arc content.

図３は、グラフ表現における読解支援の一例を示す図である。図３に示すように、例えば、文５２において、選択部５４の「学校」が、ユーザにより矢印５６のように入力部３を介して選択された場合、ノード３８は、選択ノード３７のように、例えばハイライト表示にすることにより、読解支援を行う。このとき、選択は、表現例３０のノード３８が、タッチパネルなどにより選択５８のように選択された場合に、ハイライト表示を行うようにしてもよい。 FIG. 3 is a diagram illustrating an example of reading support in the graph expression. As shown in FIG. 3, for example, in the sentence 52, when “school” of the selection unit 54 is selected by the user via the input unit 3 as indicated by an arrow 56, the node 38 is like the selection node 37. For example, reading is supported by highlighting. At this time, the selection may be performed when the node 38 of the expression example 30 is selected like the selection 58 by a touch panel or the like.

図４は、グラフ表現の別の例を示す図である。表現例６０は、「太郎は花子が行く学校へ明日行く」の構造をグラフ表現した例である。このように、修飾語句などがある場合、グラフが複雑になっていく。表現例６０では、「行」「太郎」「明日」「学校」「行」「花子」を内容とするノード３２〜３４、ノード６２、６４、およびそれらの関係を示すアーク４０〜４４、６６、６８が示されている。 FIG. 4 is a diagram illustrating another example of the graph expression. Expression example 60 is an example in which the structure of “Taro goes to school where Hanako goes tomorrow” is expressed in a graph. Thus, when there are modifiers, the graph becomes complicated. In the expression example 60, nodes 32 to 34, nodes 62 and 64 having contents of “row”, “taro”, “tomorrow”, “school”, “row”, and “Hanako”, and arcs 40 to 44, 66 indicating their relationship, 68 is shown.

しかし、表現例６０のように、全てのノードを展開して表示すると、表現が複雑になり、読解を支援するという目的は達成できない。このため、情報が多いノードに関しては縮退することができる。なお、ノードを展開するとは、そのノードに繋がった複数のノードを別々に表示することをいい、ノードを縮退するとは、ある一つのノードとそのノードに繋がった少なくとも一つのノードとを、その一つのノードで代表して表示させることをいう。このとき、代表して表示されたノードを縮退ノードという。 However, if all the nodes are expanded and displayed as in the expression example 60, the expression becomes complicated and the purpose of supporting reading comprehension cannot be achieved. For this reason, a node with a lot of information can be degenerated. Note that expanding a node means displaying a plurality of nodes connected to the node separately, and degenerating a node means that one node and at least one node connected to the node are It means to display on behalf of one node. At this time, the node displayed as a representative is called a degenerate node.

図５は、ノードの縮退を示す図である。表現例７０は、「花子が行く学校」を表す構造を、「学校」に縮退した例である。「学校」は、例えば、ハッチングや色などを変えた縮退ノード３９で表示され、縮退されたノードであることを表している。ここで「学校」を内容とする縮退ノード３９を、例えばタッチパネルや、クリック操作などによりユーザが選択すれば、「学校」の縮退ノード３９が展開され、例えば、ノード６２（「行」）が展開されて表示される。このように、ユーザが深く読みたいノードを選択して展開していけば、読解支援をすることが可能となる。 FIG. 5 is a diagram illustrating node degeneration. The expression example 70 is an example in which the structure representing “the school where Hanako goes” is reduced to “school”. “School” is, for example, displayed by a reduced node 39 in which hatching, color, or the like is changed, and represents a reduced node. Here, if the user selects the reduced node 39 having “school” as a content by, for example, a touch panel or a click operation, the reduced node 39 of “school” is expanded, for example, node 62 (“row”) is expanded. Displayed. In this way, if a user selects and expands a node that he / she wants to read deeply, reading support can be provided.

次に、図６から図２９を参照しながら、別の入力文を例として、第１の実施の形態による言語処理方法についてさらに説明する。図６は、入力文８０の表示例を示す図である。入力文８０は、図２から図５を参照しながら説明した例より、少し複雑な文となっている。 Next, the language processing method according to the first embodiment will be further described with reference to FIGS. 6 to 29, taking another input sentence as an example. FIG. 6 is a diagram illustrating a display example of the input sentence 80. The input sentence 80 is a slightly more complicated sentence than the example described with reference to FIGS.

図６に示すように、入力文８０は、以下の文である。「本技術は、校正前文書を所定の単位で区切った各要素が校正前文書において共に出現する関係である共起関係と、校正後文書を所定の単位で区切った各要素が校正後文書において共に出現する関係である共起関係とを格納した共起記憶部に基づいて、校正前文書における各要素の共起関係の出現頻度及び校正後文書における各要素の共起関係の出現頻度をそれぞれ算出する。」以下、この入力文８０を解析する場合について説明する。 As shown in FIG. 6, the input sentence 80 is the following sentence. “This technology uses a co-occurrence relationship in which each element that delimits the document before proofreading in a predetermined unit appears in the document before proofreading, and each element that delimits the document after proofreading in a predetermined unit in the document after proofreading. Based on the co-occurrence storage unit that stores the co-occurrence relationship that is the relationship that appears together, the frequency of occurrence of the co-occurrence relationship of each element in the document before proofreading and the frequency of occurrence of the co-occurrence relationship of each element in the document after proofreading, respectively Hereinafter, a case where the input sentence 80 is analyzed will be described.

図７Ａは、形態素解析情報１５０の一例を示す図である。形態素解析とは、予め解析結果記憶部２５などに備えられた辞書等を参考にして、文を構成する形態素を解析する処理である。形態素解析情報１５０は、入力文８０の形態素解析結果の一部を示す例であり、例えば、形態素解析部７で生成される。形態素解析情報１５０は、形態素番号１５２、形態素１５４、開始位置１５６、文字長１５８、品詞情報１６０、意味記号１６２を示している。形態素番号１５２は、形態素の識別番号であり、形態素１５４は、形態素の文字列である。開始位置１５６は、入力文における形態素の開始位置までの文字数であり、文字長１５８は、形態素の文字列の文字数である。品詞情報１６０は、形態素の品詞を示す記号であり、意味記号１６２は、形態素の意味を示す記号である。 FIG. 7A is a diagram illustrating an example of the morphological analysis information 150. The morpheme analysis is a process of analyzing morphemes constituting a sentence with reference to a dictionary or the like provided in advance in the analysis result storage unit 25 or the like. The morpheme analysis information 150 is an example showing a part of the morpheme analysis result of the input sentence 80, and is generated by the morpheme analysis unit 7, for example. The morpheme analysis information 150 indicates a morpheme number 152, a morpheme 154, a start position 156, a character length 158, part of speech information 160, and a semantic symbol 162. The morpheme number 152 is a morpheme identification number, and the morpheme 154 is a morpheme character string. The start position 156 is the number of characters up to the start position of the morpheme in the input sentence, and the character length 158 is the number of characters of the morpheme character string. The part of speech information 160 is a symbol indicating the part of speech of the morpheme, and the semantic symbol 162 is a symbol indicating the meaning of the morpheme.

例えば、形態素番号１５２＝「１」は、形態素１５４＝「本」である。この形態素の開始位置１５６＝「０」であり、入力文８０の先頭が開始位置であることを示している。文字長１５８＝「１」、品詞情報１６０＝「ＤＥＴ」、意味記号１６２＝「ＴＨＩＳ」である。 For example, the morpheme number 152 = “1” is the morpheme 154 = “book”. This morpheme start position 156 = "0", indicating that the head of the input sentence 80 is the start position. Character length 158 = “1”, part-of-speech information 160 = “DET”, and semantic symbol 162 = “THIS”.

図７Ｂは、品詞記号情報１７０の一例を示す図である。品詞記号情報１７０は、形態素の品詞を表す記号を示す情報であり、例えばあらかじめ解析結果記憶部２５に記憶されている。図７Ｂに示すように、例えば、品詞記号１７２＝「ＤＥＴ」は、冠詞を表し、「Ｎ」は、名詞を表し、「Ｊ」は、助詞を表す。よって、図７Ａの「本」は、「冠詞」と判別されたということである。 FIG. 7B is a diagram illustrating an example of the part of speech symbol information 170. The part-of-speech symbol information 170 is information indicating a symbol representing a part-of-speech morpheme, and is stored in advance in the analysis result storage unit 25, for example. As shown in FIG. 7B, for example, the part of speech symbol 172 = “DET” represents an article, “N” represents a noun, and “J” represents a particle. Therefore, “book” in FIG. 7A is determined to be “article”.

図８は、構文解析情報１８０の一例を示す図である。構文解析とは、予め解析結果記憶部２５などに備えられた辞書等を参考にして、文を構成する形態素同士を、意味を成す句にまとめることにより互いの関係を解析する処理である。図８に示すように、構文解析情報１８０は、構文解析の結果の一例を示す情報であり、構文解析部９で生成される。構文解析情報１８０は、構成要素１８２、品詞１８４、ラベル１８６、主辞１８８を有している。構成要素１８２は、形態素解析情報１５０で解析された形態素番号１５２、および、ラベル１８６を示す。品詞１８４は、構成要素１８２の品詞を示す。ラベル１８６は、構成要素１８２に付された名称である。主辞１８８は、構成要素１８２の中での主たる意味を保持する単語を示す形態素番号１５２である。 FIG. 8 is a diagram illustrating an example of the syntax analysis information 180. The syntax analysis is a process of analyzing the relationship between morphemes constituting a sentence into meaningful phrases with reference to a dictionary or the like provided in advance in the analysis result storage unit 25 or the like. As shown in FIG. 8, the syntax analysis information 180 is information indicating an example of the result of syntax analysis, and is generated by the syntax analysis unit 9. The parsing information 180 includes a component 182, a part of speech 184, a label 186, and a main word 188. The component 182 indicates the morpheme number 152 analyzed by the morpheme analysis information 150 and the label 186. The part of speech 184 indicates the part of speech of the component 182. The label 186 is a name given to the component 182. The main word 188 is a morpheme number 152 indicating a word holding the main meaning in the component 182.

図９は、意味解析情報２００の一例を示す図である。意味解析とは、予め解析結果記憶部２５などに備えられた辞書等を参考にして、構文解析結果から、単語の意味記号同士の関係を解析する処理である。図９に示すように、意味解析情報２００は、意味解析の結果の一例を示す情報であり、意味解析部１１で生成される。意味解析情報２００は、出発ノード２０２、到達ノード２０４、アーク２０６を有している。また、出発ノード２０２には、形態素番号２０８、意味記号２１０、形態素２１２が示され、到達ノード２０４には、形態素番号２１４、意味記号２１６、形態素２１８が示されている。出発ノード２０２は、アークが出るノード、到達ノード２０４は、アークが向かうノードである。形態素番号２０８は、ノードの内容に対応する形態素の識別番号である。意味記号２１０は、形態素の意味を示す記号である。形態素２１２は、出発ノード２０２の内容に対応する形態素である。到達ノード２０４の形態素番号２１４、意味記号２１６、形態素２１８についても同様である。 FIG. 9 is a diagram illustrating an example of the semantic analysis information 200. Semantic analysis is a process of analyzing the relationship between semantic symbols of words from a syntax analysis result with reference to a dictionary or the like provided in advance in the analysis result storage unit 25 or the like. As shown in FIG. 9, the semantic analysis information 200 is information indicating an example of the result of semantic analysis, and is generated by the semantic analysis unit 11. The semantic analysis information 200 includes a departure node 202, a reaching node 204, and an arc 206. The departure node 202 shows a morpheme number 208, a semantic symbol 210, and a morpheme 212, and the destination node 204 shows a morpheme number 214, a semantic symbol 216, and a morpheme 218. The departure node 202 is a node where the arc is generated, and the reaching node 204 is a node where the arc is directed. The morpheme number 208 is an identification number of the morpheme corresponding to the contents of the node. The meaning symbol 210 is a symbol indicating the meaning of the morpheme. The morpheme 212 is a morpheme corresponding to the contents of the departure node 202. The same applies to the morpheme number 214, the semantic symbol 216, and the morpheme 218 of the destination node 204.

意味解析情報２００は、入力文８０の一部である「格納した共起記憶部に基づいて」という文字列を解析した例である。意味解析情報２００において、例えば、出発ノード２０２の形態素番号２０８＝「６５」は、意味記号２１０＝「ＢＡＳＥＤ」であり、形態素２１２＝「基づく」である。これに対して、到達ノード２０４の形態素番号２１４＝「６４」は、意味記号２１６＝「ＰＡＲＴ」であり、形態素２１８＝「部」である。このときのアーク２０６＝「修飾」であり、「部」は、「基づく」を「修飾」していることを示している。この解析結果により、例えば、「基づく」を内容とするノードから、「部」を内容とするノードに向かう「修飾」を内容とするアークが出るグラフが作成される。他の語句についても同様にして、入力文８０を示すグラフが作成される。 The semantic analysis information 200 is an example in which a character string “based on a stored co-occurrence storage unit” that is a part of the input sentence 80 is analyzed. In the semantic analysis information 200, for example, the morpheme number 208 = “65” of the departure node 202 is the semantic symbol 210 = “BASED” and the morpheme 212 = “based”. On the other hand, the morpheme number 214 = “64” of the reaching node 204 is the semantic symbol 216 = “PART”, and the morpheme 218 = “part”. At this time, the arc 206 = “modification”, and “part” indicates that “based” is “modified”. Based on the analysis result, for example, a graph is created in which an arc having “modification” as a content is directed from a node having “based” as content to a node having “part” as content. Similarly, a graph indicating the input sentence 80 is created for other words.

図１０は、拡張情報２３０の一例を示す図である。拡張情報２３０は、アークの内容と、そのアークで接続された２つのノードをひとまとめにする（以下、ノードをひとまとめにすることを、ノードを拡張するという）か否かの関係を示す情報であり、予め例えば解析結果記憶部２５に記憶される。 FIG. 10 is a diagram illustrating an example of the extended information 230. The extension information 230 is information indicating the relationship between the contents of the arc and whether or not two nodes connected by the arc are grouped together (hereinafter, grouping the nodes together is referred to as node expansion). For example, it is stored in advance in the analysis result storage unit 25.

拡張情報２３０は、アーク２３２、及び種類２３４を有している。アーク２３２は、アークの内容を示し、種類２３４は、アークに繋がるノード同士が拡張可能であるか否かの種類を示している。図１０に示すように、例えば、アーク２３２（「修飾」）は、種類２３４＝「拡張」であり、「修飾」の関係にある２つのノードは、拡張可能であることを示している。 The extended information 230 has an arc 232 and a type 234. The arc 232 indicates the content of the arc, and the type 234 indicates the type of whether or not nodes connected to the arc can be expanded. As illustrated in FIG. 10, for example, the arc 232 (“modification”) has the type 234 = “expansion”, and indicates that two nodes in the “modification” relationship can be expanded.

次に、図１１〜図１５を参照しながら、入力文８０の解析結果の表示例について説明する。図１１は、解析結果の初期表示の一例を示す図である。図１１に示すように、この例では、表示例８２が示されている。表示例８２は、表示例８６とグラフ８３とを含んでいる。グラフ８３は、入力文８０のグラフ表現であり、一つの縮退ノード８４が表示されている。表示例８６は、入力文８０の文章表示であり、「算出する」が強調表示８８されている。「算出する」は、入力文８０に関して解析部５により解析を行うことにより、この入力文８０の中心概念として解析された語句である。 Next, display examples of the analysis result of the input sentence 80 will be described with reference to FIGS. FIG. 11 is a diagram illustrating an example of an initial display of analysis results. As shown in FIG. 11, in this example, a display example 82 is shown. The display example 82 includes a display example 86 and a graph 83. The graph 83 is a graph representation of the input sentence 80, and one degenerate node 84 is displayed. The display example 86 is a sentence display of the input sentence 80, and “Calculate” is highlighted 88. “Calculate” is a phrase that is analyzed as a central concept of the input sentence 80 by analyzing the input sentence 80 by the analysis unit 5.

図１２は、解析結果の初期表示の別の例を示す図である。図１２に示すように、この例では、表示例９０が示されている。表示例９０は、表示例９１とグラフ９２とを含んでいる。グラフ９２は、入力文８０のグラフ表現であり、縮退ノード８４が展開された例を示している。グラフ９２では、縮退ノード８４が展開されて、ノード８５の表示に変わっている。ノード８５には、縮退ノード９２がアーク１００で接続され、縮退ノード９４が、アーク１０２で接続され、縮退ノード９６が、アーク１０４で接続され、縮退ノード９８が、アーク１０６で接続されている。 FIG. 12 is a diagram illustrating another example of the initial display of the analysis result. As shown in FIG. 12, a display example 90 is shown in this example. The display example 90 includes a display example 91 and a graph 92. A graph 92 is a graph representation of the input sentence 80 and shows an example in which the degenerate node 84 is expanded. In the graph 92, the degenerated node 84 is expanded and changed to the display of the node 85. A degeneration node 92 is connected to the node 85 by an arc 100, a degeneration node 94 is connected by an arc 102, a degeneration node 96 is connected by an arc 104, and a degeneration node 98 is connected by an arc 106.

グラフ９２において、縮退ノード９２の内容は「技術」であり、縮退ノード９４の内容は「基づく」であり、縮退ノード９８の内容は「出現頻度」であり、縮退ノード９８の内容は「それぞれ」である。アーク１００の内容は「動作主」、アーク１０２の内容は「修飾」、アーク１０４の内容は「対象」、アーク１０６の内容は「方法」である。グラフ９２におけるノード及びアークの内容、及び後述する他のノード及びアークの内容は、形態素解析情報１５０、構文解析情報１８０、意味解析情報２００により解析されたものである。 In the graph 92, the contents of the degeneration node 92 are “technology”, the contents of the degeneration node 94 are “based”, the contents of the degeneration node 98 are “appearance frequency”, and the contents of the degeneration node 98 are “respectively”. It is. The content of the arc 100 is “operator”, the content of the arc 102 is “modification”, the content of the arc 104 is “target”, and the content of the arc 106 is “method”. The contents of the nodes and arcs in the graph 92 and the contents of other nodes and arcs to be described later are analyzed by the morphological analysis information 150, the syntax analysis information 180, and the semantic analysis information 200.

表示例９１は、文章表現であり、強調表示８８に加え、強調表示９３として「本技術は」、強調表示９５として「に基づいて」、強調表示９７として「出現頻度をそれぞれ」という語句が表示されている。これらの強調表示９３〜９７は、ノード９２〜９８に対応している。 The display example 91 is a sentence expression. In addition to the highlight display 88, the words “this technology” is displayed as the highlight display 93, “based on” as the highlight display 95, and “appearance frequency” are displayed as the highlight display 97, respectively. Has been. These highlights 93 to 97 correspond to the nodes 92 to 98.

なお、グラフ９２において、ノード８５のような、アークが出ているノードを選択すると、ノードが縮退され、縮退ノードの表示に戻る。この例では、ノード８５が、縮退ノード８４の表示に戻り、接続されたアーク１００〜１０６及びノード９２〜９８は表示されなくなる。また、逆に、縮退ノードが選択された場合には、例えば、縮退ノードから直接出ているアークとそれにつながるノードが表示される。 In the graph 92, when a node with an arc such as the node 85 is selected, the node is degenerated and the display returns to the degenerated node display. In this example, the node 85 returns to the display of the degenerated node 84, and the connected arcs 100 to 106 and nodes 92 to 98 are not displayed. On the other hand, when a degenerate node is selected, for example, an arc directly coming out from the degenerate node and a node connected thereto are displayed.

図１３は、グラフの展開例を示す図である。図１３に示すように、グラフ１１０は、グラフ９２において、縮退ノード９６（「出現頻度」）を選択した状態である。グラフ１１０では、縮退ノード９６が展開されてノード９７の表示になり、直接出ているアーク１２０及びそれにつながる縮退ノード１１４、アーク１１８及び、それにつながるノード１１２が示されている。グラフ１１０では、ノード１１２から出るアーク１２２および縮退ノード１１６が示されている。このように、縮退ノード９６は、例えば展開例１２６のように展開される。このとき、グラフ１１０では図示していないが、入力文８０において、例えば、図１１の表示例８６などのように、展開例１２６に対応する箇所を強調表示することが好ましい。 FIG. 13 is a diagram illustrating a development example of the graph. As illustrated in FIG. 13, the graph 110 is a state in which the degenerate node 96 (“appearance frequency”) is selected in the graph 92. In the graph 110, the degenerate node 96 is expanded to display the node 97, and the arc 120 directly appearing, the degenerate node 114 connected to the arc 120, the arc 118, and the node 112 connected to the arc 120 are shown. In graph 110, arc 122 and degenerate node 116 exiting node 112 are shown. In this way, the degenerate node 96 is expanded as in the expansion example 126, for example. At this time, although not shown in the graph 110, in the input sentence 80, it is preferable to highlight a portion corresponding to the expanded example 126, such as the display example 86 of FIG.

なお、グラフ１１０において、ノード１１２の内容は「出現頻度」であり、縮退ノード１１４、１１６の内容は「共起関係」である。アーク１１８の内容は「列挙」であり、アーク１２０、１２２の内容は「修飾」である。 In the graph 110, the content of the node 112 is “appearance frequency”, and the content of the degenerate nodes 114 and 116 is “co-occurrence relationship”. The contents of the arc 118 are “enumeration”, and the contents of the arcs 120 and 122 are “modification”.

図１４は、グラフの別の展開例を示す図である。図１４に示すように、グラフ１３０は、グラフ１１０において、縮退ノード９４（「基づく」）を選択した状態である。グラフ１３０では、縮退ノード９４が展開されてノード９５の表示になり、直接出ているアーク１３４及びそれにつながる縮退ノード１３２が示されている。このように、縮退ノード９４は、例えば展開例１３６のように展開される。なお、グラフ１３０において、ノード１３２の内容は「部」であり、アーク１３４の内容は「修飾」である。 FIG. 14 is a diagram illustrating another development example of the graph. As illustrated in FIG. 14, the graph 130 is a state in which the degenerate node 94 (“based”) is selected in the graph 110. In the graph 130, the degenerated node 94 is expanded to become a display of the node 95, and the arc 134 directly appearing and the degenerated node 132 connected thereto are shown. In this way, the degenerate node 94 is expanded as in the expansion example 136, for example. In the graph 130, the content of the node 132 is “part”, and the content of the arc 134 is “modification”.

またこのとき、図１３のグラフ１１０で展開されていた展開例１２６は、縮退例１３８のように縮退ノード９６に戻っている。これにより、注目したい箇所のみを詳細に表示させ、より簡易な表示としている。 At this time, the expanded example 126 expanded in the graph 110 of FIG. 13 has returned to the reduced node 96 as in the reduced example 138. As a result, only the part to be noticed is displayed in detail, and the display is simpler.

図１５は、グラフの別の展開例を示す図である。図１５のグラフ２２０は、図１４のグラフ１３０をさらに展開したグラフ表現の例である。図１５に示すように、グラフ２２０は、グラフ１３０において、縮退ノード１３２（「部」）を展開したグラフ表現である。 FIG. 15 is a diagram illustrating another development example of the graph. A graph 220 in FIG. 15 is an example of a graph expression in which the graph 130 in FIG. 14 is further developed. As illustrated in FIG. 15, the graph 220 is a graph representation in which the degenerated node 132 (“part”) is expanded in the graph 130.

グラフ２２０では、縮退ノード１３２が展開されてノード１３３の表示になり、直接出ているアーク２２５及びそれにつながる縮退ノード２２１が示されている。また、アーク２２６及びそれにつながる縮退ノード２２２、さらにそこから出るアーク２２７及びそれにつながるノード２２３が示されている。このように、縮退ノード９４は、例えば展開例２２８のように展開される。 In the graph 220, the degenerated node 132 is expanded to display a node 133, and the arc 225 directly appearing and the degenerated node 221 connected thereto are shown. Also shown are an arc 226 and a degenerate node 222 connected thereto, an arc 227 exiting therefrom and a node 223 connected thereto. In this way, the degenerate node 94 is expanded as in the expansion example 228, for example.

なお、グラフ２２０において、ノード２２２の内容は「記憶」であり、ノード２２３の内容は「共起」であり、縮退ノード２２１の内容は「格納」である。アーク２２５の内容は「動作主」であり、アーク２２６の内容は「修飾」であり、アーク２２７の内容は「対象」である。 In the graph 220, the content of the node 222 is “memory”, the content of the node 223 is “co-occurrence”, and the content of the degenerate node 221 is “storage”. The content of the arc 225 is “operator”, the content of the arc 226 is “modification”, and the content of the arc 227 is “target”.

続いて、図１６から図２３を参照しながら、第１の実施の形態による言語処理についてさらに説明する。図１６は、入力文文字列とノードとの対応を計算する処理を示す図である。図１６においては、入力文として文２４０「これはペンです。」が与えられた場合について説明する。文２４０は、解析部５により、「これ」「は」「ペン」「です」「。」という形態素２４２〜２５０に解析される。 Subsequently, the language processing according to the first embodiment will be further described with reference to FIGS. 16 to 23. FIG. 16 is a diagram showing processing for calculating the correspondence between the input sentence character string and the node. In FIG. 16, a case where a sentence 240 "This is a pen" is given as an input sentence will be described. The sentence 240 is analyzed by the analysis unit 5 into morphemes 242 to 250 of “this”, “ha”, “pen”, “is”, and “.”.

これらの形態素２４２〜２５０の中で、例えば、形態素２４２の「これ」と、形態素２４６の「ペン」とが、ノード２４３、２４７に対応付けられ、それらのノードの関係は、「＜ＩＳＡ＞」で表される。このとき、形態素２４２は、開始位置（オフセット）が「０」、文字数が「２」である。形態素２４６は、開始位置（オフセット）が「３」、文字数が「２」である。このようにして、入力された文の解析結果の各ノードについて、入力部３により受け付けられた文字列との対応が、対応計算部２１により計算される。 Among these morphemes 242-250, for example, “this” of the morpheme 242 and “pen” of the morpheme 246 are associated with the nodes 243 and 247, and the relationship between these nodes is “<ISA>”. It is represented by At this time, the morpheme 242 has a start position (offset) of “0” and the number of characters of “2”. The morpheme 246 has a start position (offset) of “3” and the number of characters of “2”. In this way, the correspondence calculation unit 21 calculates the correspondence between each node of the analysis result of the input sentence and the character string received by the input unit 3.

図１７は、名詞句及び動詞句の判定処理の一例を示す図である。図１７に示すように、句判定例２５０においては、文字列２５２「〜と・・・とを格納した共起記憶部に」について名詞句判定部１３及び動詞句判定部１５が判定処理を行う。文字列２５２の形態素毎の品詞は、品詞情報２５４で表される。この品詞情報２５４は、形態素解析部７による解析結果から得られる。 FIG. 17 is a diagram illustrating an example of determination processing for a noun phrase and a verb phrase. As shown in FIG. 17, in the phrase determination example 250, the noun phrase determination unit 13 and the verb phrase determination unit 15 perform the determination process on the character string 252 “in the co-occurrence storage unit storing“ ˜ ”and“ ... ”. . The part of speech for each morpheme in the character string 252 is represented by the part of speech information 254. This part-of-speech information 254 is obtained from the analysis result by the morphological analysis unit 7.

例えば、文字列２５２の一部の「共起記憶部」という文字列は、品詞情報２５４に示すように、「共起」「記憶」「部」という３つの名詞（Ｎ）で表される。このとき、句情報２５６に示すように、名詞句判定部１３により、「共起」と「記憶」は、ひとつの名詞句（ＮＰ）「共起記憶」を構成すると判定される。さらに、「共起記憶」と「部」とは、一つの名詞句を構成すると判定される。図１７に示す最小名詞句２７０は、「部」を含む最小名詞句である。 For example, a character string “co-occurrence storage unit” in a part of the character string 252 is represented by three nouns (N) “co-occurrence”, “memory”, and “part” as shown in the part-of-speech information 254. At this time, as shown in the phrase information 256, the noun phrase determination unit 13 determines that “co-occurrence” and “memory” constitute one noun phrase (NP) “co-occurrence memory”. Furthermore, it is determined that “co-occurrence memory” and “part” constitute one noun phrase. A minimum noun phrase 270 illustrated in FIG. 17 is a minimum noun phrase including “part”.

同様に、句情報２５６に示すように、「〜と」と「・・・と」とは、ともに名詞句である。さらに、句情報２５８に示すように、「〜と・・・と」は名詞句であると判定される。句情報２６０に示すように、「〜と・・・とを」も名詞句であると判定される。 Similarly, as indicated by the phrase information 256, “to and” and “... And” are both noun phrases. Furthermore, as shown in the phrase information 258, “to,...,” Is determined to be a noun phrase. As shown in the phrase information 260, “to and...” Are also determined to be noun phrases.

「格納した」は、品詞情報２５４に示すように、動詞（Ｖ）である。「〜と・・・とを格納した」は、句情報２６２に示すように、動詞句判定部１５により、動詞句（ＶＰ）であると判定される。 “Stored” is a verb (V) as shown in the part of speech information 254. “Stored with ˜ and...” Is determined as a verb phrase (VP) by the verb phrase determination unit 15 as shown in the phrase information 262.

さらに、句情報２６４に示すように、「〜と・・・とを格納した共起記憶部」は、名詞句であると判定される。同様に、句情報２６６にしめすように、文字列２５２「〜と・・・とを格納した共起記憶部に」は、名詞句であると判定される。 Furthermore, as shown in the phrase information 264, the “co-occurrence storage unit storing“ ˜ ”and“... ”Is determined to be a noun phrase. Similarly, as shown in the phrase information 266, the character string 252 “in the co-occurrence storage unit storing ˜ and...” Is determined to be a noun phrase.

図１８は、特定の文字列を含む名詞句の大きさの判定を示す図である。句判定例２８０は、「部」を含む名詞句の大きさの判定例を示している。「部」は、文字列２５２「〜と・・・とを格納した共起記憶部に」の「主辞」である。 FIG. 18 is a diagram illustrating determination of the size of a noun phrase including a specific character string. Phrase determination example 280 shows a determination example of the size of a noun phrase including “part”. The “part” is the “main word” of the character string 252 “in the co-occurrence storage unit storing“ ˜ and.

図１８に示すように、「部」が含まれる名詞句は、「部」を含む文字列の句情報を図１８の下方から上方に向かって矢印２８２のように参照し、句情報に「ＮＰ」が現われたところに対応する文字列となる。矢印２８２によれば、「部」を含む最小の文字列は、最小名詞句２７０（「共起記憶部」）である。２番目に小さい名詞句は、同様に図１８の上方に向かって矢印２８４のように句情報を参照する。これにより、２番目に小さい名詞句は名詞句２７２（「〜と・・・とを格納した共起記憶部」）である。 As shown in FIG. 18, a noun phrase including “part” refers to phrase information of a character string including “part” from the bottom to the top of FIG. "Is a character string corresponding to the place where" According to the arrow 282, the minimum character string including “part” is the minimum noun phrase 270 (“co-occurrence storage unit”). Similarly, the second smallest noun phrase refers to phrase information as shown by an arrow 284 in the upward direction in FIG. Thus, the second smallest noun phrase is the noun phrase 272 (“co-occurrence storage unit storing“ ˜ ”and“... ”).

図１９は、ノードを名詞句に拡張した場合の表示例を示す図である。図１９に示すように、表示例２９０は、表示例２９２、グラフ２９６を含んでいる。表示例２９２は、入力文８０において、ユーザにより選択された選択文字列２９４が、下線により強調表示されている例である。 FIG. 19 is a diagram illustrating a display example when a node is expanded to a noun phrase. As illustrated in FIG. 19, the display example 290 includes a display example 292 and a graph 296. The display example 292 is an example in which the selected character string 294 selected by the user in the input sentence 80 is highlighted with an underline.

グラフ２９６は、選択された選択文字列２９４に対応した解析結果のグラフ表現を示す例である。例えば図１４のグラフ１３０において、縮退ノード１３２で表されていた「部」に代えて、「部」を含む最小の名詞句に対応した簡略ノード２９８（「共起記憶部」）が示されている。簡略ノードとは、拡張された句に対応して表示されるノードをいう。 A graph 296 is an example showing a graph expression of the analysis result corresponding to the selected character string 294 selected. For example, in the graph 130 of FIG. 14, a simplified node 298 (“co-occurrence storage unit”) corresponding to the minimum noun phrase including “part” is shown instead of “part” represented by the degenerate node 132. Yes. A simplified node refers to a node that is displayed corresponding to an expanded phrase.

図２０、図２１は、入力文８０において、ユーザが特定の文字列を選択した場合の別の表示例を示す図である。図２０に示すように、表示例３１０において、文字列「頻度及び校正」という部分がユーザにより選択されているとする。選択された選択文字列３１２は、例えばハイライト表示される。 20 and 21 are diagrams showing another display example when the user selects a specific character string in the input sentence 80. As illustrated in FIG. 20, in the display example 310, it is assumed that the character string “frequency and proofread” is selected by the user. The selected selected character string 312 is highlighted, for example.

ところで、この文字列「頻度及び校正」をグラフ表現にすることを考えてみる。「頻度及び校正」は、入力文８０において直接繋がっているため、表示例３１０のように選択することができる。しかし、グラフ表現においては、「頻度」と「校正」は直接つながっていない。しかも、「頻度」は名詞句「出現頻度」の中の単語であり、「校正」は名詞句「校正後文書」の中の単語である。そこで、このような場合、選択された部分を自動的に拡張する。 By the way, let us consider making this character string “frequency and proofread” into a graph expression. Since “frequency and proofreading” are directly connected in the input sentence 80, they can be selected as in the display example 310. However, in the graph representation, “frequency” and “calibration” are not directly connected. In addition, “frequency” is a word in the noun phrase “appearance frequency”, and “proofreading” is a word in the noun phrase “document after proofreading”. In such a case, the selected portion is automatically expanded.

図２１は、解析結果の表示例を示す図である。図２１において、表示例３２０は、表示例３２２、およびグラフ３４０を含んでいる。表示例３２２は、入力文８０において、図２０においてユーザに選択された選択文字列３１２の読解支援を行なった場合に注目すべき文字列を強調表示により示した例である。グラフ３４０は、図１３のグラフ１１０をさらに展開したグラフ表現の一例を示す図である。 FIG. 21 is a diagram illustrating a display example of analysis results. In FIG. 21, a display example 320 includes a display example 322 and a graph 340. The display example 322 is an example in which, in the input sentence 80, a character string to be noticed when reading support for the selected character string 312 selected by the user in FIG. A graph 340 is a diagram illustrating an example of a graph expression in which the graph 110 in FIG. 13 is further developed.

グラフ３４０においては、グラフ１１０における縮退ノード１１６が展開されてノード１１７となり、アーク３５０及びそれに繋がるノード３４４、アーク３５２及びそれに繋がるノード３４８が表示されている。ノード３４４の内容は「校正後文書」であり、ノード３４８の内容は、「各要素」である。アーク３５０の内容は「範囲」であり、アーク３５２の内容は「関連」である。このグラフ３４０において、拡張対象３５４、３５６、３５８が設定されている。 In the graph 340, the degenerate node 116 in the graph 110 is expanded to become a node 117, and an arc 350, a node 344 connected thereto, an arc 352, and a node 348 connected thereto are displayed. The content of the node 344 is “document after proofreading”, and the content of the node 348 is “each element”. The content of the arc 350 is “range”, and the content of the arc 352 is “related”. In this graph 340, expansion targets 354, 356, and 358 are set.

図２２、図２３は、ノードの拡張の一例を示す図である。図２２は、例えば、図２１に示した拡張対象３５６におけるノードの拡張について示している。図２２に示すように、拡張対象３５６において、ノード１１２の「出現頻度」と、縮退ノード１１６の「共起関係」とが別々に表示されているが、これらをまとめて一つの簡略ノード３７２とする。簡略ノード３７２の内容は、「共起関係の出現頻度」となる。 22 and 23 are diagrams illustrating an example of node expansion. FIG. 22 illustrates, for example, node expansion in the expansion target 356 illustrated in FIG. As shown in FIG. 22, in the expansion target 356, the “appearance frequency” of the node 112 and the “co-occurrence relationship” of the degenerate node 116 are displayed separately. To do. The content of the simplified node 372 is “appearance frequency of co-occurrence relationship”.

図２３は、拡張対象３５８におけるノードの拡張について示している。図２３に示すように、拡張対象３５８において、ノード３４４の「校正後文書」とノード３４８の「各要素」とが別々に表示されているが、これらをまとめて一つの簡略ノード３７７とする。簡略ノード３７７の内容は、「校正後文書における各要素の」となる。 FIG. 23 illustrates node expansion in the expansion target 358. As shown in FIG. 23, in the expansion target 358, the “post-proof document” of the node 344 and the “each element” of the node 348 are separately displayed, and these are collectively referred to as one simple node 377. The contents of the simplified node 377 are “each element in the proofread document”.

図２４は、図２１におけるグラフ表現のグラフ３４０を、ノードの拡張により簡略化した例を示す図である。図２４に示すように、表示例３８０は、表示例３８２及びグラフ３９０を有している。表示例３８２は、入力文８０において、ユーザが選択した「頻度及び校正」の読解支援となる部分を強調表示した例である。表示例３８２において、強調表示３８４、３２８、３８６、３８８が示されている。 FIG. 24 is a diagram illustrating an example in which the graph representation graph 340 in FIG. 21 is simplified by node expansion. As illustrated in FIG. 24, the display example 380 includes a display example 382 and a graph 390. The display example 382 is an example in which, in the input sentence 80, a portion that assists reading comprehension of “frequency and proofreading” selected by the user is highlighted. In the display example 382, highlighted displays 384, 328, 386, and 388 are shown.

グラフ３９０においては、拡張ノード３７２、３７９、３９２が表示されている。これらの表示により、名詞句がまとめて表示され、表示場所も狭くてよく、簡易な表示となっている。 In the graph 390, extended nodes 372, 379, and 392 are displayed. By these displays, noun phrases are displayed together, the display location may be narrow, and the display is simple.

図２５は、グラフ表現の一例を示す図である。図２５に示すように、表示例４９０は、文字列４９２及びグラフ５００を含んでいる。文字列４９２は、「校正後文書における各要素の共起関係の出現頻度」であり、グラフ５００は、文字列４９２の解析結果をグラフ表現で示したものである。グラフ５００においては、図２３のノード３４４が、ノード５０２（「要素」）、ノード５０４（「各」）に展開され、ノード３４８が、ノード４９４（「文書」）、ノード４９６（「後」）、ノード４９８（「校正」）に展開されている。 FIG. 25 is a diagram illustrating an example of a graph expression. As shown in FIG. 25, the display example 490 includes a character string 492 and a graph 500. The character string 492 is “appearance frequency of the co-occurrence relationship of each element in the proofread document”, and the graph 500 shows the analysis result of the character string 492 in a graph expression. In the graph 500, the node 344 in FIG. 23 is expanded into a node 502 (“element”) and a node 504 (“each”), and a node 348 includes a node 494 (“document”) and a node 496 (“after”). , Node 498 (“calibration”).

次に、図２６から図２９を参照しながら、第１の実施の形態による言語処理装置１の動作について説明する。図２６は、言語処理装置の動作を示すメインフローチャート、図２７は、ノード展開及び縮退処理を示すフローチャート、図２８は、ノード拡張処理を示すフローチャート、図２９は、ノード拡張判定を示すフローチャートである。なお、言語処理装置１の動作については、図１を参照しながら説明した各機能が処理を行うものとして説明する。 Next, the operation of the language processing apparatus 1 according to the first embodiment will be described with reference to FIGS. 26 is a main flowchart showing the operation of the language processing apparatus, FIG. 27 is a flowchart showing node expansion and reduction processing, FIG. 28 is a flowchart showing node extension processing, and FIG. 29 is a flowchart showing node extension determination. . The operation of the language processing apparatus 1 will be described on the assumption that each function described with reference to FIG. 1 performs processing.

はじめに、図２６を参照しながら、言語処理装置１の処理の全体の流れを説明する。図２６に示すように、解析部５は、例えば、ユーザにより入力部３を介して入力文が入力されたことを検知する。または、解析部５は、例えば解析結果記憶部２５を含む記憶装置等から、解析対象の入力文を読み込む。解析部５は、入力された入力文を解析する（Ｓ４０１）。このとき、形態素解析部７は、形態素解析を行い、例えば形態素解析情報１５０を生成する（Ｓ４１１）。構文解析部９は、形態素解析情報１５０に基づき構文解析を行い、構文解析情報１８０を生成し（Ｓ４１２）、解析結果記憶部２５に記憶する（Ｓ４１３）。このとき、解析部５は、形態素解析情報１５０を解析結果記憶部２５に記憶するようにしてもよい。 First, the overall flow of processing of the language processing device 1 will be described with reference to FIG. As illustrated in FIG. 26, the analysis unit 5 detects that an input sentence has been input via the input unit 3 by the user, for example. Alternatively, the analysis unit 5 reads an input sentence to be analyzed from, for example, a storage device including the analysis result storage unit 25. The analysis unit 5 analyzes the input sentence that has been input (S401). At this time, the morpheme analysis unit 7 performs morpheme analysis and generates, for example, morpheme analysis information 150 (S411). The syntax analysis unit 9 performs syntax analysis based on the morphological analysis information 150, generates syntax analysis information 180 (S412), and stores it in the analysis result storage unit 25 (S413). At this time, the analysis unit 5 may store the morphological analysis information 150 in the analysis result storage unit 25.

意味解析部１１は、形態素解析情報１５０及び構文解析情報１８０に基づき意味解析を行い、意味解析情報２００を生成し（Ｓ４１４）、解析結果記憶部２５に記憶する（Ｓ４１５）。 The semantic analysis unit 11 performs semantic analysis based on the morphological analysis information 150 and the syntax analysis information 180, generates semantic analysis information 200 (S414), and stores it in the analysis result storage unit 25 (S415).

対応計算部２１は、意味解析情報２００に基づきグラフ表現のための情報を生成するとともに、ノードと文字列との対応を計算する。表示制御部２３は、対応計算部２１での計算結果に基づき、表示部２７に解析結果の初期表示を行わせる（Ｓ４０２）。初期表示は、例えば、図１１の表示例８２、図１２の表示例９０などであり、一部がハイライト表示された入力文や、グラフ表現の表示である。ノードと文字列との対応は、例えば図１６を参照しながら説明したように、形態素解析部７等の解析結果に基づき計算される。 The correspondence calculation unit 21 generates information for graph expression based on the semantic analysis information 200 and calculates the correspondence between the node and the character string. The display control unit 23 causes the display unit 27 to perform initial display of the analysis result based on the calculation result in the correspondence calculation unit 21 (S402). The initial display is, for example, the display example 82 in FIG. 11, the display example 90 in FIG. 12, and the like, and is an input sentence partially highlighted or a graph representation display. The correspondence between the node and the character string is calculated based on the analysis result of the morpheme analysis unit 7 or the like as described with reference to FIG.

対応計算部２１は、入力部３を介してユーザが処理の終了を選択したことが検知されたか否かを判別し（Ｓ４０３）、選択したと検知されると（Ｓ４０３：ＹＥＳ）、処理を終了する。 The correspondence calculation unit 21 determines whether it is detected that the user has selected the end of the process via the input unit 3 (S403), and when it is detected that the user has selected (S403: YES), the process ends. To do.

終了を選択したと検知されない場合には（Ｓ４０３：ＮＯ）、対応計算部２１は、例えば、表示例８２、表示例９０などの入力文の表示において、文字列またはノードが選択されるまで判別を繰り返す（Ｓ４０４：ＮＯ）。このとき、選択されたノードをハイライト表示など、強調表示を行うことにより、選択されたことを示すようにしてもよい。対応計算部２１が、文字列またはノードが検知されたことを判別すると（Ｓ４０４：ＹＥＳ）、ノード拡張部１７は、図２７に示すノードの展開及び縮退処理を行う（Ｓ４０５）。 When it is not detected that the end is selected (S403: NO), the correspondence calculation unit 21 determines whether a character string or a node is selected in the display of the input sentence such as the display example 82 and the display example 90, for example. Repeat (S404: NO). At this time, the selected node may be highlighted so as to indicate that it has been selected. When the correspondence calculation unit 21 determines that a character string or a node is detected (S404: YES), the node expansion unit 17 performs node expansion and contraction processing shown in FIG. 27 (S405).

このとき例えば、文字列が選択された場合には、対応計算部２１は、文字列とノードとの対応を計算することにより、選択されたノードを特定する。例えば、図２０の例のように文字列「頻度及び校正」が選択されると、対応計算部２１が、「頻度」が縮退ノード９６の「出現頻度」に対応し、「校正」が、ノード３４４の「校正後文書」に対応することを特定する。 At this time, for example, when a character string is selected, the correspondence calculation unit 21 specifies the selected node by calculating the correspondence between the character string and the node. For example, when the character string “frequency and proof” is selected as in the example of FIG. 20, the correspondence calculation unit 21 corresponds to “frequency of appearance” of the degenerate node 96, and “proof” is the node It is specified that it corresponds to “Post-proofreading document” 344.

次に、図２７を参照しながら、ノード展開縮退処理について説明する。図２７に示すように、ノード拡張部１７は、選択されたノードから直接繋がるノードの集合を取得し、それぞれにラベルを付す（Ｓ４２１）。ラベルは、例えば、拡張対象ノードＮ１〜Ｎｎ（ｎは、１以上の整数）とすることができる。ノードの拡張を行なう場合、ノード拡張部１７は、Ｓ４２１で取得された全ての拡張対象ノードＮ１〜Ｎｎについて、図２８に示すノード拡張処理を行う（Ｓ４２２）。 Next, the node expansion / reduction process will be described with reference to FIG. As shown in FIG. 27, the node expansion unit 17 acquires a set of nodes directly connected from the selected node, and attaches a label to each (S421). The labels can be, for example, expansion target nodes N1 to Nn (n is an integer of 1 or more). When expanding the node, the node expansion unit 17 performs the node expansion process shown in FIG. 28 for all the expansion target nodes N1 to Nn acquired in S421 (S422).

以下、図２６のＳ４０４において、図１２の表示例９０における縮退ノード９４の「基づく」が選択された場合を例にして説明する。図１２の例では、選択された縮退ノード９４に縮退されている全てのノードを取得する。例えば、図１５に示すように、縮退ノード９４には、ノード１３３、２２１、２２２、２２３が縮退されている。しかし、直接アークで繋がるノードはノード１３３のみである。よって、ノード１３３を拡張対象ノードＮ１とする。すなわち、取得するノードの集合は、ノード１３３（「部」）である。 Hereinafter, a case where “based” of the degenerate node 94 in the display example 90 of FIG. 12 is selected in S404 of FIG. 26 will be described as an example. In the example of FIG. 12, all nodes that have been reduced to the selected reduced node 94 are acquired. For example, as shown in FIG. 15, nodes 133, 221, 222, and 223 are degenerated in the degeneration node 94. However, only the node 133 is directly connected by an arc. Therefore, the node 133 is set as the expansion target node N1. That is, the set of nodes to be acquired is the node 133 (“part”).

図２７に戻って、句表示最適化部１９は、ノード拡張処理で拡張された名詞句、動詞句をひとまとめにする処理を行う（Ｓ４２３）。すなわち、図１８を参照しながら説明したように、拡張対象ノードＮ１＝ノード１３３（「部」）に対して、名詞句判定部１３により最小の名詞句「共起記憶部」が抽出された場合、句表示最適化部１９は、「共起記憶部」を一つのノードとする処理を行う。このとき、句表示最適化部１９は、ノード拡張部１７による拡張処理に基づき、ノードをどこまで展開するかを決定する。表示制御部２３は、句表示最適化部１９、対応計算部２１の処理結果に基づき拡張された「共起記憶部」のノード等を表示し、例えばハイライト表示など選択されたことを示す表示とする（Ｓ４２４）。 Returning to FIG. 27, the phrase display optimizing unit 19 performs a process of collecting noun phrases and verb phrases expanded by the node expansion process (S423). That is, as described with reference to FIG. 18, when the noun phrase determining unit 13 extracts the minimum noun phrase “co-occurrence storage unit” for the expansion target node N1 = node 133 (“part”). The phrase display optimization unit 19 performs a process in which the “co-occurrence storage unit” is one node. At this time, the phrase display optimizing unit 19 determines how far the node is expanded based on the expansion processing by the node expansion unit 17. The display control unit 23 displays the nodes of the “co-occurrence storage unit” expanded based on the processing results of the phrase display optimization unit 19 and the correspondence calculation unit 21, for example, a display indicating that a highlight display or the like has been selected. (S424).

続いて、図２８を参照しながら、ノード拡張処理について説明する。図２８に示すように、ノード拡張部１７は、ノード拡張処理において、処理対象ノードＮを設定する（Ｓ４３１）。上述のように、処理対象ノードＮは、拡張対象ノードＮ１＝ノード１３３（「部」）である。 Next, the node expansion process will be described with reference to FIG. As illustrated in FIG. 28, the node expansion unit 17 sets a processing target node N in the node expansion processing (S431). As described above, the processing target node N is the expansion target node N1 = node 133 (“part”).

名詞句判定部１３、動詞句判定部１５は、処理対象ノードＮが名詞句または動詞句に含まれるか否か判別する（Ｓ４３２）。対象処理ノードＮが、名詞句または動詞句に含まれる場合には（Ｓ４３２：ＹＥＳ）、名詞句判定部１３または動詞句判定部１５は、処理対象ノードＮを含む名詞句または動詞句の主辞のノード（以下、主辞ノードＮＡという）を、自動選択する（Ｓ４３３）。対象処理ノードＮが、名詞句または動詞句に含まれない場合には（Ｓ４３２：ＮＯ）、ノード拡張部１７は、図２７のＳ４２３に処理を進める。 The noun phrase determination unit 13 and the verb phrase determination unit 15 determine whether or not the processing target node N is included in the noun phrase or the verb phrase (S432). When the target processing node N is included in the noun phrase or verb phrase (S432: YES), the noun phrase determination unit 13 or the verb phrase determination unit 15 determines the main word of the noun phrase or verb phrase including the processing target node N. A node (hereinafter referred to as the main node NA) is automatically selected (S433). When the target processing node N is not included in the noun phrase or the verb phrase (S432: NO), the node expansion unit 17 advances the process to S423 in FIG.

例えば、図１８を参照しながら説明したように、「部」は、「共起記憶部」という名詞句に含まれることが判別される。この判別は、例えば、構文解析情報１８０を参照することにより取得される。ノード拡張部１７は、「共起記憶部」という名詞句の主辞ノードＮＡ＝「部」を選択する。 For example, as described with reference to FIG. 18, it is determined that “part” is included in the noun phrase “co-occurrence storage unit”. This determination is acquired by referring to the syntax analysis information 180, for example. The node extension unit 17 selects the main node NA = “part” of the noun phrase “co-occurrence storage unit”.

次に、ノード拡張部１７は、選択された主辞ノードＮＡから出るアークを列挙し、それぞれ識別番号（ＩＤ）＝１、２、・・・、ｋ（ｋは、１以上の整数）を割り振るとともに、カウンタ値ａｃ＝０とする（Ｓ４３４）。 Next, the node expansion unit 17 enumerates arcs output from the selected main word node NA and assigns identification numbers (ID) = 1, 2,..., K (k is an integer of 1 or more), respectively. The counter value ac = 0 is set (S434).

図１５に示すように、主辞ノードＮＡ＝「部」から出るアークは、アーク２２５（「動作主」）およびアーク２２６（「修飾」）である。よって、例えば、アーク２２５（「動作主」）にＩＤ＝１、アーク２２６（「修飾」）にＩＤ＝２と割り振られる。 As shown in FIG. 15, arcs exiting from the main node NA = “part” are an arc 225 (“operation main”) and an arc 226 (“modification”). Therefore, for example, ID = 1 is assigned to the arc 225 (“operator”), and ID = 2 is assigned to the arc 226 (“modification”).

図２８に戻って、ノード拡張部１７は、処理対象ノードを、処理対象ノードＮ及び主辞ノードＮＡを含む最小の名詞句に拡張する（Ｓ４３５）。例えば、処理対象ノードＮ＝「部」の場合には、最小の名詞句は、「共起記憶部」となる。 Returning to FIG. 28, the node expansion unit 17 expands the processing target node to the minimum noun phrase including the processing target node N and the main word node NA (S435). For example, when the processing target node N = “part”, the minimum noun phrase is “co-occurrence storage unit”.

つづいて、ノード拡張部１７は、図２９に示すノード拡張判定を行う（Ｓ４３６）。この判定の結果、さらに拡張を行うと判定される場合には（Ｓ４３７：ＹＥＳ）、図２９の処理において出力されたアークに繋がるノードについて、さらに、図２７に示したノード展開縮退処理を行う（Ｓ４３８）。Ｓ４３７で、さらに拡張を行わないと判定された場合には（Ｓ４３７：ＮＯ）、ノード拡張部１７は、処理を図２７のＳ４２３に進める。 Subsequently, the node expansion unit 17 performs the node expansion determination illustrated in FIG. 29 (S436). As a result of this determination, if it is determined that further expansion is to be performed (S437: YES), the node expansion / reduction process shown in FIG. 27 is further performed on the node connected to the arc output in the process of FIG. S438). If it is determined in S437 that no further expansion is to be performed (S437: NO), the node expansion unit 17 advances the processing to S423 in FIG.

さらに、図２９を参照しながら、ノード拡張判定処理について説明する。図２９に示すように、ノード拡張部１７は、Ｓ４３４で列挙した全てのアークについて処理が終了したか否かを判別する（Ｓ４４１）。ノード拡張部１７は、処理が終了した場合には（Ｓ４４１：ＹＥＳ）、図２８のＳ４３７の判定に対し「ＮＯ」を返す（Ｓ４４７）。 Further, the node extension determination process will be described with reference to FIG. As illustrated in FIG. 29, the node expansion unit 17 determines whether or not the processing has been completed for all the arcs listed in S434 (S441). When the processing is completed (S441: YES), the node expansion unit 17 returns “NO” to the determination of S437 in FIG. 28 (S447).

処理が終了していない場合には（Ｓ４４１：ＮＯ）、ノード拡張部１７は、カウンタ値ａｃ＝ａｃ＋１とし（Ｓ４４２）、ＩＤ＝ａｃのアークが処理対象アークであるか否か判別する（Ｓ４４３）。ここで、処理対象アークとは、図２８のノード拡張処理において判定された最小の名詞句に含まれないアークである。 If the processing has not ended (S441: NO), the node expansion unit 17 sets the counter value ac = ac + 1 (S442), and determines whether or not the arc with ID = ac is the processing target arc (S443). . Here, the processing target arc is an arc that is not included in the minimum noun phrase determined in the node expansion processing of FIG.

例えば、最小の名詞句が「共起記憶部」である場合、ＩＤ＝１のアークであるアーク２２５（「動作主」）は、「共起記憶部」に含まれないので、処理対象アークである。ＩＤ＝２のアークは、アーク２２６（「修飾」）であるが、これは、「部」と「記憶」とを繋ぐものであり、「共起記憶部」に含まれている。よって、処理対象アークとはならない。 For example, when the minimum noun phrase is “co-occurrence storage unit”, arc 225 (“actor”) that is an arc with ID = 1 is not included in “co-occurrence storage unit”. is there. The arc with ID = 2 is the arc 226 (“modification”), which connects “part” and “memory”, and is included in the “co-occurrence storage unit”. Therefore, it is not a processing target arc.

ＩＤ＝ａｃのアークが処理対象アークでない場合（Ｓ４４３：ＮＯ）、ノード拡張部１７は、図２８のＳ４３７の判定に対し「ＹＥＳ」を返す（Ｓ４４６）。処理対象アークである場合（Ｓ４４３：ＹＥＳ）、ノード拡張部１７は、対象のアーク（ＩＤ＝ａｃのアーク）の種類が「拡張」であるか否か判別する（Ｓ４４４）。対象のアークの種類が、「拡張」でない場合には（Ｓ４４４：ＮＯ）、ノード拡張部１７は、処理をＳ４４６に進める。対象のアークの種類が「拡張」である場合には（Ｓ４４４：ＹＥＳ）、ノード拡張部１７は、対象のアークを出力し、Ｓ４４６に進み、図２８のＳ４３７の処理に対し「ＹＥＳ」を返す。 When the arc with ID = ac is not the processing target arc (S443: NO), the node expansion unit 17 returns “YES” in response to the determination of S437 of FIG. 28 (S446). When the arc is a processing target arc (S443: YES), the node expansion unit 17 determines whether or not the type of the target arc (ID = ac) is “extended” (S444). When the type of the target arc is not “expansion” (S444: NO), the node expansion unit 17 advances the process to S446. When the type of the target arc is “expansion” (S444: YES), the node expansion unit 17 outputs the target arc, proceeds to S446, and returns “YES” to the process of S437 in FIG. .

例えば、対象のアークが、図２８においてＩＤ＝１とされたアーク２２５（「動作主」）である場合、拡張情報２３０を参照すると、「動作主」は、「非拡張」のアークであるので、ノード拡張部１７は、そのまま処理をＳ４４６に進める。図２８のＳ４３７の処理において、「ＹＥＳ」が取得されたとき、すなわち、対象のアークの種類が「拡張」である場合には、ノード拡張部１７は、対象のアークを出力する（Ｓ４４５）。ノード拡張部１７は、出力された対象のアークに直接繋がるノードについて、再び、図２７に示したノード展開縮退処理を行う（Ｓ４３８）。このようにして、全てのＩＤのアークについて処理が終了すると、ノード拡張部１７は、図２６のＳ４０３に処理を戻す。以上の処理により、例えば図１９に示したように、縮退ノード１３２（「部」）に代えて、簡略ノード２９８（「共起記憶部」）が表示され、例えばハイライト表示される。 For example, when the target arc is the arc 225 (“operator”) whose ID = 1 in FIG. 28, referring to the extended information 230, the “operator” is an “non-extended” arc. The node extension unit 17 proceeds with the process to S446 as it is. In the process of S437 of FIG. 28, when “YES” is acquired, that is, when the type of the target arc is “expansion”, the node expansion unit 17 outputs the target arc (S445). The node expansion unit 17 performs the node expansion / reduction process shown in FIG. 27 again for the node directly connected to the output target arc (S438). In this way, when the process is completed for all the arcs of the IDs, the node expansion unit 17 returns the process to S403 in FIG. Through the above processing, for example, as shown in FIG. 19, instead of the degenerate node 132 (“part”), the simplified node 298 (“co-occurrence storage unit”) is displayed, for example, highlighted.

同様の処理を、図２０に示した例で説明する。図２０の例では、入力文８０が解析され（Ｓ４０１、Ｓ４０２）、表示例３１０のように、選択文字列３１２＝「頻度及び校正」が選択されていることを示している（Ｓ４０３：ＮＯ、Ｓ４０４：ＹＥＳ）。すなわち、入力部３を介して、選択文字列３１２が選択されると、対応計算部２１が、選択された文字列とノードとの関係を計算することにより、選択された文字列からノードを特定するとともに、表示制御部２３により表示部２７に表示を行わせる。 Similar processing will be described with reference to the example shown in FIG. In the example of FIG. 20, the input sentence 80 is analyzed (S401, S402), and the selected character string 312 = “frequency and proofreading” is selected as shown in the display example 310 (S403: NO, S404: YES). That is, when the selected character string 312 is selected via the input unit 3, the correspondence calculating unit 21 calculates the relationship between the selected character string and the node, thereby specifying the node from the selected character string. At the same time, the display control unit 23 causes the display unit 27 to display.

上述したように、「頻度」は名詞句「出現頻度」の中の単語である。また、「及び」に対応するノードは存在しない。そこで、このような場合、図２６のＳ４０４において選択されるノードは、縮退ノード９６（「出現頻度」）（図２１参照）と、ノード４９８（「校正」）（図２５参照）ということになる。 As described above, “frequency” is a word in the noun phrase “appearance frequency”. Also, there is no node corresponding to “and”. Therefore, in such a case, the nodes selected in S404 in FIG. 26 are the degenerate node 96 (“appearance frequency”) (see FIG. 21) and the node 498 (“calibration”) (see FIG. 25). .

続いて、ノード拡張部１７は、縮退ノード９６（「出現頻度」）と、ノード４９８（「校正」）に直接つながるノードの集合を取得する（Ｓ４２１）。まず縮退ノード９６（「出現頻度」）について説明する。 Subsequently, the node expansion unit 17 acquires a set of nodes directly connected to the degenerate node 96 (“appearance frequency”) and the node 498 (“proofreading”) (S421). First, the degenerate node 96 (“appearance frequency”) will be described.

図２１に示すように、縮退ノード９６から直接繋がるノードは、縮退ノード１１４（「共起関係」）とノード１１２（「出現頻度」）とになる。それらを拡張対象ノードＮ１、Ｎ２として、図２８のノード拡張処理を行う（Ｓ４２２）。 As shown in FIG. 21, nodes directly connected from the degenerate node 96 are a degenerate node 114 (“co-occurrence relationship”) and a node 112 (“appearance frequency”). The node expansion process of FIG. 28 is performed with these nodes as the expansion target nodes N1 and N2 (S422).

はじめに、ノード拡張部１７は、処理対象のノード＝拡張対象ノードＮ１＝縮退ノード１１４（「共起関係」）とする（Ｓ４３１）。「共起関係」は、名詞句判定部１３により、名詞句「共起関係の出現頻度」に含まれると判断される（Ｓ４３２：ＹＥＳ）。この名詞句の主辞は、ノード９７（「出現頻度」）であると判定される（Ｓ４３３）。このとき、主辞ノードから直接出るアークは、アーク１２０（「修飾」）、及びアーク１１８（「列挙」）である（Ｓ４３４）。よって、これらのアークにＩＤが割り振られる。縮退ノード１１４は、この主辞を含む最小の名詞句「共起関係の出現頻度」に拡張される（Ｓ４３５）。 First, the node expansion unit 17 sets processing target node = expansion target node N1 = degenerated node 114 (“co-occurrence relationship”) (S431). The “co-occurrence relationship” is determined to be included in the noun phrase “appearance frequency of the co-occurrence relationship” by the noun phrase determination unit 13 (S432: YES). The main word of this noun phrase is determined to be the node 97 (“appearance frequency”) (S433). At this time, the arcs directly coming out from the main node are the arc 120 (“modification”) and the arc 118 (“enumeration”) (S434). Therefore, IDs are assigned to these arcs. The degenerate node 114 is expanded to the minimum noun phrase “occurrence frequency of co-occurrence relationship” including this main word (S435).

図２９の処理については、アーク１２０（例えば、ＩＤ＝１）に繋がる「共起関係」は、拡張された名詞句「共起関係の出現頻度」の中に入っているため、拡張の処理対象とはならない（Ｓ４４１〜Ｓ４４３：ＮＯ）。よって、アーク１２０に関する拡張は、図２４に示す簡略ノード３９２（「共起関係の出現頻度」）で完了する。 29, since the “co-occurrence relationship” connected to the arc 120 (for example, ID = 1) is included in the expanded noun phrase “occurrence frequency of the co-occurrence relationship”, the processing target of the expansion (S441 to S443: NO). Therefore, the expansion related to the arc 120 is completed at the simplified node 392 (“occurrence frequency of co-occurrence relationship”) shown in FIG.

次に、アーク１１８（「列挙」）（例えば、ＩＤ＝２）について図２９の処理が行われる。「列挙」は、図１０によれば、「拡張」のアークである（Ｓ４４４：ＹＥＳ）。よって、ノード拡張部１７は、アーク「列挙」を出力し（Ｓ４４５）、図２８のＳ４３７処理に「ＹＥＳ」を返す（Ｓ４４６）。ノード拡張部１７は、アーク１１８に繋がるノード１１２（「出現頻度」）について、図２７の処理を行う（Ｓ４３８）。 Next, the process of FIG. 29 is performed for the arc 118 (“enumeration”) (for example, ID = 2). According to FIG. 10, “enumeration” is an arc of “expansion” (S444: YES). Therefore, the node expansion unit 17 outputs the arc “enumeration” (S445), and returns “YES” to the S437 process of FIG. 28 (S446). The node expansion unit 17 performs the process of FIG. 27 for the node 112 (“appearance frequency”) connected to the arc 118 (S438).

ノード拡張部１７は、選択されたノードに直接繋がるノードとして、ノード１１２を取得する（Ｓ４２１）。ノード拡張部１７は、ノード１１２を処理対象のノードとして、図２８の処理を行う（Ｓ４２２、Ｓ４３１）。 The node expansion unit 17 acquires the node 112 as a node directly connected to the selected node (S421). The node expansion unit 17 performs the processing in FIG. 28 with the node 112 as a processing target node (S422, S431).

ノード１１２（「出現頻度」）は、名詞句「共起関係の出現頻度」に含まれる（Ｓ４３２：ＹＥＳ）。また、その主辞は、ノード１１２（「出現頻度」）である（Ｓ４３３）。ノード１１２から出るアーク１２２を、ＩＤ＝１とする（Ｓ４３４）。 The node 112 (“appearance frequency”) is included in the noun phrase “appearance frequency of co-occurrence relations” (S432: YES). The main word is the node 112 (“appearance frequency”) (S433). The arc 122 exiting from the node 112 is set to ID = 1 (S434).

図２９の処理において、ノード拡張部１７は、ノード１１２から出るアーク１２２を取得する（Ｓ４４１〜Ｓ４４３）。ノード拡張部１７は、アーク１２２は処理対象のアークでないと判別し（Ｓ４４３：ＮＯ）、ノード１１２に関する拡張処理は、図２４に示す簡略ノード３７２（「共起関係の出現頻度」）に確定して終了する。以上で、「頻度」に関する処理が完了したことになる。 In the process of FIG. 29, the node expansion unit 17 acquires the arc 122 that exits from the node 112 (S441 to S443). The node expansion unit 17 determines that the arc 122 is not an arc to be processed (S443: NO), and the expansion processing related to the node 112 is confirmed to the simplified node 372 (“appearance frequency of co-occurrence relationship”) illustrated in FIG. To finish. This completes the processing related to “frequency”.

次に、選択されたノード＝ノード４９８（「校正」）とする。ノード４９８から直接繋がるノードは、図２５に示すように、ノード４９６（「後」）である。よって、ノード拡張部１７は、ノード４９６を処理対象のノードとして、図２８の処理を行う（Ｓ４３１）。「後」は、名詞句「校正後文書」に含まれる（Ｓ４３２：ＹＥＳ）。このとき、主辞ノードＮＡ＝「文書」である（Ｓ４３３）。図２５に示すように、「文書」から出るアークは、アーク５０６（「時間」）である。また、ノード４９６（「後」）は、「校正後文書」に拡張される（Ｓ４３５）。 Next, the selected node = node 498 (“calibration”). A node directly connected from the node 498 is a node 496 (“after”) as shown in FIG. Therefore, the node expansion unit 17 performs the process of FIG. 28 using the node 496 as a processing target node (S431). “After” is included in the noun phrase “document after proofreading” (S432: YES). At this time, the main node NA = “document” (S433). As shown in FIG. 25, the arc from “document” is arc 506 (“time”). Further, the node 496 (“after”) is expanded to “post-proofreading document” (S435).

図２９の処理において、アーク５０６（「時間」）は、処理対象アークではない（Ｓ４４３：ＮＯ）。よって、ノード４９６（「後」）は、主辞を含む最小の名詞句「校正後文書」に拡張される。 In the process of FIG. 29, the arc 506 (“time”) is not a process target arc (S443: NO). Therefore, the node 496 (“after”) is expanded to the minimum noun phrase “post-proofread” including the main word.

なお上記の説明では、ノード４９８（「校正」）について「校正後文書」にノードを拡張する例を説明したが、例えば、図２１において、縮退ノード３３４（「校正後文書」）について、それ以上拡張されないことを判別する処理を行うこともできる。 In the above description, the node 498 (“proofreading”) is extended to the “post-proofreading document”. However, for example, in FIG. 21, the degenerate node 334 (“post-proofreading document”) is more than that. It is also possible to perform processing for determining that the data is not expanded.

さらに、図２５に示すように、「校正後文書」に繋がるノード１１７（「共起関係」）にはノード５０２（「要素」）もつながっている。ノードを展開する場合、上述のように、繋がる全てのノードを表示する例では、ノード５０２（「要素」）に関しても名詞句の拡張を行い、「各要素」を得、図２１の簡略ノード３４８のように表示する。 Further, as shown in FIG. 25, a node 502 (“element”) is also connected to the node 117 (“co-occurrence relationship”) connected to “the document after proofreading”. When expanding nodes, in the example in which all connected nodes are displayed as described above, noun phrases are expanded with respect to the node 502 (“element”) to obtain “each element”, and the simplified node 348 in FIG. Is displayed.

また、入力文８０では、「校正後文書における」「各要素の」は文字列としてもつながっている。よって、図２３のように、簡略ノード３７９（「校正後文書における各要素の」）のようにまとめて表示させることもできる。この結果、図２４のグラフ３９０のような表示結果を得る。 In the input sentence 80, “in the proofread document” and “for each element” are also connected as a character string. Therefore, as shown in FIG. 23, it can also be displayed collectively as a simple node 379 (“each element in the proofread document”). As a result, a display result such as a graph 390 in FIG. 24 is obtained.

このように、ユーザが「頻度および校正」を選択した場合において、本実施の形態においては、言語処理装置１が自動的に、例えば図２４の表示例３８２のように、「共起関係の出現頻度及び…共起関係の出現頻度」を選択する。また、図２４のグラフ３９０のように、該当する部分のグラフを例えば、簡略ノード３７２、３７９、３９２を用いて表示する。 As described above, when the user selects “frequency and proofreading”, in the present embodiment, the language processing apparatus 1 automatically displays “appearance of a co-occurrence relationship” as in the display example 382 of FIG. Select “Frequency and ... Frequency of occurrence of co-occurrence relationship”. In addition, as in a graph 390 of FIG. 24, a graph of a corresponding portion is displayed using, for example, simplified nodes 372, 379, and 392.

以上詳細に説明したように、第１の実施の形態による言語処理装置１においては、解析部５が、入力文を解析し、形態素解析情報１５０、構文解析情報１８０、意味解析情報２００を生成する。対応計算部２１は、解析結果に基づき、文字列とノードを関連付ける計算を行う。表示制御部２３は、表示部２７を制御して、例えばグラフ表現を行う。 As described above in detail, in the language processing apparatus 1 according to the first embodiment, the analysis unit 5 analyzes the input sentence and generates the morphological analysis information 150, the syntax analysis information 180, and the semantic analysis information 200. . The correspondence calculation unit 21 performs calculation for associating the character string with the node based on the analysis result. The display control unit 23 controls the display unit 27 to perform, for example, graph expression.

ユーザが、入力文において文字列、またはグラフにおいてノードを選択すると、言語処理装置１は、入力部３を介して選択を取得する。対応計算部２１は、選択された文字列に対応するノードを特定する。名詞句判定部１３、動詞句判定部１５は、特定されたノードに関して、名詞句または動詞句に含まれるか否かを判定する。ノードの内容が名詞句または動詞句に含まれている場合には、ノード拡張部１７は、その句の主辞から、その句に含まれないアークが出ているか否か、その句に含まれないアークは「拡張」の種類の「アーク」か否か、に基づき、さらなる拡張の有無を判定する。 When the user selects a character string in the input sentence or a node in the graph, the language processing device 1 acquires the selection via the input unit 3. The correspondence calculation unit 21 specifies a node corresponding to the selected character string. The noun phrase determination unit 13 and the verb phrase determination unit 15 determine whether the specified node is included in the noun phrase or the verb phrase. When the contents of the node are included in the noun phrase or verb phrase, the node extension unit 17 determines whether or not an arc not included in the phrase is generated from the main word of the phrase, and is not included in the phrase. Whether or not the arc is further expanded is determined based on whether or not the arc is the “arc” type.

以上説明したように、本実施の形態による言語処理装置１によれば、文の解析結果を表示する際に、単語を名詞句または動詞句に拡張して表示することができる。これにより、文字列が意味解析において意味をなすか否かを判断し、まとめて表示する単位を、文字列の読解支援という観点で意味をなす最小単位に拡張することができる。 As described above, according to the language processing apparatus 1 according to the present embodiment, when displaying a sentence analysis result, a word can be expanded and displayed as a noun phrase or a verb phrase. This makes it possible to determine whether or not the character string makes sense in the semantic analysis, and to extend the unit to be displayed together to the minimum unit that makes sense in terms of assisting reading of the character string.

また、言語処理装置１では、拡張された名詞句または動詞句の主辞から出るアークが、その名詞句または動詞句に含まれるか否かに基づき、更なる拡張を行うか否かを判別することができる。また、言語処理装置１では、単語間の関係を示すアークの内容を「拡張」「非拡張」に種類分けした種類に応じて、更なる拡張を行うか否かを判別することができる。解析結果は、更なる拡張は行わないと判別された文字列の単位で表示することができる。よって、より読解支援に適した文字列を表示する単位とすることができる。 Further, in the language processing device 1, it is determined whether or not to perform further expansion based on whether or not an arc from the main noun phrase or verb phrase is included in the noun phrase or verb phrase. Can do. Further, the language processing apparatus 1 can determine whether or not to perform further expansion according to the type of the arc content indicating the relationship between words as “extended” and “non-extended”. The analysis result can be displayed in units of character strings determined not to be further expanded. Therefore, it can be a unit for displaying a character string more suitable for reading support.

このとき、例えばグラフ表現においては、句表示最適化部１９が拡張対象となった複数のノードをひとまとめにするなど、ノード及びアークの構造を適宜変更する。これにより、単語間、フレーズ間の係り受けに関する表示を、読解支援のためにより適した単位とすることができる。よって、表示される意味を容易に判断することが可能となり、より読解しやすい表示とすることができる。 At this time, in the graph expression, for example, the phrase display optimizing unit 19 appropriately changes the structure of the nodes and arcs such as grouping a plurality of nodes to be expanded. Thereby, the display regarding the dependency between words and phrases can be made into a unit more suitable for reading comprehension support. Therefore, the meaning to be displayed can be easily determined, and the display can be more easily read.

言語処理装置１では、文の解読のための品詞情報や類似文の表示、文の解析結果としての単語に基づくグラフ表現、グラフ表現における縮退表示のみでは対応が困難であった複雑な文の解析結果を、簡易に表示することができる。よって、大局的に捉えて表示することしかできなかった長文の読解支援をより適切に行なうことができる。また、複雑な文の構造を視覚的に捉えることができる。 In the language processing device 1, the display of part-of-speech information for deciphering a sentence and the display of a similar sentence, the graph expression based on a word as a sentence analysis result, and the analysis of a complex sentence that was difficult to cope with only the degenerate display in the graph expression The result can be easily displayed. Therefore, it is possible to more appropriately support reading comprehension of long sentences that could only be captured and displayed on a global basis. In addition, it is possible to visually grasp the structure of complex sentences.

また、展開する必要のない部分については、指示を入力することにより縮退させた表示が可能である。拡張の必要ない部分については、拡張させない表示も可能である。これらは、任意に組み合わせることもできるため、ユーザの必要性に合致した形態で表示が可能である。選択する範囲が意味解析の点から適切でない場合にも、選択された部分を自動的に拡張して、理解しやすい形式に表示が可能である。このような簡易な表示により、言語的な専門知識が乏しいユーザに対しても適切に読解支援を行うことが可能となる。 In addition, a portion that does not need to be expanded can be displayed in a reduced manner by inputting an instruction. For parts that do not need to be extended, it is also possible to display the parts without extension. Since these can be arbitrarily combined, they can be displayed in a form that matches the needs of the user. Even when the range to be selected is not appropriate from the viewpoint of semantic analysis, the selected portion can be automatically expanded and displayed in an easy-to-understand format. With such a simple display, it is possible to appropriately support reading comprehension even for users who lack linguistic expertise.

（変形例）
以下、表示方法、選択方法に関する変形例について説明する。本変形例において、言語処理装置１の構成及び動作について、第１の実施の形態と同様の部分については重複説明を省略する。図３０、３１は、表示方法の変形例を示す図である。図３０に示すように、入力文５２２の表示例としては、出力結果５２４、５２６でもよい。文５２２＝「太郎は学校に行く。」の解析結果は、出力結果５２４、５２６のように、グラフ表現でなく、２つのノードと、そのアークを並列表示するようにしてもよい。 (Modification)
Hereinafter, modified examples regarding the display method and the selection method will be described. In the present modification, the redundant description of the configuration and operation of the language processing device 1 is omitted for the same parts as those in the first embodiment. 30 and 31 are diagrams showing a modification of the display method. As shown in FIG. 30, output results 524 and 526 may be displayed as examples of display of the input sentence 522. The analysis result of the sentence 522 = “Taro goes to school” may be displayed in parallel with two nodes and their arcs instead of a graph expression, as in the output results 524 and 526.

図３１は、図３０と同様に、グラフ表現でなく、２つのノードとそのアークを並列表示する別の例を示している。図３１に示すように、入力文８０の表示例は、出力結果５３４、５３６でもよい。 FIG. 31 shows another example in which two nodes and their arcs are displayed in parallel, instead of a graph representation, as in FIG. As shown in FIG. 31, the display example of the input sentence 80 may be output results 534 and 536.

図３２、３３は、文字列選択時の表示例を示す図である。図３２に示すように、表示例５５０は、入力文５５２及び初期表示５５６、選択後表示５５８を示している。この例では、入力文５５２において選択された文字列５５４＝「基づいて」について、選択前の初期表示５５６と、選択後表示５５８が示されている。これら初期表示５５６と選択後表示５５８は、同時に表示してもよいし、文字列の選択の前後で表示を切り替えるようにしてもよい。 32 and 33 are diagrams showing display examples when a character string is selected. As shown in FIG. 32, the display example 550 shows an input sentence 552, an initial display 556, and a post-selection display 558. In this example, an initial display 556 before selection and a display 558 after selection are shown for the character string 554 = “based” selected in the input sentence 552. These initial display 556 and post-selection display 558 may be displayed simultaneously, or the display may be switched before and after the character string is selected.

図３３に示すように、表示例５８０は、入力文５５２及び初期表示５９２、選択後表示５９４を示している。この例では、入力文５５２において選択された文字列５５４＝「基づいて」について、文字列選択前の初期表示５９２と、選択後表示５９４が示されている。これら初期表示５９２と選択後表示５９４は、同時に表示してもよいし、文字列の選択の前後で表示を切り替えるようにしてもよい。 As shown in FIG. 33, the display example 580 shows an input sentence 552, an initial display 592, and a post-selection display 594. In this example, for the character string 554 = “based” selected in the input sentence 552, an initial display 592 before the character string selection and a display 594 after the selection are shown. The initial display 592 and the post-selection display 594 may be displayed simultaneously, or the display may be switched before and after the character string is selected.

以上のように、上記変形例によれば、第１の実施の形態による効果に加え、さらにユーザの必要性に合致した表示を行うことが可能となる。 As described above, according to the modified example, in addition to the effects of the first embodiment, it is possible to perform display that further meets the needs of the user.

なお、本発明は、以上に述べた実施の形態に限定されるものではなく、本発明の要旨を逸脱しない範囲内で種々の構成または実施形態を採ることができる。例えば、形態素解析情報１５０、構文解析情報１８０、意味解析情報２００、拡張情報２３０の構成は上記の例に限定されない。同様の内容を示すものであれば、適用が可能である。 The present invention is not limited to the embodiments described above, and various configurations or embodiments can be adopted without departing from the gist of the present invention. For example, the configurations of the morphological analysis information 150, the syntax analysis information 180, the semantic analysis information 200, and the extended information 230 are not limited to the above example. Anything that shows similar contents can be applied.

入力文における文字列の選択は、表示されている意味構造のノードを直接選択してもよい。そのとき、名詞句判定部１３、動詞句判定部１５は、ノードの品詞に応じて名詞句判定、動詞句判定を行う。これら句の判定は、構文解析情報１８０を使わずに、例えば、Ｎグラムモデルなど、統計処理や、従来の他の手法に基づいて行ってもよい。 The selection of the character string in the input sentence may directly select the displayed semantic structure node. At that time, the noun phrase determination unit 13 and the verb phrase determination unit 15 perform noun phrase determination and verb phrase determination according to the part of speech of the node. These phrases may be determined based on statistical processing such as an N-gram model or other conventional methods without using the parsing information 180.

上記実施の形態及び変形例においては、解析部５が入力文の解析を行う場合を例にして説明したが、既に解析された結果に基づいてノードの拡張処理を行い、表示を最適化する処理を行うようにしてもよい。このとき、解析部は、入力文の解析結果である、形態素解析情報１５０、構文解析情報１８０、意味解析情報２００等を、例えば外部記憶装置や送受信装置などを介して取得するようにしてもよい。また、例えば、解析結果記憶部２５に解析を行うための辞書を予め記憶しておくことが好ましい。 In the embodiment and the modification described above, the case where the analysis unit 5 analyzes the input sentence has been described as an example. However, the process of performing the node expansion process based on the already analyzed result and optimizing the display May be performed. At this time, the analysis unit may acquire the morphological analysis information 150, the syntax analysis information 180, the semantic analysis information 200, and the like, which are the analysis results of the input sentence, for example, via an external storage device or a transmission / reception device. . In addition, for example, it is preferable to previously store a dictionary for performing analysis in the analysis result storage unit 25.

上記実施の形態及び変形例において、名詞句判定部１３、動詞句判定部１５は、句判定部の一例であり、ノード拡張部１７は、拡張部の一例であり、句表示最適化部１９は、表示処理部の一例である。表示制御部２３、表示部２７は、表示部の一例である。解析結果記憶部２５は、記憶部の一例であり、入力部３は、選択取得部の一例である。 In the above-described embodiment and modification, the noun phrase determination unit 13 and the verb phrase determination unit 15 are examples of phrase determination units, the node expansion unit 17 is an example of expansion units, and the phrase display optimization unit 19 is This is an example of a display processing unit. The display control unit 23 and the display unit 27 are examples of the display unit. The analysis result storage unit 25 is an example of a storage unit, and the input unit 3 is an example of a selection acquisition unit.

ここで、上記第１の実施の形態及び変形例による言語処理方法の動作をコンピュータに行わせるために共通に適用されるコンピュータの例について説明する。図３４は、標準的なコンピュータのハードウエア構成の一例を示すブロック図である。図３４に示すように、コンピュータ６００は、ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ（ＣＰＵ）６０２、メモリ６０４、入力装置６０６、出力装置６０８、外部記憶装置６１２、媒体駆動装置６１４、ネットワーク接続装置等がバス６１０を介して接続されている。 Here, an example of a computer that is commonly applied to cause a computer to perform the operation of the language processing method according to the first embodiment and the modification will be described. FIG. 34 is a block diagram illustrating an example of a hardware configuration of a standard computer. As shown in FIG. 34, a computer 600 includes a central processing unit (CPU) 602, a memory 604, an input device 606, an output device 608, an external storage device 612, a medium drive device 614, a network connection device, and the like via a bus 610. It is connected.

ＣＰＵ６０２は、コンピュータ６００全体の動作を制御する演算処理装置である。メモリ６０４は、コンピュータ６００の動作を制御するプログラムを予め記憶したり、プログラムを実行する際に必要に応じて作業領域として使用したりするための記憶部である。メモリ６０４は、例えばＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ（ＲＡＭ）、ＲｅａｄＯｎｌｙＭｅｍｏｒｙ（ＲＯＭ）等である。入力装置６０６は、コンピュータの使用者により操作されると、その操作内容に対応付けられている使用者からの各種情報の入力を取得し、取得した入力情報をＣＰＵ６０２に送付する装置であり、例えばキーボード装置、マウス装置などである。出力装置６０８は、コンピュータ６００による処理結果を出力する装置であり、表示装置などが含まれる。例えば表示装置は、ＣＰＵ６０２により送付される表示データに応じてテキストや画像を表示する。 The CPU 602 is an arithmetic processing unit that controls the operation of the entire computer 600. The memory 604 is a storage unit for storing a program for controlling the operation of the computer 600 in advance or using it as a work area as needed when executing the program. The memory 604 is, for example, a Random Access Memory (RAM), a Read Only Memory (ROM), or the like. The input device 606 is a device that, when operated by a computer user, acquires various information input from the user associated with the operation content and sends the acquired input information to the CPU 602. Keyboard device, mouse device, etc. The output device 608 is a device that outputs a processing result by the computer 600, and includes a display device and the like. For example, the display device displays text and images according to display data sent by the CPU 602.

外部記憶装置６１２は、例えば、ハードディスクなどの記憶装置であり、ＣＰＵ６０２により実行される各種制御プログラムや、取得したデータ等を記憶しておく装置である。媒体駆動装置６１４は、可搬記録媒体６１６に書き込みおよび読み出しを行うための装置である。ＣＰＵ６０２は、可搬記録媒体６１６に記録されている所定の制御プログラムを、記録媒体駆動装置６１４を介して読み出して実行することによって、各種の制御処理を行うようにすることもできる。可搬記録媒体６１６は、例えばＣｏｍｐａｃｔＤｉｓｃ（ＣＤ）−ＲＯＭ、ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ（ＤＶＤ）、ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ（ＵＳＢ）メモリ等である。ネットワーク接続装置６１８は、有線または無線により外部との間で行われる各種データの授受の管理を行うインタフェース装置である。バス６１０は、上記各装置等を互いに接続し、データのやり取りを行う通信経路である。 The external storage device 612 is, for example, a storage device such as a hard disk, and stores various control programs executed by the CPU 602, acquired data, and the like. The medium driving device 614 is a device for writing to and reading from the portable recording medium 616. The CPU 602 can perform various control processes by reading and executing a predetermined control program recorded on the portable recording medium 616 via the recording medium driving device 614. The portable recording medium 616 is, for example, a Compact Disc (CD) -ROM, a Digital Versatile Disc (DVD), a Universal Serial Bus (USB) memory, or the like. The network connection device 618 is an interface device that manages transmission / reception of various data performed between the outside by wired or wireless. A bus 610 is a communication path for connecting the above devices and the like to exchange data.

上記第１の実施の形態及びその変形例による言語処理方法をコンピュータに実行させるプログラムは、例えば外部記憶装置６１２に記憶させる。ＣＰＵ６０２は、外部記憶装置６１２からプログラムを読み出し、コンピュータ６００に言語処理の動作を行なわせる。このとき、まず、言語処理をＣＰＵ６０２に行わせるための制御プログラムを作成して外部記憶装置６１２に記憶させておく。そして、入力装置６０６から所定の指示をＣＰＵ６０２に与えて、この制御プログラムを外部記憶装置６１２から読み出させて実行させるようにする。また、このプログラムは、可搬記録媒体６１６に記憶するようにしてもよい。 A program that causes a computer to execute the language processing method according to the first embodiment and its modification is stored in, for example, the external storage device 612. The CPU 602 reads a program from the external storage device 612 and causes the computer 600 to perform language processing. At this time, first, a control program for causing the CPU 602 to perform language processing is created and stored in the external storage device 612. Then, a predetermined instruction is given from the input device 606 to the CPU 602 so that the control program is read from the external storage device 612 and executed. Further, this program may be stored in the portable recording medium 616.

以上の実施形態に関し、さらに以下の付記を開示する。
（付記１）
入力された文に含まれる複数の単語を抽出して前記文の意味を解析した解析結果に基づき、複数の前記単語の夫々について、前記単語を含み一つの意味を成す句があるか否かを判定し、ある場合には前記句を出力する句判定部と、
複数の前記単語のうちの互いに関係する２つの単語同士の関係を示す前記関係情報及び前記句判定部の判定結果に基づき、前記単語または前記句を前後の単語または句とつなげた別の句として表示させるか否かの判別を行う拡張部と、
前記文を表示させる際に、前記別の句として表示させると判別された前記単語または句を一つの句として表示させる処理を行う表示処理部と、
前記解析結果及び前記表示処理部での処理の結果に基づき、前記文を表示する表示部と、
を有することを特徴とする言語処理装置。
（付記２）
前記関係情報と、前記単語または前記句同士を前記別の句として表示させるべきか否かにより分類される種類とを関連付けた拡張情報を記憶する記憶部をさらに有し、
前記拡張部は、前記拡張情報に応じて判別を行うことを特徴とする付記１に記載の言語処理装置。
（付記３）
前記文中の文字列が選択されたことを取得する選択取得部、
前記文字列と前記文字列に含まれる前記単語または前記句との関係を計算することにより、選択された前記単語または前記句を特定する対応計算部、
をさらに有し、
前記拡張部は、前記対応計算部により特定された前記単語または前記句に基づき前記判別を行うことを特徴とする付記１または付記２に記載の言語処理装置。
（付記４）
前記文の形態素解析を行い、前記形態素解析に基づき、前記形態素同士の関係から前記文の構文解析を行い、前記構文解析の結果に基づき前記関係情報を生成する解析部、
をさらに有し、
前記拡張部は、前記関係情報として前記解析部により生成された前記関係情報に基づき判別を行う
ことを特徴とする付記１から付記３のいずれかに記載の言語処理装置。
（付記５）
前記句判定部は、前記解析部による前記構文解析の結果に基づき前記単語を、前記単語を含む名詞句または動詞句に分類する
ことを特徴とする付記４に記載の言語処理装置。
（付記６）
前記表示部は、
前記文を前記単語または前記句と前記関係情報とを含んで表示する際に、縮退させて表示可能な複数の前記単語または前記句を決定し、前記縮退させて表示可能な複数の前記単語または前記句の中で、縮退させない状態に同時に展開する前記複数の単語または前記句の組合せを前記関係情報に基づき決定し、前記縮退させた表示と、前記縮退させない状態の表示とを切り換える指示を取得すると、前記指示及び前記組合せに基づき前記表示を切換える表示制御部
をさらに有することを特徴とする付記１から付記５のいずれかに記載の言語処理装置。
（付記７）
言語処理装置が、
入力された文に含まれる複数の単語を抽出して前記文の意味を解析した解析結果に基づき、前記単語を、前記単語を含み一つの意味を成す最小単位の句があるか否かを判定し、ある場合には前記句を出力し、
複数の前記単語のうちの互いに関係する２つの単語同士の関係を示す関係情報及び前記句判定部の判定結果に基づき、前記単語または前記句を前後の単語または句とつなげた別の句として表示させるか否かの判別を行い、
前記文を表示させる際に、前記別の句として表示させると判別された前記単語または句を一つの句として表示させる処理を行い、
前記解析結果及び前記表示させる処理の結果に基づき、前記文を表示する
ことを特徴とする言語処理方法。
（付記８）
前記関係情報と、前記単語または前記句同士を前記別の句として表示させるべきか否かにより分類される種類とを関連付けた拡張情報に応じて前記判別を行うことを特徴とする付記７に記載の言語処理方法。
（付記９）
さらに、
前記文中の文字列が選択されたことを取得して、前記文字列と前記文字列に含まれる前記単語または前記句との関係を計算することにより、選択された前記単語または前記句を特定し、
特定された前記単語または前記句に基づき前記判別を行うことを特徴とする付記７または付記８に記載の言語処理方法。
（付記１０）
さらに
前記文の形態素解析を行い、
前記形態素解析に基づき、前記形態素同士の関係から前記文の構造を解析する構文解析を行い、
前記構文解析の解析結果に基づき前記関係情報を生成する意味解析を行い、
前記意味解析により生成された前記関係情報及び前記拡張情報に基づき、前記判別を行う
ことを特徴とする付記７から付記９のいずれかに記載の言語処理方法。
（付記１１）
前記句の分類は、前記構文解析による解析結果に基づき前記単語を、前記単語を含む名詞句または動詞句に分類する
ことを特徴とする付記１０に記載の言語処理方法。
（付記１２）
前記表示する処理は、
前記文を前記単語または前記句と前記関係情報とを含んで表示する際に、縮退させて表示可能な複数の前記単語または前記句を決定し、前記縮退させて表示可能な複数の前記単語または前記句の中で、縮退させない状態に同時に展開する前記複数の単語または前記句の組合せを前記関係情報に基づき決定し、前記縮退させた表示と、前記縮退させない状態の表示とを切り換える指示を取得すると、前記指示及び前記組合せに基づき前記表示を切換える
ことを特徴とする付記７から付記１１のいずれかに記載の言語処理方法。
（付記１３）
入力された文に含まれる複数の単語を抽出して前記文の意味を解析した解析結果に基づき、前記単語を、前記単語を含み一つの意味を成す最小単位の句があるか否かを判定し、ある場合には前記句を出力し、
複数の前記単語のうちの互いに関係する２つの単語同士の関係を示す関係情報及び前記句判定部の判定結果に基づき、前記単語または前記句を前後の単語または句とつなげた別の句として表示させるか否かの判別を行い、
前記文を表示させる際に、前記別の句として表示させると判別された前記単語または句を一つの句として表示させる処理を行い、
前記解析結果及び前記表示させる処理の結果に基づき、前記文を表示する
処理をコンピュータに実行させるプログラム。
（付記１４）
前記関係情報と、前記単語または前記句同士を前記別の句として表示させるべきか否かにより分類される種類とを関連付けた拡張情報に応じて前記判別を行う
ことを特徴とする付記１３に記載のプログラム。
（付記１５）
さらに、
前記文中の文字列が選択されたことを取得して、前記文字列と前記文字列に含まれる前記単語または前記句との関係を計算することにより、選択された前記単語または前記句を特定し、
特定された前記単語または前記句に基づき前記判別を行う
ことを特徴とする付記１３または付記１４に記載のプログラム。 Regarding the above embodiment, the following additional notes are disclosed.
(Appendix 1)
Based on the analysis result obtained by extracting a plurality of words included in the input sentence and analyzing the meaning of the sentence, for each of the plurality of words, it is determined whether there is a phrase that includes the word and has one meaning. A phrase determination unit that determines and outputs the phrase if any;
As another phrase in which the word or the phrase is connected to the preceding or following word or phrase based on the relationship information indicating the relationship between two words related to each other among the plurality of words and the determination result of the phrase determination unit An extension for determining whether to display, and
When displaying the sentence, a display processing unit that performs processing to display the word or phrase determined to be displayed as the another phrase as one phrase;
Based on the analysis result and the result of processing in the display processing unit, a display unit for displaying the sentence;
A language processing apparatus comprising:
(Appendix 2)
A storage unit that stores extended information that associates the relationship information with a type classified based on whether the words or phrases should be displayed as the other phrases;
The language processing apparatus according to appendix 1, wherein the extension unit performs determination according to the extension information.
(Appendix 3)
A selection acquisition unit for acquiring that a character string in the sentence is selected;
A correspondence calculation unit that identifies the selected word or the phrase by calculating a relationship between the character string and the word or the phrase included in the character string;
Further comprising
The language processing apparatus according to appendix 1 or appendix 2, wherein the extension unit performs the determination based on the word or the phrase specified by the correspondence calculation unit.
(Appendix 4)
An analysis unit that performs morphological analysis of the sentence, performs syntax analysis of the sentence from the relationship between the morphemes based on the morphological analysis, and generates the relationship information based on a result of the syntax analysis;
Further comprising
The language processing apparatus according to any one of appendix 1 to appendix 3, wherein the extension unit performs determination based on the relationship information generated by the analysis unit as the relationship information.
(Appendix 5)
The language processing apparatus according to appendix 4, wherein the phrase determination unit classifies the word into a noun phrase or a verb phrase including the word based on a result of the parsing by the analysis unit.
(Appendix 6)
The display unit
When displaying the sentence including the word or the phrase and the relationship information, the plurality of words or the phrase that can be displayed in a reduced manner are determined and the plurality of the words that can be displayed in the reduced state or In the phrase, the plurality of words or the combination of the phrases that are simultaneously expanded to be in a non-degenerate state are determined based on the relationship information, and an instruction to switch between the reduced display and the non-degenerate display is obtained. Then, the language processing apparatus according to any one of appendix 1 to appendix 5, further comprising a display control unit that switches the display based on the instruction and the combination.
(Appendix 7)
The language processor
Based on an analysis result obtained by extracting a plurality of words included in the input sentence and analyzing the meaning of the sentence, it is determined whether or not there is a minimum unit phrase including the word and forming one meaning. And if there is, output the phrase,
Based on the relationship information indicating the relationship between two related words of the plurality of words and the determination result of the phrase determination unit, the word or the phrase is displayed as another phrase connected to the preceding or following word or phrase To determine whether or not
When displaying the sentence, performing the process of displaying the word or phrase determined to be displayed as the another phrase as one phrase,
A language processing method, wherein the sentence is displayed based on the analysis result and the processing result to be displayed.
(Appendix 8)
The appendix 7 is characterized in that the determination is performed according to extended information that associates the relationship information with a type classified based on whether the word or the phrases should be displayed as the different phrases. Language processing method.
(Appendix 9)
further,
Obtaining that the character string in the sentence is selected, and calculating the relationship between the character string and the word or the phrase included in the character string, thereby identifying the selected word or the phrase ,
The language processing method according to appendix 7 or appendix 8, wherein the discrimination is performed based on the identified word or phrase.
(Appendix 10)
Furthermore, morphological analysis of the sentence is performed,
Based on the morpheme analysis, perform a syntax analysis to analyze the structure of the sentence from the relationship between the morphemes,
Performing semantic analysis to generate the relation information based on the analysis result of the syntax analysis,
The language processing method according to any one of appendix 7 to appendix 9, wherein the determination is performed based on the relation information and the extended information generated by the semantic analysis.
(Appendix 11)
The language processing method according to appendix 10, wherein the phrase classification is performed by classifying the word into a noun phrase or a verb phrase including the word based on an analysis result by the syntax analysis.
(Appendix 12)
The processing to display is
When displaying the sentence including the word or the phrase and the relationship information, the plurality of words or the phrase that can be displayed in a reduced manner are determined and the plurality of the words that can be displayed in the reduced state or In the phrase, the plurality of words or the combination of the phrases that are simultaneously expanded to be in a non-degenerate state are determined based on the relationship information, and an instruction to switch between the reduced display and the non-degenerate display is obtained. Then, the language processing method according to any one of appendix 7 to appendix 11, wherein the display is switched based on the instruction and the combination.
(Appendix 13)
Based on an analysis result obtained by extracting a plurality of words included in the input sentence and analyzing the meaning of the sentence, it is determined whether or not there is a minimum unit phrase including the word and forming one meaning. And if there is, output the phrase,
Based on the relationship information indicating the relationship between two related words of the plurality of words and the determination result of the phrase determination unit, the word or the phrase is displayed as another phrase connected to the preceding or following word or phrase To determine whether or not
When displaying the sentence, performing the process of displaying the word or phrase determined to be displayed as the another phrase as one phrase,
A program that causes a computer to execute a process of displaying the sentence based on the analysis result and the result of the process to be displayed.
(Appendix 14)
The appendix 13 is characterized in that the determination is performed according to extended information that associates the relationship information with a type classified depending on whether the word or the phrase should be displayed as the another phrase. Program.
(Appendix 15)
further,
Obtaining that the character string in the sentence is selected, and calculating the relationship between the character string and the word or the phrase included in the character string, thereby identifying the selected word or the phrase ,
The program according to appendix 13 or appendix 14, wherein the determination is performed based on the specified word or phrase.

１言語処理装置
３入力部
５解析部
７形態素解析部
９構文解析部
１１意味解析部
１３名詞句判定部
１５動詞句判定部
１７ノード拡張部
１９句表示最適化部
２１対応計算部
２３表示制御部
２５解析結果記憶部
２７表示部
９２〜９８縮退ノード
１００〜１０６アーク
１５０形態素解析情報
１５２形態素番号
１５４形態素
１５６開始位置
１５８文字長
１６０品詞情報
１６２意味記号
１７０品詞記号情報
１７２品詞記号
１７４意味
１８０構文解析情報
１８２構成要素
１８４品詞
１８６ラベル
１８８主辞
２００意味解析情報
２０２出発ノード
２０４到達ノード
２０６アーク
２０８形態素番号
２１０意味記号
２１２形態素
２１４形態素番号
２１６意味記号
２１８形態素
３７２簡略ノード
３７９簡略ノード
３８０表示例
３８２表示例
３９０表示例
３９２簡略ノード DESCRIPTION OF SYMBOLS 1 Language processing device 3 Input part 5 Analysis part 7 Morphological analysis part 9 Syntax analysis part 11 Semantic analysis part 13 Noun phrase determination part 15 Verb phrase determination part 17 Node expansion part 19 Phrase display optimization part 21 Corresponding calculation part 23 Display control part 23 25 Analysis result storage unit 27 Display unit 92 to 98 Degenerate node 100 to 106 Arc 150 Morphological analysis information 152 Morphological number 154 Morphological unit 156 Start position 158 Character length 160 Part of speech information 162 Meaning symbol 170 Part of speech symbol information 172 Part of speech symbol 174 Meaning 180 Parsing Information 182 Component 184 Part of speech 186 Label 188 Semantics 200 Semantic analysis information 202 Departure node 204 Arrival node 206 Arc 208 Morpheme number 210 Semantic symbol 212 Morpheme 214 Morpheme number 216 Semantic symbol 218 Morpheme 372 Simplified node 379 Simplified node 380 Display Example 382 Display example 390 Display example 392 Simplified node

Claims

Based on the analysis result obtained by extracting a plurality of words included in the input sentence and analyzing the meaning of the sentence, for each of the plurality of words, it is determined whether there is a phrase that includes the word and has one meaning. A phrase determination unit that determines and outputs the phrase if any;
Perform morphological analysis of the sentence, perform syntax analysis of the sentence based on the relationship between the morphemes of the sentence based on the morphological analysis, and based on the result of the syntax analysis , semantic relationships between words, and an analysis unit for generating the relationship information indicating the semantic relation between a word that holds the primary significance in multiple clauses output from the phrase determination unit and each of the words,
Based on said relationship information, the determination of whether to display another phrase the word or the phrase concatenation and before and after the word or phrase, the relationship between the availability of it to be displayed as the another phrase said relationship information An extension unit that performs in accordance with predefined extension information ;
A display processing unit that collects the words or phrases that are determined to be displayed as the different phrases as one phrase;
Based on the analysis result obtained by analyzing the meaning of the sentence and the processing result in the display processing unit, the phrase analyzed as the central concept of the sentence , the phrase compiled by the display processing unit, and the phrase and the phrase A display unit for displaying the relationship information representing a semantic relationship with a phrase ;
A language processing apparatus comprising:

A selection acquisition unit for acquiring that a character string in the sentence is selected;
A correspondence calculation unit that identifies the selected word or the phrase by calculating a relationship between the character string and the word or the phrase included in the character string;
Further comprising
The extensions, language processing apparatus according to claim 1, characterized in that the determination as an object the corresponding calculation unit the word or the phrase specified by.

The language processing according to claim 1 or 2 , wherein the phrase determination unit classifies the word into a noun phrase or a verb phrase including the word based on a result of the parsing by the analysis unit. apparatus.

The display unit
When an instruction to switch the display of the grouped phrases is acquired by the display processing unit, instead of displaying the grouped phrases, a word holding the main meaning in the grouped phrases, the main meaning in the grouped phrases And the relationship information representing the semantic relationship between the word or phrase other than the word holding the main word and the word or phrase other than the word holding the main meaning and the word or phrase other than the word holding the main meaning. language processing apparatus according to further comprising a display control unit for displaying on the basis of the result of the process from claim 1, wherein in any one of claims 3.

The language processor
Based on the extracted plurality of words included in the input sentence analysis result of the analysis of the meaning of the sentence, for each of a plurality of said word, whether there is a phrase makes sense one includes the words Judgment, if there is, output the phrase,
Perform morphological analysis of the sentence,
Based on the morphological analysis, syntactic analysis of the sentence from the relationship between the morphemes of the sentence,
Based on the result of the parsing, a semantic relationship between two mutually related words out of the plurality of the words, and a word having a main meaning in each of the plurality of the words and the output phrase, Generate relationship information that shows the semantic relationship of
Based on said relationship information, the determination of whether to display another phrase the word or the phrase concatenation and before and after the word or phrase, the relationship between the availability of it to be displayed as the another phrase said relationship information Is performed according to predefined extended information ,
The words or phrases that are determined to be displayed as the different phrases are grouped as one phrase,
Based on the results of collectively Ru processing as the analysis result and the one phrase meaning analyzed the sentence, phrase parsed as central idea of the sentence, the gathered phrase, and, between the phrase and該句 A language processing method characterized by displaying the relationship information representing a semantic relationship .

Based on the extracted plurality of words included in the input sentence analysis result of the analysis of the meaning of the sentence, for each of a plurality of said word, whether there is a phrase makes sense one includes the words Judgment, if there is, output the phrase,
Perform morphological analysis of the sentence,
Based on the morphological analysis, syntactic analysis of the sentence from the relationship between the morphemes of the sentence,
Based on the result of the parsing, a semantic relationship between two mutually related words out of the plurality of the words, and a word having a main meaning in each of the plurality of the words and the output phrase, Generate relationship information that shows the semantic relationship of
Based on said relationship information, the determination of whether to display another phrase the word or the phrase concatenation and before and after the word or phrase, the relationship between the availability of it to be displayed as the another phrase said relationship information Is performed according to predefined extended information ,
The words or phrases that are determined to be displayed as the different phrases are grouped as one phrase,
Based on the results of collectively Ru processing as the analysis result and the one phrase meaning analyzed the sentence, phrase parsed as central idea of the sentence, the gathered phrase, and, between the phrase and該句 A program for causing a computer to execute processing for displaying the relationship information representing a semantic relationship .