JPH0269869A

JPH0269869A - Language analyzing device

Info

Publication number: JPH0269869A
Application number: JP63222890A
Authority: JP
Inventors: Toshihiko Yokogawa; 横川　壽彦
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1988-09-06
Filing date: 1988-09-06
Publication date: 1990-03-08

Abstract

PURPOSE:To eliminate the failure of analysis due to the mistake of the punctuation of a sentence by providing the inputting means of language to be analyzed, a means to register a sentence end pattern and a sentence dividing means to divide the inputted objective language to be analyzed into sentences by using the registered sentence end pattern. CONSTITUTION:An input file 1, a manual inputting part 2 such as a keyboard, an inputting device 3, a sentence end pattern file 4, a sentence end pattern registering part 5, a sentence end pattern buffer 6, a sentence end recognizing part 7, a sentence end deciding part 8, a language analyzing part 9 and a like of character collating table 10 are provided with a control part 11 and the output terminal 12 of an analyzed result. Then, the inputted objective language to be analyzed is divided into the sentences by using the registered sentence end pattern. Accordingly, the pattern of the end of the sentence different according to the sentence can be dealt with dynamically. Thus, in the case of natural language analysis, it can be easily and correctly divided into the sentence which is the unit of the language analysis according to the format or the like of the sentence.

Description

【発明の詳細な説明】伎亙氷見本発明は、言語解析装置、より詳細には、自然言語解析
装置１機械翻訳装置の解析部に関し、例えば質問応答シ
ステム等における言語解析等に応用可能なものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a language analysis device, more specifically, to an analysis section of a natural language analysis device 1 machine translation device, and is applicable to, for example, language analysis in a question answering system, etc. It is.

丈米技権従来の言語解析装置においては、予めシステム製作者が
設定した文末のパターンしか文末として認識しない。例
えば、セミコロンが来た後は必ず文末にするようにする
といったように、システム作成者の定めたパターンは、
必ず文の切れ目となり、それ以外は文の切れ目とはなら
ない。しかし、文章によっては、句読点の用い方に差が
あるのは予想される。例えば、ある人は、セミコロンは
、文の切れ目として扱うのに対して、ある人は、同格の
説明を与えるために用いているといったことがある。こ
の場合、予めシステム製作者の設定した文末パターンで
は、どちらかの文書に対応するのが精いっばいである。Previous language analysis devices only recognize sentence ending patterns that are set in advance by the system manufacturer as sentence endings. For example, the pattern established by the system creator, such as making sure that after a semicolon is at the end of a sentence,
It always becomes a sentence break, and other times it does not become a sentence break. However, it is expected that there will be differences in the usage of punctuation depending on the text. For example, some people treat semicolons as sentence breaks, while others use them to explain appositions. In this case, the end-of-sentence pattern set in advance by the system manufacturer is at best compatible with either document.

また、システム製作者の予想していないような文末パタ
ーンには対応できない欠点がある。Another drawback is that it cannot handle sentence ending patterns that the system creator has not anticipated.

こうした欠点を解決するために、従来技術では。In order to solve these shortcomings, the prior art.

例えば、交編集機能を用いて、文章そのものを編集して
、システムが文末であると認識する形あるいは文を切り
たくない場合は、システムが文の切れ目と認識しない形
に変形してやる必要があり、非常に面倒であった。この
ためには、文末の標準パターンを使用者が知る必要があ
る。さらに、解析対象文中の句読点等を変更する必要が
あるので、原文のイメージを損なってしまうという欠点
もある。For example, if you want to use the cross-editing function to edit the text itself so that the system recognizes it as the end of a sentence, or if you do not want to cut the sentence, you need to transform it into a shape that the system does not recognize as a break in the sentence. It was very troublesome. For this purpose, the user needs to know the standard pattern at the end of a sentence. Furthermore, since it is necessary to change punctuation marks, etc. in the sentence to be analyzed, there is also the disadvantage that the image of the original sentence is spoiled.

また自然言語の文章を解析するいわゆる自然言語解析シ
ステム等においては、現在のところ、文を単位に解析を
するシステムが大半である。したがって、文の切れ目の
認識に失敗すると解析がうまく行かないということにな
る。上述したように、従来技術では、システム作成者が
最初に設定した文の切れ目しか許されない。それ以外の
場合は、例えば、文の切れ目であるという決まりの記号
連続等を入れるということになるが、非常に大変な労力
が必要である。ところで、文の切れ目の約束は、文章ご
とに決まっているといってよい。したがって、同じ英語
の文章でも、手紙の文と、マニュアルの文では、文の切
れ目とすべきものが異なっているのが悴通である。Furthermore, at present, most of the so-called natural language analysis systems that analyze natural language sentences are systems that analyze sentences in units of sentences. Therefore, if the recognition of sentence breaks fails, the analysis will not go well. As described above, in the prior art, only sentence breaks initially set by the system creator are allowed. In other cases, for example, it is necessary to insert a series of regular symbols that indicate breaks in sentences, but this requires a great deal of effort. By the way, it can be said that the rules for sentence breaks are fixed for each sentence. Therefore, even in the same English sentence, the text that should be used as a sentence break is different between a letter and a manual.

且−一眞本発明は、上述のごとき実情に鑑みてなされたもので、
特に１文の切れ目を示す複数の記号連続を使用者が文章
の種類に合わせて、自由に設定できるようにすることに
よって１文の切れ目をまちがえるがために解析に失敗す
るといったことをなくすことを目的としてなされたもの
である。- Kazuma The present invention was made in view of the above-mentioned circumstances,
In particular, by allowing the user to freely set a series of symbols that indicate breaks in a sentence according to the type of sentence, it is possible to eliminate failures in analysis due to mistakes in the breaks in a sentence. It was done for a purpose.

盪−一双本発明は、上記目的を達成するために、文分割装置にお
いて、解析する言語の入力手段と、文末パターンを登録
する手段と、登録した文末パターンを用いて、入力され
た解析対象言語を文に分割する文分割手段とを有するこ
と、あるいは、さらに、文末コードをもつ入力文を解析
する言語の解析手段を有し、前記文分割手段において入
力言語を文に分割した後に解析を行うようにした言語解
析装置を特徴とするものである。以下、本発明の実施例
に基づいて説明する。(ii) In order to achieve the above object, the present invention provides, in a sentence segmentation device, an input means for a language to be analyzed, a means for registering a sentence ending pattern, and an input language to be analyzed using the registered sentence ending pattern. or a language analysis means for analyzing an input sentence having a sentence-end code, and the sentence division means performs the analysis after dividing the input language into sentences. The language analysis device is characterized by the following. Hereinafter, the present invention will be explained based on examples.

第１図は、本発明の一実施例を説明するための全体構成
図で、同図は、英語の解析装置に本解析装置を応用した
例を示す、第１図において、１は入力ファイル、２はキ
ーボード等の手操作入力部。FIG. 1 is an overall configuration diagram for explaining one embodiment of the present invention, and the figure shows an example in which this analysis device is applied to an English language analysis device. In FIG. 1, 1 is an input file; 2 is a manual input section such as a keyboard.

３は入力装置、４は文末パターンファイル、５は文末パ
ターン登録部、６は文末パターンバッファ、７は文末認
定部、８は文末判定部、９は言語解析部、１０は字類照
合用テーブル、１１は制御部、１２は解析結果の出力端
で、点線内は文分割部である。3 is an input device, 4 is a sentence-end pattern file, 5 is a sentence-end pattern registration unit, 6 is a sentence-end pattern buffer, 7 is a sentence-end recognition unit, 8 is a sentence-end determination unit, 9 is a language analysis unit, 10 is a class matching table, 11 is a control section, 12 is an output end of the analysis result, and the area within the dotted line is a sentence division section.

第２〜４図は、第１図の動作説明をするためのフローチ
ャートで、以下、このフローチャートに従って説明する
。2 to 4 are flowcharts for explaining the operation of FIG. 1, and the following description will be made according to this flowchart.

言語解析装置が起動されると（第２図５ＴＡＲＴ）、文
末パターンファイル４をアクセスし、言語解析システム
中の文末パターンバッファ６に登録する。When the language analysis device is activated (FIG. 2, 5TART), the sentence-end pattern file 4 is accessed and registered in the sentence-end pattern buffer 6 in the language analysis system.

言語解析システム起動の際のパラメータとして、文末パ
ターンファイルの名称を与えることができるようにして
、所望の文末パターンファイルを用いることができるよ
うに構成しである。これは、例えば、翻訳エディタと言
ったものを用いる際は、起動の際のパラメータとして与
えることも可能であるし、また、翻訳エディタ中の一機
能として、文末パターンファイルを与えることもできる
。The structure is such that the name of the sentence-final pattern file can be given as a parameter when starting the language analysis system, so that a desired sentence-final pattern file can be used. For example, when using a translation editor, this can be given as a parameter when starting up, or a sentence-end pattern file can be given as a function in the translation editor.

文末パターンファイル４は、文末パターンと、それに対
応する文末位置指定をペアとするレコードの集合として
構成されている。したがって、言語解析システムでは、
文末パターンファイルの一レコードを−パターンとして
、レコード数だけのパターンを登録できるようにしであ
る。The end-of-sentence pattern file 4 is configured as a set of records each pairing an end-of-sentence pattern and a corresponding end-of-sentence position designation. Therefore, in a language analysis system,
This allows one record of the sentence-end pattern file to be used as a - pattern, and as many patterns as there are records to be registered.

文末パターンは、英語の場合では、アルファベット；句
読点；空白；改行；大文字；小文字等を表わすことので
きる文字、あるいは、文字連続の並びとして表わされる
。例えば、第１表のように約束しておく。文末位置指定
は１文末パターン上の何文字目の後ろに文末を設定する
かを指定するものである。文末パターンの直前に文末を
設定する場合は、Ｏとなる。In the case of English, the sentence ending pattern is expressed as a character or a sequence of consecutive characters that can represent alphabets, punctuation marks, spaces, line breaks, uppercase letters, lowercase letters, etc. For example, make promises as shown in Table 1. The end-of-sentence position designation is for specifying after which character in the end-of-sentence pattern the end of a sentence is to be set. When setting the end of a sentence immediately before the end of sentence pattern, the value is O.

第１表文末パターンと１文末位置指定の間には、タブコードが
あるものとしている。したがって、文末パターンファイ
ルのルーコードは、「文末パターン（ＴＡＢ）文末位置
指定（改行）」の形式であられされ、例示すると第５図
に示すようにあられされる。図中、ムは、タブコードを
示す。It is assumed that there is a tab code between the first table sentence end pattern and the first sentence end position specification. Therefore, the loop code of the sentence-end pattern file is written in the format of ``sentence-end pattern (TAB) sentence-end position designation (line feed)'', as shown in FIG. 5, for example. In the figure, mu indicates a tab code.

言語解析システムを起動すると（第２図５ＴＡＲＴ）　
。When the language analysis system is started (Fig. 2 5TART)
.

パラメータで指定された文末パターンファイルをオープ
ンする（第２図５ｔｅｐ２）。次いで、文末パターンフ
ァイルから、−行ずつ読み取り（第２図５ｔｅｐ２）、
ファイルの終りでなければ（第２図５ｔｅｐ２）、タブ
までの部分を文末パターンとして登録しく第２図５ｔｅ
ｐ４）、それに対応する文末位置指定を数値として文末
パターンバッファ６に登録する（第２図５ｔｅｐ５）。The sentence ending pattern file specified by the parameter is opened (FIG. 2, step 2). Next, read each - line from the sentence ending pattern file (Figure 2, step 2),
If it is not the end of the file (Fig. 2, step 2), register the part up to the tab as the end-of-text pattern.
p4), and the corresponding sentence end position designation is registered as a numerical value in the sentence end pattern buffer 6 (FIG. 2, step 5).

このとき、文末パターンの最後には、ヌル（Ｎｕｌｌ）
コードを入れるものとする。これを、文末パターンファ
イルの終了まで繰り返す。第６図は、このようにして作
られた文末パターンバッファの例である。図中、　ｒ（
）Ｊはヌルコードを意味する。従来は、これらのパター
ンを固定的にシステムに組み込んで処理するか、あるい
はプログラム的に固定的な処理を行なっていた。At this time, at the end of the sentence-end pattern, there is a null
The code shall be entered. Repeat this until the end of the sentence-end pattern file. FIG. 6 is an example of a sentence-end pattern buffer created in this manner. In the figure, r(
) J means a null code. Conventionally, these patterns have been fixedly incorporated into a system and processed, or fixedly processed using a program.

文末判定部８は、入力手段３から、入力を入力バッファ
に受は取ると、入力バッファ中の現文字位置を記憶しく
第３図５ｔｅｐｌ）、−文字ずつ、文末パターンの１文
字目と等しいか（あるいは、同じ類に属するか）を検査
する（第３図５ｔａｐ２）。第３図５ｔｅｐ２の詳細を
第４図に示す、文末パターンの文字が（Ｎｕｌｌ）なら
ば照合は成功とし、Ｎｕｌｌでなければ（第４図５ｔｅ
ｐｌ）、文末パターン現在設定の文字が￥であるかどう
かを調べる（第４図５ｔｅｐ２）。￥であれば次の文字
をとり（第４図３ｔｅｐ５）、字類照合用テーブル１０
用いて同じ類に属するかどうかの検査を行なう（第４図
５ｔｅｐ６）　ｏ字類照合用テーブルは、第７図に示す
ように￥を伴う記号に属する内容か記述されている。字
類中の字であれば、文末パターンを２文字、入力文字（
バッファ）を１文字進め、５ｔｅｐｌにもどる。１文字
目がどの文末パターンとも合致しない場合は、その文字
がそのまま。When the end-of-sentence determining unit 8 receives input from the input means 3 into the input buffer, it stores the current character position in the input buffer (Fig. 3, 5tepl), and determines whether the character is equal to the first character of the end-of-sentence pattern or not. (or whether they belong to the same class) (Figure 3, tap 2). The details of step 2 in Figure 3 are shown in Figure 4. If the characters in the sentence ending pattern are (Null), the matching is considered successful, and if it is not Null (Figure 4, 5te
pl), check whether the currently set character in the sentence ending pattern is ¥ (Figure 4, step 2). If it is ¥, take the next character (Fig. 4, step 5), and type matching table 10.
(FIG. 4, 5 step 6) The o-class matching table describes contents belonging to symbols accompanying ¥, as shown in FIG. 7. If it is a character in the class, add two characters for the sentence ending pattern and the input character (
Advance the buffer) by one character and return to 5tepl. If the first character does not match any sentence ending pattern, that character remains as is.

言語解析部に送られ、次の文字に進める。−文字目が、
合致しないパターンについては、その入力バッファ上の
位置を記憶し、２文字目以降も同様に検査し、パターン
が、ヌルコードに達した場合に１文末認定を行なう。全
てのパターンについて、失敗した場合は、その文字をそ
のまま言語解析部に送り、次の文字に進める（第３図５
ｔｅｐ３）。It is sent to the language analysis department and proceeds to the next character. -The letter is
For patterns that do not match, their positions on the input buffer are stored, and the second and subsequent characters are similarly checked, and when the pattern reaches a null code, the end of one sentence is recognized. If all the patterns fail, send the character as is to the language analysis section and proceed to the next character (Fig. 3, 5).
step 3).

文末認定部７では、記憶しておいた、入力バッファ上の
文字から、文末位置指示領分だけの文字を言語解析部９
に送る。そして、予め言語解析部側で定めである、文末
コードを送る。（本システムでは、￥ｑｌの３文字の連
続としである。）入力バッファ中のその後の１文字を無
条件で言語解析部に送り、その次の文字から、再び、文
末判定部８の処理を続ける９入力がつきた場合も、言語解析部側で定めた文末コード
を送り１文末判定部・文末認定部の処理を終了する。The end-of-sentence recognition section 7 extracts characters corresponding to the end-of-sentence position indication area from the stored characters on the input buffer to the language analysis section 9.
send to Then, the end-of-sentence code, which is determined in advance by the language analysis section, is sent. (In this system, it is a series of three characters \ql.) The next character in the input buffer is unconditionally sent to the language analysis section, and the process of the end-of-sentence determination section 8 is performed again from the next character. Continue 9 Even when an input is received, the end-of-sentence code determined by the language analysis section is sent and the processing of the end-of-sentence determination section and end-of-sentence recognition section is completed.

言語解析部９では、入力の先頭及び、直前の文末コード
の直後から１次の文末コードまでを１文として、文の解
析を繰り返し行なう。たとえば、特開昭８２−２６７８
７２〜３号公報に示した言語解析装置を利用することが
できる。また、文分割装置を、上記言語解析装置中の形
態素解析部として構成することも同様非常に簡単である
。文末コードは、文をくぎる役割りだけを示し、解析対
象からは除かれるように構成しておく。これは、文を単
位に解析を進める解析システムでは、非常に簡単に実現
できる。解析方法は５文を単位とする解析方法であれば
１文脈自由文法を用いたものであろうが、格文法や、拡
張遷移網を用いたものでもかまわない。The language analysis unit 9 repeatedly analyzes sentences, with the beginning of the input and immediately after the immediately preceding sentence end code up to the first sentence end code as one sentence. For example, JP-A-82-2678
The language analysis device shown in Publication No. 72-3 can be used. Furthermore, it is also very easy to configure the sentence segmentation device as a morphological analysis section in the language analysis device. The sentence-end code is configured so that it only serves to complete the sentence and is excluded from the analysis target. This can be achieved very easily with an analysis system that analyzes sentences in units of sentences. As for the analysis method, if it is a five-sentence analysis method, it will probably use a one-context-free grammar, but it may also use a case grammar or an extended transition network.

本システｔ１は、構成の簡単な文解析システムとして構
成しているが、それだけではなく、言語解析部は、文末
コードを抽象的な文末位置と認識できるように構成され
たものならどんなものでも可能である。This system t1 is configured as a simple sentence analysis system, but the language analysis unit can be of any type as long as it is configured to recognize the sentence-final code as an abstract sentence-final position. It is.

たとえば、第５図のような文末パターンがＳ＆Ｂされて
いたとき、Ｉ　ｈａｖｅ　ａ　ｂｏｏｋ；　Ｂｕｔ　Ｉ　ｄｏｎｔ
’ｔ　ｈａｖｅ　ａ　ｂｏｏｋ；ｅｓｐｅｃｉａｌｌｙ
　ａ　ｇｏｏｄ　ｂｏｏｋ。For example, when the sentence ending pattern shown in Figure 5 is S&B, I have a book; But I don't
't have a book; specially
a good book.

という文章が入力されたとする。Suppose that the following sentence is input.

最初のｂｏｏｋの後の；は、次が大文字であるので、文
末であると認識され、次のｂｏｏｋ；は、次が小文字で
あるので文末でないと認識される。The ; after the first book is recognized as the end of a sentence because the next character is an uppercase letter, and the next book; is recognized as not the end of a sentence because the next character is a lowercase character.

したがって、構文解析部は、Ｉ　ｈａｖｅ　ａ　ｂｏｏｋ；￥ｑ］　Ｂｕｔ　Ｉ　ｄ
ｏｎ’ｔ　ｈａｖｅ　ａ　ｂｏｏｋ；ｅｓｐｅｃｉａｌ
ｌｙ　ａ　ｇｏｏｄ　ｂｏｏｋ、￥ｑ］という形で送ら
れる。Therefore, the syntax parsing unit does the following: I have a book;\q] But I d
on't have a book; special
ly a good book, ￥q].

構文解析部は、Ｉ　ｈａｖｅ　ａ　ｂｏｏｋ；ＢｕＬ　
Ｉ　ｄｏｎ’ｔ　　ｈａｖｅ　ａ　ｂｏｏｋ；ｅｓｐｅ
ｃｉａｌｌｙ　　ａ　　ｇｏｏｄ　　ｂｏｏｋ。I have a book; BuL
I don't have a book;espe
cially a good book.

という２文として解析を行ない、正しい結果を得ること
が可能になる。By analyzing these two sentences, it is possible to obtain correct results.

とくに、文章作成者や、文書の種類によって、用法の異
なる、コロン・セミコロン等の解析に非常に有効に働く
。In particular, it is very effective in analyzing colons, semicolons, etc., which have different usage depending on the author and the type of document.

翻訳エディタ中の一機能として、実現する場合には、文
末パターンファイルが指定される度に、直前までの文末
バッファをクリアし５文末バッファへの新たな登録を行
なうようにする事によっても容易に実現できる。If this is implemented as a function in the translation editor, it can be easily achieved by clearing the previous sentence-end buffer and registering a new one in the sentence-end buffer every time a sentence-end pattern file is specified. realizable.

本実施例では、文末パターンファイルは、テキストファ
イルで構成され、その変更等は、一般のエディタ等を用
いることを前提にしている。しかし、翻訳エディタ等で
は、文末パターンを登録・変更するようにすることも、
文末パターンファイルが、テキストファイルとして構成
されているので、容易に実現可能である。In this embodiment, the end-of-sentence pattern file is composed of a text file, and changes to the file are assumed to be made using a general editor or the like. However, in translation editors, etc., it is possible to register or change the sentence ending pattern.
Since the sentence ending pattern file is configured as a text file, it can be easily implemented.

また、文末パターンファイルを、言語解析システム上で
利用しやすい形に構成し、翻訳エディタ（あるいは、文
末パターンの構成の合わせた文末パターンエディタ等）
で直接変更するように構成することも可能である。In addition, you can configure the sentence ending pattern file in a format that is easy to use on the language analysis system, and use a translation editor (or a sentence ending pattern editor that matches the structure of the sentence ending pattern, etc.).
It is also possible to configure it so that it can be changed directly.

本実施例は、英語の解析装置を例として取り上げたが、
文末パターンに登録できる字類等を適当に設定すること
によって、日本語の解析や、他の言語の解析装置にも容
易に応用できる。This example takes an English language analysis device as an example, but
By appropriately setting the classes etc. that can be registered in the sentence ending pattern, it can be easily applied to Japanese analysis and other language analysis devices.

勢−一層以上の説明から明らかなように１本発明によると１文章
によって異なる文末のパターンに動的に対応できるため
に、自然言語解析に際して、言語解析の単位である文に
、文章のフォーマット等に応じて、正しく分割すること
が容易に可能になる。As is clear from the above explanation, according to the present invention, since it is possible to dynamically deal with sentence ending patterns that vary depending on one sentence, during natural language analysis, the sentence format etc. Correct division can be easily achieved according to the following.

したがって、文章の解析の効率が非常に向上し、システ
ムの要求する文末パターンに前編集等を用いて、文の編
集を行なう必要がなくなるので、自然言語解析や機械翻
訳の前処理を軽減することができる。また、同種の文章
には、−度作成した文末パターンが有効であるので、同
種の文の大量の機械翻訳や、継続的な翻訳等に際して、
より一層の効果がある。Therefore, the efficiency of sentence analysis is greatly improved, and there is no need to edit sentences by using pre-editing to match the sentence ending pattern required by the system, reducing the amount of pre-processing required for natural language analysis and machine translation. Can be done. In addition, since the sentence ending pattern created twice is effective for sentences of the same type, it is useful for machine translation of a large number of sentences of the same type, continuous translation, etc.
It is even more effective.

[Brief explanation of the drawing]

第１図は、本発明の一実施例を説明するための全体構成
図、第２図乃至第４図は、第１図の実施例の動作を説明
するためのフローチャート、第５図は１文末パターンフ
ァイルの一例を説明するための図、第６図は、文末パタ
ーンバッファの一例を説明するための図、第７図は、字
類照合用テーブルの一例を説明するための図である。３・・・入力装置、４・・・文末パターンファイル、６
・・・文末パターンバッファ、７・・・文末認定部、８
・・・文末判定部、９・・・言語解析部。第］図第図第図第区Figure 1 is an overall configuration diagram for explaining one embodiment of the present invention, Figures 2 to 4 are flowcharts for explaining the operation of the embodiment of Figure 1, and Figure 5 is the end of the first sentence. FIG. 6 is a diagram for explaining an example of a pattern file, FIG. 6 is a diagram for explaining an example of a sentence-end pattern buffer, and FIG. 7 is a diagram for explaining an example of a class matching table. 3... Input device, 4... Sentence end pattern file, 6
...Sentence end pattern buffer, 7...Sentence end recognition section, 8
... Sentence end determination section, 9... Language analysis section. ] Fig. Fig. Fig. Section

Claims

[Claims] 1. It has a means for inputting a language to be analyzed, a means for registering a sentence ending pattern, and a sentence dividing means for dividing the input language to be analyzed into sentences using the registered sentence ending pattern. A language analysis device featuring: 2. An input means for the language to be analyzed, a means for registering a sentence ending pattern, a sentence dividing means for dividing the input language to be analyzed into sentences using the registered sentence ending pattern, and an input sentence having a sentence ending code. 1. A language analysis device comprising: language analysis means for dividing an input language into sentences, and performing analysis after dividing an input language into sentences in said sentence division means.