JPH0193798A

JPH0193798A - Division labelling apparatus

Info

Publication number: JPH0193798A
Application number: JP62251116A
Authority: JP
Inventors: Yoshiharu Abe; 芳春阿部
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1987-10-05
Filing date: 1987-10-05
Publication date: 1989-04-12
Anticipated expiration: 2012-07-09
Also published as: JP2629205B2

Abstract

PURPOSE: To improve the accuracy of pattern matching and the accuracy of divided labeling by statistically processing variation in the section length of divided sections at the time of pattern matching. CONSTITUTION: A file device 1 stores K word voice time sequence patterns, a file device 2 stores K groups of vector label strings and section label strings and a file device 7 stores a division result obtained by m-th repeat. An initial division part 3 reads out a section length label string corresponding to the time sequence pattern of each vector stored in the device 1 from the device 2, executes time division so that each section length is proportional to the weight of a label sort and sends the result to the device 7. A statistic part 4 reads out the newest division result found out by the initial division part 3 or the best division part 6 from the device 7 and finds out the average and dispersion of cepstrum coefficients as the statistic value of vectors and finds out the average and dispersion as the statistical value of section length in each label and stores the obtained results in a file device 5. The best division is repeated in accordance with the contents of the device 5.

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、音声バタンで代表されるベクトルの時系列
パタンを、与えられたラベル列に従って分割し、これら
分割区間にラベルを付ける装置の改良に関する。[Detailed Description of the Invention] [Field of Industrial Application] This invention is an improvement of a device that divides a time series pattern of a vector represented by a voice button according to a given label string and labels these divided sections. Regarding.

[Conventional technology]

第２図は、昭和６２年３．ｕｓ日出願の特願昭６２−０
６４９５２号公報、「分割ラベル付け装置」、記載の従
来のこの種装置の機能ブロック図である。図において、
１１）はベクトルの時系列バタンを格納するためのファ
イル装置、（２）はファイル装置＋１１中のベクトルの
時系列パタンに対して付けるべきラベル列を格納するた
めのファイル装置。Figure 2 is from March 1986. Patent application filed on US day 1986-0
64952, "Divided labeling device", is a functional block diagram of a conventional device of this kind. In the figure,
11) is a file device for storing time-series patterns of vectors; (2) is a file device for storing label strings to be attached to time-series patterns of vectors in file device+11;

（３１は初期分割部、（４１は統計部、（５；は統計部
（４）の出力するラベル別の統計量を格納するためのフ
ァイル装置、（６）は最良分割部、（７）は分割結果を
格納するためのファイル装置である。(31 is the initial division section, (41 is the statistics section, (5) is a file device for storing statistics for each label output by the statistics section (4), (6) is the best division section, (7) is This is a file device for storing division results.

ここで、初期分割部（３）或いは最良分割部（６）は。Here, the initial dividing section (3) or the best dividing section (6) is.

ファイル装置ＩＩＩ中の各ベクトルの時系列パタンを。Time series pattern of each vector in file device III.

ファイル装置（２）中の対応するラベル列に従い分割し
、その分割結果をファイル装置（７）に出力するもので
ある。一方、統計部（４）は、これら分割結果から９分
割区間に含まれるベクトルの平均あるいは分散等の統計
量をラベル別に求め、ファイル装置（５）に出力するも
のである。It divides the file device (2) according to the corresponding label string, and outputs the division results to the file device (7). On the other hand, the statistics section (4) calculates statistics such as the average or variance of the vectors included in the nine divided sections from these division results for each label, and outputs them to the file device (5).

ところで、最良分割部（６）は、ある時系列パタンを分
割するに際し、対応するラベル列の順に、ラベル別のベ
クトルの統計量をファイル装置（５）から読み取り、こ
れらを並べた系列を型版として、この型版と分割対象の
時系列パタンとの整合を、動的計画法に基づくバタン整
合技術を用いてとり。By the way, when dividing a certain time series pattern, the best dividing unit (6) reads the statistics of vectors for each label from the file device (5) in the order of the corresponding label sequence, and uses the sequence in which these are arranged as a template. As such, we use a slam matching technique based on dynamic programming to match this template with the time series pattern to be divided.

バタンと型版間の整合の尤度（尤もらしさ）が最大と６
時の時系列パタンの時間分割の状態を最良の分割結果と
して、ファイル装置（７）に格納する。The likelihood of matching between the baton and the template is maximum 6
The time division state of the time series pattern is stored in the file device (7) as the best division result.

ある時点における最良の分割結果は、統計部（４）によ
って、再び最新のラベル別統計量を求めるため用いられ
、これらは更に再度、最良分割のため用いられる。この
様な、最良の分割と、ラベル別に統計量を求める処理と
の繰り返しによって、順次改良された分割結果を得る。The best division result at a certain point in time is used again by the statistics unit (4) to obtain the latest statistics for each label, and these are used again for the best division. By repeating the best division and the process of obtaining statistics for each label, a successively improved division result is obtained.

この様な原理で動作するこの種の装置によって。With this kind of device that works on this principle.

正確な分割ラベル付けを行うためには、最良分割部で利
用される統計量の内容、及びそれに基づく型版と１時系
列パタンの間のバタン整合の技術が問題となる。In order to perform accurate segmentation labeling, the content of the statistics used in the best segmentation unit and the technique of matching between the model version and one time series pattern based on the statistics are important.

[Problem that the invention seeks to solve]

ベクトルの時系列パタンに対して、型版をより正確に整
合させる技術としては、引用文献（電子通信学会論文誌
（Ａ）、第６９−Ａ巻、第２号、ｐｐ２６１−２７０）
に述べられている様に、ベクトルの時系列パタンを分割
する際、各分割区間の区間長に上限及び下限を決め、こ
れら区間長の許容範囲の制限の下で最良な整合状態を求
める方法がある。As a technique for more accurately matching the template to the time series pattern of vectors, the cited document (Transactions of the Institute of Electronics and Communication Engineers (A), Volume 69-A, No. 2, pp 261-270)
As described in , when dividing a time series pattern of vectors, there is a method of determining upper and lower limits for the interval length of each divided interval and finding the best consistency condition under the limits of the allowable range of these interval lengths. be.

しかしながら、この方法は分割区間の区間長の許容範囲
を９人間が限られた量のデータの観察経験に基づいて決
めて与え、−旦与えたものは以後−定幅としていたため
に、データ量を増加した時に始めて明かとなると考えら
れる９分割区間の区間長の真の分布を考慮したものでは
な（、必ずしも正しい整合が行えると言う保証はない。However, in this method, the allowable range of the interval length of the divided interval was determined and given by nine people based on the experience of observing a limited amount of data, and what was given once was given a constant width from then on, so the amount of data This method does not take into account the true distribution of the section lengths of the 9-section sections, which is thought to become clear only when the number of sections is increased (there is no guarantee that correct matching can be achieved).

又、許容範囲から、はずれるような区間長の大きな変動
に対しては、誤った整合状態を与えるため１分割の精度
が低下すると言う問題点があった。Further, there is a problem in that when there is a large variation in the section length that deviates from the allowable range, an incorrect matching state is given, resulting in a decrease in the accuracy of one division.

この発明は、係る欠点を解決するためなされたもので、
バタン整合の際問題となる区間長の変動を、統計的に捉
えるようにして１分割ラベル付けの精度を改善した装置
を提供するものである。This invention was made to solve such drawbacks,
An object of the present invention is to provide an apparatus that improves the accuracy of one-segment labeling by statistically capturing fluctuations in section lengths, which are a problem during baton matching.

[Means for solving problems]

この発明に係る分割ラベル付け装置は、統計部に、最新
の分割ラベル付けの状態から、ラベル別のベクトルの統
計量の他に、ラベル別の分割区間の区間長の統計量をも
求める手段を用いると共に。The segmented labeling device according to the present invention includes means for determining, in the statistics section, not only vector statistics for each label but also statistics on the section length of segmented sections for each label from the latest segmented labeling state. Along with using.

最良分割部でも、これらラベル別のベクトルの統計量と
、ラベル別の分割区間の区間長の統計量の両者を用いて
バタン整合を行い、最良な分割ラベル付けの状態を求め
る手段を用いる様にしたものである。Even in the best segmentation section, a method is used to perform slam matching using both the statistics of vectors for each label and the statistics of the section length of segmented sections for each label to find the best segmentation labeling state. This is what I did.

〔作　用〕この発明による分割ラベル付け装置では、バタンの際問
題となる１分割区間の区間長の変動を。[Function] The divided labeling device according to the present invention prevents fluctuations in the length of one divided section, which is a problem when slamming.

統計的に捉えることができるため、バタン整合の精度が
向上し、その結果９分割ラベル付けの精度を改善した装
置が実現される。Since it can be grasped statistically, the accuracy of baton matching is improved, and as a result, an apparatus with improved accuracy of 9-division labeling is realized.

〔Example〕

第１図は、この発明の一実施例を示す機能ブロック図で
ある。各機能ブロックは、汎用ミニコンピユータ上で動
作するプログラムによって実現されている。FIG. 1 is a functional block diagram showing an embodiment of the present invention. Each functional block is realized by a program running on a general-purpose minicomputer.

図において、１１）は単語音声データの分析によって得
られた（特徴）ベクトル（具体的には、１６次元のケプ
ストラム係数ベクトル）の時系列パタンか格納されたフ
ァイル装置、　＋２１はラベル列の格納されたファイル
装置、（３）は初期分割部、１４）は統計部、（５）は
ラベル別の統計量を格納するためのファイル装置、（６
）は最良分割部、（７）は分割結果を格納するためのフ
ァイル装置である。In the figure, 11) is a file device that stores time-series patterns of (feature) vectors (specifically, 16-dimensional cepstral coefficient vectors) obtained by analyzing word audio data, and +21 is a file device that stores label sequences. (3) is the initial division section, (14) is the statistics section, (5) is the file device for storing statistics for each label, and (6) is the file device for storing statistics for each label.
) is the best division unit, and (7) is a file device for storing the division results.

この実施例では、ファイル装置Ｉｌｌ中の単語音声の時
系列パタンを、ファイル装置（２）中のラベル列に従っ
て１分割ラベル付けするものである。In this embodiment, the time-series pattern of word sounds in the file device Ill is labeled into one segment according to the label string in the file device (2).

以下、ラベル列として、その成分ラベルが、ベクトルの
統計量に対する識別のためのラベル成分（以後、ベクト
ルラベルと呼ぶ）と２区間長の統計部に対する識別のた
めのラベル成分（以後１区間長ラベルと呼ぶ）からなる
場合について説明し。Hereinafter, as a label string, the component labels are a label component for identifying vector statistics (hereinafter referred to as a vector label) and a label component for identifying a 2-interval length statistical part (hereinafter referred to as a 1-interval length label). ).

又、ベクトルの統計量として、ケプストラム係数の平均
及び分散を、又１区間長の統計量として。Also, as vector statistics, the mean and variance of cepstral coefficients, and as statistics for one interval length.

その平均及び分散を用いる場合について説明する。The case where the average and variance are used will be explained.

又１分割の繰り返しの回数（以後２ｍで表す）を、初期
分割の結果に対し９ｍ＝１となるように定義する。Furthermore, the number of repetitions of one division (hereinafter expressed as 2m) is defined so that 9m=1 for the result of the initial division.

更に、ファイル装置＋１１には、ＫＩ［Ｆｊの単語音声
の時系列パタンか格納されているものとし、これらを。Furthermore, it is assumed that the file device +11 stores time-series patterns of the word sounds of KI[Fj.

（（（Ｃ（ｋ、ｉ　、ｎ）、　Ｏ≦ｎ≦＋ｓ　）、　Ｉ
≦ｉ≦Ｉ（ｋ））、１≦に≦Ｋ）と記す。（ここで、記
号（Ｘ）はＸなる列を表すものとし、以後、断りなくこ
の記法を用いる。又。(((C(k, i, n), O≦n≦+s), I
≦i≦I(k)), and 1≦≦K). (Here, the symbol (X) represents the sequence X, and this notation will be used hereinafter without further notice.

Ｃ（ｋ、ｉ、ｎ）は第に番目の単語を分析して得られる
第１番目のフレームにおける第３次のケプスラム係数、
［１は１時系列パタンの長さ（分析フレーム数）である
。）更に、又、ファイル装置（２）には、に組のベクトルラ
ベル列、及び区間長ラベル列が格納されているものとし
、これらを、ベクトルラベル列は。C(k, i, n) is the third-order cepthrum coefficient in the first frame obtained by analyzing the th word,
[1 is the length of one time series pattern (number of analysis frames). ) Furthermore, it is assumed that the file device (2) stores a set of vector label strings and a section length label string, and these are defined as the vector label string.

（ｉｌ？ｓ　（ｋ、ｊ）、　１≦ｊ≦Ｊ（ｋ））、１≦
に≦Ｋ）（但しｌ　Ｒ３（ｋ、ｊ）　　は第に番目の時
系列パタンの第ｊ番目の分割区間に付されるベクトルラ
ベル成分を、Ｊ（ｋ）はラベル列の長さを表す。）と、
又１区間長ラベル列は。(il?s (k, j), 1≦j≦J(k)), 1≦
≦K) (where l R3(k, j) represents a vector label component attached to the j-th divided section of the th-th time series pattern, and J(k) represents the length of the label string. )and,
Also, the one section length label string is:

（ｔＲｔ　（ｋ、Ｄ、　ｌ≦ｊ≦Ｊ（ｋ））、＊≦に≦
Ｋ）（但し、　Ｒｔ　（ｋ、ｊ）は第に番目の時系列パ
タンの第ｊ番目の分割区間に付される区事長ラベル成分
を表す。）　　　　　　　　　　　　　　　　　　　１
と記すものとし、これらラベル列に含まれる異なるラベ
ルの集合を、それぞれ、ベクトルラベル集合及び区間長
ラベル集合と呼ぶ。ベクトルラベル集合は。(tRt (k, D, l≦j≦J(k)), *≦to≦
K) (However, Rt (k, j) represents the section chief label component attached to the j-th division interval of the th-th time series pattern.) 1
The sets of different labels included in these label strings are called a vector label set and an interval length label set, respectively. Vector label set is.

（ＱＳ　（ｑｓ　）、　ｌ≦ｑｓ≦ＮＱｓ　）（但し、
ＮＱｓは異なるベクトルラベルの総数を表す。）と、又９区間長ラベル集合は。(QS (qs), l≦qs≦NQs) (However,
NQs represents the total number of different vector labels. ), and the 9 interval length label set is.

（Ｑｔ　（ｑｔ　）、　ｔ≦ｑｔ≦ＮＱｔ）（但し、Ｎ
Ｑｔは異なる区間長ラベルの総数を表す。）と記すもの
とする。(Qt (qt), t≦qt≦NQt) (However, N
Qt represents the total number of different interval length labels. ).

更に、又、ファイル装置（７）に格納される第ｍ回目の
繰り返しにおける分割結果を。Furthermore, the division result at the m-th repetition is stored in the file device (7).

１１ｅ　（ｍ、ｋ　ｊ）、　１≦ｊ≦Ｊ（ｋ）−Ｈ）、
１≦に≦Ｋ）と記すものとする。（ここで、ｅ　　（ｍ
、に、ｊ）は。11e (m, k j), 1≦j≦J(k)−H),
1≦ and ≦K). (Here, e (m
, ni, j) is.

第ｍ回目の繰り返しにおける第に番目のベクトルの時系
列パタンを分割する時の、第ｊ番目の分割区間の開始端
のフレーム番号を表し１次の式を満たすものとする。It represents the frame number at the start end of the j-th division section when dividing the time-series pattern of the th vector in the m-th repetition, and satisfies the first-order equation.

＝ｅ　（ｍ、に、　１）≦ｅ　（ｍ、に、　２）≦・・
・≦ｅ　（ｍ、に、Ｊ（ｋｌ刊）＝Ｉ（ｋ）−ｚ・・・
・・・１１）これによって、第ｍ回目の繰り返しにおける第に番目の
時系列パタンの第ｊ番目の分割区間のフレーム範囲は。=e (m, ni, 1)≦e (m, ni, 2)≦...
・≦e (m, ni, J (published by kl) = I(k)-z...
...11) As a result, the frame range of the j-th divided section of the th time-series pattern in the m-th repetition is as follows.

ｅ（ｍ、に、ｊ）≦ｉｃｅ　（ｍ、に、ｊ　＋１）　（
ｉはフレーム番号）の様に与えられる。）以上の準備の下で１次に、各部の動作について説明する
。e(m, ni, j)≦ice (m, ni, j +1) (
i is a frame number). ) With the above preparations in place, the operation of each part will be explained.

初期分割部（３）は、ファイル装置（１）中の各ベクト
ルの時系パタンに対して、対応する区間長ラベル列をフ
ァイル装置（２肋）ら読み取って、各分割区間の区間長
が、その区間に付される区間長ラベルの種類によりあら
かじめ決められている重みに比例するように９時系列バ
タンを時間分割し１分割結果をファイル装置（７）に出
力する。ファイル装置＋７１に出力された初期分割結果
、即ち、第１回目。The initial dividing unit (3) reads the corresponding interval length label string from the file device (2 ribs) for the time series pattern of each vector in the file device (1), and calculates the interval length of each divided interval as follows. Nine time-series bumps are time-divided in proportion to the weight predetermined according to the type of section length label attached to the section, and one division result is output to the file device (7). The initial division result output to the file device +71, ie, the first division.

（ｍ＝１）の繰り返しにおける分割結果。(m=1) division result in repetition.

１（ｅ（ｚ、に、ｊ）、　　　＋　≦ｊ≦Ｊ（ｋ）＋１
）、　　　ｌ　≦に≦Ｋ）（ただし、１≦ｊ≦Ｊ（ｋ）
＋　１　）　　　・・・・・・（２）（但し、記号（Ｘ
）は９本第２式に限って、Ｘを超えない最大の整数を表
すものとする。）の様に求められる。（ここで、　Ｗ（
Ｒｔ（ｋ、ｊ））は。1(e(z, ni, j), + ≦j≦J(k)+1
), l ≦ to ≦K) (where 1≦j≦J(k)
+ 1) ・・・・・・(2) (However, the symbol (X
) shall represent the largest integer not exceeding X only in the second equation. ) is required. (Here, W(
Rt(k,j)) is.

区間長ラベルＲｔ　（ｋ、ｊ　）に与えられた重みを表
す。）又、統計部（４）は、初期分割部（３）又は最良
分割部（６）で求められた最新の分割結果を、ファイル
装置（７）から読み取って、ベクトルに関する統計量と
してケプストラム係数の平均及び分散を、又９区間長の
統計量として、その平均及び分散をラベル別に求め、こ
れらラベル別の統計量をファイル装置（５）に格納する
。ここで、これらラベル別の統計量は、第ｍ回目の繰り
返しにおいて９次の様に与えられる。即ち、ベクトルラ
ベル集合の第ｑｓ番目のベクトルラベルＱｓ　（ｑｓ　
）に対するケプストラム係数の平均（以後、　Ｈｓ　（
ｍ、ｑｓ、　ｎ）と記す）及び分散（以後、ｖＳ（ｍ、
ｑＳ、ｎ）と記す）は、それぞれ。It represents the weight given to the interval length label Rt (k,j). ) Also, the statistics section (4) reads the latest division results obtained by the initial division section (3) or the best division section (6) from the file device (7), and calculates the cepstral coefficients as statistics regarding the vector. The mean and variance are obtained for each label as statistics for the length of 9 sections, and these statistics for each label are stored in the file device (5). Here, the statistics for each label are given as 9th order in the m-th repetition. That is, the qs-th vector label Qs (qs
) for the average cepstral coefficient (hereinafter Hs (
m, qs, n)) and variance (hereinafter, vS(m,
qS, n) are respectively.

Ｈｓ　（ｍ、ｑｓ、ｎ）＝Ｋ　　　Ｊ（ｋｌ Σ　　Σ　　　　　　　　　　　ΣＣ（ｋ、ｉ、ｎ）ｋ
＝ｌ　ｊ　＝１．　Ｒｓ（ｋ、ｊ）＝Ｑｓ（ｑｓ）　ｅ
　（ｍ、に、　ｊ）≦ｉ（ｅ（ｍｋｊ＋１）Ｋ　　　Ｊ
ｌｋｌ Σ　　Σ　　　（ｅ（ｍ、に、ｊ＋１）−ｅ（ｍ、に、
ｊ））ｋ＝　＋　ｊ　＝　ｔ　、Ｒｓ（ｋ、ｊ　）　＝
Ｑｓ　（ｑｓ）（但し、１≦ｑｓ≦ＮＱｓ、０≦ｎ≦１
５　）　　　−・−＋ａ＋及び。Hs (m, qs, n) = K J (kl Σ Σ ΣC(k, i, n)k
=l j =1. Rs(k,j)=Qs(qs) e
(m, ni, j)≦i(e(mkj+1)K J
lkl Σ Σ (e(m, to, j+1)−e(m, to,
j))k=+j=t,Rs(k,j)=
Qs (qs) (1≦qs≦NQs, 0≦n≦1
5) -・-+a+ and.

Ｖｓ　（ｍ、ｑｓ、　ｎ）　＝（但し、Ｉ≦ｑｓ≦ＮＱｓ、Ｑ≦ｎ≦１５）　　　・−
−−−−（４）で与えられる。一方１区間長ラベルの集
合の第ｑｔ番目の区間長ラベルＱｔ　（ｑｔ　）に対す
る区間長の平均（以後、　Ｈｔ　（ｍ、ｑｔ）と記す）
及び分散（以後、　ｖｔ　（ｍ、　ｑｔ　＞と記す）は
、それぞれ。Vs (m, qs, n) = (However, I≦qs≦NQs, Q≦n≦15) ・-
---It is given by (4). On the other hand, the average interval length for the qt-th interval length label Qt (qt) of the set of 1 interval length labels (hereinafter referred to as Ht (m, qt))
and variance (hereinafter referred to as vt (m, qt >), respectively).

Σ　　　Σ　　　　１に＝ｌ　ｊ＝＋、Ｒｔ（ｋ、ｊ）＝Ｑｔ（ｑｔ）（但し
、　１≦ｑｔ≦ＮＱｔ）　　　　　　　　　　・・・・
・・（５）及び。Σ Σ 1 = l j = +, Rt (k, j) = Qt (qt) (however, 1≦qt≦NQt) ...
...(5) and.

Ｖｔ　（ｍ　ｑｔ　）　＝ Σ　　　Σ　　　　１に＝１　ｊ＝１．Ｒｔ　（ｋ、　ｊ）＝　Ｑｔ　（ｑｔ
）（但し、１≦ｑｔ≦ＮＱｔ）　　　　　　　　　　・
・・・・・（６）で与えられる。Vt (m qt ) = Σ Σ 1 = 1 j = 1. Rt (k, j) = Qt (qt
) (However, 1≦qt≦NQt) ・
...It is given by (6).

更に、最良分割部（６）は、ファイル装置＋１１中の各
時系列パタンについて、ファイル装置（２）中の対応す
るラベル列に従って、ファイル装置（５）から読み取っ
たラベル別の統計量を並べ換えることによって型版を構
成し１時系列バタンと型版との間で。Furthermore, the best dividing unit (6) rearranges the label-specific statistics read from the file device (5) for each time series pattern in the file device +11 according to the corresponding label string in the file device (2). By doing so, a template is constructed between one time series and the template.

動的計画法に基づき整合の尤度を最大化する時の分割状
態を求め、これを最良の分割結果として。Based on dynamic programming, find the partitioning state that maximizes the likelihood of matching, and use this as the best partitioning result.

ファイル装置（７）に出力する。ここで、第ｍ回目の繰
り返しにおけるラベル別統計量に基づいて求められる。Output to file device (7). Here, it is determined based on the statistics for each label in the m-th repetition.

最良の整合の尤度（以後、　Ｓ　”　（ｍ、ｋ）と記す
）は。The likelihood of the best match (hereafter denoted S ” (m, k)) is:

Ｓ　”　（ｍ、ｋ　）　＝の様に定義され、又、この時の最良の分割状態（以後＋
　　ｔｅ”　（ｍ＊ｋｅｊ）＋　１≦ｊ≦Ｊｌｋｌ　＋
　１　）と記す）は。S ” (m, k) = , and the best division state at this time (hereinafter +
te” (m*kej)+ 1≦j≦Jlkl+
1) is written as ).

（ｅ　”　（ｍ、に、ｊ　）、　１≦ｊ≦Ｊ（ｋ）＋１
）＝の様に定義される。ここで、第７式及び第８式中の
Ｓ　（ｍ、　ｋ　）は、第ｍ回目の繰り返しにおいて求
められたラベル別の統計量に基づいて、ある分割状態（
ｅ　（ｍ、に、ｊ）、　１≦ｊ≦Ｊ（ｋｌ−ｚ）に対し
決まる整合の尤度であり、ベクトルラベル列に対する整
合の尤度と１区間長ラベル列に対する整合の尤度の和と
して。(e ” (m, ni, j), 1≦j≦J(k)+1
)= is defined. Here, S (m, k) in the seventh and eighth equations is calculated based on the statistics for each label obtained in the m-th iteration.
e (m, to, j), is the likelihood of matching determined for 1≦j≦J(kl-z), and is the sum of the likelihood of matching for the vector label string and the likelihood of matching for the 1-interval length label string. As.

Ｓ　（ｍ、ｋ）＝　Ｌｓ　（ｍ、ｋ）＋　Ｌｔ（ｍ、ｋ
）　−Ｗｔ　（＋≦に≦Ｋ）・・・・・・（９）の様に定義される。ここで、Ｗｔは区間長ラベル列に対
する整合の尤度に付する重み係数、又。S (m, k) = Ls (m, k) + Lt (m, k
) −Wt (+≦≦K) (9) Defined as follows. Here, Wt is a weighting coefficient assigned to the likelihood of matching the interval length label string.

Ｌｓ　（ｍ、ｋ）及びＬｔ　（ｍ、ｔ）は、それぞれ、
ベクトルラベル列に対する整合の尤度と１区間長ラベル
列に対する整合の尤度である。Ls (m, k) and Lt (m, t) are, respectively,
These are the likelihood of matching for a vector label string and the likelihood of matching for a one-section length label string.

ベクトルラベル列に対する整合の尤度は。The likelihood of matching for a vector label sequence is.

Ｌｓ　（ｍ、ｋ）＝ｊ＝１．ｅ（ｍ、に、ｊ）ｐ！ｅ　（ｍ、に、ｊ＋　ｔ
）・・・・・・（１（１（但し＋　Ｉｓ　（ｅ　（ｍ、に、ｊ）、ｅ（ｍ、に、
ｊ＋１）、ｑｓ）は、ベクトルラベル集合の第ｑｓ番目
のベクトルラベルＱｓ（ｑｓ）　　に対する。第ｊ番目
の区間の尤度であり。Ls (m, k)=j=1. e(m,ni,j)p! e (m, ni, j+ t
)・・・・・・(1(1 (However, + Is (e (m, ni, j), e(m, ni,
j+1), qs) corresponds to the qs-th vector label Qs(qs) of the vector label set. is the likelihood of the jth interval.

ｌｓ　（ｅ　（ｍ、に、ｊ）、ｅ　（ｍ　、に、ｊ−）
−１）、ｑｓ）＝と定義される。）又１区間長ラベル列に対する整合の尤度は。ls (e (m, ni, j), e (m, ni, j-)
−1), qs)=. ) Also, the likelihood of matching for a one-section length label string is:

Ｌｔ　（ｍ、ｋ）　＝・・・・・−へ３（但し、　Ｉｔ　（ｅ　（ｍ、に、ｊ　）、　ｅ　（ｍ
、に、　ｊ＋１）、　ｑｔ）は１区間長ラベル集合の第
ｑｔ番目の区間長ラベルＱｔ（ｑｔ　）に対する。第ｊ
番目の区間の尤度で。Lt (m, k) = ......-3 (However, It (e (m, to, j), e (m
, j+1), qt) corresponds to the qt-th interval length label Qt (qt) of the one interval length label set. jth
with the likelihood of the interval.

Ｉｔ　（ｘ、ｙ、ｑｔ）＝と表現される。It (x, y, qt) = It is expressed as

さて、第７式及び第８式の区間分割法１ｅ　（ｍ、に、
　Ｄ。Now, the interval division method 1e (m, to,
D.

１≦ｊ≦Ｊ（ｋｌ−Ｍｌ　に関する最大化は、一種の組
み合せ最適化問題となり、動的計画法の原理に基づいて
９次の漸化式。The maximization regarding 1≦j≦J(kl−Ml) becomes a kind of combinatorial optimization problem, and is a 9th order recurrence formula based on the principle of dynamic programming.

ｊ＝２．ａ、・・・、Ｊ（ｋｌ−ｚについて。j=2. a, ..., J (about kl-z.

かつ、１＝２３．・・・、　Ｉ（ｋ）　＋　＋について
。And 1=23. ..., about I(k) + +.

Ｇｓ　（ｉ、ｊ）＝　Ｇｓ　（ｉ−ビ、ｊ−＋）＋１ｓ
（ｉ−？、　ｉ、　Ｑｓ−’　（Ｒｓ（ｋ、ｊ）））Ｇ
ｔ（ｉ、ｊ）＝Ｇｔ（ｉ−ｒ”、ｊ−１）＋１ｓ（ｉ−
τ”、ｉ、Ｑｔ　　’（Ｒｔ（ｋ、ｊ）））Ｎｓ（ｉ、
ｊ）＝Ｎｓ（ｉ−ｒ”、ｊ−１）＋ｕ（ｉ−げ＋’）Ｂ
（ｉ、ｊ）＝ｉ−げ　　　　　　　　　　　　　　・・
・・・・ａ４（ここに＋　Ｇｓ　（ｉ、ｊ　）、　Ｇｔ
　（ｉｊ）ｅ　Ｎｓ（ｉ、ｊ）、　Ｂ（ｉ、ｊ　）は動
的計画法における状態変数で、以下の意味を有する。Gs (i, j) = Gs (i-bi, j-+) + 1s
(i-?, i, Qs-' (Rs(k,j)))G
t(i,j)=Gt(ir”,j-1)+1s(i-
τ'', i, Qt'(Rt(k,j)))Ns(i,
j)=Ns(i-r", j-1)+u(i-ge+')B
(i, j)=i-ge...
...a4 (here + Gs (i, j), Gt
(ij)e Ns(i, j), B(i, j) are state variables in dynamic programming and have the following meanings.

Ｇｓ（ｉ、ｊ）：　ベクトルラベルに対する整合尤度の
累積値。Gs(i,j): Cumulative value of matching likelihood for vector labels.

Ｇｔ（ｉ、ｊ）　：　区間長ラベルに対する整合尤度の
累積値。Gt(i,j): Cumulative value of matching likelihood for interval length labels.

Ｎｓ（ｉ、ｊ）：ベクトルラベルに対する整合尤度の累
積回数。Ns(i,j): cumulative number of matching likelihoods for vector labels.

Ｂ（ｉ、ｊ）　：　ポックポインタ。B(i,j): Pock pointer.

尚。still.

Ｑｓ　　’　（Ｒｓ）　：　　Ｒｓ＝Ｑｓ　（ｑｓ）を
満たすベクトルラベルの番号ｑｓを返す関数。Qs' (Rs): A function that returns the vector label number qs that satisfies Rs=Qs (qs).

Ｑｔ　　（Ｒｔ）　　：　　Ｒｔ＝Ｑｔ　（ｑｔ）を満
たす区間長ラベルの番号ｑｔを返す関係。Qt (Rt): A relationship that returns the section length label number qt that satisfies Rt=Qt (qt).

１ｓ（ｘ、ｙ、ｑ）：ベクトルラベルに対する区間の尤
度（第１１式）Ｉｔ　（ｘ、ｙ、ｑ）　：区間長ラベルに対する区間の
尤度（第１３式）ｕ（ｘ、ｙ）：次式を満たす関数である。）を初期条件、即ち。1s (x, y, q): Likelihood of the interval for the vector label (Equation 11) It (x, y, q): Likelihood of the interval for the interval length label (Equation 13) u(x, y): This is a function that satisfies the following equation. ) as the initial condition, i.e.

Ｇｓ（１，ｊ）＝Ｏ（ｔ≦ｊ≦Ｊ　ｆｋ）　＋　１）Ｇ
ｔ　（１，０＝　ｏ　　（＋≦ｊ≦Ｊ　ｆｋ）　＋　１
　）Ｎｔ（＋、Ｄ＝ｏ　　（＋≦ｊ≦Ｊ（ｋ）＋１）Ｇ
ｓ　（ｉ、　１）　＝ｏｏ　　（２≦゛ｉ≦Ｉ（ｋ）＋
°＋）Ｇｔ　（ｉ、　１）　＝閃　（２≦ｉ≦Ｉ（ｋ）
＋１）Ｎｔ（ｉ、１）−１（２≦ｉ≦Ｉ（ｋ）＋ｌ）　
　　・・・・・−ｍｓの下で解き、最良の整合の尤度Ｓ
′″（ｍ、　ｋ）を。Gs(1,j)=O(t≦j≦J fk) + 1)G
t (1,0= o (+≦j≦J fk) + 1
)Nt(+, D=o (+≦j≦J(k)+1)G
s (i, 1) =oo (2≦゛i≦I(k)+
°+)Gt (i, 1) = Flash (2≦i≦I(k)
+1) Nt (i, 1) - 1 (2≦i≦I(k)+l)
Solve under ・・・・・・−ms, the likelihood of the best match S
′″(m, k).

Ｓ”　（ｍ、ｋ）＝Ｎｓ　（Ｉ（ｋ）＋　＋、　Ｊ（ｋ）＋　＋）　　　　
　　Ｊ（ｋ）・・・・・・ａｅと置くことにより解け、又、最良の分割状態。S” (m, k) = Ns (I(k)+ +, J(k)+ +)
It can be solved by putting J(k)...ae, and it is the best division state.

（ｅ”　（ｍ、に、ｊ　）＋　ｔ≦ｊ≦Ｊ（ｋ）＋ｌ）
は、バックポインタを逆向きに辿ることにより求められ
る。即ち、まず。(e” (m, ni, j) + t≦j≦J(k)+l)
is found by tracing the back pointer in the opposite direction. Namely, first.

ｅ”　　（ｍ、に、ｊ（ｋ）＋ｔ）＝Ｉ（ｋ）＋１　　
　−・・−６ｎと置き１次に。e” (m, to j(k)+t)=I(k)+1
-...-6n and place 1st order.

ｊ　＝Ｊ（ｋ）、Ｊ［ｋｌ−＋、・・・、　３．２．　
ｌ　　について。j = J(k), J[kl-+,..., 3.2.
About l.

ｅ”（ｍ、に、ｊ）＝Ｂ（ｅ”（ｍ、に、ｊ＋１）、ｊ
＋１）　　＝−・・ＵＦｊと置くことによって求められ
る。この結果、即ち。e”(m, ni, j)=B(e”(m, ni, j+1), j
+1) =-...UFj. This result, ie.

第ｍ＋１回目の繰り返しにおける分割結果。The division result at the m+1st iteration.

（（ｅ（ｍ＋ｌ、に、ｊ）、　１≦ｊ≦Ｊ　［ｋ）＋　
１　）、　ｌ≦に≦Ｋ）は、ファイル装置（７）に、　
ｅ　（ｍ＋ｌ、に、ｊ）＝ｅ”　（ｍ。((e(m+l, ni, j), 1≦j≦J [k)+
1), l≦ and ≦K) in the file device (7),
e (m+l, j)=e” (m.

ｋ、ｊ）　　と置いて出力される。k, j) and output.

この様にして求められた。繰の返しの第ｍ＋１回目にお
ける分割結果は、再び第ｍ＋を回目の繰り返しにおける
ラベル別の統計量を求めるために使われ、この統計量は
、更に１次の第ｍ＋２回目の繰り返しにおける最良の分
割結果を求めるためにも使われる。It was requested in this way. The division result at the m+1st iteration is used again to calculate the statistics for each label at the m+th iteration, and this statistic is further used to determine the best division at the m+2nd iteration of the first order. It is also used to obtain results.

ところで９以上の説明から、最良分割部（６）によって
求められた最新の分割ラベル付けの状態は。By the way, from the explanation above in 9, what is the state of the latest division labeling determined by the best division unit (6)?

第９式で定義された整合の尤度Ｓ（ｍ、ｋ）を最大とす
るものであって、しかも、この第９式の第２項として含
まれる区間長ラベル列に対する整合の尤度、　Ｌｔ　（
ｍ、ｋ）を考慮したものとなっている。Maximizes the likelihood of matching S(m, k) defined by the ninth equation, and the likelihood of matching for the interval length label string included as the second term of the ninth equation, Lt (
m, k).

更に、この区間長ラベル列に対する整合の尤度。Furthermore, the likelihood of matching for this interval length label sequence.

Ｌｔ（ｍ、ｋ）は、第１２式及び第１３式の様に区間長
のラベル別の統計量、即ち、その平均（Ｈｔ　（ｍ、ｑ
ｔ）、　１≦ｑｔ≦ＮＱｔ）、及び分散（Ｖｔ　（ｍ　
ｑｔ）、　１≦ｑｔ≦ＮＱｔ　）に基づイテ求められた
ものなので１本実施例によれば９区間長の統計的変動を
捉えたバタン整合が行え、しかも。Lt (m, k) is the statistic for each label of the interval length as shown in equations 12 and 13, that is, its average (Ht (m, q
t), 1≦qt≦NQt), and variance (Vt (m
qt), and 1≦qt≦NQt). According to this embodiment, it is possible to perform the matching that captures the statistical fluctuations of nine section lengths.

その際用いられる区間長の統計量は、常に最新の分割結
果から求めているため９本実施例は区間長データの真の
分布に近い統計量によってバタン整合を行っていると言
える。Since the section length statistics used at this time are always obtained from the latest division results, it can be said that this embodiment performs slam matching using statistics close to the true distribution of the section length data.

〔Effect of the invention〕

以上説明した様に、この発明に係る分割ラベル付け装置
では、従来のこの種装置における統計手段に、ベクトル
のラベル別統計量の他に９分割区間の区間長のラベル別
統計量を求める手段を用いると共に、最良分割手段にこ
れらベクトルのラベル別統計量と１分割区間の区間長の
ラベル別統計量に基づき、最良な分割ラベル付けの状態
を求める手段を用いているため、最良分割を求めるため
のバタン整合の際９分割区間長の変動を統計的に捉える
ことが出来、バタン整合の精度が改善され。As explained above, in the segmented labeling device according to the present invention, in addition to the statistics for each label of vectors, means for obtaining the statistics for each label of the section length of 9 segmented sections are added to the statistical means in conventional devices of this type. In addition, since the best segmentation method uses a means to determine the best segment labeling state based on the label-specific statistics of these vectors and the label-specific statistics of the interval length of one segment, it is possible to obtain the best segmentation. It is possible to statistically capture the fluctuations in the length of the 9-division section during the baton matching, and the accuracy of the baton matching is improved.

その結果１分割ラベル付けの精度が改善されると言う効
果を有する。As a result, this has the effect of improving the accuracy of one-segment labeling.

[Brief explanation of the drawing]

第１図は、この発明の一実施例による分割ラベ、ル付け
装置の機能ブロック図、第２図は、従来の装置の機能ブ
ロック図である。図において、（１）はベクトルの時系
列パタンを格納するファイル装置。（２１はラベル列を格納するファイル装置、（３）は初
期分割部、（４１は統計部、（５）はラベル別の統計量
を格納するファイル装置、（６）は最良分割部、（７）
は分割結果を格納するファイル装置である。なお１図中同一あるいは相当部分には、同一の符号を付
して示しである。第１図７　：　コア４ル躾置第２図手続補正書（自発）FIG. 1 is a functional block diagram of a divided labeling apparatus according to an embodiment of the present invention, and FIG. 2 is a functional block diagram of a conventional apparatus. In the figure, (1) is a file device that stores time-series patterns of vectors. (21 is the file device that stores the label string, (3) is the initial division section, (41 is the statistics section, (5) is the file device that stores the statistics for each label, (6) is the best division section, (7 )
is a file device that stores the division results. Note that the same or corresponding parts in FIG. 1 are indicated with the same reference numerals. Figure 1 7: Core 4 Discipline Figure 2 Procedural Amendment (Voluntary)

Claims

[Claims]

(1) A device that divides a time-series pattern of a vector according to a pair of label sequences and labels the components of this label sequence on these divided intervals, and is an initial device for determining the initial state of division labeling. A dividing means, a statistical means for calculating statistics of vectors for each label according to the latest divided labeling state, and optimal dividing labeling based on the statistics of vectors for each label obtained by the statistical means. starting from the initial dividing state determined by the initial dividing means, and determining the best dividing state based on the latest vector statistics for each label determined by the statistical means. In the segmented labeling device, the best segmented labeling state determined by the segmenting means is replaced with the latest segmented labeling state, and the statistical means calculates the statistics of vectors for each label from the latest segmented labeling state. and the statistics of the interval lengths of the division intervals for each label, and the best division means is a means for calculating the statistics of the vectors for each label and the statistics of the interval length of the division intervals for each label. A divided labeling device characterized in that it is a means for determining the best divided labeling state based on the following.

(2) The component labels of the label string consist of a label component for identifying vector statistics and a label component for identifying interval length statistics, and the two components do not necessarily match. A divided labeling apparatus according to claim 1.