JPS58159592A - Polysyllabic word recognition system - Google Patents
Polysyllabic word recognition systemInfo
- Publication number
- JPS58159592A JPS58159592A JP57024978A JP2497882A JPS58159592A JP S58159592 A JPS58159592 A JP S58159592A JP 57024978 A JP57024978 A JP 57024978A JP 2497882 A JP2497882 A JP 2497882A JP S58159592 A JPS58159592 A JP S58159592A
- Authority
- JP
- Japan
- Prior art keywords
- words
- time series
- syllable
- unit
- section
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.
Description
【発明の詳細な説明】
(a)発明の技術分野
本発明は音声認識システムに係り、特に複数音節単語と
単音節単語とを認識する上でモード切替等を必要としな
い複数音節単語認識方式に関する。Detailed Description of the Invention (a) Technical Field of the Invention The present invention relates to a speech recognition system, and particularly to a multi-syllable word recognition method that does not require mode switching to recognize multi-syllable words and monosyllabic words. .
(b)技術の背景
主として単音節単語を認識する音声認識システノ・に於
゛C1複数音節単語の認識も行うことが出来れば改行と
か1句点、読点、r、J、等の特殊記号及びセーブ、プ
リント、等の制御用語の同時入力が可能になり、単語モ
ード、カナモード等モード切替用キーが不要となって音
声認識システムの構成がそれだけ簡易化し、操作性の向
上ともなる。(b) Technical background The speech recognition system mainly recognizes monosyllabic words, but if multi-syllabic words can also be recognized, special symbols such as line breaks, single periods, commas, r, J, etc., and saves, etc. It becomes possible to input control terms such as "print" simultaneously, and keys for switching modes such as word mode and kana mode are no longer required, which simplifies the configuration of the voice recognition system and improves operability.
従って単音節単語を認識すると共に複数音節単語の認識
処理速度の速い音声認識システムの出現が望まれている
。Therefore, it is desired to develop a speech recognition system that can recognize monosyllabic words and has a high recognition processing speed for multisyllabic words.
(c)発明の目的
本発明の目的は上記要望に基づきモード切替等を必要と
せず、且つ複数音節単語と単音節単語とを認識し得ると
共に複数音節単語の認識処理速度の速い音声認識システ
ムを提供することにある。(c) Object of the Invention The object of the present invention is to provide a speech recognition system that does not require mode switching, can recognize multi-syllable words and mono-syllabic words, and has a fast recognition processing speed for multi-syllable words. It is about providing.
(d)発明の構成
本発明の構成は音声の特徴バラメークを不均一サンプリ
ングして固定長の特徴パターンである縮小パラメータ時
系列を得る手段を設りて登録し、該縮小パラメータ時系
列を用いて同一手段で抽出された未知入力音声の特徴パ
ラメータを照合し複故音節単語か単音節単語かを先ず選
別する。複数音節単語と判定された単語は最も良く伯た
特徴パラメータを持つ単語が第一候補として選出され順
次複数の候補が選出される。次に該複数ff節単語は候
補選択部により照合候補が絞られてD P照合される。(d) Configuration of the Invention The configuration of the present invention includes providing and registering means for non-uniformly sampling voice feature variations to obtain a reduced parameter time series which is a feature pattern of a fixed length, and registering the reduced parameter time series using the reduced parameter time series. The feature parameters of the unknown input speech extracted by the same means are compared to determine whether it is a multi-syllable word or a monosyllabic word. For words determined to be multi-syllable words, the word with the best characteristic parameter is selected as the first candidate, and multiple candidates are sequentially selected. Next, the candidate selection unit narrows down the matching candidates for the plural ff clause words and performs DP matching.
又単音節単語と判定された単語は単音節単語認識部によ
りそのままDP照合されるものである。Furthermore, words determined to be monosyllabic words are directly subjected to DP matching by the monosyllabic word recognition unit.
尚縮小パラメータ時系列を得る手段については比較的簡
単なアルゴリズムの下で認識対象単語の内容や大きさに
拘わず効率良く認識対象候補を決定する方式(特願昭5
5−062059)が提案されており、その中に述べら
れている。簡単に説明すると未知入力単語の音声信号を
分析し、練合)ji (8号から抽出された入力特徴パ
ラメータ時系列を多くても10個以下の区間に分割して
各区間内の特徴パラメータ値を平均する様にしたもので
ある。Regarding the means for obtaining the reduced parameter time series, there is a method that efficiently determines recognition target candidates regardless of the content and size of the recognition target word under a relatively simple algorithm (Japanese Patent Application No. 1983).
5-062059) and is described therein. To briefly explain, the audio signal of an unknown input word is analyzed and refined) ji (The input feature parameter time series extracted from No. 8 is divided into at most 10 sections or less, and the feature parameter values in each section are calculated. It is designed to average.
(e)発明の実施例 図は本発明の一実施例を示す回路のブロック図である。(e) Examples of the invention The figure is a block diagram of a circuit showing one embodiment of the present invention.
先ず話者は予め音声を登録する為制御部9の制御により
切替部3を縮小パラメータ時系列抽出部4及び複数音節
単語認識部13と単音節単語認識部14に接続し、単音
節単語と特定の複数a節単語とを入力より加える。前処
理部1は音声レベル調整及びアナログディジタル変換等
を行いパラメータ抽出部2へ送る。パラメータ抽出部2
はパラメータを抽出して切替部3へ送る。切替部3より
縮小パラメータ時系列抽出部4に入った特徴パラメータ
は不均一サンプリングされ固定長の縮小パラメータ時系
列が抽出され切替部5へ送られる。切替部5に入った単
音節単語用#Ai!小パラメータ時系列は単音節用縮小
パラメータ時系列格納部6へ、複数音節単語用縮小パラ
メータ時系列は複数音節用縮小パラメータ時系列格納部
7へ格納される。又同時に単音節単語の特徴パラメータ
は単音節単語認識部14へ、複数音節単語の特徴パラメ
ータは複数音節単語認識部13に夫々格納される。First, in order to register speech in advance, the speaker connects the switching unit 3 to the reduced parameter time series extraction unit 4, the multi-syllable word recognition unit 13, and the monosyllabic word recognition unit 14 under the control of the control unit 9, and identifies it as a monosyllabic word. Add multiple a clause words from the input. The preprocessing section 1 performs audio level adjustment, analog-to-digital conversion, etc., and sends the results to the parameter extraction section 2. Parameter extraction part 2
extracts the parameters and sends them to the switching section 3. The feature parameters input from the switching unit 3 to the reduced parameter time series extraction unit 4 are non-uniformly sampled, and a fixed length reduced parameter time series is extracted and sent to the switching unit 5. #Ai for monosyllabic words entered in switching section 5! The small parameter time series is stored in the monosyllable reduced parameter time series storage section 6, and the reduced parameter time series for multi-syllable words is stored in the multi-syllable reduced parameter time series storage section 7. At the same time, the characteristic parameters of monosyllabic words are stored in the monosyllabic word recognition section 14, and the characteristic parameters of multi-syllable words are stored in the multi-syllable word recognition section 13, respectively.
次に音声認識を行なわせるため話者は制御部9の制御に
より切替部3を縮小パラメータ時系列抽出部4と記憶部
lOに接続し、切替部5を照合判定部8に接続する。入
力より入った音声は前記同様に前処理部l、パラメータ
抽出部2.切替部3゜縮小パラメータ時系列抽出部4.
切替部5を経て照合判定部8に入り単音節用縮小パラメ
ータ時系列格納部6及び複数音節用縮小パラメータ時系
列格納部7よりの縮小パラメータ時系列と照合を行・う
。照合判定部8は複数音節単語と判定すると最も良く似
た特徴パラメータを持つ単語を第−M袖として選出し順
次複数の候補を選出して候補選択its l 2−1送
る。候補選択部12は該候補を1. 2の候補に絞り複
数音節単語認識部13へ送る。ここでDP照合され認識
結果が制御部9を経゛ζ出力に送出される。141音節
単語と判定が出ると選択部11は記憶部10に人ってい
る特徴パラメータを単f1″節単語認識部14へ送り、
単音節単語の認識を行い、制御部9を経て出力へ認識結
果を送出する。Next, in order to perform speech recognition, the speaker connects the switching section 3 to the reduced parameter time series extraction section 4 and the storage section IO, and connects the switching section 5 to the collation determining section 8 under the control of the control section 9. The input audio is processed by the preprocessing section 1, the parameter extraction section 2, and the like described above. Switching unit 3° reduction parameter time series extraction unit 4.
The data passes through the switching unit 5 and enters the matching determination unit 8 where it is compared with the reduced parameter time series from the single syllable reduced parameter time series storage unit 6 and the multi-syllable reduced parameter time series storage unit 7. When the collation determining unit 8 determines that the word is a multi-syllable word, it selects the word with the most similar characteristic parameters as the -Mth sleeve, sequentially selects a plurality of candidates, and sends the candidate selection its l 2-1. The candidate selection unit 12 selects the candidates as 1. The candidates are narrowed down to two and sent to the multi-syllable word recognition unit 13. Here, the DP is collated and the recognition result is sent to the ζ output via the control section 9. When it is determined that the word is a 141-syllable word, the selection unit 11 sends the feature parameters stored in the storage unit 10 to the single f1'' syllable word recognition unit 14.
Monosyllabic words are recognized and the recognition results are sent to the output via the control unit 9.
(f)発明の詳細
な説明した如く2本発明は縮小パラメータ時系列を用い
た照合により複数音節単語を先ず分類し、その分類結果
により詳細に照合する複数音節単語の候補を絞りDP照
合で確認するため処理時間の短い、且つモード切替を不
必要とする音声認識システムを提供出来るため1 その
効果は大なるものがある。(f) Detailed explanation of the invention As described above, the present invention first classifies multi-syllabic words by matching using a reduced parameter time series, and then narrows down candidates for multi-syllabic words to be matched in detail based on the classification results and confirms them by DP matching. Therefore, it is possible to provide a speech recognition system that requires short processing time and does not require mode switching.1 The effects are significant.
図は本発明の一実施例を示す回路のブロック図である。
1は前処理部、2はパラメータ抽出部。
3.5は切替部、4は縮小パラメータ時系列抽出部、6
は単音節用縮小パラメータ時系列格納部。
7は複数音節用縮小パラメータ時系列格納部、8は照合
判定部、9は制御部、10は記憶部、11は選択部、1
2は候補選択部、13は複数音節単語認識部、14は単
音節単語認識部である。The figure is a block diagram of a circuit showing one embodiment of the present invention. 1 is a preprocessing section, and 2 is a parameter extraction section. 3.5 is a switching unit, 4 is a reduced parameter time series extraction unit, 6
is the reduced parameter time series storage unit for monosyllables. 7 is a reduced parameter time series storage unit for multiple syllables, 8 is a collation determination unit, 9 is a control unit, 10 is a storage unit, 11 is a selection unit, 1
2 is a candidate selection section, 13 is a multi-syllable word recognition section, and 14 is a monosyllabic word recognition section.
Claims (1)
認識システムに於て、音声の特徴パラメータを不均一サ
ンプリングし°ζ固定長の縮小パラメータ時系列を得る
手段と該縮小パラメータ時系列を用いて照合し複数音節
単語か単音節単語かを選別する手段とを設は該選別手段
により選別された複数音節単語は該選別時の情報により
認識する照合候補を絞って選択してから認識することを
特徴とする複数音節単語認識方式。In a speech recognition system that recognizes multi-syllable words and single-f, -clausal words, there is provided a means for non-uniformly sampling speech feature parameters to obtain a reduced parameter time series of fixed length °ζ, and the reduced parameter time series. The multi-syllable words selected by the screening means are recognized after narrowing down and selecting the matching candidates to be recognized using the information at the time of screening. A multi-syllable word recognition method characterized by:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP57024978A JPS58159592A (en) | 1982-02-18 | 1982-02-18 | Polysyllabic word recognition system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP57024978A JPS58159592A (en) | 1982-02-18 | 1982-02-18 | Polysyllabic word recognition system |
Publications (1)
Publication Number | Publication Date |
---|---|
JPS58159592A true JPS58159592A (en) | 1983-09-21 |
Family
ID=12153059
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP57024978A Pending JPS58159592A (en) | 1982-02-18 | 1982-02-18 | Polysyllabic word recognition system |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS58159592A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS493507A (en) * | 1972-04-19 | 1974-01-12 | ||
JPS54145408A (en) * | 1978-05-06 | 1979-11-13 | Hiroya Fujisaki | Speech recognition system |
JPS56116148A (en) * | 1980-02-15 | 1981-09-11 | Nec Corp | Audio typewriter |
-
1982
- 1982-02-18 JP JP57024978A patent/JPS58159592A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS493507A (en) * | 1972-04-19 | 1974-01-12 | ||
JPS54145408A (en) * | 1978-05-06 | 1979-11-13 | Hiroya Fujisaki | Speech recognition system |
JPS56116148A (en) * | 1980-02-15 | 1981-09-11 | Nec Corp | Audio typewriter |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4677673A (en) | Continuous speech recognition apparatus | |
US4769844A (en) | Voice recognition system having a check scheme for registration of reference data | |
US4720864A (en) | Speech recognition apparatus | |
JPS58159592A (en) | Polysyllabic word recognition system | |
JPH0566790A (en) | Speech recognition method | |
JPS58159593A (en) | Voice recognition system | |
JPS6151798B2 (en) | ||
JPS5887599A (en) | Spoken word recognition equipment | |
CN111583956B (en) | Voice processing method and device | |
JP2757356B2 (en) | Word speech recognition method and apparatus | |
KR20010018532A (en) | User Interface method using Hand-written character recognition and Speech Recognition Synchronous | |
JPH0262879B2 (en) | ||
JP2572753B2 (en) | Unspecified speaker consonant identification device | |
JPH0656557B2 (en) | Word detection method | |
JPH10340096A (en) | Voice recognition device | |
JPH06309443A (en) | Individual recognizing system combining finger print and voice | |
JPS58159594A (en) | Polysyllabic word recognition system | |
JPS63173100A (en) | Keyword extractor | |
JPS62164097A (en) | Voice discrimination system | |
JPH06167997A (en) | Speech recognizing device | |
JPH08146986A (en) | Speech recognition device | |
JPH01158499A (en) | Standing noise eliminaton system | |
JPS637398B2 (en) | ||
JPS6287997A (en) | Voice recognition equipment | |
JPS595294A (en) | Voice recognition equipment |