JP2005326677A

JP2005326677A - Voice memo printer

Info

Publication number: JP2005326677A
Application number: JP2004145400A
Authority: JP
Inventors: Yoshihiko Ikeda; 喜彦池田; Naoki Sekine; 直樹関根; Masanori Takeuchi; 雅則竹内; Junko Watanabe; 順子渡辺; Nobuo Watanabe; 伸夫渡辺; Shunji Saito; 俊次齊藤; Ekigen Yana; 益源梁; Wataru Sakurai; 渉櫻井
Original assignee: Toshiba TEC Corp
Current assignee: Toshiba TEC Corp
Priority date: 2004-05-14
Filing date: 2004-05-14
Publication date: 2005-11-24

Abstract

<P>PROBLEM TO BE SOLVED: To provide a voice memo printer with which users can leave memorandums freely, without being held by preparation times in all scenes, and which can be utilized efficiently. <P>SOLUTION: The voice memory printer extracts a word, having a language feature nearest to the feature of a voice inputted from a microphone 52, from a language pattern dictionary 60 in which language features of words which are to be used in special applications are registered and outputs it as a result of voice recognition and prints it. As a result, since language features of the words to be used in the special application are registered in the language pattern dictionary 60, a voice memory printer 1, with which incorrect recognition about voice recognition of voices uttered in the special application can be suppressed as much as possible, can be realized with a simple constitution. Thus, the voice memory printer 1, with which users can leave the memorandums freely without being caught by the preparing time in all scenes, and which can be utilized efficiently, can be provided. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、音声認識機能を搭載し、音声認識結果を印字可能な音声メモプリンタに関する。 The present invention relates to a voice memo printer equipped with a voice recognition function and capable of printing a voice recognition result.

マイクなどから入力された音声に基づいて生成された音声デジタルデータを解析し、人間の声をテキストに変換する音声認識技術はパーソナルコンピュータ等で活用され、キーボードによる手入力に代わる手段として普及し始めている。 Voice recognition technology that analyzes voice digital data generated based on voice input from a microphone, etc., and converts human voice into text has been utilized in personal computers, etc., and has begun to become popular as an alternative to manual input using a keyboard Yes.

一方、マイクなどから入力された音声を認識して用紙へ直接印字するものが知られている（例えば、特許文献１，２参照）。 On the other hand, there is known one that recognizes voice input from a microphone or the like and prints directly on a sheet (for example, see Patent Documents 1 and 2).

特開平５−３５４２８号公報Japanese Patent Laid-Open No. 5-35428 特開平８−２０１５号公報JP-A-8-2015

ところで、近年、集団でアイデアを出すための会議方式の一つとして、ブレーンストーミング（Brain Stormimg Method）が一般的になっている。ブレーンストーミング（ＫＪ法）は、最初の段階ではテーマについてのアイデアをアイデア単位で次々に付箋紙や小紙に書き込んでいき、アイデアが出尽くしたところでアイデアが書き出された各付箋紙等を分類し、テーマを分析するものである。 By the way, in recent years, brainstorming (Brain Stormimg Method) has become common as one of the conference methods for group ideas. In brainstorming (KJ method), the idea of the theme is written on sticky notes and small papers one after another in the initial stage, and each sticky note on which the idea is written is classified when the idea is exhausted. , Analyze themes.

しかしながら、発言をする度に付箋紙等に書き込んでいく従来のブレーンストーミングでは、付箋紙等に発言内容を書き込む等の手間により検討時間が減少し、能率が低下してしまう。 However, in conventional brainstorming in which writing is made on a sticky note each time a statement is made, the examination time is reduced due to the trouble of writing the contents of the statement on a sticky note or the like, and efficiency is lowered.

そこで、前述したような特許文献１，２のように音声認識技術を用いて人間の声を直接印字することも考えられるが、これは例えば会議の議事録を作成するために考えられたものであり、上述のようなブレーンストーミングに用いようとすると、印字したものを切ってから貼り付けるという作業という手間がかなりかかり、能率の低下は否めない。 Therefore, it is conceivable to directly print a human voice using the speech recognition technology as in Patent Documents 1 and 2 as described above, but this was conceived, for example, to create a meeting minutes. However, if it is intended to be used for brainstorming as described above, it takes a lot of labor to cut and paste the printed matter, and a reduction in efficiency cannot be denied.

また、従来の音声認識技術によれば、事前に長文を学習させ不特定に発せられた音声を誤認識なく認識させるために、高機能のパフォーマンスを有するＣＰＵの環境下で装置を動作させなければならず、非常に高価なものとなっている。 Further, according to the conventional speech recognition technology, in order to learn long sentences in advance and recognize unspecified speech without misrecognition, the device must be operated under the environment of a CPU having high performance. It is very expensive.

本発明は、あらゆるシーンで作成時間にとらわれずに気軽にメモを残すことができ、能率的に活用できる音声メモプリンタを提供することを目的とする。 An object of the present invention is to provide a voice memo printer that can easily leave a memo regardless of the creation time in any scene and can be used efficiently.

本発明は、音声認識についての誤認識を極力抑えた音声メモプリンタを簡便な構成で実現することを目的とする。 SUMMARY OF THE INVENTION An object of the present invention is to realize a voice memo printer that suppresses erroneous recognition of voice recognition as much as possible with a simple configuration.

本発明は、音声を入力するマイクと、このマイクから入力された音声アナログデータを音声デジタルデータに変換するＡ／Ｄ変換手段と、このＡ／Ｄ変換手段により変換された音声デジタルデータを周波数変換して解析する周波数解析手段と、特定用途向けの言語パターン辞書を持つ音声認識手段と、前記周波数解析手段により解析された周波数に基づき前記音声認識手段から出力された音声認識結果を印字する印字手段と、を備える。 The present invention relates to a microphone for inputting voice, A / D conversion means for converting voice analog data inputted from the microphone into voice digital data, and frequency conversion of the voice digital data converted by the A / D conversion means. Frequency analysis means for analyzing the voice, speech recognition means having a language pattern dictionary for specific applications, and printing means for printing the speech recognition result output from the voice recognition means based on the frequency analyzed by the frequency analysis means And comprising.

したがって、マイクから入力された音声の音響特徴に最も近い言語特徴を有している単語が、言語パターン辞書から抽出されて音声認識結果として出力され、印字される。これにより、言語パターン辞書には特定用途で使われる単語の言語特徴が登録されていることから、特定用途で発せられる音声についての音声認識についての誤認識を極力抑えた音声メモプリンタを簡便な構成で実現することが可能になる。 Therefore, the word having the language feature closest to the acoustic feature of the voice input from the microphone is extracted from the language pattern dictionary, output as a voice recognition result, and printed. As a result, the language features of words used for specific purposes are registered in the language pattern dictionary, so a simple configuration of a voice memo printer that minimizes misrecognition of speech recognition for voices issued for specific purposes Can be realized.

本発明によれば、言語パターン辞書に特定用途で使われる言語特徴を登録していることから、特定用途で発せられる音声についての音声認識についての誤認識を極力抑えた音声メモプリンタを簡便な構成で実現することができるので、あらゆるシーンで作成時間にとらわれずに気軽にメモを残すことができ、能率的に活用できる音声メモプリンタを提供することができる。 According to the present invention, since a language feature used for a specific application is registered in the language pattern dictionary, a simple configuration of a voice memo printer that suppresses misrecognition of voice recognition for a voice emitted for a specific application as much as possible. Therefore, it is possible to provide a voice memo printer that can easily leave a memo in any scene regardless of the creation time and can be used efficiently.

本発明の実施の一形態を図１ないし図７に基づいて説明する。 An embodiment of the present invention will be described with reference to FIGS.

ここで、図１は本発明の実施の一形態の音声メモプリンタ１をラベル排出側から示す外観斜視図、図２は音声メモプリンタ１をオペレータ装着側から示す外観斜視図、図３は音声メモプリンタ１の内部構造を示す水平断面図である。 Here, FIG. 1 is an external perspective view showing the voice memo printer 1 according to an embodiment of the present invention from the label discharge side, FIG. 2 is an external perspective view showing the voice memo printer 1 from the operator mounting side, and FIG. FIG. 2 is a horizontal sectional view showing the internal structure of the printer 1.

図１ないし図３に示すように、携帯可能なポータブルプリンタである音声メモプリンタ１のプリンタ本体１ａは、一面が開放されたケース２と、このケース２の開放された面を開閉するカバー３とより構成されている。カバー３は、ケース２に設けられた支点軸４により回動自在に支持されている。そして、ケース２には、カバー３を閉じた状態で、ロール状に巻回された長尺状の記録紙５を転動自在に収納するホッパ６が形成されている。なお、本実施の形態においては、記録紙５として台紙５ａに多数のラベル５ｂを等間隔で貼付したものを用いているが、他の記録紙を用いても良い。ラベル５ｂには粘着力の弱い糊が塗布されており、印字発行後には、付箋紙Ｐ（図７参照）としても利用可能である。 As shown in FIGS. 1 to 3, a printer main body 1a of a voice memo printer 1 which is a portable portable printer includes a case 2 with one side opened, and a cover 3 for opening and closing the opened side of the case 2. It is made up of. The cover 3 is rotatably supported by a fulcrum shaft 4 provided on the case 2. The case 2 is formed with a hopper 6 for storing the long recording paper 5 wound in a roll shape with the cover 3 closed. In the present embodiment, the recording paper 5 is obtained by attaching a large number of labels 5b to the mount 5a at equal intervals, but other recording paper may be used. The label 5b is coated with adhesive having a weak adhesive force, and can be used as a sticky note P (see FIG. 7) after the printing is issued.

このようなケース２には、ホッパ６の底部からカバー３側に向けて延出する用紙ガイド７が設けられており、この用紙ガイド７のカバー３に近い部分には、回転自在のプラテン８と、このプラテン８の長手方向に沿うラベル剥離体９とが配設されている。 In such a case 2, a paper guide 7 extending from the bottom of the hopper 6 toward the cover 3 is provided. A portion of the paper guide 7 near the cover 3 has a rotatable platen 8 and A label peeling body 9 along the longitudinal direction of the platen 8 is disposed.

図３に示すように、カバー３の内面（ホッパ６側）には、サーマルヘッド１２を備えたヘッド支持体１１が支軸１１ａを中心に回動自在に設けられている。このヘッド支持体１１は板ばね１３により一方向に付勢されており、サーマルヘッド１２はカバー３を閉じた状態でプラテン８に当接することになる。すなわち、プラテン８とサーマルヘッド１２とにより印字部１４が形成されている。 As shown in FIG. 3, a head support 11 having a thermal head 12 is provided on the inner surface of the cover 3 (on the hopper 6 side) so as to be rotatable around a support shaft 11a. The head support 11 is biased in one direction by a leaf spring 13, and the thermal head 12 comes into contact with the platen 8 with the cover 3 closed. That is, the printing unit 14 is formed by the platen 8 and the thermal head 12.

また、カバー３の自由端側の両側には、スプリング１５の付勢力によりプラテン８に圧接されたピンチローラ１６が回転自在に設けられている。さらに、カバー３には、サーマルヘッド１２とピンチローラ１６との間に配置されてラベル５ｂを排出させるラベル排出口１７と、ホッパ６内の記録紙５の浮きを押える紙押え１８とが形成されている。ケース２にはカバー３の自由端との間で台紙５ａを排出させる台紙排出口１９が形成されている。 Further, on both sides of the free end side of the cover 3, pinch rollers 16 that are pressed against the platen 8 by the urging force of the spring 15 are rotatably provided. Further, the cover 3 is formed with a label discharge port 17 that is disposed between the thermal head 12 and the pinch roller 16 and discharges the label 5b, and a paper presser 18 that presses the floating of the recording paper 5 in the hopper 6. ing. The case 2 is formed with a mount discharge port 19 for discharging the mount 5 a between the free end of the cover 3.

ケース２の上面には、バッテリ１０（図３参照）からの電力供給のＯＮ／ＯＦＦを宣言する電源スイッチ２０、ラベル５ｂに印字を行わせるフィードスイッチ２１、蓋部２２、赤外線を受光する受光窓２３が設けられている。蓋部２２は、ケース２の一つの面である上面に開口して設けられたバッテリ収納部３０（図３参照）に対してバッテリ１０を着脱する場合に開閉するものである。さらに、カバー３の両側には係止爪２４がスライド自在に設けられている（図１参照）。これらの係止爪２４は外側に向けて付勢されてケース２に係止され、カバー３を開放するときに係止爪２４を矢印マークで示すように内方スライドさせてケース２との係止状態を解除する。 On the upper surface of the case 2, a power switch 20 that declares ON / OFF of power supply from the battery 10 (see FIG. 3), a feed switch 21 that performs printing on the label 5 b, a lid 22, and a light receiving window that receives infrared rays. 23 is provided. The lid portion 22 opens and closes when the battery 10 is attached to and detached from the battery storage portion 30 (see FIG. 3) provided to be opened on the upper surface that is one surface of the case 2. Further, locking claws 24 are slidably provided on both sides of the cover 3 (see FIG. 1). These locking claws 24 are urged outward to be locked to the case 2, and when the cover 3 is opened, the locking claws 24 are slid inward as indicated by the arrow marks to engage with the case 2. Release the stop state.

また、ケース２のラベル排出口１７と同一面には、内蔵マイク５２が設けられている。本実施の形態の音声メモプリンタ１には、音声認識機能が搭載されており、この内蔵マイク５２は、この音声認識機能を実行する際に用いられるものである。 A built-in microphone 52 is provided on the same surface of the case 2 as the label discharge port 17. The voice memo printer 1 according to the present embodiment is equipped with a voice recognition function, and the built-in microphone 52 is used when executing the voice recognition function.

加えて、ケース２の上面には、ＬＥＤ５６が配設されている。本実施の形態の音声メモプリンタ１は、このＬＥＤ５６を点灯させたり点滅させることにより、音声メモプリンタ１の動作状態をオペレータに対して報知することができるようになっている。 In addition, an LED 56 is disposed on the upper surface of the case 2. The voice memo printer 1 of the present embodiment can notify the operator of the operation state of the voice memo printer 1 by turning on or blinking the LED 56.

さらに、図２に示すように、プリンタ本体１ａのカバー３とは反対側の一面には、オペレータの腰のあたりに密着される弧面２５が形成され、この弧面２５にはオペレータの衣服に対して滑りを少なくするための滑り止め２６と、この滑り止め２６に対向してオペレータのベルトに引っ掛けられるベルト掛け２７とが形成されている。 Further, as shown in FIG. 2, an arc surface 25 is formed on one surface of the printer body 1a opposite to the cover 3 so as to be in close contact with the operator's waist. On the other hand, a non-slip 26 for reducing slippage and a belt hook 27 which is hooked on the operator's belt so as to face the non-slip 26 are formed.

このような構成により、バッテリ１０がバッテリ収納部３０へと正しく収納された場合には、電源スイッチ２０がＯＮしている状態でバッテリ収納部３０の端子とバッテリ１０の端子とが接触して電気的に接続された状態となり、バッテリ１０から電力供給を必要とするサーマルヘッド１２等の各部へと電力が供給されることになる。 With such a configuration, when the battery 10 is correctly stored in the battery storage unit 30, the terminal of the battery storage unit 30 and the terminal of the battery 10 come into contact with each other while the power switch 20 is ON. Thus, power is supplied from the battery 10 to each part such as the thermal head 12 that requires power supply.

このような音声メモプリンタ１は、記録紙５をセットする場合にカバー３を開放し、ロール状に巻回された記録紙５をプリンタ本体１ａのホッパ６に収納し、カバー３が開放されている状態で記録紙５の先端をプラテン８及びラベル剥離体９を覆う位置まで引き出し、カバー３を閉塞する。これにより、図３に示すように、記録紙５の台紙５ａの先端部分が、サーマルヘッド１２とピンチローラ１６とによりプラテン８上に圧接され、また、ラベル剥離体９により台紙５ａの引き出し経路が鋭角に折曲され、ホッパ６の底面からの記録紙５の浮きが紙押え１８により阻止される。記録紙５をセットしたプリンタ本体１ａは、机上に置いて使用することも可能であるが、通常はオペレータの腰に装着した状態でも使用可能である。 In such a voice memo printer 1, when the recording paper 5 is set, the cover 3 is opened, the recording paper 5 wound in a roll shape is stored in the hopper 6 of the printer main body 1a, and the cover 3 is opened. In this state, the front end of the recording paper 5 is pulled out to a position covering the platen 8 and the label peeling body 9 and the cover 3 is closed. As a result, as shown in FIG. 3, the leading end portion of the mount 5a of the recording paper 5 is pressed against the platen 8 by the thermal head 12 and the pinch roller 16, and the pull-out path of the mount 5a is formed by the label peeling member 9. The recording paper 5 is bent at an acute angle and the recording paper 5 is prevented from floating from the bottom surface of the hopper 6. The printer main body 1a on which the recording paper 5 is set can be used by placing it on a desk, but it can also be used even when it is usually worn on the operator's waist.

次に、音声メモプリンタ１の各部の制御系の接続について図４を参照しつつ説明する。音声メモプリンタ１は、各部を集中的に制御するＣＰＵ（Central Processing Unit）４１を備えており、このＣＰＵ４１には、ＣＰＵ４１が実行するプログラム等の固定データが書き込まれているＲＯＭ（Read Only Memory）４２と、ワークデータ等の可変データを更新自在に書き込むＲＡＭ（Random Access Memory）４３と、各種情報を登録するフラッシュメモリ４４とがバスライン４５を介して接続されている。そして、サーマルヘッド１２を駆動するサーマルヘッドドライバ４６、プラテン８が連結されたモータ４７を駆動するモータドライバ４８、各種センサ４９が接続されたセンサ回路５０、カバー３の開閉によりオン、オフするカバーオープンスイッチ５１と電源スイッチ２０とフィードスイッチ２１とが接続されたスイッチ回路５４、赤外線インタフェース５５、ＬＥＤ５６が接続された点灯制御回路５７が、ＣＰＵ４１に接続されている。このように、図４に示す回路はプリンタ本体１ａの内部に設けられた基板（図示せず）上に形成されている。なお、赤外線インタフェース５５は、前述した受光窓２３の内方に配置されている。インタフェースは図ではＩ／Ｆと記す。 Next, the connection of the control system of each part of the voice memo printer 1 will be described with reference to FIG. The voice memo printer 1 includes a CPU (Central Processing Unit) 41 that centrally controls each unit. The CPU 41 stores a ROM (Read Only Memory) in which fixed data such as a program executed by the CPU 41 is written. 42, a RAM (Random Access Memory) 43 in which variable data such as work data is renewably written, and a flash memory 44 for registering various information are connected via a bus line 45. Then, a thermal head driver 46 for driving the thermal head 12, a motor driver 48 for driving a motor 47 connected to the platen 8, a sensor circuit 50 to which various sensors 49 are connected, and a cover open that is turned on and off by opening and closing the cover 3. A switch circuit 54 to which the switch 51, the power switch 20 and the feed switch 21 are connected, an infrared interface 55, and a lighting control circuit 57 to which the LED 56 is connected are connected to the CPU 41. As described above, the circuit shown in FIG. 4 is formed on a substrate (not shown) provided inside the printer main body 1a. The infrared interface 55 is disposed inside the light receiving window 23 described above. The interface is denoted as I / F in the figure.

また、ＣＰＵ４１には、音声入力用ＣＯＤＥＣ５３が接続されている。この音声入力用ＣＯＤＥＣ５３には、内蔵マイク５２が接続されている。音声入力用ＣＯＤＥＣ５３は、Ａ／Ｄ変換手段として機能するもので、内蔵マイク５２から入力された音声アナログデータを音声デジタルデータに変換してＣＰＵ４１に出力する。 The CPU 41 is connected with a voice input CODEC 53. A built-in microphone 52 is connected to the audio input CODEC 53. The audio input CODEC 53 functions as an A / D conversion unit, converts audio analog data input from the built-in microphone 52 into audio digital data, and outputs the audio digital data to the CPU 41.

さらに、ＣＰＵ４１には、音声認識エンジン５８が接続されている。この音声認識エンジン５８は、内蔵マイク５２から入力されて音声入力用ＣＯＤＥＣ５３で生成された音声デジタルデータを解析し、人間の声をテキストに変換するものである。このような音声認識エンジン５８は、例えば、人間の発声の小さな単位（音素）の音響特徴（音韻）が登録される音響辞書５９や音声認識させる単語の言語特徴が登録されている言語パターン辞書６０を用いて音声認識を行う。 Further, a speech recognition engine 58 is connected to the CPU 41. The voice recognition engine 58 analyzes voice digital data input from the built-in microphone 52 and generated by the voice input CODEC 53, and converts human voice into text. Such a speech recognition engine 58 includes, for example, an acoustic dictionary 59 in which acoustic features (phonemes) of small units (phonemes) of human speech are registered, and a language pattern dictionary 60 in which language features of words to be recognized are registered. Voice recognition is performed using.

本実施の形態の言語パターン辞書６０に登録されている音声認識させる単語は、特定用途に絞られている。特定用途では決まった言葉が発せられることが多いため、このように特定用途に絞った単語のみを言語パターン辞書６０に登録するようにすることで、言語パターン辞書６０を安価に構成することができる。具体的には、使用される用途において使用されるであろう会話や発声言語を一覧に纏め、用途別使用言語表（図示せず）とする。この用途別使用言語表に登録された各言語毎に、その言語の周波数を解析し、音声特徴（音韻情報）と言語特徴（音韻の系列情報）に分離する。このようにして分離された言語特徴が、言語パターン辞書６０に登録される。 The words to be recognized by speech registered in the language pattern dictionary 60 of the present embodiment are limited to specific applications. Since a specific word is often issued in a specific application, the language pattern dictionary 60 can be configured at low cost by registering only the words focused on the specific application in the language pattern dictionary 60 in this way. . Specifically, a list of conversations and utterance languages that will be used in the application to be used is collected and used as a use language table (not shown) for each application. For each language registered in this use language table for each application, the frequency of the language is analyzed and separated into speech features (phoneme information) and language features (phoneme series information). The language features separated in this way are registered in the language pattern dictionary 60.

音響辞書５９は、用途別でなく、音声認識全般に係わる辞書として使用される。声を発する原理は、
（１）『喉が震える』
（２）『口腔／鼻腔を通過』
と考えられることから、音響辞書５９には、声の周波数から（１）（２）の形状を特定する情報（人間の発声の小さな単位（音素）の音響特徴（音韻））を格納する。 The acoustic dictionary 59 is used as a dictionary related to voice recognition in general, not by use. The principle of speaking is
(1) “My throat trembles”
(2) “Passing through oral cavity / nasal cavity”
Therefore, the sound dictionary 59 stores information (acoustic features (phonemes) of small units (phonemes) of human utterance) that specify the shapes of (1) and (2) from the frequency of the voice.

このような構成の音声認識エンジン５８は、図５に示すように、内蔵マイク５２から入力されて音声入力用ＣＯＤＥＣ５３で生成された音声デジタルデータを周波数解析手段である周波数解析部５８ａにより周波数変換して解析し、比較部５８ｂにおいて音響辞書５９に基づいて音響特徴を抽出する（音声特徴抽出手段）。この段階では、前述した（１）（２）の形状が特定できただけで、５０音のどれかは、未だ特定できない。そこで、言語パターン辞書６０に登録されている単語の中から、単語の言語特徴が入力音声の音響特徴に最も近い単語を探して音声認識結果として出力する（言語特徴抽出手段）。このように言語パターン辞書６０と比較することで、初めて「あいうえお」等を特定することができる。不特定多数の言葉が発せられると特定は困難だが、特定の用途で発せられる言葉に絞り込むようにし、前述した（１）（２）の関係と音韻系列波形の特徴を単語全体で比較すれば、誤認識の可能性を極力抑える事ができ、このような簡便な機構で音声認識が可能となる。 As shown in FIG. 5, the speech recognition engine 58 having such a configuration performs frequency conversion on speech digital data input from the built-in microphone 52 and generated by the speech input CODEC 53 by a frequency analysis unit 58a which is a frequency analysis means. The comparison unit 58b extracts acoustic features based on the acoustic dictionary 59 (speech feature extraction means). At this stage, only the shapes (1) and (2) described above can be specified, and any of the 50 sounds cannot be specified yet. Therefore, from the words registered in the language pattern dictionary 60, a word whose language feature is closest to the acoustic feature of the input speech is searched for and output as a speech recognition result (language feature extraction means). By comparing with the language pattern dictionary 60 in this way, “Aiueo” or the like can be specified for the first time. It is difficult to specify when a large number of unspecified words are uttered, but if you try to narrow down to words that are uttered for a specific purpose and compare the relationship of (1) and (2) above and the characteristics of the phoneme sequence waveform, The possibility of misrecognition can be suppressed as much as possible, and speech recognition is possible with such a simple mechanism.

また、言語パターン辞書６０は、音声メモプリンタ１に図示しない外部機器（パーソナルコンピュータ等）を赤外線インタフェース５５を介して接続することで、当該外部機器から更新可能である。さらに、言語パターン辞書６０を格納する言語パターン格納チップ（辞書）の交換や言語パターン辞書６０の図示しない外部機器（パーソナルコンピュータ等）からのダウンロードによる登録内容の書き換えにより、言語パターン辞書６０の内容を特定用途毎に変えることも可能である。新たな言語パターン辞書６０が赤外線インタフェース５５を介してダウンロードされた場合には、旧言語パターン辞書６０は、抹消される。 The language pattern dictionary 60 can be updated from an external device by connecting an external device (such as a personal computer) (not shown) to the voice memo printer 1 via the infrared interface 55. Further, the contents of the language pattern dictionary 60 are changed by exchanging the language pattern storage chip (dictionary) for storing the language pattern dictionary 60 or rewriting the registered contents by downloading the language pattern dictionary 60 from an external device (such as a personal computer) (not shown). It is also possible to change for each specific application. When a new language pattern dictionary 60 is downloaded via the infrared interface 55, the old language pattern dictionary 60 is deleted.

次に、音声メモプリンタ１に内蔵されたＲＯＭ４２に格納された制御プログラムがＣＰＵ４１に実行させる機能のうち、本実施の形態の音声メモプリンタ１が備える特長的な機能について説明する。 Next, of the functions that the control program stored in the ROM 42 built in the voice memo printer 1 causes the CPU 41 to execute, the characteristic functions provided in the voice memo printer 1 of the present embodiment will be described.

ここで、音声メモプリンタ１のＣＰＵ４１が実行する音声印字処理について説明する。図６は、音声印字処理の流れを示すフローチャートである。図６に示すように、デジタル化された音声が入力されると（ステップＳ１のＹ）、ステップＳ２に進み、認識パターンの登録処理か、発声の音声認識処理かが判断される。 Here, the voice printing process executed by the CPU 41 of the voice memo printer 1 will be described. FIG. 6 is a flowchart showing the flow of the voice printing process. As shown in FIG. 6, when a digitized voice is input (Y in step S1), the process proceeds to step S2, and it is determined whether a recognition pattern registration process or an utterance voice recognition process.

発声の音声認識処理であると判断されると、音声認識エンジン５８による音声認識処理が実行される（ステップＳ３）。 If it is determined that the speech recognition processing is utterance, the speech recognition processing by the speech recognition engine 58 is executed (step S3).

音声認識処理において言語パターン辞書６０に登録されている単語であると判断された場合（ステップＳ４のＹ）、単語の言語特徴が入力音声の音響特徴に最も近い単語を探して音声認識結果として印字部１４に出力して印字する（ステップＳ５：印字手段）。ここで、図７は発行された付箋紙Ｐの一例を示す平面図である。図７に示すように、付箋紙Ｐには、「○○○○というアイデア」と発声した場合のテキスト「○○○○というアイデア」が印字されている。 If it is determined in the speech recognition process that the word is registered in the language pattern dictionary 60 (Y in step S4), the word whose language feature is closest to the acoustic feature of the input speech is searched and printed as a speech recognition result. The data is output to the section 14 and printed (step S5: printing means). Here, FIG. 7 is a plan view showing an example of the issued sticky note P. FIG. As shown in FIG. 7, the sticky note P is printed with the text “idea of XXX” when “speaking of an idea of XXX”.

音声認識処理において言語パターン辞書６０に登録されている単語でないと判断された場合（ステップＳ４のＮ）、音声認識せずにステップＳ１に戻る。 If it is determined in the voice recognition process that the word is not registered in the language pattern dictionary 60 (N in step S4), the process returns to step S1 without performing voice recognition.

一方、認識パターンの登録処理であると判断されると、ステップＳ６に進み、認識パターン登録処理を実行する。認識パターン登録処理は、使用される用途において使用されるであろう会話や発声言語を一覧に纏め、用途別使用言語表（図示せず）とし、この用途別使用言語表に登録された各言語毎に、その言語の周波数を解析し、音声特徴（音韻情報）と言語特徴（音韻の系列情報）に分離する。そして、このようにして分離された言語特徴を、言語パターン辞書６０に登録する。 On the other hand, if it is determined that the process is a recognition pattern registration process, the process advances to step S6 to execute a recognition pattern registration process. In the recognition pattern registration process, conversations and utterance languages that will be used in the intended use are summarized in a list, and a use language table (not shown) for each use is registered, and each language registered in this use language table for each use Each time, the frequency of the language is analyzed and separated into speech features (phoneme information) and language features (phoneme sequence information). Then, the language features separated in this way are registered in the language pattern dictionary 60.

このような音声メモプリンタ１は、あらゆるシーンで利用可能である。例えば、ブレーンストーミングにおけるアイデア出しの際にはアイデアを発声するだけで付箋紙Ｐに発声したアイデアが印字された状態で発行されてくるので、発行された付箋紙Ｐを模造紙等に貼り付けていけばよい。また、弁当店等における注文を受ける際にも、注文を受けた商品について発声するだけで付箋紙Ｐに注文された商品が印字された状態で発行されてくるので、発行された付箋紙Ｐを注文票として利用することができる。この注文票は、商品引渡しの際に商品に貼り付けておくようにすれば、商品の取り違いを防止することもできる。 Such a voice memo printer 1 can be used in any scene. For example, in brainstorming, when an idea is put out, it is issued in a state where the idea uttered is printed on the sticky note P just by uttering the idea. I'll do it. Also, when an order is received at a bento store or the like, the ordered product is issued on the sticky note P simply by speaking about the ordered product. It can be used as an order form. If the order slip is affixed to the product when the product is delivered, it is possible to prevent the product from being mixed.

このように本実施の形態によれば、内蔵マイク５２から入力された音声の特徴に最も近い言語特徴を有している単語が、言語パターン辞書６０から抽出されて音声認識結果として出力され、印字される。これにより、言語パターン辞書６０には特定用途で使われる単語の言語特徴が登録されていることから、特定用途で発せられる音声についての音声認識についての誤認識を極力抑えた音声メモプリンタ１を簡便な構成で実現することができるので、あらゆるシーンで作成時間にとらわれずに気軽にメモを残すことができ、能率的に活用できる音声メモプリンタ１を提供することができる。 As described above, according to the present embodiment, the word having the language feature closest to the feature of the voice input from the built-in microphone 52 is extracted from the language pattern dictionary 60 and output as the voice recognition result for printing. Is done. As a result, the language features of the words used in the specific application are registered in the language pattern dictionary 60. Therefore, the voice memo printer 1 that suppresses the misrecognition of the voice recognition of the voice generated in the specific application as much as possible can be simplified. Since it can be realized with a simple configuration, it is possible to provide a voice memo printer 1 that can easily leave a memo regardless of the creation time in any scene and can be used efficiently.

なお、本実施の形態においては、音声認識させる特定用途についての単語の言語特徴を登録している単一の言語パターン辞書６０を備えるようにしたが、これに限るものではなく、異なる特定用途についての単語の言語特徴をそれぞれ登録している複数の言語パターン辞書６０を備えるようにしても良い。この場合、特定用途別に言語パターン辞書６０を切り替えて使用するようにすれば良い。言語パターン辞書６０を切り替えは、入力された音声内容により切り替えるようにしても良いし、スイッチによって切り替えるようにしても良い。 In the present embodiment, the single language pattern dictionary 60 that registers the linguistic features of words for specific applications to be recognized by speech is provided. However, the present invention is not limited to this. A plurality of language pattern dictionaries 60 each registering the language characteristics of the word may be provided. In this case, the language pattern dictionary 60 may be switched and used for each specific application. The language pattern dictionary 60 may be switched according to the input voice content or may be switched by a switch.

本発明の実施の一形態の音声メモプリンタをラベル排出側から示す外観斜視図である。1 is an external perspective view showing a voice memo printer according to an embodiment of the present invention from a label discharge side. 音声メモプリンタをオペレータ装着側から示す外観斜視図である。It is an external appearance perspective view which shows a voice memo printer from the operator mounting side. 音声メモプリンタの内部構造を示す水平断面図である。It is a horizontal sectional view showing the internal structure of the voice memo printer. 音声メモプリンタの各部の制御系の接続を示すブロック図である。It is a block diagram which shows the connection of the control system of each part of a voice memo printer. 音声認識エンジンの構成を示すブロック図である。It is a block diagram which shows the structure of a speech recognition engine. 音声印字処理の流れを示すフローチャートである。It is a flowchart which shows the flow of an audio | voice printing process. 発行された付箋紙の一例を示す平面図である。It is a top view which shows an example of the issued sticky note paper.

Explanation of symbols

１…音声メモプリンタ、５２…マイク、５３…Ａ／Ｄ変換手段、５８ａ…周波数解析手段、５９…音響辞書、６０…言語パターン辞書
DESCRIPTION OF SYMBOLS 1 ... Voice memo printer, 52 ... Microphone, 53 ... A / D conversion means, 58a ... Frequency analysis means, 59 ... Acoustic dictionary, 60 ... Language pattern dictionary

Claims

A microphone for voice input,
A / D conversion means for converting audio analog data input from the microphone into audio digital data;
Frequency analysis means for frequency-converting and analyzing the audio digital data converted by the A / D conversion means;
A voice recognition means having a language pattern dictionary for specific applications;
Printing means for printing a voice recognition result output from the voice recognition means based on the frequency analyzed by the frequency analysis means;
Voice memo printer equipped with.

A microphone for voice input,
A / D conversion means for converting audio analog data input from the microphone into audio digital data;
Frequency analysis means for frequency-converting and analyzing the audio digital data converted by the A / D conversion means;
A language pattern dictionary with linguistic features for specific uses;
Language feature extraction means for selecting one of the language pattern dictionaries based on the frequency analyzed by the frequency analysis means;
Printing means for printing the speech recognition result output by the language feature extraction means;
Voice memo printer equipped with.

The language pattern dictionary is prepared for each specific application and can be exchanged according to the specific application.
The voice memo printer according to claim 1.

A plurality of language pattern dictionaries are prepared for each specific application, and the language pattern dictionary is switched according to the specific application.
The voice memo printer according to claim 1.

The registered contents of the language pattern dictionary can be rewritten from an external device.
The voice memo printer according to any one of claims 1 to 4.