JPH07191686A

JPH07191686A - Device for reading document aloud

Info

Publication number: JPH07191686A
Application number: JP5347247A
Authority: JP
Inventors: Takafumi Ito; 隆文伊藤
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1993-12-27
Filing date: 1993-12-27
Publication date: 1995-07-28

Abstract

PURPOSE:To provide the document reading device which is improved in operation convenience when a document reading board is mounted on a computer in which a voice reproducing function is incorporated. CONSTITUTION:A CPU 1, a voice synthesizer 5, an oversampling digital filter 5 for return noise removal of sampling, and a 1-bit D/A converter 7 which converts digital voice data into a binary pulse train by SIGMA-DELTA conversion are integrated on one chip; and the binary pulse train which is generated here is transmitted through an IC card slot as the interface of the computer to provide a function which reads text data aloud through an LPF, an amplifier, and a loudspeaker incorporated in the computer main body.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、漢字かな混じり文を
音声に直接変換する文章読み上げ装置の改良に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an improvement of a text-to-speech device for directly converting a mixed kanji / kana sentence into a voice.

【０００２】[0002]

【従来の技術】文書作成ソフトウェアや文書編集ソフト
ウェア（テキストエディタ）等で作成された漢字かな混
じり文章に基づいて音声信号を合成出力し、それを読み
上げる文章読み上げ装置が開発された。2. Description of the Related Art A text-to-speech device has been developed in which a voice signal is synthesized and output based on a text containing kanji and kana created by text-writing software or text-editing software (text editor) and the voice is read aloud.

【０００３】この文章読み上げ装置は、通常のテキスト
データから直接自然な音声信号を生成するものであり、
漢字かな混じり文を直接耳で聞き取ることができるのた
め各種用途が考えられる。This text-to-speech device directly produces natural voice signals from ordinary text data.
It can be used for various purposes because you can directly listen to mixed kanji and kana sentences.

【０００４】文章読み上げ装置は、システムの小型化と
ホスト計算機の負荷を軽減するために文音声合成ボード
（オプションボード）として実現される。即ち、パーソ
ナルコンピュータ（ＰＣ）やワークステーション（Ｗ
Ｓ）、ワードプロセッサ（ＷＰ）等のホストコンピュー
タが持つ拡張スロットに実装される基板、または、ＩＣ
カードスロットに実装されるＩＣカードの形で提供され
る。図５にその内部構成を示す。The text-to-speech device is realized as a text-to-speech synthesis board (option board) in order to downsize the system and reduce the load on the host computer. That is, a personal computer (PC) or workstation (W
S), a board mounted in an expansion slot of a host computer such as a word processor (WP), or an IC
It is provided in the form of an IC card mounted in a card slot. FIG. 5 shows its internal configuration.

【０００５】図５を用いて構成動作について簡単に説明
すると以下のようになる。文章データはホストコンピュ
ータによって生成され、コネクタを介して文章読み上げ
装置に送られる。文章読み上げ装置は、ホストコンピュ
ータによって生成される文章データに対して、言語処理
装置５１で単語分割やアクセント情報付加などの読み上
げのための言語処理を行い、音声記号列（韻律情報と音
韻系列）を得る。音声記号列は音声合成装置５２に送ら
れ、ここで音声素片が取り出され、音声信号が規則合成
される。音声信号はアナログ増幅器５３で増幅され、そ
の音声を外部のイヤホンやスピーカ５４で聞くことがで
きる。The configuration operation will be briefly described below with reference to FIG. The text data is generated by the host computer and sent to the text reading device via the connector. The text-to-speech device performs, on the text data generated by the host computer, linguistic processing such as word division and accent information addition by the linguistic processing device 51 to read out a speech symbol string (prosodic information and phonological sequence). obtain. The voice symbol string is sent to the voice synthesizer 52, where the voice unit is extracted and the voice signal is regularly synthesized. The voice signal is amplified by the analog amplifier 53, and the voice can be heard by an external earphone or speaker 54.

【０００６】[0006]

【発明が解決しようとする課題】ところで、パーソナル
コンピュータ（ＰＣ）、ワークステーション（ＷＳ）、
ワードプロセッサ（ＷＰ）等のホストコンピュータの中
には、増幅器やスピーカを含む音声録音や再生を実現す
る回路を本体側に内蔵している機種もある。図６にその
構成例を示す。By the way, a personal computer (PC), a workstation (WS),
Some host computers, such as word processors (WP), have a built-in circuit for realizing voice recording and reproduction, including an amplifier and a speaker, in the main body. FIG. 6 shows an example of the configuration.

【０００７】図６において、符号６０は、パーソナルコ
ンピュータ（ＰＣ）、ワークステーション（ＷＳ）、ワ
ードプロセッサ（ＷＰ）等の本体（ＰＣ／ＷＳ／ＷＰ本
体と称す）、符号６１は図５に示す構成を持つ文章読み
上げ装置、符号６２，６３，６４，６５はそれぞれＰＣ
／ＷＳ／ＷＰ本体６０が内蔵するもので、６２は本体信
号源、６３は増幅器、６４はローパスフィルタ（ＬＰ
Ｆ）、６５はスピーカである。In FIG. 6, reference numeral 60 is a main body (referred to as a PC / WS / WP main body) of a personal computer (PC), workstation (WS), word processor (WP), etc., and reference numeral 61 is the configuration shown in FIG. The text reading device, 62, 63, 64, 65 are PCs, respectively
/ WS / WP body 60 is built in, 62 is a body signal source, 63 is an amplifier, 64 is a low-pass filter (LP
F) and 65 are speakers.

【０００８】一方、ＰＣ／ＷＳ／ＷＰ等のホストコンピ
ュータに装備されている汎用の拡張スロットやＩＣカー
ドスロットには、デジタル信号処理のためのインタフェ
ースしかないため、文章読み上げ装置で生成したアナロ
グ音声信号を、この拡張スロットやＩＣカードスロット
を介してホストコンピュータ本体に転送することができ
ない。On the other hand, since a general-purpose expansion slot or IC card slot equipped in a host computer such as a PC / WS / WP has an interface for digital signal processing, an analog voice signal generated by a text-to-speech device is required. Cannot be transferred to the host computer main body via the expansion slot or IC card slot.

【０００９】このため、文章読み上げ装置で合成された
読み上げ音声を、ホストコンピュータが持つスピーカ６
５で再生して聞きたいときには、汎用の拡張スロットや
ＩＣカードスロットとは別の、専用のアナログ信号回線
（接続ケーブル６８、アナログ信号入力コネクタ６９）
を設けてアナログ信号を文章読み上げ装置からホストコ
ンピュータ本体に転送する必要があった。Therefore, the read-out voice synthesized by the text-to-speech device is used by the speaker 6 of the host computer.
When you want to play and listen with 5, you can use a dedicated analog signal line (connection cable 68, analog signal input connector 69) that is different from the general-purpose expansion slot or IC card slot.
It was necessary to provide an analog signal from the text reading device to the host computer body.

【００１０】この場合、外部に接続ケーブルが必要とな
るため接続が面倒で煩わしく使い難い。また、本体側に
専用のアナログ音声入力用のインタフェースが必要にな
るといった問題があった。In this case, since a connection cable is required externally, the connection is troublesome, troublesome and difficult to use. There is also a problem that a dedicated analog voice input interface is required on the main body side.

【００１１】この発明は上記事情に鑑みてなされたもの
であり、音声再生機能を内蔵するホストコンピュータ
に、文章読み上げボードを実装した際の使い勝手を向上
させることのできる文章読み上げ装置を提供することを
目的とする。The present invention has been made in view of the above circumstances, and it is an object of the present invention to provide a text-to-speech device capable of improving usability when a text-to-speech board is mounted on a host computer having a voice reproduction function. To aim.

【００１２】[0012]

【課題を解決するための手段】本発明の文章読み上げ装
置は、テキストデータを生成するコンピュータ本体と、
コンピュータ本体により生成されるテキストデータを言
語解析してディジタル音声データに変換する文音声合成
装置と、デジタル音声データを２値のパルス列によるエ
ネルギー密度波の音声データに変換する１ビットＤ／Ａ
変換器と、１ビットＤ／Ａ変換器によって変換された２
値のパルス列をデジタルインタフェースを介して出力す
るデータ転送装置とを具備することを特徴とする。コン
ピュータは、増幅器とスピーカを含む音声再生のための
手段を持つ。また、文音声合成装置は、内蔵メモリに格
納されたプログラムに従がいコンピュータ本体から送ら
れるテキストデータに対し内蔵の言語処理用辞書を参照
しながら読み上げのための言語処理を行って音声記号列
を得、更に内蔵の音声素片データを参照して音声記号列
から音声素片を取りだし、必要なパラメータを抽出して
そのパラメータを音声合成器に出力する音声処理回路
と、音声処理回路により生成されるパラメータをデジタ
ル信号処理してデジタル発声データ変換する音声合成器
と、音声発生器によって生成されるディジタル音声デー
タをオーバーサンプリングしてサンプリング点間を補間
し、その音声データに含まれる折り返しノイズの除去を
行うディジタルフィルタと、Σ−△変換によりデジタル
音声データを２値のパルス列に変換する１ビットＤ／Ａ
コンバータと、この１ビットＤ／Ａ変換器によって変換
された２値のパルス列をデジタルインタフェースを介し
て出力するデータ転送装置と、１ビットＤ／Ａ変換器に
よって変換された２値のパルス列をアナログ音声信号に
変換するローパスフィルタと、増幅器、アナログ信号出
力コネクタを具備することを特徴とする。更に、上記音
声処理回路、音声合成器、ディジタルフィルタならびに
１ビットＤ／Ａコンバータは１チップＬＳＩに集積さ
れ、１ビットＤ／Ａコンバータにより生成されるディジ
タル音声データを一定の工業規格に準拠したインタフェ
ースを介し再送機能を持つコンピュータに供給すること
も特徴とする。A text-to-speech device of the present invention comprises a computer main body for generating text data,
A sentence-speech synthesizer for linguistically analyzing text data generated by a computer body and converting it into digital voice data, and a 1-bit D / A for converting the digital voice data into energy density wave voice data by a binary pulse train.
2 converted by a converter and a 1-bit D / A converter
And a data transfer device for outputting a pulse train of values via a digital interface. The computer has means for audio reproduction including an amplifier and a speaker. Further, the sentence-speech synthesizer performs a language process for reading a text data sent from a computer according to a program stored in a built-in memory, while referring to a built-in language processing dictionary to generate a voice symbol string. Then, it is generated by the speech processing circuit that extracts the speech unit from the speech symbol string by referring to the built-in speech unit data, extracts the necessary parameters, and outputs the parameters to the speech synthesizer, and the speech processing circuit. A voice synthesizer that processes digital parameters to convert the digital voice data to digital voice data, and the digital voice data generated by the voice generator is oversampled to interpolate between sampling points and remove aliasing noise included in the voice data. And a digital filter that performs Σ-Δ conversion to convert the digital audio data into a binary pulse train. 1-bit D / A to
A converter, a data transfer device for outputting a binary pulse train converted by the 1-bit D / A converter through a digital interface, and a binary pulse train converted by the 1-bit D / A converter for analog voice. It is characterized by including a low-pass filter for converting into a signal, an amplifier, and an analog signal output connector. Further, the voice processing circuit, the voice synthesizer, the digital filter and the 1-bit D / A converter are integrated in a 1-chip LSI, and the digital voice data generated by the 1-bit D / A converter is interfaced according to a certain industrial standard. It is also characterized in that it is supplied to a computer having a retransmission function via.

【００１３】[0013]

【作用】本発明の文章読み上げ装置は、ＣＰＵと、音声
合成器と、サンプリングの折り返しノイズ除去用のオー
バーサンプリングデジタルフィルタと、Σ−△変換によ
りデジタル音声データを２値のパルス列に変換する１ビ
ットＤ／Ａコンバータとを１チップ集積し、更にＣＰＵ
を動作させるためのプログラムや言語処理用の辞書なら
びに音声合成用の音声素片データ等を格納するＲＡＭ
と、文章データを記憶するＲＡＭと、ＰＣ／ＷＳ等の他
の情報処理装置とのインタフェースとなるＩＣカードス
ロットと、２値のパルス列をアナログ音声信号に変換す
るＬＰＦ（ローバスフィルタ）と、増幅器、イヤホンジ
ャックをクレジットカード大のＩＣカードに納め、この
ＩＣカードによりコンピュータから送られてくるテキス
トデータを読み上げる機能を実現する。具体的には、Ｃ
ＰＵは、内蔵メモリに格納されたプログラムに従がいコ
ンピュータ本体から送られるテキストデータに対し内蔵
の言語処理用辞書を参照しながら読み上げのための言語
処理を行って音声記号列を得、更に内蔵の音声素片デー
タを参照して音声記号列から音声素片を取りだし、必要
なパラメータを抽出してそのパラメータを音声合成器に
出力する。音声合成器は、ＣＰＵにより生成されるパラ
メータをデジタル信号処理してデジタル発声データに変
換し、ディジタルフィルタによって、ディジタル音声デ
ータをオーバーサンプリングしてサンプリング点間を補
間し、その音声データに含まれる折り返しノイズの除去
を行う。そして、Σ−△変換によりデジタル音声データ
を２値のパルス列に変換する。The text-to-speech device of the present invention comprises a CPU, a voice synthesizer, an oversampling digital filter for removing sampling aliasing noise, and a 1-bit converting digital voice data into a binary pulse train by Σ-Δ conversion. D / A converter and 1 chip integrated, and CPU
RAM for storing programs for operating the computer, language processing dictionaries, and voice unit data for voice synthesis
A RAM for storing text data; an IC card slot for interfacing with other information processing devices such as PC / WS; an LPF (low-pass filter) for converting a binary pulse train into an analog audio signal; and an amplifier. The earphone jack is stored in a credit card-sized IC card, and the IC card realizes a function of reading aloud text data sent from a computer. Specifically, C
The PU performs a language processing for reading the text data sent from the computer according to the program stored in the built-in memory while referring to the built-in language processing dictionary to obtain a phonetic symbol string, and further The speech unit is extracted from the speech symbol string by referring to the speech unit data, necessary parameters are extracted, and the parameters are output to the speech synthesizer. The voice synthesizer digitally processes the parameters generated by the CPU to convert them into digital voicing data, oversamples the digital voice data by a digital filter, interpolates between sampling points, and returns the data included in the voice data. Remove noise. Then, the digital audio data is converted into a binary pulse train by Σ-Δ conversion.

【００１４】また、上記Ｄ／Ａコンバータで変換された
２値音声信号をＩＣカードスロットインタフェースのデ
ジタル信号インタフェースを介してＰＣ／ＷＰ等に転送
し、その２値表現される音声信号をＰＣ／ＷＳ／ＷＰ等
コンピュータに内蔵されたローパスフィルタ（ＬＰ
Ｆ）、増幅器を通してスピーカで再生する機能を提供す
るものである。Further, the binary audio signal converted by the D / A converter is transferred to a PC / WP or the like via a digital signal interface of an IC card slot interface, and the binary audio signal is represented by PC / WS. Low pass filter (LP
F), which provides a function of reproducing with a speaker through an amplifier.

【００１５】本発明によれば、文章読み上げ装置で再生
された音声をコンピュータ本体に内蔵されたスピーカで
再生する際、汎用のＩＣカードインタフェースのデジタ
ルインタフェースを介して転送し、コンピュータ本体は
簡単なＬＰＦを通すだけでアナログ音声を再生できるの
で、アナログ音声信号を得るのに、コンピュータ本体側
に必要なのは簡単なＬＰＦだけであり、従って、外部の
接続ケーブル不要、専用のアナログ音声入力ようのイン
タフェースが不要になるといった利点を持つ。また、Ｉ
Ｃカードインタフェースから転送される音声信号は２値
のデジタルデータであるため本体側での制御が容易とな
るといった効果が得られる。According to the present invention, when the voice reproduced by the text-to-speech device is reproduced by the speaker built in the computer body, it is transferred through the digital interface of the general-purpose IC card interface, and the computer body is a simple LPF. Since analog audio can be played back only by passing it through, only a simple LPF is required on the computer side to obtain an analog audio signal. Therefore, no external connection cable is required and no dedicated analog audio input interface is required. It has the advantage of becoming. Also, I
Since the audio signal transferred from the C card interface is binary digital data, it is possible to obtain an effect that control on the main body side becomes easy.

【００１６】[0016]

【実施例】以下、図面を使用して本発明実施例について
詳細に説明する。Embodiments of the present invention will be described in detail below with reference to the drawings.

【００１７】図１は本発明の実施例を示すブロック図で
ある。FIG. 1 is a block diagram showing an embodiment of the present invention.

【００１８】図において、符号１は本発明の文書読み上
げを制御するＣＰＵである。符号２はＲＯＭであり、Ｃ
ＰＵ１を動作させるためのプログラムとデータａ、言語
処理用の辞書ｂ、そして音声合成用の音声素片データｃ
を格納している。In the figure, reference numeral 1 is a CPU for controlling the reading of a document according to the present invention. Reference numeral 2 is a ROM, C
A program and data a for operating the PU1, a dictionary b for language processing, and speech unit data c for speech synthesis.
Is stored.

【００１９】符号３はワークＲＡＭでありＣＰＵ１が処
理途中の各種データを一時的に格納するダイナミックＲ
ＡＭである。符号４は文書データ記憶用ＲＡＭであり、
外部インタフェースを通して送られてきた文章データ
（テキストデータ）を記憶しておくスタテックＲＡＭで
あり、バッテリでバックアップされている。Reference numeral 3 is a work RAM, which is a dynamic R in which the CPU 1 temporarily stores various data being processed.
AM. Reference numeral 4 is a RAM for storing document data,
This is a static RAM that stores text data (text data) sent through an external interface and is backed up by a battery.

【００２０】符号５は音声合成器であり、ＣＰＵ１から
のデータをデジタル信号処理してデジタル発声データ
（ＰＣＭ音声データ）に変換する。符号６はオーバーサ
ンプリングデジタルフィルタであり、音声合成器５で合
成されたデジタル音声データをオーバーサンプリングし
てサンプリングによる折り返しノイズを除去する。Reference numeral 5 is a voice synthesizer, which digitally processes the data from the CPU 1 to convert it into digital voicing data (PCM voice data). Reference numeral 6 is an oversampling digital filter that removes aliasing noise due to sampling by oversampling the digital voice data synthesized by the voice synthesizer 5.

【００２１】符号７は１ビットＤ／Ａコンバータであ
り、デジタルフィルタからのデジタル音声データをさら
にオーバーサンプリングしてΣ−△変換方式にて２値の
ＰＷＭ（Pulse Width Modulation）信号に変調する。Reference numeral 7 is a 1-bit D / A converter, which further oversamples the digital audio data from the digital filter and modulates it into a binary PWM (Pulse Width Modulation) signal by the Σ-Δ conversion method.

【００２２】符号８はＬＰＦ（ローパスフィルタ）であ
り、１ビットＤ／Ａコンバータ７により生成されるＰＷ
Ｍ信号からアナログ信号を生成する。符号９は増幅器で
あり、ここでアナログ音声信号を電圧／電力増幅する。
符号１０はイヤホンジャックであり、ここでアナログ音
声信号を外部に出力する。このイヤホンジャックにイヤ
ホンを直接接続するか、パワーアンプ内蔵スピーカを接
続することで読み上げ音声を聴くことができる。Reference numeral 8 is an LPF (low pass filter), which is a PW generated by the 1-bit D / A converter 7.
An analog signal is generated from the M signal. Reference numeral 9 is an amplifier, which amplifies the analog audio signal by voltage / power.
Reference numeral 10 is an earphone jack, which outputs an analog audio signal to the outside. You can listen to the read voice by connecting an earphone directly to this earphone jack or connecting a speaker with a built-in power amplifier.

【００２３】符号１１は内部のデータバスである。符号
１２は外部インタフェースであり、例えば、ＪＥＩＤＡ
バージョン４規格に準じたＩＣカードとＰＣやＷＳ等情
報処理装置とを接続するための標準的なインタフェース
である。尚、図中、ＣＰＵ１、音声合成器５、オーバー
サンプリングデジタルフィルタ６、及び１ビットＤ／Ａ
コンバータ７は１チップＬＳＩに実装される。Reference numeral 11 is an internal data bus. Reference numeral 12 is an external interface, for example, JEIDA
It is a standard interface for connecting an IC card conforming to the version 4 standard and an information processing device such as a PC or WS. In the figure, CPU 1, voice synthesizer 5, oversampling digital filter 6, and 1-bit D / A
The converter 7 is mounted on a 1-chip LSI.

【００２４】図２、及び図３はそれぞれ本発明の実施例
の動作を説明するために引用した図であり、図２はオー
バサンプリングディジタルフィルタによるオーバサンプ
リングと補間を説明するための図、図３はＰＷＭ波によ
るデータ表現の例を示す図である。FIGS. 2 and 3 are respectively cited for explaining the operation of the embodiment of the present invention, and FIG. 2 is a diagram for explaining oversampling and interpolation by an oversampling digital filter, and FIG. [Fig. 6] is a diagram showing an example of data representation by a PWM wave.

【００２５】図４は本発明が採用されるＰＣ／ＷＳ／Ｗ
Ｐ等のホストコンピュータの内部構成を示す図である。
図６に示す従来例との比較対照の意味で示した。本発明
によれば、図６に従来例として示す専用のアナログ信号
回線（接続ケーブル６８、アナログ信号入力コネクタ６
９）が不要となることを示している。尚、図４の中で、
図６と同一番号の付されたブロックは図６に示すそれと
同じものとする。FIG. 4 is a PC / WS / W to which the present invention is applied.
It is a figure which shows the internal structure of a host computer, such as P.
It is shown in the sense of comparison and control with the conventional example shown in FIG. According to the present invention, a dedicated analog signal line (connection cable 68, analog signal input connector 6) shown as a conventional example in FIG.
It shows that 9) becomes unnecessary. In addition, in FIG.
The blocks with the same numbers as in FIG. 6 are the same as those shown in FIG.

【００２６】以下、図１に示す本発明の実施例の動作に
つき、図２、及び図３を使用して説明する。The operation of the embodiment of the present invention shown in FIG. 1 will be described below with reference to FIGS. 2 and 3.

【００２７】まず、ＰＣ／ＷＳ／ＷＰ等のホストコンピ
ュータによって文章データ（テキストデータ）が生成さ
れ、外部インタフェース１２を通して本発明の文章読み
上げ装置に送られる。文章読み上げ装置では、ＣＰＵ１
がＲＯＭ２に格納されたプログラム２ａに従って動作
し、送られた文章データに対して、内蔵の言語処理用辞
書２ｂを参照しながら単語分割やアクセント情報付加な
どの読み上げのための言語処理を行ない、音声記号列
（韻律情報と音韻系列）を得る。First, text data (text data) is generated by a host computer such as a PC / WS / WP and sent to the text reading device of the present invention through the external interface 12. In the text reading device, CPU1
Operates according to the program 2a stored in the ROM 2 and performs language processing for reading the sent text data such as word division and accent information addition while referring to the built-in language processing dictionary 2b, and outputs the voice. Obtain a symbol string (prosodic information and phoneme sequence).

【００２８】ＣＰＵは更に音声素片データ２ｃを参照し
て、音声記号列から音声素片を取りだし、必要なパラメ
ータを抽出して音声合成器５にそのパラメータを送る。The CPU further refers to the speech unit data 2c, extracts a speech unit from the speech symbol string, extracts a necessary parameter, and sends the parameter to the speech synthesizer 5.

【００２９】音声合成器５はＣＰＵ１から送られたパラ
メータをデジタル信号処理して、サンプリング周波数
（ｆｓ）＝１２ＫＨｚ、量子化精度＝１２ビットのデジ
タル発声データ（ＰＣＭ音声データ）に変換する。The voice synthesizer 5 processes the parameters sent from the CPU 1 into digital signals and converts them into digital voicing data (PCM voice data) having a sampling frequency (fs) = 12 KHz and a quantization precision = 12 bits.

【００３０】音声合成器５によって生成されたＰＣＭ音
声データは、オーバーサンプリングデジタルフイルタ６
によるサンプリング周波数ｆｓを８倍（８ｆｓ＝９６Ｋ
Ｈｚ）にオーバーサンプリングしてサンプリング点間を
補間し、ＰＣＭ音声データに含まれる折り返しノイズの
除去を行う。図２にそのオーバサンプリングと補間の例
が示されている。図中、丸印は元のサンプリング点を、
×印は補間したサンプリング点を示す。The PCM voice data generated by the voice synthesizer 5 is the oversampling digital filter 6
Sampling frequency fs by 8 times (8fs = 96K
Hz) to interpolate between sampling points to remove aliasing noise included in the PCM audio data. FIG. 2 shows an example of the oversampling and interpolation. In the figure, the circles indicate the original sampling points,
A cross mark indicates an interpolated sampling point.

【００３１】オーバーサンプリングデジタルフィルタ６
によってオーバーサンプリングされたＰＣＭ音声データ
は、１ビットＤ／Ａコンバータ７でΣ−△変換によっ
て、オーバーサンプリングと量子化数の削減とノイズシ
ェーピングが行なわれ、サンプリング周波数１９２ｆｓ
（＝２．３０４ＭＨｚ）のＰＷＭ波形に変換する。Oversampling Digital Filter 6
The 1-bit D / A converter 7 performs Σ-Δ conversion on the PCM audio data oversampled by the oversampling, the quantization number reduction, and the noise shaping, and the sampling frequency 192fs
Convert to PWM waveform of (= 2.304 MHz).

【００３２】ＰＷＭ波形は、ＨｉｇｈまたはＬｏｗの２
値パルス列であり、ここではパルスの幅によってレベル
を表現する。即ち、パルス幅が広いほどレベルが大き
く、狭いほど小さい。図７にＰＷＭ波によるデータ表現
例が示されている。１ビットＤ／Ａコンバータ７によっ
て生成されたＰＷＭ音声データはアナログＬＰＦ８を通
してアナログ音声波形に変換される。ここで生成される
アナログ音声波形は増幅器９によって増幅され、イヤフ
ォンジャック１０を介して外界へ出力される。The PWM waveform is either High or Low 2
A value pulse train, in which the level is expressed by the pulse width. That is, the wider the pulse width is, the higher the level is, and the narrower the pulse width is, the smaller the level is. FIG. 7 shows an example of data representation by a PWM wave. The PWM audio data generated by the 1-bit D / A converter 7 is converted into an analog audio waveform through the analog LPF 8. The analog voice waveform generated here is amplified by the amplifier 9 and output to the outside world via the earphone jack 10.

【００３３】ディジタルフィルタ６で生成されたＰＷＭ
波形は、ＩＣカードインタフェースとなるデジタル信号
インタフェース１２を経由してＰＣ／ＷＳ／ＷＰ等のホ
ストコンピュータに送られ、ここでローパスフィルタ８
と同様の内蔵コンポーネントを通してアナログ波形に変
換され、同じく内蔵の増幅器９で増幅されスピーカ経由
で再生される。PWM generated by the digital filter 6
The waveform is sent to a host computer such as a PC / WS / WP via a digital signal interface 12 serving as an IC card interface, where the low pass filter 8
Is converted into an analog waveform through the same built-in component as above, amplified by the built-in amplifier 9 and reproduced through the speaker.

【００３４】尚、図１に示す実施例で採用したオーバー
サンプリングデジタルフィルタ６と１ビットＤ／Ａコン
バータ７は、近年、ＣＤプレーヤ等デジタルオーディオ
機器のデジタル音声の再生に頻繁に用いられているコン
ポーネントであり、また、本発明実施例ではＰＷＭ波形
を出力するために１ビットＤ／Ａコンバータを用いてい
るが、デジタル音声データを２値のパルス列によるエネ
ルギー密度波の音声データに変換する他の方式、例え
ば、ＰＤＭ（Pulse DensityModulation;パルス密度変
調）波形を出力する１ビットＤ／Ａコンバータ等を用い
てもよい。The oversampling digital filter 6 and the 1-bit D / A converter 7 used in the embodiment shown in FIG. 1 are components frequently used in recent years for reproducing digital audio of digital audio equipment such as CD players. In the embodiment of the present invention, the 1-bit D / A converter is used to output the PWM waveform. However, another method of converting digital voice data into voice data of energy density wave by a binary pulse train For example, a 1-bit D / A converter that outputs a PDM (Pulse Density Modulation) waveform may be used.

【００３５】[0035]

【発明の効果】以上説明のように本発明によれば、文章
読み上げ装置で再生された音声をＰＣ／ＷＳ／ＷＰ等の
ホストコンピュータ本体に内蔵されたスピーカで再生す
る際に、汎用のＩＣカードインタフェースとなるデジタ
ルインタフェースを介してホストコンピュータに転送
し、本体側は簡単なローパスフィルタを通すだけでアナ
ログ音声を再生できるので、以下に列挙する効果が得ら
れる。（１）ＰＣ／ＷＳ／ＷＰ等のホストコンピュータ本体に
内蔵されたスピーカで読み上げ音声を再生する際に、外
部の接続ケーブルが不要であり、使い勝手が向上する。（２）ＰＣ／ＷＳ／ＷＰ等ホストコンピュータ本体側に
専用のアナログ音声入力用のインタフェースが不要にな
る。（３）アナログ音声信号を得るのに、ホストコンピュー
タ本体側に必要なのは簡単なローパスフィルタで済む。（４）ＩＣカードインタフェースから転送される音声信
号は２値のデジタルデータであるため本体側での制御が
容易である。As described above, according to the present invention, a general-purpose IC card is used when the voice reproduced by the text-to-speech device is reproduced by the speaker built in the host computer such as PC / WS / WP. Since it can be transferred to the host computer via the digital interface that serves as the interface and the main body side can reproduce the analog voice only by passing it through a simple low-pass filter, the effects listed below can be obtained. (1) An external connection cable is not required when a reading voice is reproduced by a speaker built in a host computer such as a PC / WS / WP, and usability is improved. (2) A dedicated interface for analog voice input on the host computer body side such as PC / WS / WP is not required. (3) A simple low-pass filter is required on the host computer main body side to obtain an analog audio signal. (4) Since the audio signal transferred from the IC card interface is binary digital data, it is easy to control on the main body side.

[Brief description of drawings]

【図１】本発明の実施例を示すブロック図。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図２】オーバサンプリングディジタルフィルタによる
オーバサンプリングと補間を説明するための図。FIG. 2 is a diagram for explaining oversampling and interpolation by an oversampling digital filter.

【図３】ＰＷＭ波によるデータ表現の例を示す図。FIG. 3 is a diagram showing an example of data representation by a PWM wave.

【図４】本発明の文章読み上げ装置の構成を示す図。FIG. 4 is a diagram showing a configuration of a text-to-speech device of the present invention.

【図５】文章読み上げ装置の内部構成を示す図。FIG. 5 is a diagram showing an internal configuration of a text-to-speech device.

【図６】従来の文章読み上げ装置の構成を示す図。FIG. 6 is a diagram showing a configuration of a conventional text-to-speech device.

[Explanation of symbols]

１…ＣＰＵ（音声処理回路）、２…ＲＯＭ、３…ワーク
ＲＡＭ、４…ＳＲＡＭ、５…音声合成器、６…ディジタ
ルフィルタ、７…１ビットＤ／Ａコンバータ、８…ロー
パスフィルタ、９…増幅器、１０…イヤホン、１１…内
部バス、１２…ＩＣカードインタフェース。1 ... CPU (voice processing circuit), 2 ... ROM, 3 ... Work RAM, 4 ... SRAM, 5 ... Voice synthesizer, 6 ... Digital filter, 7 ... 1-bit D / A converter, 8 ... Low-pass filter, 9 ... Amplifier 10 ... Earphone, 11 ... Internal bus, 12 ... IC card interface.

Claims

[Claims]

1. A computer main body for generating text data, a sentence / speech synthesizer for linguistically analyzing the text data generated by the computer main body for conversion into digital voice data, and an energy density of the digital voice data by a binary pulse train. A 1-bit D / A converter for converting into wave sound data, and a data transfer device for outputting a binary pulse train converted by the 1-bit D / A converter through a digital interface. Sentence reading device.

2. The text-to-speech device according to claim 1, wherein the computer has means for reproducing sound including an amplifier and a speaker.

3. The sentence voice synthesizing device performs language processing for reading aloud text data sent from a computer according to a program stored in a built-in memory, with reference to a built-in language processing dictionary. A speech processing circuit that obtains a speech symbol string, extracts the speech segment from the speech symbol string by referring to the built-in speech segment data, extracts the necessary parameters, and outputs the parameters to the speech synthesizer. A voice synthesizer that performs digital signal processing on the parameters generated by the processing circuit to convert the digital voice data, and digital voice data generated by the voice generator is oversampled to interpolate between sampling points and included in the voice data. Digital audio data is removed by Σ-Δ conversion and a digital filter that removes aliasing noise. 1 bit D converting the pulse train
/ A converter, a data transfer device for outputting the binary pulse train converted by the 1-bit D / A converter through a digital interface, and the binary pulse train converted by the 1-bit D / A converter. The text-to-speech device according to claim 1, further comprising a low-pass filter for converting into an analog voice signal, an amplifier, and an analog signal output connector.

4. The voice processing circuit, voice synthesizer, digital filter and 1-bit D / A converter are
4. The text reading system according to claim 3, wherein digital voice data integrated in a chip LSI and generated by a 1-bit D / A converter is supplied to a computer having a resending function through an interface conforming to a certain industry standard. apparatus.