JP7326879B2

JP7326879B2 - Semiconductor devices, electronic devices and moving bodies

Info

Publication number: JP7326879B2
Application number: JP2019101042A
Authority: JP
Inventors: 貴夫片山; 文仁倍賞; テリムリカード
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2019-05-30
Filing date: 2019-05-30
Publication date: 2023-08-16
Anticipated expiration: 2039-05-30
Also published as: JP2020194139A

Description

本発明は、半導体装置、電子機器及び移動体に関する。 The present invention relates to semiconductor devices, electronic devices, and mobile bodies.

特許文献１には、リサンプリングにより音声データのピッチを変更した後、時間スケール変換（ＴＳＭ：Time Scale Modification）により時間スケールを元に戻す装置が記載されている。 Patent Literature 1 describes a device that restores the original time scale by time scale modification (TSM) after changing the pitch of audio data by resampling.

特開平９－３２５７９４号公報JP-A-9-325794

しかしながら、特許文献１に記載の装置では、リサンプリング及び時間スケール変換をソフトウェア処理で行うため、処理能力の高いＤＳＰが必要となり、回路規模が大きくなってしまう。 However, in the device described in Patent Document 1, resampling and time scale conversion are performed by software processing, so a DSP with high processing capability is required, resulting in a large circuit scale.

本発明に係る半導体装置の一態様は、
第１の周期でサンプリングされた複数のサンプルを含む第１の音声データを格納する音
声データ格納部と、
第１のクロック信号及び第２のクロック信号を生成するクロック生成部と、
前記第１のクロック信号に同期して、前記第１の音声データを、前記第１の音声データ
に含まれる前記複数のサンプルの一部を削除又は複製した第２の音声データに変換する時
間スケール変換部と、
前記第２のクロック信号に同期して、前記第２の音声データを、前記第１の周期でサン
プリングされた、前記第２の音声データと異なる数の複数のサンプルを含む第３の音声デ
ータに変換する専用回路であるリサンプラーと、
前記時間スケール変換部及び前記リサンプラーの動作を制御する制御情報を格納するレ
ジスターと、を含む。
One aspect of the semiconductor device according to the present invention is
an audio data storage unit that stores first audio data including a plurality of samples sampled in a first period;
a clock generator that generates a first clock signal and a second clock signal;
A time scale for converting the first audio data into second audio data in which part of the plurality of samples included in the first audio data are deleted or duplicated in synchronization with the first clock signal. a conversion unit;
The second audio data is sampled at the first period in synchronization with the second clock signal, and the third audio data includes a plurality of samples of a number different from that of the second audio data. A resampler, which is a dedicated circuit that converts to
and a register storing control information for controlling the operation of the time scale converter and the resampler.

前記半導体装置の一態様において、
前記第３の音声データは、前記第１の音声データのピッチが変更された、前記第１の音声データと同じサンプル数のデータであってもよい。 In one aspect of the semiconductor device,
The third audio data may be data having the same number of samples as the first audio data, in which the pitch of the first audio data is changed.

前記半導体装置の一態様において、
前記音声データ格納部、前記クロック生成部、前記時間スケール変換部、前記リサンプラー及び前記レジスターは、１チップの集積回路装置に含まれてもよい。 In one aspect of the semiconductor device,
The audio data storage unit, the clock generation unit, the time scale conversion unit, the resampler and the register may be included in a one-chip integrated circuit device.

前記半導体装置の一態様において、
前記制御情報は、前記半導体装置の動作モードを、第１の動作モード、第２の動作モード及び第３の動作モードを含む複数の動作モードのいずれかに設定する情報を含み、
前記第１の動作モードは、前記第１の音声データの再生時間を変更せずにピッチを変更する動作モードであり、
前記第２の動作モードは、前記第１の音声データのピッチを変更せずに再生時間を変更する動作モードであり、
前記第３の動作モードは、前記第１の音声データの再生時間及びピッチを変更する動作モードであってもよい。 In one aspect of the semiconductor device,
the control information includes information for setting the operation mode of the semiconductor device to one of a plurality of operation modes including a first operation mode, a second operation mode, and a third operation mode;
the first operation mode is an operation mode in which the pitch is changed without changing the reproduction time of the first audio data;
the second operation mode is an operation mode in which the reproduction time is changed without changing the pitch of the first audio data;
The third operation mode may be an operation mode for changing the reproduction time and pitch of the first audio data.

前記半導体装置の一態様は、
前記第３の音声データをアナログ信号に変換するＤ／Ａ変換器を含んでもよい。 One aspect of the semiconductor device is
A D/A converter that converts the third audio data into an analog signal may be included.

本発明に係る電子機器の一態様は、
前記半導体装置の一態様と、
前記半導体装置からの出力信号に基づいて音声を出力する音出力部と、を備えている。 One aspect of the electronic device according to the present invention is
an aspect of the semiconductor device;
and a sound output unit that outputs sound based on the output signal from the semiconductor device.

本発明に係る移動体の一態様は、
前記半導体装置の一態様と、
前記半導体装置からの出力信号に基づいて音声を出力する音出力部と、を備えている。 One aspect of the moving object according to the present invention is
an aspect of the semiconductor device;
and a sound output unit that outputs sound based on the output signal from the semiconductor device.

第１実施形態の半導体装置の構成例を示す図。1 is a diagram showing a configuration example of a semiconductor device according to a first embodiment; FIG. ピッチを下げる場合の音声データの変換例を示す図。FIG. 10 is a diagram showing an example of conversion of voice data when pitch is lowered; 図２の例におけるリサンプリングの詳細について説明するための図。FIG. 3 is a diagram for explaining details of resampling in the example of FIG. 2; ピッチを上げる場合の音声データの変換例を示す図。FIG. 10 is a diagram showing an example of conversion of voice data when raising the pitch; 図４の例におけるリサンプリングの詳細について説明するための図。FIG. 5 is a diagram for explaining the details of resampling in the example of FIG. 4; 半導体装置の具体的な実施例を示す図。FIG. 10 is a diagram showing a specific example of a semiconductor device; 第２実施形態の半導体装置の構成例を示す図。FIG. 4 is a diagram showing a configuration example of a semiconductor device according to a second embodiment; ピッチを下げるとともに再生時間を短縮する場合の音声データの変換例を示す図。FIG. 10 is a diagram showing an example of conversion of audio data when pitch is lowered and playback time is shortened; ピッチを上げるとともに再生時間を伸長する場合の音声データの変換例を示す図。FIG. 10 is a diagram showing an example of conversion of audio data in the case of increasing the pitch and extending the playback time; 変形例の半導体装置の構成を示す図。FIG. 10 is a diagram showing a configuration of a semiconductor device of a modified example; 変形例においてピッチを下げる場合の音声データの変換例を示す図。The figure which shows the conversion example of the audio|voice data in the case of lowering a pitch in a modification. 変形例においてピッチを上げる場合の音声データの変換例を示す図。The figure which shows the conversion example of the audio|voice data in the case of raising a pitch in a modification. 本実施形態の電子機器の機能ブロック図。FIG. 2 is a functional block diagram of the electronic device according to the embodiment; 本実施形態の電子機器の外観の一例を示す図。1A and 1B are diagrams illustrating an example of an appearance of an electronic device according to an embodiment; FIG. 本実施形態の移動体の一例を示す図。The figure which shows an example of the moving body of this embodiment.

以下、本発明の好適な実施形態について図面を用いて詳細に説明する。なお、以下に説明する実施の形態は、特許請求の範囲に記載された本発明の内容を不当に限定するものではない。また以下で説明される構成の全てが本発明の必須構成要件であるとは限らない。 Preferred embodiments of the present invention will be described in detail below with reference to the drawings. It should be noted that the embodiments described below do not unduly limit the content of the present invention described in the claims. Moreover, not all the configurations described below are essential constituent elements of the present invention.

１．半導体装置
１－１．第１実施形態
図１は、第１実施形態の半導体装置の構成例を示す図である。図１に示すように、第１実施形態の半導体装置１は、音声データ格納部１０と、クロック生成部２０と、時間スケール変換部３０と、リサンプラー４０と、レジスター５０と、を含む。 1. Semiconductor device 1-1. First Embodiment FIG. 1 is a diagram showing a configuration example of a semiconductor device according to a first embodiment. As shown in FIG. 1, the semiconductor device 1 of the first embodiment includes an audio data storage section 10, a clock generation section 20, a time scale conversion section 30, a resampler 40, and a register 50.

音声データ格納部１０は、第１の周期Ｔｓ１でサンプリングされた複数のサンプルを含む第１の音声データＤ１を格納する。音声データ格納部１０は、あらかじめ第１の音声データＤ１が格納された不揮発性メモリーであってもよいし、例えば、半導体装置１が不図示のインターフェース回路を介して外部装置から受信した第１の音声データＤ１を一時的に格納する揮発性メモリーであってもよい。第１の音声データＤ１は、例えば、パルス符号変調（ＰＣＭ：Pulse Code Modulation）された音声データであってもよい。また、音声データ格納部１０のサイズを低減させるために、第１の音声データＤ１は、適応的差分パルス符号変調（ＡＤＰＣＭ：adaptive differential pulse code modulation）された音声データ等の圧縮された音声データであってもよい。第１の周期Ｔｓ１は、第１の音声データＤ１のサンプリング周波数の逆数であり、当該サンプリング周波数は、所定の音声帯域の上限周波数の２倍以上の周波数であればよく、例えば、１５．６２５ｋＨｚであってもよい。 The audio data storage unit 10 stores first audio data D1 including a plurality of samples sampled at the first period Ts1. The audio data storage unit 10 may be a non-volatile memory in which the first audio data D1 is stored in advance, or for example, the first audio data received by the semiconductor device 1 from an external device via an interface circuit (not shown). A volatile memory that temporarily stores the audio data D1 may be used. The first audio data D1 may be, for example, pulse code modulated (PCM) audio data. Also, in order to reduce the size of the audio data storage unit 10, the first audio data D1 is compressed audio data such as adaptive differential pulse code modulated (ADPCM) audio data. There may be. The first period Ts1 is the reciprocal of the sampling frequency of the first audio data D1, and the sampling frequency may be at least twice the upper limit frequency of a predetermined audio band, for example, 15.625 kHz. There may be.

クロック生成部２０は、第１のクロック信号ＣＫ１及び第２のクロック信号ＣＫ２を生成する。第１のクロック信号ＣＫ１は時間スケール変換部（ＴＳＭ部）３０を動作させるためのクロック信号であり、第２のクロック信号ＣＫ２は時間スケール変換部３０を動作させるためのクロック信号である。クロック生成部２０は、例えば、水晶発振回路、リングオシレーター、ＣＲ発振回路等の不図示の発振回路を有し、当該発振回路の出力信号に基づいて、第１のクロック信号ＣＫ１及び第２のクロック信号ＣＫ２を生成してもよい。 The clock generator 20 generates a first clock signal CK1 and a second clock signal CK2. The first clock signal CK1 is a clock signal for operating the time scale conversion section (TSM section) 30, and the second clock signal CK2 is a clock signal for operating the time scale conversion section 30. FIG. The clock generation unit 20 has an oscillation circuit (not shown) such as a crystal oscillation circuit, a ring oscillator, a CR oscillation circuit, etc., and generates a first clock signal CK1 and a second clock signal CK1 based on the output signal of the oscillation circuit. A signal CK2 may be generated.

時間スケール変換部３０は、第１のクロック信号ＣＫ１に同期して、第１の音声データＤ１を、第１の音声データＤ１に含まれる複数のサンプルの一部を削除又は複製した第２の音声データＤ２に変換する。例えば、削除又は複製されるサンプルは、周期的に繰り返される波形を構成する複数のサンプルの一部であってもよい。第２の音声データＤ２は、第１の音声データＤ１と同じピッチであって、第１の音声データＤ１とは再生速度あるいは再生時間が異なる音声データである。第１の音声データＤ１を第２の音声データＤ２に変換する処理、すなわち、いわゆる時間スケール変換処理には公知の各種の手法を適用可能である。例えば、時間スケール変換部３０は、ソフトウェア処理により時間スケール変換処理を行ってもよい。なお、第１の音声データＤ１が圧縮された音声データである場合は、時間スケール変換部３０は、不図示の音声データ展開部により展開された第１の音声データＤ１に対して時間スケール変換処理を行ってもよい。 In synchronization with the first clock signal CK1, the time scale conversion unit 30 transforms the first audio data D1 into a second audio data obtained by deleting or duplicating some of the plurality of samples included in the first audio data D1. Convert to data D2. For example, the deleted or duplicated sample may be part of a plurality of samples that make up a periodically repeating waveform. The second audio data D2 is audio data having the same pitch as the first audio data D1, but different in reproduction speed or reproduction time from the first audio data D1. Various known techniques can be applied to the process of converting the first audio data D1 into the second audio data D2, that is, the so-called time scale conversion process. For example, the time scale conversion unit 30 may perform time scale conversion processing by software processing. When the first audio data D1 is compressed audio data, the time scale conversion unit 30 performs time scale conversion processing on the first audio data D1 expanded by an audio data expansion unit (not shown). may be performed.

リサンプラー４０は、第２のクロック信号ＣＫ２に同期して、ハードウェア処理により、第２の音声データＤ２を、第１の周期Ｔｓ１でサンプリングされた、第２の音声データＤ２と異なる数の複数のサンプルを含む第３の音声データＤ３に変換する。具体的には、リサンプラー４０は、第２の音声データＤ２に含まれる複数のサンプルを第１の周期Ｔｓ１とは異なる第２の周期Ｔｓ２でサンプリングされたものと仮定して、当該複数のサンプルに基づく補間処理及び/又は間引き処理により、第１の周期Ｔｓ１でサンプリングされた複数のサンプルを求めて第３の音声データＤ３とする。補間処理及び/又は間引き処理には公知の各種の手法を適用可能である。 In synchronization with the second clock signal CK2, the resampler 40 performs hardware processing to sample the second audio data D2 at the first period Ts1, and samples the second audio data D2 in a number different from that of the second audio data D2. is converted into third audio data D3 containing samples of . Specifically, the resampler 40 assumes that a plurality of samples included in the second audio data D2 are sampled at a second period Ts2 different from the first period Ts1. A plurality of samples sampled at the first cycle Ts1 are obtained by interpolation processing and/or thinning processing based on , and are used as third audio data D3. Various known techniques can be applied to interpolation processing and/or thinning processing.

本実施形態では、第３の音声データＤ３は、第１の音声データＤ１のピッチが変更された、第１の音声データＤ１と同じサンプル数のデータである。換言すれば、第３の音声データＤ３は、第１の音声データＤ１とピッチが異なり、再生時間あるいは再生速度が同じ音声データである。 In this embodiment, the third audio data D3 is data with the same number of samples as the first audio data D1, in which the pitch of the first audio data D1 is changed. In other words, the third audio data D3 is audio data with a different pitch and the same reproduction time or reproduction speed as the first audio data D1.

レジスター５０は、時間スケール変換部３０及びリサンプラー４０の動作を制御する制御情報を格納する。例えば、制御情報は、時間スケール変換部３０及びリサンプラー４０に処理を開始させるためのスタートビットＳＴを含んでもよい。時間スケール変換部３０及びリサンプラー４０は、スタートビットＳＴが非アクティブからアクティブに変化すると処理を開始し、処理を終了すると停止する。 The register 50 stores control information for controlling the operations of the time scale converter 30 and the resampler 40 . For example, the control information may include a start bit ST for causing the time scale converter 30 and resampler 40 to start processing. The time scale converter 30 and the resampler 40 start processing when the start bit ST changes from inactive to active, and stop when the processing ends.

また、制御情報は、第２の音声データＤ２のサンプル数と第１の音声データＤ１のサンプル数との比率を設定する比率情報ＳＲを含み、時間スケール変換部３０及びリサンプラー４０に比率情報ＳＲが供給されてもよい。例えば、比率情報ＳＲが、第２の音声データＤ２のサンプル数が第１の音声データＤ１のサンプル数の７５％であることを示す場合、時間スケール変換部３０は、第１の音声データＤ１を、第１の音声データＤ１に含まれるＮ個のサンプルの２５％を削除したＮ×０．７５個のサンプルを含む第２の音声データＤ２に変換する。そして、リサンプラー４０は、第２の音声データＤ２に含まれるＮ×０．７５個のサンプルを第１の周期Ｔｓ１の約１３３％である第２の周期Ｔｓ２でサンプリングされたものと仮定して、Ｎ×０．７５個のサンプルに基づく補間処理及び/又は間引き処理により、第１の周期Ｔｓ１でサンプリングされたＮ個のサンプルを求めて第３の音声データＤ３とする。 The control information also includes ratio information SR for setting the ratio between the number of samples of the second audio data D2 and the number of samples of the first audio data D1, and the ratio information SR is sent to the time scale conversion unit 30 and the resampler 40. may be supplied. For example, when the ratio information SR indicates that the number of samples of the second audio data D2 is 75% of the number of samples of the first audio data D1, the time scale conversion unit 30 converts the first audio data D1 into , 25% of the N samples included in the first audio data D1 are deleted and converted into second audio data D2 including N×0.75 samples. Assuming that the resampler 40 samples N×0.75 samples included in the second audio data D2 at a second period Ts2 that is approximately 133% of the first period Ts1, , N×0.75 samples, N samples sampled at the first cycle Ts1 are obtained as third audio data D3.

また、半導体装置１は、Ｄ／Ａ（Digital to Analog）変換器６０を含んでもよい。半導体装置１がＤ／Ａ変換器６０を含む場合、クロック生成部２０は、Ｄ／Ａ変換器６０を動作させるための第３のクロック信号ＣＫ３を生成する。 The semiconductor device 1 may also include a D/A (Digital to Analog) converter 60 . When the semiconductor device 1 includes the D/A converter 60, the clock generator 20 generates the third clock signal CK3 for operating the D/A converter 60. FIG.

Ｄ／Ａ変換器６０は、第３の音声データＤ３をアナログ信号である音声信号ＶＯに変換する。具体的には、Ｄ／Ａ変換器６０は、第３のクロック信号ＣＫ３に同期して、第１の周期Ｔｓ１で第３の音声データＤ３に含まれる複数のサンプルをサンプリングしてアナログ値に変換することにより音声信号ＶＯを生成する。スタートビットＳＴはＤ／Ａ変換器６０にも供給され、Ｄ／Ａ変換器６０は、スタートビットＳＴが非アクティブからアクティブに変化すると処理を開始し、処理を終了すると停止する。 The D/A converter 60 converts the third audio data D3 into an analog audio signal VO. Specifically, the D/A converter 60 samples a plurality of samples included in the third audio data D3 at a first period Ts1 in synchronization with the third clock signal CK3 and converts them into analog values. to generate the audio signal VO. The start bit ST is also supplied to the D/A converter 60, and the D/A converter 60 starts processing when the start bit ST changes from inactive to active, and stops when processing ends.

なお、半導体装置１は、リサンプラー４０におけるリサンプリングによって発生するエリアシングノイズを低減させる必要がある場合は、ローパスフィルター７０を含んでもよい。リサンプラー４０におけるリサンプリングがダウンサンプリングである場合は、ローパスフィルター７０は、リサンプラー４０の前段、具体的には時間スケール変換部３０の出力からリサンプラー４０の入力に至る信号経路に設けられ、必要に応じてリサンプラー４０の後段、具体的にはリサンプラー４０の出力からＤ／Ａ変換器６０の入力に至る信号経路にも設けられる。また、リサンプラー４０におけるリサンプリングがアップサンプリングである場合は、ローパスフィルター７０は、リサンプラー４０の後段、具体的にはリサンプラー４０の出力からＤ／Ａ変換器６０の入力に至る信号経路に設けられる。半導体装置１がローパスフィルター７０を含む場合、クロック生成部２０は、ローパスフィルター７０を動作させるための第４のクロック信号ＣＫ４を生成し、スタートビットＳＴはローパスフィルター７０にも供給される。そして、ローパスフィルター７０は、スタートビットＳＴが非アクティブからアクティブに変化すると第４のクロック信号ＣＫ４に同期した処理を開始し、処理を終了すると停止する。 Note that the semiconductor device 1 may include a low-pass filter 70 when it is necessary to reduce aliasing noise generated by resampling in the resampler 40 . When the resampling in the resampler 40 is downsampling, the low-pass filter 70 is provided in the preceding stage of the resampler 40, specifically in the signal path from the output of the time scale conversion unit 30 to the input of the resampler 40, It is also provided in a signal path after the resampler 40 , specifically, from the output of the resampler 40 to the input of the D/A converter 60 as necessary. Also, when the resampling in the resampler 40 is upsampling, the low-pass filter 70 is placed in the signal path after the resampler 40, specifically from the output of the resampler 40 to the input of the D/A converter 60. be provided. When the semiconductor device 1 includes the low-pass filter 70 , the clock generator 20 generates a fourth clock signal CK4 for operating the low-pass filter 70 and the start bit ST is also supplied to the low-pass filter 70 . The low-pass filter 70 starts processing in synchronization with the fourth clock signal CK4 when the start bit ST changes from inactive to active, and stops when the processing ends.

なお、クロック生成部２０は、スタートビットＳＴが非アクティブからアクティブに変化すると、第１のクロック信号ＣＫ１、第２のクロック信号ＣＫ２、第３のクロック信号ＣＫ３及び第４のクロック信号ＣＫ４を生成し、所定の時間が経過すると、クロック信号の生成を停止してもよい。この場合、スタートビットＳＴは、時間スケール変換部３０及びリサンプラー４０、Ｄ／Ａ変換器６０及びローパスフィルター７０に供給されなくてもよい。 Note that the clock generator 20 generates the first clock signal CK1, the second clock signal CK2, the third clock signal CK3, and the fourth clock signal CK4 when the start bit ST changes from inactive to active. , the generation of the clock signal may be stopped after a predetermined period of time. In this case, the start bit ST may not be supplied to the time scale converter 30, resampler 40, D/A converter 60 and low-pass filter 70. FIG.

また、クロック生成部２０が生成する第１のクロック信号ＣＫ１、第２のクロック信号ＣＫ２、第３のクロック信号ＣＫ３及び第４のクロック信号ＣＫ４のうちの複数のクロック信号が共通であってもよい。 Further, a plurality of clock signals among the first clock signal CK1, the second clock signal CK2, the third clock signal CK3, and the fourth clock signal CK4 generated by the clock generation unit 20 may be common. .

以下では、時間スケール変換部３０が行う処理を「時間スケール変換」という。また、リサンプラー４０が行う処理を「リサンプリング」という。また、Ｄ／Ａ変換器６０が行う処理を「Ｄ／Ａ変換」という。 Hereinafter, the processing performed by the time scale conversion unit 30 is referred to as "time scale conversion". The processing performed by the resampler 40 is called "resampling". Further, the processing performed by the D/A converter 60 is called "D/A conversion".

図２は、第１の音声データＤ１のピッチを下げる場合の第２の音声データＤ２、第３の音声データＤ３及び音声信号ＶＯの一例を示す図である。図２の例では、第１の音声データＤ１は、一定時間に同じ正弦波が４回繰り返される一定周波数の音声データであり、その再生時間はＴ１である。また、比率情報ＳＲは７５％に設定されている。 FIG. 2 is a diagram showing an example of the second audio data D2, the third audio data D3, and the audio signal VO when lowering the pitch of the first audio data D1. In the example of FIG. 2, the first audio data D1 is audio data of a constant frequency in which the same sine wave is repeated four times in a constant period of time, and its reproduction time is T1. Also, the ratio information SR is set to 75%.

まず、時間スケール変換によりＮ個のサンプルを含む第１の音声データＤ１の１つの正弦波が削除され、同じ正弦波が３回繰り返される一定周波数の第２の音声データＤ２が生成される。第２の音声データＤ２のサンプル数はＮ×０．７５であり、第２の音声データＤ２の再生時間Ｔ２は、第１の音声データＤ１の再生時間Ｔ１の７５％に短縮されている。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は第１の音声データＤ１に含まれる正弦波の周波数と同じなので、時間スケール変換の前後でピッチは変わらない。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、ピッチが同じであって再生時間が７５％に短縮された音声データである。 First, one sine wave of the first audio data D1 containing N samples is removed by time scale conversion to generate the second audio data D2 of constant frequency in which the same sine wave is repeated three times. The number of samples of the second audio data D2 is N×0.75, and the reproduction time T2 of the second audio data D2 is shortened to 75% of the reproduction time T1 of the first audio data D1. On the other hand, since the frequency of the sine wave included in the second audio data D2 is the same as the frequency of the sine wave included in the first audio data D1, the pitch does not change before and after the time scale conversion. That is, the second audio data D2 is audio data having the same pitch as the first audio data D1 and having a playback time shortened to 75%.

次に、リサンプリングにより第２の音声データＤ２のサンプル数の比率情報ＳＲの逆数である約１．３３倍のサンプルを含む第３の音声データＤ３が生成される。図３は、図２の例におけるリサンプリングの詳細について説明するための図である。図３では、第２の音声データＤ２の一部に対応して生成される第３の音声データＤ３の一部が図示されている。図３の例では、リサンプラー４０は、第２の音声データＤ２に含まれるサンプルＡ１～Ａ５等を第１の周期Ｔｓ１の約１３３％である第２の周期Ｔｓ２でサンプリングされたものと仮定して、補間処理及び/又は間引き処理により、第１の周期Ｔｓ１でサンプリングされたサンプルＢ１～Ｂ６等を求めて第３の音声データＤ３を生成する。 Next, resampling generates the third audio data D3 containing about 1.33 times as many samples as the reciprocal of the ratio information SR of the number of samples of the second audio data D2. FIG. 3 is a diagram for explaining details of resampling in the example of FIG. FIG. 3 shows part of the third audio data D3 generated corresponding to part of the second audio data D2. In the example of FIG. 3, resampler 40 assumes that samples A1 to A5, etc. included in second audio data D2 are sampled at second period Ts2, which is approximately 133% of first period Ts1. Then, by interpolation processing and/or thinning processing, the samples B1 to B6 sampled at the first period Ts1 are obtained to generate the third audio data D3.

図２に示すように、第３の音声データＤ３の再生時間は、第２の音声データＤ２の再生時間Ｔ２の約１３３％であり、第１の音声データＤ１の再生時間Ｔ１と同じである。これに対して、第３の音声データＤ３に含まれる正弦波の周波数は、第２の音声データＤ２に含まれる正弦波の周波数、すなわち第１の音声データＤ１に含まれる正弦波の周波数の７５％である。すなわち、第３の音声データＤ３は、第１の音声データＤ１に対して、再生時間が同じであり、ピッチが７５％に下がった音声データである。 As shown in FIG. 2, the reproduction time of the third audio data D3 is approximately 133% of the reproduction time T2 of the second audio data D2, which is the same as the reproduction time T1 of the first audio data D1. On the other hand, the frequency of the sine wave included in the third audio data D3 is 75 times the frequency of the sine wave included in the second audio data D2, that is, the frequency of the sine wave included in the first audio data D1. %. That is, the third audio data D3 is audio data whose reproduction time is the same as that of the first audio data D1 and whose pitch is reduced to 75%.

最後に、Ｄ／Ａ変換により、第３の音声データＤ３の再生時間、すなわち第１の音声データＤ１の再生時間Ｔ１と同じ再生時間の音声信号ＶＯが生成される。 Finally, by D/A conversion, an audio signal VO having the same reproduction time as the reproduction time of the third audio data D3, that is, the reproduction time T1 of the first audio data D1 is generated.

図４は、第１の音声データＤ１のピッチを上げる場合の第２の音声データＤ２、第３の音声データＤ３及び音声信号ＶＯの一例を示す図である。図４の例では、第１の音声データＤ１は、一定時間に同じ正弦波が４回繰り返される一定周波数の音声データであり、その再生時間はＴ１である。また、比率情報ＳＲは１２５％に設定されている。 FIG. 4 is a diagram showing an example of the second audio data D2, the third audio data D3, and the audio signal VO when the pitch of the first audio data D1 is raised. In the example of FIG. 4, the first audio data D1 is audio data of a constant frequency in which the same sine wave is repeated four times in a constant period of time, and its reproduction time is T1. Also, the ratio information SR is set to 125%.

まず、時間スケール変換によりＮ個のサンプルを含む第１の音声データＤ１の１つの正弦波が複製され、同じ正弦波が５回繰り返される一定周波数の第２の音声データＤ２が生成される。第２の音声データＤ２のサンプル数はＮ×１．２５であり、第２の音声データＤ２の再生時間Ｔ２は、第１の音声データＤ１の再生時間Ｔ１の１２５％に伸長されている。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は第１の音声データＤ１に含まれる正弦波と同じなので、時間スケール変換の前後でピッチは変わらない。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、ピッチが同じであって再生時間が１２５％に伸長された音声データである。 First, one sine wave of the first audio data D1 containing N samples is replicated by time scale conversion to generate the second audio data D2 of constant frequency in which the same sine wave is repeated five times. The number of samples of the second audio data D2 is N×1.25, and the reproduction time T2 of the second audio data D2 is extended to 125% of the reproduction time T1 of the first audio data D1. On the other hand, since the sine wave contained in the second audio data D2 has the same frequency as the sine wave contained in the first audio data D1, the pitch does not change before and after the time scale conversion. In other words, the second audio data D2 is audio data having the same pitch as the first audio data D1 and having the reproduction time extended by 125%.

次に、リサンプリングにより第２の音声データＤ２のサンプル数の比率情報ＳＲの逆数である０．８倍のサンプルを含む第３の音声データＤ３が生成される。図５は、図４の例におけるリサンプリングの詳細について説明するための図である。図５では、第２の音声データＤ２の一部に対応して生成される第３の音声データＤ３の一部が図示されている。図５の例では、リサンプラー４０は、第２の音声データＤ２に含まれるサンプルＡ１～Ａ５等を第１の周期Ｔｓ１の８０％である第２の周期Ｔｓ２でサンプリングされたものと仮定して、補間処理及び/又は間引き処理により、第１の周期Ｔｓ１でサンプリングされたサンプルＢ１～Ｂ４等を求めて第３の音声データＤ３を生成する。 Next, resampling generates the third audio data D3 containing 0.8 times as many samples as the reciprocal of the ratio information SR of the number of samples of the second audio data D2. FIG. 5 is a diagram for explaining details of resampling in the example of FIG. FIG. 5 shows part of the third audio data D3 generated corresponding to part of the second audio data D2. In the example of FIG. 5, the resampler 40 assumes that the samples A1 to A5, etc. included in the second audio data D2 are sampled at a second period Ts2 that is 80% of the first period Ts1. , interpolation processing and/or decimation processing, the samples B1 to B4, etc. sampled at the first period Ts1 are obtained to generate the third audio data D3.

図４に示すように、第３の音声データＤ３の再生時間は、第２の音声データＤ２の再生時間Ｔ２の８０％であり、第１の音声データＤ１の再生時間Ｔ１と同じである。これに対して、第３の音声データＤ３に含まれる正弦波の周波数は、第２の音声データＤ２に含まれる正弦波の周波数、すなわち第１の音声データＤ１に含まれる正弦波の周波数の１２５％である。すなわち、第３の音声データＤ３は、第１の音声データＤ１に対して、再生時間が同じであり、ピッチが１２５％に上がった音声データである。 As shown in FIG. 4, the reproduction time of the third audio data D3 is 80% of the reproduction time T2 of the second audio data D2 and is the same as the reproduction time T1 of the first audio data D1. On the other hand, the frequency of the sine wave contained in the third audio data D3 is 125 times the frequency of the sine wave contained in the second audio data D2, that is, the frequency of the sine wave contained in the first audio data D1. %. That is, the third audio data D3 is audio data whose reproduction time is the same as that of the first audio data D1 and whose pitch is raised to 125%.

図６は、半導体装置１の具体的な実施例を示す図である。図６において、図１と同様の構成要素には同じ符号が付されている。図６の例では、半導体装置１は、音声データ格納部１０、クロック生成部２０、時間スケール変換部３０、リサンプラー４０、レジスター５０、Ｄ／Ａ変換器６０、ローパスフィルター７０、ＲＯＭ（Read Only Memory）１１０、ＲＡＭ（Random Access Memory）１２０、プロセッサー１３０及びバス２００を含む、１チップの集積回路装置である。 FIG. 6 is a diagram showing a specific embodiment of the semiconductor device 1. As shown in FIG. In FIG. 6, the same reference numerals are assigned to the same components as in FIG. In the example of FIG. 6, the semiconductor device 1 includes an audio data storage unit 10, a clock generation unit 20, a time scale conversion unit 30, a resampler 40, a register 50, a D/A converter 60, a low pass filter 70, a ROM (Read Only Memory) 110 , RAM (Random Access Memory) 120 , processor 130 and bus 200 .

音声データ格納部１０、時間スケール変換部３０、リサンプラー４０、レジスター５０、Ｄ／Ａ変換器６０、ローパスフィルター７０、ＲＯＭ１１０、ＲＡＭ１２０及びプロセッサー１３０は、バス２００に接続されている。 Audio data storage unit 10 , time scale conversion unit 30 , resampler 40 , register 50 , D/A converter 60 , low pass filter 70 , ROM 110 , RAM 120 and processor 130 are connected to bus 200 .

音声データ格納部１０は、前述の第１の音声データＤ１を格納する。音声データ格納部１０は、あらかじめ第１の音声データＤ１が格納された不揮発性メモリーであってもよいし、例えば、半導体装置１が不図示のインターフェース回路を介して外部装置から受信した第１の音声データＤ１を一時的に格納する揮発性メモリーであってもよい。 The audio data storage unit 10 stores the above-described first audio data D1. The audio data storage unit 10 may be a non-volatile memory in which the first audio data D1 is stored in advance, or for example, the first audio data received by the semiconductor device 1 from an external device via an interface circuit (not shown). A volatile memory that temporarily stores the audio data D1 may be used.

クロック生成部２０は、前述の第１のクロック信号ＣＫ１、第２のクロック信号ＣＫ２、第３のクロック信号ＣＫ３及び第４のクロック信号ＣＫ４を生成する。さらに、クロック生成部２０は、第６のクロック信号ＣＫ６を生成する。 The clock generator 20 generates the first clock signal CK1, the second clock signal CK2, the third clock signal CK3 and the fourth clock signal CK4. Furthermore, the clock generator 20 generates a sixth clock signal CK6.

時間スケール変換部３０は、マイクロコントロールユニット（ＭＣＵ：Micro Control Unit）３１及びデジタル信号プロセッサー（ＤＳＰ：Digital Signal Processor）３２を含む。マイクロコントロールユニット３１は、バス２００を介して、ＲＯＭ１１０に記憶されている時間スケール変換プログラム１１１を取得して実行する。デジタル信号プロセッサー３２は、マイクロコントロールユニット３１の命令に従って動作し、バス２００を介して、音声データ格納部１０に格納されている第１の音声データＤ１あるいはＲＡＭ１２０に記憶されている第２の音声データＤ２を取得して前述の時間スケール変換を行って第２の音声データＤ２を生成する。そして、デジタル信号プロセッサー３２は、生成した第２の音声データＤ２を、バス２００を介してＲＡＭ１２０に記憶させる。このように、時間スケール変換部３０は、時間スケール変換プログラム１１１に基づくソフトウェア処理により、時間スケール変換を行う。 The time scale conversion section 30 includes a micro control unit (MCU) 31 and a digital signal processor (DSP) 32 . The micro control unit 31 acquires and executes the time scale conversion program 111 stored in the ROM 110 via the bus 200 . The digital signal processor 32 operates according to instructions from the micro control unit 31, and transmits the first audio data D1 stored in the audio data storage unit 10 or the second audio data stored in the RAM 120 via the bus 200. D2 is obtained and the time scale conversion described above is performed to generate second audio data D2. Digital signal processor 32 then causes RAM 120 to store the generated second audio data D2 via bus 200 . Thus, the time scale conversion unit 30 performs time scale conversion by software processing based on the time scale conversion program 111 .

リサンプラー４０は、バス２００を介して、ＲＡＭ１２０に記憶されている第２の音声データＤ２を取得して前述のリサンプリングを行って第３の音声データＤ３を生成する。そして、リサンプラー４０は、生成した第３の音声データＤ３を、バス２００を介してＲＡＭ１２０に記憶させる。リサンプラー４０は、時間スケール変換部３０のようにプログラムに基づくソフトウェア処理を行わず、ハードウェア処理によりリサンプリングを行う。すなわち、リサンプラー４０は、専用回路で実現される。 The resampler 40 acquires the second audio data D2 stored in the RAM 120 via the bus 200 and performs the above-described resampling to generate the third audio data D3. The resampler 40 then stores the generated third audio data D3 in the RAM 120 via the bus 200 . Unlike the time scale conversion unit 30, the resampler 40 does not perform software processing based on a program, but performs resampling by hardware processing. That is, the resampler 40 is realized by a dedicated circuit.

Ｄ／Ａ変換器６０は、リサンプラー４０が生成した第３の音声データＤ３を取得してＤ／Ａ変換を行い、アナログ信号である音声信号ＶＯを生成する。そして、音声信号ＶＯは、半導体装置１と接続される不図示のスピーカーにより音声に変換される。 The D/A converter 60 acquires the third audio data D3 generated by the resampler 40, performs D/A conversion, and generates an audio signal VO which is an analog signal. Then, the audio signal VO is converted into audio by a speaker (not shown) connected to the semiconductor device 1 .

仮に、リサンプリングがソフトウェア処理で行われる場合には、ソフトウェア処理で行われるリサンプリングとハードウェア処理で行われるＤ／Ａ変換とを同期させるための制御が必要であるが、本実施形態では、リサンプラー４０がハードウェア処理によりリサンプリングを行うので、リサンプリングとＤ／Ａ変換とを容易に同期させることができる。 If resampling is performed by software processing, control is required to synchronize resampling performed by software processing and D/A conversion performed by hardware processing. Since the resampler 40 performs resampling by hardware processing, resampling and D/A conversion can be easily synchronized.

ローパスフィルター７０は、バス２００を介して、ＲＡＭ１２０に記憶されている第２の音声データＤ２を取得してフィルター処理を行い、フィルター処理された第２の音声データＤ２を、バス２００を介してＲＡＭ１２０に記憶させる。 The low-pass filter 70 acquires the second audio data D2 stored in the RAM 120 via the bus 200, performs filtering, and transmits the filtered second audio data D2 to the RAM 120 via the bus 200. be memorized.

プロセッサー１３０は、例えば、半導体装置１と接続される不図示の操作部からの操作信号に応じて、レジスター５０に前述の制御情報を書き込む。レジスター５０に記憶される制御情報は、バス２００を介して、時間スケール変換部３０、リサンプラー４０、レジスター５０、Ｄ／Ａ変換器６０及びローパスフィルター７０に供給され、これらの各部の動作が制御される。そして、プロセッサー１３０がレジスター５０に所望の制御情報を書き込むことにより、時間スケール変換部３０、リサンプラー４０、Ｄ／Ａ変換器６０及びローパスフィルター７０が動作して所望の音声信号ＶＯが得られるので、プロセッサー１３０は、半導体装置１が組み込まれる機器に必要な各種の処理、例えば、モーターの制御やパネルの制御等に専念することができる。 The processor 130 writes the aforementioned control information to the register 50 in response to an operation signal from an operation unit (not shown) connected to the semiconductor device 1, for example. The control information stored in register 50 is supplied to time scale converter 30, resampler 40, register 50, D/A converter 60 and low pass filter 70 via bus 200 to control the operation of each of these sections. be done. When the processor 130 writes the desired control information to the register 50, the time scale converter 30, the resampler 40, the D/A converter 60 and the low pass filter 70 operate to obtain the desired audio signal VO. , the processor 130 can be devoted to various kinds of processing necessary for equipment in which the semiconductor device 1 is incorporated, such as motor control and panel control.

以上に説明したように、第１実施形態の半導体装置１は、第１の周期Ｔｓ１でサンプリングされた複数のサンプルを含む第１の音声データＤ１を格納する音声データ格納部１０と、第１のクロック信号ＣＫ１及び第２のクロック信号ＣＫ２を生成するクロック生成部２０と、第１のクロック信号ＣＫ１に同期して、第１の音声データＤ１を、第１の音声データＤ１に含まれる複数のサンプルの一部を削除又は複製した第２の音声データＤ２に変換する時間スケール変換部３０と、第２のクロック信号ＣＫ２に同期して、ハードウェア処理により、第２の音声データＤ２を、第１の周期Ｔｓ１でサンプリングされた、第２の音声データＤ２と異なる数の複数のサンプルを含む第３の音声データＤ３に変換するリサンプラー４０と、時間スケール変換部３０及びリサンプラー４０の動作を制御する制御情報を格納するレジスター５０と、を含む。 As described above, the semiconductor device 1 of the first embodiment includes the audio data storage unit 10 that stores the first audio data D1 including a plurality of samples sampled at the first period Ts1, and the first A clock generator 20 for generating a clock signal CK1 and a second clock signal CK2, and a plurality of samples included in the first audio data D1 in synchronization with the first clock signal CK1. and a time scale conversion unit 30 that converts the second audio data D2 by deleting or duplicating a part of the first audio data D2 by hardware processing in synchronization with the second clock signal CK2. A resampler 40 that converts the resampler 40 into third audio data D3 containing a number of samples different from the number of the second audio data D2 sampled at a period Ts1 of , and the operations of the time scale conversion unit 30 and the resampler 40 are controlled. and a register 50 for storing control information to be executed.

時間スケール変換部３０により、第１の音声データＤ１のピッチが変換されずに再生時間が変換された第２の音声データＤ２が生成され、リサンプラー４０により、第２の音声データＤ２のピッチ及び再生時間が変換された第３の音声データＤ３が生成されるので、第１の音声データＤ１のピッチ変換が実現することができる。そして、リサンプラー４０がハードウェア処理によりリサンプリングを行うので、例えば、図６に示したように、マイクロコントロールユニット３１やデジタル信号プロセッサー３２が時間スケール変換部３０として機能し、ソフトウェア処理により時間スケール変換を行う場合でも、リサンプリングの分だけ処理負荷が低減されるため、簡易なマイクロコントロールユニット３１やデジタル信号プロセッサー３２を用いることができる。したがって、第１実施形態の半導体装置１によれば、比較的小さな回路規模でありながら音声データのピッチ変換を行うことができる。 The time scale conversion unit 30 generates the second audio data D2 in which the reproduction time is converted without converting the pitch of the first audio data D1, and the resampler 40 converts the pitch and the pitch of the second audio data D2. Since the third audio data D3 whose reproduction time is converted is generated, the pitch conversion of the first audio data D1 can be realized. Since the resampler 40 performs resampling by hardware processing, for example, as shown in FIG. Even when conversion is performed, the processing load is reduced by the amount of resampling, so a simple micro control unit 31 and digital signal processor 32 can be used. Therefore, according to the semiconductor device 1 of the first embodiment, pitch conversion of voice data can be performed with a relatively small circuit scale.

また、第１実施形態の半導体装置１によれば、例えば、図６に示したように、プロセッサー１３０がレジスター５０に所望の制御情報を書き込むことにより、第１の音声データＤ１のピッチ変換が行われるので、プロセッサー１３０は、半導体装置１が組み込まれる機器に必要な各種の処理に専念することができる。 Further, according to the semiconductor device 1 of the first embodiment, the processor 130 writes desired control information to the register 50 as shown in FIG. Therefore, the processor 130 can concentrate on various processes necessary for equipment in which the semiconductor device 1 is installed.

また、第１実施形態の半導体装置１によれば、リサンプラー４０がハードウェア処理によりリサンプリングを行うので、半導体装置１が第３の音声データＤ３を音声信号ＶＯに変換するＤ／Ａ変換器６０を含む場合には、リサンプリングとＤ／Ａ変換器６０のハードウェア処理によるＤ／Ａ変換とを容易に同期させることができる。 Further, according to the semiconductor device 1 of the first embodiment, since the resampler 40 performs resampling by hardware processing, the semiconductor device 1 is a D/A converter that converts the third audio data D3 into the audio signal VO. 60, resampling and D/A conversion by hardware processing of the D/A converter 60 can be easily synchronized.

また、第１実施形態の半導体装置１によれば、第３の音声データＤ３が第１の音声データＤ１のピッチが変更された、第１の音声データＤ１と同じサンプル数のデータであるように制御情報が設定されることにより、第１の音声データＤ１の再生時間を変更せずにピッチを変換した第３の音声データＤ３を生成することができる。 Further, according to the semiconductor device 1 of the first embodiment, the third audio data D3 is data having the same number of samples as the first audio data D1, in which the pitch of the first audio data D1 is changed. By setting the control information, the pitch-converted third audio data D3 can be generated without changing the reproduction time of the first audio data D1.

１－２．第２実施形態
以下、第２実施形態の半導体装置１について、第１実施形態と同様の構成については同じ符号を付し、第１実施形態と同様の説明は省略又は簡略し、主として第１実施形態と異なる内容について説明する。 1-2. Second Embodiment Hereinafter, regarding the semiconductor device 1 of the second embodiment, the same reference numerals are given to the same configurations as in the first embodiment, and the same description as in the first embodiment will be omitted or simplified, and mainly the first embodiment will be described. Contents different from the form will be explained.

図７は、第２実施形態の半導体装置１の構成例を示す図である。図７に示すように、第２実施形態の半導体装置１は、第１実施形態と同様、音声データ格納部１０と、クロック生成部２０と、時間スケール変換部３０と、リサンプラー４０と、レジスター５０と、を含む。また、第２実施形態の半導体装置１は、第１実施形態と同様、Ｄ／Ａ変換器６０やローパスフィルター７０を含んでもよい。さらに、第２実施形態の半導体装置１は、セレクター８０、セレクター９０及び制御部１００を含む。 FIG. 7 is a diagram showing a configuration example of the semiconductor device 1 of the second embodiment. As shown in FIG. 7, the semiconductor device 1 of the second embodiment includes an audio data storage unit 10, a clock generation unit 20, a time scale conversion unit 30, a resampler 40, and a register as in the first embodiment. 50 and Moreover, the semiconductor device 1 of the second embodiment may include the D/A converter 60 and the low-pass filter 70 as in the first embodiment. Furthermore, the semiconductor device 1 of the second embodiment includes a selector 80 , a selector 90 and a control section 100 .

音声データ格納部１０は、第１実施形態と同様、第１の周期Ｔｓ１でサンプリングされた複数のサンプルを含む第１の音声データＤ１を格納する。 The audio data storage unit 10 stores first audio data D1 including a plurality of samples sampled at the first period Ts1, as in the first embodiment.

クロック生成部２０は、第１実施形態と同様、第１のクロック信号ＣＫ１、第２のクロック信号ＣＫ２、第３のクロック信号ＣＫ３及び第４のクロック信号ＣＫ４を生成し、さらに、制御部１００を動作させるための第５のクロック信号ＣＫ５を生成する。 The clock generator 20 generates a first clock signal CK1, a second clock signal CK2, a third clock signal CK3, and a fourth clock signal CK4, and further controls the controller 100 as in the first embodiment. A fifth clock signal CK5 is generated for operation.

レジスター５０は、時間スケール変換部３０及びリサンプラー４０の動作を制御する制御情報を格納する。第２実施形態では、制御情報は、半導体装置１の動作モードを、第１の動作モード、第２の動作モード及び第３の動作モードを含む複数の動作モードのいずれかに設定する情報である動作モード情報ＭＤを含む。第１の動作モードは、第１の音声データＤ１の再生時間を変更せずにピッチを変更する動作モードである。第２の動作モードは、第１の音声データＤ１のピッチを変更せずに再生時間を変更する動作モードである。第３の動作モードは、第１の音声データＤ１の再生時間及びピッチを変更する動作モードである。 The register 50 stores control information for controlling the operations of the time scale converter 30 and the resampler 40 . In the second embodiment, the control information is information for setting the operation mode of the semiconductor device 1 to one of a plurality of operation modes including a first operation mode, a second operation mode, and a third operation mode. It contains operation mode information MD. A first operation mode is an operation mode in which the pitch is changed without changing the reproduction time of the first audio data D1. A second operation mode is an operation mode in which the reproduction time is changed without changing the pitch of the first audio data D1. A third operation mode is an operation mode for changing the reproduction time and pitch of the first audio data D1.

また、制御情報は、第２の音声データＤ２のサンプル数と第１の音声データＤ１のサンプル数との比率を設定する第１の比率情報ＳＲ１及び第２の比率情報ＳＲ２や、時間スケール変換部３０及びリサンプラー４０の少なくとも一方に処理を開始させるためのスタートビットＳＴを含んでもよい。 Further, the control information includes first ratio information SR1 and second ratio information SR2 for setting the ratio between the number of samples of the second audio data D2 and the number of samples of the first audio data D1, and the time scale converter. A start bit ST for causing at least one of 30 and resampler 40 to start processing may be included.

制御部１００は、時間スケール変換部３０、リサンプラー４０、Ｄ／Ａ変換器６０、ローパスフィルター７０、セレクター８０及びセレクター９０の動作を制御する。具体的には、制御部１００は、第５のクロック信号ＣＫ５に同期して、動作モード情報ＭＤ、第１の比率情報ＳＲ１、第２の比率情報ＳＲ２及びスタートビットＳＴに基づいて、パルス信号である第１のスタート信号ＳＴ１、第２のスタート信号ＳＴ２、第３のスタート信号ＳＴ３、第１の選択信号ＳＥＬ１、第２の選択信号ＳＥＬ２、比率情報ＳＲを生成する。時間スケール変換部３０は、第１のスタート信号ＳＴ１により、比率情報ＳＲに基づいて第２の音声データＤ２を生成する処理を開始し、処理を終了すると停止する。リサンプラー４０及びローパスフィルター７０は、第２のスタート信号ＳＴ２により、比率情報ＳＲに基づいて第３の音声データＤ３を生成する処理を開始し、処理を終了すると停止する。Ｄ／Ａ変換器６０は、第３のスタート信号ＳＴ３により処理を開始し、処理を終了すると停止する。 The control unit 100 controls operations of the time scale conversion unit 30 , resampler 40 , D/A converter 60 , low pass filter 70 , selector 80 and selector 90 . Specifically, in synchronization with the fifth clock signal CK5, the control unit 100 generates a pulse signal based on the operation mode information MD, the first ratio information SR1, the second ratio information SR2, and the start bit ST. A first start signal ST1, a second start signal ST2, a third start signal ST3, a first selection signal SEL1, a second selection signal SEL2, and ratio information SR are generated. The time scale conversion unit 30 starts the process of generating the second audio data D2 based on the ratio information SR in response to the first start signal ST1, and stops when the process ends. The resampler 40 and the low-pass filter 70 start the process of generating the third audio data D3 based on the ratio information SR in response to the second start signal ST2, and stop when the process ends. The D/A converter 60 starts processing in response to the third start signal ST3, and stops when the processing ends.

セレクター８０は、第１の選択信号ＳＥＬ１に基づいて、第１の音声データＤ１又は第２の音声データＤ２を選択して出力する。また、セレクター９０は、第２の選択信号ＳＥＬ２に基づいて、第３の音声データＤ３又は第２の音声データＤ２を選択して出力する。 The selector 80 selects and outputs the first audio data D1 or the second audio data D2 based on the first selection signal SEL1. Also, the selector 90 selects and outputs the third audio data D3 or the second audio data D2 based on the second selection signal SEL2.

動作モード情報ＭＤにおいて第１の動作モードが設定されている場合、制御部１００は、セレクター８０に第１の音声データＤ１を選択させる第１の選択信号ＳＥＬ１、及びセレクター９０に第３の音声データＤ３を選択させる第２の選択信号ＳＥＬ２を生成する。また、制御部１００は、スタートビットＳＴが非アクティブからアクティブに変化すると、第１のスタート信号ＳＴ１、第２のスタート信号ＳＴ２及び第３のスタート信号ＳＴ３を発生させる。また、制御部１００は、比率情報ＳＲに第１の比率情報ＳＲ１を設定する。これにより、時間スケール変換部３０、リサンプラー４０、Ｄ／Ａ変換器６０及びローパスフィルター７０が第１実施形態と同様に動作する。したがって、第１の動作モードにおいて生成される音声信号ＶＯは、第１実施形態と同様、第１の音声データＤ１の再生時間を変更せずにピッチを変更した音声信号である。 When the first operation mode is set in the operation mode information MD, the control unit 100 outputs the first selection signal SEL1 that causes the selector 80 to select the first audio data D1 and the selector 90 to transmit the third audio data D1. A second selection signal SEL2 for selecting D3 is generated. Also, when the start bit ST changes from inactive to active, the controller 100 generates a first start signal ST1, a second start signal ST2 and a third start signal ST3. Also, the control unit 100 sets the first ratio information SR1 to the ratio information SR. As a result, the time scale converter 30, resampler 40, D/A converter 60, and low-pass filter 70 operate in the same manner as in the first embodiment. Therefore, the audio signal VO generated in the first operation mode is an audio signal whose pitch is changed without changing the reproduction time of the first audio data D1, as in the first embodiment.

また、動作モード情報ＭＤにおいて第２の動作モードが設定されている場合、制御部１００は、セレクター８０に第１の音声データＤ１を選択させる第１の選択信号ＳＥＬ１、及びセレクター９０に第２の音声データＤ２を選択させる第２の選択信号ＳＥＬ２を生成する。また、制御部１００は、スタートビットＳＴが非アクティブからアクティブに変化すると、第１のスタート信号ＳＴ１及び第３のスタート信号ＳＴ３を発生させる。また、制御部１００は、比率情報ＳＲに第１の比率情報ＳＲ１を設定する。これにより、時間スケール変換部３０及びＤ／Ａ変換器６０が動作し、リサンプラー４０及びローパスフィルター７０は動作しない。したがって、第２の動作モードにおいて生成される音声信号ＶＯは、第１の音声データＤ１のピッチを変更せずに再生時間を変更した音声信号である。 Further, when the second operation mode is set in the operation mode information MD, the control unit 100 outputs the first selection signal SEL1 that causes the selector 80 to select the first audio data D1, and the second selection signal SEL1 that causes the selector 90 to select the second audio data D1. A second selection signal SEL2 for selecting the audio data D2 is generated. Also, when the start bit ST changes from inactive to active, the controller 100 generates a first start signal ST1 and a third start signal ST3. Also, the control unit 100 sets the first ratio information SR1 to the ratio information SR. As a result, the time scale converter 30 and the D/A converter 60 operate, and the resampler 40 and low-pass filter 70 do not operate. Therefore, the audio signal VO generated in the second operation mode is an audio signal in which the reproduction time is changed without changing the pitch of the first audio data D1.

また、動作モード情報ＭＤにおいて第３の動作モードが設定されている場合、制御部１００は、まず、セレクター８０に第１の音声データＤ１を選択させる第１の選択信号ＳＥＬ１を生成する。また、制御部１００は、スタートビットＳＴが非アクティブからアクティブに変化すると、第１のスタート信号ＳＴ１を発生させる。また、制御部１００は、比率情報ＳＲに第１の比率情報ＳＲ１を設定する。これにより、時間スケール変換部３０が動作し、リサンプラー４０、Ｄ／Ａ変換器６０及びローパスフィルター７０は動作しない。そして、時間スケール変換部３０により、第１の音声データＤ１のピッチを変更せずに再生時間を変更した第２の音声データＤ２が生成される。 Further, when the third operation mode is set in the operation mode information MD, the control unit 100 first generates the first selection signal SEL1 that causes the selector 80 to select the first audio data D1. Also, the control unit 100 generates the first start signal ST1 when the start bit ST changes from inactive to active. Also, the control unit 100 sets the first ratio information SR1 to the ratio information SR. As a result, the time scale converter 30 operates, and the resampler 40, D/A converter 60 and low-pass filter 70 do not operate. Then, the time scale conversion unit 30 generates the second audio data D2 in which the reproduction time is changed without changing the pitch of the first audio data D1.

第２の音声データＤ２の生成が終了すると、制御部１００は、次に、セレクター８０に第２の音声データＤ２を選択させる第１の選択信号ＳＥＬ１、及びセレクター９０に第３の音声データＤ３を選択させる第２の選択信号ＳＥＬ２を生成する。また、制御部１００は、第１のスタート信号ＳＴ１、第２のスタート信号ＳＴ２及び第３のスタート信号ＳＴ３を発生させる。また、制御部１００は、比率情報ＳＲに第２の比率情報ＳＲ２を設定する。これにより、時間スケール変換部３０、リサンプラー４０、Ｄ／Ａ変換器６０及びローパスフィルター７０が動作し、セレクター８０により選択された第２の音声データＤ２の再生時間を変更せずにピッチを変更した第３の音声データＤ３が生成される。ここで、セレクター８０により選択された第２の音声データＤ２は、前述の通り、第１の音声データＤ１のピッチを変更せずに再生時間を変更した音声データであるので、第３の音声データＤ３は、第１の音声データＤ１の再生時間及びピッチを変更した音声データである。したがって、第３の動作モードにおいて生成される音声信号ＶＯは、第１の音声データＤ１の再生時間及びピッチを変更した音声信号である。なお、再生時間の変換率やピッチの変換率は、第１の比率情報ＳＲ１及び第２の比率情報ＳＲ２により任意に設定される。 When the generation of the second audio data D2 is completed, the control unit 100 next sends the first selection signal SEL1 that causes the selector 80 to select the second audio data D2 and the third audio data D3 to the selector 90. A second selection signal SEL2 for selection is generated. The control section 100 also generates a first start signal ST1, a second start signal ST2 and a third start signal ST3. Also, the control unit 100 sets the second ratio information SR2 to the ratio information SR. As a result, the time scale conversion unit 30, the resampler 40, the D/A converter 60, and the low-pass filter 70 operate to change the pitch without changing the reproduction time of the second audio data D2 selected by the selector 80. Then, third audio data D3 is generated. Here, the second audio data D2 selected by the selector 80 is the audio data obtained by changing the reproduction time without changing the pitch of the first audio data D1, as described above. D3 is audio data obtained by changing the reproduction time and pitch of the first audio data D1. Therefore, the audio signal VO generated in the third operation mode is an audio signal obtained by changing the reproduction time and pitch of the first audio data D1. Note that the reproduction time conversion rate and the pitch conversion rate are arbitrarily set by the first ratio information SR1 and the second ratio information SR2.

図８は、第３の動作モードにおいて、第１の音声データＤ１のピッチを下げるとともに再生時間を短縮する場合の第２の音声データＤ２、第３の音声データＤ３及び音声信号ＶＯの一例を示す図である。図８の例では、第１の音声データＤ１は、一定時間に同じ正弦波が４回繰り返される一定周波数の音声データであり、その再生時間はＴ１である。また、第１の比率情報ＳＲ１は７５％に設定され、第２の比率情報ＳＲ２は６７％に設定されている。 FIG. 8 shows an example of the second audio data D2, the third audio data D3, and the audio signal VO when the pitch of the first audio data D1 is lowered and the reproduction time is shortened in the third operation mode. It is a diagram. In the example of FIG. 8, the first audio data D1 is audio data of a constant frequency in which the same sine wave is repeated four times in a constant period of time, and its reproduction time is T1. Also, the first ratio information SR1 is set to 75%, and the second ratio information SR2 is set to 67%.

まず、第１回目の時間スケール変換によりＮ個のサンプルを含む第１の音声データＤ１の１つの正弦波が削除され、同じ正弦波が３回繰り返される一定周波数の第２の音声データＤ２が生成される。第２の音声データＤ２のサンプル数はＮ×０．７５であり、第２の音声データＤ２の再生時間Ｔ２は、第１の音声データＤ１の再生時間Ｔ１の７５％に短縮されている。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は第１の音声データＤ１に含まれる正弦波の周波数と同じなので、時間スケール変換の前後でピッチは変わらない。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、ピッチが同じであって再生時間が７５％に短縮された音声データである。 First, one sine wave of the first audio data D1 containing N samples is deleted by the first time scale conversion, and the second audio data D2 with a constant frequency in which the same sine wave is repeated three times is generated. be done. The number of samples of the second audio data D2 is N×0.75, and the reproduction time T2 of the second audio data D2 is shortened to 75% of the reproduction time T1 of the first audio data D1. On the other hand, since the frequency of the sine wave included in the second audio data D2 is the same as the frequency of the sine wave included in the first audio data D1, the pitch does not change before and after the time scale conversion. That is, the second audio data D2 is audio data having the same pitch as the first audio data D1 and having a playback time shortened to 75%.

次に、第２回目の時間スケール変換によりＮ×０．７５個のサンプルを含む第２の音声データＤ２の１つの正弦波が削除され、同じ正弦波が２回繰り返される一定周波数の第２の音声データＤ２が生成される。時間スケール変換により得られる第２の音声データＤ２のサンプル数はＮ×０．７５×０．６７であり、第２の音声データＤ２の再生時間Ｔ３は、第１の音声データＤ１の再生時間Ｔ１の５０％に短縮されている。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は第１の音声データＤ１に含まれる正弦波の周波数と同じである。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、ピッチが同じであって再生時間が５０％に短縮された音声データである。 Next, one sine wave of the second audio data D2 containing N×0.75 samples is removed by a second time scale conversion, and a second sound data D2 of constant frequency in which the same sine wave is repeated twice. Audio data D2 is generated. The number of samples of the second audio data D2 obtained by time scale conversion is N×0.75×0.67, and the reproduction time T3 of the second audio data D2 is equal to the reproduction time T1 of the first audio data D1. is shortened to 50% of On the other hand, the frequency of the sine wave included in the second audio data D2 is the same as the frequency of the sine wave included in the first audio data D1. That is, the second audio data D2 is audio data having the same pitch as the first audio data D1 and having a playback time shortened to 50%.

次に、リサンプリングにより第２の音声データＤ２のサンプル数の第２の比率情報ＳＲ２の逆数である１．５倍のサンプルを含む第３の音声データＤ３が生成される。第３の音声データＤ３の再生時間は、時間スケール変換後の第２の音声データＤ２の再生時間Ｔ３の１５０％であり、時間スケール変換前の第２の音声データＤ２の再生時間Ｔ２と同じであり、第１の音声データＤ１の再生時間Ｔ１の７５％である。これに対して、第３の音声データＤ３に含まれる正弦波の周波数は、第２の音声データＤ２に含まれる正弦波の周波数、すなわち第１の音声データＤ１に含まれる正弦波の周波数の６７％である。すなわち、第３の音声データＤ３は、第１の音声データＤ１に対して、再生時間が７５％に短縮され、ピッチが６７％に下がった音声データである。 Next, by resampling, the third audio data D3 including 1.5 times the number of samples of the second audio data D2, which is the reciprocal of the second ratio information SR2, is generated. The reproduction time of the third audio data D3 is 150% of the reproduction time T3 of the second audio data D2 after time scale conversion, and is the same as the reproduction time T2 of the second audio data D2 before time scale conversion. , which is 75% of the reproduction time T1 of the first audio data D1. On the other hand, the frequency of the sine wave included in the third audio data D3 is 67 times the frequency of the sine wave included in the second audio data D2, that is, the frequency of the sine wave included in the first audio data D1. %. That is, the third audio data D3 is audio data in which the reproduction time is shortened to 75% and the pitch is reduced to 67% with respect to the first audio data D1.

最後に、Ｄ／Ａ変換により、第３の音声データＤ３の再生時間Ｔ２と同じ再生時間の音声信号ＶＯが生成される。 Finally, by D/A conversion, an audio signal VO having the same reproduction time as the reproduction time T2 of the third audio data D3 is generated.

図９は、第３の動作モードにおいて、第１の音声データＤ１のピッチを上げるとともに再生時間を伸長する場合の第２の音声データＤ２、第３の音声データＤ３及び音声信号ＶＯの一例を示す図である。図９の例では、第１の音声データＤ１は、一定時間に同じ正弦波が４回繰り返される一定周波数の音声データであり、その再生時間はＴ１である。また、第１の比率情報ＳＲ１は１２５％に設定され、第２の比率情報ＳＲ２は１２０％に設定されている。 FIG. 9 shows an example of the second audio data D2, the third audio data D3, and the audio signal VO when the pitch of the first audio data D1 is increased and the reproduction time is extended in the third operation mode. It is a diagram. In the example of FIG. 9, the first audio data D1 is audio data of a constant frequency in which the same sine wave is repeated four times in a constant period of time, and its reproduction time is T1. Also, the first ratio information SR1 is set to 125%, and the second ratio information SR2 is set to 120%.

まず、第１回目の時間スケール変換によりＮ個のサンプルを含む第１の音声データＤ１の１つの正弦波が複製され、同じ正弦波が５回繰り返される一定周波数の第２の音声データＤ２が生成される。第２の音声データＤ２のサンプル数はＮ×１．２５であり、第２の音声データＤ２の再生時間Ｔ２は、第１の音声データＤ１の再生時間Ｔ１の１２５％に伸長されている。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は第１の音声データＤ１に含まれる正弦波の周波数と同じなので、時間スケール変換の前後でピッチは変わらない。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、ピッチが同じであって再生時間が１２５％に伸長された音声データである。 First, the first time scale conversion replicates one sine wave of the first audio data D1 containing N samples to generate the second audio data D2 with a constant frequency in which the same sine wave is repeated five times. be done. The number of samples of the second audio data D2 is N×1.25, and the reproduction time T2 of the second audio data D2 is extended to 125% of the reproduction time T1 of the first audio data D1. On the other hand, since the frequency of the sine wave included in the second audio data D2 is the same as the frequency of the sine wave included in the first audio data D1, the pitch does not change before and after the time scale conversion. In other words, the second audio data D2 is audio data having the same pitch as the first audio data D1 and having the reproduction time extended by 125%.

次に、第２回目の時間スケール変換によりＮ×１．２５個のサンプルを含む第２の音声データＤ２の１つの正弦波が複製され、同じ正弦波が６回繰り返される一定周波数の第２の音声データＤ２が生成される。時間スケール変換により得られる第２の音声データＤ２のサンプル数はＮ×１．２５×１．２０＝Ｎ×１．５であり、第２の音声データＤ２の再生時間Ｔ３は、第１の音声データＤ１の再生時間Ｔ１の１５０％に伸長されている。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は第１の音声データＤ１に含まれる正弦波の周波数と同じである。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、ピッチが同じであって再生時間が１５０％に伸長された音声データである。 Next, a second time-scale conversion replicates one sine wave of the second audio data D2 containing N×1.25 samples, and a second constant-frequency second audio data D2 of the same sine wave repeated six times. Audio data D2 is generated. The number of samples of the second audio data D2 obtained by time scale conversion is N×1.25×1.20=N×1.5, and the reproduction time T3 of the second audio data D2 is equal to that of the first audio. It is extended to 150% of the reproduction time T1 of data D1. On the other hand, the frequency of the sine wave included in the second audio data D2 is the same as the frequency of the sine wave included in the first audio data D1. That is, the second audio data D2 is audio data having the same pitch as the first audio data D1 and having a reproduction time extended by 150%.

次に、リサンプリングにより第２の音声データＤ２のサンプル数の第２の比率情報ＳＲ２の逆数である約０．８３倍のサンプルを含む第３の音声データＤ３が生成される。第３の音声データＤ３の再生時間は、時間スケール変換後の第２の音声データＤ２の再生時間Ｔ３の約８３％であり、時間スケール変換前の第２の音声データＤ２の再生時間Ｔ２と同じであり、第１の音声データＤ１の再生時間Ｔ１の１２５％である。これに対して、第３の音声データＤ３に含まれる正弦波の周波数は、第２の音声データＤ２に含まれる正弦波の周波数、すなわち第１の音声データＤ１に含まれる正弦波の周波数の１２０％である。すなわち、第３の音声データＤ３は、第１の音声データＤ１に対して、再生時間が１２５％に伸長され、ピッチが１２０％に上がった音声データである。 Next, by resampling, the third audio data D3 including about 0.83 times the number of samples of the second audio data D2, which is the reciprocal of the second ratio information SR2, is generated. The reproduction time of the third audio data D3 is approximately 83% of the reproduction time T3 of the second audio data D2 after time scale conversion, and is the same as the reproduction time T2 of the second audio data D2 before time scale conversion. , which is 125% of the reproduction time T1 of the first audio data D1. On the other hand, the frequency of the sine wave included in the third audio data D3 is 120 times the frequency of the sine wave included in the second audio data D2, that is, the frequency of the sine wave included in the first audio data D1. %. That is, the third audio data D3 is audio data whose reproduction time is extended by 125% and whose pitch is increased by 120% with respect to the first audio data D1.

なお、第２実施形態の半導体装置１の実施例の構成については図示を省略するが、図６に示した第１実施形態の半導体装置１に対して、クロック生成部２０がさらに前述の第５のクロック信号ＣＫ５を生成し、例えば、マイクロコントロールユニット３１がさらに前述の制御部１００として機能してもよい。 Although the configuration of the example of the semiconductor device 1 of the second embodiment is omitted from the drawing, the clock generator 20 is further added to the semiconductor device 1 of the first embodiment shown in FIG. , and the micro control unit 31 may further function as the above-described control section 100, for example.

以上に説明した第２実施形態の半導体装置１によれば、第１実施形態の半導体装置１と同様の効果を奏する。さらに、第２実施形態の半導体装置１によれば、レジスター５０に記憶される制御情報に基づき、第１の音声データＤ１の再生時間を変更せずにピッチを変更する第１の動作モード、第１の音声データＤ１のピッチを変更せずに再生時間を変更する第２の動作モード、又は第１の音声データＤ１の再生時間及びピッチを変更する第３の動作モードを選択することができるので、高い汎用性を実現することができる。 According to the semiconductor device 1 of the second embodiment described above, the same effects as those of the semiconductor device 1 of the first embodiment are obtained. Furthermore, according to the semiconductor device 1 of the second embodiment, the first operation mode in which the pitch is changed without changing the reproduction time of the first audio data D1 based on the control information stored in the register 50; It is possible to select the second operation mode in which the reproduction time is changed without changing the pitch of the first audio data D1, or the third operation mode in which the reproduction time and pitch of the first audio data D1 are changed. , high versatility can be achieved.

１－３．変形例
図１０は、変形例の半導体装置１の構成を示す図である。図１０に示すように、変形例の半導体装置１は、第１実施形態と同様、音声データ格納部１０と、クロック生成部２０と、時間スケール変換部３０と、リサンプラー４０と、レジスター５０と、を含む。 1-3. Modification FIG. 10 is a diagram showing the configuration of a semiconductor device 1 of a modification. As shown in FIG. 10, the semiconductor device 1 of the modification includes an audio data storage unit 10, a clock generation unit 20, a time scale conversion unit 30, a resampler 40, and a register 50, as in the first embodiment. ,including.

クロック生成部２０は、第１実施形態と同様、第１のクロック信号ＣＫ１及び第２のクロック信号ＣＫ２を生成する。 The clock generator 20 generates a first clock signal CK1 and a second clock signal CK2, as in the first embodiment.

リサンプラー４０は、第２のクロック信号ＣＫ２に同期して、ハードウェア処理により、第１の音声データＤ１を、第１の周期Ｔｓ１でサンプリングされた、第１の音声データＤ１と異なる数の複数のサンプルを含む第２の音声データＤ２に変換する。具体的には、リサンプラー４０は、第１の音声データＤ１に含まれる複数のサンプルを第１の周期Ｔｓ１とは異なる第２の周期Ｔｓ２でサンプリングされたものと仮定して、当該複数のサンプルに基づく補間処理及び/又は間引き処理により、第１の周期Ｔｓ１でサンプリングされた複数のサンプルを求めて第２の音声データＤ２とする。補間処理及び/又は間引き処理には公知の各種の手法を適用可能である。なお、第１の音声データＤ１が圧縮された音声データである場合は、リサンプラー４０は、不図示の音声データ展開部により展開された第１の音声データＤ１に対して補間処理及び/又は間引き処理を行ってもよい。 Synchronizing with the second clock signal CK2, the resampler 40 performs hardware processing to sample the first audio data D1 at the first period Ts1, and samples the first audio data D1 in a number different from that of the first audio data D1. is converted into second audio data D2 containing samples of . Specifically, the resampler 40 assumes that the plurality of samples included in the first audio data D1 are sampled at a second period Ts2 different from the first period Ts1, and assumes that the plurality of samples A plurality of samples sampled at the first cycle Ts1 are obtained by interpolation processing and/or thinning processing based on , and are used as the second audio data D2. Various known techniques can be applied to interpolation processing and/or thinning processing. When the first audio data D1 is compressed audio data, the resampler 40 performs interpolation processing and/or thinning on the first audio data D1 expanded by an audio data expansion unit (not shown). processing may be performed.

時間スケール変換部３０は、第１のクロック信号ＣＫ１に同期して、第２の音声データＤ２を、第２の音声データＤ２に含まれる複数のサンプルの一部を削除又は複製した第３の音声データＤ３に変換する。例えば、削除又は複製されるサンプルは、周期的に繰り返される波形を構成する複数のサンプルの一部であってもよい。第３の音声データＤ３は、第２の音声データＤ２と同じピッチであって、第２の音声データＤ２とは再生速度あるいは再生時間が異なる音声データである。第２の音声データＤ２を第３の音声データＤ３に変換する処理、すなわち、いわゆる時間スケール変換処理には公知の各種の手法を適用可能である。例えば、時間スケール変換部３０は、ソフトウェア処理により時間スケール変換処理を行ってもよい。 In synchronization with the first clock signal CK1, the time scale conversion unit 30 transforms the second audio data D2 into a third audio data obtained by deleting or duplicating some of the samples included in the second audio data D2. Convert to data D3. For example, the deleted or duplicated sample may be part of a plurality of samples that make up a periodically repeating waveform. The third audio data D3 is audio data having the same pitch as the second audio data D2, but different in reproduction speed or reproduction time from the second audio data D2. Various known techniques can be applied to the process of converting the second audio data D2 into the third audio data D3, that is, the so-called time scale conversion process. For example, the time scale conversion unit 30 may perform time scale conversion processing by software processing.

レジスター５０は、第１実施形態と同様、時間スケール変換部３０及びリサンプラー４０の動作を制御する制御情報を格納する。例えば、制御情報は、時間スケール変換部３０及びリサンプラー４０に処理を開始させるためのスタートビットＳＴを含んでもよい。時間スケール変換部３０及びリサンプラー４０は、スタートビットＳＴが非アクティブからアクティブに変化すると処理を開始し、処理を終了すると停止する。 The register 50 stores control information for controlling the operations of the time scale converter 30 and the resampler 40, as in the first embodiment. For example, the control information may include a start bit ST for causing the time scale converter 30 and resampler 40 to start processing. The time scale converter 30 and the resampler 40 start processing when the start bit ST changes from inactive to active, and stop when the processing ends.

また、制御情報は、第３の音声データＤ３のサンプル数と第２の音声データＤ２のサンプル数との比率情報ＳＲを含み、時間スケール変換部３０及びリサンプラー４０に比率情報ＳＲが供給されてもよい。例えば、比率情報ＳＲが、第３の音声データＤ３のサンプル数が第２の音声データＤ２のサンプル数の７５％であることを示す場合、リサンプラー４０は、第１の音声データＤ１に含まれるＮ個のサンプルを第１の周期Ｔｓ１の約１３３％である第２の周期Ｔｓ２でサンプリングされたものと仮定して、Ｎ個のサンプルに基づく補間処理及び/又は間引き処理により、第１の周期Ｔｓ１でサンプリングされた約Ｎ×１．３３個のサンプルを求めて第２の音声データＤ２とする。そして、時間スケール変換部３０は、第２の音声データＤ２を、第２の音声データＤ２に含まれる約Ｎ×１．３３個のサンプルの２５％を削除したＮ個のサンプルを含む第３の音声データＤ３に変換する。 Further, the control information includes ratio information SR between the number of samples of the third audio data D3 and the number of samples of the second audio data D2, and the ratio information SR is supplied to the time scale conversion unit 30 and the resampler 40. good too. For example, if the ratio information SR indicates that the number of samples of the third audio data D3 is 75% of the number of samples of the second audio data D2, the resampler 40 is included in the first audio data D1. Assuming that N samples were sampled at a second period Ts2 that is approximately 133% of the first period Ts1, interpolation and/or decimation based on the N samples yields the first period Approximately N×1.33 samples sampled at Ts1 are obtained and used as the second audio data D2. Then, the time scale conversion unit 30 transforms the second audio data D2 into a third audio data D2 containing N samples obtained by deleting 25% of approximately N×1.33 samples contained in the second audio data D2. Convert to audio data D3.

また、第１実施形態と同様、半導体装置１は、Ｄ／Ａ変換器６０を含んでもよい。半導体装置１がＤ／Ａ変換器６０を含む場合、クロック生成部２０は、Ｄ／Ａ変換器６０を動作させるための第３のクロック信号ＣＫ３を生成する。 Further, the semiconductor device 1 may include a D/A converter 60, as in the first embodiment. When the semiconductor device 1 includes the D/A converter 60, the clock generator 20 generates the third clock signal CK3 for operating the D/A converter 60. FIG.

Ｄ／Ａ変換器６０は、第１実施形態と同様、第３の音声データＤ３をアナログ信号である音声信号ＶＯに変換する。具体的には、Ｄ／Ａ変換器６０は、第３のクロック信号ＣＫ３に同期して、第１の周期Ｔｓ１で第３の音声データＤ３に含まれる複数のサンプルをサンプリングしてアナログ値に変換することにより音声信号ＶＯを生成する。スタートビットＳＴはＤ／Ａ変換器６０にも供給され、Ｄ／Ａ変換器６０は、スタートビットＳＴが非アクティブからアクティブに変化すると処理を開始し、処理を終了すると停止する。 As in the first embodiment, the D/A converter 60 converts the third audio data D3 into an analog audio signal VO. Specifically, the D/A converter 60 samples a plurality of samples included in the third audio data D3 at a first period Ts1 in synchronization with the third clock signal CK3 and converts them into analog values. to generate the audio signal VO. The start bit ST is also supplied to the D/A converter 60, and the D/A converter 60 starts processing when the start bit ST changes from inactive to active, and stops when processing ends.

なお、第１実施形態と同様、半導体装置１は、リサンプラー４０におけるリサンプリングによって発生するエリアシングノイズを低減させる必要がある場合は、ローパスフィルター７０を含んでもよい。リサンプラー４０におけるリサンプリングがダウンサンプリングである場合は、ローパスフィルター７０は、リサンプラー４０の前段、具体的には音声データ格納部１０の出力からリサンプラー４０の入力に至る信号経路に設けられ、必要に応じてリサンプラー４０の後段、具体的にはリサンプラー４０の出力から時間スケール変換部３０の入力に至る信号経路にも設けられる。また、リサンプラー４０におけるリサンプリングがアップサンプリングである場合は、ローパスフィルター７０は、リサンプラー４０の後段、具体的にはリサンプラー４０の出力から時間スケール変換部３０の入力に至る信号経路に設けられる。半導体装置１がローパスフィルター７０を含む場合、クロック生成部２０は、ローパスフィルター７０を動作させるための第４のクロック信号ＣＫ４を生成し、スタートビットＳＴはローパスフィルター７０にも供給される。そして、ローパスフィルター７０は、スタートビットＳＴが非アクティブからアクティブに変化すると第４のクロック信号ＣＫ４に同期した処理を開始し、処理を終了すると停止する。 As in the first embodiment, the semiconductor device 1 may include the low-pass filter 70 when it is necessary to reduce aliasing noise generated by resampling in the resampler 40 . When the resampling in the resampler 40 is downsampling, the low-pass filter 70 is provided in the preceding stage of the resampler 40, specifically in the signal path from the output of the audio data storage unit 10 to the input of the resampler 40, If necessary, it is also provided in a signal path subsequent to the resampler 40 , specifically, from the output of the resampler 40 to the input of the time scale conversion section 30 . Further, when the resampling in the resampler 40 is upsampling, the low-pass filter 70 is provided in the subsequent stage of the resampler 40, specifically in the signal path from the output of the resampler 40 to the input of the time scale conversion unit 30. be done. When the semiconductor device 1 includes the low-pass filter 70 , the clock generator 20 generates a fourth clock signal CK4 for operating the low-pass filter 70 and the start bit ST is also supplied to the low-pass filter 70 . The low-pass filter 70 starts processing in synchronization with the fourth clock signal CK4 when the start bit ST changes from inactive to active, and stops when the processing ends.

なお、第１実施形態と同様、クロック生成部２０は、スタートビットＳＴが非アクティブからアクティブに変化すると、第１のクロック信号ＣＫ１、第２のクロック信号ＣＫ２、第３のクロック信号ＣＫ３及び第４のクロック信号ＣＫ４を生成し、所定の時間が経過すると、クロック信号の生成を停止してもよい。この場合、スタートビットＳＴは、時間スケール変換部３０及びリサンプラー４０、Ｄ／Ａ変換器６０及びローパスフィルター７０に供給されなくてもよい。 As in the first embodiment, the clock generator 20 generates the first clock signal CK1, the second clock signal CK2, the third clock signal CK3, and the fourth clock signal CK3 when the start bit ST changes from inactive to active. The clock signal CK4 may be generated, and the generation of the clock signal may be stopped after a predetermined period of time has elapsed. In this case, the start bit ST may not be supplied to the time scale converter 30, resampler 40, D/A converter 60 and low-pass filter 70. FIG.

また、第１実施形態と同様、クロック生成部２０が生成する第１のクロック信号ＣＫ１、第２のクロック信号ＣＫ２、第３のクロック信号ＣＫ３及び第４のクロック信号ＣＫ４のうちの複数のクロック信号が共通であってもよい。 Further, as in the first embodiment, a plurality of clock signals out of the first clock signal CK1, the second clock signal CK2, the third clock signal CK3, and the fourth clock signal CK4 generated by the clock generator 20 may be common.

図１１は、変形例の半導体装置１において、第１の音声データＤ１のピッチを下げる場合の第２の音声データＤ２、第３の音声データＤ３及び音声信号ＶＯの一例を示す図である。図１１の例では、第１の音声データＤ１は、一定時間に同じ正弦波が４回繰り返される一定周波数の音声データであり、その再生時間はＴ１である。また、比率情報ＳＲは７５％に設定されている。 FIG. 11 is a diagram showing an example of the second audio data D2, the third audio data D3, and the audio signal VO when the pitch of the first audio data D1 is lowered in the semiconductor device 1 of the modification. In the example of FIG. 11, the first audio data D1 is audio data of a constant frequency in which the same sine wave is repeated four times in a constant period of time, and its reproduction time is T1. Also, the ratio information SR is set to 75%.

まず、リサンプリングにより第１の音声データＤ１のサンプル数Ｎの比率情報ＳＲの逆数である約１．３３倍のサンプルを含む第２の音声データＤ２が生成される。このリサンプリングの詳細は、図３の例と同様である。第２の音声データＤ２の再生時間Ｔ２は、第１の音声データＤ１の再生時間Ｔ１の約１３３％である。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は、第１の音声データＤ１に含まれる正弦波の周波数の７５％である。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、再生時間が約１３３％に伸長され、ピッチが７５％に下がった音声データである。 First, by resampling, the second audio data D2 including approximately 1.33 times as many samples as the reciprocal of the ratio information SR of the number N of samples of the first audio data D1 is generated. The details of this resampling are the same as in the example of FIG. The reproduction time T2 of the second audio data D2 is approximately 133% of the reproduction time T1 of the first audio data D1. On the other hand, the frequency of the sine wave included in the second audio data D2 is 75% of the frequency of the sine wave included in the first audio data D1. That is, the second audio data D2 is audio data in which the reproduction time is extended to about 133% and the pitch is reduced to 75% of the first audio data D1.

次に、時間スケール変換により第２の音声データＤ２の１つの正弦波が削除され、同じ正弦波が３回繰り返される一定周波数の第３の音声データＤ３が生成される。第３の音声データＤ３のサンプル数はＮであり、第３の音声データＤ３の再生時間は、第２の音声データＤ２の再生時間Ｔ２の７５％である。これに対して、第３の音声データＤ３に含まれる正弦波の周波数は第２の音声データＤ２に含まれる正弦波の周波数と同じなので、時間スケール変換の前後でピッチは変わらない。すなわち、第３の音声データＤ３は、第２の音声データＤ２に対して、ピッチが同じであって再生時間が７５％に短縮された音声データである。前述の通り、第２の音声データＤ２は、第１の音声データＤ１に対して、再生時間が約１３３％に伸長され、ピッチが７５％に下がった音声データであるので、第３の音声データＤ３は、第１の音声データＤ１に対して、再生時間が同じであり、ピッチが７５％に下がった音声データである。 Next, one sine wave of the second audio data D2 is removed by time scale conversion to generate third audio data D3 of constant frequency in which the same sine wave is repeated three times. The number of samples of the third audio data D3 is N, and the playback time of the third audio data D3 is 75% of the playback time T2 of the second audio data D2. On the other hand, since the frequency of the sine wave included in the third audio data D3 is the same as the frequency of the sine wave included in the second audio data D2, the pitch does not change before and after the time scale conversion. That is, the third audio data D3 is audio data that has the same pitch as the second audio data D2 and has a reproduction time shortened to 75%. As described above, the second audio data D2 is audio data whose reproduction time is extended to about 133% and the pitch is lowered to 75% of the first audio data D1. D3 is audio data whose reproduction time is the same as that of the first audio data D1 and whose pitch is lowered to 75%.

図１２は、変形例の半導体装置１において、第１の音声データＤ１のピッチを上げる場合の第２の音声データＤ２、第３の音声データＤ３及び音声信号ＶＯの一例を示す図である。図１２の例では、第１の音声データＤ１は、一定時間に同じ正弦波が４回繰り返される一定周波数の音声データであり、その再生時間はＴ１である。また、比率情報ＳＲは１２５％に設定されている。 FIG. 12 is a diagram showing an example of the second audio data D2, the third audio data D3, and the audio signal VO when the pitch of the first audio data D1 is raised in the semiconductor device 1 of the modification. In the example of FIG. 12, the first audio data D1 is audio data of a constant frequency in which the same sine wave is repeated four times in a constant period of time, and its reproduction time is T1. Also, the ratio information SR is set to 125%.

まず、リサンプリングにより第１の音声データＤ１のサンプル数Ｎの比率情報ＳＲの逆数である０．８０倍のサンプルを含む第２の音声データＤ２が生成される。このリサンプリングの詳細は、図５の例と同様である。第２の音声データＤ２の再生時間Ｔ２は、第１の音声データＤ１の再生時間Ｔ１の８０％である。これに対して、第２の音声データＤ２に含まれる正弦波の周波数は、第１の音声データＤ１に含まれる正弦波の周波数の１２５％である。すなわち、第２の音声データＤ２は、第１の音声データＤ１に対して、再生時間が８０％に短縮され、ピッチが１２５％に上がった音声データである。 First, by resampling, the second audio data D2 including 0.80 times as many samples as the reciprocal of the ratio information SR of the sample number N of the first audio data D1 is generated. The details of this resampling are the same as in the example of FIG. The reproduction time T2 of the second audio data D2 is 80% of the reproduction time T1 of the first audio data D1. On the other hand, the frequency of the sine wave included in the second audio data D2 is 125% of the frequency of the sine wave included in the first audio data D1. That is, the second audio data D2 is audio data in which the reproduction time is shortened to 80% and the pitch is raised to 125% of the first audio data D1.

次に、時間スケール変換により第２の音声データＤ２の１つの正弦波が複製され、同じ正弦波が５回繰り返される一定周波数の第３の音声データＤ３が生成される。第３の音声データＤ３のサンプル数はＮであり、第３の音声データＤ３の再生時間は、第２の音声データＤ２の再生時間Ｔ２の１２５％である。これに対して、第３の音声データＤ３に含まれる正弦波の周波数は第２の音声データＤ２に含まれる正弦波の周波数と同じなので、時間スケール変換の前後でピッチは変わらない。すなわち、第３の音声データＤ３は、第２の音声データＤ２に対して、ピッチが同じであって再生時間が１２５％に伸長された音声データである。前述の通り、第２の音声データＤ２は、第１の音声データＤ１に対して、再生時間が８０％に短縮され、ピッチが１２５％に上がった音声データであるので、第３の音声データＤ３は、第１の音声データＤ１に対して、再生時間が同じであり、ピッチが１２５％に上がった音声データである。 Next, one sine wave of the second audio data D2 is replicated by time scale conversion to generate third audio data D3 of constant frequency in which the same sine wave is repeated five times. The number of samples of the third audio data D3 is N, and the playback time of the third audio data D3 is 125% of the playback time T2 of the second audio data D2. On the other hand, since the frequency of the sine wave included in the third audio data D3 is the same as the frequency of the sine wave included in the second audio data D2, the pitch does not change before and after the time scale conversion. In other words, the third audio data D3 is audio data having the same pitch as the second audio data D2 and having the playback time extended by 125%. As described above, the second audio data D2 is audio data in which the reproduction time is shortened to 80% and the pitch is raised to 125% of the first audio data D1. is audio data whose reproduction time is the same as that of the first audio data D1 and whose pitch is raised to 125%.

以上に説明した変形例の半導体装置１によれば、上記の第１実施形態と同様の効果を奏する。 According to the semiconductor device 1 of the modified example described above, the same effects as those of the first embodiment are obtained.

なお、変形例の半導体装置１に対して、上記の第２実施形態で示したセレクター８０、セレクター９０及び制御部１００が追加されてもよい。 Note that the selector 80, the selector 90, and the control unit 100 shown in the second embodiment may be added to the semiconductor device 1 of the modified example.

２．電子機器
図１３は、上述した半導体装置１を用いた本実施形態の電子機器の構成の一例を示す機能ブロック図である。また、図１４は、本実施形態の電子機器の一例であるスマートフォンの外観の一例を示す図である。 2. Electronic Apparatus FIG. 13 is a functional block diagram showing an example of the configuration of an electronic apparatus according to the present embodiment using the semiconductor device 1 described above. Also, FIG. 14 is a diagram showing an example of the appearance of a smartphone, which is an example of the electronic device of the present embodiment.

本実施形態の電子機器３００は、半導体装置１、操作部３３０、記憶部３４０、通信部３５０、表示部３６０及び音出力部３７０を含んで構成されている。なお、本実施形態の電子機器３００は、図１３の構成要素の一部を省略又は変更し、あるいは、他の構成要素を付加した構成としてもよい。 An electronic device 300 of this embodiment includes a semiconductor device 1 , an operation unit 330 , a storage unit 340 , a communication unit 350 , a display unit 360 and a sound output unit 370 . Note that the electronic device 300 of the present embodiment may have a configuration in which some of the constituent elements in FIG. 13 are omitted or changed, or other constituent elements are added.

操作部３３０は、操作キーやボタンスイッチ等により構成される入力装置であり、ユーザーによる操作に応じた操作信号を半導体装置１に出力する。半導体装置１は、例えば、操作部３３０から入力される信号に応じて前述の制御情報を変更する。 The operation unit 330 is an input device including operation keys, button switches, and the like, and outputs an operation signal to the semiconductor device 1 in accordance with a user's operation. The semiconductor device 1 changes the aforementioned control information according to a signal input from the operation unit 330, for example.

記憶部３４０は、半導体装置１が各種の計算処理や制御処理を行うためのプログラムやデータ等を記憶している。例えば、記憶部３４０には複数の音声データが記憶されており、半導体装置１は、記憶部３４０に記憶されている複数の音声データのいずれかを読み出して、前述の第１の音声データＤ１としてもよい。記憶部３４０は、例えば、ハードディスク、フレキシブルディスク、ＭＯ、ＭＴ、各種のメモリー、ＣＤ－ＲＯＭ、又は、ＤＶＤ－ＲＯＭ等によって実現される。 The storage unit 340 stores programs, data, and the like for the semiconductor device 1 to perform various calculation processes and control processes. For example, a plurality of pieces of audio data are stored in the storage unit 340, and the semiconductor device 1 reads one of the plurality of pieces of audio data stored in the storage unit 340 and uses it as the first audio data D1. good too. The storage unit 340 is implemented by, for example, a hard disk, flexible disk, MO, MT, various memories, CD-ROM, DVD-ROM, or the like.

通信部３５０は、半導体装置１と電子機器３００の外部機器との間のデータ通信を成立させるための各種制御を行う。 The communication unit 350 performs various controls for establishing data communication between the semiconductor device 1 and an external device of the electronic device 300 .

表示部３６０は、ＬＣＤ（Liquid Crystal Display）等により構成される表示装置であり、入力される表示信号に基づいて各種の情報を表示する。表示部３６０には操作部３３０として機能するタッチパネルが設けられていてもよい。例えば、半導体装置１が表示信号を生成してもよいし、半導体装置１とは異なる不図示の半導体装置が表示信号を生成してもよい。 The display unit 360 is a display device configured by an LCD (Liquid Crystal Display) or the like, and displays various information based on input display signals. The display unit 360 may be provided with a touch panel that functions as the operation unit 330 . For example, the semiconductor device 1 may generate the display signal, or a semiconductor device (not shown) different from the semiconductor device 1 may generate the display signal.

音出力部３７０は、スピーカー等によって構成され、半導体装置１からの出力信号である音声信号に基づいて音声を出力する。 Sound output unit 370 is configured by a speaker or the like, and outputs sound based on an audio signal that is an output signal from semiconductor device 1 .

本実施形態の電子機器３００は、比較的小さな回路規模でありながら音声データのピッチ変換を行うことが可能な半導体装置１を備えていることにより、低コストで所望の音声を出力することができる。 The electronic device 300 of the present embodiment is equipped with the semiconductor device 1 capable of pitch-converting voice data while having a relatively small circuit scale, so that desired voice can be output at low cost. .

このような電子機器３００としては種々の電子機器が考えられ、例えば、炊飯器、ＩＨクッキングヒーター、掃除機、洗濯機等の各種の家庭用電気製品、電子時計、モバイル型、ラップトップ型、タブレット型などのパーソナルコンピューター、スマートフォンや携帯電話機などの移動体端末、ディジタルカメラ、インクジェットプリンターなどのインクジェット式吐出装置、ルーターやスイッチなどのストレージエリアネットワーク機器、ローカルエリアネットワーク機器、移動体端末基地局用機器、テレビ、ビデオカメラ、ビデオレコーダー、カーナビゲーション装置、リアルタイムクロック装置、ページャー、電子手帳、電子辞書、電卓、電子ゲーム機器、ゲーム用コントローラー、ワードプロセッサー、ワークステーション、テレビ電話、防犯用テレビモニター、電子双眼鏡、ＰＯＳ端末、電子体温計、血圧計、血糖計、心電図計測装置、超音波診断装置、電子内視鏡等の医療機器、魚群探知機、各種測定機器、車両、航空機、船舶等の計器類、フライトシミュレーター、ヘッドマウントディスプレイ、モーショントレース、モーショントラッキング、モーションコントローラー、歩行者自立航法（ＰＤＲ：Pedestrian Dead Reckoning）装置等が挙げられる。 Various electronic devices can be considered as such an electronic device 300, for example, various household electric appliances such as rice cookers, IH cooking heaters, vacuum cleaners, washing machines, electronic clocks, mobile types, laptop types, and tablet types. such as personal computers, mobile terminals such as smartphones and mobile phones, digital cameras, inkjet ejection devices such as inkjet printers, storage area network equipment such as routers and switches, local area network equipment, mobile terminal base station equipment, Televisions, video cameras, video recorders, car navigation devices, real-time clock devices, pagers, electronic notebooks, electronic dictionaries, calculators, electronic game machines, game controllers, word processors, workstations, videophones, security TV monitors, electronic binoculars, POS terminals, electronic thermometers, sphygmomanometers, blood glucose meters, electrocardiogram measuring devices, ultrasonic diagnostic devices, medical devices such as electronic endoscopes, fish finders, various measuring devices, instruments for vehicles, aircraft, ships, etc., flight simulators , head-mounted displays, motion tracing, motion tracking, motion controllers, pedestrian dead reckoning (PDR: Pedestrian Dead Reckoning) devices, and the like.

３．移動体
図１５は、上述した半導体装置１を用いた本実施形態の移動体の一例を示す図である。図１５に示す移動体４００は、半導体装置１、コントローラー４４０，４５０，４６０、バッテリー４７０、スピーカー４８０を含む。なお、本実施形態の移動体は、図１５の構成要素の一部を省略し、あるいは、他の構成要素を付加した構成としてもよい。 3. Mobile Object FIG. 15 is a diagram showing an example of a mobile object according to the present embodiment using the semiconductor device 1 described above. A moving body 400 shown in FIG. 15 includes a semiconductor device 1 , controllers 440 , 450 and 460 , a battery 470 and a speaker 480 . It should be noted that the moving body of this embodiment may have a configuration in which some of the constituent elements shown in FIG. 15 are omitted, or other constituent elements are added.

半導体装置１、コントローラー４４０，４５０，４６０、スピーカー４８０は、バッテリー４７０から供給される電源電圧で動作する。 The semiconductor device 1 , the controllers 440 , 450 , 460 and the speaker 480 operate with power supply voltage supplied from the battery 470 .

例えば、コントローラー４４０，４５０，４６０は、それぞれ、姿勢制御システム、横転防止システム、ブレーキシステム等の各種の制御を行う。 For example, controllers 440, 450, and 460 each perform various controls such as an attitude control system, rollover prevention system, and brake system.

スピーカー４８０は、半導体装置１からの出力信号である音声信号に基づいて音声を出力する音出力部である。 Speaker 480 is a sound output unit that outputs sound based on an audio signal that is an output signal from semiconductor device 1 .

本実施形態の移動体４００は、比較的小さな回路規模でありながら音声データのピッチ変換を行うことが可能な半導体装置１を備えていることにより、低コストで所望の音声を出力することができる。 The moving body 400 of the present embodiment is equipped with the semiconductor device 1 capable of pitch-converting voice data while having a relatively small circuit scale, so that desired voice can be output at low cost. .

このような移動体４００としては種々の移動体が考えられ、例えば、電気自動車等の自動車、ジェット機やヘリコプター等の航空機、船舶、ロケット、人工衛星等が挙げられる。 As such a moving body 400, various moving bodies are conceivable, and examples thereof include automobiles such as electric vehicles, aircraft such as jet planes and helicopters, ships, rockets, and artificial satellites.

本発明は本実施形態に限定されず、本発明の要旨の範囲内で種々の変形実施が可能である。 The present invention is not limited to this embodiment, and various modifications can be made within the scope of the present invention.

上述した実施形態および変形例は一例であって、これらに限定されるわけではない。例えば、各実施形態および各変形例を適宜組み合わせることも可能である。 The above-described embodiments and modifications are examples, and the present invention is not limited to these. For example, it is also possible to appropriately combine each embodiment and each modification.

本発明は、実施の形態で説明した構成と実質的に同一の構成、例えば、機能、方法及び結果が同一の構成、あるいは目的及び効果が同一の構成を含む。また、本発明は、実施の形態で説明した構成の本質的でない部分を置き換えた構成を含む。また、本発明は、実施の形態で説明した構成と同一の作用効果を奏する構成又は同一の目的を達成することができる構成を含む。また、本発明は、実施の形態で説明した構成に公知技術を付加した構成を含む。 The present invention includes configurations that are substantially the same as the configurations described in the embodiments, for example, configurations that have the same function, method and result, or configurations that have the same purpose and effect. Moreover, the present invention includes configurations obtained by replacing non-essential portions of the configurations described in the embodiments. In addition, the present invention includes a configuration that achieves the same effects or achieves the same purpose as the configurations described in the embodiments. In addition, the present invention includes configurations obtained by adding known techniques to the configurations described in the embodiments.

１…半導体装置、１０…音声データ格納部、２０…クロック生成部、３０…時間スケール変換部、３１…マイクロコントロールユニット、３２…デジタル信号プロセッサー、４０…リサンプラー、５０…レジスター、６０…Ｄ／Ａ変換器、７０…ローパスフィルター、８０…セレクター、９０…セレクター、１００…制御部、１１０…ＲＯＭ、１１１…時間スケール変換プログラム、１２０…ＲＡＭ、１３０…プロセッサー、２００…バス、３００…電子機器、３３０…操作部、３４０…記憶部、３５０…通信部、３６０…表示部、３７０…音出力部、４００…移動体、４４０，４５０，４６０…コントローラー、４７０…バッテリー、４８０…スピーカー DESCRIPTION OF SYMBOLS 1... Semiconductor device 10... Audio data storage part 20... Clock generation part 30... Time scale conversion part 31... Micro control unit 32... Digital signal processor 40... Resampler 50... Register 60... D/ A converter 70 Low-pass filter 80 Selector 90 Selector 100 Control unit 110 ROM 111 Time scale conversion program 120 RAM 130 Processor 200 Bus 300 Electronic device DESCRIPTION OF SYMBOLS 330... Operation part 340... Storage part 350... Communication part 360... Display part 370... Sound output part 400... Mobile body 440, 450, 460... Controller, 470... Battery, 480... Speaker

Claims

an audio data storage unit that stores first audio data including a plurality of samples sampled in a first period;
a clock generator that generates a first clock signal and a second clock signal;
A time scale for converting the first audio data into second audio data in which part of the plurality of samples included in the first audio data are deleted or duplicated in synchronization with the first clock signal. a conversion unit;
Synchronizing with the second clock signal , the second audio data is converted into third audio data sampled at the first period and including a plurality of samples of a number different from that of the second audio data. A resampler, which is a dedicated circuit for conversion,
and a register storing control information for controlling operations of the time scale converter and the resampler.

2. The semiconductor device according to claim 1, wherein said third audio data is data of the same number of samples as said first audio data, the pitch of said first audio data being changed.

3. The semiconductor device according to claim 1, wherein said audio data storage section, said clock generation section, said time scale conversion section, said resampler, and said register are included in a one-chip integrated circuit device.

the control information includes information for setting the operation mode of the semiconductor device to one of a plurality of operation modes including a first operation mode, a second operation mode, and a third operation mode;
the first operation mode is an operation mode in which the pitch is changed without changing the reproduction time of the first audio data;
the second operation mode is an operation mode in which the reproduction time is changed without changing the pitch of the first audio data;
4. The semiconductor device according to claim 1, wherein said third operation mode is an operation mode for changing reproduction time and pitch of said first audio data.

5. A D/A converter for converting said third audio data into an analog signal.
The semiconductor device according to any one of 1.

A semiconductor device according to any one of claims 1 to 5;
An electronic device, comprising: a sound output unit that outputs sound based on an output signal from the semiconductor device.

A semiconductor device according to any one of claims 1 to 5;
and a sound output unit that outputs sound based on the output signal from the semiconductor device.