JP5106664B1

JP5106664B1 - Transmission device, reception device, transmission method, reception method, communication system, communication method, and program

Info

Publication number: JP5106664B1
Application number: JP2011178578A
Authority: JP
Inventors: ウィルアーチェルアレンツ; ウダーナバンダーラ
Original assignee: Rakuten Inc
Current assignee: Rakuten Group Inc
Priority date: 2011-08-17
Filing date: 2011-08-17
Publication date: 2012-12-26
Anticipated expiration: 2031-08-17
Also published as: JP2013042399A

Abstract

【課題】特別な通信デバイスを付加しなくても、近くにある装置同士で情報をやりとりすることのできる技術を提供すること。
【解決手段】送信装置は、送信データを取得する手段と、前記送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する符号化手段と、前記出力音声データをＤＡ変換器に変換させることにより、当該出力音声データに応じた音声をスピーカに出力させる出力手段と、を含む。受信装置は、マイクに入力される音声を示すＰＣＭ形式の入力音声データをＡＤ変換器を介して取得する入力手段と、前記入力音声データに基づいて前記送信データを復号する復号手段と、を含む。
【選択図】図２Provided is a technique capable of exchanging information between nearby devices without adding a special communication device.
A transmission apparatus includes means for acquiring transmission data, encoding means for outputting output audio data in PCM format obtained by encoding the transmission data, and converting the output audio data to a DA converter. Output means for outputting a sound corresponding to the output sound data to a speaker. The receiving device includes input means for acquiring input voice data in PCM format indicating voice input to the microphone via an AD converter, and decoding means for decoding the transmission data based on the input voice data. .
[Selection] Figure 2

Description

本発明は送信装置、受信装置、送信方法、受信方法、通信システム、通信方法およびプログラムに関する。 The present invention relates to a transmission device, a reception device, a transmission method, a reception method, a communication system, a communication method, and a program.

従来、比較的近くの装置の間で情報をやりとりする際には、赤外線通信などのデバイスを用いることが多かった。例えば、従来の携帯電話機は、赤外線通信用のデバイスを介して連絡先といった情報をやりとりする機能を有するものが多い。特許文献１には、赤外線通信を用いて携帯電話間で情報の送受信を行う技術が記載されている。 Conventionally, when exchanging information between relatively close apparatuses, devices such as infrared communication are often used. For example, many conventional mobile phones have a function of exchanging information such as a contact information through a device for infrared communication. Patent Document 1 describes a technique for transmitting and receiving information between mobile phones using infrared communication.

特開平１１−１５４９１６号公報JP-A-11-154916

近くにある装置の間で通信をするために赤外線通信のような特別なデバイスを用いる場合、送信側の装置および受信側の装置にそのデバイスが装備されている必要がある。例えば、近年はスマートフォンのように赤外線通信の機能を有さないものが急速に普及しており、近くにあるスマートフォン等の間で情報をやりとりすることが難しくなっている。なお、通信の方法としては無線ＬＡＮなどの電波を用いることも考えられるが、通信相手を位置関係により特定することが難しいため、セキュリティや利便性の問題が生じやすい。 When a special device such as infrared communication is used to communicate between nearby devices, the device on the transmitting side and the device on the receiving side must be equipped with the device. For example, in recent years, devices such as smartphones that do not have an infrared communication function are rapidly spreading, and it is difficult to exchange information between nearby smartphones and the like. Although it is conceivable to use radio waves such as a wireless LAN as a communication method, it is difficult to specify a communication partner based on a positional relationship, so that security and convenience problems are likely to occur.

本発明は上記課題を鑑みてなされたものであって、その目的は、特別な通信デバイスを付加しなくても、近くにある装置同士で情報をやりとりすることのできる技術を提供することにある。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a technology capable of exchanging information between nearby devices without adding a special communication device. .

上記課題を解決するために、本発明にかかる送信装置は、送信データを取得する手段と、前記送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する符号化手段と、前記出力音声データをＤＡ変換器に変換させることにより当該出力音声データに応じた音声をスピーカに出力させる出力手段と、を含むことを特徴とする。 In order to solve the above-mentioned problems, a transmitting apparatus according to the present invention includes a means for acquiring transmission data, an encoding means for outputting output audio data in PCM format obtained by encoding the transmission data, and the output audio. Output means for outputting a sound corresponding to the output sound data to a speaker by converting the data to a DA converter.

また、本発明にかかる受信装置は、マイクに入力される音声であって、送信データが符号化されてなる音声を示すＰＣＭ形式の入力音声データをＡＤ変換器を介して取得する入力手段と、前記入力音声データに基づいて前記送信データを復号する復号手段と、を含むことを特徴とする。 Further, the receiving device according to the present invention is an input means for acquiring, through an AD converter, PCM format input voice data indicating voice input to a microphone and indicating voice obtained by encoding transmission data; Decoding means for decoding the transmission data based on the input voice data.

また、本発明にかかる送信方法は、送信データを取得するステップと、前記送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する符号化ステップと、前記出力音声データをＤＡ変換器に変換させることにより、当該出力音声データに応じた音声をスピーカに出力させるステップと、を含むことを特徴とする。 The transmission method according to the present invention includes a step of acquiring transmission data, an encoding step of outputting output audio data in PCM format obtained by encoding the transmission data, and the output audio data to a DA converter. And a step of causing the speaker to output a sound corresponding to the output sound data by performing the conversion.

また、本発明にかかる受信方法は、マイクに入力される音声であって、送信データが符号化されてなる音声に基づいてＡＤ変換器を介してＰＣＭ形式の入力音声データを取得する入力ステップと、前記入力音声データに基づいて前記送信データを復号する復号ステップと、を含むことを特徴とする。 According to another aspect of the present invention, there is provided a receiving method for obtaining PCM format input voice data via an AD converter based on voice input to a microphone and encoded transmission data; And a decoding step of decoding the transmission data based on the input voice data.

また、本発明にかかる送信用のプログラムは、送信データを取得する手段、前記送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する符号化手段、および、前記出力音声データをＤＡ変換器に変換させることにより当該出力音声データに応じた音声をスピーカに出力させる出力手段、としてコンピュータを機能させることを特徴とする。 The program for transmission according to the present invention includes means for acquiring transmission data, encoding means for outputting output audio data in PCM format obtained by encoding the transmission data, and DA conversion of the output audio data. The computer is caused to function as output means for outputting a sound corresponding to the output sound data to a speaker by converting the sound into a device.

また、本発明にかかる受信用のプログラムは、マイクに入力される音声であって、送信データが符号化されてなる音声を示すＰＣＭ形式の入力音声データをＡＤ変換器を介して取得する入力手段、および、前記入力音声データに基づいて前記送信データを復号する復号手段、としてコンピュータを機能させることを特徴とする。 Further, the reception program according to the present invention is an input means for acquiring, through an AD converter, PCM format input voice data indicating voice input to a microphone and indicating voice obtained by encoding transmission data. And a computer functioning as decoding means for decoding the transmission data based on the input voice data.

また、本発明にかかる通信システムは、上述の送信装置と受信装置を含むことを特徴とする。 A communication system according to the present invention includes the above-described transmission device and reception device.

また、本発明にかかる通信方法は、送信データを取得するステップと、前記送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する符号化ステップと、前記出力音声データをＤＡ変換器に変換させることにより、当該出力音声データに応じた音声をスピーカに出力させるステップと、前記スピーカが出力しマイクに入力される音声を示すＰＣＭ形式の入力音声データをＡＤ変換器を介して取得する入力ステップと、前記入力音声データに基づいて前記送信データを復号する復号ステップと、を含むことを特徴とする。 The communication method according to the present invention includes a step of acquiring transmission data, an encoding step of outputting output audio data in PCM format obtained by encoding the transmission data, and the output audio data to a DA converter. A step of causing the speaker to output a sound corresponding to the output sound data by conversion, and an input for acquiring input sound data in PCM format indicating the sound output from the speaker and input to the microphone via the AD converter And a decoding step of decoding the transmission data based on the input voice data.

本発明によれば、ＤＡ変換器の出力に応じて音を出すスピーカを含む装置と、ＡＤ変換器に音が変換された電圧信号を出力するマイクがある装置とを用いることにより、特別なデバイスを付加しなくてもそれらの装置の間で情報をやりとりすることができる。例えば、スマートフォン同士や、テレビとスマートフォンとの間などで情報をやりとりすることができる。 According to the present invention, a special device is obtained by using an apparatus including a speaker that emits sound according to the output of the DA converter and an apparatus including a microphone that outputs a voltage signal obtained by converting the sound into the AD converter. Information can be exchanged between these devices even without adding. For example, information can be exchanged between smartphones or between a TV and a smartphone.

本発明の一態様では、前記出力音声データは、非可聴周波数帯の音声を示すデータであってもよい。 In one aspect of the present invention, the output sound data may be data indicating sound in a non-audible frequency band.

本発明の一態様では、前記非可聴周波数帯は、２０ｋＨｚ以上であってもよい。 In one aspect of the present invention, the inaudible frequency band may be 20 kHz or more.

本発明の一態様では、受信装置は、前記入力音声データが示す音声から非可聴周波数帯の音声を抽出した抽出音声データを生成する抽出手段をさらに含み、前記復号手段は、前記抽出音声データに基づいて前記送信データを復号してもよい。 In one aspect of the present invention, the receiving device further includes extraction means for generating extracted voice data obtained by extracting a sound in a non-audible frequency band from the voice indicated by the input voice data, and the decoding means adds the extracted voice data to the extracted voice data. Based on this, the transmission data may be decoded.

本発明の一態様では、前記符号化手段は、前記出力音声データが示す音声の振幅変化を抑制するように当該出力音声データを出力してもよい。 In one aspect of the present invention, the encoding means may output the output sound data so as to suppress a change in sound amplitude indicated by the output sound data.

本発明の一態様では、前記符号化手段は、前記出力音声データが前記送信データに応じて複数の有音区間と隣り合う有音区間の間の無音区間とを有する音声を示し、かつ前記各有音区間は振幅が徐増する区間および振幅が徐減する区間のうち少なくとも一方を含むように当該出力音声データを出力してもよい。 In one aspect of the present invention, the encoding means indicates the speech in which the output speech data includes a plurality of speech segments and a silent segment between adjacent speech segments according to the transmission data, and The output voice data may be output so that the voiced section includes at least one of a section where the amplitude gradually increases and a section where the amplitude gradually decreases.

本発明の一態様では、前記複数の有音期間のそれぞれの時間幅は、予め定められた所定数の時間幅のうちいずれかであってよい。 In one aspect of the present invention, a time width of each of the plurality of sound periods may be any of a predetermined number of time widths.

本発明の一態様では、前記復号手段は、前記抽出音声データが示す複数の有音区間であって、前記マイクに順に入力される複数の有音区間のそれぞれの時間の長さに基づいて前記送信データを復号してもよい。 In one aspect of the present invention, the decoding means is a plurality of sound segments indicated by the extracted speech data, and is based on the length of time of each of the plurality of sound segments input in order to the microphone. The transmission data may be decoded.

本発明の実施形態にかかる通信システムの構成の一例を示す図である。It is a figure which shows an example of a structure of the communication system concerning embodiment of this invention. 本発明の実施形態にかかる通信システムの機能を示す機能ブロック図である。It is a functional block diagram which shows the function of the communication system concerning embodiment of this invention. 送信装置の処理フローの一例を示す図である。It is a figure which shows an example of the processing flow of a transmitter. 出力音声データが示す音声の時間推移を示す図である。It is a figure which shows the time transition of the audio | voice which output audio | speech data shows. 有音区間における波形の一例を示す図である。It is a figure which shows an example of the waveform in a sound area. 受信装置の処理フローの一例を示す図である。It is a figure which shows an example of the processing flow of a receiver. テレビのスピーカを用いてデータを送信するシステムの一例を示す図である。It is a figure which shows an example of the system which transmits data using the speaker of a television.

以下では、本発明の実施形態について図面に基づいて説明する。出現する構成要素のうち同一機能を有するものには同じ符号を付し、その説明を省略する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. Of the constituent elements that appear, those having the same function are given the same reference numerals, and the description thereof is omitted.

図１は、本発明の実施形態にかかる通信システムの構成の一例を示す図である。この通信システムは、送信装置１と受信装置２とを含む。送信装置１は、プロセッサ１１、メモリ１２、表示操作部１３、ＤＡ変換器１４、およびスピーカ１５を含み、受信装置２は、プロセッサ２１、メモリ２２、表示操作部２３、ＡＤ変換器２４、およびマイク２５を含む。送信装置１はＤＡ変換器１４を介して音声用のスピーカに音声を出力させる装置であればよく、例えばスマートフォンや携帯電話であってよい。受信装置２は音声用のマイクから入力される音声を示すデータをＡＤ変換器２４を介して取得する装置であればよく、例えばスマートフォンや携帯電話であってよい。 FIG. 1 is a diagram illustrating an example of a configuration of a communication system according to an embodiment of the present invention. This communication system includes a transmission device 1 and a reception device 2. The transmission device 1 includes a processor 11, a memory 12, a display operation unit 13, a DA converter 14, and a speaker 15. The reception device 2 includes a processor 21, a memory 22, a display operation unit 23, an AD converter 24, and a microphone. 25. The transmission device 1 may be any device that outputs audio to an audio speaker via the DA converter 14, and may be, for example, a smartphone or a mobile phone. The receiving device 2 may be a device that acquires data indicating sound input from a sound microphone via the AD converter 24, and may be, for example, a smartphone or a mobile phone.

プロセッサ１１およびプロセッサ２１は、それぞれメモリ１２およびメモリ２２に格納されているプログラムに従って動作しうる。またプロセッサ１１は表示操作部１３やＤＡ変換器１４を制御し、プロセッサ２１は表示操作部２３やＡＤ変換器２４を制御する。なお、上記プログラムは、インターネット等のネットワークを介して提供されるものであってもよいし、フラッシュメモリ等のコンピュータで読み取り可能な情報記録媒体に格納されて提供されるものであってもよい。 The processor 11 and the processor 21 can operate according to programs stored in the memory 12 and the memory 22, respectively. The processor 11 controls the display operation unit 13 and the DA converter 14, and the processor 21 controls the display operation unit 23 and the AD converter 24. The program may be provided via a network such as the Internet, or may be provided by being stored in a computer-readable information recording medium such as a flash memory.

メモリ１２およびメモリ２２は、ＲＡＭやフラッシュメモリ等のメモリ素子やハードディスクドライブ等によって構成されている。メモリ１２およびメモリ２２は、上記プログラムを格納する。また、メモリ１２およびメモリ２２は、各部から入力される情報や演算結果を格納する。 The memory 12 and the memory 22 are configured by memory elements such as RAM and flash memory, hard disk drives, and the like. The memory 12 and the memory 22 store the program. The memory 12 and the memory 22 store information input from each unit and calculation results.

表示操作部１３および表示操作部２３は、液晶表示パネル等の表示出力デバイスを制御する手段や、タッチパネルやリモコン等の入力デバイスを制御する手段などによって構成されている。表示操作部１３および表示操作部２３は、それぞれプロセッサ１１およびプロセッサ２１の制御に基づいて、画像データ等を表示出力デバイスに対して出力し、入力デバイスより操作者（ユーザ）からの情報を取得する。 The display operation unit 13 and the display operation unit 23 are configured by means for controlling a display output device such as a liquid crystal display panel, means for controlling an input device such as a touch panel and a remote control, and the like. The display operation unit 13 and the display operation unit 23 output image data and the like to the display output device based on the control of the processor 11 and the processor 21, respectively, and acquire information from the operator (user) from the input device. .

ＤＡ変換器１４は、プロセッサ１１の制御に基づいて、ＰＣＭ形式の出力音声データを出力信号電圧に変換する。ＤＡ変換器１４は、例えばΔΣ法を用いてＤＡ変換する。またＤＡ変換器１４は、操作者による設定に応じて出力信号電圧を変換するボリューム制御の機能も有する。 The DA converter 14 converts output audio data in PCM format into an output signal voltage based on the control of the processor 11. The DA converter 14 performs DA conversion using, for example, the ΔΣ method. The DA converter 14 also has a volume control function for converting the output signal voltage according to the setting by the operator.

スピーカ１５は、ＤＡ変換器１４が出力する出力信号電圧に応じた音声を出力する。スピーカ１５は、音声出力用のスピーカであり、もちろん可聴周波数帯の音声も出力できる。 The speaker 15 outputs sound corresponding to the output signal voltage output from the DA converter 14. The speaker 15 is a sound output speaker, and of course can output sound in an audible frequency band.

マイク２５は、入力される音声を入力信号電圧に変換する。送信装置１と通信する際にはこのマイク２５には、送信装置１が出力する音声が入力される。マイク２５は、音声入力用のマイクであり、もちろん可聴周波数帯の音声も入力できる。 The microphone 25 converts input sound into an input signal voltage. When communicating with the transmission device 1, sound output from the transmission device 1 is input to the microphone 25. The microphone 25 is a microphone for voice input, and of course can also input audio in an audible frequency band.

ＡＤ変換器２４は、プロセッサ２１の制御に基づいて、入力信号電圧をＰＣＭ形式の入力音声データに変換する。ＡＤ変換器２４は、例えばΔΣ法を用いてＡＤ変換を行う。 The AD converter 24 converts the input signal voltage into PCM format input voice data based on the control of the processor 21. The AD converter 24 performs AD conversion using, for example, the ΔΣ method.

図２は、本発明の実施形態にかかる通信システムの機能を示す機能ブロック図である。送信装置１は機能的に、送信データ取得部３１、符号化部３２、および音声出力部３３を含む。これらの機能は、プロセッサ１１がメモリ１２に格納されたプログラムを実行し、ＤＡ変換器１４や表示操作部１３等を制御することで実現される。また受信装置２は機能的に、音声入力部４１、抽出部４２および復号部４３を含む。これらの機能は、プロセッサ２１がメモリ２２に格納されたプログラムを実行し、ＡＤ変換器２４や表示操作部２３等を制御することで実現される。 FIG. 2 is a functional block diagram showing functions of the communication system according to the embodiment of the present invention. The transmission apparatus 1 functionally includes a transmission data acquisition unit 31, an encoding unit 32, and an audio output unit 33. These functions are realized by the processor 11 executing a program stored in the memory 12 and controlling the DA converter 14, the display operation unit 13, and the like. In addition, the receiving device 2 functionally includes a voice input unit 41, an extraction unit 42, and a decoding unit 43. These functions are realized by the processor 21 executing a program stored in the memory 22 and controlling the AD converter 24, the display operation unit 23, and the like.

図３は、送信装置１の処理フローの一例を示す図である。以下ではこの処理フローに従い送信装置１が有する機能を説明する。 FIG. 3 is a diagram illustrating an example of a processing flow of the transmission device 1. Below, the function which the transmission apparatus 1 has is demonstrated according to this processing flow.

送信データ取得部３１は、プロセッサ１１、メモリ１２、および表示操作部１３を中心として実現される。送信データ取得部３１は、表示操作部１３を介してユーザから入力される指示に基づいて動作し、表示操作部１３から入力される情報や、メモリ１２に記憶された情報から送信データを取得する（ステップＳ１０１）。送信装置１がスマートフォンである場合は、送信データは、例えば所有者の電話番号やメールアドレスのような情報や、スマートフォンを識別するためのＩＤといった情報である。 The transmission data acquisition unit 31 is realized centering on the processor 11, the memory 12, and the display operation unit 13. The transmission data acquisition unit 31 operates based on an instruction input from the user via the display operation unit 13, and acquires transmission data from information input from the display operation unit 13 or information stored in the memory 12. (Step S101). When the transmission device 1 is a smartphone, the transmission data is information such as information such as an owner's phone number or email address, or an ID for identifying the smartphone.

符号化部３２は、プロセッサ１１およびメモリ１２を中心として実現される。符号化部３２は、送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する。 The encoding unit 32 is realized centering on the processor 11 and the memory 12. The encoding unit 32 outputs output audio data in PCM format obtained by encoding transmission data.

出力音声データの形式について説明する。図４は、出力音声データが示す音声の時間推移を示す図である。出力音声データが示す音声は、出力開始を示す同期信号５１と、出力終了を示す同期信号５２と、時間的にみて同期信号５１と同期信号５２とに挟まれる複数の伝送信号区間５３とからなる。これらの複数の伝送信号区間５３は、送信データが変調された信号である。各伝送信号区間５３は、音声が出力される有音区間であり、その出力される音声は、予め定められた２種類の音声のうち何れかである。各伝送信号区間５３は、送信データを構成するビット配列のうちいずれかに対応する。ここでは、伝送信号区間５３に出力される２種類の音声は、互いにその時間幅が異なる。具体的には、値が１となるビットに対応する伝送信号区間５３の時間幅Ｔｔ１と、値が０となるビットに対応する伝送信号区間５３の時間幅Ｔｔ０とが異なっている。本実施形態の例では、時間幅Ｔｔ１は６ｍｓ、時間幅Ｔｔ０は２ｍｓである。また、隣り合う伝送信号区間５３の間には無音区間が配置され、また同期信号５１，５２とそれらの隣の伝送信号区間５３との間にも無音区間が配置される。無音区間の時間幅Ｔｎは１０ｍｓである。なお、本実施形態では時間幅Ｔｔ１と時間幅Ｔｔ０とが異なっていれば、時間幅Ｔｔ０，Ｔｔ１，Ｔｎの時間幅は上述のものと異なっていてもよい。 The format of the output audio data will be described. FIG. 4 is a diagram showing a time transition of the voice indicated by the output voice data. The audio indicated by the output audio data includes a synchronization signal 51 indicating the start of output, a synchronization signal 52 indicating the end of output, and a plurality of transmission signal sections 53 sandwiched between the synchronization signal 51 and the synchronization signal 52 in terms of time. . The plurality of transmission signal sections 53 are signals obtained by modulating transmission data. Each transmission signal section 53 is a voiced section in which sound is output, and the output sound is one of two types of predetermined sounds. Each transmission signal section 53 corresponds to one of the bit arrangements constituting the transmission data. Here, the two types of sound output to the transmission signal section 53 have different time widths. Specifically, the time width Tt1 of the transmission signal section 53 corresponding to a bit having a value of 1 is different from the time width Tt0 of the transmission signal section 53 corresponding to a bit having a value of 0. In the example of the present embodiment, the time width Tt1 is 6 ms and the time width Tt0 is 2 ms. Further, a silent section is arranged between adjacent transmission signal sections 53, and a silent section is also arranged between the synchronization signals 51 and 52 and the adjacent transmission signal section 53. The time interval Tn of the silent section is 10 ms. In the present embodiment, as long as the time width Tt1 and the time width Tt0 are different, the time widths of the time widths Tt0, Tt1, and Tn may be different from those described above.

次に符号化部３２の処理の詳細について説明する。符号化部３２は、まず出力音声データに同期信号５１を示す音声データを設定し（ステップＳ１０２）、無音区間を示す音声データを追加する（ステップＳ１０３）。次に変数ｉに１を代入し（ステップＳ１０４）、複数の伝送信号区間５３を出力音声データに追加する処理をする。 Next, details of the processing of the encoding unit 32 will be described. The encoding unit 32 first sets audio data indicating the synchronization signal 51 in the output audio data (step S102), and adds audio data indicating a silent period (step S103). Next, 1 is substituted into the variable i (step S104), and processing for adding a plurality of transmission signal sections 53 to the output audio data is performed.

符号化部３２は、送信データからｉ番目のビットを取得する（ステップＳ１０５）。ここで、送信データは順序づけられて並ぶｎ個のビットの集合とみなす。次に、取得されたビットに応じた時間幅を有する有音区間を示す音声データを出力音声データに追加し（ステップＳ１０６）、さらに出力音声データに無音区間を示す音声データを追加する（ステップＳ１０７）。そして変数ｉの値を１増やし（ステップＳ１０８）、送信データを構成する全て（ｎ個）のビットについて処理を行っていなければ（ステップＳ１０９のＮ）、ステップＳ１０５から処理を繰り返す。全てのビットについて処理を行っていれば（ステップＳ１０９のＹ）、出力音声データに同期信号５２を示す音声データを追加する（ステップＳ１１０）。これらの処理により、符号化部３２は送信データを２進数のビット列とした場合の各ビットに対応する種類の音声を順に出力する出力音声データを出力する。 The encoding unit 32 acquires the i-th bit from the transmission data (step S105). Here, the transmission data is regarded as a set of n bits arranged in order. Next, voice data indicating a voiced section having a time width corresponding to the acquired bit is added to the output voice data (step S106), and voice data indicating a silent section is further added to the output voice data (step S107). ). Then, the value of the variable i is incremented by 1 (step S108), and if all (n) bits constituting the transmission data are not processed (N in step S109), the processing is repeated from step S105. If all the bits have been processed (Y in step S109), audio data indicating the synchronization signal 52 is added to the output audio data (step S110). Through these processes, the encoding unit 32 outputs output audio data that sequentially outputs audio of the type corresponding to each bit when the transmission data is a binary bit string.

音声出力部３３は、プロセッサ１１、メモリ１２を中心として実現される。音声出力部３３は、出力音声データをＤＡ変換器１４に変換させることによりその出力音声データが示す音声をスピーカ１５に出力させる。具体的には、まず音声出力部３３は、出力音声データをＤＡ変換器１４に出力し（ステップＳ１１１）、ＤＡ変換器１４はその出力音声データを出力信号電圧に変換する。ＤＡ変換器１４から出力される出力信号電圧はスピーカ１５に入力され、スピーカ１５は出力信号電圧に応じた音声を出力する。 The audio output unit 33 is realized centering on the processor 11 and the memory 12. The audio output unit 33 converts the output audio data to the DA converter 14 to output the audio indicated by the output audio data to the speaker 15. Specifically, the audio output unit 33 first outputs the output audio data to the DA converter 14 (step S111), and the DA converter 14 converts the output audio data into an output signal voltage. The output signal voltage output from the DA converter 14 is input to the speaker 15, and the speaker 15 outputs sound corresponding to the output signal voltage.

ここで、有音区間の信号の詳細について説明する。図５は、有音区間における波形の一例を示す図である。実線は有音区間の音声の波形を示し、破線はその波形の振幅の時間変化を示す。有音区間においては、非可聴周波数帯、具体的には２０ｋＨｚ以上かつ出力可能な最大周波数（２５ｋＨｚ程度）以下の周波数の音声が出力されている。非可聴周波数帯の音声を通信に用いることで、周りの人間が不快なノイズを認識することを防ぐことができる。また、符号化部３２は有音区間における音声の振幅の変化を抑制するように出力音声データを設定する。より具体的には、出力音声データが示す各有音区間は、音声の振幅が徐増する区間と、音声の振幅が徐減する区間とを有する。音声の振幅が徐増する区間は、無音区間から有音区間に切り替わった後の一定の区間であり、音声の振幅が徐減する区間は、有音区間から無音区間に切り替わる前の一定の区間である。本実施形態では、上述の特徴を有する有音区間の振幅は、有音区間の時間幅の２倍の周期のサイン関数から求められている。 Here, the detail of the signal of a sound section is demonstrated. FIG. 5 is a diagram illustrating an example of a waveform in a sound section. The solid line indicates the waveform of the voice in the voiced section, and the broken line indicates the time change of the amplitude of the waveform. In the voiced section, sound having a frequency of a non-audible frequency band, specifically, a frequency of 20 kHz or more and a maximum frequency (about 25 kHz) that can be output is output. By using sound in a non-audible frequency band for communication, surrounding humans can be prevented from recognizing unpleasant noise. Also, the encoding unit 32 sets the output voice data so as to suppress the change in the voice amplitude in the voiced section. More specifically, each voiced section indicated by the output voice data has a section in which the voice amplitude gradually increases and a section in which the voice amplitude gradually decreases. The section where the amplitude of the voice gradually increases is a constant section after switching from the silent section to the voice section, and the section where the amplitude of the voice gradually decreases is a fixed section before switching from the voice section to the silent section It is. In the present embodiment, the amplitude of the sound section having the above-described characteristics is obtained from a sine function having a period twice as long as the time width of the sound section.

有音区間におけるタイムステップｘにおける波形ｆ（ｘ）を示す式を以下に示す。ここで、ＰＣＭ形式の出力音声データは１６ビットであり、−３２７６８〜３２７６７の値を取ることができるとし、１タイムステップは１秒をサンプリングレートで割った時間とする。 An expression showing the waveform f (x) at time step x in the sound section is shown below. Here, the output audio data in the PCM format is 16 bits and can take values of -32768 to 32767, and one time step is a time obtained by dividing one second by the sampling rate.

ここで、γは出力音声データのサンプリングレートを非可聴周波数帯の音声の周波数で割った値である。サンプリングレートは通信に用いる非可聴周波数帯の周波数の２倍以上かつ自然数倍にするとよい。サンプリング周期と音声の周期との違いにより目的とする周波数より低い周波数が出力されることを防ぐことができるからである。本実施形態では、サンプリングレートが４４．１ｋＨｚ、音声の周波数は２２．０５ｋＨｚとしている。Ａは、有音区間のデータとして取りうる最大値を示している。Ｂは、サンプリングレートを振幅変化を示す周波数１０００でわった値である。なお、実際の出力音声データは、ｆ（ｘ）が示す値のうち小数部の情報は切り捨てられている。 Here, γ is a value obtained by dividing the sampling rate of the output audio data by the frequency of the audio in the inaudible frequency band. The sampling rate is preferably at least twice the frequency of the non-audible frequency band used for communication and a natural number. This is because a frequency lower than the target frequency can be prevented from being output due to the difference between the sampling period and the voice period. In this embodiment, the sampling rate is 44.1 kHz and the audio frequency is 22.05 kHz. A indicates a maximum value that can be taken as data of a voiced section. B is a value obtained by dividing the sampling rate by a frequency 1000 indicating an amplitude change. In the actual output audio data, the decimal part of the value indicated by f (x) is truncated.

ｆ（ｘ）の式は、非可聴周波数帯の音声を示すサイン波に、振幅の時間変化を抑制するサイン波をかけた式になっている。これにより、以下のｆｓ（ｘ）の式が示す有音区間における振幅を一定とするサイン波に比べ振幅の時間変化が抑制される。 The expression of f (x) is an expression obtained by applying a sine wave that suppresses a change in amplitude over time to a sine wave indicating sound in a non-audible frequency band. Thereby, the time change of an amplitude is suppressed compared with the sine wave which makes the amplitude in the sound area shown by the following formula | equation of fs (x) constant.

ここで、前述のように送信装置１のスピーカ１５は音声用のスピーカであるので、２０ｋＨｚ以上の非可聴周波数帯の音を出力する場合、物理的な特性のために出力信号電圧の変化に追随できない現象が起きる場合がある。その現象がおきると、スピーカ１５からより低い周波数のノイズのような音が出力される。上述の現象は、ｆｓ（ｘ）が示す有音区間の音声を出力する場合にはほぼ確実に発生するが、ｆ（ｘ）のように振幅の時間変化を制限すると、スピーカ１５が出力信号電圧の時間変化に追随しやすくなるため発生しなくなる。これにより、通信時にスピーカ１５が発生するノイズであってユーザが認識できるノイズを大幅に減少させることができる。 Here, as described above, the speaker 15 of the transmission device 1 is an audio speaker. Therefore, when a sound in an inaudible frequency band of 20 kHz or higher is output, a change in the output signal voltage follows the physical characteristics. Unusable phenomenon may occur. When this phenomenon occurs, a sound like noise at a lower frequency is output from the speaker 15. The above phenomenon occurs almost certainly when the sound in the voiced section indicated by fs (x) is output. However, when the time variation of the amplitude is limited as in f (x), the speaker 15 outputs the output signal voltage. It will not occur because it becomes easier to follow the time change. Thereby, it is possible to greatly reduce the noise generated by the speaker 15 during communication and recognized by the user.

なお、符号化部３２が出力する出力音声データにおける１つの伝送信号区間５３が、必ずしも１ビットのデータを表さなくてもよい。例えば、１つの伝送信号区間５３の時間幅の種類を４種類にし、１つの伝送信号区間５３が送信データの２ビットに対応させてもよい。また送信データをｋ進数（ｋ＞３）の値の配列に変換し、ｋ種類の音声のうちそれぞれの値に対応するものを出力するように符号化部３２が出力音声データを出力してもよい。 Note that one transmission signal section 53 in the output audio data output from the encoding unit 32 does not necessarily represent 1-bit data. For example, four types of time widths of one transmission signal section 53 may be used, and one transmission signal section 53 may correspond to 2 bits of transmission data. Further, even if the encoding unit 32 outputs the output voice data so as to convert the transmission data into an array of k-ary (k> 3) values and output k kinds of voices corresponding to the respective values. Good.

図６は、受信装置２の処理フローの一例を示す図である。以下ではこの処理フローに従い受信装置２が有する機能を説明する。 FIG. 6 is a diagram illustrating an example of a processing flow of the receiving device 2. Below, the function which the receiving apparatus 2 has according to this processing flow is demonstrated.

音声入力部４１は、プロセッサ２１、メモリ２２および表示操作部２３を中心として実現される。音声入力部４１は、表示操作部２３を介してユーザから入力される指示に基づいて、ＡＤ変換器２４からＰＣＭ形式の入力音声データを取得する（ステップＳ２０１）。この際、マイク２５に送信装置１の出力する音声を入力させることで、入力音声データに送信データが符号化されてなる音声のデータを含むようにする。なお、本実施形態では、サンプリングレートは送信装置１と同じサンプリングレートとしている。 The voice input unit 41 is realized centering on the processor 21, the memory 22, and the display operation unit 23. The voice input unit 41 acquires PCM format input voice data from the AD converter 24 based on an instruction input from the user via the display operation unit 23 (step S201). At this time, the sound output from the transmission device 1 is input to the microphone 25 so that the input sound data includes sound data obtained by encoding the transmission data. In the present embodiment, the sampling rate is the same as that of the transmission apparatus 1.

抽出部４２は、プロセッサ２１およびメモリ２２を中心として実現される。抽出部４２は、音声入力部４１が取得した入力音声データから非可聴周波数帯の音声を抽出し（ステップＳ２０２）、抽出された音声を示すデータをメモリ２２に格納する。抽出部４２は、非可聴周波数帯の音声を抽出するためにバンドパスフィルタ（bandpass filter）を用いる。この処理を行うことにより、周囲の人の声などのノイズの影響を取り除くことができる。雑音が少ない環境で通信を行う前提であれば、この処理を行なわずに復号部４３に入力音声データをそのまま渡してもよい。 The extraction unit 42 is realized centering on the processor 21 and the memory 22. The extraction unit 42 extracts a non-audible frequency band sound from the input sound data acquired by the sound input unit 41 (step S202), and stores data indicating the extracted sound in the memory 22. The extraction unit 42 uses a bandpass filter in order to extract sound in an inaudible frequency band. By performing this process, it is possible to remove the influence of noise such as the voices of surrounding people. If communication is performed in an environment with little noise, the input voice data may be passed to the decoding unit 43 as it is without performing this process.

復号部４３は、プロセッサ２１およびメモリ２２を中心として実現される。復号部４３は抽出された音声を示すデータに基づいて送信データを復号する。まず、復号部４３は抽出された音声の振幅を正規化する（ステップＳ２０３）。これにより、マイク２５に入力される音声の大きさが通信するごとに異なっていても、後続の処理で問題が生じなくなる。 The decoding unit 43 is realized centering on the processor 21 and the memory 22. The decoding unit 43 decodes the transmission data based on the data indicating the extracted voice. First, the decoding unit 43 normalizes the amplitude of the extracted speech (step S203). Thereby, even if the magnitude of the sound input to the microphone 25 is different every time communication is performed, no problem occurs in the subsequent processing.

復号部４３は、次に有音区間と無音区間との境界を検出し、複数の有音区間のそれぞれの時間幅を検出する（ステップＳ２０４）。復号部４３は、検出された各有音期間の時間幅に応じて、ビットの値の配列を取得し（ステップＳ２０５）、取得した値の配列を復号された受信データとしてメモリ２２等に出力する（ステップＳ２０６）。ここで、復号部４３はＣＲＣ訂正などの公知の誤り訂正技術を用いて受信データの精度を向上させてもよい。また、送信装置１が出力する伝送信号区間５３の時間幅の種類が３以上であれば、それに応じた値の配列を取得して受信データを復号してもよい。 Next, the decoding unit 43 detects the boundary between the voiced section and the silent section, and detects the time width of each of the plurality of voiced sections (step S204). The decoding unit 43 acquires an array of bit values according to the detected duration of each sound period (step S205), and outputs the acquired array of values to the memory 22 or the like as decoded received data. (Step S206). Here, the decoding unit 43 may improve the accuracy of the received data using a known error correction technique such as CRC correction. Further, if the type of time width of the transmission signal section 53 output by the transmission device 1 is 3 or more, the received data may be decoded by obtaining an array of values corresponding to the time width.

上述の技術により、ＤＡ変換器１４に接続されるスピーカ１５を含む送信装置１と、ＡＤ変換器２４に接続されるマイク２５を含む受信装置２とがあれば、他のデバイスを付加しなくても通信を行うことができる。非可聴周波数帯の音声は電波に比べれば指向性が強いので、送信内容を誤って別の装置が受信する恐れも少なくなり、電波を用いる場合のように混信を防ぐための機能を付加したり操作を複雑化する必要もない。しかも、非可聴周波数帯の音声を用い、かつノイズの発生も抑えられているので、スピーカを用いて通信を行っているにも関わらず、その通信により人が不快な音を認識することはない。そのため、人体への影響も少ない。 If the transmitter 1 including the speaker 15 connected to the DA converter 14 and the receiver 2 including the microphone 25 connected to the AD converter 24 are provided by the above-described technique, other devices need not be added. Can also communicate. Since sound in the non-audible frequency band is more directional than radio waves, there is less risk of another device receiving the wrong transmission content, and a function to prevent interference, such as when using radio waves, is added. There is no need to complicate the operation. In addition, since non-audible frequency band audio is used and noise generation is suppressed, even though communication is performed using a speaker, a person does not recognize an unpleasant sound. . Therefore, there is little influence on the human body.

また、ＤＡ変換器１４とスピーカ１５とを有する機器のほとんどはユーザによるボリューム調整が可能であるので、例えばセキュリティと通信エラーとのバランスを任意に調整することができる。ユーザが、送信側の機器と受信側の機器との距離が近い（例えば５ｃｍ）場合には音量を小さくし、遠い（例えば１ｍ）場合は音量を大きくするといった調整をすればよいからである。 Also, since most of the devices having the DA converter 14 and the speaker 15 can be adjusted by the user, for example, the balance between security and communication error can be arbitrarily adjusted. This is because the user may make an adjustment such that the volume is reduced when the distance between the transmission-side device and the reception-side device is short (for example, 5 cm), and the volume is increased when the distance is long (for example, 1 m).

また、上述の説明では送信装置１が出力する音声の時間幅を用いて送信データを符号化しているが、他の変調方法を用いて送信データを符号化してもよい。例えば、ＡＭ変調方式のように音声の振幅の違いによって送信するビット配列の各ビットを表してもよいし、ＦＭ変調方式のように音声の周波数の違いによってビット配列の各ビットを表してもよい。ただし、音声の時間幅を用いた方が、前者に比べてノイズ等の環境変化に強く、また後者に比べて復号に必要な計算量が少ない。 In the above description, the transmission data is encoded using the time width of the sound output from the transmission device 1, but the transmission data may be encoded using other modulation methods. For example, each bit of the bit array to be transmitted may be represented by a difference in sound amplitude as in the AM modulation method, or each bit of the bit array may be represented by a difference in sound frequency as in the FM modulation method. . However, using the time width of speech is more resistant to environmental changes such as noise than the former, and requires less computation for decoding than the latter.

これまでに説明した通信方法は、スマートフォンや携帯電話機などでは特に有効である。スマートフォンや携帯電話機のように音声を入出力するデジタル対応の携帯端末には必ずＤＡ変換器１４とそれに接続されるスピーカ１５、ＡＤ変換器２４とそれに接続されるマイク２５を装備している。この特徴により、スマートフォンや携帯電話機にプログラムをインストールすることで、それらが送信装置１や受信装置２の機能を実現できる。つまり、外付けの通信デバイスを購入したり接続したりすることなく携帯端末を用いた通信を実現できる。例えば、２台のスマートフォンのうち一方又は両方が赤外線通信機能を有していないような場合であっても、プログラムをインストールすることで、それらの携帯電話機の間でメールアドレスの交換や、ＵＲＬやメッセージの送受信等を行うことができる。もちろん携帯電話の基地局や、無線ＬＡＮの親機からの電波が来ていなくてもそれらの情報をやりとりできる。また基地局等からの電波を受信できるのであれば、非可聴帯域の音声を用いた通信を用いて、電波を介するネットワークにおいて少なくとも一方のスマートフォン等を特定する情報を送受信し、電波を介してより大きなデータを送受信することも可能である。 The communication methods described so far are particularly effective for smartphones and mobile phones. A digital-compatible portable terminal that inputs and outputs audio, such as a smartphone or a mobile phone, is always equipped with a DA converter 14 and a speaker 15 connected thereto, an AD converter 24 and a microphone 25 connected thereto. With this feature, the functions of the transmission device 1 and the reception device 2 can be realized by installing a program in a smartphone or a mobile phone. That is, communication using a mobile terminal can be realized without purchasing or connecting an external communication device. For example, even if one or both of two smartphones do not have an infrared communication function, by installing a program, e-mail address exchange, URL, Messages can be sent and received. Of course, even if there is no radio wave from the base station of the mobile phone or the base unit of the wireless LAN, such information can be exchanged. In addition, if radio waves from a base station or the like can be received, information that identifies at least one smartphone etc. is transmitted and received in a network via radio waves using communication using non-audible band audio, and more It is also possible to send and receive large data.

これまでは携帯電話機やスマートフォンの間での通信のように１対１の送受信の例を説明したが、送信側の装置が複数の受信装置２に向けてデータを送信してもよい。より具体的には、例えば、出力音声データをテレビ局やラジオ局で生成し、その出力音声データに応じた音声を各家庭のテレビ７等のスピーカから出力させてよい。テレビ７の前にある複数の携帯電話機やスマートフォン等は、その音声をマイク２５を介して受信する。 So far, an example of one-to-one transmission / reception such as communication between mobile phones and smartphones has been described. However, a transmitting apparatus may transmit data to a plurality of receiving apparatuses 2. More specifically, for example, output audio data may be generated by a television station or a radio station, and audio corresponding to the output audio data may be output from a speaker of the home television 7 or the like. A plurality of mobile phones, smartphones, and the like in front of the television 7 receive the sound via the microphone 25.

図７は、テレビ７のスピーカを用いてデータを送信するシステムの一例を示す図である。局側装置６は、テレビ局に設置される。局側装置６はプロセッサ６１、記憶部６２、および送信回路６３を含む。またテレビ７は、受信回路７１、表示回路７２、ＤＡ変換器７３およびスピーカ７４を含む。局側装置６とテレビ７とをあわせた送信システムが、図１の送信装置１に対応する。局側装置６に含まれるプロセッサ６１および記憶手段６２はそれぞれ送信装置１に含まれるプロセッサ１１およびメモリ１２に対応し、テレビ７に含まれるＤＡ変換器７３およびスピーカ７４は、それぞれ送信装置１に含まれるＤＡ変換器１４およびスピーカ１５に対応する。 FIG. 7 is a diagram illustrating an example of a system that transmits data using a speaker of the television 7. The station side device 6 is installed in a television station. The station side device 6 includes a processor 61, a storage unit 62, and a transmission circuit 63. The television 7 includes a receiving circuit 71, a display circuit 72, a DA converter 73, and a speaker 74. A transmission system that combines the station-side device 6 and the television 7 corresponds to the transmission device 1 of FIG. The processor 61 and the storage means 62 included in the station side device 6 correspond to the processor 11 and the memory 12 included in the transmission device 1, respectively, and the DA converter 73 and the speaker 74 included in the television 7 are included in the transmission device 1, respectively. Corresponds to the DA converter 14 and the speaker 15.

送信回路６３はテレビ７の表示回路７２に表示させる映像データと、スピーカ７３に出力させる出力音声データとを含む放送データを、テレビ局のアンテナを介して複数のテレビ７に送信する。また受信回路７１は放送データを受信し、その放送データに含まれる映像データを表示回路に出力し、また放送データに含まれる出力音声データをＤＡ変換器７３に出力する。表示回路７２は、映像処理回路や表示パネル等を含んでおり、映像データに基づく画像を表示する。図１に示す送信装置１とこの送信システムとの違いは、プロセッサ６１および記憶手段６２と、ＤＡ変換器７３との間に、送信回路６３および受信回路７１が存在することで、データを転送する経路が長くなっている点である。 The transmission circuit 63 transmits broadcast data including video data to be displayed on the display circuit 72 of the television 7 and output audio data to be output to the speaker 73 to the plurality of televisions 7 through the antenna of the television station. The receiving circuit 71 receives broadcast data, outputs video data included in the broadcast data to the display circuit, and outputs output audio data included in the broadcast data to the DA converter 73. The display circuit 72 includes a video processing circuit, a display panel, and the like, and displays an image based on the video data. The difference between the transmission apparatus 1 shown in FIG. 1 and this transmission system is that data is transferred because the transmission circuit 63 and the reception circuit 71 exist between the processor 61, the storage means 62, and the DA converter 73. The route is longer.

この送信システムも、機能的には送信データ取得部３１、符号化部３２、および音声出力部３３を含む。送信データ取得部３１は、プロセッサ６１、記憶部６２を中心として実現される。この送信データ取得部３１は、放送するコンテンツとなる映像データおよび音声データとそれらの映像データおよび音声データ中の先頭からの時間とに基づいて、送信データを取得する。映像データおよび音声データは、記憶部６２に記憶されており、送信データは予め映像データおよび音声データとその時間的な区間とに関連づけて記憶部６２に記憶されている。 This transmission system also functionally includes a transmission data acquisition unit 31, an encoding unit 32, and an audio output unit 33. The transmission data acquisition unit 31 is realized centering on the processor 61 and the storage unit 62. The transmission data acquisition unit 31 acquires transmission data based on video data and audio data serving as content to be broadcast and a time from the beginning of the video data and audio data. The video data and audio data are stored in the storage unit 62, and the transmission data is stored in advance in the storage unit 62 in association with the video data and audio data and their time intervals.

符号化部３２は、プロセッサ６１および記憶部６２を中心として実現される。符号化部３２は、送信データが符号化されてなるＰＣＭ形式の出力音声データを出力する。その詳細は、図１に示す送信装置１で説明したものと同じである。 The encoding unit 32 is realized centering on the processor 61 and the storage unit 62. The encoding unit 32 outputs output audio data in PCM format obtained by encoding transmission data. The details are the same as those described in the transmission apparatus 1 shown in FIG.

音声出力部３３は、プロセッサ６１および記憶部６２を中心として実現される。音声出力部３３は、符号化部３２が出力する出力音声データに放送するコンテンツとなる音声データを混合させて新たな出力音声データを生成し、その新たな出力音声データと、映像データとを含む放送データを送信回路６３に出力する。そうすることで、音声出力部３３は、出力音声データをテレビ７に含まれるＤＡ変換器７３に変換させることによりその出力音声データが示す音声をスピーカ７４に出力させる。なお、音声データを出力音声データに混合しても、受信装置２は送信データを復号することができる。バンドパスフィルタによって非可聴周波数帯の音声をほとんど含まない音声データの影響が低減されるからである。 The audio output unit 33 is realized with the processor 61 and the storage unit 62 as the center. The audio output unit 33 generates new output audio data by mixing the output audio data output from the encoding unit 32 with the audio data serving as the content to be broadcast, and includes the new output audio data and video data. Broadcast data is output to the transmission circuit 63. By doing so, the audio output unit 33 causes the speaker 74 to output the audio indicated by the output audio data by converting the output audio data to the DA converter 73 included in the television 7. Note that the reception device 2 can decode the transmission data even if the audio data is mixed with the output audio data. This is because the influence of audio data containing almost no sound in the inaudible frequency band is reduced by the band pass filter.

このようにすることで、テレビ局の放送するコンテンツに応じて情報の紹介やキャンペーンの申込を行うＷｅｂページのＵＲＬや、番組名などの情報を、既存のテレビ７やスマートフォンのハードウェアを用いて送受信することができる。テレビの視聴者は従来は画面に表示されていた情報を、文字入力等の作業を経ずに取得することができ、利便性が大きく向上する。これは特にスマートフォンや携帯電話などの携帯端末を用いる場合に有利である。携帯端末は大きさの都合でＰＣのようなキーボードを装備できないため、文字入力を苦手とするからである。 In this way, information such as the URL of the Web page for introducing information and applying for a campaign according to the content broadcast by the TV station, and information such as the program name are transmitted and received using the hardware of the existing TV 7 or smartphone. can do. The viewer of the television can acquire information that has been conventionally displayed on the screen without performing an operation such as character input, and the convenience is greatly improved. This is particularly advantageous when a mobile terminal such as a smartphone or a mobile phone is used. This is because a portable terminal cannot be equipped with a keyboard like a PC due to its size, and is not good at inputting characters.

なお、局側装置６はデジタル放送を行うラジオ局に置いてもよい。この場合は、送信データ取得部３１は、放送するコンテンツとなる音声データとその先頭からの時間とに基づいて、送信データを取得し、送信回路６３は音声データを含む放送データを送信する。また、デジタル放送に対応したラジオが放送データを受信しスピーカからその音声を出力すればよい。 The station side device 6 may be placed in a radio station that performs digital broadcasting. In this case, the transmission data acquisition unit 31 acquires transmission data based on the audio data as the content to be broadcast and the time from the beginning, and the transmission circuit 63 transmits the broadcast data including the audio data. A radio compatible with digital broadcasting may receive broadcast data and output the sound from a speaker.

１送信装置、２受信装置、６局側装置、７テレビ、１１，２１，６１プロセッサ、１２，２２メモリ、１３，２３表示操作部、１４，７３ＤＡ変換器、１５，７４スピーカ、２４ＡＤ変換器、２５マイク、３１送信データ取得部、３２符号化部、３３音声出力部、４１音声入力部、４２抽出部、４３復号部、５１，５２同期信号、５３伝送信号区間、６２記憶部、６３送信回路、７１受信回路、７２表示回路、Ｔｎ，Ｔｔ０，Ｔｔ１時間幅。 DESCRIPTION OF SYMBOLS 1 Transmitter, 2 Receiver, 6 Station side apparatus, 7 Television, 11,21,61 Processor, 12,22 Memory, 13,23 Display operation part, 14,73 DA converter, 15,74 Speaker, 24 AD conversion 25, microphone, 31 transmission data acquisition unit, 32 encoding unit, 33 audio output unit, 41 audio input unit, 42 extraction unit, 43 decoding unit, 51, 52 synchronization signal, 53 transmission signal section, 62 storage unit, 63 Transmission circuit, 71 Reception circuit, 72 Display circuit, Tn, Tt0, Tt1 Time width.

Claims

Means for obtaining transmission data;
Encoding means for outputting output sound data indicating sound in a non-audible frequency band, which is output sound data in PCM format obtained by encoding the transmission data;
Output means for outputting a sound corresponding to the output sound data to a speaker by converting the output sound data to a DA converter;
Including
It said encoding means indicates a voice and a silent section between the sound period to the output audio data is adjacent to the plurality of voiced section in accordance with the transmission data, and amplitude start of each chromatic sounds interval Output the output audio data so that it shows a sine curve that increases from time to time and then decreases ,
A transmission apparatus characterized by the above.

The encoding means determines a time width of each of the plurality of sound sections according to the transmission data, and outputs output sound data indicating sound including the plurality of sound sections.
The transmission apparatus according to claim 1, wherein:

The inaudible frequency band is 20 kHz or more,
The transmission apparatus according to claim 1 or 2 , wherein

Each time width of the plurality of sound periods is one of a predetermined number of time widths,
Transmission device according to any one of claims 1 to 3, characterized in that.

Obtaining transmission data; and
An encoding step of outputting output audio data indicating audio in a non-audible frequency band, which is output audio data in PCM format obtained by encoding the transmission data;
Converting the output audio data to a DA converter to output a sound corresponding to the output audio data to a speaker;
Including
In the encoding step, the output voice data indicates a voice having a plurality of voiced sections and a silent section between adjacent voiced sections according to the transmission data, and the amplitude of each voiced section starts. Output the output audio data so that it shows a sine curve that increases from time to time and then decreases ,
A transmission method characterized by the above.

Means for obtaining transmission data;
PCM format output audio data obtained by encoding the transmission data, and encoding means for outputting output audio data indicating audio in an inaudible frequency band; and
An output means for outputting a sound corresponding to the output sound data to a speaker by converting the output sound data to a DA converter;
Function as a computer
It said encoding means indicates a voice and a silent section between the sound period to the output audio data is adjacent to the plurality of voiced section in accordance with the transmission data, and amplitude start of each chromatic sounds interval Output the output audio data so that it shows a sine curve that increases from time to time and then decreases ,
A program characterized by that.

A voice in a non-audible frequency band that is input to a microphone and encoded with transmission data, and has a plurality of sound sections and a silent section between adjacent sound sections according to the transmission data; and Input means for acquiring, via an AD converter, PCM format input voice data indicating a voice indicating a sine curve that increases after the amplitude of each voiced section increases from the start and reaches a maximum ;
Extracting means for generating extracted voice data obtained by extracting voice in a non-audible frequency band from the voice indicated by the input voice data;
Decoding means for decoding the transmission data based on the extracted voice data;
A receiving apparatus comprising:

The decoding means decodes the transmission data based on respective lengths of a plurality of sound segments indicated by the extracted voice data and sequentially input to the microphone.
The receiving apparatus according to claim 7 .

A voice in a non-audible frequency band that is input to a microphone and encoded with transmission data, and has a plurality of sound sections and a silent section between adjacent sound sections according to the transmission data; and An input step of acquiring input voice data in PCM format via an AD converter based on voice indicating a sine curve that increases after the amplitude of each voiced section increases from the start and reaches a maximum ;
An extraction step of generating extracted voice data obtained by extracting voice in a non-audible frequency band from the voice indicated by the input voice data;
A decoding step of decoding the transmission data based on the extracted voice data;
A receiving method comprising:

A voice in a non-audible frequency band that is input to a microphone and encoded with transmission data, and has a plurality of sound sections and a silent section between adjacent sound sections according to the transmission data; and Input means for acquiring, via an AD converter, PCM format input voice data indicating a voice indicating a sine curve that increases after the amplitude of each voiced section increases from the start and reaches a maximum ;
Extraction means for generating extracted voice data obtained by extracting voice in a non-audible frequency band from the voice indicated by the input voice data; and
Decoding means for decoding the transmission data based on the extracted voice data;
As a program to make the computer function as.

A communication system including a transmission device and a reception device,
The transmitter is
Means for obtaining transmission data;
Encoding means for outputting output sound data indicating sound in a non-audible frequency band, which is output sound data in PCM format obtained by encoding the transmission data;
An output means for outputting a sound corresponding to the output sound data to a speaker by converting the output sound data to a DA converter;
Including
It said encoding means indicates a voice and a silent section between the sound period to the output audio data is adjacent to the plurality of voiced section in accordance with the transmission data, and amplitude start of each chromatic sounds interval Output the output audio data so that it shows a sine curve that increases from time to time and then decreases ,
The receiving device is:
Input means for acquiring, through an AD converter, PCM-format input sound data indicating sound output from the speaker and input to a microphone;
Extracting means for generating extracted voice data obtained by extracting voice in a non-audible frequency band from the voice indicated by the input voice data;
Decoding means for decoding the transmission data based on the extracted voice data;
including,
A communication system characterized by the above.

Obtaining transmission data; and
An encoding step of outputting output audio data indicating audio in a non-audible frequency band, which is output audio data in PCM format obtained by encoding the transmission data;
Converting the output audio data to a DA converter to output a sound corresponding to the output audio data to a speaker;
An input step of acquiring, through an AD converter, input audio data in PCM format indicating the audio output from the speaker and input to the microphone;
An extraction step of generating extracted voice data obtained by extracting voice in a non-audible frequency band from the voice indicated by the input voice data;
A decoding step of decoding the transmission data based on the extracted voice data;
Including
In the encoding step, the output voice data indicates a voice having a plurality of voiced sections and a silent section between adjacent voiced sections according to the transmission data, and the amplitude of each voiced section starts. Output the output audio data so that it shows a sine curve that increases from time to time and then decreases ,
A communication method characterized by the above.