JP4543586B2

JP4543586B2 - Audio reproduction apparatus and audio reproduction method

Info

Publication number: JP4543586B2
Application number: JP2001191774A
Authority: JP
Inventors: 宏平浅田; 和史吉田
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2001-06-25
Filing date: 2001-06-25
Publication date: 2010-09-15
Anticipated expiration: 2021-06-25
Also published as: JP2003009298A

Description

【０００１】
【発明の属する技術分野】
本発明は、マルチチャンネルの音声信号を同期再生する音声再生装置及び音声再生方法に関する。
【０００２】
【従来の技術】
近年、オーディオ機器から再生される音声信号を臨場感を高めて聴取可能とするために、マルチチャンネルの音声信号を再生する音声再生システムの構築が考えられている。このような音声再生システムとしては、例えば、主に家庭室内や車両内等の空間における聴取を目的として、ＤＶＤ（Digital Versatile Disc）等の記録媒体に記録されたマルチチャンネルの音声信号を再生するものがあり、聴取者の右前方、正面前方、左前方、右後方及び左後方に設置された５つのスピーカのそれぞれから出力されるべき５チャンネルと、音声信号の低域成分のみが記録されたサブチャンネルとを有する方式、すなわち、いわゆるドルビーＡＣ−３（商標）５．１チャンネル方式と呼ばれるものが実用化されている。この５．１チャンネル方式による音声再生システムにおいては、実際に聴取者の右前方、正面前方、左前方、右後方及び左後方に設置された５つのスピーカと低域再生用のスピーカとを用いて再生することにより、臨場感のある音響再生を行うことができる。
【０００３】
【発明が解決しようとする課題】
ところで、本件出願人は、マルチスピーカによる非常に精密な指向性、音場及び音像等の制御を実現するために、上述したような５．１チャンネル方式といったように、少数のスピーカから音声を再生可能とする音声再生システムではなく、より大規模の音声再生システムの構築を進めている。
【０００４】
このような大規模な音声再生システムの構築を考えた場合、全く新規のハードウェアを設計することは勿論可能であるが、既存の汎用ハードウェアを用いて構築するのが望ましい。これは、パーソナルコンピュータに搭載可能であるいわゆるサウンドカード等の汎用ハードウェアに必要とされる機能を有する既存の安価な半導体が多く存在しており、音声再生システムを構築する上で全く新規の音声処理機構を開発するのに比べ、設計工数の効率化を計ることができ、安価に部品を調達でき、容易に設計可能であること等によるものである。
【０００５】
ここで、このようなパーソナルコンピュータ等を含むハードウェアは、物理制約上又はシステム制約上、チャンネル数に制限がある場合が多い。そのため、音声再生システムとしては、１又は複数チャンネルの音声信号を出力可能なパーソナルコンピュータ等を含むハードウェアをネットワーク等を介して複数接続し、所定の局所的制御のもとに、これら複数のハードウェアから音声信号を分散して再生させることが考えられる。この際、音声再生システムにおいては、個々のハードウェアから出力される音声信号を例えばサンプル単位で完全に同期させて再生することが必要となる。
【０００６】
したがって、音声再生システムにおいては、同期再生を実現するために、いわゆるリアルタイムＯＳ（Operating System）を必要としたり、外部信号との時間同期を正確に行いつつ通信を行うことができる特殊なハードウェアを必要とし、システム全体や通信手段の複雑化を招く要因となっていた。
【０００７】
また、音声再生システムにおいては、同期再生を実現するために、従来から映像等の同期手法として行われているものであり、各ハードウェアを接続する汎用ネットワークによるパケット通信によってハードウェア間のクロックを合わせることも可能である。
【０００８】
しかしながら、音声再生システムにおいては、ネットワーク上のトラフィックの影響に起因して、コマンドが各ハードウェアへと到達する時間にばらつきが生じる。また、音声再生システムにおいては、様々なハードウェアを相互に接続することから、これら各ハードウェアの相違によって処理時間も多様となり、この処理時間を予測することも困難である。さらに、音声再生システムにおいては、サウンドカード自体のクロックの相違もある。このため、音声再生システムにおいては、通信セッション毎の各ハードウェア間での遅延時間を特定することは困難であり、統計的手法に頼らざるを得ず、非常に不安定なものであった。したがって、このような手法は、音声のように、例えば１／４４１００秒単位でサンプルレベルの同期が必要なものには不適であった。
【０００９】
本発明は、このような実情に鑑みてなされたものであり、汎用の複数のハードウェアを用いて、マルチチャンネルの音声信号を精確に同期再生することができる音声再生装置及び音声再生方法を提供することを目的とするものである。
【００１０】
【課題を解決するための手段】
上述した目的を達成する本発明にかかる音声再生装置は、複数チャンネルの音声信号を同期再生する音声再生装置であって、所定の基準信号を生成する基準信号生成手段と、基準信号に同期して複数チャンネルの音声信号を出力処理可能な音声出力処理器を介して、音声信号を再生して出力する互いに同期が確保されていない複数の音声出力手段と、基準信号に同期して少なくとも音声出力手段の数以上の音声信号を入力処理可能な音声入力処理器を介して、音声出力手段のそれぞれから少なくとも１チャンネル分だけ供給される信号であって音声出力手段のそれぞれにおける音声信号の再生タイミングを与えるための予行信号の入来を監視する予行信号監視手段とを備え、予行信号監視手段は、音声出力手段のそれぞれから供給された複数の予行信号の入力時間についての相対的なずれ時間を求め、ずれ時間を示す時間情報を、音声出力手段のそれぞれに対して通知し、音声出力手段は、それぞれ、時間情報に基づいて、音声信号の再生を開始すべきタイミングを決定することを特徴としている。
【００１１】
このような本発明にかかる音声再生装置は、音声出力手段のそれぞれに対応する複数の予行信号の入力時間についての相対的なずれ時間を示す時間情報に基づいて、音声信号の再生を開始すべきタイミングを決定する。
【００１２】
また、上述した目的を達成する本発明にかかる音声再生方法は、複数チャンネルの音声信号を同期再生する音声再生方法であって、所定の基準信号に同期して複数チャンネルの音声信号を出力処理可能な音声出力処理器を介して、音声信号を再生して出力する互いに同期が確保されていない複数の音声出力手段のそれぞれから少なくとも１チャンネル分だけ供給される信号であって、音声出力手段のそれぞれにおける音声信号の再生タイミングを与えるための予行信号を、基準信号に同期して少なくとも音声出力手段の数以上の音声信号を入力処理可能な音声入力処理器を介して予行信号監視手段に対して入力し、音声出力手段のそれぞれから供給された複数の予行信号の入力時間についての相対的なずれ時間を求めるずれ時間算出工程と、ずれ時間を示す時間情報に基づいて、音声信号の再生を開始すべきタイミングを決定する再生開始タイミング決定工程とを備えることを特徴としている。
【００１３】
このような本発明にかかる音声再生方法は、音声出力手段のそれぞれに対応する複数の予行信号の入力時間についての相対的なずれ時間を示す時間情報に基づいて、音声信号の再生を開始すべきタイミングが決定される。
【００１４】
【発明の実施の形態】
以下、本発明を適用した具体的な実施の形態について図面を参照しながら詳細に説明する。
【００１５】
この実施の形態は、パーソナルコンピュータ等を含む複数のハードウェアを用いて、少なくとも２チャンネル以上のマルチチャンネルの音声信号を同期して再生することが可能な音声再生装置である。この音声再生装置は、マルチタスク可能であり、且つ、必ずしもリアルタイム性処理に特化していない一般的なハードウェア又はＯＳ（Operating System）若しくはＯＳに相当するシステムを用いるとともに、既存のサウンドカードやインターフェース、これに準じるハードウェア等の汎用的な音声処理手段及び通信手段を用いて、複数のハードウェアにまたがって存在する音声信号のマルチチャンネルの同期再生を可能とするものである。
【００１６】
なお、以下では、ハードウェアとは、各種プログラムを記憶するメモリ及び各種プログラムを実行するＣＰＵ（Central Processing Unit）又はＤＳＰ（Digital Signal Processor）等によって構成されたものとして説明する。また、以下では、ハードウェアに存在する音声信号とは、特に断りがない限り、例えばハードディスク等の記憶手段上に音声ファイルとして存在しているもの、及び、各ハードウェア上で信号処理されるか否かを問わず逐次的にメモリ上に存在する音声ストリームのいずれをも指し示すものとして説明する。さらに、以下では、各機能単位を示すモジュールを“サイト”と称するものとする。このサイトは、必ずしもハードウェア等の単位と一致するものではない。すなわち、音声再生装置は、複数のサイトが１つのハードウェアに実装されてもよく、複数のハードウェアに分散されて実装されてもよく、その構成は任意性があるものとする。
【００１７】
まず、本発明の第１の実施の形態について説明する。この第１の実施の形態として図１に示す音声再生装置１０は、各ハードウェア上に音声信号が音声ファイルとして分散して存在しており、これらの音声ファイルを用いて５６チャンネルの音声信号の同期再生を行うものである。
【００１８】
音声再生装置１０は、同図に示すように、各部を制御する制御手段であるコントロールサイト１１と、後述する予行信号であるトリガ信号ＴＧ_１，ＴＧ_２，・・・，ＴＧ_８を監視する予行信号監視手段であるトリガ監視サイト１２と、音声信号を再生して出力する音声出力手段である８つの音声出力サイト２０_１，２０_２，・・・，２０_８（以下、音声出力サイト２０_ｎ（ｎ＝１，２，・・・，８；サイト番号）と総称する。）と、所定の基準信号としてワード単位でクロックを生成する基準信号生成手段であるワードクロック生成器１４と、音声出力サイト２０_ｎから出力された音声信号に対してＤ／Ａ（Digital/Analog）変換を施すＤ／Ａコンバータ及びアナログ化された音声信号を増幅するアンプ等からなる５６個の出力器１５_１１，１５_１２，１５_１３，１５_１４，１５_１５，１５_１６，１５_１７，１５_２１，１５_２２，・・・，１５_８４，１５_８５，１５_８６，１５_８７（以下、出力器１５_ｎｍ（ｎ＝１，２，・・・，８、ｍ＝１，２，・・・，７；チャンネル番号）と総称する。）と、これらの出力器１５_ｎｍのそれぞれから出力された音声信号を外部へと放音する５６個のスピーカ１６_１１，１６_１２，１６_１３，１６_１４，１６_１５，１６_１６，１６_１７，１６_２１，１６_２２，・・・，１６_８４，１６_８５，１６_８６，１６_８７（以下、スピーカ１６_ｎｍ（ｎ＝１，２，・・・，８、ｍ＝１，２，・・・，７）と総称する。）とを備える。
【００１９】
コントロールサイト１１は、音声再生装置１０の統括制御を行う機能を有するとともに、音声再生装置１０とユーザとの間のインターフェースに関する機能を有するものである。コントロールサイト１１は、例えばＴＣＰ／ＩＰ（Transmission Control Protocol/Internet Protocol）やＵＤＰ（User Datagram Protocol）等の所定のプロトコルにしたがって外部との通信を行うことが可能であり、ネットワークＮＴを介してトリガ監視サイト１２及び音声出力サイト２０_ｎとの間で信号の送受信を行うことが可能とされる。音声再生装置１０は、このような一般的に周知であり多く使用されるネットワークリソースや汎用の通信プロトコルを用いることにより、特別なハードウェアを用意したり、新たに構築する必要がない。コントロールサイト１１は、コントロールプログラムＣｔを実行することが可能な環境とされており、所定の操作部や表示部等からなるユーザインターフェースＵＩを介してユーザによって再生コマンドが指示されると、コントロールプログラムＣｔの実行制御のもとに、音声ファイルの再生指示を示す制御情報をネットワークＮＴを介して音声出力サイト２０_ｎに対して通知する。
【００２０】
トリガ監視サイト１２は、少なくとも音声出力サイト２０_ｎの数以上、すなわち、８チャンネル以上の後述するトリガ信号としての音声信号を入力処理可能なサウンドカード等のサウンド処理部１３を有するパーソナルコンピュータ等を含むハードウェアの機能を有するものである。トリガ監視サイト１２は、コントロールサイト１１と同様に、ＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがってネットワークＮＴを介してコントロールサイト１１及び音声出力サイト２０_ｎとの間で信号の送受信を行うことが可能とされる。また、トリガ監視サイト１２は、サウンド処理部１３を介して音声出力サイト２０_ｎから出力される１チャンネル分の音声信号であって音声ファイルの再生タイミングを与えるための信号であるトリガ信号ＴＧ_１，ＴＧ_２，・・・，ＴＧ_８（以下、トリガ信号ＴＧ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を受信することが可能とされる。このとき、トリガ監視サイト１２は、サウンド処理部１３における第１チャンネルが音声出力サイト２０_１と接続され、第２チャンネルが音声出力サイト２０_２と接続されるといったように、各チャンネルが音声出力サイト２０_ｎのそれぞれと対応させて接続される。トリガ監視サイト１２は、監視プログラムＳｖを実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部１３における処理を行う。
【００２１】
トリガ監視サイト１２は、コントロールサイト１１から全ての音声出力サイト２０_ｎに対して音声ファイルの再生指示を示す制御情報が出力されると、監視プログラムＳｖの実行制御のもとに、音声出力サイト２０_ｎのそれぞれから出力されるトリガ信号ＴＧ_ｎの入来を監視する。トリガ監視サイト１２は、トリガ信号ＴＧ_ｎを受け取ると、これらのトリガ信号ＴＧ_ｎに基づいて、音声出力サイト２０_ｎのそれぞれに対応する後述する遅れサンプル数を求め、これらの遅れサンプル数を示す遅れサンプル数情報をネットワークＮＴを介して音声出力サイト２０_ｎに対してそれぞれ出力する。ここで、遅れサンプル数とは、トリガ監視サイト１２に対するトリガ信号ＴＧ_ｎの入力時間についての相対的なずれ時間をワードクロックＷＣＬＫ単位で求めたものである。
【００２２】
なお、トリガ監視サイト１２が取り扱う音声信号の種別としては、ディジタルデータが望ましいが、サウンド処理部１３において、ワードクロックＷＣＬＫやトリガ信号ＴＧ_ｎを正確に感知でき且つ送受信可能であるならば、アナログデータでもよい。また、トリガ監視サイト１２においては、ディジタルデータを取り扱う場合であっても、例えばサンプリングレートや圧縮又は非圧縮の別といったデータフォーマットはいかなるものであってもよい。
【００２３】
音声出力サイト２０_ｎは、それぞれ、例えば８チャンネルの音声信号を出力処理可能なサウンドカード等のサウンド処理部２１_１，２１_２，・・・，２１_８（以下、サウンド処理部２１_ｎ（ｎ＝１，２，・・・，８）と総称する。）を有するパーソナルコンピュータ等を含むハードウェアの機能を有するものである。音声出力サイト２０_ｎは、それぞれ、コントロールサイト１１及びトリガ監視サイト１２と同様に、ＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがってネットワークＮＴを介してコントロールサイト１１及びトリガ監視サイト１２との間で信号の送受信を行うことが可能とされる。また、音声出力サイト２０_ｎは、それぞれ、サウンド処理部２１_ｎにおける例えば第８チャンネルのみをトリガ監視サイト１２と接続しており、このチャンネルを用いてトリガ監視サイト１２に対して１チャンネル分の音声信号であるトリガ信号ＴＧ_ｎを出力することが可能とされる。音声出力サイト２０_ｎは、それぞれ、再生処理プログラムＡ_１，Ａ_２，・・・，Ａ_８（以下、再生処理プログラムＡ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部２１_ｎにおける処理を行う。このとき、音声出力サイト２０_ｎにおいては、それぞれ、サウンド処理部２１_ｎにおける第１チャンネル乃至第８チャンネルの間の同期は確保されているものとする。すなわち、音声出力サイト２０_ｎにおいては、それぞれ、後述するように、コントロールサイト１１から再生指示を受け取った場合には、サウンド処理部２１_ｎにおける第１チャンネル乃至第８チャンネルから同時に音声信号を出力する。また、音声出力サイト２０_ｎは、それぞれ、図示しない記憶手段に、音声ファイルＡＦ_１，ＡＦ_２，・・・，ＡＦ_８（以下、音声ファイルＡＦ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を記憶している。
【００２４】
音声出力サイト２０_ｎは、それぞれ、コントロールサイト１１からネットワークＮＴを介して供給される音声ファイルＡＦ_ｎの再生指示を示す制御情報を受け取ると、再生処理プログラムＡ_ｎの実行制御のもとに、サウンド処理部２１_ｎを介してトリガ信号ＴＧ_ｎをトリガ監視サイト１２に対して出力する。なお、このトリガ信号ＴＧ_ｎは、トリガ監視サイト１２によってトリガとして認識できる所定の波形を有する音声信号であり、例えばインパルス信号等が望ましい。また、音声出力サイト２０_ｎは、それぞれ、トリガ監視サイト１２からネットワークＮＴを介して遅れサンプル数情報を受け取ると、この遅れサンプル数と後述する再生マージンサンプル数とに基づいて、音声ファイルＡＦ_ｎの再生を開始すべきタイミングを決定し、このタイミングで音声ファイルＡＦ_ｎの再生を開始し、出力器１５_ｎｍ及びスピーカ１６_ｎｍを介して外部へと放音させる。なお、音声出力サイト２０_ｎは、それぞれ、サウンド処理部２１_１における第１チャンネル乃至第７チャンネルがそれぞれ出力器１５_１ｍと接続されるといったように、各チャンネルから出力される音声信号を、対応する出力器１５_ｎｍに対して出力する。
【００２５】
なお、各音声出力サイト２０_ｎが取り扱う音声信号の種別としては、トリガ監視サイト１２と同様に、ディジタルデータが望ましいが、サウンド処理部２１_ｎにおいて、ワードクロックＷＣＬＫやトリガ信号ＴＧ_ｎを正確に感知でき且つ送受信可能であるならば、アナログデータでもよい。また、音声出力サイト２０_ｎにおいては、それぞれ、ディジタルデータを取り扱う場合であっても、例えばサンプリングレートや圧縮又は非圧縮の別といったデータフォーマットはいかなるものであってもよい。
【００２６】
ワードクロック生成器１４は、ワード単位で正確なクロックを生成する。ワードクロック生成器１４は、生成したワードクロックＷＣＬＫを、トリガ監視サイト１２におけるサウンド処理部１３及び音声出力サイト２０_ｎにおけるサウンド処理部２１_ｎに供給する。なお、ワードクロック生成器１４は、ワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号を生成するようにしてもよい。
【００２７】
出力器１５_ｎｍは、それぞれ、音声出力サイト２０_ｎから出力された音声信号に対してＤ／Ａ変換を施した後増幅する。出力器１５_ｎｍは、それぞれ、増幅した音声信号をスピーカ１６_ｎｍに対して出力する。
【００２８】
スピーカ１６_ｎｍは、それぞれ、出力器１５_ｎｍのそれぞれから出力された電気的な音声信号を音響エネルギへと変換し、外部へと放音する。
【００２９】
このような各部を備える音声再生装置１０においては、音声出力サイト２０_ｎのそれぞれに記憶されている音声ファイルＡＦ_ｎの再生指示を示す制御情報がコントロールサイト１１によって与えられると、音声出力サイト２０_ｎのそれぞれから、トリガ監視サイト１２に対してトリガ信号ＴＧ_ｎが出力される。音声再生装置１０においては、トリガ監視サイト１２により、トリガ信号ＴＧ_ｎに基づいて、音声出力サイト２０_ｎのそれぞれに対応する遅れサンプル数を求める。そして、音声再生装置１０においては、各音声出力サイト２０_ｎにより、この遅れサンプル数と再生マージンサンプル数とに基づいて、自己が再生を開始すべきタイミングを求め、このタイミングに到達した時間に音声ファイルＡＦ_ｎの再生を開始することができる。音声再生装置１０においては、このようにして、全ての音声出力サイト２０_ｎによる音声ファイルＡＦ_ｎの再生開始のタイミングを一致させることができる。そして、音声再生装置１０においては、それ以後、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声出力サイト２０_ｎによって再生を進行することにより、マルチチャンネルの同期再生を行うことができる。
【００３０】
なお、音声再生装置１０においては、例えば、音声出力サイト２０_１がコントロールサイト１１及び／又はトリガ監視サイト１２の機能を兼ね備えていたり、他の音声出力サイト２０_２の機能を兼ね備るといったように、複数のサイトが１つのハードウェアに実装されたり、複数のハードウェアに分散されて実装されることが可能であることは、上述したとおりである。
【００３１】
このような音声再生装置１０においては、音声出力サイト２０_ｎは、それぞれ、再生処理プログラムＡ_ｎを実行し、図２に示す一連の工程を経ることによって音声ファイルＡＦ_ｎの再生を開始する。
【００３２】
まず、音声出力サイト２０_ｎには、それぞれ、同図に示すように、ステップＳ１において、再生マージンサンプル数Ｓｍが設定される。この再生マージンサンプル数Ｓｍは、音声出力サイト２０_ｎの間において共通の値である。具体的には、再生マージンサンプル数Ｓｍは、後述する基準サンプルから音声ファイルＡＦ_ｎの再生開始時のサンプルまでの時間を表す値である。再生マージンサンプル数Ｓｍは、例えば後述する各種処理が終了するのに十分な時間を見込んで算出される値であり、ユーザによって設定されてもよく、自動的に設定されてもよい。音声出力サイト２０_ｎは、それぞれ、設定された再生マージンサンプル数Ｓｍを図示しないメモリ等に記憶する。
【００３３】
続いて、音声出力サイト２０_ｎは、それぞれ、ステップＳ２において、コントロールサイト１１からネットワークＮＴを介して音声ファイルＡＦ_ｎの再生指示を示す制御情報が到達したか否かを判別する。なお、音声出力サイト２０_ｎには、コントロールサイト１１からの再生指示がネットワークＮＴを介して与えられることから、制御情報が到達する絶対的な時刻は、ネットワークトラフィックの影響等に起因して音声出力サイト２０_ｎの間でばらつきが生じる。音声出力サイト２０_ｎは、それぞれ、コントロールサイト１１からの制御情報が到達するまで待機し、制御情報を受け取った時点で、ステップＳ３へと処理を移行する。
【００３４】
音声出力サイト２０_ｎは、それぞれ、ステップＳ３において、サウンド処理部２１_ｎにおける例えば第８チャンネル（ｍ＝８）、すなわち、トリガ監視サイト１２と接続しているトリガチャンネルに、例えばインパルス信号等の音声信号であるトリガ信号ＴＧ_ｎを再生する。なお、音声出力サイト２０_ｎのそれぞれからトリガ信号ＴＧ_ｎが再生される絶対的な時刻は、音声出力サイト２０_ｎが、それぞれ、異なる種類のサウンド処理部２１_ｎを搭載している場合があること、同一種類のサウンド処理部２１_ｎを搭載していても処理能力には若干のばらつきがあること、マルチタスクで動作している他のタスクの動作状況等に起因して予測不可能である。
【００３５】
これと同時に、音声出力サイト２０_ｎは、それぞれ、ステップＳ４において、トリガ信号ＴＧ_ｎを再生してトリガ監視サイト１２に対して出力した時点のサンプルＳ_ｎを基準として、再生サンプル数のカウントを開始する。音声出力サイト２０_ｎにおいては、それぞれ、このサンプルＳ_ｎを基準として、音声ファイルＡＦ_ｎの再生開始のタイミングが決定されることになる。
【００３６】
さらにこれと同時に、音声出力サイト２０_ｎは、それぞれ、ステップＳ５において、トリガチャンネル以外のチャンネル、すなわち、第１チャンネル乃至第７チャンネル（ｍ＝１〜７）に“０”を出力する。すなわち、音声出力サイト２０_ｎは、それぞれ、ステップＳ３において、第８チャンネルにトリガ信号ＴＧ_ｎを再生するのに対応して、第１チャンネル乃至第７チャンネルをミュートする。
【００３７】
そして、音声出力サイト２０_ｎは、それぞれ、ステップＳ６において、再生サンプル数のカウントを継続し、ステップＳ７において、トリガ監視サイト１２から対応する遅れサンプル数Ｓｄ_ｎ（ｎ＝１，２，・・・，８）を示す遅れサンプル数情報がネットワークＮＴを介して到達したか否かを判別する。なお、音声出力サイト２０_ｎには、トリガ監視サイト１２からの遅れサンプル数情報がネットワークＮＴを介して与えられることから、遅れサンプル数情報が到達する絶対的な時刻は、ネットワークトラフィックの影響等に起因して音声出力サイト２０_ｎの間でばらつきが生じる。
【００３８】
ここで、遅れサンプル数情報が到達していない場合には、音声出力サイト２０_ｎは、それぞれ、ステップＳ５乃至ステップＳ６の処理を繰り返す。
【００３９】
一方、遅れサンプル数情報が到達した場合には、音声出力サイト２０_ｎは、それぞれ、ステップＳ８において、“（再生マージンサンプル数Ｓｍ）−（遅れサンプル数Ｓｄ_ｎ）”の値を算出し、トリガ監視サイト１２に対してトリガ信号ＴＧ_ｎを出力した時点のサンプルからカウントして、この算出した値が経過したサンプルに到達した時点から、音声ファイルＡＦ_ｎの再生を開始し、一連の処理を終了する。
【００４０】
このようにして、音声再生装置１０においては、音声出力サイト２０_ｎのそれぞれによって再生処理プログラムＡ_ｎを実行し、音声ファイルＡＦ_ｎの再生を同時に開始することができる。なお、音声再生装置１０においては、それ以後、音声出力サイト２０_ｎが、それぞれ、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声ファイルＡＦ_ｎの再生を進行することにより、マルチチャンネルの同期再生が可能となる。
【００４１】
一方、音声再生装置１０においては、トリガ監視サイト１２は、監視プログラムＳｖを実行し、図３に示す一連の工程を経ることによって音声出力サイト２０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_１，Ｓｄ_２，・・・，Ｓｄ_８（以下、遅れサンプル数Ｓｄ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を求める。
【００４２】
まず、トリガ監視サイト１２は、同図に示すように、ステップＳ１１において、音声出力サイト２０_ｎのそれぞれからサウンド処理部１３における各チャンネルに出力されるトリガ信号ＴＧ_ｎの入来を監視する。
【００４３】
続いて、トリガ監視サイト１２は、ステップＳ１２において、サウンド処理部１３における８つのチャンネルのうちのいずれかのチャンネルに、トリガ信号ＴＧ_ｎのいずれかが供給されたか否かを判別する。
【００４４】
ここで、トリガ信号ＴＧ_ｎのいずれかが供給されていない場合には、トリガ監視サイト１２は、ステップＳ１１へと処理を移行し、トリガ信号ＴＧ_ｎのいずれかが供給されるまで監視し続ける。
【００４５】
一方、トリガ信号ＴＧ_ｎのいずれかが供給された場合には、トリガ監視サイト１２は、ステップＳ１３において、供給されたトリガ信号ＴＧ_ｎが最初に供給されたものであるか否か、すなわち、初めてトリガ信号ＴＧ_ｎが供給されたのか否かを判別する。
【００４６】
ここで、トリガ信号ＴＧ_ｎが最初に供給されたものでない場合には、トリガ監視サイト１２は、ステップＳ１５へと処理を移行する。一方、トリガ信号ＴＧ_ｎが最初に供給されたものであった場合には、トリガ監視サイト１２は、ステップＳ１４において、この最初に検出したトリガ信号ＴＧ_ｎが入力された時点のサンプルを基準サンプルとして、入力サンプル数のカウントを開始し、ステップＳ１５へと処理を移行する。トリガ監視サイト１２においては、この基準サンプルに基づいて、入力サンプル数をカウントすることによって他のトリガ信号ＴＧ_ｎの入力時刻を計測することになる。
【００４７】
そして、トリガ監視サイト１２は、ステップＳ１５において、ステップＳ１４にて基準サンプルを基点としてカウントを開始しているカウント値を遅れサンプル数Ｓｄ_ｎとし、トリガ信号ＴＧ_ｎを出力した対応する音声出力サイト２０_ｎに対して、この遅れサンプル数Ｓｄ_ｎを示す遅れサンプル数情報をネットワークＮＴを介して出力する。なお、トリガ監視サイト１２は、最初にトリガ信号ＴＧ_ｎを出力した音声出力サイト２０_ｎに対しては、遅れサンプル数Ｓｄ_ｎ＝０を通知することはいうまでもない。
【００４８】
続いて、トリガ監視サイト１２は、ステップＳ１６において、入力サンプル数のカウントを継続し、ステップＳ１７において、全ての音声出力サイト２０_ｎからトリガ信号ＴＧ_ｎを受信したか否かを判別する。
【００４９】
ここで、全ての音声出力サイト２０_ｎからトリガ信号ＴＧ_ｎを受信していない場合には、トリガ監視サイト１２は、ステップＳ１１乃至ステップＳ１６の処理を繰り返す。
【００５０】
一方、全ての音声出力サイト２０_ｎからトリガ信号ＴＧ_ｎを受信した場合には、トリガ監視サイト１２は、そのまま一連の処理を終了する。
【００５１】
なお、音声出力サイト２０_ｎのいずれかが故障等によってトリガ信号ＴＧ_ｎをトリガ監視サイト１２に与えない場合があり得るが、この場合には、トリガ監視サイト１２は、ステップＳ１７からステップＳ１１へと移行するループ処理を適宜抜けるようにしてもよい。
【００５２】
このようにして、音声再生装置１０においては、トリガ監視サイト１２によって監視プログラムＳｖを実行し、音声出力サイト２０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求めることができ、音声出力サイト２０_ｎによる同期再生を実現させることが可能となる。
【００５３】
このようなプログラムを実行するトリガ監視サイト１２及び音声出力サイト２０_ｎを備える音声再生装置１０においては、例えば図４に示すタイミングにしたがって各動作が協調して行われる。
【００５４】
すなわち、音声再生装置１０においては、ワードクロック生成器１４によって生成されるワードクロックＷＣＬＫに基づいて、トリガ監視サイト１２及び音声出力サイト２０_ｎが動作する。音声再生装置１０においては、同図上段に示すように、ユーザインターフェースＵＩを介して再生コマンドが指示されると、この再生指示を示す制御情報がコントロールサイト１１から音声出力サイト２０_ｎのそれぞれにネットワークＮＴを介して通知される。この制御情報が到達する絶対的な時刻は、上述したように、ネットワークトラフィックの影響等に起因して、音声出力サイト２０_ｎの間でばらつきが生じる。ここでは、同図中段に示すように、音声出力サイト２０_１が最も早く制御情報を受信し、音声出力サイト２０_８，音声出力サイト２０_２の順序で制御情報を受信したものとする。
【００５５】
すると、音声再生装置１０においては、上述したように、音声出力サイト２０_ｎからトリガ監視サイト１２に対してトリガ信号ＴＧ_ｎが与えられる。この音声出力サイト２０_ｎのそれぞれからトリガ信号ＴＧ_ｎが再生される絶対的な時刻は、上述したように、音声出力サイト２０_ｎにおけるサウンド処理部２１_ｎの相違や図示しないＣＰＵ等の処理能力等に起因して、音声出力サイト２０_ｎの間でばらつきが生じる。ここでは、同図中段に示すように、音声出力サイト２０_１が最も早いサンプルＳ_１の時点でトリガ信号ＴＧ_１を再生し、以後、サンプルＳ_８，Ｓ_２の時点で音声出力サイト２０_８，音声出力サイト２０_２の順序でトリガ信号ＴＧ_８，ＴＧ_２を再生したものとする。音声出力サイト２０_ｎは、それぞれ、これらのトリガ信号ＴＧ_ｎが出力された時点のサンプルＳ_ｎを基準として、再生サンプル数のカウントを開始する。また、ここでは、最も早くトリガ信号ＴＧ_１が再生されたサンプルＳ_１を基準サンプルとしている。
【００５６】
これに対応して、音声再生装置１０においては、上述したように、トリガ監視サイト１２によって基準サンプルであるサンプルＳ_１から入力サンプル数のカウントを開始し、同図下段に示すように、遅れサンプル数Ｓｄ_ｎを算出する。音声再生装置１０においては、これらの遅れサンプル数Ｓｄ_ｎが対応する音声出力サイト２０_ｎに対して通知される。この遅れサンプル数Ｓｄ_ｎが到達する絶対的な時刻は、上述したように、ネットワークトラフィックの影響等に起因して、音声出力サイト２０_ｎの間でばらつきが生じる。ここでは、同図中段に示すように、音声出力サイト２０_１が最も早く遅れサンプル数情報を受信し、音声出力サイト２０_２，音声出力サイト２０_８の順序で遅れサンプル数情報を受信したものとする。
【００５７】
そして、音声再生装置１０においては、音声出力サイト２０_ｎのそれぞれにより、トリガ監視サイト１２に対してトリガ信号ＴＧ_ｎを出力した時点のサンプルＳ_ｎからカウントして、（再生マージンサンプル数Ｓｍ−遅れサンプル数Ｓｄ_ｎ）サンプル目から、音声ファイルＡＦ_ｎの再生が一斉に開始される。なお、再生マージンサンプル数Ｓｍは、少なくとも以下の工程、すなわち、再生指示を示す制御情報を受信した音声出力サイト２０_ｎがトリガ信号ＴＧ_ｎを出力する工程、トリガ監視サイト１２が全ての音声出力サイト２０_ｎからトリガ信号ＴＧ_ｎを受信する工程、トリガ監視サイト１２が音声出力サイト２０_ｎのそれぞれに対して遅れサンプル数Ｓｄ_ｎを通知する工程、及び、全ての音声出力サイト２０_ｎが遅れサンプル数Ｓｄ_ｎを受信し、音声ファイルＡＦ_ｎの再生動作を開始する工程を行うのに十分な時間を見込んで算出される値である。
【００５８】
音声再生装置１０においては、音声出力サイト２０_ｎ間における音声ファイルＡＦ_ｎの再生開始位置を一定にし、それ以後は、ワードクロック生成器１４によって生成されるワードクロックＷＣＬＫに基づいて同期を保ちながら音声ファイルＡＦ_ｎの再生を進行する。
【００５９】
以上のように、本発明の第１の実施の形態として示す音声再生装置１０は、各音声出力サイト２０_ｎ上に分散して存在している音声ファイルＡＦ_ｎを用いて、マルチチャンネルの同期再生を行うことができる。特に、音声再生装置１０は、互いに同期が確保されておらず、動作環境が多種多様にわたる音声出力サイト２０_ｎを用いた場合であっても、確実に同期再生を実現することができる。音声再生装置１０は、例えば、Ｍチャンネルの同時再生が可能なサウンド処理部を用いて、トリガ監視サイト１２及びｎ個の音声出力サイト２０_ｎを構成することにより、（Ｍ−１）×ｎチャンネルの同期再生を実現することが可能となる。
【００６０】
なお、音声再生装置１０においては、上述したコントロールサイト１１の役割としては、ユーザがユーザインターフェースＵＩを操作したタイミングで再生を行うための再生指示を示す制御情報を発信することである。しかしながら、ユーザがユーザインターフェースＵＩを操作して再生指示が出されたタイミングから再生開始までの時間は、最初のトリガ信号ＴＧ_ｎが再生されるまでの時間と、設定した再生マージンサンプル数Ｓｍとの和によって規定される。そして、これらの時間は、ネットワークトラフィックやハードウェアの処理負荷等の要因によって毎回ばらつきが生じるものである。したがって、音声再生装置１０においては、再生開始の絶対時刻を問題とするのではなく、あくまでも再生開始のタイミングを一致させることを目的とし、絶対時刻に関しては問題とならない場合を対象としていることに注意されたい。
【００６１】
また、上述した実施の形態では、コントロールサイト１１によって再生指示のみを与えるものとして説明したが、音声再生装置１０においては、例えば、「停止」、「一時停止」、必要に応じて「巻き戻し」及び「早送り」等に相当するコマンドに対応した指示を与えることもできる。例えば、音声再生装置１０においては、再生開始後も再生サンプルのカウントを継続し、各音声ファイルＡＦ_ｎにおけるどの時点で停止するか、又は、復帰再生するか、といった情報をコントロールサイト１１を介して音声出力サイト２０_ｎに与えたり、音声ファイルＡＦ_ｎの再生位置の指定を行うことにより、音声出力サイト２０_ｎによる再生処理プログラムＡ_ｎの実行制御のもとに統一のとれた動作を実現することができる。
【００６２】
さらに、音声再生装置１０においては、コントロールサイト１１は必ずしも備える必要はない。音声再生装置１０は、コントロールサイト１１を備える代わりに、音声出力サイト２０_ｎのそれぞれが有するタイマ等を利用して、予め設定された時刻に同期再生を開始する、といったことも実現可能である。ただし、この場合、音声再生装置１０においては、音声出力サイト２０_ｎのそれぞれが有するタイマのずれ時間を想定して再生マージンサンプル数Ｓｍの値を設定するか、再生マージンサンプル数Ｓｍの値以内に収まるように、各タイマをネットワークＮＴを介して調整する必要がある。
【００６３】
さらにまた、音声再生装置１０においては、各サイトとしてパーソナルコンピュータを用いたり、サウンド処理部としてサウンドカードを用いなくとも、当該機能を実現可能なハードウェアであればよい。
【００６４】
また、音声再生装置１０においては、ＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがってネットワークＮＴを介した通信を行うものとして説明したが、トリガ監視サイト１２から音声出力サイト２０_ｎに対して遅れサンプル数Ｓｄ_ｎを通知可能であるならば、いかなるネットワーク階層のフォーマットであってもよいことはいうまでもない。
【００６５】
さらに、上述した実施の形態では、再生マージンサンプル数Ｓｍとして固定値を用いるものとして説明したが、例えば、音声再生装置１０においては、トリガ監視サイト１２による監視プログラムＳｖの実行制御のもとに、各音声出力サイト２０_ｎの遅れサンプル数Ｓｄ_ｎを算出した後、通信及び音声信号出力に際する遅れを十分に見込んだ値を再生マージンサンプル数Ｓｍとして算出し、ネットワークＮＴを通じて各音声出力サイト２０_ｎに通知して設定するようにしてもよい。このとき、音声再生装置１０は、ネットワークＮＴを介した通信を行う際に生じる遅れを統計的に測定し、この統計情報を用いて再生マージンサンプル数Ｓｍを設定することもできる。
【００６６】
さらにまた、上述した実施の形態では、サウンド処理部をワードクロックＷＣＬＫに同期させるものとして説明したが、音声再生装置１０においては、ワードクロックに限らず、サウンド処理部において同期を確保するような機構であれば、例えば、２５６倍クロックや１／２ワード単位のクロック、又は、ＳＭＰＴＥ（Society of Motion Picture and Television Engineers）のような一般的なコードに対応するものであってもよい。
【００６７】
また、上述した実施の形態では、トリガ監視サイト１２において入力サンプル数のカウントが開始される基準サンプルとして、最も早くトリガ信号ＴＧ_ｎが再生された時点のサンプルＳ_ｎを採用するものとして説明したが、音声再生装置１０においては、トリガ監視サイト１２によって任意時刻での特定サンプルを規定することができ、相対的に遅れサンプル数Ｓｄ_１，Ｓｄ_２，・・・，Ｓｄ_８の時間関係が保たれた情報が各音声出力サイト２０_ｎに通知可能であるならば、基準サンプルはいかなる時間のものであってもよい。
【００６８】
さらに、上述した実施の形態では、音声信号が音声ファイルＡＦ_ｎとして存在するものとして説明したが、音声再生装置１０においては、逐次的にメモリ上に音声ストリームとして存在するものであってもよい。
【００６９】
つぎに、本発明の第２の実施の形態について説明する。この第２の実施の形態として図５に示す音声再生装置３０は、５６チャンネルの音声ファイルがまとまって存在しており、８チャンネルの音声信号を出力処理可能な８つの音声出力サイトを用いて、５６チャンネルの音声信号の同期再生を行うものである。したがって、上述した第１の実施の形態として示した音声再生装置１０と同様の機能を有する各部については同一符号を付し、詳細な説明を省略するものとする。
【００７０】
音声再生装置３０は、同図に示すように、上述したトリガ監視サイト１２と、ワードクロック生成器１４と、５６個の出力器１５_ｎｍと、５６個のスピーカ１６_ｎｍとの他、記憶している５６チャンネルの音声ファイルＡＦを配信する配信サイト３１と、音声信号を再生して出力する８つの音声出力サイト４０_１，４０_２，・・・，４０_８（以下、音声出力サイト４０_ｎ（ｎ＝１，２，・・・，８；サイト番号）と総称する。）とを備える。
【００７１】
配信サイト３１は、上述したコントロールサイト１１と同様に、音声再生装置３０の統括制御を行う機能を有するとともに、音声再生装置３０とユーザとの間のユーザインターフェースＵＩを有するのに加え、さらに、５６チャンネルの音声ファイルＡＦを図示しない記憶手段に記憶するものである。この配信サイト３１に記憶されている音声ファイルＡＦは、所定のフォーマットで圧縮されていてもよく、非圧縮であってもよい。配信サイト３１は、例えばＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがって外部との通信を行うことが可能であり、５６チャンネルの音声ファイルＡＦを伝送するのに十分な容量を有するネットワークＮＴを介してトリガ監視サイト１２及び音声出力サイト４０_ｎとの間で信号の送受信を行うことが可能とされる。配信サイト３１は、配信プログラムＤｓを実行することが可能な環境とされており、所定の操作部や表示部等からなるユーザインターフェースＵＩを介してユーザによって再生コマンドが指示されると、配信プログラムＤｓの実行制御のもとに、音声ファイルＡＦの再生指示を示す制御情報をネットワークＮＴを介して音声出力サイト４０_ｎに対して通知するとともに、５６チャンネルの音声ファイルＡＦをストリームデータとして、７チャンネル分ずつ音声出力サイト４０_ｎに対して配信する。なお、ネットワークＮＴを介して伝送される音声ストリームは、所定のフォーマットで圧縮されていてもよく、非圧縮であってもよい。
【００７２】
音声出力サイト４０_ｎは、それぞれ、上述した音声出力サイト２０_ｎと同様に、例えば８チャンネルの音声信号を出力処理可能なサウンドカード等のサウンド処理部４１_１，４１_２，・・・，４１_８（以下、サウンド処理部４１_ｎ（ｎ＝１，２，・・・，８）と総称する。）を有するパーソナルコンピュータ等を含むハードウェアの機能を有するものである。音声出力サイト４０_ｎは、それぞれ、配信サイト３１及びトリガ監視サイト１２と同様に、ＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがってネットワークＮＴを介して配信サイト３１及びトリガ監視サイト１２との間で信号の送受信を行うことが可能とされる。また、音声出力サイト４０_ｎは、それぞれ、サウンド処理部４１_ｎにおける例えば第８チャンネルのみをトリガ監視サイト１２と接続しており、このチャンネルを用いてトリガ監視サイト１２に対してトリガ信号ＴＧ_ｎを出力することが可能とされる。
音声出力サイト４０_ｎは、それぞれ、再生処理プログラムＢ_１，Ｂ_２，・・・，Ｂ_８（以下、再生処理プログラムＢ_ｎ（ｎ＝１，２，・・・，８）と総称する。
）を実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部４１_ｎにおける処理を行う。このとき、音声出力サイト４０_ｎにおいては、それぞれ、サウンド処理部４１_ｎにおける第１チャンネル乃至第８チャンネルの間の同期は確保されているものとする。すなわち、音声出力サイト４０_ｎにおいては、それぞれ、配信サイト３１から再生指示を受け取った場合には、サウンド処理部４１_ｎにおける第１チャンネル乃至第８チャンネルから同時に音声信号を出力する。また、音声出力サイト４０_ｎは、それぞれ、配信サイト３１から配信される音声ストリームを一時的に記憶する図示しないバッファを有しており、配信サイト３１から配信される音声ストリームをこのバッファに逐次的に記憶する。音声出力サイト４０_ｎにおいては、音声ストリームが所定のフォーマットで圧縮されている場合には、これらの音声ストリームがバッファに記憶されると、このバッファを介して所定のデコード処理が行われる。そして、音声出力サイト４０_ｎにおいては、バッファからは、音声ストリームが所定容量以上記憶されると読み出され、外部へと出力される。なお、以下では、各音声出力サイト４０_ｎに配信される音声ストリームを、それぞれ、ＡＳ_１，ＡＳ_２，・・・，ＡＳ_８（以下、音声ストリームＡＳ_ｎ（ｎ＝１，２，・・・，８）と総称する。）と称するものとする。
【００７３】
音声出力サイト４０_ｎは、それぞれ、配信サイト３１からネットワークＮＴを介して供給される再生指示を示す制御情報を受け取ると、再生処理プログラムＢ_ｎの実行制御のもとに、サウンド処理部４１_ｎを介してトリガ信号ＴＧ_ｎをトリガ監視サイト１２に対して出力する。この間、音声出力サイト４０_ｎには、それぞれ、音声ストリームＡＳ_ｎが配信されており、音声出力サイト４０_ｎは、それぞれ、配信された音声ストリームＡＳ_ｎを図示しないバッファにストリーム順に記憶する。また、音声出力サイト４０_ｎは、それぞれ、トリガ監視サイト１２からネットワークＮＴを介して遅れサンプル数情報を受け取ると、この遅れサンプル数Ｓｄ_ｎと再生マージンサンプル数Ｓｍとに基づいて、バッファの先頭地点からデコードした音声ストリームＡＳ_ｎの再生を開始し、出力器１５_ｎｍ及びスピーカ１６_ｎｍを介して外部へと放音させる。
【００７４】
このような各部を備える音声再生装置３０においては、配信サイト３１に一括して記憶されている５６チャンネルの音声ファイルＡＦの再生指示を示す制御情報が与えられると、音声ファイルＡＦが７チャンネルの音声ストリームＡＳ_ｎとして、音声出力サイト４０_ｎのそれぞれに配信されるとともに、音声出力サイト４０_ｎのそれぞれから、トリガ監視サイト１２に対してトリガ信号ＴＧ_ｎが出力される。音声再生装置３０においては、トリガ監視サイト１２により、トリガ信号ＴＧ_ｎに基づいて、音声出力サイト４０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求める。そして、音声再生装置３０においては、各音声出力サイト４０_ｎにより、この遅れサンプル数Ｓｄ_ｎと再生マージンサンプル数Ｓｍとに基づいて、自己が再生を開始すべきタイミングを求め、このタイミングに到達した時間に音声ストリームＡＳ_ｎの再生を開始することができる。音声再生装置３０においては、このようにして、全ての音声出力サイト４０_ｎによる音声ストリームＡＳ_ｎの再生開始のタイミングを一致させることができる。そして、音声再生装置３０においては、それ以後、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声出力サイト４０_ｎによって再生を進行することにより、マルチチャンネルの同期再生を行うことができる。
【００７５】
具体的には、音声再生装置３０においては、音声出力サイト４０_ｎは、それぞれ、再生処理プログラムＢ_ｎを実行し、図６に示す一連の工程を経ることによって音声ストリームＡＳ_ｎの再生を開始する。
【００７６】
まず、音声出力サイト４０_ｎには、それぞれ、同図に示すように、ステップＳ２１において、再生マージンサンプル数Ｓｍが設定される。この再生マージンサンプル数Ｓｍは、上述した音声再生装置１０の場合と同様に、再生指示を示す制御情報を受信した音声出力サイト４０_ｎがトリガ信号ＴＧ_ｎを出力する工程、トリガ監視サイト１２が全ての音声出力サイト４０_ｎからトリガ信号ＴＧ_ｎを受信する工程、トリガ監視サイト１２が音声出力サイト４０_ｎのそれぞれに対して遅れサンプル数Ｓｄ_ｎを通知する工程、全ての音声出力サイト４０_ｎが遅れサンプル数Ｓｄ_ｎを受信し、音声ストリームＡＳ_ｎの再生動作を開始する工程を行うのに十分な時間に加え、バッファに音声ストリームＡＳ_ｎが所定容量以上記憶されるのに十分な時間を見込んで算出される値である。
【００７７】
続いて、音声出力サイト４０_ｎは、それぞれ、ステップＳ２２において、配信サイト３１からネットワークＮＴを介して音声ファイルＡＦの再生指示を示す制御情報が到達したか否かを判別する。音声出力サイト４０_ｎは、それぞれ、配信サイト３１からの制御情報が到達するまで待機し、制御情報を受け取った時点で、ステップＳ２３へと処理を移行する。
【００７８】
音声出力サイト４０_ｎは、それぞれ、ステップＳ２３において、サウンド処理部４１_ｎにおけるトリガチャンネルに、トリガ信号ＴＧ_ｎを再生する。
【００７９】
これと同時に、音声出力サイト４０_ｎは、それぞれ、ステップＳ２４において、トリガ信号ＴＧ_ｎを再生してトリガ監視サイト１２に対して出力した時点のサンプルＳ_ｎを基準として、再生サンプル数のカウントを開始する。音声出力サイト４０_ｎにおいては、それぞれ、このサンプルＳ_ｎを基準として、音声ストリームＡＳ_ｎの再生開始のタイミングが決定されることになる。
【００８０】
さらにこれと同時に、音声出力サイト４０_ｎは、それぞれ、ステップＳ２５において、トリガチャンネル以外のチャンネルに“０”を出力する。
【００８１】
そして、音声出力サイト４０_ｎは、それぞれ、ステップＳ２６において、再生サンプル数のカウントを継続し、ステップＳ２７において、トリガ監視サイト１２から対応する遅れサンプル数Ｓｄ_ｎを示す遅れサンプル数情報がネットワークＮＴを介して到達したか否かを判別する。
【００８２】
ここで、遅れサンプル数情報が到達していない場合には、音声出力サイト４０_ｎは、それぞれ、ステップＳ２５乃至ステップＳ２６の処理を繰り返す。一方、遅れサンプル数情報が到達した場合には、音声出力サイト４０_ｎは、それぞれ、ステップＳ２８において、“（再生マージンサンプル数Ｓｍ）−（遅れサンプル数Ｓｄ_ｎ）”の値を算出し、トリガ監視サイト１２に対してトリガ信号ＴＧ_ｎを出力した時点のサンプルからカウントして、この算出した値が経過したサンプルに到達した時点から、バッファに記憶されている音声ストリームＡＳ_ｎの再生を開始し、一連の処理を終了する。
【００８３】
一方、音声再生装置３０においては、トリガ監視サイト１２は、監視プログラムＳｖを実行し、先に図３に示した一連の工程を経ることによって音声出力サイト４０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求めることができる。
【００８４】
このようにして、音声再生装置３０においては、トリガ監視サイト１２によって監視プログラムＳｖを実行するとともに、音声出力サイト４０_ｎのそれぞれによって再生処理プログラムＢ_ｎを実行し、音声ファイルＡＦを構成する音声ストリームＡＳ_ｎの再生を同時に開始することができる。なお、音声再生装置３０においては、それ以後、音声出力サイト４０_ｎが、それぞれ、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声ストリームＡＳ_ｎの再生を進行することにより、マルチチャンネルの同期再生が可能となる。
【００８５】
以上のように、本発明の第２の実施の形態として示す音声再生装置３０は、配信サイト３１上に存在している音声ファイルＡＦを、各音声出力サイト４０_ｎが出力処理可能なチャンネル数分に分割して各音声出力サイト４０_ｎに配信し、配信された音声ストリームＡＳ_ｎを用いて、マルチチャンネルの同期再生を行うことができる。特に、音声再生装置３０は、互いに同期が確保されておらず、動作環境が多種多様にわたる音声出力サイト４０_ｎを用いた場合であっても、確実に同期再生を実現することができる。
【００８６】
つぎに、本発明の第３の実施の形態について説明する。この第３の実施の形態として図７に示す音声再生装置５０は、１つのソース信号に対して各音声出力サイトによって所定の信号処理を施し、５６チャンネルの音声信号の同期再生を行うものである。より換言すれば、音声再生装置５０は、１チャンネルの音声ファイルを各音声出力サイトに対してマルチキャスト配信し、配信された音声ファイルに対して各音声出力サイトによってチャンネル単位で個別の信号処理を施して同期再生することにより、例えばマルチスピーカを用いた制御、すなわち、従来ではハードウェアの能力やスピーカのチャンネル数の制限等によって小規模のシステムしか構成できず効果が低かった指向性、音場及び音像制御を、効果的に実現可能とするものである。なお、ここでも、上述した第１の実施の形態として示した音声再生装置１０又は第２の実施の形態として示した音声再生装置３０と同様の機能を有する各部については同一符号を付し、詳細な説明を省略するものとする。
【００８７】
音声再生装置５０は、同図に示すように、上述したトリガ監視サイト１２と、ワードクロック生成器１４と、５６個の出力器１５_ｎｍと、５６個のスピーカ１６_ｎｍとの他、記憶している１チャンネルのソースファイルである音声ファイルＡＦを出力するソース出力サイト５１と、音声信号を再生して出力する８つの音声出力サイト６０_１，６０_２，・・・，６０_８（以下、音声出力サイト６０_ｎ（ｎ＝１，２，・・・，８；サイト番号）と総称する。）とを備える。
【００８８】
ソース出力サイト５１は、上述したコントロールサイト１１又は配信サイト３１と同様に、音声再生装置５０の統括制御を行う機能を有するとともに、音声再生装置５０とユーザとの間のユーザインターフェースＵＩを有するのに加え、さらに、音声ファイルＡＦを図示しない記憶手段に記憶するものである。なお、このソース出力サイト５１に記憶されている音声ファイルＡＦは、１チャンネルであるものとして説明するが、複数チャンネルからなるものであってもよい。また、音声ファイルＡＦは、所定のフォーマットで圧縮されていてもよく、非圧縮であってもよい。ソース出力サイト５１は、例えばＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがって外部との通信を行うことが可能であり、ネットワークＮＴを介してトリガ監視サイト１２及び音声出力サイト６０_ｎとの間で信号の送受信を行うことが可能とされる。ソース出力サイト５１は、ソース出力プログラムＳｏを実行することが可能な環境とされており、所定の操作部や表示部等からなるユーザインターフェースＵＩを介してユーザによって再生コマンドが指示されると、ソース出力プログラムＳｏの実行制御のもとに、音声ファイルＡＦの再生指示を示す制御情報をネットワークＮＴを介して音声出力サイト６０_ｎに対して通知するとともに、１チャンネルの音声ファイルＡＦをそのまま又はストリームデータとして、全ての音声出力サイト６０_ｎに対してマルチキャスト配信する。なお、ネットワークＮＴを介して伝送される音声信号は、所定のフォーマットで圧縮されていてもよく、非圧縮であってもよい。ここでは、音声出力サイト６０_ｎに対して音声ファイルＡＦがそのままマルチキャスト配信されるものとする。
【００８９】
音声出力サイト６０_ｎは、それぞれ、上述した音声出力サイト２０_ｎ又は音声出力サイト４０_ｎと同様に、例えば８チャンネルの音声信号を出力処理可能なサウンドカード等のサウンド処理部６１_１，６１_２，・・・，６１_８（以下、サウンド処理部６１_ｎ（ｎ＝１，２，・・・，８）と総称する。）を有するパーソナルコンピュータ等を含むハードウェアの機能を有するものである。音声出力サイト６０_ｎは、それぞれ、ソース出力サイト５１及びトリガ監視サイト１２と同様に、ＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがってネットワークＮＴを介してソース出力サイト５１及びトリガ監視サイト１２との間で信号の送受信を行うことが可能とされる。また、音声出力サイト６０_ｎは、それぞれ、サウンド処理部６１_ｎにおける例えば第８チャンネルのみをトリガ監視サイト１２と接続しており、このチャンネルを用いてトリガ監視サイト１２に対してトリガ信号ＴＧ_ｎを出力することが可能とされる。音声出力サイト６０_ｎは、それぞれ、再生処理プログラムＣ_１，Ｃ_２，・・・，Ｃ_８（以下、再生処理プログラムＣ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部６１_ｎにおける処理を行う。このとき、音声出力サイト６０_ｎにおいては、それぞれ、サウンド処理部６１_ｎにおける第１チャンネル乃至第８チャンネルの間の同期は確保されているものとする。すなわち、音声出力サイト６０_ｎにおいては、それぞれ、ソース出力サイト５１から再生指示を受け取った場合には、サウンド処理部６１_ｎにおける第１チャンネル乃至第８チャンネルから同時に音声信号を出力する。また、音声出力サイト６０_ｎにおいては、音声ファイルＡＦが所定のフォーマットで圧縮されている場合には、この音声ファイルＡＦが図示しないバッファに一時的に記憶され、このバッファを介して所定のデコード処理が行われる。そして、音声出力サイト６０_ｎにおいては、バッファからは、音声ファイルＡＦが所定容量以上記憶されると読み出される。
【００９０】
また、音声出力サイト６０_ｎは、それぞれ、バッファから読み出された音声ファイルＡＦに対して所定の信号処理を施す。音声出力サイト６０_ｎは、それぞれ、図８に示すように、７チャンネル分の信号処理部６２_ｎｍ（ｎ＝１，２，・・・，８、ｍ＝１，２，・・・，７；チャンネル番号）を有する。これらの信号処理部６２_ｎｍは、ハードウェアとして構成することもでき、同図に示すように、再生処理プログラムＣ_ｎ等のソフトウェアによって構成することもできる。音声出力サイト６０_ｎは、それぞれ、各チャンネル毎に音声信号を信号処理部６２_ｎｍに入力する。信号処理としては、例えば、いわゆるＦＩＲ（Finite Impulse Responce）フィルタやＩＩＲ（Infinite Impulse Responce）フィルタ等のディジタルフィルタを施すことが考えられ、全体としていわゆるクロスフェード処理を施すものが考えられる。ただし、これらの信号処理部６２_ｎｍにおける遅延量は既知であり、簡単のため、各チャンネル間で一定であるものとする。
【００９１】
さらに、例えばディジタルフィルタにおけるフィルタ係数といった信号処理部６２_ｎｍにおける信号処理のパラメータは、外部から供給される信号処理方法を変更するための制御情報ＣＮＴに応じて、変更することが可能である。例えば、音声再生装置５０においては、ユーザによるユーザインターフェースＵＩの操作に応じてパラメータを変更することができ、ソース出力サイト５１から音声出力サイト６０_ｎに対してパラメータを変更する旨及びパラメータ値を示す制御情報ＣＮＴが通知される。より具体的には、例えば画像に連動して音像を移動させるようなアプリケーションを想定した場合、ユーザがユーザインターフェースＵＩたるジョイスティック等を操作することによって自己の位置を移動させるのにともない、制御情報ＣＮＴを音声出力サイト６０_ｎに対して通知し、信号処理のパラメータを変更させることにより、再現される音像の位置を移動させることができる。その他、音声出力サイト６０_ｎは、それぞれ、指向性の方向変更又は音場の残響パラメータの変更といったように、アプリケーションに応じて信号処理のパラメータを変更することができる。音声出力サイト６０_ｎにおいては、信号処理が施された音声信号が図示しないバッファに一時的に記憶され、所定容量以上記憶されると読み出される。
【００９２】
音声出力サイト６０_ｎは、それぞれ、ソース出力サイト５１からネットワークＮＴを介して供給される再生指示を示す制御情報を受け取ると、再生処理プログラムＣ_ｎの実行制御のもとに、サウンド処理部６１_ｎを介してトリガ信号ＴＧ_ｎをトリガ監視サイト１２に対して出力する。この間、音声出力サイト６０_ｎには、それぞれ、音声ファイルＡＦが配信されており、音声出力サイト６０_ｎは、それぞれ、配信された音声ファイルＡＦに対して信号処理部６２_ｎｍによって信号処理を施す。また、音声出力サイト６０_ｎは、それぞれ、トリガ監視サイト１２からネットワークＮＴを介して遅れサンプル数情報を受け取ると、この遅れサンプル数Ｓｄ_ｎと再生マージンサンプル数Ｓｍとに基づいて、信号処理後の音声信号の再生を開始し、出力器１５_ｎｍ及びスピーカ１６_ｎｍを介して外部へと放音させる。
【００９３】
このような各部を備える音声再生装置５０においては、音声出力サイト６０_ｎは、それぞれ、再生処理プログラムＣ_ｎを実行し、先に図２に示した一連の工程を経ることによって音声ファイルＡＦの再生を開始することができる。一方、音声再生装置５０においては、トリガ監視サイト１２は、監視プログラムＳｖを実行し、先に図３に示した一連の工程を経ることによって音声出力サイト６０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求めることができる。
【００９４】
音声再生装置５０においては、ソース出力サイト５１に記憶されている１チャンネルの音声ファイルＡＦの再生指示を示す制御情報が与えられると、音声ファイルＡＦが音声出力サイト６０_ｎのそれぞれにマルチキャスト配信されるとともに、音声出力サイト６０_ｎのそれぞれから、トリガ監視サイト１２に対してトリガ信号ＴＧ_ｎが出力される。音声再生装置５０においては、トリガ監視サイト１２により、トリガ信号ＴＧ_ｎに基づいて、音声出力サイト６０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求める。そして、音声再生装置５０においては、各音声出力サイト６０_ｎにより、この遅れサンプル数Ｓｄ_ｎと再生マージンサンプル数Ｓｍとに基づいて、自己が再生を開始すべきタイミングを求め、このタイミングに到達した時間に音声信号の再生を開始することができる。音声再生装置５０においては、このようにして、全ての音声出力サイト６０_ｎによる音声ファイルＡＦの再生開始のタイミングを一致させることができる。そして、音声再生装置５０においては、それ以後、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声出力サイト６０_ｎによって再生を進行することにより、マルチチャンネルの同期再生を行うことができる。
【００９５】
また、音声再生装置５０においては、途中で信号処理のパラメータを変更する場合には、ソース出力サイト５１から信号処理の内容を示す上述した制御情報ＣＮＴが音声出力サイト６０_ｎに与えられ、信号処理のパラメータが変更される。
このとき、音声再生装置５０においては、再生指示と同様に、制御情報ＣＮＴに関する遅れサンプル数を算出することにより、信号処理のパラメータを変更する時間の同期をとることができる。
【００９６】
このようにして、音声再生装置５０においては、トリガ監視サイト１２によって監視プログラムＳｖを実行するとともに、音声出力サイト６０_ｎのそれぞれによって再生処理プログラムＣ_ｎを実行し、音声ファイルＡＦの再生を同時に開始することができる。
【００９７】
以上のように、本発明の第３の実施の形態として示す音声再生装置５０は、ソース出力サイト５１上に存在している音声ファイルＡＦを、各音声出力サイト６０_ｎにマルチキャスト配信し、配信された音声ファイルＡＦに対して信号処理を施した音声信号を用いて、マルチチャンネルの同期再生を行うことができる。特に、音声再生装置５０は、互いに同期が確保されておらず、動作環境が多種多様にわたる音声出力サイト６０_ｎを用いた場合であっても、確実に同期再生を実現することができ、例えばマルチスピーカを用いた指向性、音場及び音像制御を優れた効果のもとに実現することができる。
【００９８】
つぎに、本発明の第４の実施の形態について説明する。この第４の実施の形態として図９に示す音声再生装置７０は、上述した第３の実施の形態として示した音声再生装置５０と同様に、１つのソースファイルに対して各音声出力サイトによって所定の信号処理を施し、５６チャンネルの音声信号の同期再生を行うものであるが、ネットワークトラフィックの影響を回避するために、例えばＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがって情報伝送を行うネットワークＮＴを設けず、サウンド処理部のみを用いて情報の送受信を行うものである。したがって、上述した第３の実施の形態として示した音声再生装置５０と同様の機能を有する各部については同一符号を付し、詳細な説明を省略するものとする。
【００９９】
音声再生装置７０は、同図に示すように、上述したワードクロック生成器１４と、５６個の出力器１５_ｎｍと、５６個のスピーカ１６_ｎｍとの他、記憶している１チャンネルのソースファイルである音声ファイルＡＦを出力するソース出力サイト７１と、トリガ信号ＴＧ_ｎを監視するトリガ監視サイト７３と、音声信号を再生して出力する８つの音声出力サイト８０_１，８０_２，・・・，８０_８（以下、音声出力サイト８０_ｎ（ｎ＝１，２，・・・，８；サイト番号）と総称する。）とを備える。
【０１００】
ソース出力サイト７１は、上述したソース出力サイト５１と同様に、音声再生装置７０の統括制御を行う機能を有するとともに、音声再生装置７０とユーザとの間のユーザインターフェースＵＩを有し、さらに、音声ファイルＡＦを図示しない記憶手段に記憶するものである。なお、このソース出力サイト７１に記憶されている音声ファイルＡＦは、１チャンネルであるものとして説明するが、複数チャンネルからなるものであってもよい。また、音声ファイルＡＦは、所定のフォーマットで圧縮されていてもよく、非圧縮であってもよい。ソース出力サイト７１は、少なくとも２チャンネル以上の音声信号を出力処理可能なサウンドカード等のサウンド処理部７２を有し、このサウンド処理部７２を介して、ユーザインターフェースＵＩを介したユーザによる再生指示や信号処理のパラメータ変更等を示す制御情報を音声出力サイト８０_ｎに対して通知するとともに、音声ファイルＡＦをそのまま又はストリームデータとして、全ての音声出力サイト８０_ｎに対してマルチキャスト配信することが可能とされる。このとき、ソース出力サイト７１は、サウンド処理部７２における第１チャンネルが音声ファイルＡＦの配信用に用いられ、第２チャンネルが制御情報の通知用に用いられる。
【０１０１】
また、ソース出力サイト７１は、サウンド処理部７２を介して制御情報を送受信する。この制御情報は、数値やパラメータ等の送受信を行うことが可能であればよく、その生成及び認識機構を送受信するサイト間で取り決めて使用すればいかなるものであってもよい。特に、ソース出力サイト７１及び音声出力サイト８０_ｎがディジタルデータを取り扱うものである場合には、各サイトは、制御情報を単なる信号列として扱うことができ、生成及び認識用のプログラムを単純化することができる。ソース出力サイト７１は、ソース出力プログラムＳｏを実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部７２における処理を行う。
【０１０２】
トリガ監視サイト７３は、少なくとも音声出力サイト８０_ｎの数以上、すなわち、８チャンネル以上の音声信号を入力処理及び出力処理可能なサウンドカード等のサウンド処理部７４を有し、このサウンド処理部７４を介して音声出力サイト８０_ｎから出力されるトリガ信号ＴＧ_ｎを受信することが可能とされる。また、トリガ監視サイト７３は、サウンド処理部７４を介して、遅れサンプル数Ｓｄ_ｎを音声出力サイト８０_ｎに対して通知することが可能とされる。このとき、トリガ監視サイト７３は、サウンド処理部７４における入力側の第１チャンネルが音声出力サイト８０_１と接続され、第２チャンネルが音声出力サイト８０_２と接続されるとともに、出力側の第１チャンネルが音声出力サイト８０_１と接続され、第２チャンネルが音声出力サイト８０_２と接続されるといったように、各チャンネルが音声出力サイト８０_ｎのそれぞれと対応させて接続される。トリガ監視サイト７３は、監視プログラムＳｖを実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部７４における処理を行う。
【０１０３】
トリガ監視サイト７３は、監視プログラムＳｖの実行制御のもとに、音声出力サイト８０_ｎのそれぞれから出力されるトリガ信号ＴＧ_ｎの入来を監視する。トリガ監視サイト７３は、トリガ信号ＴＧ_ｎを受け取ると、これらのトリガ信号ＴＧ_ｎに基づいて、音声出力サイト８０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求め、これらの遅れサンプル数Ｓｄ_ｎを示す遅れサンプル数情報をサウンド処理部７４を介して音声出力サイト８０_ｎに対してそれぞれ出力する。
【０１０４】
音声出力サイト８０_ｎは、それぞれ、例えば８チャンネルの音声信号を出力処理可能であり且つ少なくとも３チャンネルの音声信号を入力処理可能なサウンドカード等のサウンド処理部８１_１，８１_２，・・・，８１_８（以下、サウンド処理部８１_ｎ（ｎ＝１，２，・・・，８）と総称する。）を有する。音声出力サイト８０_ｎは、これらのサウンド処理部８１_ｎを介してソース出力サイト７１から出力される音声ファイルＡＦ及び制御情報を受信することが可能とされ、さらに、トリガ監視サイト７３から出力される遅れサンプル数情報を受信することが可能とされる。例えば、音声出力サイト８０_ｎにおいては、それぞれ、サウンド処理部８１_ｎにおける入力側の第１チャンネルがソース出力サイト７１からの音声ファイルＡＦの受信部とされ、第２チャンネルがソース出力サイト７１からの制御情報の受信部とされ、第３チャンネルがトリガ監視サイト７３からの遅れサンプル数情報の受信部とされる。また、音声出力サイト８０_ｎは、それぞれ、サウンド処理部８１_ｎにおける出力側の例えば第８チャンネルのみをトリガ監視サイト７３と接続しており、このチャンネルを用いてトリガ監視サイト７３に対してトリガ信号ＴＧ_ｎを出力することが可能とされる。音声出力サイト８０_ｎは、それぞれ、再生処理プログラムＤ_１，Ｄ_２，・・・，Ｄ_８（以下、再生処理プログラムＤ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部８１_ｎにおける処理を行う。音声出力サイト８０_ｎにおいては、音声ファイルＡＦが所定のフォーマットで圧縮されている場合には、この音声ファイルＡＦが図示しないバッファに一時的に記憶され、このバッファを介して所定のデコード処理が行われる。そして、音声出力サイト８０_ｎにおいては、バッファからは、音声ファイルＡＦが所定容量以上記憶されると読み出される。
【０１０５】
また、音声出力サイト８０_ｎは、それぞれ、音声出力サイト６０_ｎと同様に、バッファから読み出された音声ファイルＡＦに対して所定の信号処理を施す。音声出力サイト８０_ｎにおいては、信号処理が施された音声信号が図示しないバッファに一時的に記憶され、所定容量以上記憶されると読み出される。
【０１０６】
音声出力サイト８０_ｎは、それぞれ、ソース出力サイト７１から供給される再生指示を示す制御情報をサウンド処理部８１_ｎを介して受け取ると、再生処理プログラムＤ_ｎの実行制御のもとに、サウンド処理部８１_ｎを介してトリガ信号ＴＧ_ｎをトリガ監視サイト７３に対して出力する。この間、音声出力サイト８０_ｎには、それぞれ、音声ファイルＡＦが配信されており、音声出力サイト８０_ｎは、それぞれ、配信された音声ファイルＡＦに対してここでは図示しない信号処理部によって信号処理を施す。また、音声出力サイト８０_ｎは、それぞれ、トリガ監視サイト７３から遅れサンプル数情報をサウンド処理部８１_ｎを介して受け取ると、この遅れサンプル数Ｓｄ_ｎと再生マージンサンプル数Ｓｍとに基づいて、信号処理後の音声信号の再生を開始し、出力器１５_ｎｍ及びスピーカ１６_ｎｍを介して外部へと放音させる。
【０１０７】
このような各部を備える音声再生装置７０においては、音声出力サイト８０_ｎは、それぞれ、再生処理プログラムＤ_ｎを実行し、先に図２に示した一連の工程を経ることによって音声ファイルＡＦの再生を開始することができる。一方、音声再生装置７０においては、トリガ監視サイト７３は、監視プログラムＳｖを実行し、先に図３に示した一連の工程を経ることによって音声出力サイト８０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求めることができる。
【０１０８】
音声再生装置７０においては、音声再生装置５０と同様に、ソース出力サイト７１に記憶されている１チャンネルの音声ファイルＡＦの再生指示を示す制御情報が与えられると、音声ファイルＡＦが音声出力サイト８０_ｎのそれぞれにマルチキャスト配信されるとともに、音声出力サイト８０_ｎのそれぞれから、トリガ監視サイト７３に対してトリガ信号ＴＧ_ｎが出力される。音声再生装置７０においては、トリガ監視サイト７３により、トリガ信号ＴＧ_ｎに基づいて、音声出力サイト８０_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求める。そして、音声再生装置７０においては、各音声出力サイト８０_ｎにより、この遅れサンプル数Ｓｄ_ｎと再生マージンサンプル数Ｓｍとに基づいて、自己が再生を開始すべきタイミングを求め、このタイミングに到達した時間に音声信号の再生を開始することができる。音声再生装置７０においては、このようにして、全ての音声出力サイト８０_ｎによる音声ファイルＡＦの再生開始のタイミングを一致させることができる。そして、音声再生装置７０においては、それ以後、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声出力サイト８０_ｎによって再生を進行することにより、マルチチャンネルの同期再生を行うことができる。
【０１０９】
また、音声再生装置７０においては、音声再生装置５０と同様に、途中で信号処理のパラメータを変更する場合には、ソース出力サイト７１から信号処理の内容を示す制御情報が音声出力サイト８０_ｎに与えられ、信号処理のパラメータが変更される。このとき、音声再生装置７０においては、再生指示と同様に、制御情報に関する遅れサンプル数を算出することにより、信号処理のパラメータを変更する時間の同期をとることができる。
【０１１０】
このようにして、音声再生装置７０においては、トリガ監視サイト７３によって監視プログラムＳｖを実行するとともに、音声出力サイト８０_ｎのそれぞれによって再生処理プログラムＤ_ｎを実行し、音声ファイルＡＦの再生を同時に開始することができる。
【０１１１】
以上のように、本発明の第４の実施の形態として示す音声再生装置７０は、ソース出力サイト７１上に存在している音声ファイルＡＦを、各音声出力サイト８０_ｎにマルチキャスト配信し、配信された音声ファイルＡＦに対して信号処理を施した音声信号を用いて、マルチチャンネルの同期再生を行うことができる。特に、音声再生装置７０は、例えばＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがって情報伝送を行うネットワークを介して情報伝送を行うことがないため、ネットワークトラフィックの影響を回避することができ、確実に同期再生を実現することができる。
【０１１２】
最後に、本発明の第５の実施の形態について説明する。この第５の実施の形態として図１０に示す音声再生装置９０は、音声出力サイトにおけるサウンド処理部のうち、上述したトリガ信号を出力するトリガチャンネルを専用に設けるのではなく、音声信号を出力するチャンネルと共用するものである。すなわち、上述した実施の形態では、トリガチャンネルを使用するのは、音声信号の再生開始前に１度だけトリガ信号を再生する場合のみであり、このために専用のチャンネルを確保するのは用途によっては効率が悪い場合がある。そこで、音声再生装置９０は、ｎ個の音声出力サイトにおけるＭチャンネルの同時再生が可能なサウンド処理部のうち、全てのチャンネルを音声信号の出力に用いることにより、上述した（Ｍ−１）×ｎチャンネルの音声信号の同期再生ではなく、Ｍ×ｎチャンネルの音声信号の同期再生を実現可能とするものである。ここでは、音声再生装置９０は、８チャンネルの音声信号を出力処理可能な４つの音声出力サイトを用いて、３２チャンネルの音声信号の同期再生を行うものとして説明する。なお、ここでも、上述した第１の実施の形態として示した音声再生装置１０等と同様の機能を有する各部については同一符号を付し、詳細な説明を省略するものとする。
【０１１３】
音声再生装置９０は、同図に示すように、上述したワードクロック生成器１４と、３２個の出力器１５_１１，１５_１２，１５_１３，１５_１４，１５_１５，１５_１６，１５_１７，１５_１８，１５_２１，１５_２２，・・・，１５_４５，１５_４６，１５_４７，１５_４８（以下、出力器１５_ｎｍ（ｎ＝１，２，３，４；サイト番号、ｍ＝１，２，・・・，８；チャンネル番号）と総称する。）と、３２個のスピーカ１６_１１，１６_１２，１６_１３，１６_１４，１６_１５，１６_１６，１６_１７，１６_１８，１６_２１，１６_２２，・・・，１６_４５，１６_４６，１６_４７，１６_４８（以下、スピーカ１６_ｎｍ（ｎ＝１，２，３，４、ｍ＝１，２，・・・，８）と総称する。）との他、トリガ信号ＴＧ_ｎを監視するトリガ監視サイト９１と、音声信号を再生して出力する４つの音声出力サイト１００_１，１００_２，・・・，１００_４（以下、音声出力サイト１００_ｎ（ｎ＝１，２，３，４）と総称する。）とを備える。この音声再生装置９０は、例えば映画館のようなエンターテイメントシアタＥＴＨに適用した場合を想定しており、出力器１５_ｎｍ及びスピーカ１６_ｎｍが当該エンターテイメントシアタＥＴＨの４壁面に埋め込まれ、３２チャンネルの独立音声によって臨場感のある音場を構築可能としている。
なお、音声再生装置９０においては、４つの音声出力サイト１００_ｎのそれぞれに対応する８つずつの出力器１５_ｎｍ及びスピーカ１６_ｎｍを、１つのブロックＢＫ_ｎ（ｎ＝１，２，３，４）とし、１つの壁面を構成するものとする。
【０１１４】
トリガ監視サイト９１は、上述したトリガ監視サイト１２と同様の機能を有するとともに、上述したコントロールサイト１１と同様の機能を兼ね備え、さらに、所定の映像からなる映画ファイルＭＦを図示しない記憶手段に記憶して、この映画ファイルＭＦを配信するシネマ配信サイトとしても機能するものである。トリガ監視サイト９１は、例えばＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがって外部との通信を行うことが可能であり、ネットワークＮＴを介して音声出力サイト１００_ｎとの間で信号の送受信を行うことが可能とされる。トリガ監視サイト９１は、監視プログラムＳｖを実行することが可能な環境とされており、所定の操作部や表示部等からなるユーザインターフェースＵＩを介してユーザによって再生コマンドが指示されると、監視プログラムＳｖの実行制御のもとに、音声ファイルＡＦ_ｎの再生指示を示す制御情報をネットワークＮＴを介して音声出力サイト１００_ｎに対して通知するとともに、映画ファイルＭＦに対して画像処理部９３によって所定の画像処理を施し、音声ファイルＡＦと同期させながら、映像データＩＭとしてプロジェクタＰＪへと供給してエンターテイメントシアタＥＴＨにおけるスクリーンに投影させる。なお、このトリガ監視サイト９１に記憶されている映画ファイルＭＦは、所定のフォーマットで圧縮されていてもよく、非圧縮であってもよく、画像処理部９３によってデコード処理が行われる。
【０１１５】
また、トリガ監視サイト９１は、少なくとも音声出力サイト２０_ｎの数以上、すなわち、４チャンネル以上の音声信号を入力処理可能なサウンドカード等のサウンド処理部９２を有し、このサウンド処理部９２を介して音声出力サイト１００_ｎから出力されるトリガ信号ＴＧ_ｎを受信することが可能とされる。トリガ監視サイト９１は、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部９２における処理を行う。
【０１１６】
トリガ監視サイト９１は、監視プログラムＳｖの実行制御のもとに、音声出力サイト１００_ｎのそれぞれから出力される音声信号の波形解析を行い、後述するように、この音声信号に埋め込まれたトリガ信号ＴＧ_ｎを監視する。トリガ監視サイト９１は、トリガ信号ＴＧ_ｎを検出すると、これらのトリガ信号ＴＧ_ｎに基づいて、音声出力サイト１００_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求め、これらの遅れサンプル数Ｓｄ_ｎを示す遅れサンプル数情報をネットワークＮＴを介して音声出力サイト１００_ｎに対してそれぞれ出力する。
【０１１７】
音声出力サイト１００_ｎは、それぞれ、上述した音声出力サイト２０_ｎと同様に、例えば８チャンネルの音声信号を出力処理可能なサウンドカード等のサウンド処理部１０１_１，１０１_２，・・・，１０１_８（以下、サウンド処理部１０１_ｎ（ｎ＝１，２，・・・，８）と総称する。）を有する。音声出力サイト１００_ｎは、それぞれ、ＴＣＰ／ＩＰやＵＤＰ等の所定のプロトコルにしたがってネットワークＮＴを介してトリガ監視サイト９１との間で信号の送受信を行うことが可能とされる。また、音声出力サイト１００_ｎは、それぞれ、サウンド処理部１０１_ｎにおける例えば第８チャンネルのみを出力器１５_ｎｍとともにトリガ監視サイト９１とも接続しており、このチャンネルを用いてトリガ監視サイト９１に対してトリガ信号ＴＧ_ｎを出力することが可能とされる。なお、このトリガ信号ＴＧ_ｎは、後述するように、音声信号として音声ファイルＡＦ_ｎに埋め込まれているものである。音声出力サイト１００_ｎは、それぞれ、再生処理プログラムＥ_１，Ｅ_２，・・・，Ｅ_８（以下、再生処理プログラムＥ_ｎ（ｎ＝１，２，・・・，８）と総称する。）を実行することが可能な環境とされており、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫ又はワード単位で各サンプルを同期せしめるのに十分なトリガ外部信号に同期して、サウンド処理部１０１_ｎにおける処理を行う。このとき、音声出力サイト１００_ｎにおいては、それぞれ、サウンド処理部１０１_ｎにおける第１チャンネル乃至第８チャンネルの間の同期は確保されているものとする。すなわち、音声出力サイト１００_ｎにおいては、それぞれ、トリガ監視サイト９１から再生指示を受け取った場合には、サウンド処理部１０１_ｎにおける第１チャンネル乃至第８チャンネルから同時に音声信号を出力する。また、音声出力サイト１００_ｎは、それぞれ、図示しない記憶手段に、８チャンネルの音声ファイルＡＦ_ｎを記憶している。
【０１１８】
ここで、音声ファイルＡＦ_ｎは、例えば図１１に示すように、８チャンネルの音声信号からなる。音声ファイルＡＦ_ｎには、所定時刻ｔｃ以降に、各チャンネル毎に映画コンテンツ用の音声信号Ｃ_１，Ｃ_２，・・・，Ｃ_８が記録されている。さらに、音声ファイルＡＦ_ｎには、第８チャンネルの音声信号における時刻ｔｃよりも以前の時刻に、トリガ用の音声信号Ｔ_ｎが記録されている。このトリガ用の音声信号Ｔ_ｎは、後述するように、各音声出力サイト１００_ｎにおける音声ファイルＡＦ_ｎの再生によって出力器１５_ｎｍに出力されるものであり、各音声出力サイト１００_ｎによる映画コンテンツ用の音声信号Ｃ_１，Ｃ_２，・・・，Ｃ_８の再生前に、ユーザに聴取されることになる。また、トリガ用の音声信号Ｔ_ｎは、トリガ監視サイト９１からネットワークＮＴを介して供給される再生指示を示す制御情報の到達時刻のばらつきに起因して、各音声出力サイト１００_ｎの間でばらばらに再生されるものである。そのため、トリガ用の音声信号Ｔ_ｎは、例えば、映画のタイトル音、チャイム音又は小鳥の囀り声といったように、時間的にずれて再生されても聴取者にとって不快とならない有意的な音声信号とするのが望ましい。特に、トリガ用の音声信号Ｔ_ｎは、例えば音声再生装置９０が本発明を適用したシステムであることを示すサウンドロゴとして、積極的に活用することも考えられる。
【０１１９】
トリガ用の音声信号Ｔ_ｎには、例えば電子透かしによってトリガ信号ＴＧ_ｎが埋め込まれており、トリガ検出サイト９１及び音声出力サイト１００_ｎによって特定サンプル値にて検出可能とされる。また、音声ファイルＡＦ_ｎにおいては、トリガ信号ＴＧ_ｎが検出可能な時刻ｔｔから時刻ｔｃまでの時間は、再生マージンサンプル数Ｓｍ０として定義される。この再生マージンサンプル数Ｓｍ０は、音声ファイルＡＦ_ｎを作成する際に、例えば後述する各種処理が終了するのに十分な時間を見込んで算出される値である。なお、トリガ信号ＴＧ_ｎとしては、必ずしも電子透かしによって音声信号Ｔ_ｎに埋め込むものである必要はなく、トリガ監視サイト９１及び音声出力サイト１００_ｎが、それぞれ、音声信号Ｔ_ｎの波形を観測してトリガ信号であることを判定でき、且つ、そのトリガ信号の発生サンプルが特定可能であるならば、音声信号Ｔ_ｎの波形そのものをトリガ信号ＴＧ_ｎとして扱うようにしてもよい。
【０１２０】
音声出力サイト１００_ｎは、それぞれ、トリガ監視サイト９１からネットワークＮＴを介して供給される再生指示を示す制御情報を受け取ると、再生処理プログラムＥ_ｎの実行制御のもとに、音声ファイルＡＦ_ｎの再生を開始し、サウンド処理部１０１_ｎを介して出力器１５_ｎｍ及びスピーカ１６_ｎｍへと供給して外部へと放音させるとともに、サウンド処理部１０１_ｎにおける第８チャンネルから出力される音声信号をトリガ監視サイト９１に供給する。音声出力サイト１００_ｎは、それぞれ、音声ファイルＡＦ_ｎの波形解析を行い、トリガ信号ＴＧ_ｎを検出し、そのトリガ信号ＴＧ_ｎが埋め込まれていたサンプルを特定する。なお、この時点で、出力器１５_ｎｍ及びスピーカ１６_ｎｍを介して放音される音声は、第８チャンネルから出力される音声信号Ｔ_ｎのみであり、それ以外は無音である。
また、音声出力サイト１００_ｎは、それぞれ、トリガ監視サイト９１からネットワークＮＴを介して遅れサンプル数情報を受け取ると、この遅れサンプル数に基づいて、音声ファイルＡＦ_ｎの再生をスキップし、再び再生を開始して出力器１５_ｎｍ及びスピーカ１６_ｎｍを介して外部へと放音させる。
【０１２１】
このような各部を備える音声再生装置９０においては、音声ファイルＡＦ_ｎの再生指示を示す制御情報が与えられると、音声出力サイト１００_ｎのそれぞれから音声ファイルＡＦ_ｎが再生される。音声再生装置９０においては、トリガ監視サイト９１及び音声出力サイト１００_ｎによって音声ファイルＡＦ_ｎに埋め込まれたトリガ信号ＴＧ_ｎを検出し、トリガ監視サイト９１から音声出力サイト１００_ｎに対して、トリガ信号ＴＧ_ｎに基づいて求められた音声出力サイト１００_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎが通知される。そして、音声再生装置９０においては、各音声出力サイト１００_ｎにより、この遅れサンプル数Ｓｄ_ｎに基づいて、音声ファイルＡＦ_ｎの再生をスキップし、スキップ先の時点から再生を再開する。音声再生装置９０においては、このようにして、全ての音声出力サイト１００_ｎによる音声ファイルＡＦ_ｎの再生開始のタイミングを一致させることができる。そして、音声再生装置９０においては、それ以後、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声出力サイト１００_ｎによって再生を進行することにより、マルチチャンネルの同期再生を行うことができる。
【０１２２】
具体的には、音声再生装置９０においては、音声出力サイト１００_ｎは、それぞれ、再生処理プログラムＥ_ｎを実行し、図１２に示す一連の工程を経ることによって音声ファイルＡＦ_ｎを再生する。
【０１２３】
まず、音声出力サイト１００_ｎは、それぞれ、同図に示すように、ステップＳ３１において、トリガ監視サイト９１からネットワークＮＴを介して音声ファイルＡＦ_ｎの再生指示を示す制御情報が到達したか否かを判別する。音声出力サイト１００_ｎは、それぞれ、トリガ監視サイト９１からの制御情報が到達するまで待機し、制御情報を受け取った時点で、ステップＳ３２へと処理を移行する。
【０１２４】
音声出力サイト１００_ｎは、それぞれ、ステップＳ３２において、サウンド処理部１０１_ｎを介して音声ファイルＡＦ_ｎを再生する。
【０１２５】
これと同時に、音声出力サイト１００_ｎは、それぞれ、ステップＳ３３において、音声ファイルＡＦ_ｎに埋め込まれているトリガ信号ＴＧ_ｎを検出する。
【０１２６】
さらに、音声出力サイト１００_ｎは、それぞれ、ステップＳ３４において、トリガ信号ＴＧ_ｎが埋め込まれていたサンプルとして特定したサンプルを基準として、再生サンプル数のカウントを開始する。音声出力サイト１００_ｎにおいては、それぞれ、このカウント開始のサンプルを基準として、音声ファイルＡＦ_ｎの再生開始のタイミングが決定されることになる。
【０１２７】
そして、音声出力サイト１００_ｎは、それぞれ、ステップＳ３５において、再生サンプル数のカウントを継続し、ステップＳ３６において、トリガ監視サイト９１から対応する遅れサンプル数Ｓｄ_ｎを示す遅れサンプル数情報がネットワークＮＴを介して到達したか否かを判別する。
【０１２８】
ここで、遅れサンプル数情報が到達していない場合には、音声出力サイト１００_ｎは、それぞれ、ステップＳ３５の処理を繰り返す。
【０１２９】
一方、遅れサンプル数情報が到達した場合には、音声出力サイト１００_ｎは、それぞれ、ステップＳ３７において、遅れサンプル数Ｓｄ_ｎだけ音声ファイルＡＦ_ｎの再生をスキップし、スキップ先の時点から再生を再開し、一連の処理を終了する。
【０１３０】
このようにして、音声再生装置９０においては、音声出力サイト１００_ｎのそれぞれによって再生処理プログラムＥ_ｎを実行し、音声ファイルＡＦ_ｎにおける映画コンテンツ用の音声信号の再生を同時に開始することができる。なお、音声再生装置９０においては、それ以後、音声出力サイト１００_ｎが、それぞれ、ワードクロック生成器１４から供給されるワードクロックＷＣＬＫに基づいて音声ファイルＡＦ_ｎの再生を進行することにより、マルチチャンネルの同期再生が可能となる。
【０１３１】
一方、音声再生装置９０においては、トリガ監視サイト９１は、監視プログラムＳｖを実行し、図１３に示す一連の工程を経ることによって音声出力サイト１００_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求めることができる。
【０１３２】
まず、トリガ監視サイト９１は、同図に示すように、ステップＳ４１において、再生指示を示す制御情報を音声出力サイト１００_ｎのそれぞれに対して通知する。
【０１３３】
続いて、トリガ監視サイト９１は、ステップＳ４２において、音声出力サイト１００_ｎのそれぞれからサウンド処理部９２における各チャンネルに出力される音声ファイルＡＦ_ｎの入来を監視する。
【０１３４】
続いて、トリガ監視サイト９１は、ステップＳ４３において、サウンド処理部９２における４つのチャンネルのうちのいずれかのチャンネルに、音声ファイルＡＦ_ｎのいずれかが供給されたか否かを判別する。
【０１３５】
ここで、音声ファイルＡＦ_ｎのいずれかが供給されていない場合には、トリガ監視サイト９１は、ステップＳ４２へと処理を移行し、音声ファイルＡＦ_ｎのいずれかが供給されるまで監視し続ける。
【０１３６】
一方、音声ファイルＡＦ_ｎのいずれかが供給された場合には、トリガ監視サイト９１は、ステップＳ４４において、音声ファイルＡＦ_ｎに埋め込まれているトリガ信号ＴＧ_ｎを検出する。
【０１３７】
続いて、トリガ監視サイト９１は、ステップＳ４５において、検出されたトリガ信号ＴＧ_ｎが最初に供給された音声ファイルＡＦ_ｎに埋め込まれていたものであるか否か、すなわち、初めてトリガ信号ＴＧ_ｎが検出されたのか否かを判別する。
【０１３８】
ここで、トリガ信号ＴＧ_ｎが最初に検出されたものでない場合には、トリガ監視サイト９１は、ステップＳ４７へと処理を移行する。一方、トリガ信号ＴＧ_ｎが最初に検出されたものであった場合には、トリガ監視サイト９１は、ステップＳ４６において、この最初に検出したトリガ信号ＴＧ_ｎが入力された時点のサンプルを基準サンプルとして、入力サンプル数のカウントを開始し、ステップＳ４７へと処理を移行する。トリガ監視サイト９１においては、この基準サンプルに基づいて、入力サンプル数をカウントすることによって他のトリガ信号ＴＧ_ｎの入力時刻を計測することになる。
【０１３９】
そして、トリガ監視サイト９１は、ステップＳ４７において、ステップＳ４６にて基準サンプルを基点としてカウントを開始しているカウント値を遅れサンプル数Ｓｄ_ｎとし、トリガ信号ＴＧ_ｎを出力した対応する音声出力サイト１００_ｎに対して、この遅れサンプル数Ｓｄ_ｎを示す遅れサンプル数情報をネットワークＮＴを介して出力する。なお、トリガ監視サイト９１は、最初に音声ファイルＡＦ_ｎを出力した音声出力サイト１００_ｎに対しては、遅れサンプル数Ｓｄ_ｎ＝０を通知することはいうまでもない。
【０１４０】
続いて、トリガ監視サイト９１は、ステップＳ４８において、入力サンプル数のカウントを継続し、ステップＳ４９において、全ての音声出力サイト１００_ｎからトリガ信号ＴＧ_ｎを受信したか否かを判別する。
【０１４１】
ここで、全ての音声出力サイト１００_ｎからトリガ信号ＴＧ_ｎを受信していない場合には、トリガ監視サイト９１は、ステップＳ４２乃至ステップＳ４８の処理を繰り返す。
【０１４２】
一方、全ての音声出力サイト１００_ｎからトリガ信号ＴＧ_ｎを受信した場合には、トリガ監視サイト９１は、そのまま一連の処理を終了する。
【０１４３】
なお、音声出力サイト１００_ｎのいずれかが故障等によってトリガ信号ＴＧ_ｎをトリガ監視サイト９１に与えない場合があり得るが、この場合には、トリガ監視サイト９１は、ステップＳ４９からステップＳ４２へと移行するループ処理を適宜抜けるようにしてもよい。
【０１４４】
このようにして、音声再生装置９０においては、トリガ監視サイト９１によって監視プログラムＳｖを実行し、音声出力サイト１００_ｎのそれぞれに対応する遅れサンプル数Ｓｄ_ｎを求めることができ、音声出力サイト１００_ｎによる同期再生を実現させることが可能となる。なお、トリガ監視サイト９１は、この一連の処理の間、画像処理部９３によって映画ファイルＭＦに対する画像処理を施し、音声出力サイト１００_ｎによる音声ファイルＡＦ_ｎにおける映画コンテンツ用の音声信号の再生開始と同期させて、映像データＩＭをプロジェクタＰＪを介してエンターテイメントシアタＥＴＨにおけるスクリーンに投影させる。
【０１４５】
このようなプログラムを実行するトリガ監視サイト９１及び音声出力サイト１００_ｎを備える音声再生装置９０においては、例えば図１４に示すタイミングにしたがって各動作が協調して行われる。
【０１４６】
すなわち、音声再生装置９０においては、ワードクロック生成器１４によって生成されるワードクロックＷＣＬＫに基づいて、トリガ監視サイト９１及び音声出力サイト１００_ｎが動作する。音声再生装置９０においては、同図上段に示すように、ユーザインターフェースＵＩを介して再生コマンドが指示されると、この再生指示を示す制御情報がトリガ監視サイト９１から音声出力サイト１００_ｎのそれぞれにネットワークＮＴを介して通知される。この制御情報が到達する絶対的な時刻は、上述したように、ネットワークトラフィックの影響等に起因して、音声出力サイト１００_ｎの間でばらつきが生じる。ここでは、同図中段に示すように、音声出力サイト１００_１が最も早く制御情報を受信し、音声出力サイト１００_４，音声出力サイト１００_２の順序で制御情報を受信したものとする。
【０１４７】
すると、音声再生装置９０においては、上述したように、音声出力サイト１００_ｎから音声ファイルＡＦ_ｎが再生される。この音声出力サイト１００_ｎのそれぞれから音声ファイルＡＦ_ｎが再生される絶対的な時刻は、上述したように、音声出力サイト１００_ｎにおけるサウンド処理部１０１_ｎの相違や図示しないＣＰＵ等の処理能力等に起因して、音声出力サイト１００_ｎの間でばらつきが生じる。ここでは、同図中段に示すように、音声出力サイト１００_１が最も早い時点で音声ファイルＡＦ_１の再生を開始し、以後、音声出力サイト１００_４，音声出力サイト１００_２の順序で音声ファイルＡＦ_４，ＡＦ_２の再生を開始したものとする。
【０１４８】
そして、音声再生装置９０においては、上述したように、音声出力サイト１００_ｎのそれぞれによって音声ファイルＡＦ_ｎに埋め込まれたトリガ信号ＴＧ_ｎが検出されるとともに、トリガ監視サイト９１によってもトリガ信号ＴＧ_ｎが検出される。ここでは、同図下段に示すように、トリガ監視サイト９１によって最も早いサンプルＳ_１の時点でトリガ信号ＴＧ_１を検出し、以後、トリガ信号ＴＧ_４，ＴＧ_２の順序で検出したものとする。音声出力サイト１００_ｎは、それぞれ、これらのトリガ信号ＴＧ_ｎが特定された時点のサンプルＳ_ｎを基準として、再生サンプル数のカウントを開始する。また、ここでは、最も早くトリガ信号ＴＧ_１が検出されたサンプルＳ_１を基準サンプルとしている。また、トリガ監視サイト９１は、基準サンプルであるサンプルＳ_１から入力サンプル数のカウントを開始し、同図下段に示すように、遅れサンプル数Ｓｄ_ｎを算出する。音声再生装置９０においては、これらの遅れサンプル数Ｓｄ_ｎが対応する音声出力サイト１００_ｎに対して通知される。この遅れサンプル数Ｓｄ_ｎが到達する絶対的な時刻は、上述したように、ネットワークトラフィックの影響等に起因して、音声出力サイト１００_ｎの間でばらつきが生じる。ここでは、同図中段に示すように、音声出力サイト１００_１が最も早く遅れサンプル数情報を受信し、音声出力サイト１００_２，音声出力サイト１００_４の順序で遅れサンプル数情報を受信したものとする。
【０１４９】
そして、音声再生装置９０においては、音声出力サイト１００_ｎのそれぞれにより、遅れサンプル数Ｓｄ_ｎだけ音声ファイルＡＦ_ｎの再生がスキップされ、スキップ先の時点から再生が再開される。このとき、上述した再生マージンサンプル数Ｓｍ０が、少なくとも以下の工程、すなわち、再生指示を示す制御情報を受信した音声出力サイト１００_ｎが音声ファイルＡＦ_ｎを再生する工程、トリガ監視サイト９１が全ての音声出力サイト１００_ｎからトリガ信号ＴＧ_ｎを受信する工程、トリガ監視サイト９１が音声出力サイト１００_ｎのそれぞれに対して遅れサンプル数Ｓｄ_ｎを通知する工程、及び、全ての音声出力サイト１００_ｎが遅れサンプル数Ｓｄ_ｎを受信し、再生しているサンプルを補正する工程を行うのに十分な時間を見込んで算出されている場合には、スキップされたサンプルは、映画コンテンツに無関係の音声信号、すなわち、先に図１１に示した時刻ｔｔから時刻ｔｃまでに含まれる音声信号である。音声再生装置９０においては、トリガ監視サイト９１によって最初にトリガ信号ＴＧ_ｎが検出された時点のサンプルＳ_ｎからカウントして、再生マージンサンプル数Ｓｍ０だけ経過したサンプルから、音声ファイルＡＦ_ｎにおける映画コンテンツ用の音声信号Ｃ_１，Ｃ_２，・・・，Ｃ_８の再生が一斉に開始される。
【０１５０】
音声再生装置９０においては、音声出力サイト１００_ｎ間における映画コンテンツ用の音声信号Ｃ_１，Ｃ_２，・・・，Ｃ_８の再生開始位置を一定にし、それ以後は、ワードクロック生成器１４によって生成されるワードクロックＷＣＬＫに基づいて同期を保ちながら音声ファイルＡＦ_ｎの再生を進行する。
【０１５１】
以上のように、本発明の第５の実施の形態として示す音声再生装置９０は、音声出力サイト１００_ｎにおけるサウンド処理部１０１_ｎのうち、トリガ信号ＴＧ_ｎを出力するトリガチャンネルを専用に設けずに、音声信号を出力するチャンネルと共用し、効率よく音声信号の同期再生を行うことができる。
【０１５２】
以上説明したように、本発明の第１の実施の形態乃至第５の実施の形態として示す音声再生装置は、汎用の複数のハードウェアを用いて安価に構成することができ、処理能力やチャンネル数等の制限等によって従来では実現不可能であったマルチチャンネルの音声信号の同期再生を精確に行うことができる。したがって、音声再生装置は、例えば映画館やゲーム等のエンターテイメントシステムに適用することにより、簡易な構成のもとに非常に臨場感の高い音響再生を実現することができる。
【０１５３】
なお、本発明は、上述した実施の形態に限定されるものではなく、例えば、上述した第１の実施の形態及び第２の実施の形態の末尾に記載したように、各種変更が可能であり、これは、他の実施の形態についても必要に応じて適用可能である。
【０１５４】
また、本発明は、上述した第１の実施の形態乃至第５の実施の形態のうち、任意のものを可能な範囲で適宜組み合わせてもよい。
【０１５５】
このように、本発明は、その趣旨を逸脱しない範囲で適宜変更が可能であることはいうまでもない。
【０１５６】
【発明の効果】
以上詳細に説明したように、本発明にかかる音声再生装置は、複数チャンネルの音声信号を同期再生する音声再生装置であって、所定の基準信号を生成する基準信号生成手段と、基準信号に同期して複数チャンネルの音声信号を出力処理可能な音声出力処理器を介して、音声信号を再生して出力する互いに同期が確保されていない複数の音声出力手段と、基準信号に同期して少なくとも音声出力手段の数以上の音声信号を入力処理可能な音声入力処理器を介して、音声出力手段のそれぞれから少なくとも１チャンネル分だけ供給される信号であって音声出力手段のそれぞれにおける音声信号の再生タイミングを与えるための予行信号の入来を監視する予行信号監視手段とを備え、予行信号監視手段は、音声出力手段のそれぞれから供給された複数の予行信号の入力時間についての相対的なずれ時間を求め、ずれ時間を示す時間情報を、音声出力手段のそれぞれに対して通知し、音声出力手段は、それぞれ、時間情報に基づいて、音声信号の再生を開始すべきタイミングを決定する。
【０１５７】
したがって、本発明にかかる音声再生装置は、音声出力手段のそれぞれに対応する複数の予行信号の入力時間についての相対的なずれ時間を示す時間情報に基づいて、音声信号の再生を開始すべきタイミングを決定することにより、汎用の複数のハードウェアを用いた場合であっても、マルチチャンネルの音声信号の同期再生を精確に行うことができる。
【０１５８】
また、本発明にかかる音声再生方法は、複数チャンネルの音声信号を同期再生する音声再生方法であって、所定の基準信号に同期して複数チャンネルの音声信号を出力処理可能な音声出力処理器を介して、音声信号を再生して出力する互いに同期が確保されていない複数の音声出力手段のそれぞれから少なくとも１チャンネル分だけ供給される信号であって、音声出力手段のそれぞれにおける音声信号の再生タイミングを与えるための予行信号を、基準信号に同期して少なくとも音声出力手段の数以上の音声信号を入力処理可能な音声入力処理器を介して予行信号監視手段に対して入力し、音声出力手段のそれぞれから供給された複数の予行信号の入力時間についての相対的なずれ時間を求めるずれ時間算出工程と、ずれ時間を示す時間情報に基づいて、音声信号の再生を開始すべきタイミングを決定する再生開始タイミング決定工程とを備える。
【０１５９】
したがって、本発明にかかる音声再生方法は、音声出力手段のそれぞれに対応する複数の予行信号の入力時間についての相対的なずれ時間を示す時間情報に基づいて、音声信号の再生を開始すべきタイミングが決定されることにより、汎用の複数のハードウェアを用いた場合であっても、マルチチャンネルの音声信号の同期再生を精確に行うことが可能となる。
【図面の簡単な説明】
【図１】本発明の第１の実施の形態として示す音声再生装置の構成を説明するブロック図である。
【図２】同音声再生装置が備える音声出力サイトにおいて音声ファイルの再生を開始するまでの一連の工程を説明するフローチャートである。
【図３】同音声再生装置が備えるトリガ監視サイトにおいて音声出力サイトのそれぞれに対応する遅れサンプル数を求める際の一連の工程を説明するフローチャートである。
【図４】同音声再生装置における各部の動作タイミングを説明するための図である。
【図５】本発明の第２の実施の形態として示す音声再生装置の構成を説明するブロック図である。
【図６】同音声再生装置が備える音声出力サイトにおいて音声ストリームの再生を開始するまでの一連の工程を説明するフローチャートである。
【図７】本発明の第３の実施の形態として示す音声再生装置の構成を説明するブロック図である。
【図８】同音声再生装置の構成を説明するブロック図であって、特に、音声出力サイトが有する信号処理部の構成を説明するブロック図である。
【図９】本発明の第４の実施の形態として示す音声再生装置の構成を説明するブロック図である。
【図１０】本発明の第５の実施の形態として示す音声再生装置の構成を説明するブロック図である。
【図１１】音声ファイルの構造を説明する図である。
【図１２】同音声再生装置が備える音声出力サイトにおいて音声ファイルを再生する際の一連の工程を説明するフローチャートである。
【図１３】同音声再生装置が備えるトリガ監視サイトにおいて音声出力サイトのそれぞれに対応する遅れサンプル数を求める際の一連の工程を説明するフローチャートである。
【図１４】同音声再生装置における各部の動作タイミングを説明するための図である。
【符号の説明】
１０，３０，５０，７０，９０音声再生装置、１１コントロールサイト、１２，７３，９１トリガ監視サイト、１３，２１_ｎ，４１_ｎ，６１_ｎ，７２，７４，８１_ｎ，９２，１０１_ｎサウンド処理部、１４ワードクロック生成器、１５_ｎｍ出力器、１６_ｎｍスピーカ、２０_ｎ，４０_ｎ，６０_ｎ，８０_ｎ，１００_ｎ音声出力サイト、３１配信サイト、５１，７１ソース出力サイト、６２_ｎｍ信号処理部、９３画像処理部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an audio reproducing apparatus and audio reproducing method for synchronously reproducing multi-channel audio signals.
[0002]
[Prior art]
In recent years, in order to make it possible to listen to an audio signal reproduced from an audio device with an enhanced sense of reality, it is considered to construct an audio reproduction system that reproduces a multi-channel audio signal. As such an audio reproduction system, for example, an apparatus that reproduces a multi-channel audio signal recorded on a recording medium such as a DVD (Digital Versatile Disc) mainly for the purpose of listening in a space such as a home room or a vehicle. There are five channels to be output from each of the five speakers installed at the front right, front front, left front, right rear and left rear of the listener, and a sub in which only the low frequency component of the audio signal is recorded. A system having a channel, that is, a so-called Dolby AC-3 (trademark) 5.1 channel system has been put into practical use. In this 5.1-channel audio reproduction system, five speakers and low-frequency reproduction speakers installed in front of the listener's right front, front front, left front, right rear and left rear are used. By reproducing, realistic sound reproduction can be performed.
[0003]
[Problems to be solved by the invention]
By the way, the present applicant reproduces sound from a small number of speakers, such as the 5.1 channel method as described above, in order to realize control of very precise directivity, sound field, sound image, etc. by multi-speakers. We are proceeding with the construction of a larger-scale audio reproduction system rather than a possible audio reproduction system.
[0004]
When considering the construction of such a large-scale audio reproduction system, it is of course possible to design completely new hardware, but it is desirable to construct it using existing general-purpose hardware. This is because there are many existing inexpensive semiconductors that have the functions required for general-purpose hardware such as so-called sound cards that can be installed in personal computers. Compared to developing a processing mechanism, the efficiency of design man-hours can be improved, parts can be procured at low cost, and design can be easily performed.
[0005]
Here, hardware including such a personal computer often has a limited number of channels due to physical restrictions or system restrictions. Therefore, as a sound reproduction system, a plurality of hardware including a personal computer capable of outputting one or a plurality of channels of sound signals are connected via a network or the like, and the plurality of hardware are controlled under a predetermined local control. It is conceivable to distribute and reproduce audio signals from the wear. At this time, in the audio reproduction system, it is necessary to reproduce audio signals output from individual hardware in a completely synchronized manner, for example, in units of samples.
[0006]
Therefore, in the audio reproduction system, in order to realize synchronous reproduction, a so-called real-time OS (Operating System) is required, or special hardware capable of performing communication while accurately performing time synchronization with an external signal. Necessary and complicating the entire system and communication means.
[0007]
Also, in the audio reproduction system, in order to realize synchronized reproduction, it has been conventionally performed as a synchronization method of video, etc., and the clock between hardware is set by packet communication by a general-purpose network connecting each hardware. It is also possible to match.
[0008]
However, in the audio reproduction system, the time for the command to reach each hardware varies due to the influence of traffic on the network. Also, in the audio reproduction system, since various hardware are connected to each other, the processing time varies depending on the difference of each hardware, and it is difficult to predict the processing time. Further, in the sound reproduction system, there is a difference in the clock of the sound card itself. For this reason, in the audio reproduction system, it is difficult to specify the delay time between each hardware for each communication session, and it is necessary to rely on a statistical method, which is very unstable. Therefore, such a method is unsuitable for a sample that requires sample level synchronization in units of 1/444100 seconds, such as voice.
[0009]
The present invention has been made in view of such a situation, and provides an audio reproducing apparatus and an audio reproducing method capable of accurately synchronizing and reproducing a multi-channel audio signal using a plurality of general-purpose hardware. It is intended to do.
[0010]
[Means for Solving the Problems]
An audio reproducing apparatus according to the present invention that achieves the above-described object is an audio reproducing apparatus that reproduces an audio signal of a plurality of channels in synchronization with reference signal generating means for generating a predetermined reference signal, and in synchronization with the reference signal. A plurality of audio output means for reproducing and outputting audio signals via an audio output processor capable of output processing of audio signals of a plurality of channels, and at least audio output means synchronized with a reference signal The signal is supplied for at least one channel from each of the sound output means via a sound input processor capable of processing the sound signal of more than the number, and the reproduction timing of the sound signal in each of the sound output means is given. And a replay signal monitoring means for monitoring the arrival of a replay signal for the sound, and the replay signal monitoring means includes a plurality of sound output means supplied from each of the sound output means. A relative shift time with respect to the input time of the recurring signal is obtained, and time information indicating the shift time is notified to each of the sound output means, and the sound output means each of the sound signal based on the time information. It is characterized in that the timing for starting playback is determined.
[0011]
Such an audio reproduction device according to the present invention should start reproduction of an audio signal based on time information indicating a relative shift time with respect to input times of a plurality of replay signals corresponding to each of the audio output means. Determine timing.
[0012]
The audio reproduction method according to the present invention that achieves the above-described object is an audio reproduction method for synchronously reproducing an audio signal of a plurality of channels, and can output an audio signal of a plurality of channels in synchronization with a predetermined reference signal A signal that is supplied for at least one channel from each of a plurality of audio output means that are not synchronized with each other and that reproduces and outputs an audio signal via a sound output processor, and each of the audio output means The replay signal for giving the reproduction timing of the sound signal is input to the replay signal monitoring means via a sound input processor capable of inputting and processing at least the sound signals of the sound output means in synchronization with the reference signal. A deviation time calculating step for obtaining a relative deviation time for the input times of the plurality of recurrent signals supplied from each of the audio output means; Based on the time information indicating a time that is characterized by comprising a playback start timing determination step of determining when to start playing the audio signal.
[0013]
In such an audio reproduction method according to the present invention, reproduction of an audio signal should be started based on time information indicating a relative shift time with respect to input times of a plurality of replay signals corresponding to each of the audio output means. Timing is determined.
[0014]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, specific embodiments to which the present invention is applied will be described in detail with reference to the drawings.
[0015]
This embodiment is an audio reproducing apparatus capable of reproducing at least two channels of multi-channel audio signals in synchronization using a plurality of hardware including a personal computer. This sound reproducing apparatus uses multi-tasking and general hardware that is not necessarily specialized for real-time processing, or an OS (Operating System) or a system corresponding to the OS, and an existing sound card or interface. By using general-purpose audio processing means such as hardware conforming to this and communication means, multi-channel synchronized reproduction of audio signals existing over a plurality of hardware is enabled.
[0016]
In the following description, the hardware is assumed to be configured by a memory that stores various programs, a CPU (Central Processing Unit) that executes the various programs, a DSP (Digital Signal Processor), and the like. In the following description, unless otherwise specified, audio signals existing in hardware are those existing as audio files on storage means such as a hard disk, and whether or not signal processing is performed on each hardware. The description will be made on the assumption that any of the audio streams existing on the memory is sequentially indicated regardless of whether or not. Further, hereinafter, a module indicating each functional unit is referred to as a “site”. This site does not necessarily match the unit of hardware. That is, the audio reproducing apparatus may be implemented by a plurality of sites mounted on a single hardware, or may be distributed and mounted on a plurality of hardware, and the configuration thereof is arbitrary.
[0017]
First, a first embodiment of the present invention will be described. In the audio reproduction apparatus 10 shown in FIG. 1 as the first embodiment, audio signals are distributed as audio files on each hardware, and 56-channel audio signals are generated using these audio files. Synchronous playback is performed.
[0018]
As shown in the figure, the audio reproduction device 10 includes a control site 11 which is a control means for controlling each part, and a trigger signal TG which is a recurring signal to be described later. ₁ , TG ₂ , ..., TG ₈ Trigger monitoring site 12 which is a recurrent signal monitoring means for monitoring the sound and eight audio output sites 20 which are audio output means for reproducing and outputting the audio signal ₁ , 20 ₂ , ..., 20 ₈ (Hereafter, audio output site 20 _n (N = 1, 2,..., 8; site number). ), A word clock generator 14 which is a reference signal generating means for generating a clock in units of words as a predetermined reference signal, and an audio output site 20 _n 56 output units 15 comprising a D / A converter that performs D / A (Digital / Analog) conversion on the audio signal output from the A / D converter, an amplifier that amplifies the analog audio signal, and the like. ₁₁ , 15 ₁₂ , 15 ₁₃ , 15 ₁₄ , 15 ₁₅ , 15 ₁₆ , 15 ₁₇ , 15 ₂₁ , 15 ₂₂ , ..., 15 ₈₄ , 15 ₈₅ , 15 ₈₆ , 15 ₈₇ (Hereafter, output device 15 _nm (N = 1, 2,..., 8, m = 1, 2,..., 7; channel number). ) And these output devices 15 _nm 56 loudspeakers 16 for emitting sound signals output from the sound sources to the outside ₁₁ , 16 ₁₂ , 16 ₁₃ , 16 ₁₄ , 16 ₁₅ , 16 ₁₆ , 16 ₁₇ , 16 ₂₁ , 16 ₂₂ , ..., 16 ₈₄ , 16 ₈₅ , 16 ₈₆ , 16 ₈₇ (Hereafter, speaker 16 _nm (N = 1, 2,..., 8, m = 1, 2,..., 7). ).
[0019]
The control site 11 has a function of performing overall control of the sound reproduction device 10 and a function related to an interface between the sound reproduction device 10 and the user. The control site 11 can communicate with the outside in accordance with a predetermined protocol such as TCP / IP (Transmission Control Protocol / Internet Protocol) or UDP (User Datagram Protocol), and trigger monitoring is performed via the network NT. Site 12 and audio output site 20 _n It is possible to send and receive signals to and from. The audio reproducing apparatus 10 does not need to prepare special hardware or construct a new one by using such generally known and frequently used network resources and general-purpose communication protocols. The control site 11 has an environment in which the control program Ct can be executed. When a playback command is instructed by the user via a user interface UI including a predetermined operation unit and display unit, the control site Ct Control information indicating an audio file playback instruction is transmitted to the audio output site 20 via the network NT. _n Notify against.
[0020]
The trigger monitoring site 12 is at least an audio output site 20 _n Or more, that is, a hardware function including a personal computer having a sound processing unit 13 such as a sound card capable of inputting and processing an audio signal as a trigger signal to be described later of 8 channels or more. Similar to the control site 11, the trigger monitoring site 12 is connected to the control site 11 and the audio output site 20 via the network NT according to a predetermined protocol such as TCP / IP or UDP. _n It is possible to send and receive signals to and from. The trigger monitoring site 12 is connected to the audio output site 20 via the sound processing unit 13. _n Trigger signal TG, which is an audio signal for one channel output from, and is a signal for giving the reproduction timing of the audio file ₁ , TG ₂ , ..., TG ₈ (Hereafter, trigger signal TG _n (N = 1, 2,..., 8) ) Can be received. At this time, the trigger monitoring site 12 is configured such that the first channel in the sound processing unit 13 is the audio output site 20. ₁ And the second channel is connected to the audio output site 20. ₂ Each channel is connected to the audio output site 20 such as _n Are connected in correspondence with each of the above. The trigger monitoring site 12 is an environment capable of executing the monitoring program Sv, and is a trigger external signal sufficient to synchronize each sample with the word clock WCLK or the word unit supplied from the word clock generator 14. In synchronism with this, the sound processing unit 13 performs processing.
[0021]
The trigger monitoring site 12 is connected to all audio output sites 20 from the control site 11. _n Is output to the audio output site 20 under the execution control of the monitoring program Sv. _n Trigger signal TG output from each of _n Monitor incoming traffic. The trigger monitoring site 12 generates a trigger signal TG _n , These trigger signals TG _n Based on the audio output site 20 _n The number of delay samples to be described later corresponding to each of these is obtained, and the delay sample number information indicating these delay sample numbers is obtained via the network NT. _n Respectively. Here, the number of delayed samples is the trigger signal TG for the trigger monitoring site 12. _n The relative shift time with respect to the input time is obtained in units of the word clock WCLK.
[0022]
Note that the type of audio signal handled by the trigger monitoring site 12 is preferably digital data, but in the sound processing unit 13, the word clock WCLK and the trigger signal TG are used. _n As long as it can be accurately detected and transmitted / received, analog data may be used. In the trigger monitoring site 12, even if digital data is handled, any data format such as sampling rate or compression or non-compression may be used.
[0023]
Audio output site 20 _n Are each a sound processing unit 21 such as a sound card capable of outputting an 8-channel audio signal, for example. ₁ , 21 ₂ , ..., 21 ₈ (Hereinafter, the sound processing unit 21 _n (N = 1, 2,..., 8) ) Having hardware functions including a personal computer or the like. Audio output site 20 _n Performs transmission / reception of signals between the control site 11 and the trigger monitoring site 12 via the network NT in accordance with a predetermined protocol such as TCP / IP and UDP, respectively, similarly to the control site 11 and the trigger monitoring site 12. It is possible. Also, the audio output site 20 _n Respectively, the sound processing unit 21 _n For example, only the eighth channel is connected to the trigger monitoring site 12, and the trigger signal TG, which is an audio signal for one channel, is connected to the trigger monitoring site 12 using this channel. _n Can be output. Audio output site 20 _n Respectively, reproduction processing program A ₁ , A ₂ , ..., A ₈ (Hereinafter, reproduction processing program A _n (N = 1, 2,..., 8) The sound processing unit is synchronized with a word clock WCLK supplied from the word clock generator 14 or a trigger external signal sufficient to synchronize each sample in units of words. 21 _n The process in is performed. At this time, the audio output site 20 _n Respectively, the sound processing unit 21 _n It is assumed that synchronization between the first channel to the eighth channel is secured. That is, the audio output site 20 _n , In the case where a reproduction instruction is received from the control site 11 as will be described later, the sound processing unit 21 _n Audio signals are simultaneously output from the first channel to the eighth channel. Also, the audio output site 20 _n Are respectively stored in a storage means (not shown) in an audio file AF. ₁ , AF ₂ , ..., AF ₈ (Hereafter, audio file AF _n (N = 1, 2,..., 8) ) Is remembered.
[0024]
Audio output site 20 _n Are audio files AF supplied from the control site 11 via the network NT, respectively. _n When the control information indicating the reproduction instruction is received, the reproduction processing program A _n Under the execution control of the sound processing unit 21 _n Trigger signal TG via _n Is output to the trigger monitoring site 12. This trigger signal TG _n Is an audio signal having a predetermined waveform that can be recognized as a trigger by the trigger monitoring site 12, and is preferably an impulse signal, for example. Also, the audio output site 20 _n When the delay sample number information is received from the trigger monitoring site 12 via the network NT, the audio file AF is based on the delay sample number and the reproduction margin sample number described later. _n Decides the timing to start playback of the audio file AF at this timing _n Reproduction of the output device 15 _nm And speaker 16 _nm Sound is emitted to the outside via Audio output site 20 _n Respectively, the sound processing unit 21 ₁ The first through seventh channels are respectively output devices 15. _1m The audio signal output from each channel is connected to the corresponding output device 15 as shown in FIG. _nm Output for.
[0025]
Each audio output site 20 _n As for the type of the audio signal handled by, as with the trigger monitoring site 12, digital data is desirable, but the sound processing unit 21 _n , Word clock WCLK and trigger signal TG _n As long as it can be accurately detected and transmitted / received, analog data may be used. Also, the audio output site 20 _n In the above, even if digital data is handled, any data format such as sampling rate or compression or non-compression may be used.
[0026]
The word clock generator 14 generates an accurate clock in units of words. The word clock generator 14 sends the generated word clock WCLK to the sound processing unit 13 and the audio output site 20 at the trigger monitoring site 12. _n Sound processing unit 21 _n To supply. The word clock generator 14 may generate a trigger external signal sufficient to synchronize each sample in units of words.
[0027]
Output device 15 _nm Are the audio output sites 20 respectively. _n The audio signal output from is subjected to D / A conversion and then amplified. Output device 15 _nm Respectively, the amplified audio signal is output from the speaker 16. _nm Output for.
[0028]
Speaker 16 _nm Are output devices 15 respectively. _nm The electrical sound signal output from each of these is converted into acoustic energy and emitted to the outside.
[0029]
In the audio reproduction device 10 including such units, the audio output site 20 _n Audio file AF stored in each _n If the control information indicating the reproduction instruction is given by the control site 11, the audio output site 20 _n From each of the trigger signals TG to the trigger monitoring site 12 _n Is output. In the audio reproduction device 10, the trigger signal TG is transmitted by the trigger monitoring site 12. _n Based on the audio output site 20 _n Find the number of delayed samples corresponding to each of. In the audio reproduction device 10, each audio output site 20 _n Based on the number of delayed samples and the number of playback margin samples, the timing at which the player should start playback is obtained, and the audio file AF is reached when the timing is reached. _n Can start playing. In the audio reproduction device 10, all the audio output sites 20 are thus obtained. _n Audio file AF by _n The playback start timing can be matched. Then, in the audio reproduction device 10, the audio output site 20 is thereafter based on the word clock WCLK supplied from the word clock generator 14. _n Multi-channel synchronized playback can be performed by proceeding with playback.
[0030]
In the audio playback device 10, for example, the audio output site 20 ₁ Have the functions of the control site 11 and / or the trigger monitoring site 12, or other audio output sites 20 ₂ As described above, a plurality of sites can be mounted on a single piece of hardware or can be distributed and mounted on a plurality of pieces of hardware.
[0031]
In such an audio playback device 10, the audio output site 20 _n Respectively, reproduction processing program A _n And the audio file AF is performed through a series of steps shown in FIG. _n Start playing.
[0032]
First, the audio output site 20 _n As shown in the figure, the reproduction margin sample number Sm is set in step S1. This reproduction margin sample number Sm is determined by the audio output site 20. _n Is a common value between Specifically, the reproduction margin sample number Sm is determined from the reference sample described later from the audio file AF. _n This is a value representing the time until the sample at the start of playback. The reproduction margin sample number Sm is a value that is calculated in consideration of a time sufficient for completing various processes to be described later, for example, and may be set by the user or automatically set. Audio output site 20 _n Each stores the set reproduction margin sample number Sm in a memory (not shown) or the like.
[0033]
Next, the audio output site 20 _n Respectively, in step S2, the audio file AF from the control site 11 via the network NT. _n It is determined whether or not the control information indicating the reproduction instruction is reached. Audio output site 20 _n Since the reproduction instruction from the control site 11 is given via the network NT, the absolute time at which the control information arrives depends on the influence of network traffic and the like. _n Variation occurs between the two. Audio output site 20 _n Each waits until the control information from the control site 11 arrives, and when the control information is received, the process proceeds to step S3.
[0034]
Audio output site 20 _n Respectively, in step S3, the sound processing unit 21 _n In the eighth channel (m = 8), that is, in the trigger channel connected to the trigger monitoring site 12, the trigger signal TG which is an audio signal such as an impulse signal is provided. _n Play. Audio output site 20 _n Trigger signal TG from each of _n Is the absolute time at which the audio output site 20 _n Are different types of sound processing units 21. _n The same type of sound processing unit 21 _n Even if it is installed, the processing capability varies slightly, and it is unpredictable due to the operating status of other tasks operating in multitasking.
[0035]
At the same time, the audio output site 20 _n Respectively, in step S4, the trigger signal TG _n Sample S at the time of playback and output to the trigger monitoring site 12 _n Is used as a reference to start counting the number of playback samples. Audio output site 20 _n Respectively, this sample S _n Audio file AF based on _n The playback start timing is determined.
[0036]
At the same time, the audio output site 20 _n In step S5, “0” is output to channels other than the trigger channel, that is, the first to seventh channels (m = 1 to 7). That is, the audio output site 20 _n Respectively, in step S3, the trigger signal TG is transmitted to the eighth channel. _n The first channel to the seventh channel are muted.
[0037]
And the audio output site 20 _n Respectively continues to count the number of reproduced samples in step S6, and in step S7, the corresponding delayed sample number Sd from the trigger monitoring site 12 _n It is determined whether or not the delayed sample number information indicating (n = 1, 2,..., 8) has reached through the network NT. Audio output site 20 _n Since the delay sample number information from the trigger monitoring site 12 is given via the network NT, the absolute time at which the delay sample number information arrives depends on the influence of the network traffic and the like. _n Variation occurs between the two.
[0038]
Here, if the delayed sample number information has not arrived, the audio output site 20 _n Respectively repeat the processing of step S5 to step S6.
[0039]
On the other hand, when the delayed sample number information arrives, the audio output site 20 _n In step S8, “(reproduction margin sample number Sm) − (delay sample number Sd _n ) "And calculates the trigger signal TG to the trigger monitoring site 12. _n Is counted from the sample at the time of output, and the audio file AF is counted from the time when the calculated value is reached. _n Is started, and a series of processing ends.
[0040]
In this way, in the audio reproduction device 10, the audio output site 20 _n Each of the reproduction processing program A _n And execute the audio file AF _n Can be started simultaneously. In the audio playback device 10, the audio output site 20 is thereafter processed. _n Are respectively based on the word clock WCLK supplied from the word clock generator 14. _n By proceeding with the playback, multi-channel synchronized playback becomes possible.
[0041]
On the other hand, in the audio reproduction device 10, the trigger monitoring site 12 executes the monitoring program Sv and goes through a series of steps shown in FIG. _n The number of delayed samples Sd corresponding to each ₁ , Sd ₂ , ..., Sd ₈ (Hereinafter, the number of delayed samples Sd _n (N = 1, 2,..., 8) )
[0042]
First, the trigger monitoring site 12, as shown in FIG. _n Trigger signal TG output to each channel in the sound processing unit 13 _n Monitor incoming traffic.
[0043]
Subsequently, in step S12, the trigger monitoring site 12 sends a trigger signal TG to any one of the eight channels in the sound processing unit 13. _n It is determined whether any of the above has been supplied.
[0044]
Here, the trigger signal TG _n If any of the above is not supplied, the trigger monitoring site 12 proceeds to step S11, and trigger signal TG _n Continue to monitor until either of them is supplied.
[0045]
On the other hand, trigger signal TG _n When any of the trigger signals is supplied, the trigger monitoring site 12 sends the supplied trigger signal TG in step S13. _n Is first supplied, ie for the first time the trigger signal TG _n It is determined whether or not is supplied.
[0046]
Here, the trigger signal TG _n Is not the first supplied, the trigger monitoring site 12 moves the process to step S15. On the other hand, trigger signal TG _n , The trigger monitoring site 12 determines that the trigger signal TG detected first in step S14. _n Counting the number of input samples is started using the sample at the time when is input as a reference sample, and the process proceeds to step S15. At the trigger monitoring site 12, another trigger signal TG is counted by counting the number of input samples based on this reference sample. _n Will be measured.
[0047]
Then, in step S15, the trigger monitoring site 12 changes the count value that has started counting from the reference sample in step S14 as the number of delayed samples Sd. _n Trigger signal TG _n Corresponding audio output site 20 _n For this delay sample number Sd _n Is output via the network NT. The trigger monitoring site 12 first sets the trigger signal TG _n Audio output site 20 _n Is the number of delayed samples Sd _n Needless to say, 0 is notified.
[0048]
Subsequently, the trigger monitoring site 12 continues to count the number of input samples in step S16, and in step S17, all the voice output sites 20 _n To trigger signal TG _n Is received.
[0049]
Here, all audio output sites 20 _n To trigger signal TG _n Is not received, the trigger monitoring site 12 repeats the processes of steps S11 to S16.
[0050]
On the other hand, all audio output sites 20 _n To trigger signal TG _n Is received, the trigger monitoring site 12 ends the series of processes as it is.
[0051]
Audio output site 20 _n Either of the trigger signal TG due to failure etc. _n May not be given to the trigger monitoring site 12, but in this case, the trigger monitoring site 12 may appropriately exit the loop processing that shifts from step S17 to step S11.
[0052]
In this way, in the audio reproduction device 10, the monitoring program Sv is executed by the trigger monitoring site 12, and the audio output site 20 _n The number of delayed samples Sd corresponding to each _n Voice output site 20 _n It is possible to realize synchronized playback by means of.
[0053]
Trigger monitoring site 12 and audio output site 20 for executing such a program _n In the audio playback device 10 having the above, for example, each operation is performed in accordance with the timing shown in FIG.
[0054]
That is, in the audio reproduction device 10, the trigger monitoring site 12 and the audio output site 20 are based on the word clock WCLK generated by the word clock generator 14. _n Works. In the audio reproduction device 10, as shown in the upper part of the figure, when a reproduction command is instructed via the user interface UI, control information indicating the reproduction instruction is transmitted from the control site 11 to the audio output site 20. _n Are notified via the network NT. As described above, the absolute time at which this control information arrives is caused by the influence of network traffic, etc. _n Variation occurs between the two. Here, as shown in the middle of FIG. ₁ Receives the control information earliest and the voice output site 20 ₈ , Audio output site 20 ₂ It is assumed that the control information is received in the following order.
[0055]
Then, in the audio reproduction device 10, as described above, the audio output site 20 _n Trigger signal TG to trigger monitoring site 12 from _n Is given. This audio output site 20 _n Trigger signal TG from each of _n As described above, the absolute time at which is reproduced is the audio output site 20 _n Sound processing unit 21 _n Audio output site 20 due to the difference in the processing capacity of the CPU or the like (not shown). _n Variation occurs between the two. Here, as shown in the middle of FIG. ₁ Is the earliest sample S ₁ Trigger signal TG at ₁ , And then sample S ₈ , S ₂ Audio output site 20 at the time of ₈ , Audio output site 20 ₂ Trigger signal TG in the order ₈ , TG ₂ Shall be reproduced. Audio output site 20 _n Respectively, these trigger signals TG _n Sample S at the time when is output _n Is used as a reference to start counting the number of playback samples. Also, here, the trigger signal TG is the earliest ₁ Sample S from which was played ₁ Is the reference sample.
[0056]
In response to this, in the audio reproduction device 10, as described above, the trigger monitoring site 12 performs the sample S that is the reference sample. ₁ Starts counting the number of input samples, and as shown in the lower part of FIG. _n Is calculated. In the audio reproduction device 10, the number of delayed samples Sd _n Voice output site 20 that corresponds to _n Will be notified. This delayed sample number Sd _n As described above, the absolute time at which the voice output site 20 arrives is due to the influence of the network traffic, etc. _n Variation occurs between the two. Here, as shown in the middle of FIG. ₁ Is the earliest delay sample number information received, voice output site 20 ₂ , Audio output site 20 ₈ It is assumed that the delayed sample number information is received in the following order.
[0057]
In the audio reproduction device 10, the audio output site 20 _n , Trigger signal TG to trigger monitoring site 12 _n Sample S at the time of output _n (Reproduction margin sample number Sm−delay sample number Sd _n ) Audio file AF from sample _n Playback starts simultaneously. Note that the reproduction margin sample number Sm is at least the following process, that is, the audio output site 20 that has received the control information indicating the reproduction instruction. _n Is the trigger signal TG _n , The trigger monitoring site 12 is connected to all the audio output sites 20 _n To trigger signal TG _n The trigger monitoring site 12 receives the voice output site 20. _n The number of delayed samples Sd for each _n And all audio output sites 20 _n Is late sample number Sd _n Received, audio file AF _n This value is calculated in consideration of sufficient time for performing the step of starting the reproduction operation.
[0058]
In the audio playback device 10, the audio output site 20 _n Audio file AF _n After that, the audio file AF is maintained while maintaining the synchronization based on the word clock WCLK generated by the word clock generator 14. _n Proceed with playback.
[0059]
As described above, the audio reproduction device 10 shown as the first exemplary embodiment of the present invention has the audio output sites 20. _n Audio file AF distributed on top _n Can be used to perform multi-channel synchronized playback. In particular, the audio playback apparatuses 10 are not synchronized with each other, and the audio output site 20 has a wide variety of operating environments. _n Even in the case of using, synchronous reproduction can be surely realized. The audio reproduction device 10 uses, for example, a sound processing unit capable of simultaneous reproduction of M channels, and uses a trigger monitoring site 12 and n audio output sites 20. _n By configuring this, it is possible to realize synchronized reproduction of (M−1) × n channels.
[0060]
In the audio playback device 10, the role of the control site 11 described above is to transmit control information indicating a playback instruction for playback at the timing when the user operates the user interface UI. However, the time from the timing when the user operates the user interface UI to issue the playback instruction to the start of playback is the first trigger signal TG. _n Is defined by the sum of the time until the reproduction is performed and the set reproduction margin sample number Sm. These times vary depending on factors such as network traffic and hardware processing load. Therefore, the audio playback device 10 does not consider the absolute time of the playback start, but aims to match the timing of the playback start to the last, and is intended for the case where there is no problem with the absolute time. I want to be.
[0061]
Further, in the above-described embodiment, it has been described that only the reproduction instruction is given by the control site 11. However, in the audio reproduction device 10, for example, “stop”, “pause”, and “rewind” as necessary. It is also possible to give an instruction corresponding to a command corresponding to “fast forward” or the like. For example, in the audio reproduction device 10, the reproduction sample count is continued even after the reproduction is started, and each audio file AF _n Information about whether to stop or return to playback via the control site 11 _n Or audio file AF _n By specifying the playback position of the voice output site 20 _n Playback processing program A _n It is possible to realize a unified operation under the execution control.
[0062]
Furthermore, in the audio reproduction device 10, the control site 11 is not necessarily provided. The audio reproduction device 10 includes an audio output site 20 instead of including the control site 11. _n It is also possible to start synchronous reproduction at a preset time using a timer or the like included in each of the above. However, in this case, in the audio reproduction device 10, the audio output site 20 _n It is necessary to set the value of the reproduction margin sample number Sm on the assumption of the deviation time of the timer included in each of them, or to adjust each timer via the network NT so as to be within the value of the reproduction margin sample number Sm. .
[0063]
Furthermore, the audio playback device 10 may be hardware capable of realizing the function without using a personal computer as each site or using a sound card as a sound processing unit.
[0064]
Further, although the audio playback device 10 has been described as performing communication via the network NT in accordance with a predetermined protocol such as TCP / IP or UDP, the trigger monitoring site 12 to the audio output site 20 are described. _n The number of samples Sd delayed from _n Needless to say, any network layer format may be used.
[0065]
Further, in the above-described embodiment, the fixed value is used as the reproduction margin sample number Sm. However, for example, in the audio reproduction device 10, under the execution control of the monitoring program Sv by the trigger monitoring site 12, Each audio output site 20 _n Delay sample number Sd _n Is calculated as a reproduction margin sample number Sm with a sufficient expectation of delay in communication and audio signal output, and each audio output site 20 is transmitted through the network NT. _n You may make it notify and set. At this time, the audio reproduction device 10 can also statistically measure a delay that occurs when performing communication via the network NT, and can set the reproduction margin sample number Sm using this statistical information.
[0066]
Furthermore, in the above-described embodiment, the sound processing unit has been described as being synchronized with the word clock WCLK. However, in the audio reproduction device 10, a mechanism that ensures synchronization in the sound processing unit is not limited to the word clock. If so, for example, it may correspond to a 256-times clock, a clock of 1/2 word unit, or a general code such as SMPTE (Society of Motion Picture and Television Engineers).
[0067]
In the above-described embodiment, the trigger signal TG is earliest as a reference sample from which the count of the input samples is started at the trigger monitoring site 12. _n Sample S at the time when was played _n However, in the audio reproduction apparatus 10, a specific sample at an arbitrary time can be defined by the trigger monitoring site 12, and the number of delayed samples Sd is relatively large. ₁ , Sd ₂ , ..., Sd ₈ The information in which the time relationship is maintained is the audio output site 20 _n The reference sample can be of any time as long as it can be notified.
[0068]
Furthermore, in the above-described embodiment, the audio signal is the audio file AF. _n However, in the audio reproduction device 10, it may exist as an audio stream sequentially on the memory.
[0069]
Next, a second embodiment of the present invention will be described. As the second embodiment, the audio reproduction device 30 shown in FIG. 5 includes 56 audio files, and uses 8 audio output sites capable of outputting an 8-channel audio signal. The 56-channel audio signal is synchronously reproduced. Accordingly, parts having the same functions as those of the audio reproduction device 10 shown as the first embodiment are denoted by the same reference numerals, and detailed description thereof is omitted.
[0070]
As shown in the figure, the sound reproducing device 30 includes the trigger monitoring site 12, the word clock generator 14, and 56 output devices 15 described above. _nm And 56 speakers 16 _nm , A distribution site 31 that distributes the stored 56-channel audio file AF, and eight audio output sites 40 that reproduce and output audio signals. ₁ , 40 ₂ , ..., 40 ₈ (Hereinafter, voice output site 40 _n (N = 1, 2,..., 8; site number). ).
[0071]
Similar to the control site 11 described above, the distribution site 31 has a function of performing overall control of the audio reproduction device 30 and has a user interface UI between the audio reproduction device 30 and the user. The channel audio file AF is stored in a storage means (not shown). The audio file AF stored in the distribution site 31 may be compressed in a predetermined format or may be uncompressed. The distribution site 31 can communicate with the outside in accordance with a predetermined protocol such as TCP / IP or UDP, for example, via the network NT having a capacity sufficient to transmit the 56-channel audio file AF. Trigger monitoring site 12 and audio output site 40 _n It is possible to send and receive signals to and from. The distribution site 31 is configured to be capable of executing the distribution program Ds. When a reproduction command is instructed by the user via a user interface UI including a predetermined operation unit and display unit, the distribution program Ds. Under the execution control of the voice output site 40, the control information indicating the playback instruction of the voice file AF is transmitted via the network NT. _n And the audio output site 40 for each of the seven channels using the 56-channel audio file AF as stream data. _n Deliver against. Note that the audio stream transmitted via the network NT may be compressed in a predetermined format or may be uncompressed.
[0072]
Audio output site 40 _n Are the audio output sites 20 described above, respectively. _n Similarly to the sound processing unit 41 such as a sound card that can output and process an 8-channel audio signal, for example. ₁ , 41 ₂ , ..., 41 ₈ (Hereinafter, the sound processing unit 41 _n (N = 1, 2,..., 8) ) Having hardware functions including a personal computer or the like. Audio output site 40 _n Performs transmission / reception of signals between the distribution site 31 and the trigger monitoring site 12 via the network NT according to a predetermined protocol such as TCP / IP or UDP, respectively, in the same manner as the distribution site 31 and the trigger monitoring site 12. It is possible. Also, the audio output site 40 _n Respectively, the sound processing unit 41 _n For example, only the eighth channel is connected to the trigger monitoring site 12, and the trigger signal TG is transmitted to the trigger monitoring site 12 using this channel. _n Can be output.
Audio output site 40 _n Respectively, reproduction processing program B ₁ , B ₂ , ..., B ₈ (Hereafter, playback processing program B _n (N = 1, 2,..., 8)
The sound processing unit is synchronized with a word clock WCLK supplied from the word clock generator 14 or a trigger external signal sufficient to synchronize each sample in units of words. 41 _n The process in is performed. At this time, the audio output site 40 _n Respectively, the sound processing unit 41 _n It is assumed that synchronization between the first channel to the eighth channel is secured. That is, the voice output site 40 _n , When receiving a reproduction instruction from the distribution site 31, respectively, the sound processing unit 41 _n Audio signals are simultaneously output from the first channel to the eighth channel. Also, the audio output site 40 _n Each has a buffer (not shown) that temporarily stores an audio stream distributed from the distribution site 31, and sequentially stores the audio stream distributed from the distribution site 31 in this buffer. Audio output site 40 _n In the case where the audio streams are compressed in a predetermined format, when these audio streams are stored in a buffer, a predetermined decoding process is performed via the buffer. And the audio output site 40 _n In, when a predetermined volume or more of the audio stream is stored from the buffer, it is read out and output to the outside. In the following, each audio output site 40 _n Audio streams delivered to the AS respectively ₁ , AS ₂ , ..., AS ₈ (Hereafter, audio stream AS _n (N = 1, 2,..., 8) ).
[0073]
Audio output site 40 _n When receiving control information indicating a reproduction instruction supplied from the distribution site 31 via the network NT, the reproduction processing program B _n Under the execution control of the sound processing unit 41 _n Trigger signal TG via _n Is output to the trigger monitoring site 12. During this time, the audio output site 40 _n Respectively, the audio stream AS _n Is delivered and the audio output site 40 _n Are respectively delivered audio streams AS. _n Are stored in a buffer not shown in the order of streams. Also, the audio output site 40 _n When the delay sample number information is received from the trigger monitoring site 12 via the network NT, the delay sample number Sd _n And the audio stream AS decoded from the beginning of the buffer based on the reproduction margin sample number Sm. _n Reproduction of the output device 15 _nm And speaker 16 _nm Sound is emitted to the outside via
[0074]
In the audio reproducing device 30 having such units, when control information indicating a reproduction instruction for the 56-channel audio file AF stored in the distribution site 31 is given, the audio file AF is 7-channel audio. Stream AS _n As an audio output site 40 _n And the audio output site 40 _n From each of the trigger signals TG to the trigger monitoring site 12 _n Is output. In the sound reproducing device 30, the trigger monitoring site 12 causes the trigger signal TG. _n Based on the voice output site 40 _n The number of delayed samples Sd corresponding to each _n Ask for. In the audio reproduction device 30, each audio output site 40 _n By this delay sample number Sd _n And the playback margin sample number Sm, the timing at which the player should start playback is obtained, and at the time when this timing is reached, the audio stream AS _n Can start playing. In the audio reproducing device 30, all the audio output sites 40 are thus obtained. _n Audio stream AS _n The playback start timing can be matched. Thereafter, in the audio reproducing device 30, the audio output site 40 is based on the word clock WCLK supplied from the word clock generator 14. _n Multi-channel synchronized playback can be performed by proceeding with playback.
[0075]
Specifically, in the audio playback device 30, the audio output site 40 _n Respectively, reproduction processing program B _n And the audio stream AS is obtained through a series of steps shown in FIG. _n Start playing.
[0076]
First, the audio output site 40 _n As shown in the figure, the reproduction margin sample number Sm is set in step S21. This reproduction margin sample number Sm is the same as in the case of the audio reproduction device 10 described above, and the audio output site 40 that has received the control information indicating the reproduction instruction. _n Is the trigger signal TG _n , The trigger monitoring site 12 is connected to all the audio output sites 40. _n To trigger signal TG _n The trigger monitoring site 12 receives the voice output site 40. _n The number of delayed samples Sd for each _n Notification process, all audio output sites 40 _n Is late sample number Sd _n The audio stream AS _n In addition to sufficient time to perform the process of starting the playback operation of the audio stream AS _n Is a value calculated in anticipation of a sufficient time for storing a predetermined capacity or more.
[0077]
Subsequently, the audio output site 40 _n In step S22, it is determined whether or not control information indicating an instruction to reproduce the audio file AF has arrived from the distribution site 31 via the network NT. Audio output site 40 _n Each waits until the control information from the distribution site 31 arrives, and when the control information is received, the process proceeds to step S23.
[0078]
Audio output site 40 _n Respectively, in step S23, the sound processing unit 41 _n Trigger signal TG to the trigger channel at _n Play.
[0079]
At the same time, the audio output site 40 _n Respectively, in step S24, the trigger signal TG _n Sample S at the time of playback and output to the trigger monitoring site 12 _n Is used as a reference to start counting the number of playback samples. Audio output site 40 _n Respectively, this sample S _n Audio stream AS _n The playback start timing is determined.
[0080]
At the same time, the audio output site 40 _n In step S25, “0” is output to channels other than the trigger channel.
[0081]
And the audio output site 40 _n Respectively continues to count the number of reproduced samples in step S26, and in step S27, the corresponding delayed sample number Sd from the trigger monitoring site 12 _n It is determined whether or not the delayed sample number information indicating is reached via the network NT.
[0082]
Here, when the delayed sample number information has not arrived, the audio output site 40 _n Respectively repeat the processing of step S25 to step S26. On the other hand, when the delayed sample number information arrives, the audio output site 40 _n In step S28, “(reproduction margin sample number Sm) − (delay sample number Sd _n ) "And calculates the trigger signal TG to the trigger monitoring site 12. _n The audio stream AS stored in the buffer is counted from the time point when the calculated value is reached and the sample is reached. _n Is started, and a series of processing ends.
[0083]
On the other hand, in the audio playback device 30, the trigger monitoring site 12 executes the monitoring program Sv and goes through the series of steps shown in FIG. _n The number of delayed samples Sd corresponding to each _n Can be requested.
[0084]
In this way, in the audio reproduction device 30, the trigger monitoring site 12 executes the monitoring program Sv and the audio output site 40. _n Reproduction processing program B by each of _n And the audio stream AS constituting the audio file AF is executed. _n Can be started simultaneously. In the audio playback device 30, the audio output site 40 is thereafter used. _n Are respectively based on the word clock WCLK supplied from the word clock generator 14. _n By proceeding with the playback, multi-channel synchronized playback becomes possible.
[0085]
As described above, the audio reproduction device 30 shown as the second exemplary embodiment of the present invention converts the audio file AF existing on the distribution site 31 into each audio output site 40. _n Is divided into the number of channels that can be output, and each audio output site 40 _n Audio stream AS delivered to _n Can be used to perform multi-channel synchronized playback. In particular, the audio playback devices 30 are not synchronized with each other, and the audio output site 40 has a wide variety of operating environments. _n Even in the case of using, synchronous reproduction can be surely realized.
[0086]
Next, a third embodiment of the present invention will be described. As the third embodiment, the audio reproducing device 50 shown in FIG. 7 performs predetermined signal processing on each source signal for each source signal, and performs synchronous reproduction of 56-channel audio signals. . In other words, the audio reproduction device 50 multicasts a single channel audio file to each audio output site, and performs individual signal processing on a channel basis by each audio output site for the distributed audio file. For example, control using multiple speakers, i.e., the directivity, sound field and Sound image control can be effectively realized. In this case as well, components having the same functions as those of the audio reproduction device 10 shown as the first embodiment or the audio reproduction device 30 shown as the second embodiment are given the same reference numerals, and details Such explanation will be omitted.
[0087]
As shown in the figure, the sound reproducing device 50 includes the trigger monitoring site 12, the word clock generator 14, and 56 output devices 15 described above. _nm And 56 speakers 16 _nm In addition to the above, a source output site 51 that outputs a stored audio file AF that is a one-channel source file, and eight audio output sites 60 that reproduce and output audio signals ₁ , 60 ₂ , ..., 60 ₈ (Hereafter, audio output site 60 _n (N = 1, 2,..., 8; site number). ).
[0088]
The source output site 51 has a function of performing overall control of the audio playback device 50 and a user interface UI between the audio playback device 50 and the user, like the control site 11 or the distribution site 31 described above. In addition, the audio file AF is stored in a storage means (not shown). The audio file AF stored in the source output site 51 is described as having one channel, but may be composed of a plurality of channels. The audio file AF may be compressed in a predetermined format or may be uncompressed. The source output site 51 can communicate with the outside according to a predetermined protocol such as TCP / IP or UDP, for example, and the trigger monitoring site 12 and the audio output site 60 via the network NT. _n It is possible to send and receive signals to and from. The source output site 51 is an environment in which the source output program So can be executed. When a reproduction command is instructed by a user via a user interface UI including a predetermined operation unit and display unit, the source output site 51 Under the execution control of the output program So, control information indicating an instruction to reproduce the audio file AF is sent to the audio output site 60 via the network NT. _n 1 channel audio file AF as it is or as stream data, all audio output sites 60 _n Multicast distribution to Note that the audio signal transmitted via the network NT may be compressed in a predetermined format or uncompressed. Here, the audio output site 60 _n Suppose that the audio file AF is multicast-delivered as it is.
[0089]
Audio output site 60 _n Are the audio output sites 20 described above, respectively. _n Or audio output site 40 _n Similarly to the sound processing unit 61 such as a sound card that can output and process an 8-channel audio signal, for example. ₁ , 61 ₂ , ..., 61 ₈ (Hereinafter, the sound processing unit 61 _n (N = 1, 2,..., 8) ) Having hardware functions including a personal computer or the like. Audio output site 60 _n Transmits and receives signals to and from the source output site 51 and the trigger monitoring site 12 via the network NT according to a predetermined protocol such as TCP / IP and UDP, respectively, similarly to the source output site 51 and the trigger monitoring site 12. It is possible to perform. Also, the audio output site 60 _n Respectively, the sound processing unit 61 _n For example, only the eighth channel is connected to the trigger monitoring site 12, and the trigger signal TG is transmitted to the trigger monitoring site 12 using this channel. _n Can be output. Audio output site 60 _n Respectively, reproduction processing program C ₁ , C ₂ , ..., C ₈ (Hereinafter, reproduction processing program C _n (N = 1, 2,..., 8) The sound processing unit is synchronized with a word clock WCLK supplied from the word clock generator 14 or a trigger external signal sufficient to synchronize each sample in units of words. 61 _n The process in is performed. At this time, the audio output site 60 _n Respectively, the sound processing unit 61 _n It is assumed that synchronization between the first channel to the eighth channel is secured. That is, the audio output site 60 _n , When receiving a reproduction instruction from the source output site 51, the sound processing unit 61. _n Audio signals are simultaneously output from the first channel to the eighth channel. Also, the audio output site 60 _n When the audio file AF is compressed in a predetermined format, the audio file AF is temporarily stored in a buffer (not shown), and a predetermined decoding process is performed via this buffer. And the audio output site 60 _n Is read out from the buffer when the audio file AF is stored in a predetermined capacity or more.
[0090]
Also, the audio output site 60 _n Respectively perform predetermined signal processing on the audio file AF read from the buffer. Audio output site 60 _n , Respectively, as shown in FIG. _nm (N = 1, 2,..., 8, m = 1, 2,..., 7; channel number). These signal processing units 62 _nm Can also be configured as hardware, and as shown in FIG. _n It can also be configured by software such as. Audio output site 60 _n Respectively, the signal processing unit 62 converts the audio signal for each channel. _nm To enter. As the signal processing, for example, a digital filter such as a so-called FIR (Finite Impulse Response) filter or an IIR (Infinite Impulse Response) filter may be applied. However, these signal processing units 62 _nm The delay amount in is known and is assumed to be constant between channels for simplicity.
[0091]
Further, for example, a signal processing unit 62 such as a filter coefficient in a digital filter. _nm The signal processing parameters in can be changed according to control information CNT for changing the signal processing method supplied from the outside. For example, in the audio reproduction device 50, the parameters can be changed according to the operation of the user interface UI by the user, and the source output site 51 to the audio output site 60 can be changed. _n The control information CNT indicating the parameter change and the parameter value is notified. More specifically, for example, when an application that moves a sound image in conjunction with an image is assumed, the control information CNT is generated as the user moves his / her position by operating a joystick or the like as a user interface UI. Voice output site 60 _n The position of the reproduced sound image can be moved by changing the signal processing parameters. In addition, audio output site 60 _n Can change the parameters of the signal processing depending on the application, such as changing the direction of directivity or changing the reverberation parameter of the sound field. Audio output site 60 _n In, an audio signal that has been subjected to signal processing is temporarily stored in a buffer (not shown), and is read out when a predetermined capacity or more is stored.
[0092]
Audio output site 60 _n When receiving the control information indicating the reproduction instruction supplied from the source output site 51 via the network NT, the reproduction processing program C _n Under the execution control of the sound processing unit 61 _n Trigger signal TG via _n Is output to the trigger monitoring site 12. During this time, the audio output site 60 _n , Audio file AF is distributed, respectively, and the audio output site 60 _n Respectively, the signal processing unit 62 for the distributed audio file AF. _nm To perform signal processing. Also, the audio output site 60 _n When the delay sample number information is received from the trigger monitoring site 12 via the network NT, the delay sample number Sd _n And the reproduction margin sample number Sm, the reproduction of the audio signal after the signal processing is started, and the output unit 15 _nm And speaker 16 _nm Sound is emitted to the outside via
[0093]
In the audio reproduction device 50 including such units, the audio output site 60 _n Respectively, reproduction processing program C _n And the reproduction of the audio file AF can be started through the series of steps shown in FIG. On the other hand, in the audio reproduction device 50, the trigger monitoring site 12 executes the monitoring program Sv and goes through the series of steps shown in FIG. _n The number of delayed samples Sd corresponding to each _n Can be requested.
[0094]
In the audio playback device 50, when control information indicating a playback instruction for the 1-channel audio file AF stored in the source output site 51 is given, the audio file AF is stored in the audio output site 60. _n And the audio output site 60 _n From each of the trigger signals TG to the trigger monitoring site 12 _n Is output. In the audio reproduction device 50, the trigger monitoring site 12 causes the trigger signal TG. _n Based on the audio output site 60 _n The number of delayed samples Sd corresponding to each _n Ask for. In the audio reproduction device 50, each audio output site 60 is displayed. _n By this delay sample number Sd _n And the playback margin sample number Sm, the timing at which the player should start playback can be obtained, and playback of the audio signal can be started at the time when this timing is reached. In the audio reproducing device 50, all the audio output sites 60 are thus obtained. _n It is possible to match the reproduction start timing of the audio file AF. Thereafter, in the audio reproducing device 50, the audio output site 60 is based on the word clock WCLK supplied from the word clock generator 14. _n Multi-channel synchronized playback can be performed by proceeding with playback.
[0095]
Further, in the audio reproducing device 50, when the signal processing parameter is changed in the middle, the control information CNT indicating the content of the signal processing is transmitted from the source output site 51 to the audio output site 60. _n The signal processing parameters are changed.
At this time, the audio reproducing device 50 can synchronize the time for changing the parameters of the signal processing by calculating the number of delayed samples related to the control information CNT, similarly to the reproduction instruction.
[0096]
In this way, in the audio reproduction device 50, the monitoring program Sv is executed by the trigger monitoring site 12, and the audio output site 60 is also executed. _n Each of the reproduction processing program C _n And the reproduction of the audio file AF can be started at the same time.
[0097]
As described above, the audio reproduction device 50 shown as the third embodiment of the present invention uses the audio file AF existing on the source output site 51 as the audio output site 60. _n Multi-channel synchronized playback can be performed by using an audio signal that has been distributed to the audio file and subjected to signal processing on the distributed audio file AF. In particular, the audio playback devices 50 are not synchronized with each other, and the audio output site 60 has various operating environments. _n Even in the case of using, it is possible to reliably realize synchronous reproduction, for example, directivity, sound field and sound image control using a multi-speaker can be realized with excellent effects.
[0098]
Next, a fourth embodiment of the present invention will be described. The audio playback device 70 shown in FIG. 9 as the fourth embodiment is predetermined by each audio output site for one source file, like the audio playback device 50 shown as the third embodiment described above. In order to avoid the influence of network traffic, for example, a network NT that performs information transmission according to a predetermined protocol such as TCP / IP or UDP is performed. And transmitting / receiving information using only the sound processing unit. Accordingly, components having the same functions as those of the audio reproduction device 50 shown as the above-described third embodiment are denoted by the same reference numerals, and detailed description thereof is omitted.
[0099]
As shown in the figure, the audio reproducing device 70 includes the above-described word clock generator 14 and 56 output devices 15. _nm And 56 speakers 16 _nm , A source output site 71 that outputs a stored audio file AF that is a one-channel source file, and a trigger signal TG _n Trigger monitoring site 73 for monitoring the sound and eight sound output sites 80 for reproducing and outputting sound signals ₁ , 80 ₂ , ..., 80 ₈ (Hereafter, audio output site 80 _n (N = 1, 2,..., 8; site number). ).
[0100]
Similar to the source output site 51 described above, the source output site 71 has a function of performing overall control of the audio playback device 70, and has a user interface UI between the audio playback device 70 and the user, The file AF is stored in a storage means (not shown). The audio file AF stored in the source output site 71 is described as having one channel, but it may be composed of a plurality of channels. The audio file AF may be compressed in a predetermined format or may be uncompressed. The source output site 71 has a sound processing unit 72 such as a sound card that can output and process audio signals of at least two channels, and through this sound processing unit 72, a reproduction instruction by a user via the user interface UI or Control information indicating signal processing parameter changes, etc. _n And the audio file AF as it is or as stream data is sent to all the audio output sites 80. _n Can be distributed by multicast. At this time, in the source output site 71, the first channel in the sound processing unit 72 is used for distributing the audio file AF, and the second channel is used for notifying control information.
[0101]
The source output site 71 transmits and receives control information via the sound processing unit 72. This control information only needs to be able to transmit and receive numerical values, parameters, and the like, and any information can be used as long as the generation and recognition mechanism is agreed and used between the sites that transmit and receive. In particular, the source output site 71 and the audio output site 80 _n Can handle the control information as a simple signal sequence, and can simplify the program for generation and recognition. The source output site 71 is an environment in which the source output program So can be executed. The trigger output is sufficient to synchronize each sample with the word clock WCLK supplied from the word clock generator 14 or in units of words. The sound processing unit 72 performs processing in synchronization with the signal.
[0102]
The trigger monitoring site 73 is at least an audio output site 80. _n Or more, that is, a sound processing unit 74 such as a sound card capable of performing input processing and output processing on audio signals of 8 channels or more, and through this sound processing unit 74, an audio output site 80 _n Trigger signal TG output from _n Can be received. In addition, the trigger monitoring site 73 sends the number of delayed samples Sd via the sound processing unit 74. _n Voice output site 80 _n Can be notified. At this time, in the trigger monitoring site 73, the first channel on the input side in the sound processing unit 74 is the audio output site 80. ₁ And the second channel is connected to the audio output site 80. ₂ And the first channel on the output side is connected to the audio output site 80. ₁ And the second channel is connected to the audio output site 80. ₂ Each channel is connected to the audio output site 80 such as _n Are connected in correspondence with each of the above. The trigger monitoring site 73 is an environment in which the monitoring program Sv can be executed, and a trigger external signal sufficient to synchronize each sample with the word clock WCLK supplied from the word clock generator 14 or in units of words. In synchronism with this, the sound processing unit 74 performs processing.
[0103]
The trigger monitoring site 73 controls the voice output site 80 under the execution control of the monitoring program Sv. _n Trigger signal TG output from each of _n Monitor incoming traffic. The trigger monitoring site 73 generates a trigger signal TG _n , These trigger signals TG _n Based on the voice output site 80 _n The number of delayed samples Sd corresponding to each _n And the number of delayed samples Sd _n The information about the number of delayed samples is sent to the audio output site 80 via the sound processing unit 74. _n Respectively.
[0104]
Audio output site 80 _n The sound processing unit 81 such as a sound card that can output, for example, an 8-channel audio signal and can process an audio signal of at least 3 channels. ₁ , 81 ₂ , ..., 81 ₈ (Hereinafter, the sound processing unit 81 _n (N = 1, 2,..., 8) ). Audio output site 80 _n These sound processing units 81 _n It is possible to receive the audio file AF and control information output from the source output site 71 via the, and further to receive the delayed sample number information output from the trigger monitoring site 73. For example, an audio output site 80 _n Respectively, the sound processing unit 81 _n The first channel on the input side in FIG. 5 is a reception unit for the audio file AF from the source output site 71, the second channel is a reception unit for control information from the source output site 71, and the third channel is from the trigger monitoring site 73. The delay sample number information receiving unit. Also, the audio output site 80 _n Respectively, the sound processing unit 81 _n For example, only the eighth channel on the output side is connected to the trigger monitoring site 73, and the trigger signal TG is transmitted to the trigger monitoring site 73 using this channel. _n Can be output. Audio output site 80 _n Respectively, reproduction processing program D ₁ , D ₂ , ..., D ₈ (Hereafter, reproduction processing program D _n (N = 1, 2,..., 8) The sound processing unit is synchronized with a word clock WCLK supplied from the word clock generator 14 or a trigger external signal sufficient to synchronize each sample in units of words. 81 _n The process in is performed. Audio output site 80 _n When the audio file AF is compressed in a predetermined format, the audio file AF is temporarily stored in a buffer (not shown), and a predetermined decoding process is performed via this buffer. And voice output site 80 _n Is read out from the buffer when the audio file AF is stored in a predetermined capacity or more.
[0105]
Also, the audio output site 80 _n Are the audio output sites 60, respectively. _n Similarly to the above, predetermined signal processing is performed on the audio file AF read from the buffer. Audio output site 80 _n In, an audio signal that has been subjected to signal processing is temporarily stored in a buffer (not shown), and is read out when a predetermined capacity or more is stored.
[0106]
Audio output site 80 _n Respectively, control information indicating a reproduction instruction supplied from the source output site 71 is transmitted to the sound processing unit 81. _n Is received via the playback processing program D _n Under the execution control of the sound processing unit 81 _n Trigger signal TG via _n Is output to the trigger monitoring site 73. During this time, the audio output site 80 _n , Audio file AF is distributed, and the audio output site 80 _n Respectively, performs signal processing on the distributed audio file AF by a signal processing unit (not shown). Also, the audio output site 80 _n Respectively, the delay sample number information from the trigger monitoring site 73 is sent to the sound processing unit 81. _n This delay sample number Sd _n And the reproduction margin sample number Sm, the reproduction of the audio signal after the signal processing is started, and the output unit 15 _nm And speaker 16 _nm Sound is emitted to the outside via
[0107]
In the audio reproduction device 70 including such units, the audio output site 80 _n Respectively, reproduction processing program D _n And the reproduction of the audio file AF can be started through the series of steps shown in FIG. On the other hand, in the audio reproduction device 70, the trigger monitoring site 73 executes the monitoring program Sv and goes through the series of steps shown in FIG. _n The number of delayed samples Sd corresponding to each _n Can be requested.
[0108]
In the audio reproduction device 70, as in the audio reproduction device 50, when control information indicating a reproduction instruction for the one-channel audio file AF stored in the source output site 71 is given, the audio file AF is converted into the audio output site 80. _n And the audio output site 80 _n From each of the trigger signals TG to the trigger monitoring site 73 _n Is output. In the audio reproduction device 70, the trigger signal TG is transmitted by the trigger monitoring site 73. _n Based on the voice output site 80 _n The number of delayed samples Sd corresponding to each _n Ask for. In the audio reproduction device 70, each audio output site 80 is displayed. _n By this delay sample number Sd _n And the playback margin sample number Sm, the timing at which the player should start playback can be obtained, and playback of the audio signal can be started at the time when this timing is reached. In the audio reproduction device 70, all the audio output sites 80 are thus obtained. _n It is possible to match the reproduction start timing of the audio file AF. Thereafter, in the audio reproduction device 70, the audio output site 80 is based on the word clock WCLK supplied from the word clock generator 14 thereafter. _n Multi-channel synchronized playback can be performed by proceeding with playback.
[0109]
Further, in the audio reproduction device 70, similarly to the audio reproduction device 50, when changing the signal processing parameters on the way, control information indicating the content of the signal processing is transmitted from the source output site 71 to the audio output site 80. _n The signal processing parameters are changed. At this time, in the audio reproduction device 70, as in the case of the reproduction instruction, it is possible to synchronize the time for changing the signal processing parameters by calculating the number of delayed samples related to the control information.
[0110]
In this way, in the audio reproduction device 70, the monitoring program Sv is executed by the trigger monitoring site 73, and the audio output site 80 is also executed. _n Each of the reproduction processing program D _n And the reproduction of the audio file AF can be started at the same time.
[0111]
As described above, the audio reproduction device 70 shown as the fourth exemplary embodiment of the present invention uses the audio file AF existing on the source output site 71 as the audio output site 80. _n Multi-channel synchronized playback can be performed by using an audio signal that has been distributed to the audio file and subjected to signal processing on the distributed audio file AF. In particular, since the audio playback device 70 does not perform information transmission via a network that performs information transmission according to a predetermined protocol such as TCP / IP or UDP, for example, the influence of network traffic can be avoided and ensured. Synchronous playback can be realized.
[0112]
Finally, a fifth embodiment of the present invention will be described. The audio reproducing device 90 shown in FIG. 10 as the fifth embodiment outputs an audio signal instead of providing a dedicated trigger channel for outputting the trigger signal described above, among the sound processing units at the audio output site. Shared with the channel. That is, in the above-described embodiment, the trigger channel is used only when the trigger signal is reproduced once before the reproduction of the audio signal is started. May be inefficient. Therefore, the audio playback device 90 uses all the channels among the sound processing units capable of simultaneous playback of M channels at the n audio output sites for the output of the audio signal, so that (M−1) × described above. Rather than synchronized playback of n-channel audio signals, synchronized playback of M × n-channel audio signals can be realized. Here, it is assumed that the audio reproduction device 90 performs synchronous reproduction of a 32-channel audio signal using four audio output sites that can output an 8-channel audio signal. In this case as well, each part having the same function as that of the audio reproduction device 10 or the like shown as the first embodiment is given the same reference numeral, and detailed description thereof is omitted.
[0113]
As shown in the figure, the sound reproducing device 90 includes the above-described word clock generator 14 and 32 output devices 15. ₁₁ , 15 ₁₂ , 15 ₁₃ , 15 ₁₄ , 15 ₁₅ , 15 ₁₆ , 15 ₁₇ , 15 ₁₈ , 15 ₂₁ , 15 ₂₂ , ..., 15 ₄₅ , 15 ₄₆ , 15 ₄₇ , 15 ₄₈ (Hereafter, output device 15 _nm (N = 1, 2, 3, 4; site number, m = 1, 2,..., 8; channel number). ) And 32 speakers 16 ₁₁ , 16 ₁₂ , 16 ₁₃ , 16 ₁₄ , 16 ₁₅ , 16 ₁₆ , 16 ₁₇ , 16 ₁₈ , 16 ₂₁ , 16 ₂₂ , ..., 16 ₄₅ , 16 ₄₆ , 16 ₄₇ , 16 ₄₈ (Hereafter, speaker 16 _nm (N = 1, 2, 3, 4, m = 1, 2,..., 8) ) And trigger signal TG _n Trigger monitoring site 91 for monitoring and four audio output sites 100 for reproducing and outputting audio signals ₁ , 100 ₂ , ..., 100 ₄ (Hereinafter, the audio output site 100 _n (N = 1, 2, 3, 4) ). This audio reproduction device 90 is assumed to be applied to an entertainment theater ETH such as a movie theater, for example. _nm And speaker 16 _nm Embedded in the four walls of the entertainment theater ETH, it is possible to construct a realistic sound field with 32 channels of independent audio.
In the audio playback device 90, four audio output sites 100 are provided. _n 8 output devices 15 corresponding to each of _nm And speaker 16 _nm A block BK _n (N = 1, 2, 3, 4) and one wall surface is formed.
[0114]
The trigger monitoring site 91 has the same function as the trigger monitoring site 12 described above, and also has the same function as the control site 11 described above, and further stores a movie file MF consisting of a predetermined video in a storage means (not shown). Thus, it also functions as a cinema distribution site for distributing the movie file MF. The trigger monitoring site 91 can communicate with the outside according to a predetermined protocol such as TCP / IP or UDP, and the audio output site 100 via the network NT. _n It is possible to send and receive signals to and from. The trigger monitoring site 91 is an environment in which the monitoring program Sv can be executed. When a playback command is instructed by the user via a user interface UI including a predetermined operation unit, display unit, and the like, the monitoring program 91 Audio file AF under the control of Sv execution _n Control information indicating the reproduction instruction of the voice output site 100 via the network NT _n The image processing unit 93 performs predetermined image processing on the movie file MF, supplies it to the projector PJ as video data IM while synchronizing with the audio file AF, and displays it on the screen in the entertainment theater ETH. Project. The movie file MF stored in the trigger monitoring site 91 may be compressed in a predetermined format or may be uncompressed, and is decoded by the image processing unit 93.
[0115]
The trigger monitoring site 91 is at least the audio output site 20. _n Or more, that is, a sound processing unit 92 such as a sound card capable of inputting and processing audio signals of four or more channels, and through this sound processing unit 92, the audio output site 100 _n Trigger signal TG output from _n Can be received. The trigger monitoring site 91 performs processing in the sound processing unit 92 in synchronization with a word clock WCLK supplied from the word clock generator 14 or a trigger external signal sufficient to synchronize each sample in units of words.
[0116]
The trigger monitoring site 91 controls the audio output site 100 under the execution control of the monitoring program Sv. _n As described later, the trigger signal TG embedded in the audio signal is analyzed. _n To monitor. The trigger monitoring site 91 generates a trigger signal TG _n Is detected, these trigger signals TG _n Based on the voice output site 100 _n The number of delayed samples Sd corresponding to each _n And the number of delayed samples Sd _n The information about the number of delayed samples indicating that the voice output site 100 via the network NT _n Respectively.
[0117]
Audio output site 100 _n Are the audio output sites 20 described above, respectively. _n Similarly to the sound processing unit 101 such as a sound card that can output and process an 8-channel audio signal, for example. ₁ , 101 ₂ , ..., 101 ₈ (Hereinafter, the sound processing unit 101 _n (N = 1, 2,..., 8) ). Audio output site 100 _n Can transmit / receive signals to / from the trigger monitoring site 91 via the network NT according to a predetermined protocol such as TCP / IP or UDP. Also, the audio output site 100 _n Respectively, the sound processing unit 101 _n For example, only the eighth channel in the output unit 15 _nm The trigger monitoring site 91 is also connected to the trigger monitoring site 91, and the trigger signal TG is transmitted to the trigger monitoring site 91 using this channel. _n Can be output. This trigger signal TG _n As will be described later, the audio file AF is used as an audio signal. _n Is embedded in. Audio output site 100 _n Respectively, reproduction processing program E ₁ , E ₂ , ..., E ₈ (Hereinafter, the reproduction processing program E _n (N = 1, 2,..., 8) The sound processing unit is synchronized with a word clock WCLK supplied from the word clock generator 14 or a trigger external signal sufficient to synchronize each sample in units of words. 101 _n The process in is performed. At this time, the audio output site 100 _n , Each of the sound processing units 101 _n It is assumed that synchronization between the first channel to the eighth channel is secured. That is, the voice output site 100 _n , In the case where a reproduction instruction is received from the trigger monitoring site 91, the sound processing unit 101 _n Audio signals are simultaneously output from the first channel to the eighth channel. Also, the audio output site 100 _n Are stored in a storage means (not shown) in an 8-channel audio file AF. _n Is remembered.
[0118]
Here, audio file AF _n For example, as shown in FIG. 11, it consists of 8-channel audio signals. Audio file AF _n Includes a sound signal C for movie content for each channel after a predetermined time tc. ₁ , C ₂ , ..., C ₈ Is recorded. Furthermore, audio file AF _n Includes the trigger audio signal T at a time before the time tc in the audio signal of the eighth channel. _n Is recorded. This trigger audio signal T _n As will be described later, each audio output site 100 _n Audio file AF _n By regenerating the output device 15 _nm Output to each voice output site 100 _n Audio signal C for movie content ₁ , C ₂ , ..., C ₈ Before playback, the user listens. Also, the trigger audio signal T _n Each of the audio output sites 100 due to variations in arrival time of control information indicating a reproduction instruction supplied from the trigger monitoring site 91 via the network NT. _n It will be played back in pieces. Therefore, the trigger audio signal T _n Is preferably a significant audio signal that does not become uncomfortable for the listener, even if it is reproduced with a time lag, such as a movie title sound, chime sound, or a snarling voice of a bird. In particular, the trigger audio signal T _n For example, it may be actively used as a sound logo indicating that the audio playback device 90 is a system to which the present invention is applied.
[0119]
Audio signal T for trigger _n For example, the trigger signal TG is indicated by a digital watermark. _n Embedded in the trigger detection site 91 and the audio output site 100 _n Can be detected at a specific sample value. Audio file AF _n In the trigger signal TG _n Is defined as the reproduction margin sample number Sm0. This reproduction margin sample number Sm0 is determined by the audio file AF. _n For example, the value is calculated in consideration of a time sufficient for completing various processes to be described later. The trigger signal TG _n As for the audio signal T _n The trigger monitoring site 91 and the audio output site 100 need not be embedded in _n Are respectively audio signals T _n If it is determined that the trigger signal is a trigger signal, and the generation sample of the trigger signal can be specified, the audio signal T _n The waveform itself is the trigger signal TG _n You may make it treat as.
[0120]
Audio output site 100 _n When receiving control information indicating a reproduction instruction supplied from the trigger monitoring site 91 via the network NT, the reproduction processing program E _n Under the execution control of the audio file AF _n Playback of the sound processing unit 101 _n Via the output device 15 _nm And speaker 16 _nm And sound is emitted to the outside, and the sound processing unit 101 _n An audio signal output from the eighth channel is supplied to the trigger monitoring site 91. Audio output site 100 _n Respectively, audio file AF _n Analysis of the trigger signal TG _n And the trigger signal TG _n Identify the sample in which the was embedded. At this point, the output device 15 _nm And speaker 16 _nm The sound emitted via the sound signal is an audio signal T output from the eighth channel. _n Only, otherwise it is silent.
Also, the audio output site 100 _n When the delay sample number information is received from the trigger monitoring site 91 via the network NT, the audio file AF is received based on the delay sample number. _n Is skipped, playback is started again, and the output device 15 _nm And speaker 16 _nm Sound is emitted to the outside via
[0121]
In the audio reproducing device 90 including such units, the audio file AF _n If the control information indicating the reproduction instruction is given, the audio output site 100 _n Audio file AF from each of _n Is played. In the audio reproduction device 90, the trigger monitoring site 91 and the audio output site 100 are displayed. _n Voice file AF by _n Trigger signal TG embedded in _n From the trigger monitoring site 91 to the voice output site 100 _n Trigger signal TG _n Voice output site 100 determined based on _n The number of delayed samples Sd corresponding to each _n Will be notified. In the audio reproduction device 90, each audio output site 100 _n By this delay sample number Sd _n Based on audio file AF _n Will be skipped and playback will resume from the skip destination. In the audio reproduction device 90, all the audio output sites 100 are thus obtained. _n Audio file AF by _n The playback start timing can be matched. Thereafter, in the audio reproduction device 90, the audio output site 100 is based on the word clock WCLK supplied from the word clock generator 14. _n Multi-channel synchronized playback can be performed by proceeding with playback.
[0122]
Specifically, in the audio playback device 90, the audio output site 100 _n Respectively, reproduction processing program E _n And the audio file AF is performed through a series of steps shown in FIG. _n Play.
[0123]
First, the audio output site 100 _n As shown in the figure, in step S31, the voice file AF is sent from the trigger monitoring site 91 via the network NT in step S31. _n It is determined whether or not the control information indicating the reproduction instruction is reached. Audio output site 100 _n Respectively wait until the control information from the trigger monitoring site 91 arrives, and when the control information is received, the process proceeds to step S32.
[0124]
Audio output site 100 _n Respectively, in step S32, the sound processing unit 101 _n Audio file AF via _n Play.
[0125]
At the same time, the audio output site 100 _n Respectively, in step S33, the audio file AF _n Trigger signal TG embedded in _n Is detected.
[0126]
Furthermore, the audio output site 100 _n Respectively, in step S34, the trigger signal TG _n The count of the number of reproduction samples is started with reference to the sample specified as the sample in which is embedded. Audio output site 100 _n In this case, the audio file AF is used with reference to the count start sample. _n The playback start timing is determined.
[0127]
And the audio output site 100 _n Respectively continues to count the number of reproduced samples in step S35, and in step S36, the corresponding delayed sample number Sd from the trigger monitoring site 91. _n It is determined whether or not the delayed sample number information indicating is reached via the network NT.
[0128]
Here, if the delayed sample number information has not arrived, the audio output site 100 _n Each repeats the process of step S35.
[0129]
On the other hand, when the delayed sample number information arrives, the audio output site 100 _n Respectively, in step S37, the number of delayed samples Sd _n Only audio file AF _n The playback is skipped, the playback is restarted from the skip destination time, and the series of processes is terminated.
[0130]
In this way, in the audio reproduction device 90, the audio output site 100 is displayed. _n Each of the reproduction processing program E _n And execute the audio file AF _n The reproduction of the audio signal for the movie content in can be started simultaneously. In the audio playback device 90, the audio output site 100 is thereafter used. _n Are respectively based on the word clock WCLK supplied from the word clock generator 14. _n By proceeding with the playback, multi-channel synchronized playback becomes possible.
[0131]
On the other hand, in the audio reproduction device 90, the trigger monitoring site 91 executes the monitoring program Sv and goes through a series of steps shown in FIG. _n The number of delayed samples Sd corresponding to each _n Can be requested.
[0132]
First, as shown in the figure, the trigger monitoring site 91 sends control information indicating a reproduction instruction to the audio output site 100 in step S41. _n Notify each of them.
[0133]
Subsequently, the trigger monitoring site 91 determines that the voice output site 100 in step S42. _n Audio file AF output to each channel in the sound processing unit 92 from _n Monitor incoming traffic.
[0134]
Subsequently, in step S43, the trigger monitoring site 91 assigns the audio file AF to any one of the four channels in the sound processing unit 92. _n It is determined whether any of the above has been supplied.
[0135]
Here, audio file AF _n If any of the above is not supplied, the trigger monitoring site 91 shifts the processing to step S42, and the audio file AF _n Continue to monitor until either of them is supplied.
[0136]
On the other hand, audio file AF _n If any of the above is supplied, the trigger monitoring site 91 determines in step S44 the audio file AF. _n Trigger signal TG embedded in _n Is detected.
[0137]
Subsequently, in step S45, the trigger monitoring site 91 detects the detected trigger signal TG. _n Is the first audio file AF supplied _n The trigger signal TG for the first time. _n Whether or not has been detected is determined.
[0138]
Here, the trigger signal TG _n If is not the first detected, the trigger monitoring site 91 proceeds to step S47. On the other hand, trigger signal TG _n Is detected first, the trigger monitoring site 91 determines that the trigger signal TG detected first in step S46. _n Counting the number of input samples is started using the sample at the time when is input as a reference sample, and the process proceeds to step S47. At the trigger monitoring site 91, another trigger signal TG is counted by counting the number of input samples based on this reference sample. _n Will be measured.
[0139]
Then, in step S47, the trigger monitoring site 91 determines the count value that has started counting from the reference sample as the base point in step S46 as the number of delayed samples Sd. _n Trigger signal TG _n Corresponding audio output site 100 _n For this delay sample number Sd _n Is output via the network NT. It should be noted that the trigger monitoring site 91 starts with the audio file AF _n Output site 100 that outputs _n Is the number of delayed samples Sd _n Needless to say, 0 is notified.
[0140]
Subsequently, the trigger monitoring site 91 continues to count the number of input samples in step S48, and in step S49, all the audio output sites 100 are monitored. _n To trigger signal TG _n Is received.
[0141]
Here, all audio output sites 100 _n To trigger signal TG _n Is not received, the trigger monitoring site 91 repeats the processing from step S42 to step S48.
[0142]
On the other hand, all audio output sites 100 _n To trigger signal TG _n Is received, the trigger monitoring site 91 ends the series of processes as it is.
[0143]
Audio output site 100 _n Either of the trigger signal TG due to failure etc. _n May not be given to the trigger monitoring site 91. In this case, the trigger monitoring site 91 may appropriately exit the loop process that moves from step S49 to step S42.
[0144]
In this way, in the audio reproduction device 90, the trigger monitoring site 91 executes the monitoring program Sv, and the audio output site 100 _n The number of delayed samples Sd corresponding to each _n Voice output site 100 _n It is possible to realize synchronized playback by means of. The trigger monitoring site 91 performs image processing on the movie file MF by the image processing unit 93 during this series of processing, and the audio output site 100. _n Audio file AF by _n The video data IM is projected onto the screen in the entertainment theater ETH via the projector PJ in synchronization with the start of reproduction of the audio signal for the movie content at.
[0145]
Trigger monitoring site 91 and audio output site 100 for executing such a program _n In the audio reproducing device 90 having the above, each operation is performed in cooperation according to the timing shown in FIG. 14, for example.
[0146]
That is, in the audio reproduction device 90, the trigger monitoring site 91 and the audio output site 100 are based on the word clock WCLK generated by the word clock generator 14. _n Works. In the audio reproduction device 90, as shown in the upper part of the figure, when a reproduction command is instructed via the user interface UI, control information indicating the reproduction instruction is transmitted from the trigger monitoring site 91 to the audio output site 100. _n Are notified via the network NT. As described above, the absolute time at which this control information arrives is due to the influence of network traffic, etc. _n Variation occurs between the two. Here, as shown in the middle of FIG. ₁ Receives the control information earliest and the voice output site 100 ₄ , Voice output site 100 ₂ It is assumed that the control information is received in the following order.
[0147]
Then, in the audio reproduction device 90, as described above, the audio output site 100 _n Voice file AF _n Is played. This audio output site 100 _n Audio file AF from each of _n As described above, the absolute time at which is played is the audio output site 100. _n Sound processing unit 101 _n Voice output site 100 due to the difference in the processing capacity of the CPU or the like (not shown). _n Variation occurs between the two. Here, as shown in the middle of FIG. ₁ Is the earliest voice file AF ₁ Playback starts, and after that, the audio output site 100 ₄ , Voice output site 100 ₂ Audio file AF in the order ₄ , AF ₂ It is assumed that the playback of has started.
[0148]
Then, in the audio reproduction device 90, as described above, the audio output site 100 _n Audio file AF by each of _n Trigger signal TG embedded in _n Is detected, and the trigger signal TG is also detected by the trigger monitoring site 91. _n Is detected. Here, as shown in the lower part of FIG. ₁ Trigger signal TG at ₁ , And then trigger signal TG ₄ , TG ₂ Detected in the following order. Audio output site 100 _n Respectively, these trigger signals TG _n Sample S when is identified _n Is used as a reference to start counting the number of playback samples. Also, here, the trigger signal TG is the earliest ₁ Sample S with detected ₁ Is the reference sample. The trigger monitoring site 91 also includes a sample S that is a reference sample. ₁ Starts counting the number of input samples, and as shown in the lower part of FIG. _n Is calculated. In the audio reproduction device 90, the number of delayed samples Sd _n Voice output site 100 that corresponds to _n Will be notified. This delayed sample number Sd _n As described above, the absolute time at which the message reaches the voice output site 100 is due to the influence of network traffic or the like. _n Variation occurs between the two. Here, as shown in the middle of FIG. ₁ Is the earliest to receive late sample number information, voice output site 100 ₂ , Voice output site 100 ₄ It is assumed that the delayed sample number information is received in the following order.
[0149]
In the audio reproduction device 90, the audio output site 100 is displayed. _n The number of delayed samples Sd _n Only audio file AF _n Playback is skipped, and playback is resumed from the point of the skip destination. At this time, the above-described reproduction margin sample number Sm0 is at least the following step, that is, the audio output site 100 that has received the control information indicating the reproduction instruction. _n Is audio file AF _n , The trigger monitoring site 91 is connected to all the audio output sites 100. _n To trigger signal TG _n The trigger monitoring site 91 receives the voice output site 100. _n The number of delayed samples Sd for each _n And all audio output sites 100 _n Is late sample number Sd _n And the skipped sample is an audio signal unrelated to the movie content, i.e. 11 is an audio signal included from time tt to time tc shown in FIG. In the sound reproducing device 90, the trigger monitoring site 91 firstly executes the trigger signal TG. _n Sample S at the time when is detected _n Audio file AF from the sample that has elapsed from the number of playback margin samples Sm0 _n Audio signal C for movie content ₁ , C ₂ , ..., C ₈ Playback starts simultaneously.
[0150]
In the audio playback device 90, the audio output site 100 _n Audio signal C for movie content in between ₁ , C ₂ , ..., C ₈ After that, the audio file AF is maintained while maintaining the synchronization based on the word clock WCLK generated by the word clock generator 14. _n Proceed with playback.
[0151]
As described above, the audio reproduction device 90 shown as the fifth exemplary embodiment of the present invention is the audio output site 100. _n Sound processing unit 101 _n Among them, the trigger signal TG _n Without providing a dedicated trigger channel for outputting the audio signal, it can be shared with the channel for outputting the audio signal, and the synchronized reproduction of the audio signal can be performed efficiently.
[0152]
As described above, the audio reproduction apparatuses shown as the first to fifth embodiments of the present invention can be configured at low cost using a plurality of general-purpose hardware, and have a processing capability and a channel. Synchronous reproduction of multi-channel audio signals that could not be realized in the past due to the limitation of the number or the like can be performed accurately. Therefore, by applying the audio reproducing apparatus to an entertainment system such as a movie theater or a game, for example, it is possible to realize a highly realistic sound reproduction with a simple configuration.
[0153]
In addition, this invention is not limited to embodiment mentioned above, For example, various changes are possible as described in the end of 1st Embodiment mentioned above and 2nd Embodiment. This is applicable to other embodiments as needed.
[0154]
In the present invention, any of the first to fifth embodiments described above may be appropriately combined within a possible range.
[0155]
Thus, it goes without saying that the present invention can be modified as appropriate without departing from the spirit of the present invention.
[0156]
【The invention's effect】
As described above in detail, the audio reproduction device according to the present invention is an audio reproduction device that synchronously reproduces audio signals of a plurality of channels, and is synchronized with the reference signal generating means for generating a predetermined reference signal. A plurality of audio output means for reproducing and outputting the audio signal via an audio output processor capable of processing the output of the multi-channel audio signal, and at least the audio in synchronism with the reference signal Reproduction timing of the audio signal in each of the audio output means, which is a signal supplied for at least one channel from each of the audio output means via an audio input processor capable of inputting and processing audio signals equal to or more than the number of output means. And a replay signal monitoring means for monitoring the arrival of the replay signal for providing the sound, and the replay signal monitoring means is supplied from each of the sound output means. The relative deviation times for the input times of the number of replay signals are obtained, and time information indicating the deviation times is notified to each of the audio output means, and each of the audio output means performs audio based on the time information. Determine when to start playing the signal.
[0157]
Therefore, the audio reproduction device according to the present invention is a timing at which reproduction of an audio signal should be started based on time information indicating a relative shift time with respect to input times of a plurality of replay signals corresponding to each of the audio output means. Therefore, even when a plurality of general-purpose hardware is used, it is possible to accurately perform synchronized reproduction of multi-channel audio signals.
[0158]
The audio reproduction method according to the present invention is an audio reproduction method for synchronously reproducing audio signals of a plurality of channels, and an audio output processor capable of performing output processing of the audio signals of a plurality of channels in synchronization with a predetermined reference signal. Through which the audio signal is reproduced and output from each of a plurality of audio output means that are not synchronized with each other and supplied for at least one channel, and the reproduction timing of the audio signal in each of the audio output means The sounding signal for providing the sounding signal is input to the sounding signal monitoring means via the sound input processor capable of inputting and processing at least the number of sound signals equal to the number of the sound output means in synchronization with the reference signal. A deviation time calculation step for obtaining a relative deviation time with respect to input times of a plurality of replay signals supplied from each, and time information indicating the deviation time Based on, and a playback start timing determination step of determining when to start playing the audio signal.
[0159]
Therefore, in the audio reproduction method according to the present invention, the timing at which the reproduction of the audio signal should be started based on the time information indicating the relative shift time with respect to the input times of the plurality of replay signals corresponding to the respective audio output means. Therefore, even when a plurality of general-purpose hardware is used, it is possible to accurately perform synchronized reproduction of multi-channel audio signals.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration of an audio reproducing device shown as a first embodiment of the present invention.
FIG. 2 is a flowchart for explaining a series of steps until playback of an audio file is started at an audio output site provided in the audio playback apparatus.
FIG. 3 is a flowchart for explaining a series of steps when obtaining the number of delayed samples corresponding to each of the audio output sites at the trigger monitoring site provided in the audio reproduction apparatus.
FIG. 4 is a diagram for explaining an operation timing of each unit in the audio reproduction device.
FIG. 5 is a block diagram illustrating a configuration of an audio reproduction device shown as a second embodiment of the present invention.
FIG. 6 is a flowchart for explaining a series of steps until playback of an audio stream is started at an audio output site provided in the audio playback apparatus.
FIG. 7 is a block diagram illustrating a configuration of an audio reproducing device shown as a third embodiment of the present invention.
FIG. 8 is a block diagram illustrating the configuration of the audio reproduction device, and in particular, a block diagram illustrating a configuration of a signal processing unit included in the audio output site.
FIG. 9 is a block diagram illustrating a configuration of an audio reproducing device shown as a fourth embodiment of the present invention.
FIG. 10 is a block diagram illustrating a configuration of an audio reproducing device shown as a fifth embodiment of the present invention.
FIG. 11 is a diagram illustrating the structure of an audio file.
FIG. 12 is a flowchart for explaining a series of steps for reproducing an audio file at an audio output site provided in the audio reproducing apparatus.
FIG. 13 is a flowchart for describing a series of steps when obtaining the number of delayed samples corresponding to each of the audio output sites at the trigger monitoring site provided in the audio reproduction apparatus.
FIG. 14 is a diagram for explaining the operation timing of each unit in the audio reproduction device.
[Explanation of symbols]
10, 30, 50, 70, 90 Audio playback device, 11 Control site, 12, 73, 91 Trigger monitoring site, 13, 21 _n , 41 _n , 61 _n , 72, 74, 81 _n , 92, 101 _n Sound processor, 14 word clock generator, 15 _nm Output device, 16 _nm Speaker, 20 _n , 40 _n , 60 _n , 80 _n , 100 _n Audio output site, 31 distribution site, 51, 71 source output site, 62 _nm Signal processing unit, 93 image processing unit

Claims

An audio reproduction device for synchronously reproducing audio signals of a plurality of channels,
A reference signal generating means for generating a predetermined reference signal;
A plurality of audio output means for reproducing and outputting the audio signal via an audio output processor capable of outputting and processing audio signals of a plurality of channels in synchronization with the reference signal;
A signal that is supplied for at least one channel from each of the audio output means via an audio input processor that can input and process at least audio signals equal to or greater than the number of the audio output means in synchronization with the reference signal. Replay signal monitoring means for monitoring the arrival of a replay signal for giving the reproduction timing of the sound signal in each of the sound output means,
The recurring signal monitoring means obtains a relative shift time for the input times of a plurality of replay signals supplied from each of the sound output means, and sets time information indicating the shift time to each of the sound output means. Notification to
Each of the audio output means determines a timing at which reproduction of the audio signal should be started based on the time information.

2. The sound reproducing apparatus according to claim 1, wherein the replay signal monitoring means notifies the time information to each of the sound output means via a general-purpose network according to a predetermined protocol.

The recurring signal monitoring means outputs the time information to each of the audio output means via an audio output processor capable of outputting at least audio signals equal to or greater than the number of the audio output means in synchronization with the reference signal. Notify
2. The audio reproduction apparatus according to claim 1, wherein each of the audio output means inputs the time information via an audio input processor capable of performing an input process on the audio signal in synchronization with the reference signal.

2. The audio reproducing apparatus according to claim 1, wherein each of the audio output means is provided with a channel for outputting the replay signal among channels that can be output by the audio output processor. .

2. The audio reproduction apparatus according to claim 1, wherein the audio output means outputs the advance signal to the advance signal monitoring means when control information indicating an instruction to reproduce the audio signal is input.

6. The sound reproducing apparatus according to claim 5, further comprising a control means for notifying each of the sound output means of the control information via a general-purpose network according to a predetermined protocol.

Each of the audio output means is provided with a dedicated channel for outputting the replay signal among the channels that can be output by the audio output processor,
Each of the audio output means has storage means for storing audio signals for the number of channels obtained by subtracting the number of channels for outputting the replay signal from the number of channels that can be processed by the audio output processor. The sound reproducing apparatus according to claim 1.

A delivery means for delivering the audio signal;
Each of the audio output means is provided with a dedicated channel for outputting the replay signal among the channels that can be output by the audio output processor,
The audio output means distributes audio signals for the number of channels obtained by subtracting the number of channels for outputting the replay signal from the number of channels that can be output by the audio output processor. The sound reproducing apparatus according to claim 1.

9. The audio reproducing apparatus according to claim 8, wherein the distribution unit distributes the audio signal to each of the audio output units via a general-purpose network according to a predetermined protocol.

Source output means for delivering an audio signal to be a source signal of a predetermined channel to each of the audio output means,
2. The audio reproduction device according to claim 1, wherein each of the audio output means includes a plurality of signal processing means for performing predetermined signal processing on a channel basis for the audio signal distributed from the source output means. apparatus.

11. The audio reproduction apparatus according to claim 10, wherein the source output unit distributes the audio signal to each of the audio output units via a general-purpose network according to a predetermined protocol.

The source output means distributes the audio signal to each of the audio output means via an audio output processor capable of outputting the audio signal in synchronization with the reference signal,
The audio output means inputs the audio signal distributed from the source output means via an audio input processor capable of performing input processing on the audio signal in synchronization with the reference signal. Item 13. The audio playback device according to Item 10.

The source output means notifies the audio output means of control information indicating the contents of signal processing in each of the signal processing means,
11. The audio reproduction device according to claim 10, wherein each of the signal processing means changes a signal processing parameter in accordance with the control information.

14. The audio reproduction apparatus according to claim 13, wherein the source output means notifies the control information to each of the audio output means via a general-purpose network according to a predetermined protocol.

The source output means notifies the control information to each of the audio output means via an audio output processor capable of outputting an audio signal in synchronization with the reference signal,
14. The audio reproduction apparatus according to claim 13, wherein the audio output means inputs the control information via an audio input processor that can input and process an audio signal in synchronization with the reference signal.

The replay signal is embedded in the audio signal,
The replay signal monitoring means inputs the sound signal reproduced from each of the sound output means via the sound input processor, detects the replay signal embedded in the sound signal, and detects the deviation. 2. The audio reproducing apparatus according to claim 1, wherein time is obtained.

17. The audio reproduction device according to claim 16, wherein the replay signal is embedded in the audio signal by digital watermark.

2. The audio output means shares a channel for outputting the replay signal and a channel for outputting the audio signal in the audio output processor, respectively. Audio playback device.

An audio reproduction method for synchronously reproducing audio signals of multiple channels,
From each of a plurality of audio output means that are not synchronized with each other and that reproduce and output the audio signal via an audio output processor that can output and process audio signals of a plurality of channels in synchronization with a predetermined reference signal A signal supplied for at least one channel, and a replay signal for giving the reproduction timing of the audio signal in each of the audio output means is at least equal to the number of the audio output means in synchronization with the reference signal An audio signal is input to the advance signal monitoring unit via an input processor capable of input processing, and a relative shift time with respect to input times of a plurality of advance signals supplied from each of the audio output units is obtained. Deviation time calculation step;
And a reproduction start timing determining step for determining a timing at which reproduction of the audio signal should be started based on time information indicating the deviation time.