WO2002082413A1 - Voice guide system based on cry and song - Google Patents

Voice guide system based on cry and song Download PDF

Info

Publication number
WO2002082413A1
WO2002082413A1 PCT/JP2002/002676 JP0202676W WO02082413A1 WO 2002082413 A1 WO2002082413 A1 WO 2002082413A1 JP 0202676 W JP0202676 W JP 0202676W WO 02082413 A1 WO02082413 A1 WO 02082413A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound source
unit
headset
microphone
sound
Prior art date
Application number
PCT/JP2002/002676
Other languages
French (fr)
Japanese (ja)
Inventor
Kazuhiro Nakadai
Ken-Ichi Hidai
Hiroshi Okuno
Hiroaki Kitano
Original Assignee
Japan Science And Technology Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Japan Science And Technology Corporation filed Critical Japan Science And Technology Corporation
Publication of WO2002082413A1 publication Critical patent/WO2002082413A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search

Definitions

  • the present invention relates to a sound guide system based on a call that provides a user with commentary on the animal or bird based on the sound of the animal or bird in a facility such as a natural park.
  • an object of the present invention is to provide a voice guidance system based on sounds of animals and birds, which provides a user with an explanation of the animals and birds based on the sounds of the sounds. Disclosure of the invention
  • a microphone array including a plurality of fixed microphones appropriately arranged on a site of a facility such as a natural park, a GPS, a movable microphone, and a microphone carried by a user of the facility.
  • At least one headset consisting of headphones, each fixed microphone of the microphone array and a movable microphone of each headset Sound source separation for sound source localization and separation of sound data for each sound source from the sound signal from the sound source.From the sound source localization part and the sound data for each sound source separated by the above sound source separation and sound source localization part.
  • a discriminator that identifies the type of animal that made the call, etc., and the absolute coordinate position of the sound source localized by the sound source separation '
  • a voice-based voice guidance system which comprises:
  • the sound guide system based on a call preferably includes a receiving unit in which the sound source separation and sound source setting unit receives position information by GPS from each headset and an acoustic signal by a movable microphone.
  • the guide creation unit includes a transmission unit for transmitting the commentary data, and each of the headsets transmits the position information by GFS and the acoustic signal from the movable microphone to the reception unit, and transmits the solution from the transmission unit.
  • a transmission / reception unit for receiving the description data.
  • the sound guide system based on a call according to the present invention is arranged such that the sound source separation and sound source localization unit performs a mathematical solution using a general microphone array based on acoustic signals from each fixed microphone and each movable microphone. Performs sound source separation.
  • the sound source separation 'sound source localization unit performs sound source separation by using a direction path finola.
  • the sound source separation-sound source localization unit further performs sound source separation by German 3t component analysis.
  • the discriminating unit refers to a type database including the sounds of the animals and the like, and specifies the type of the animals and the like from the sounds of the animals and the like.
  • the discriminating unit preferably performs detailed discrimination such as sex discrimination of animals and the like and situation discrimination based on the type of the animal and the like and further by referring to various detailed databases. Perform
  • the fixed microphones of the microphone array and the movable microphones of each headset collect sounds of animals and birds in the premises such as a natural park, and the collected sounds are collected.
  • the sound source separation 'sound source localization unit separates the sound into the absolute coordinate position of the sound source of each call and sound data for each sound source.
  • the discriminating unit specifies the type of animal or the like that squeaked for each sound source from the sound data of each sound source ⁇ separated by the sound source separation 'sound source localization unit, and the guide creating unit determines
  • the commentary data is created corresponding to the position information of each headset by the customer, and the output unit outputs the commentary data to the headphones of the headset.
  • the animal, bird, etc. can be easily found based on the coordinate position.
  • the sound source separation and sound source localization unit has a receiving unit that receives the position information by GPS from each headset and the acoustic signal from the movable microphone
  • the guide creation unit has a transmission unit that sends the commentary data. If each of the headsets includes a transmitting / receiving unit that transmits position information by GPS and an acoustic signal by a movable microphone to the receiving unit and receives commentary data from the transmitting unit, Each headset transmits the position information by GPS and the acoustic signal by the movable microphone to the sound source separation and sound source localization unit, and receives the commentary data from the guide creation unit. Can be transmitted and received. Therefore, the user can freely move around the premises such as a natural park while carrying the headset.
  • the sound source localization unit performs sound source separation based on acoustic signals from each fixed microphone and each movable microphone by a mathematical solution using a general micro phone array, it includes the movable microphone Sound source separation can be performed by such a mathematical solution using a microphone array in a broad sense.
  • the sound source localization unit When the sound source localization unit performs sound source separation by using a direction pass filter, sound source separation can be easily performed by using a direction pass filter.
  • the sound source separation 'sound source localization unit further performs sound source separation by independent component analysis, sound source separation can be reliably performed by using independent component analysis.
  • the discriminating unit specifies the type of the animal or the like from the call of the animal with reference to the page database provided with the sound of the animal or the like, the call of the animal or the like to be discriminated is prepared in advance in a database. Therefore, the type of the animal or the like can be quickly identified from the call of the animal or the like.
  • the discriminating unit performs detailed discrimination such as gender discrimination of animals and the like and situation discrimination based on the data of the animals and the like and further by referring to various detailed databases, a more detailed discrimination based on the call of the animals etc. Discrimination, that is, the gender of the animal, the situation under which the call is emitted, and the like can be performed.
  • FIG. 1 is a block diagram showing an electric configuration of an embodiment of a voice guidance system based on bird calls according to the present invention.
  • FIG. 2 is a schematic perspective view showing the entire configuration of the voice guidance system of FIG.
  • FIG. 3 is a block diagram showing a configuration of a head set in the voice guidance system of FIG.
  • FIG. 4 is a flowchart showing the operation of the voice guidance system of FIG. BEST MODE FOR CARRYING OUT THE INVENTION
  • FIG. 1 shows an electrical configuration of an embodiment of a voice guidance system based on a bird singing to which the present invention is applied.
  • the voice guidance system 10 sounds from a plurality of fixed microphones 11 a, lib,... Appropriately arranged in the site 10 a of a facility such as a natural park shown in FIG. 2.
  • the microphone array 11 is composed of a plurality of fixed microphones 11a, lib, Have been.
  • Each fixed microphone 11a, lib, ⁇ ⁇ ⁇ basically uses a directional microphone because it only has to collect the bird's singing from above.
  • the sound signals of the fixed microphones '11a, lib, ⁇ are input to a sound source separation' sound source localization unit 12 described later.
  • each of the fixed microphones 11a, 11b, ... may be connected to the sound source separation and sound source localization unit 12 via a cable, or may be connected wirelessly.
  • the sound source separation and sound source localization section 12 is provided, for example, in a management building 1 Ob in the site 10 a of a facility such as a natural park, and includes a fixed microphone 1 of the microphone array 11. 1a, 1 lb, '--' and the sound of the bird based on the sound signals from the movable microphones of each headset 20 are detected, and the localization (position information to absolute coordinate position) of the sound source, The song of the bird is separated.
  • the positions of the fixed microphones 11a, lib, and so on of the microphone array 11 were previously input to the sound source separation and sound source localization unit 12 and the position of each headset 20. Is detected by the position information by the GFS provided in the headset 20.
  • the sound source separation / sound source localization unit 12 includes a sound source separation unit 12a, a direction path filter 11b, and an ICA (Independent Component Analysis) unit 12c. Contains.
  • the sound source separation unit 12a uses a general microphone array based on sound signals from the fixed microphones 11a, lib,... 'Of the microphone array 11 and the movable microphones of each headset 20. It is configured to perform sound source separation by a mathematical solution.
  • the above-mentioned direction pass filter 12b is used to determine the phase difference between the binaural ears based on the sound signals from the fixed microphones 11a, lib, ... of the microphone array 11 and the movable microphones of each headset 20.
  • Sound source separation is performed using IPD and binaural 3 ⁇ 4 ⁇ difference IID.
  • the ICA unit 12c performs independent component analysis based on the acoustic signals from the individual fixed microphones 11a, lib, In addition, based on the sound signal level of each microphone according to the unknown probability distribution, the sound signal of each sound source is restored by matrix calculation to separate the sound sources, and the absolute coordinate position of the sound sources is detected.
  • the discriminating unit 13 is also provided, for example, in the management building 10b in the facility site 10a of a natural park or the like, and is provided with the sound of birds separated by the sound source separation / sound source localization unit 1.
  • the position information (absolute coordinate position) and the input of the position information of each headset 20 are used to specify the singing of each bird, and include a type discriminator 13a and a detailed discriminator 13b. In.
  • the type discriminating unit 13a refers to the type database 13c having various types of bird calls and identifies the bird from the bird calls, and provides information on the types and positions of the birds.
  • the detailed discriminator 13b based on the bird's singing from the sound source separation and sound source localization unit 12 and the bird type information from the type discriminator 13a, determines the sound of the bird according to its gender, status, etc. Referring to the detailed database 13d with singing voices, the singing of the birds is used to make detailed judgments such as gender discrimination and situation discrimination, and the detailed information on the birds is sent to the guide creation unit 14.
  • the guide creation unit 14 is also provided in the management premises 10b, for example, in the site 10a of the facility such as a natural park, and the position of the discrimination unit 13 is determined for each headset 20.
  • the position information absolute coordinate position
  • the position information is converted into the relative coordinate position of the bird position information based on the GFS position information of each headset 20 and the bird type information determined by the determination unit 13
  • the commentary data on the headset 20 is created and transmitted to the headset 20 by the transmission / reception unit 4 shown in FIG.
  • the guide creating unit 14 uses the periphery of the head set 20, that is, the head set 20 based on the position information of each head set 20.
  • the explanation is limited to the birdsong within the range where the user —Create evenings.
  • the guide creating unit 14 is provided for each head set 20.
  • the present invention is not limited to this, and one guide creating unit 14 corresponds to all the head sets 20.
  • Commentary data may be created, and a guide creation unit 14 smaller in number than the number of headsets 20 is prepared. May be created. At least one headset 20 is provided so that users of facilities such as a natural park can carry it. As shown in Fig. 3, a movable microphone 21 and a GPS 2 2, a headphone 23, a transmission / reception unit 24, and a noise canceling circuit 25.
  • the movable microphone 21 detects a sound around the headset 20 and generates an acoustic signal.
  • the above-mentioned GPS 22 has a known configuration, and detects the position of the head set 20 by receiving a radio wave from the GPS satellite to generate position information.
  • the headphone 23 provides a commentary voice to the user based on the commentary data from the guide creation unit 14.
  • the transmission / reception unit 24 transmits the sound signal from the movable microphone 21 and the position information from the GPS 22 to the sound source separation / sound source localization unit 12, It is designed to receive commentary data from 14.
  • the noise canceling circuit 25 controls the noise canceling so that the user using the headset 20 does not disturb the explanation sound from the headphones 23. .
  • the voice guide system 10 is configured as described above, and as shown in the flowchart of FIG. 4, a headset 2 is set for each user based on the birdsong in a facility such as a natural park. 0 provides commentary.
  • the fixed microphones lla, lib,... Of the microphone array 11 and the movable microphones 21 of the respective headsets 20 are used to make bird calls in facilities such as a natural park. And sends an acoustic signal to the sound source separation / sound source localization unit 12.
  • the positions of the fixed microphones 11a and 11b have been previously input to the sound source separation / sound source localization unit 12 and the position of the movable microphone 21 of each headset 20 is Headset 20 GPS location information is sound source Separation ⁇ This is known by being input to the sound source localization unit 12.
  • the sound source separation / sound source localization unit 12 sends the sound from the fixed microphones lla, lib, ... of the microphone array 11 and the movable microphone 21 of each headset 20.
  • the sound source separation unit 12a Based on the signal, the sound source separation unit 12a performs sound source separation by a mathematical solution using a general microphone array 11 and a sound source using IFD and IID by a decimation pass filter 12b.
  • the position information absolute coordinate position
  • the sound source separation / sound source localization unit 12 outputs the bird's position information G »f coordinate position) and the bird's singing together with the position information of each headset 20 to the discrimination unit 13.
  • the discrimination unit 13 converts the position information (absolute coordinate position) of the bird input from the sound source separation and sound source localization unit 12 and the call of the bird, and the position information of each headset 20 into each other.
  • the type of bird is specified by the moss discriminating unit 13a from the song of the bird using the type database 13c.
  • the type discriminating unit 13a outputs the type information and the position information (absolute coordinate position) of the bird to the guide creating unit 14, and also transmits the bird call and the type information to the detailed judging unit 13b. It is sent to the detail discriminator 13b.
  • the detailed discriminating unit 13b performs detailed discrimination of the bird using the detailed database 13d on the basis of the call and the type information of the bird, and guides the detailed information of the bird. Output to the creation unit 14.
  • step ST5 the guide creating unit 14 writes the position information (absolute coordinate position) of the bird type information, detailed information and position information (absolute coordinate position) from the discriminating unit 13 into
  • Each head set 20 ⁇ is converted into the relative coordinate position of the bird's position information based on the position information of each head set 20's GPS 22 and based on the above S ⁇ page information and detailed information.
  • the commentary data on 20 is created and transmitted to the headset 20 by a transmitting unit (not shown).
  • each headset 20 received the explanation data created by the guide creation unit 14 by the transmission / reception unit 24, and performed noise cancellation by the noise cancellation circuit 25. Later, the user can use the commentary voice from the headphone 2 3 To provide. Therefore, the user can hear the comment based on the relative coordinate position of the bird to be observed from the user's position by the commentary sound heard from the headphone 23, and can easily find the bird, The bird's situation based on the call can be ascertained.
  • the audio signals from the fixed microphones 11a, lib,... Sound source separation and sound source localization unit 12 performs sound source separation and sound source localization from the sound signal from movable microphone 21 of headset 20, and discriminator 13 recognizes the sound of birds The type and details are determined, and a description corresponding to the position of the headset 20 can be provided to the headset 20 of each user based on these.
  • sound source separation and sound source localization are also performed with reference to the sound signals from the movable microphones 21 of each headset 20 so that more accurate sound source localization can be performed with respect to the position of each user. You.
  • the voice guidance system based on the singing of a bird has been described.
  • the present invention is not limited to this, and can be applied to a voice guiding system based on the singing of an animal other than a bird.
  • each fixed microphone of the microphone array and the movable microphone of each headset collect the sounds of animals and birds in the premises such as a natural park and the like, and , Sound source separation •
  • the sound source localization unit separates the sound into the absolute coordinate position of each sound source and the sound data for each sound source.
  • the discriminating unit specifies the type of animal or the like that squeaks for each sound source from the sound source for each sound source separated by the sound source separation unit and the sound source localization unit.
  • explanation data is created corresponding to the position information of each headset, and the output unit outputs the explanation data to the headphone of the headset. This allows the user to obtain a commentary on the animal or bird that made the squeal by listening to the commentary flowing from the headphone of the headset carried by the user. The animal, bird, or the like can be easily found based on the coordinate position.
  • a voice guidance system with an extremely excellent cry which provides a user with an explanation of the animal or the bird based on the cry of the animal or the bird.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Navigation (AREA)

Abstract

A voice guide system for providing an explanation of an animal or a bird to a user based on the cry of an animal or the song of a bird, comprising a microphone array (11) of a plurality of fixed microphones (11a, 11b, ...), at least one headset (20) comprising a GPS (22), a movable microphone (21) and a headphone (23), a section (12) for locating and separating the sound source of each cry or song from the sound signal of each fixed microphone and movable microphone, a decision section (13) for specifying the kind of animal or bird uttering the cry or song from the separated sound data of each sound source, and a guide generating section (14) for converting the absolute coordinates position of a sound source located at the sound source separating/locating section into relative coordinates position based on the GPS positioning information of each headset and generating explanation data concerning the headset from the kind of animal or bird specified at the decision section and then delivering the explanation data to the headset.

Description

明 細 書 鳴き声による音声ガイドシステム 技術分野  Description Voice-guided voice guidance system
本発明は、 自然公園等の施設において、 動物や鳥の鳴き声に基づいて、利用者 に当該動物や鳥に関する解説を提供するようにした鳴き声による音声ガイドシス テムに関するものである。 技術背景  The present invention relates to a sound guide system based on a call that provides a user with commentary on the animal or bird based on the sound of the animal or bird in a facility such as a natural park. Technology background
—般に、 自然公園等においては、利用者は、 自然公園等の敷地内を適宜に移動 しながら、 動物や鳥の自然な姿を観察できるようになつている。 そして、利用者 に対して動物や鳥に関する情報を提供するために、例えば動物や鳥の絵, 写真等 の画像と、 これに関する解説を付した案内板や、 パンフレツト等が用意されてい る。  In general, in natural parks and the like, users can observe the natural appearance of animals and birds while appropriately moving within the premises of the natural park and the like. To provide users with information on animals and birds, for example, images such as pictures and photographs of animals and birds, information boards with explanations on them, and pamphlets are prepared.
しかしながら、動物や鳥の鳴き声に関しては、利用者は通常判別することが困 難である。 このため、利用者は、 鳴き声が聞こえたとしてもその動物や鳥の種類 を判別することができない。 従って、特に野鳥の場合には、 当該野鳥の姿を探そ うとしても、 当該野鳥を知らない場合には、 野鳥自体が小さいことから容易に探 すことはできない。  However, it is usually difficult for users to discriminate the sounds of animals and birds. For this reason, the user cannot discriminate the kind of the animal or bird even if the call is heard. Therefore, especially in the case of wild birds, even if an attempt is made to find the shape of the wild bird, if the wild bird is unknown, it cannot be easily found because the wild bird itself is small.
この発明は、以上の点にかんがみて、動物や鳥の鳴き声に基づいて、 当該動物 や鳥の解説を利用者に提供するようにした、 鳴き声による音声ガイドシステムを 提供することを目的としている。 発明の開示  In view of the above points, an object of the present invention is to provide a voice guidance system based on sounds of animals and birds, which provides a user with an explanation of the animals and birds based on the sounds of the sounds. Disclosure of the invention
上記目的は、 この発明によれば、 自然公園等の施設の敷地内にて適宜に配置さ れた複数個の固定マイクから成るマイクロフォンアレイと、 施設の利用者が携帯 する G P S , 可動マイク及びへッドホンから成る少なくとも一つのへッドセット と、 上記マイクロフォンアレイの各固定マイク及び各へッドセッ トの可動マイク からの音響信号から、各鳴き声に関する音源の定位及び音源毎の音響デー夕の分 離を行なう音源分離 ·音源定位部と、 上記音源分離 ·音源定位部で分離された音 源毎の音響データから鳴き声を発した動物等の種類を特定する判別部と、 上記音 源分離'音源定位部で定位された音源の絶対座標位置を、各ヘッドセッ トの G PAccording to the present invention, there is provided a microphone array including a plurality of fixed microphones appropriately arranged on a site of a facility such as a natural park, a GPS, a movable microphone, and a microphone carried by a user of the facility. At least one headset consisting of headphones, each fixed microphone of the microphone array and a movable microphone of each headset Sound source separation for sound source localization and separation of sound data for each sound source from the sound signal from the sound source.From the sound source localization part and the sound data for each sound source separated by the above sound source separation and sound source localization part. A discriminator that identifies the type of animal that made the call, etc., and the absolute coordinate position of the sound source localized by the sound source separation '
Sによる位置情報に基づいて相対座標位置に変換すると共に、上記判別部により 特定された動物等の種類により、 当該へッドセットに関する解説データを作成し て、 当該へッドセットのへッドホンに出力するガイド作成部と、 を含んでいるこ とを特徴とする鳴き声による音声ガイドシステムにより、達成される。 A guide to convert to the relative coordinate position based on the position information by S, create commentary data on the headset based on the kind of animal etc. specified by the discriminator, and output it to the headphone of the headset The present invention is achieved by a voice-based voice guidance system, which comprises:
本発明の鳴き声による音声ガイドシステムは、好ましくは、上記音源分離 ·音 源定ィ立部が、各へッドセットからの G P Sによる位置情報及び可動マイクによる 音響信号を受信する受信部を備えており、 上記ガイド作成部が、 解説データを送 信する送信部を備えていて、上記各へッドセットが、 G F Sによる位置情報及び 可動マイクによる音響信号を上記受信部に送信すると共に、上記送信部からの解 説データを受信する送受信部を備えている。  Preferably, the sound guide system based on a call according to the present invention preferably includes a receiving unit in which the sound source separation and sound source setting unit receives position information by GPS from each headset and an acoustic signal by a movable microphone. The guide creation unit includes a transmission unit for transmitting the commentary data, and each of the headsets transmits the position information by GFS and the acoustic signal from the movable microphone to the reception unit, and transmits the solution from the transmission unit. And a transmission / reception unit for receiving the description data.
本発明の鳴き声による音声ガイドシステムは、好ましくは、上記音源分離'音 源定位部が、各固定マイク及び各可動マイクからの音響信号に基づいて、一般的 なマイクロフォンアレイを利用した数学的解法により音源分離を行なう。  Preferably, the sound guide system based on a call according to the present invention is arranged such that the sound source separation and sound source localization unit performs a mathematical solution using a general microphone array based on acoustic signals from each fixed microphone and each movable microphone. Performs sound source separation.
本発明の鳴き声による音声ガイドシステムは、好ましくは、上記音源分離'音 源定位部が、 ディレクションパスフィノレ夕により音源分離を行なう。  In the voice guidance system according to the cry of the present invention, preferably, the sound source separation 'sound source localization unit performs sound source separation by using a direction path finola.
本発明の鳴き声による音声ガイドシステムは、好ましくは、上記音源分離 -音 源定位部が、 さらに、独 3t成分分析により音源分離を行なう。  In the voice guidance system based on a call according to the present invention, preferably, the sound source separation-sound source localization unit further performs sound source separation by German 3t component analysis.
本発明の鳴き声による音声ガイドシステムは、好ましくは、 上記判別部が、各 種の動物等の鳴き声を備えた種類データベースを参照して、 動物等の鳴き声から 当該動物等の種類の特定を行なう。  In the voice guidance system according to the present invention, preferably, the discriminating unit refers to a type database including the sounds of the animals and the like, and specifies the type of the animals and the like from the sounds of the animals and the like.
本発明の鳴き声による音声ガイドシステムは、好ましくは、上記判別部が、動 物等の種類に基づいて、 さらに各種の詳細データベースを参照して、動物等の雌 雄判別, 状況判別等の詳細判別を行なう。  In the voice guidance system based on a call according to the present invention, the discriminating unit preferably performs detailed discrimination such as sex discrimination of animals and the like and situation discrimination based on the type of the animal and the like and further by referring to various detailed databases. Perform
上記構成によれば、 マイクロフォンアレイの各固定マイク及び各へッドセット の可動マイクが自然公園等の敷地内の動物や鳥の鳴き声を集音し、集音された音 響信号から、 音源分離'音源定位部が、各鳴き声の音源の絶対座標位置及び各音 源毎の音響データに分離する。 そして、判別部が、音源分離'音源定位部で分離 された各音源每の音響デ一タから各音源毎の鳴き声を発した動物等の種類を特定 して、 ガイド作成部が、 この動物等の禾顧により各へッドセッ卜の位置情報に対 応して解説データを作成し、 出力部が、 この解説データを当該へッドセットのへ ッドホンに出力する。 According to the above configuration, the fixed microphones of the microphone array and the movable microphones of each headset collect sounds of animals and birds in the premises such as a natural park, and the collected sounds are collected. From the sound signal, the sound source separation 'sound source localization unit separates the sound into the absolute coordinate position of the sound source of each call and sound data for each sound source. Then, the discriminating unit specifies the type of animal or the like that squeaked for each sound source from the sound data of each sound source た separated by the sound source separation 'sound source localization unit, and the guide creating unit determines The commentary data is created corresponding to the position information of each headset by the customer, and the output unit outputs the commentary data to the headphones of the headset.
これにより、利用者は、 自分で携帯するへッドセッ卜のへッドホンから流れる 解説を聴くことにより、 鳴き声を発した動物や鳥等に関する解説を入手すること ができると共に、 当該動物や鳥等の相対座標位置に基づいて、容易に当該動物や 鳥等を見つけることができる。  This allows the user to obtain a commentary on the animal or bird that made the cry by listening to the commentary flowing from the headphone of the headset carried by the user. The animal, bird, etc. can be easily found based on the coordinate position.
上記音源分離 ·音源定位部が、各ヘッドセットからの G P Sによる位置情報及 び可動マイクによる音響信号を受信する受信部を備えており、上記ガイド作成部 が、 解説データを送信する送信部を備えていて、上記各へッドセットが、 G P S による位置情報及び可動マイクによる音響信号を上記受信部に送信すると共に、 上記送信部からの解説デ一タを受信する送受信部を備えている場合には、各へッ ドセットが、 音源分離-音源定位部に対して G P Sによる位置情報及び可動マイ クによる音響信号を送信すると共に、 上記ガイド作成部からの解説データを受信 することによって、 ワイヤレス式にデー夕の送受信を行なうことができる。 従つ て、利用者は、 へッドセットを携帯しながら、 自然公園等の敷地内を自由に移動 することができる。  The sound source separation and sound source localization unit has a receiving unit that receives the position information by GPS from each headset and the acoustic signal from the movable microphone, and the guide creation unit has a transmission unit that sends the commentary data. If each of the headsets includes a transmitting / receiving unit that transmits position information by GPS and an acoustic signal by a movable microphone to the receiving unit and receives commentary data from the transmitting unit, Each headset transmits the position information by GPS and the acoustic signal by the movable microphone to the sound source separation and sound source localization unit, and receives the commentary data from the guide creation unit. Can be transmitted and received. Therefore, the user can freely move around the premises such as a natural park while carrying the headset.
上記音源分離 ·音源定位部が、各固定マイク及び各可動マイクからの音響信号 に基づいて、 一般的なマイクロフオンアレイを利用した数学的解法により音源分 離を行なう場合には、可動マイクを含む広義のマイクロフオンアレイを利用して このような数学的解法によつて音源分離を行なうことができる。  Sound source separationIf the sound source localization unit performs sound source separation based on acoustic signals from each fixed microphone and each movable microphone by a mathematical solution using a general micro phone array, it includes the movable microphone Sound source separation can be performed by such a mathematical solution using a microphone array in a broad sense.
上記音源分離 ·音源定位部が、 ディレクシヨンパスフィル夕により音源分離を 行なう場合には、 ディレクシヨンパスフィルタを使用することによって音源分離 を容易に行なうことができる。  When the sound source localization unit performs sound source separation by using a direction pass filter, sound source separation can be easily performed by using a direction pass filter.
上記音源分離'音源定位部が、 さらに、 独立成分分析により音源分離を行なう 場合には、 独立成分分析を使用して音源分離を確実に行なうことができる。 上記判別部が、各種の動物等の鳴き声を備えた 頁データベースを参照して、 動物等の鳴き声から当該動物等の種類の特定を行なう場合には、 判別すべき動物 等の鳴き声が前もってデータベース化されているので、動物等の鳴き声からその 動物等の種類を迅速に特定することができる。 When the sound source separation 'sound source localization unit further performs sound source separation by independent component analysis, sound source separation can be reliably performed by using independent component analysis. When the discriminating unit specifies the type of the animal or the like from the call of the animal with reference to the page database provided with the sound of the animal or the like, the call of the animal or the like to be discriminated is prepared in advance in a database. Therefore, the type of the animal or the like can be quickly identified from the call of the animal or the like.
上記判別部が、 動物等の に基づいて、 さらに各種の詳細データベースを参 照して、 動物等の雌雄判別, 状況判別等の詳細判別を行なう場合には、 動物等の 鳴き声から、 さらに詳細な判別、即ち動物の性別, どのような状況で発する鳴き 声であるか、等の判別を行なうことができる。 図面の簡単な説明  When the discriminating unit performs detailed discrimination such as gender discrimination of animals and the like and situation discrimination based on the data of the animals and the like and further by referring to various detailed databases, a more detailed discrimination based on the call of the animals etc. Discrimination, that is, the gender of the animal, the situation under which the call is emitted, and the like can be performed. BRIEF DESCRIPTION OF THE FIGURES
本発明は、以下の詳細な説明及び本発明の幾つかの実施の形態を示す添付図面 に基づいて、 より良く理解されるものとなろう。 なお、添付図面に示す実施の形 態は本発明を特定又は限定することを意、図するものではなく、単に本発明の説明 及び理解を容易とするためだけに記載されたものである。  The invention will be better understood on the basis of the following detailed description and the accompanying drawings, which show some embodiments of the invention. The embodiments shown in the accompanying drawings are not intended to specify or limit the present invention, but are described merely for facilitating the explanation and understanding of the present invention.
図中、  In the figure,
図 1は、 この発明による鳥の鳴き声による音声ガイドシステムの一実施形態の 電気的構成を示すプロック図である。  FIG. 1 is a block diagram showing an electric configuration of an embodiment of a voice guidance system based on bird calls according to the present invention.
図 2は、 図 1の音声ガイドシステムの全体構成を示す概略斜視図である。 図 3は、 図 1の音声ガイドシステムにおけるへッドセットの構成を示すプロッ ク図である。  FIG. 2 is a schematic perspective view showing the entire configuration of the voice guidance system of FIG. FIG. 3 is a block diagram showing a configuration of a head set in the voice guidance system of FIG.
図 4は、 図 1の音声ガイドシステムの動作を示すフローチャートである。 発明を実施するための最良の形態  FIG. 4 is a flowchart showing the operation of the voice guidance system of FIG. BEST MODE FOR CARRYING OUT THE INVENTION
以下、 本発明を好適な実施の形態について図面を参照して詳細に説明する。 図 1は、 この発明を適用した鳥の鳴き声による音声ガイドシステムの一実施形 態の電気的構成を示している。 図 1において、 音声ガイドシステム 1 0は、 図 2 に示す自然公園等の施設の敷地 1 0 a内にて適宜に配置された複数個の固定マイ ク 1 1 a, l i b , · · ·から鳴るマイクロフォンアレイ 1 1と、音源分離-音 源定位部 1 2と、判別部 1 3と、 ガイド作成部 1 4と、施設の利用者が携帯する へッドセット 2 0と、 から構成されている。 Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 shows an electrical configuration of an embodiment of a voice guidance system based on a bird singing to which the present invention is applied. In FIG. 1, the voice guidance system 10 sounds from a plurality of fixed microphones 11 a, lib,... Appropriately arranged in the site 10 a of a facility such as a natural park shown in FIG. 2. Microphone array 11, sound source separation and sound source localization unit 12, discrimination unit 13, guide creation unit 14, and carried by facility users And head set 20.
上記マイクロフォンアレイ 1 1は、 図 2に示すように、 自然公園等の施設の敷 地 1 0 a内にて適宜に固定配置された複数個の固定マイク 1 1 a, l i b, · · 'から構成されている。 各固定マイク 1 1 a, l i b, · · ·は、基本的に上方 からの鳥の鳴き声を集音すればよいので、 指向性マイクを使用する。 そして、各 固定マイク' 1 1 a, l i b, · · ·は、 その音響信号が、後述する音源分離'音 源定位部 1 2に入力されるようになっている。 なお、各固定マイク 1 1 a, 1 1 b, · · ·は、 それぞれケーブルを介して音源分離'音源定位部 1 2に接続され てもよく、 ワイヤレス式に接続されるようにしてもよい。  As shown in Fig. 2, the microphone array 11 is composed of a plurality of fixed microphones 11a, lib, Have been. Each fixed microphone 11a, lib, · · · basically uses a directional microphone because it only has to collect the bird's singing from above. The sound signals of the fixed microphones '11a, lib, ··· are input to a sound source separation' sound source localization unit 12 described later. Note that each of the fixed microphones 11a, 11b, ... may be connected to the sound source separation and sound source localization unit 12 via a cable, or may be connected wirelessly.
上記音源分離'音源定位部 1 2は、 自然公園等の施設の敷地 1 0 a内に設けら れた例えば管理棟 1 O b等に設けられており、 マイクロフォンアレイ 1 1の各固 定マイク 1 1 a, 1 l b, '- - '及び各へッドセット 20の可動マイクからの音 響信号に基づいて鳥の鳴き声を検出して、 音源である鳥の定位(位置情報〜絶対 座標位置) と、 当該鳥の鳴き声の分離を行なうようになっている。 その際、 マイ クロフオンアレイ 1 1の各固定マイク 1 1 a, l i b, · · ·の位置は、 前もつ て音源分離 ·音源定位部 1 2に入力されており、 また各へッドセット 20の位置 は、 へッドセット 20に備えられた GF Sによる位置情報により検出されること に る。  The sound source separation and sound source localization section 12 is provided, for example, in a management building 1 Ob in the site 10 a of a facility such as a natural park, and includes a fixed microphone 1 of the microphone array 11. 1a, 1 lb, '--' and the sound of the bird based on the sound signals from the movable microphones of each headset 20 are detected, and the localization (position information to absolute coordinate position) of the sound source, The song of the bird is separated. At this time, the positions of the fixed microphones 11a, lib, and so on of the microphone array 11 were previously input to the sound source separation and sound source localization unit 12 and the position of each headset 20. Is detected by the position information by the GFS provided in the headset 20.
そして、上記音源分離 ·音源定位部 1 2は、音源分離部 1 2 a, ディレクショ ンパスフィル夕 1 1 b及び I CA (I nd ep e ndent Comp onen t Ana l y s i s〜独立成分分析) 部 1 2 cを含んでいる。  The sound source separation / sound source localization unit 12 includes a sound source separation unit 12a, a direction path filter 11b, and an ICA (Independent Component Analysis) unit 12c. Contains.
上記音源分離部 1 2 aは、 マイクロフォンアレイ 1 1の各固定マイク 1 1 a, l i b, · · '及び各へッドセット 20の可動マイクからの音響信号に基づいて 、一般的なマイクロフォンアレイを利用した数学的解法により、 音源分離を実施 するように構成されている。  The sound source separation unit 12a uses a general microphone array based on sound signals from the fixed microphones 11a, lib,... 'Of the microphone array 11 and the movable microphones of each headset 20. It is configured to perform sound source separation by a mathematical solution.
上記ディレクシヨンパスフィルタ 1 2 bは、 マイクロフォンアレイ 1 1の各固 定マイク 1 1 a, l i b, · · ·及び各ヘッドセット 20の可動マイクからの音 響信号に基づいて、 両耳間位相差 I PD, 両耳間 ¾ ^差 I I Dを利用して音源の 分離を行なう。 また、 上記 I C A部 1 2 cは、個々の固定マイク 1 1 a , l i b , · · ·及び 可動マイクからの音響信号に基づいて、独立成分分析として、各マイクの音響信 号から、 互いに独立で且つ未知の確率分布に従う各マイクの音響信号レベルによ り、行列計算にて各音源の音響信号を復元して音源の分離を行なうと共に、 音源 の絶対座標位置を検出するようになっている。 The above-mentioned direction pass filter 12b is used to determine the phase difference between the binaural ears based on the sound signals from the fixed microphones 11a, lib, ... of the microphone array 11 and the movable microphones of each headset 20. Sound source separation is performed using IPD and binaural ¾ ^ difference IID. The ICA unit 12c performs independent component analysis based on the acoustic signals from the individual fixed microphones 11a, lib, In addition, based on the sound signal level of each microphone according to the unknown probability distribution, the sound signal of each sound source is restored by matrix calculation to separate the sound sources, and the absolute coordinate position of the sound sources is detected.
上記判別部 1 3は、 同様に自然公園等の施設の敷地 1 0 a内の、例えば管理棟 1 0 bに設けられており、 音源分離 ·音源定位部 1 により分離された鳥の鳴き 声と位置情報 (絶対座標位置) そして各へッドセット 2 0の位置情報が入力され ることにより、 各鳥の鳴き声の特定を行なうものであり、種類判別部 1 3 a及び 詳細判別部 1 3 bを含んでいる。  The discriminating unit 13 is also provided, for example, in the management building 10b in the facility site 10a of a natural park or the like, and is provided with the sound of birds separated by the sound source separation / sound source localization unit 1. The position information (absolute coordinate position) and the input of the position information of each headset 20 are used to specify the singing of each bird, and include a type discriminator 13a and a detailed discriminator 13b. In.
種類判別部 1 3 aは、 各種の鳥の鳴き声を備えた種類データベース 1 3 cを参 照して、 鳥の鳴き声から当該鳥の の特定を行ない、 鳥の種類情報と位置情報 The type discriminating unit 13a refers to the type database 13c having various types of bird calls and identifies the bird from the bird calls, and provides information on the types and positions of the birds.
(絶対座標位置) そして各へッドセット 2 0の位置情報をガイド作成部 1 4に送 出する。 (Absolute coordinate position) Then, the position information of each head set 20 is sent to the guide creating unit 14.
また、 詳細判別部 1 3 bは、音源分離 ·音源定位部 1 2からの鳥の鳴き声と種 類判別部 1 3 aからの鳥の種類情報に基づいて、 当該鳥の性別, 状況等による鳴 き声を備えた詳細データベース 1 3 dを参照して、 鳥の鳴き声から当該鳥の雌雄 判別, 状況判別等の詳細判別を行ない、 鳥の詳細情報を、 ガイド作成部 1 4に送 出する。  Further, the detailed discriminator 13b, based on the bird's singing from the sound source separation and sound source localization unit 12 and the bird type information from the type discriminator 13a, determines the sound of the bird according to its gender, status, etc. Referring to the detailed database 13d with singing voices, the singing of the birds is used to make detailed judgments such as gender discrimination and situation discrimination, and the detailed information on the birds is sent to the guide creation unit 14.
上記ガイド作成部 1 4は、 同様に自然公園等の施設の敷地 1 0 a内の、例えば 管理棟 1 0 bに設けられており、各へッドセット 2 0毎に、判別部 1 3からの位 置情報 (絶対座標位置) を、各へッドセット 2 0の G F Sによる位置情報に基づ いて、 鳥の位置情報の相対座標位置に変換すると共に、判別部 1 3により判別さ れた鳥の種類情報及び詳細情報に基づいて、 当該へッドセット 2 0に関する解説 データを作成し、 図 1に示す送受信部 4により当該へッドセット 2 0に対して 送 1§Τる。  The guide creation unit 14 is also provided in the management premises 10b, for example, in the site 10a of the facility such as a natural park, and the position of the discrimination unit 13 is determined for each headset 20. The position information (absolute coordinate position) is converted into the relative coordinate position of the bird position information based on the GFS position information of each headset 20 and the bird type information determined by the determination unit 13 Based on the detailed information and the detailed information, the commentary data on the headset 20 is created and transmitted to the headset 20 by the transmission / reception unit 4 shown in FIG.
ここで、 ガイド作成部 1 4は、解説データを作成する際に、個々のへッドセッ ト 2 0の位置情報に基づいて、 当該へッドセット 2 0の周辺、即ち当該へッドセ ット 2 0を使用している利用者が聞こえる範囲内の鳥の鳴き声に限定して解説デ —夕を作成する。 なお、 図示の場合、 ガイド作成部 1 4は、各へッドセット 2 0 毎に備えられているが、 これに限らず、一つのガイド作成部 1 4が、 すべてのへ ッドセット 2 0に対応して解説デ一タを作成するようにしてもよく、 またヘッド セット 2 0の数より少ない数のガイド作成部 1 4を用意しておき、 必要に応じて 、適宜へッドセット 2 0に対して解説データを作成するようにしてもよい。 上記へッドセット 2 0は、 自然公園等の施設の利用者が、 それぞれ携帯するこ とができるように少なくとも一つ用意されており、 図 3に示すように、 可動マイ ク 2 1と、 G P S 2 2と、 さらにへッドホン 2 3と、送受信部 2 4と、 ノイズキ ヤンセル回路 2 5と、 から構成されている。 Here, when creating the commentary data, the guide creating unit 14 uses the periphery of the head set 20, that is, the head set 20 based on the position information of each head set 20. The explanation is limited to the birdsong within the range where the user —Create evenings. In the illustrated case, the guide creating unit 14 is provided for each head set 20. However, the present invention is not limited to this, and one guide creating unit 14 corresponds to all the head sets 20. Commentary data may be created, and a guide creation unit 14 smaller in number than the number of headsets 20 is prepared. May be created. At least one headset 20 is provided so that users of facilities such as a natural park can carry it. As shown in Fig. 3, a movable microphone 21 and a GPS 2 2, a headphone 23, a transmission / reception unit 24, and a noise canceling circuit 25.
上記可動マイク 2 1は、 当該へッドセット 2 0の周囲の音を検出して、音響信 号を生成するようになっている。 上記 G P S 2 2は公知の構成であって、 G P S 衛星からの電波を受信することにより当該へッドせット 2 0の位置を検出して、 位置情報を生成するようになっている。 上記へッドホン 2 3は、 ガイド作成部 1 4からの解説データに基づいて、利用者に対して解説音声を提供するようになつ ている。 そして、 上記送受信部 2 4は、上言己可動マイク 2 1からの音響信号及び G P S 2 2からの位置情報を、音源分離 -音源定位部 1 2に対して送信すると共 に、上記ガイド作成部 1 4からの解説データを受信するようになっている。 さら に、上記ノイズキャンセル回路 2 5は、 当該へッドセット 2 0を使用している利 用者が、 ヘッドホン 2 3からの解説音声が邪魔にならないように、 ノイズのキヤ ンセルを; f亍なう。  The movable microphone 21 detects a sound around the headset 20 and generates an acoustic signal. The above-mentioned GPS 22 has a known configuration, and detects the position of the head set 20 by receiving a radio wave from the GPS satellite to generate position information. The headphone 23 provides a commentary voice to the user based on the commentary data from the guide creation unit 14. The transmission / reception unit 24 transmits the sound signal from the movable microphone 21 and the position information from the GPS 22 to the sound source separation / sound source localization unit 12, It is designed to receive commentary data from 14. Further, the noise canceling circuit 25 controls the noise canceling so that the user using the headset 20 does not disturb the explanation sound from the headphones 23. .
本発明実施形態による音声ガイドシステム 1 0は以上のように構成されており 、 図 4のフローチャートに示すように、 自然公園等の施設における鳥の鳴き声に 基づいて各利用者に対してへッドセット 2 0により解説が提供される。  The voice guide system 10 according to the embodiment of the present invention is configured as described above, and as shown in the flowchart of FIG. 4, a headset 2 is set for each user based on the birdsong in a facility such as a natural park. 0 provides commentary.
先ず、 図 4において、 ステップ S T 1にて、 マイクロフォンアレイ 1 1の各固 定マイク l l a, l i b , · · ·及び各へッドセット 2 0の可動マイク 2 1が、 自然公園等の施設における鳥の鳴き声を検出して、 音響信号を音源分離 ·音源定 位部 1 2に対して送出する。 ここで、各固定マイク 1 1 a, 1 1 bの位置は、前 もつて音源分離 ·音源定位部 1 2に入力されており、 また各へッドセット 2 0の 可動マイク 2 1の位置は、 当該へッドセッ ト 2 0の G P Sによる位置情報が音源 分離 ·音源定位部 1 2に入力されることにより既知である。 First, in FIG. 4, at step ST1, the fixed microphones lla, lib,... Of the microphone array 11 and the movable microphones 21 of the respective headsets 20 are used to make bird calls in facilities such as a natural park. And sends an acoustic signal to the sound source separation / sound source localization unit 12. Here, the positions of the fixed microphones 11a and 11b have been previously input to the sound source separation / sound source localization unit 12 and the position of the movable microphone 21 of each headset 20 is Headset 20 GPS location information is sound source Separation · This is known by being input to the sound source localization unit 12.
次に、 ステップ S T 2にて、音源分離'音源定位部 1 2は、 マイクロフォンァ レイ 1 1の各固定マイク l l a, l i b , · · ·及び各へッドセッ卜 2 0の可動 マイク 2 1からの音響信号に基づいて、音源分離部 1 2 aにて一般的なマイクロ フォンアレイ 1 1を利用した数学的解法による音源分離を行ない、 またディクレ クシヨンパスフィルタ 1 2 bにより I F D及び I I Dを利用した音源分離を行な い、 さらに、 I C A部 1 2 cにて独立成分分析により音源分離を行なうことによ り、音源である鳴き声を発した鳥の位置情報 (絶対座標位置) を検出すると共に 、各鳥の鳴き声を分離する。 そして、音源分離 ·音源定位部 1 2は、鳥の位置情 報 G»f座標位置) 及び鳥の鳴き声を、各へッドセット 2 0の位置情報と共に判 別部 1 3に出力する。  Next, in step ST2, the sound source separation / sound source localization unit 12 sends the sound from the fixed microphones lla, lib, ... of the microphone array 11 and the movable microphone 21 of each headset 20. Based on the signal, the sound source separation unit 12a performs sound source separation by a mathematical solution using a general microphone array 11 and a sound source using IFD and IID by a decimation pass filter 12b. By performing sound source separation by independent component analysis in the ICA unit 12c, the position information (absolute coordinate position) of the bird that emitted the squealing sound is detected, and Separate bird calls. Then, the sound source separation / sound source localization unit 12 outputs the bird's position information G »f coordinate position) and the bird's singing together with the position information of each headset 20 to the discrimination unit 13.
続いてステツプ S T 3にて、判別部 1 3は、音源分離 ·音源定位部 1 2から入 力される鳥の位置情報 (絶対座標位置)及び鳥の鳴き声, 各へッドセット 2 0の 位置情報に基づいて、禾薩判別部 1 3 aにより鳥の鳴き声から種類データベース 1 3 cを利用して鳥の種類を特定する。 そして、種類判別部 1 3 aは、鳥の種類 情報と位置情報 (絶対座標位置) をガイド作成部 1 4に出力すると共に、詳細判 別部 1 3 bに対して鳥の鳴き声及び種類情報を詳細判別部 1 3 bに送出する。 これにより、 ステップ S T 4にて、詳細判別部 1 3 bは、 鳥の鳴き声と種類情 報に基づいて詳細データベース 1 3 dを利用して鳥の詳細判別を行なって、 鳥の 詳細情報をガイド作成部 1 4に出力する。  Subsequently, in step ST3, the discrimination unit 13 converts the position information (absolute coordinate position) of the bird input from the sound source separation and sound source localization unit 12 and the call of the bird, and the position information of each headset 20 into each other. On the basis of this, the type of bird is specified by the moss discriminating unit 13a from the song of the bird using the type database 13c. Then, the type discriminating unit 13a outputs the type information and the position information (absolute coordinate position) of the bird to the guide creating unit 14, and also transmits the bird call and the type information to the detailed judging unit 13b. It is sent to the detail discriminator 13b. Accordingly, in step ST4, the detailed discriminating unit 13b performs detailed discrimination of the bird using the detailed database 13d on the basis of the call and the type information of the bird, and guides the detailed information of the bird. Output to the creation unit 14.
そして、 ステップ S T 5にて、 ガイド作成部 1 4は、判別部 1 3からの鳥の種 類情報, 詳細情報及び位置情報 (絶対座標位置) のうち、位置情報 (絶対座標位 置) を、 各へッドセット 2 0每に各へッドセット 2 0の G P S 2 2による位置情 報に基づいて鳥の位置情報の相対座標位置に変換すると共に、上記 S ^頁情報及び 詳細情報に基づいて当該へッドセット 2 0に関する解説データを作成して、 図示 しない送信部により当該へッドセット 2 0に対して送信する。  Then, in step ST5, the guide creating unit 14 writes the position information (absolute coordinate position) of the bird type information, detailed information and position information (absolute coordinate position) from the discriminating unit 13 into Each head set 20 每 is converted into the relative coordinate position of the bird's position information based on the position information of each head set 20's GPS 22 and based on the above S ^ page information and detailed information. The commentary data on 20 is created and transmitted to the headset 20 by a transmitting unit (not shown).
これにより、 ステップ S T 6にて、各へッ ドセット 2 0は、 ガイド作成部 1 4 により作成された解説データを送受信部 2 4で受信して、 ノイズキャンセル回路 2 5にてノイズキャンセルを行なった後、 へッドホン 2 3から解説音声を利用者 に提供する。 従って、 利用者は、 へッドホン 2 3から聞こえる解説音声により、 利用者の位置から観察すベき鳥に対する相対座標位置に基づく解説を聴くことが でき、 当該鳥を容易に見つけることができると共に、 そのときの鳴き声による鳥 の状況を把握することができる。 As a result, in step ST6, each headset 20 received the explanation data created by the guide creation unit 14 by the transmission / reception unit 24, and performed noise cancellation by the noise cancellation circuit 25. Later, the user can use the commentary voice from the headphone 2 3 To provide. Therefore, the user can hear the comment based on the relative coordinate position of the bird to be observed from the user's position by the commentary sound heard from the headphone 23, and can easily find the bird, The bird's situation based on the call can be ascertained.
このようにして、 本発明実施形態による音声ガイドシステム 1 0によれば、 マ イク口フォンアレイ 1 1の各固定マイク 1 1 a, l i b , · · ·からの音響信号 と、各利用者が携帯するへッドセット 2 0の可動マイク 2 1からの音響信号とか ら、音源分離 ·音源定位部 1 2により鳥の鳴き声に関する音源分離及び音源定位 を行なって、判別部 1 3により鳥の鳴き声から鳥の種類及び詳細判別を行ない、 これらに基づいて、 各利用者のへッドセット 2 0に対して、 それぞれへッドセッ ト 2 0の位置に対応した解説を提供することができる。 その際、各へッドセット 2 0の可動マイク 2 1からの音響信号も参照して、音源分離及び音源定位を行な うことにより、各利用者の位置に関してより正確な音源定位を行なうことができ る。  Thus, according to the audio guidance system 10 according to the embodiment of the present invention, the audio signals from the fixed microphones 11a, lib,... Sound source separation and sound source localization unit 12 performs sound source separation and sound source localization from the sound signal from movable microphone 21 of headset 20, and discriminator 13 recognizes the sound of birds The type and details are determined, and a description corresponding to the position of the headset 20 can be provided to the headset 20 of each user based on these. At this time, sound source separation and sound source localization are also performed with reference to the sound signals from the movable microphones 21 of each headset 20 so that more accurate sound source localization can be performed with respect to the position of each user. You.
上述した実施形態においては、鳥の鳴き声による音声ガイドシステムについて 説明したが、 これに限らず、 鳥以外の動物の鳴き声による音声ガイドシステムに ついても、 本発明を適用し得ることは明らかである。 産業上の利用可能性  In the embodiment described above, the voice guidance system based on the singing of a bird has been described. However, it is apparent that the present invention is not limited to this, and can be applied to a voice guiding system based on the singing of an animal other than a bird. Industrial applicability
以上述べたように、 この発明によれば、 マイクロフォンアレイの各固定マイク 及び各へッドセットの可動マイクが自然公園等の敷地内の動物や鳥の鳴き声を集 音し、集音された音響信号から、音源分離 ·音源定位部が、各鳴き声の音源の絶 対座標位置及び各音源毎の音響データに分離する。 そして、判別部が、音源分離 •音源定位部で分離された各音源毎の音響デ一夕から各音源毎の鳴き声を発した 動物等の種類を特定して、 ガイド作成部が、 この動物等の種類により各へッドセ ットの位置情報に対応して解説データを作成し、 出力部が、 この解説データを当 該へッドセットのへッドホンに出力する。 これにより、利用者は、 自分で携帯す るへッドセットのへッドホンから流れる解説を聴くことにより、 鳴き声を発した 動物や鳥等に関する解説を入手することができると共に、 当該動物や鳥等の相対 座標位置に基づいて、 容易に当該動物や鳥等を見つけることができる。 As described above, according to the present invention, each fixed microphone of the microphone array and the movable microphone of each headset collect the sounds of animals and birds in the premises such as a natural park and the like, and , Sound source separation • The sound source localization unit separates the sound into the absolute coordinate position of each sound source and the sound data for each sound source. Then, the discriminating unit specifies the type of animal or the like that squeaks for each sound source from the sound source for each sound source separated by the sound source separation unit and the sound source localization unit. According to the type of the headset, explanation data is created corresponding to the position information of each headset, and the output unit outputs the explanation data to the headphone of the headset. This allows the user to obtain a commentary on the animal or bird that made the squeal by listening to the commentary flowing from the headphone of the headset carried by the user. The animal, bird, or the like can be easily found based on the coordinate position.
したがって、 本発明によれば、 動物や鳥の鳴き声に基づいて、 当該動物や鳥の 解説を利用者に提供するようにした、極めて優れた鳴き声による音声ガイドシス テムが提供される。  Therefore, according to the present invention, there is provided a voice guidance system with an extremely excellent cry which provides a user with an explanation of the animal or the bird based on the cry of the animal or the bird.

Claims

請 求 の 範 囲 The scope of the claims
1 . 自然公園等の施設の敷地内にて適宜に配置された複数個の固定マイク から成るマイクロフォンアレイと、 1. A microphone array consisting of a plurality of fixed microphones arranged as appropriate on the premises of a facility such as a natural park;
施設の利用者が携帯する G F S, 可動マイク及びへッドホンから成る少なくと も一つのへッドセッ トと、  At least one headset consisting of the GFS, mobile microphone and headphone carried by the facility user;
前記マイクロフォンアレイの各固定マイク及び各へッドセットの可動マイクか らの音響信号から、各鳴き声に関する音源の定位及び音源毎の音響データの分離 を行なう音源分離 ·音源定位部と、  Sound source separation and sound source localization unit for performing sound source localization and sound data separation for each sound source from sound signals from each fixed microphone and each headset movable microphone of the microphone array,
前記音源分離 ·音源定位部で分離された音源毎の音響データから鳴き声を発し た動物等の禾難頁を特定する判別部と、  A discriminating unit that specifies a pest page of an animal or the like that has squeaked from acoustic data for each sound source separated by the sound source separation and sound source localization unit;
前記音源分離 ·音源定位部で定位された音源の絶対座標位置を、各へッドセッ 卜の G P Sによる位置情報に基づいて相対座標位置に変換すると共に、前記判別 部により特定された動物等の種類により、 当該へッドセットに関する解説データ を作成して、 当該へッドセットのへッドホンに出力するガイド作成部と、 を含んでいることを特徴とする、 鳴き声による音声ガイドシステム。  The absolute coordinate position of the sound source localized by the sound source separation / localization unit is converted into a relative coordinate position based on the positional information of each headset by GPS, and the type is determined by the type of animal or the like specified by the determination unit. And a guide creation unit for creating commentary data on the headset and outputting the data to the headphone of the headset.
2 . 前記音源分離 ·音源定位部が、各ヘッドセットからの G P Sによる位 置情報及び可動マイクによる音響信号を受信する受信部を備えており、 2. The sound source separation / sound source localization unit includes a receiving unit that receives position information by GPS from each headset and an acoustic signal by a movable microphone,
前記ガイド作成部が、解説データを送信する送信部を備えていて、  The guide creation unit includes a transmission unit that transmits commentary data,
前記各へッドセッ卜が、 G F Sによる位置情報及び可動マイクによる音響信号 を前記受信部に送信すると共に、前記送信部からの解説データを受信する送受信 部を備えていることを特徴とする、請求項 1に記載の鳴き声による音声ガイドシ ステム。  The headsets each include a transmitting / receiving unit that transmits position information by GFS and an acoustic signal by a movable microphone to the receiving unit, and receives commentary data from the transmitting unit. The voice guidance system described in 1 above.
3 . 前記音源分離 ·音源定位部が、各固定マイク及び各可動マイクからの 音響信号に基づいて、 一般的なマイクロフオンアレイを利用した数学的麟去によ り音源分離を行なうことを特徴とする、請求項 1または 2に記載の鳴き声による 音声ガイドシステム。 3. The sound source separation and sound source localization unit performs sound source separation based on acoustic signals from the fixed microphones and the movable microphones by mathematical subtraction using a general micro phone array. 3. The voice guidance system according to claim 1, wherein
4 . 前記音源分離 -音源定位部が、 ディレクシヨンパスフィルタにより音 源分離を行なうことを特徴とする、請求項 1から 3の何れかに記載の鳴き声によ る音声ガイドシステム。 4. The voice guidance system according to any one of claims 1 to 3, wherein the sound source separation-sound source localization unit performs sound source separation using a direction pass filter.
5 . 前記音源分離 ·音源定位部が、 さらに、 独立成分分析により音源分離 を行なうことを特徴とする、請求項 1から 4の何れかに記載の鳴き声による音声 ガイドシステム。 5. The voice guidance system according to any one of claims 1 to 4, wherein the sound source separation / sound source localization unit further performs sound source separation by independent component analysis.
6 . 前記判別部が、 各種の動物等の鳴き声を備えた種類データベースを参 照して、動物等の鳴き声から当該動物等の種類の特定を行なうことを特徴とする 、請求項 1から 5の何れかに記載の鳴き声による音声ガイ 6. The method according to claim 1, wherein the discriminating unit refers to a type database having sounds of various animals and the like, and specifies the type of the animals and the like from the sounds of the animals and the like. Voice guide by any of the calls
7 . 前記判別部が、 動物等の種類に基づいて、 さらに各種の詳細データべ —スを参照して、動物等の雌雄判別, 状況判別等の詳細判別を行なうことを特徴 とする、請求項 1力、ら 6の何れかに記載の鳴き声による音声ガイドシステム。 7. The discriminating unit performs a detailed discrimination such as a sex discrimination of an animal or the like and a situation discrimination based on a kind of an animal or the like and further with reference to various detailed databases. The voice guidance system according to any one of (1) and (6).
PCT/JP2002/002676 2001-04-05 2002-03-20 Voice guide system based on cry and song WO2002082413A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001107696A JP3590869B2 (en) 2001-04-05 2001-04-05 Voice guidance system by bark
JP2001-107696 2001-04-05

Publications (1)

Publication Number Publication Date
WO2002082413A1 true WO2002082413A1 (en) 2002-10-17

Family

ID=18959970

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2002/002676 WO2002082413A1 (en) 2001-04-05 2002-03-20 Voice guide system based on cry and song

Country Status (2)

Country Link
JP (1) JP3590869B2 (en)
WO (1) WO2002082413A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6501260B2 (en) * 2015-08-20 2019-04-17 本田技研工業株式会社 Sound processing apparatus and sound processing method
JP2018099114A (en) * 2016-12-19 2018-06-28 いであ株式会社 Determination/distribution system of position and kind of wild animal
JP2019010436A (en) * 2017-06-30 2019-01-24 ヤマハ株式会社 Biological sensor and signal acquisition method of biological sensor
JP7177631B2 (en) 2018-08-24 2022-11-24 本田技研工業株式会社 Acoustic scene reconstruction device, acoustic scene reconstruction method, and program

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09179579A (en) * 1995-12-25 1997-07-11 Casio Comput Co Ltd Retrieval device
JP2000056676A (en) * 1998-08-03 2000-02-25 Yamaha Corp Information notification device and terminal device thereof
JP2001077746A (en) * 1999-09-07 2001-03-23 Matsushita Electric Ind Co Ltd Article guidance system
JP3162510B2 (en) * 1992-10-27 2001-05-08 松下電器産業株式会社 Travel position display device provided with voice guidance device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3162510B2 (en) * 1992-10-27 2001-05-08 松下電器産業株式会社 Travel position display device provided with voice guidance device
JPH09179579A (en) * 1995-12-25 1997-07-11 Casio Comput Co Ltd Retrieval device
JP2000056676A (en) * 1998-08-03 2000-02-25 Yamaha Corp Information notification device and terminal device thereof
JP2001077746A (en) * 1999-09-07 2001-03-23 Matsushita Electric Ind Co Ltd Article guidance system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
NAKADAI KAZUHIRO ET AL.: "Humanoid active audition system improved by the cover acoustics", LECTURE NOTES IN ARTIFICIAL INTELLIGENCE 1886, SUBSERIES OF LECTURE NOTES IN COMPUTER SCIENCE, PROCEEDINGS OF 6TH PACIFIC RIM INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, September 2000 (2000-09-01), pages 544 - 554, XP002951029 *
OKUNO HIROSHI ET AL.: "Blind source separation ni yoru 2 washa doji hatsuwa ninshiki", AL CHALLENGE KENKYUKAI (DAI 1 KAI), THE SOCIETY FOR ARTIFICIAL INTELLIGENCE KENKYUKAI SHIRYO, 7 November 1998 (1998-11-07), pages 1 - 6, XP002951030 *
ONISHI NOBORU: "Robot to ningen gaikai tono shichokaku interface", KEISOKU TO SEIGYO, vol. 34, no. 4, 10 April 1995 (1995-04-10), pages 267 - 273, XP002951028 *

Also Published As

Publication number Publication date
JP2002304191A (en) 2002-10-18
JP3590869B2 (en) 2004-11-17

Similar Documents

Publication Publication Date Title
US10685638B2 (en) Audio scene apparatus
CN112584273B (en) Spatially avoiding audio generated by beamforming speaker arrays
US10645518B2 (en) Distributed audio capture and mixing
JP2022544138A (en) Systems and methods for assisting selective listening
US20150373474A1 (en) Augmented reality sound system
JP6248930B2 (en) Information processing system and program
JP2019518985A (en) Processing audio from distributed microphones
JP2017092732A (en) Auditory supporting system and auditory supporting device
EP2839461A1 (en) An audio scene apparatus
Moore et al. Microphone array speech recognition: Experiments on overlapping speech in meetings
AU2011201731A1 (en) Signal Dereverberation Using Environment Information
CN102316404B (en) Method for localizing audio source and multichannel hearing system
US11644528B2 (en) Sound source distance estimation
CN110545504A (en) Personal hearing device, external sound processing device and related computer program product
CN110996308A (en) Sound playing device and control method, control device and readable storage medium thereof
WO2002082413A1 (en) Voice guide system based on cry and song
JP5261983B2 (en) Voice communication system
JP2024043429A (en) Presence sound field reproducing device and presence sound field reproducing method
JP2024007669A (en) Sound field reproduction program using sound source and position information of sound-receiving medium, device, and method
Amin et al. Impact of microphone orientation and distance on BSS quality within interaction devices
JP2024008112A (en) Voice processing system, voice processing method, and voice processing program
JP2024043430A (en) Sound field presence reproducing device and sound field presence reproducing method
JP2022128177A (en) Sound generation device, sound reproduction device, sound reproduction method, and sound signal processing program
JP2010008859A (en) Content playback system
JP2020127103A (en) Sound field control device, sound field control system, control method for sound field control device, program, and recording medium

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application