JP4456423B2

JP4456423B2 - Color-sound converter

Info

Publication number: JP4456423B2
Application number: JP2004188529A
Authority: JP
Inventors: 満良前川; 慎哉橋爪
Original assignee: Ishikawa Prefecture
Current assignee: Ishikawa Prefecture
Priority date: 2004-06-25
Filing date: 2004-06-25
Publication date: 2010-04-28
Anticipated expiration: 2024-06-25
Also published as: JP2006010525A

Description

この発明は、対象物の色を音色や音声で認識させる装置に用いる色−音変換装置に関するもので、光センサで検出した対象物表面の反射光から当該対象物表面の色を認識して音情報を出力する装置に関するものである。 The present invention relates to a color-sound conversion device used in an apparatus for recognizing the color of an object by timbre or voice, and recognizes the color of the object surface from the reflected light of the object surface detected by an optical sensor. The present invention relates to an apparatus for outputting information.

視覚に障害のある者は、日常生活の中で、物体の色や柄が分からず不便を強いられている場面が多い。たとえば、靴下の左右の色合わせや冠婚葬祭での適切な色の服装の選択などである。また、色覚障害者は色名での会話に苦慮しており、対象物の色を色名で知ることも重要である。 People with visual disabilities often have inconveniences in everyday life because they do not know the color or pattern of objects. For example, the right and left color matching of socks and the selection of appropriate clothing at the ceremonial occasion. In addition, people with color blindness struggle to talk by color name, and it is also important to know the color of the object by color name.

対象物の表面からの反射光を受光して音声で人に色を認識させる装置は公知であり、特許文献１には、対象物の色を色名で発声する装置が提案されており、特許文献２及び３には、対象物の色を音色に変換して発音する装置が提案されている。
特開２００２−２２５３７号公報特開昭５６−１３７２３５号公報特開昭６３−１６３１２０号公報月東充：色障害者のための色識別システムについての研究、名古屋市工業技術研究報告、８１（１９９６）１３．高橋廉：色を音声で読み上げる色センサ、トランジスタ技術、３７、７８（２０００）２６０． An apparatus for receiving reflected light from the surface of an object and causing a person to recognize a color by voice is known, and Patent Document 1 proposes an apparatus that utters a color of an object by a color name. Documents 2 and 3 propose an apparatus that generates a sound by converting the color of an object into a timbre.
JP 2002-22537 A JP-A-56-137235 JP 63-163120 A Mitsuru Tsukito: Research on color identification system for color handicapped persons, Nagoya City Industrial Technology Research Report, 81 (1996) 13. Ren Takahashi: Color sensor that reads out sound by voice, transistor technology, 37, 78 (2000) 260.

しかし従来のこの種装置は、対象物の一点の色を音色や音声で出力するに留まり、対象物表面の色の配置や変化の様子を知りたいときには、対象物表面に幾度も装置の受光部を当てて各点毎の音や声から色の分布や変化の様子を推測するしか方法がなかった。 However, this type of conventional device only outputs a single color of the object as a timbre or sound, and when it is desired to know the color arrangement or change of the surface of the object, the light receiving unit of the apparatus is repeatedly applied to the surface of the object. The only way was to guess the color distribution and change from the sound and voice of each point.

また、検出した色を色名で出力するものは、色が徐々に変化しているときにも閾値を越えたときに色名が変わるので、色が徐々に変化しているのか異なる色の領域が境界となる線を境にして隣り合っているかを知ることができない問題があった。 Also, if the detected color is output as a color name, the color name changes when the threshold is exceeded even when the color is gradually changing, so whether the color is changing gradually or different color areas There is a problem that it is not possible to know whether the two are adjacent to each other along the boundary line.

また色を音色で出力するものは、色と音色との関係を習得しないと使用することができず、視覚で色を認識できない視覚障害者は、色と音色との関係を習得する際に補助者を必要とし、かつ色と音色の関係を忘れたりあいまいになったときは、正確に色を認識できなくなるという問題が生じていた。 In addition, those that output colors as timbres cannot be used unless the relationship between colors and timbres is mastered, and visually impaired persons who cannot recognize colors visually can assist in learning the relationship between colors and timbres. When a person is required and the relationship between color and tone is forgotten or ambiguous, there is a problem that the color cannot be recognized accurately.

更に、視覚障害者がＦＡＸやコピーを行うとき、用紙のどちらの面が情報が記載された印刷面であるかを特定する必要がある。しかし、１点の色を認識するだけでは、特定は難しく、複数点の色を認識していたが、情報の記載領域が小さいときは、表裏を特定するのに多数回の操作が必要となり、誤って白紙であると判断してしまう問題もあった。 Furthermore, when a visually impaired person performs FAX or copying, it is necessary to specify which side of the sheet is a printed side on which information is described. However, it is difficult to identify only by recognizing one color, and it has recognized multiple colors, but when the information description area is small, many operations are required to identify the front and back, There was also a problem that it was mistakenly judged to be blank.

本願発明は、受光した光の色を特定するＲＧＢ値やＨＬＳ値などの色要素値を繰返し演算し、各繰返し演算毎に演算した色要素値に対応させた音データを生成し、この音データに対応する音を連続的に又は繰返し発音すると共に、指令信号を受けたときに、前記演算した色要素値に最も近い色要素値を持つ色名を予め登録された複数の色名から選択し、当該選択した色名を発声する、色−音変換装置を提供することにより、上記課題を解決したものである。 The present invention calculates repeatedly the color element values such as RGB values and HLS values specifying the color of the received light, it generates sound data to correspond to color element value computed in each iteration calculation, the sound data When a command signal is received, a color name having a color element value closest to the calculated color element value is selected from a plurality of color names registered in advance. The above-mentioned problem is solved by providing a color-sound conversion device that utters the selected color name.

本願発明に係る色−音変換装置は、請求項１に記載のように、光を受光する光センサ３と、受光した光からその光の色を特定する色要素値を演算する色要素演算手段３３、３５と、演算した色要素値に対応する音データを生成する音データ生成手段３１と、複数の色名と各色名に対応する色要素値とを登録した色名テーブル３４と、演算した色要素値に最も近い色要素値を持った色名を前記色名テーブルから取得する色名取得手段３７と、スピーカ４、９、４２と、指令ボタン２１とを備え、前記音データ生成手段は生成した音を常時スピーカに出力し、前記色名取得手段は、指令信号を受けたときに、選択した色名の声情報をスピーカに出力する構成として実現することができる。 Color according to the present invention - the sound conversion apparatus as claimed in claim 1, an optical sensor 3 for receiving light, color component calculation means for calculating a color element value identifying the color of the light from the received light 33, 35, a sound data generation means 31 for generating sound data corresponding to the calculated color element value, a color name table 34 in which a plurality of color names and color element values corresponding to each color name are registered, and a calculation A color name acquisition means 37 for acquiring a color name having a color element value closest to the color element value from the color name table; speakers 4, 9, 42; and a command button 21; The generated sound is always output to the speaker, and the color name acquisition means can be realized as a configuration that outputs voice information of the selected color name to the speaker when receiving a command signal.

また、本願発明に係る色−音変換装置は、請求項２に記載のように、光を受光する光センサ３と、受光した光からその光の色を特定する色要素値を演算する色要素演算手段３３、３５と、演算した色要素値に対応する音データを生成する音データ生成手段３１と、複数の色名と各色名に対応する色要素値とを登録した色名テーブル３４と、演算した色要素値に最も近い色要素値を持った色名を前記色名テーブルから取得する色名取得手段３７と、スピーカ４、９、４２とを備え、前記音データ生成手段は生成した音を常時スピーカに出力し、前記色名取得手段は、色名の取得を繰返し行って取得した色名が変わったときに、取得した色名の声情報をスピーカに出力する構成として実現することもできる。 The color according to the present invention - the sound conversion apparatus as claimed in claim 2, an optical sensor 3 for receiving light, color computes the color element value identifying the color of the light from the received light elements Calculation means 33, 35, sound data generation means 31 for generating sound data corresponding to the calculated color element value, a color name table 34 in which a plurality of color names and color element values corresponding to each color name are registered, Color name acquisition means 37 for acquiring a color name having a color element value closest to the calculated color element value from the color name table, and speakers 4, 9, 42, and the sound data generation means Is always output to the speaker, and the color name acquisition unit is configured to output voice information of the acquired color name to the speaker when the acquired color name is changed by repeatedly acquiring the color name. it can.

色と音とのマッピングは、種々のものが可能である。例えば色相を無限音階信号にマッピングすることができ、このときは、無限音階信号を含む音データが生成される。 Various mappings between colors and sounds are possible. For example, the hue can be mapped to an infinite scale signal, and at this time , sound data including the infinite scale signal is generated.

また色を決める要素（例えばＲＧＢ）のそれぞれを特定の楽器に対応させて各要素の値を対応する楽器の音の大きさで表現したときは、請求項３に記載のように、少なくとも３種類の楽器の音色を含む音データが生成される。 Further, when each element for determining color (for example, RGB) is associated with a specific musical instrument and the value of each element is expressed by the sound level of the corresponding musical instrument, at least three types as described in claim 3 Sound data including the timbre of the instrument is generated.

また明度について、例えば明るい色を高い音に、暗い色を低い音にして変換したときは、請求項４に記載のように、色の明度に対応して明度が高くなるほど全体としての周波数が高くなる音データが生成される。 As for the brightness, for example, when the light color is converted to a high sound and the dark color is converted to a low sound, the overall frequency increases as the lightness increases corresponding to the lightness of the color as described in claim 4. Is generated.

また彩度について、濁った色はその彩度の低さに対応して音量が大きくなるノイズを加えるようにして変換し、請求項１、２に記載のように色の彩度に対応して彩度が低くなるほど音量が大きくなるノイズを含む音データが生成される。 Also, with regard to the saturation, the turbid color is converted by adding noise that increases the volume corresponding to the low saturation, and corresponding to the saturation of the color as described in claims 1 and 2. Sound data including noise whose volume increases as the saturation decreases is generated.

この発明の装置は、対象に沿ってなぞりながら連続で色情報を音情報に変換して音声出力することができる。また必要に応じて又は自動的に、音色に併せて色名が出力されるので、これまで不可能であった下記のことが可能になる。
(１）色模様の認識
縞模様のような境界のある模様は、対象物の表面に沿って装置を移動しているときに音色が突然変化する。一方、グラデーションは、装置の移動に伴って音色が連続的に変化するので、その相違により色の変化の様子が分かる。また音色がどの方向に変化するかにより、どのような色がどのように配置されているかを認識することができ、移動方向を変えたときの音色の変化を聞くことにより、どのような模様であるかの推測も可能になる。
(２）情報の有無確認
この発明の装置で対象物表面をなぞることにより、色の変化や境界がわかるので、対象物表面に文字などの情報の有無を色の変化として認識でき、印刷面を特定することが容易になる。
(３) 音色と色名の習得と確認
色に対応する音色が出力されているときに、色名が変わったりボタンを押すなどの操作を行うことにより、その音色に対応する色名が発声されるから、音色と色名の対応を常に確認できるので、誤認識を避けることができる。また、最初から補助者なしで装置を使用することができる。 The apparatus of the present invention can continuously convert color information into sound information and output the sound while tracing along the object. Further, since the color name is output together with the timbre as necessary or automatically, the following which has been impossible until now can be performed.
(1) Color pattern recognition A pattern with a border such as a striped pattern suddenly changes in tone when moving the device along the surface of the object. On the other hand, in gradation, since the timbre changes continuously with the movement of the apparatus, the state of color change can be understood by the difference. Also, depending on which direction the timbre changes, it is possible to recognize what color is arranged and how, and by listening to the change of the timbre when the moving direction is changed, what kind of pattern is It is possible to guess if there is.
(2) Confirmation of presence / absence of information By tracing the surface of the object with the apparatus of the present invention, the color change and the boundary can be understood, so the presence / absence of information such as characters on the object surface can be recognized as the color change, It becomes easy to specify.
(3) Acquisition and confirmation of timbre and color name When the timbre corresponding to the color is output, the color name corresponding to the timbre is uttered by changing the color name or pressing a button. Therefore, the correspondence between the timbre and the color name can be confirmed at all times, so that erroneous recognition can be avoided. Also, the device can be used without assistance from the beginning.

以下の記述では、色名を表す言葉の音声情報を声といい、言葉としての意味を持たない音声情報を音という。また、色の変化の様子や境界を認識することを色模様認識と言い、音の種類で色を特定することを色名認識と言う。 In the following description, the voice information of a word representing a color name is called a voice, and the voice information having no meaning as a word is called a sound. In addition, recognizing a color change state and boundary is called color pattern recognition, and specifying a color by sound type is called color name recognition.

１.スタンドアロン型色認識装置
図１は、この発明で用いる色模様認識の確認試験に用いたスタンドアロン型の色認識装置の構成を示した図である。図中、１は公知の声による色名発声装置で、この装置は、光源２とＲＧＢそれぞれのフォトダイオードとを備えた光センサ３と、声を出力するスピーカ４とを備えている。５はパソコンで、このパソコンには所定のアルゴリズムで色のＲＧＢ信号又はＨＬＳ信号を音の属性データに変換する変換手段６を備え、得られた音データがＭＩＤＩデータに変換される。８はＭＩＤＩシンセサイザ、９は音を出力するスピーカである。 1. Stand-Alone Color Recognition Device FIG. 1 is a diagram showing a configuration of a stand-alone color recognition device used in a color pattern recognition confirmation test used in the present invention. In the figure, reference numeral 1 denotes a known voice name utterance device, which includes a light sensor 2 having a light source 2 and RGB photodiodes, and a speaker 4 for outputting a voice. Reference numeral 5 denotes a personal computer. This personal computer is provided with conversion means 6 for converting a color RGB signal or HLS signal into sound attribute data by a predetermined algorithm, and the obtained sound data is converted into MIDI data. 8 is a MIDI synthesizer, and 9 is a speaker for outputting sound.

光センサ３の信号は、Ａ／Ｄコンバータによってデジタル値に変換される。光センサ３は、温度などの条件によって検出値に誤差が出る。そのため図１の色名発声装置１では、色補正用基準データが格納メモリに格納されており、測色を行うときの装置に内蔵された図示しない基準色も同時に読取って色補正用基準データと比較し、その差により対象物１０の測色値を補正している。 The signal of the optical sensor 3 is converted into a digital value by an A / D converter. The optical sensor 3 has an error in the detection value depending on conditions such as temperature. For this reason, in the color name utterance device 1 of FIG. 1, color correction reference data is stored in a storage memory, and a reference color (not shown) incorporated in the device for performing color measurement is also read at the same time to obtain color correction reference data and In comparison, the colorimetric value of the object 10 is corrected based on the difference.

連続で測色するためには、測色時に自動校正機能を作動させて、測色の安定化を図ることが必要である。測色する間隔は、例えば０.１から１.０秒まで０.１秒間隔で変更できるようにするなど、使用者に合わせて調整可能にしてある。 In order to measure colors continuously, it is necessary to stabilize the color measurement by operating an automatic calibration function during color measurement. The color measurement interval can be adjusted according to the user, for example, by changing the interval from 0.1 to 1.0 seconds at intervals of 0.1 seconds.

図１の色名発声装置１は、図示しない指令ボタンによって作動し、当該指令ボタンがおされたとき、ＲＧＢ値を色の三属性（色相：ｈｕｅ、明度：ｌｉｇｈｔｎｅｓｓ、彩度：ｓａｔｕｒａｔｉｏｎ）であるＨＬＳ値に変換し、更にＨＬＳ値を色名データに変換する。色名データとは、ＨＬＳ値を丸めたデータで、ＨＬＳ値に対応する音声データを持つ色名テーブルを検索可能な色数に丸めたデータである。 The color name utterance device 1 in FIG. 1 is operated by a command button (not shown), and when the command button is pressed, the RGB value has three attributes of color (hue: hue, lightness: lightness, saturation: saturation). Conversion to HLS values, and further conversion of HLS values to color name data. The color name data is data obtained by rounding the HLS value and rounding the color name table having audio data corresponding to the HLS value to the number of colors that can be searched.

そして、色名データで図示しない色名テーブルを参照して、対応する色名の音声のサンプリングデータを取得し、Ｄ／Ａコンバータ１１を通しスピーカ４から発声する。その後指令ボタンが押し続けられていれば、図示しない処理切替手段は、連続して測定・演算されているＲＧＢ値またはＨＬＳ値をパソコンにに送信する。 Then, referring to a color name table (not shown) with the color name data, audio sampling data of the corresponding color name is acquired and uttered from the speaker 4 through the D / A converter 11. After that, if the command button is kept pressed, the process switching means (not shown) transmits the RGB value or HLS value continuously measured and calculated to the personal computer.

パソコンで受信したデータは、本発明で提案する色‐音マッピング方式に従った変換アルゴリズムで音に変換される。色に対応した音はＭＩＤＩ（ＭｕｓｉｃａｌＩｎｓｔｒｕｍｅｎｔＤｉｇｉｔａｌＩｎｔｅｒｆｈｃｅ）規格のメッセージとしてパソコンからＭＩＤＩ音源８に送信し、スピーカ９から発音させる。 Data received by the personal computer is converted into sound by a conversion algorithm according to the color-sound mapping method proposed in the present invention. The sound corresponding to the color is transmitted from the personal computer to the MIDI sound source 8 as a MIDI (Musical Instrument Digital Interface) standard message, and is generated from the speaker 9.

２.色-音マッピング方式
色を音に変換するためのマッピング方式として、色の表現方法や音の三要素を考慮に入れ、以下の２つの方式を提案する。 2. Color-sound mapping method The following two methods are proposed as a mapping method for converting color to sound, taking into account the three methods of color expression and sound.

３.無限音階方式
３.１色相と音のマッピング
色の表現方法で代表的な心理的属性による三属性の基本となる、色相のマッピングについて検討する。 3. Infinite scale system 3.1 Hue-to-sound mapping Hue mapping, which is the basis of three attributes based on typical psychological attributes in the color representation method, is examined.

色相は徐々に変化させると環をなすような連続性がある。音でそのような連続性を持つ要素は、Ｃ（ド）、Ｄ（レ）、Ｅ（ミ）・・・Ｃ（ド）という音階である。Ｃから次のＣまでを１オクターブといい、１オクターブ高くなると周波数が２倍に、逆に低くなると１/２になる。この１オクターブを１２の音階で均等に分ける１２平均律により色とマッピングする方法が考えられる。しかし、１オクターブ高いＣは基点のＣと一致せず、環ではなく螺旋構造となるので色の連続性が損なわれてしまう。そこで、この矛盾を解決する方法として、無限音階によるマッピングを行う。この方式を無限音階方式と呼ぶ。 When hue is gradually changed, there is a continuity that forms a ring. Elements having such continuity in sound are scales of C (do), D (le), E (mi)... C (do). From C to the next C is called one octave, and when the frequency is increased by one octave, the frequency is doubled. A method can be considered in which one octave is mapped to a color by 12 equal temperament that equally divides 12 octaves. However, C that is one octave higher does not coincide with C at the base point, and a spiral structure is formed instead of a ring, so color continuity is impaired. Therefore, as a method of solving this contradiction, mapping using an infinite scale is performed. This method is called an infinite scale method.

無限音階は、数個の音で無限に高くなり続けるように知覚的に感じる音階である。これまで実用性がないと言われてきたが、音階の連続性が実現できる点に注目し、色相環と無限音階をマッピングする。 The infinite scale is a scale that perceptually feels as if it is infinitely high with several sounds. Although it has been said that there is no practical use so far, we will focus on the point that scale continuity can be realized and map the hue circle and infinite scale.

３.２明度・彩度と音のマッピング
色相を無限音階でマッピングしただけでは色相しか表現できず、グラデーションのような微妙な色の変化を認識することはできない。色の他の属性である明度や彩度に対するマッピングも必要である。 3.2 Lightness / Saturation and Sound Mapping By mapping hues with an infinite scale, only hues can be expressed, and subtle color changes such as gradation cannot be recognized. Mapping to other attributes of color, lightness and saturation, is also necessary.

明度や彩度をマッピングするために、無限音階に対して可能な色変化について考える。まず、無限音階に対する音高の変化は、無限音階自身が音の高低であるため、音階単位での上下ではなく、無限音階の周波数分布のピークをオクターブ単位の上下で変化させることとする。次に、音色の変化は一般的には楽器音の変化であるが、楽器音が色相の認識に影響を与える危険性もあるために変化させず、楽器音の代わりにノイズ（ホワイトノイズやピンクノイズ）を合成することで音色に変化を持たせることができる。以上のことから、無限音階を色相とし、明度と彩度にマッピングする音の要素は、オクターブ単位での音の上下変化とノイズの付加とする。 To map lightness and saturation, consider possible color changes for an infinite scale. First, the change of the pitch with respect to the infinite scale is that the infinite scale itself is the pitch of the sound, so that the peak of the frequency distribution of the infinite scale is changed up and down in octaves instead of up and down in scale units. Next, the change in timbre is generally a change in instrument sound, but it does not change because there is a risk that the instrument sound affects the recognition of the hue, and instead of the instrument sound, noise (white noise or pink) By combining (noise), the timbre can be changed. From the above, the sound element mapped to lightness and saturation with the infinite scale as the hue is the vertical change of sound and the addition of noise in octave units.

３.３明度・彩度と音の相関実験
音の変化要素として、ホワイトノイズの付加とオクターブ単位の上下変化が、明度と彩度のどちらと相関が強いかを実験により確認した。実験方法は、ホワイトノイズ無し（ＯｄＢ）、４０ｄＢ、６０ｄＢの３種類と、一般的なド（Ｃ）であるＣ４（２６１.６Ｈｚ）を基準とし、２オクターブ上のＣ６（１０４６.５Ｈｚ）、２オクターブ下のＣ２（６５.４Ｈｚ）の３種類の音を組合せた９種類の音を再生し、評価を行った。音は図１の色認識装置で生成し、ＭＩＤＩ音源よりオーボエの音を鳴らし、スピーカ９から５０ｃｍ離れた場所で７０ｄＢとなるよう音量を調整した。ここでホワイトノイズはＭＩＤＩ音源に含まれない音であるため、ＷＡＶ形式で生成し、同時再生した。 3.3 Correlation experiment between lightness / saturation and sound The experiment confirmed whether the addition of white noise and the vertical change in octave units as a sound change factor had a strong correlation with lightness or saturation. The experimental method is based on three types of white noise free (OdB), 40 dB and 60 dB and C4 (261.6 Hz) which is a general de (C), C6 (1046.5 Hz) on 2 octaves, 2 Nine kinds of sounds were combined and evaluated, with three kinds of sounds of C2 (65.4 Hz) under octave. The sound was generated by the color recognition apparatus shown in FIG. 1, and an oboe sound was generated from the MIDI sound source, and the volume was adjusted to 70 dB at a location 50 cm away from the speaker 9. Here, since white noise is a sound that is not included in the MIDI sound source, it is generated in the WAV format and reproduced simultaneously.

この実験により、オクターブを基準として比較すると、ホワイトノイズ音量は彩度に与える影響が大きく、音量を大きくするとくすんだ色に感じると認められた。オクターブの変化は、ホワイトノイズ音量を基準として比較すると、明度に与える影響が大きく、オクターブを上昇させると明るい色に感じると認められた。また、ホワイトノイズを付加するとくすんだイメージが加わり、若干暗く感じるようであるが、ホワイトノイズの音量変化による明度への影響は少ない。 From this experiment, it was recognized that the white noise volume had a great effect on the saturation when compared with the octave as a reference, and that when the volume was increased, it felt a dull color. It was recognized that the octave change had a large effect on the brightness when compared with the white noise volume as a standard, and that when the octave was raised, it felt a bright color. Also, when white noise is added, a dull image is added and the image appears to be slightly dark, but the change in the volume of white noise has little effect on the brightness.

３.４無限音階方式の概要
無限音階方式の概要を図２に示す。色相は１２平均律の無限音階とマッピングし、１２分割した。基点となるＣは色聴研究で共通性の高かった赤とした。次に明度はオクターブの段階的な上下（例えば−２、−１、０、＋１、＋２の５段階）、彩度はノイズの段階的な付加（例えば０、４０、６０ｄＢの３段階）にマッピングした。また、無彩色はノイズのみの音として、白から黒へ段階的に音量を大きくした。本来、白は無音とすべきであるが、測色状態であることを示すために、３５ｄＢ程度の小さい音量とした。この方式に従い、色・音変換アルゴリズムを開発し、１８４色に対応した音が生成できる。この音をＭＩＤＩメッセージとしてＭＩＤＩ音源に送信することで音を再生する。 3.4 Overview of the infinite scale method Figure 2 shows an overview of the infinite scale method. Hue was mapped to 12 equal temperament infinite scales and divided into 12 parts. The base point C was red, which was highly common in color hearing research. Next, lightness is mapped in octave steps up and down (for example, -2, -1, 0, +1, +2), and saturation is mapped to noise in steps (for example, 0, 40, 60 dB). did. As for the achromatic color, the volume was gradually increased from white to black as noise only. Originally, white should be silent, but in order to show that it is in a colorimetric state, the volume is set to a low level of about 35 dB. According to this method, a color / sound conversion algorithm is developed, and a sound corresponding to 184 colors can be generated. The sound is reproduced by transmitting this sound as a MIDI message to the MIDI sound source.

４.三重奏方式
４.１ＲＧＢと音のマッピング
色は光の三原色であるＲＧＢの加法混色により表現することができる。このＲＧＢの各々に楽器音を対応させ、合奏することで色を表現することが可能である。必要な楽器音の数が３つであることから三重奏方式と呼ぶ。例えば、赤＝ピアノ、緑＝オーボエ、青＝ホルンとした場合に、ピアノだけの音ならば赤で、ピアノとオーボエが同じ音量で規定値の最大であれば黄色となる。ＲＧＢに対応する３つの楽器音は、各色をイメージしやすい楽器音を選択することが理想的である。しかし、色聴能力は個人差が大きいことから楽器音を特定することは難しい。そこで、楽器音を分離識別しやすく、不快感を与えなように、表１に示す５種類の三重奏を候補とした。 4. Trio system 4.1 Mapping of RGB and sound Color can be expressed by additive color mixture of RGB which is the three primary colors of light. It is possible to express a color by associating and playing an instrument sound with each of RGB. Since the number of necessary instrument sounds is three, it is called a trio system. For example, when red = piano, green = oboe, and blue = horn, the sound is red if it is a piano-only sound, and yellow if the piano and oboe have the same volume and the maximum value. Ideally, three instrument sounds corresponding to RGB are selected as instrument sounds that are easy to imagine each color. However, it is difficult to specify the instrumental sound because the color hearing ability varies greatly between individuals. Therefore, the five types of trio shown in Table 1 were used as candidates so that the instrument sounds can be easily separated and identified without causing discomfort.

色はＲＧＢの各刺激量に応じて変化するため、刺激量と音のマッピングが必要となる。マッピングする音の要素として「高さ」と「大きさ」が考えられるが、赤などの三原色自体の音は１種類の楽器音だけが聞こえることが望ましいため、自然な形で消音できる「大きさ」とマッピングした（図３）。 Since the color changes according to each of the RGB stimulus amounts, it is necessary to map the stimulus amount and the sound. “Height” and “magnitude” can be considered as elements of the sound to be mapped, but it is desirable to hear only one type of instrument sound for the three primary colors such as red. ”(FIG. 3).

４.２三重奏方式の概要
ＲＧＢに対応する３つの楽器音は、表１の中から好みの組合せを選択可能とした。また、３つの楽器音が同じ音階よりも各楽器音で音階も異なるほうが３音の識別がしやすいと考え、各楽器音にＣ・Ｅ・Ｇを割付け、和音とした。 4.2 Outline of the trio system The three instrument sounds corresponding to RGB can be selected from the combinations shown in Table 1. In addition, it is considered that it is easier to discriminate the three sounds when the three instrument sounds are different from each other than the same scale, and C, E, and G are assigned to each instrument sound to obtain a chord.

楽器間で音量のバランスが悪いと、イメージする色が偏る。ＭＩＤＩの実際の音量は、音量を指定する数値が等しくても楽器音によってバラツキがあった。そこで、どの楽器音でも同じ音量となるように、各楽器の音量を実測し、各楽器の音量補正係数を求めて補正した。音量は６４段階で変化させた。ＲＧＢの各光量をｒ、ｇ、ｂ（０≦ｒ、ｇ、ｂ＜２５６）、３種類の楽器音のＭＩＤＩ音量指定値をＶａ、Ｖｂ, Ｖｃ（０≦Ｖａ、Ｖｂ, Ｖｃ＜６４）とすると、三重奏方式は以下の数式で示すことができる。 If the volume balance between the instruments is poor, the image color will be biased. The actual volume of MIDI varies depending on the instrumental sound even if the numerical values specifying the volume are equal. Therefore, the volume of each instrument was measured and the volume correction coefficient of each instrument was obtained and corrected so that the same volume was obtained for all instrument sounds. The volume was changed in 64 steps. The RGB light amounts are r, g, b (0 ≦ r, g, b <256), and the MIDI volume designation values of the three types of instrument sounds are Va, Vb, Vc (0 ≦ Va, Vb, Vc <64). Then, the trio system can be expressed by the following mathematical formula.

ここで、ｋは定数、α、β、γは音量補正係数である。

Here, k is a constant, and α, β, and γ are volume correction coefficients.

この方式に従った色‐音変換アルゴリズムにより音を生成するＭＩＤＩメッセージを作成し、マルチティンバー機能を使ってＭＩＤＩ音源に送信し、音を再生する。これにより、無限音階よりも連続的な音が生成できる。 A MIDI message for generating a sound is generated by a color-sound conversion algorithm according to this method, and is transmitted to a MIDI sound source using a multitimbral function to reproduce the sound. Thereby, a continuous sound can be generated rather than an infinite scale.

５．実証試験結果
色を音色で呈示する方法として、この発明で提案した無限音階方式と三重奏方式について、複数人を対称とする検証試験を行った。被験者は１０〜３０代の晴眼者８人（男性５人、女性３人）と視覚障害者２人（３０代女性、５０代男性）で、晴眼者には目隠しをしてもらい、色模様サンプルに沿って測色部を移動させ、移動にともなう音の変化を聞かせ、（１）グラデーションか縞模様か？（２）縞模様なら、色数と色名は？（３）グラデーションなら、何色から何色への変化か？などの質問に回答してもらうことにより行った。検証試験により得られた結果は以下のとおりである。
・色模様の識別では、三重奏方式が無限音階方式より有効であった。しかし、無限音階方式も高い識別率であり、非常に有効な方式と考える。
・色模様の識別では、両マッピング方式とも視覚障害や音階同定能力による差はなかった。
・無限音階方式による色名認識では、音階同定能力が大きく影響し、音階同定能力の高い人ほど正答率が高かった。
・音階同定能力の低い人の色名認識では、三重奏方式の方が有効であった。
・視覚障害者が本システムにより色模様を認識できることが確認できた。 5). Verification Test Results As a method of presenting colors with timbres, a verification test was performed in which the infinite scale method and the trio method proposed in the present invention were symmetric. The test subjects were 8 sighted people in their 10s and 30s (5 men, 3 women) and 2 visually impaired people (30s women, 50s men). Move the colorimetric part along the line and hear the change in the sound as it moves. (1) Is it gradation or striped? (2) If it is a striped pattern, what is the number of colors and the color name? (3) If it is gradation, how many colors will it change? It was done by getting answers to such questions. The results obtained from the verification test are as follows.
-The trio system was more effective than the infinite scale system in identifying color patterns. However, the infinite scale method has a high identification rate and is considered to be a very effective method.
-There was no difference in color pattern identification between the two mapping methods due to visual impairment or scale identification ability.
・ In the color name recognition by the infinite scale method, the scale identification ability was greatly affected, and the higher the scale identification ability, the higher the correct answer rate.
-The trio method was more effective in recognizing color names for people with low scale identification ability.
・ It was confirmed that visually impaired persons can recognize color patterns with this system.

６.携帯型色認識装置
図１のようなスタンドアロン型の装置は、実用には不便である。図４は、この発明に係る携帯型の色認識装置の基本構成の例を示した図である。装置は、測色部２０、指令ボタン２１、音声データ生成部３０、及び出力部４０に大別される。測色部２０は、光センサ３、Ａ／Ｄコンバータ２２、色補正用基準データを格納するメモリ２３、及び色補正用の比較演算機２４を備える。 6. Portable Color Recognition Device A stand-alone type device as shown in FIG. 1 is inconvenient for practical use. FIG. 4 is a diagram showing an example of a basic configuration of a portable color recognition apparatus according to the present invention. The apparatus is roughly divided into a color measurement unit 20, a command button 21, an audio data generation unit 30, and an output unit 40. The color measurement unit 20 includes an optical sensor 3, an A / D converter 22, a memory 23 for storing color correction reference data, and a comparison calculator 24 for color correction.

音声データ生成部３０は、音データ生成部３１と声データ生成部３２とを備える。音データ生成部は測色した色に前述したようなマッピング方法で対応させた音データを出力し、声データ生成部３２は測色した色の色名に対応する声データを出力する。出力部４０は、Ｄ／Ａコンバータ４１とスピーカ４２とを備えており、測色した色に対応する音と声とは、一つのスピーカ４２から出力される。 The audio data generation unit 30 includes a sound data generation unit 31 and a voice data generation unit 32. The sound data generation unit outputs sound data corresponding to the measured color by the mapping method as described above, and the voice data generation unit 32 outputs voice data corresponding to the color name of the measured color. The output unit 40 includes a D / A converter 41 and a speaker 42, and sounds and voices corresponding to the measured colors are output from one speaker 42.

図５は、声データ生成部３２の構成を示した図で、図１で示した色名発声装置で用いられているものと同様な構成である。すなわち、演算機３３でＲＧＢ値を色の三属性のＨＬＳ値に変換し、更に色の三属性データを色名データに変換する。色名データとは、ここではＪＩＳＺ８１０２に定義されている色の事とし、色の３属性情報を丸めて、音声データに対応する色名データを持つ色名テーブル３４にある色数まで減らす。 FIG. 5 is a diagram showing the configuration of the voice data generation unit 32, which is the same configuration as that used in the color name utterance device shown in FIG. That is, the computing unit 33 converts the RGB value into a color three-attribute HLS value, and further converts the color three-attribute data into color name data. Here, the color name data is a color defined in JIS Z8102, and the three attribute information of the color is rounded down to the number of colors in the color name table 34 having the color name data corresponding to the audio data.

そして、色名データで色名テーブル３４を参照して、取得した色名の声情報(サンプリングデータ)をＤ／Ａコンバータ(図４参照)を通しスピーカ４２から発声する。 Then, referring to the color name table 34 with the color name data, voice information (sampling data) of the acquired color name is uttered from the speaker 42 through the D / A converter (see FIG. 4).

図６は、三重奏方式によるときの音データ生成部３１の構成の例を示した図である。すなわち、演算器３５でＲＧＢ値からそれぞれの楽器のパートの音量を算出し、それぞれの音量をシンセサイザ３６に対して命令することで行う。 FIG. 6 is a diagram showing an example of the configuration of the sound data generation unit 31 when using the trio system. That is, the calculator 35 calculates the volume of each musical instrument part from the RGB values and instructs the synthesizer 36 to set the volume of each instrument.

シンセサイザ３６は、一般的な構成のもので、図には電圧制御発振器（ＶＣＯ）、電圧制御フィルタ（ＶＣＦ）、電圧制御アンプ（ＶＣＡ）、ノイズジェネレータ、音色格納メモリ及びミキサーを備えたものとして示しているが、あくまで模式的に表した図である。実際にはそれぞれに信号を送るのではなく、シンセサイザ３６が持つ命令に変換して送られる。図の処理に限定されるものではなく、音階と音色を持つ複数音を発生させる装置全てに適用できる。 The synthesizer 36 has a general configuration, and is shown in the figure as having a voltage controlled oscillator (VCO), a voltage controlled filter (VCF), a voltage controlled amplifier (VCA), a noise generator, a tone color storage memory, and a mixer. However, it is a diagram schematically shown to the last. Actually, instead of sending a signal to each, it is converted into a command held by the synthesizer 36 and sent. The present invention is not limited to the processing shown in the figure, and can be applied to all devices that generate a plurality of sounds having a scale and a timbre.

一般的なシンセサイザーは、ＶＣＯで目的の音の基準周波数を発生させ、ＶＣＦで目的の音色のフィルターをかけ、ＶＣＡでＡＤＳＲ（アタック、ディケイ、サスティン、リリース）と呼ばれる値を利用して音量と音の立ち上がりと減衰を制御する。ＶＣＯでは音程を、ＶＣＦとＶＣＡで音色を、ＶＣＡで音量をそれぞれ制御する。これらは、発音する音１音分(１パート)ずつにそれぞれある。 A typical synthesizer generates a reference frequency of the target sound with the VCO, filters the target tone with the VCF, and uses a value called ADSR (Attack, Decay, Sustain, Release) with the VCA to produce volume and sound. Controls the rise and decay of The VCO controls the pitch, the VCF and VCA control the tone, and the VCA controls the volume. Each of these sounds is one sound (one part).

ＲＧＢ音で鳴らす場合、例えば、Ｒにピアノの音色、Ｇにトランペットの音色、Ｂにバイオリンの音色を設定し、検出したＲＧＢ値のそれぞれの値を対応する楽器の音量として、シンセサイザ３６に対し、それぞれの音色を処理するチャンネルに音色データと音量データを命令することで発音する。 In the case of sounding with RGB sounds, for example, a piano tone is set for R, a trumpet tone is set for G, and a violin tone is set for B, and each detected RGB value is set as the volume of the corresponding instrument. Sound is generated by instructing tone color data and volume data to a channel for processing each tone color.

図７は、前述した無限音階方式によるときの音データ生成部３１の構成の例を示した図である。すなわち、演算器３３でＲＧＢ値を色の３属性ＨＬＳ値に変換し、色相Ｈ、彩度Ｌ、明度Ｓそれぞれの値によって、音階、音程、ノイズ音量を指令する。 FIG. 7 is a diagram showing an example of the configuration of the sound data generation unit 31 when the infinite scale method is used. That is, the computing unit 33 converts the RGB value into a three-attribute HLS value of color, and commands the scale, pitch, and noise volume according to the values of hue H, saturation L, and brightness S, respectively.

色相の値は、Ｃ(ド),Ｃ#(ド＃),Ｄ(レ),Ｄ#(レ＃),Ｅ(ミ),Ｆ(ファ),Ｆ#(ファ＃),Ｇ(ソ),Ｇ#(ソ＃),Ａ(ラ),Ａ#(ラ＃),Ｂ(シ)の１２音階にそれぞれ割り当てる。 Hue values are C (do), C # (do #), D (le), D # (le #), E (mi), F (fa), F # (fa #), G (so) , G # (So #), A (La), A # (La #), and B (B) are assigned to 12 scales.

明度の値は、オクターブとして、低い明度では低いオクターブで、高い明度では高いオクターブで鳴るようにする。彩度の値は、ノイズの音量とし、低い彩度になるほどノイズの音量が大きくなるようにする。このように作成した音階データを、三重奏方式で示したようなシンセサイザー３６に命令することで発音する。 The lightness value is octave so that it sounds low octave at low lightness and high octave at high lightness. The saturation value is the noise volume, and the noise volume increases as the saturation decreases. The musical scale data created in this way is pronounced by commanding the synthesizer 36 as shown in the trio system.

指令ボタン２１は、動作スイッチであると共に音と声を切替える切替器として作動し、指令ボタン２１が操作されたとき声データ生成部３２に信号を送って色名を一回発声させたあと音データ生成部３１に戻って、指令ボタンが押し続けられる間、測色部２０の信号を音データ生成部３１に送り続ける。従って、指令ボタン２１を押すと最初に色名が発声され、指令ボタンを押し続けて対象物表面をなぞると音が色の変化に応じて変化しながら発音され、適時指令ボタン２１を一旦離して再び押せば、そのときに検出している色の色名が発声される。 The command button 21 is an operation switch and operates as a switch for switching between sound and voice. When the command button 21 is operated, the command button 21 sends a signal to the voice data generation unit 32 to utter the color name once and then the sound data. Returning to the generation unit 31, the signal of the color measurement unit 20 is continuously sent to the sound data generation unit 31 while the command button is kept pressed. Therefore, when the command button 21 is pressed, the color name is first uttered. When the command button is continuously pressed and the object surface is traced, the sound is generated while changing according to the change in color, and the command button 21 is released once when appropriate. If pressed again, the name of the color detected at that time is spoken.

指令ボタン２１の操作による装置の基本的な使用方法は、測色部を対象に押し当て、指令ボタンを押すと起点の色名が聞け、さらに指令ボタンを押し続けると色に対応した音が鳴るので、使用者が対象物をなぞりながら色の変化を音の変化として聞くことで色模様が認識できる。色と音のマッピングに慣れない初期段階では、音が変化したときに適時指令ボタンを押し、その都度色名を聞きながら色の変化を認識する。また習熟することで、音だけで何色であるか認識できるようになり、より効率的な色認識が可能になる。 The basic method of using the device by operating the command button 21 is to press the colorimetric unit, press the command button to hear the name of the starting color, and continue to press the command button to make a sound corresponding to the color. Therefore, the user can recognize the color pattern by listening to the color change as a sound change while tracing the object. At the initial stage, when you are not used to color and sound mapping, press the timely command button when the sound changes, and recognize the color change while listening to the color name each time. In addition, by learning, it becomes possible to recognize how many colors it is with only sound, and more efficient color recognition becomes possible.

色を音で認識しているときに、色名の発声を自動的に行わせることもできる。図８は、対象物の表面に沿って装置を移動させているとき、色名が変わったときに自動的に色名を発声する声データ生成部３２の構成を示した図である。図に示すように、色の三属性を色名データに変換したのち、前回に記録した色名データとの比較を行い、色名データが変化したときに測色部２０の信号を出力する。 When the color is recognized by sound, the color name can be automatically uttered. FIG. 8 is a diagram showing a configuration of the voice data generation unit 32 that automatically utters a color name when the color name changes when the apparatus is moved along the surface of the object. As shown in the figure, after the three color attributes are converted into color name data, the color name data recorded last time is compared, and when the color name data changes, a signal of the color measuring unit 20 is output.

上記実施態様においては、色と音のマッピング方式として、無限音階方式と三重奏方式を例示した。これらの方式は、人間の知覚に適合した好ましい方式であるが、この発明は、勿論これらの方式に限定されるものではない。例えば他の方式として、色相を複数の楽器音に対応させ、色の連続性は隣接された楽器を同時に鳴らし、音量を一方は大から小へ、もう一方は小から大へ変化させることで色相の連続性を出す方式も考えられ、これに彩度表現にノイズを付加し、明度表現は音の周波数の高低あるいは高低の周波数の比率を変える、またノイズとして風や波などの自然の音を用いるなど、種々の方式が考えられる。 In the said embodiment, the infinite scale system and the trio system were illustrated as a color and sound mapping system. These methods are preferable methods adapted to human perception, but the present invention is of course not limited to these methods. For example, as another method, the hue corresponds to multiple instrument sounds, the color continuity is played simultaneously by neighboring instruments, and the volume is changed by changing the volume from high to low on one side and from small to high on the other side. There is also a method of adding continuity of noise, adding noise to chroma expression, brightness expression changing the frequency ratio of sound frequency or changing the ratio of high and low frequency, and natural noise such as wind and waves as noise Various methods such as use are conceivable.

スタンドアロン型の色認識装置の例を示した図Diagram showing an example of a stand-alone color recognition device 無限音階方式の概要を示す図Figure showing an overview of the infinite scale method 三重奏方式の概要を示す図Diagram showing the outline of the trio system 携帯型の色認識装置の基本構成の例を示した図The figure which showed the example of the basic composition of the portable color recognition device 声データ生成部の例を示した図The figure which showed the example of the voice data generation part 三重奏方式によるときの音データ生成部の例を示した図The figure which showed the example of the sound data generation part at the time of a trio system 無限音階方式によるときの音データ生成部の例を示した図The figure which showed the example of the sound data generation part at the time of the infinite scale method 自動的に色名を発声する声データ生成部の例を示した図The figure which showed the example of the voice data generation part which utters a color name automatically

Explanation of symbols

３光センサ
４スピーカー
９スピーカ
21 音-声切換器
31 音データ生成部
32 声データ生成部
34 色名テーブル
42 スピーカ 3 Optical sensor 4 Speaker 9 Speaker
21 Sound-voice switch
31 Sound data generator
32 Voice data generator
34 Color name table
42 Speaker

Claims

A light sensor (3) for receiving light, color element calculation means (33, 35) for calculating a color element value for specifying the color of the light from the received light, and sound data corresponding to the calculated color element value A sound name generating means (31) to generate, a color name table (34) in which a plurality of color names and color element values corresponding to each color name are registered, and a color element value closest to the calculated color element value A color name acquisition means (37) for acquiring a color name from the color name table, a speaker (4, 9, 42), and a command button (21), and the sound data generation means adjusts the color saturation. Correspondingly, sound data including noise that increases in volume as the saturation is lowered is generated and output to the speaker, and the color name acquisition unit receives the command signal from the command button and selects the selected color. A color-sound conversion device that outputs voice information of a name to a speaker.

A light sensor (3) for receiving light, color element calculation means (33, 35) for calculating a color element value for specifying the color of the light from the received light, and sound data corresponding to the calculated color element value A color name table (34) in which sound data generation means (31) to be generated, a plurality of color names and color element values corresponding to each color name are registered, and a color element value closest to the calculated color element value A color name acquisition means (37) for acquiring a color name from the color name table; and a speaker (4, 9, 42), wherein the sound data generation means has a low saturation corresponding to the saturation of the color. The sound data including noise that increases the volume is generated and output to the speaker, and the color name acquisition unit repeatedly acquires the color name when the acquired color name changes. A color-sound conversion device that outputs voice information to a speaker.

The color-sound converter according to claim 1 or 2, wherein the sound data is sound data including at least three kinds of musical instrument tones .

The color-sound conversion apparatus according to claim 1 or 2, wherein the sound data is sound data having an overall frequency that increases as the lightness increases corresponding to the lightness of the color.