JP4657532B2 - Shape transfer device - Google Patents
Shape transfer device Download PDFInfo
- Publication number
- JP4657532B2 JP4657532B2 JP2001275845A JP2001275845A JP4657532B2 JP 4657532 B2 JP4657532 B2 JP 4657532B2 JP 2001275845 A JP2001275845 A JP 2001275845A JP 2001275845 A JP2001275845 A JP 2001275845A JP 4657532 B2 JP4657532 B2 JP 4657532B2
- Authority
- JP
- Japan
- Prior art keywords
- sound
- image data
- shape
- output
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Abstract
Description
【0001】
【発明の属する技術分野】
本発明は、画像読取装置により認識され読取られた画像データ(1)の形状及び特徴を音データに変換し出力する手法である。
【0002】
【従来の技術】
従来の画像読取装置より認識され読み取られた画像データ(1)の情報を使用者に伝達する為の出力方法として、イメージスキャナやプリンタに見られる、入力された画像データ(1)をモニター又は用紙に出力し、視覚に伝達する方法が採られている。また、視覚障害者向けの歩行に見られる、認識された画像データ(1)の形状を振動および電気的刺激を発生させる事により、触覚に伝達を行なう方法、および入力された画像データ(1)の形状を、音を用いて聴覚に伝達を行なう方法が用いられている。
【0003】
【発明が解決しようとする課題】
従来の方法において、認識され読み取られた画像データ(1)の情報をモニター又は用紙へ出力する場合、暗所などの視界不良な場所で活動する場合をはじめとした視界および視覚に何らかの障害がある状態では、画像データ(1)を視覚にて認識する事は難しいという問題がある。また振動又は電気的刺激を発生させ触覚に伝達する方法、及び音を用いて聴覚に伝達する方法を用いた場合、読み取られた画像データ(1)の形状及び特徴を把握する事が難しいという問題がある。
【0004】
【課題を解決するための手段】
【0005】
CCDカメラやスキャナなどの画像読取装置より読取られた画像データ(1)の輪郭情報(2)を抽出する事で、画像データ(1)の形状及び特徴を抽出する。具体的には、画像データ(1)の横方向をX軸とし、画像データ(1)の縦方向をY軸とし、X軸方向に画像読取を行ない、画像データ(1)の形状及び特徴として抽出した輪郭情報(2)を音データに変換する。この時、X軸を音の出力される位置情報(4)に置き換え、Y軸を音の周波数(5)に置き換える事により画像データ(1)の形状及び特徴を音データとして出力し、視覚にて認識する事が困難であった視覚に障害を持つ者が画像データ(1)の形状及び特徴を聴覚にて認識する事が可能となる。
【0006】
【発明の実施の形態】
物体そのものが2次元もしくは3次元の場合でも、物体の輪郭情報(2)は2次元で表現することが可能である。又、人間の聴覚は音源定位(3)と呼ばれる出力される音がどの方向から聞こえるか位置を判定する機能にて、出力される音の方向を認識し、出力される音圧レベルによる音の音量の大小、及び音の周波数(5)により出力される音の音程の高低差及び音色を認識している。そこで、画像データ(1)の横方向をX軸とし、画像データ(1)の縦方向をY軸とし、X軸方向に画像読取を行なう事で画像データ(1)の輪郭情報(2)を画像データ(1)の形状及び特徴として抽出する。抽出された輪郭情報(2)を、音データに変換し出力する事で画像データ(1)の形状及び特徴を聴覚にて認識する事を可能にする。音データ変換時においてはX軸を音の出力される位置情報(4)に置き換え、Y軸を音の周波数(5)に置き換える。しかし、人間の聴覚能力において複数同時に出力する同一周波数の認識能力は無く、又、複数同時に出力する異なる周波数の認識能力には個人差がある。さらに出力される音の位置を認識する音源定位(3)においては、後から出力される音の位置の認識能力よりも、前及び左右から出力される音の位置の認識能力が高いことから、音の出力される位置の情報(4)を左側よりX軸方向に時間走査的に出力する事により、画像データ(1)の特徴を音データとして出力する。例として、画像データ(1)を音データとして出力を開始してよりnポイント(6)n+1ポイント(7)n+2ポイント(8)経過した各ポイントでは、出力される音データが異なる。nポイント(6)では、左側より異なる周波数の音が2音同時出力される。n+1ポイント(7)では中央より異なる周波数の音が2音同時出力されるが、nポイント(6)時に比べ周波数の高い音と周波数の低い音が出力される。n+2ポイント(8)では、音は出力されない。
【0007】
図11において、スキャナ入力部より画像データ(1)を画像読取後、画像変換(9)にて二値化処理(10)を行なう。画像読取された画像データ(1)は特徴抽出処理(11)として輪郭線追跡処理(12)を行ない、画像データ(1)の特徴を抽出する。輪郭情報(2)画像計測(13)後、画像データ(1)から音データにテーブルを置き換え(14)、画像読取を行なったX方向を音の出力される位置情報(4)、輪郭情報(2)の縦方向をY座標とし、ソート(15)を行なう。ソート(15)された画像データ(1)は、音データとして出力する。
【0008】
図12において、画像データと特定のポイントを音データに変換する例を示す。
図では、取り込まれた画像(21)と、特定のポイント(20)が画面上に混在する。特定のポイント(20)の位置は、特定のポイントが位置する周波数(22)と、特定のポイントが位置する音量(23)により解析される。
図内の(21)L.chと(21)R.chは、取り込まれた画像(21)を解析した時の左右の出力レベルと波形をイメージした波形であり、取り込まれた画像(21)の形状により周波数と左右の出力レベルが変化する。
図内の(20)L.chAと(20)R.chAは、特定のポイント(20)の位置を解析し特定のポイントが位置する周波数(22)と、特定のポイントが位置する音量(23)によって周辺の画像データと聞き分けが出きるように一定間隔で発信と無発信を繰り返した例であり、図内の(20)L.chBと(20)R.chBは、特定のポイント(20)の位置を解析し特定のポイントが位置する周波数(22)を、特定のポイントが位置する音量(23)によって、特定のポイント(20)が位置する横方向を再生する時間に一定間隔で発信した例である。
そして、図内の(21)L.chと(20)L.chA又は(20)L.chBの波形と、(21)R.chと(20)R.chA又は(20)R.chBの波形は各々合成して出力する。
【0009】
【図面の簡単な説明】
【図1】 画像データの一例を表す
【図2】 画像データの輪郭情報を抽出した図
【図3】 画像データを音データに変換する定義を表した図
【図4】 2次元図形の輪郭情報を抽出し、2軸データとして表す
【図5】 3次元図形の輪郭情報を抽出し、2軸データとして表す
【図6】 音源定位(3)を表わした図。
【図7】 取り込んだ画像に対して出力される音データを表わす図。
【図8】 出力される音データ(6)を表わす図。
【図9】 出力される音データ(7)を表わす図。
【図10】 出力される音データを表わす図。
【図11】 画像データを音データに変換するフローの例
【図12】 画像データと特定のポイントを音データに変換する例
【符号の説明】
1 画像データ
2 輪郭情報
3 音源定位
4 音の出力される位置情報
5 音の周波数
6 nポイント
7 n+1ポイント
8 n+2ポイント
9 画像変換
10 二値化処理
11 特徴抽出処理
12 輪郭線追跡処理
13 画像計測
14 テーブル置換
15 ソート
20 特定のポイント
21 取りこまれた画像
22 特定のポイントが位置する周波数
23 特定のポイントが位置する音量[0001]
BACKGROUND OF THE INVENTION
The present invention is a technique for converting the shape and characteristics of image data (1) recognized and read by an image reading device into sound data and outputting the sound data.
[0002]
[Prior art]
As an output method for transmitting information of image data (1) recognized and read by a conventional image reading apparatus to a user, the input image data (1) found on an image scanner or printer is displayed on a monitor or paper. The method is used to output to and visually transmit. In addition, a method of transmitting the shape of the recognized image data (1) seen in walking for the visually handicapped person to the sense of touch by generating vibration and electrical stimulation, and the input image data (1). The method of transmitting the shape of the sound to the auditory sense using sound is used.
[0003]
[Problems to be solved by the invention]
In the conventional method, when the information of the recognized and read image data (1) is output to a monitor or paper, there are some obstacles in the field of vision and vision, including the case of operating in a place with poor visibility such as a dark place. In the state, there is a problem that it is difficult to visually recognize the image data (1). In addition, it is difficult to grasp the shape and characteristics of the read image data (1) when using a method of generating vibration or electrical stimulation and transmitting it to the tactile sense and a method of transmitting to the auditory sense using sound. There is.
[0004]
[Means for Solving the Problems]
[0005]
By extracting the contour information (2) of the image data (1) read by an image reading device such as a CCD camera or a scanner, the shape and characteristics of the image data (1) are extracted. Specifically, the horizontal direction of the image data (1) is the X axis, the vertical direction of the image data (1) is the Y axis, and the image is read in the X axis direction. As the shape and characteristics of the image data (1), The extracted contour information (2) is converted into sound data. At this time, the X-axis is replaced with the position information (4) where the sound is output, and the Y-axis is replaced with the sound frequency (5), so that the shape and characteristics of the image data (1) are output as sound data. It is possible for a visually impaired person who is difficult to recognize to recognize the shape and characteristics of the image data (1) by hearing.
[0006]
DETAILED DESCRIPTION OF THE INVENTION
Even when the object itself is two-dimensional or three-dimensional, the contour information (2) of the object can be expressed in two dimensions. In addition, the human auditory sense is called sound source localization (3), which recognizes the direction of the output sound with a function for determining the direction from which the output sound can be heard, and the sound of the output sound pressure level is detected. It recognizes the pitch difference and tone color of the sound output by the volume level and the sound frequency (5). Therefore, the horizontal direction of the image data (1) is taken as the X axis, the vertical direction of the image data (1) is taken as the Y axis, and image information is read in the X axis direction to obtain the contour information (2) of the image data (1). Extracted as the shape and features of the image data (1). By converting the extracted contour information (2) into sound data and outputting it, the shape and characteristics of the image data (1) can be recognized by hearing. At the time of sound data conversion, the X axis is replaced with position information (4) from which sound is output, and the Y axis is replaced with sound frequency (5). However, in human hearing ability, there is no recognition ability for the same frequency that is output simultaneously, and there are individual differences in recognition ability for different frequencies that are output simultaneously. Furthermore, in the sound source localization (3) for recognizing the position of the sound to be output, the ability to recognize the position of the sound output from the front and left and right is higher than the ability to recognize the position of the sound output later. By outputting the position information (4) where the sound is output from the left side in the X-axis direction in a time-scanning manner, the characteristics of the image data (1) are output as sound data. As an example, the output sound data is different at each point when n points (6), n + 1 points (7), and n + 2 points (8) have elapsed since the start of outputting image data (1) as sound data. At n point (6), two sounds of different frequencies are output simultaneously from the left side. At n + 1 point (7), two sounds having different frequencies are output simultaneously from the center, but a sound having a higher frequency and a sound having a lower frequency than those at n point (6) are output. At n + 2 points (8), no sound is output.
[0007]
In FIG. 11, after image data (1) is read from the scanner input unit, binarization processing (10) is performed by image conversion (9). The image data (1) that has been read is subjected to outline tracking processing (12) as feature extraction processing (11) to extract the features of the image data (1). Contour information (2) After image measurement (13), the table is replaced with sound data from image data (1) (14), position information (4) in which sound is output in the X direction in which image reading is performed, contour information ( Sorting (15) is performed with the vertical direction of 2) as the Y coordinate. The sorted (15) image data (1) is output as sound data.
[0008]
FIG. 12 shows an example of converting image data and specific points into sound data.
In the figure, the captured image (21) and the specific point (20) are mixed on the screen. The position of the specific point (20) is analyzed by the frequency (22) where the specific point is located and the volume (23) where the specific point is located.
(21) L. ch and (21) R.M. ch is a waveform in which the left and right output levels and the waveform when the captured image (21) is analyzed, and the frequency and the left and right output levels change depending on the shape of the captured image (21).
(20) L. chA and (20) R. The chA analyzes the position of a specific point (20), and is spaced at regular intervals so that it can be distinguished from surrounding image data by the frequency (22) where the specific point is located and the volume (23) where the specific point is located. In the example shown in FIG. chB and (20) R. chB analyzes the position of the specific point (20), determines the frequency (22) at which the specific point is located, and the horizontal direction in which the specific point (20) is located by the volume (23) at which the specific point is located. This is an example of transmission at regular intervals during the playback time.
And (21) L. ch and (20) L. chA or (20) L. chB waveform and (21) R.R. ch and (20) R.M. chA or (20) R.I. The chB waveforms are synthesized and output.
[0009]
[Brief description of the drawings]
[Fig. 1] An example of image data [Fig. 2] A diagram extracting contour information of image data [Fig. 3] A diagram showing a definition for converting image data into sound data [Fig. 4] [Fig. Is extracted and expressed as 2-axis data. [FIG. 5] The contour information of a three-dimensional figure is extracted and expressed as 2-axis data. [FIG. 6] A diagram showing sound source localization (3).
FIG. 7 is a diagram illustrating sound data output for a captured image.
FIG. 8 is a diagram showing sound data (6) to be output.
FIG. 9 is a diagram showing sound data (7) to be output.
FIG. 10 is a diagram showing sound data to be output.
11 is an example of a flow for converting image data into sound data. FIG. 12 is an example of converting image data and a specific point into sound data.
DESCRIPTION OF
Claims (2)
画像データ(1)の横方向をX軸とし、画像データ(1)の縦方向をY軸とし、X軸方向に画像読取を行ない、画像データ(1)上の読取時のX座標に存在する図形の輪郭から当該輪郭のX座標およびY座標を含む輪郭情報(2)を抽出し、
当該輪郭情報(2)のX座標を、出力される音がどの方向から聞こえるかを判定する人間の聴覚機能である音源定位(3)を利用した音の出力される位置の情報(4)に置き換え、Y座標を音の周波数(5)に置き換え、
画像データ(1)のX軸方向に走査を行って、走査時のX座標に対応する位置の情報(4)と音の周波数(5)により特定される音を出力する事により、
画像データ(1)の形状及び特徴を聴覚にて認識させる事を可能とする形状伝達装置。A device for outputting the shape and characteristics of image data (1) recognized and read by an image reading device such as a CCD camera or a scanner as sound ,
The horizontal direction of the image data (1) is the X-axis, the vertical direction of the image data (1) is the Y-axis, the image is read in the X-axis direction, and exists at the X coordinate at the time of reading on the image data (1). Contour information (2) including the X coordinate and Y coordinate of the contour is extracted from the contour of the figure ,
The X coordinates of the contour information (2), the sound source localization (3) location of the information in the output of sound using the sound output is judged human hearing functions or heard from which direction (4) Replace the Y coordinate with the sound frequency (5) ,
By scanning the image data (1) in the X-axis direction and outputting the sound specified by the position information (4) and the sound frequency (5) corresponding to the X coordinate at the time of scanning ,
Shape transmission apparatus capable Ru is recognized by hearing the shape and characteristics of the image data (1).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2001275845A JP4657532B2 (en) | 2001-09-12 | 2001-09-12 | Shape transfer device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2001275845A JP4657532B2 (en) | 2001-09-12 | 2001-09-12 | Shape transfer device |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2003084784A JP2003084784A (en) | 2003-03-19 |
JP4657532B2 true JP4657532B2 (en) | 2011-03-23 |
Family
ID=19100639
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2001275845A Expired - Fee Related JP4657532B2 (en) | 2001-09-12 | 2001-09-12 | Shape transfer device |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP4657532B2 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4891375B2 (en) * | 2009-09-28 | 2012-03-07 | 昌弘 黒田 | Image hearing device |
CN107157651A (en) * | 2017-06-13 | 2017-09-15 | 浙江诺尔康神经电子科技股份有限公司 | A kind of visual pattern sensory perceptual system and method based on sonic stimulation |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH03184540A (en) * | 1989-07-27 | 1991-08-12 | Philips Gloeilampenfab:Nv | Image-voice conversion device |
JPH08252279A (en) * | 1995-03-17 | 1996-10-01 | Hitachi Ltd | Route guiding method for presenting object with sound |
JPH0962866A (en) * | 1995-08-22 | 1997-03-07 | Nec Corp | Information presentation device |
JPH09258946A (en) * | 1996-03-26 | 1997-10-03 | Fujitsu Ltd | Information processor |
JPH11102280A (en) * | 1994-12-16 | 1999-04-13 | Hitachi Ltd | Sound output method for image information |
JP2000241544A (en) * | 1999-02-18 | 2000-09-08 | Techno Soft Systemnics:Kk | Portable non-contact obstacle distance meter for visually handicapped person |
JP2001084484A (en) * | 1999-09-13 | 2001-03-30 | Tamotsu Okazawa | Method and device for scene recognition |
-
2001
- 2001-09-12 JP JP2001275845A patent/JP4657532B2/en not_active Expired - Fee Related
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH03184540A (en) * | 1989-07-27 | 1991-08-12 | Philips Gloeilampenfab:Nv | Image-voice conversion device |
JPH11102280A (en) * | 1994-12-16 | 1999-04-13 | Hitachi Ltd | Sound output method for image information |
JPH08252279A (en) * | 1995-03-17 | 1996-10-01 | Hitachi Ltd | Route guiding method for presenting object with sound |
JPH0962866A (en) * | 1995-08-22 | 1997-03-07 | Nec Corp | Information presentation device |
JPH09258946A (en) * | 1996-03-26 | 1997-10-03 | Fujitsu Ltd | Information processor |
JP2000241544A (en) * | 1999-02-18 | 2000-09-08 | Techno Soft Systemnics:Kk | Portable non-contact obstacle distance meter for visually handicapped person |
JP2001084484A (en) * | 1999-09-13 | 2001-03-30 | Tamotsu Okazawa | Method and device for scene recognition |
Also Published As
Publication number | Publication date |
---|---|
JP2003084784A (en) | 2003-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101221513B1 (en) | Graphic haptic electronic board and method for transferring visual information to visually impaired people as haptic information | |
Hoang et al. | Obstacle detection and warning system for visually impaired people based on electrode matrix and mobile Kinect | |
CN107708483B (en) | Method and system for extracting motion characteristics of a user to provide feedback to the user using hall effect sensors | |
Balakrishnan et al. | Wearable real-time stereo vision for the visually impaired. | |
CN103973971B (en) | The control method of information equipment and information equipment | |
WO2014064870A1 (en) | Image processing device and image processing method | |
US11185445B2 (en) | Portable system that allows blind or visually impaired persons to interpret the surrounding environment by sound and touch | |
EP1343108A3 (en) | Method and apparatus for recognising faces using principal component analysis and second order independent component analysis | |
JPH1069539A (en) | Scenery image input and touch output device | |
CN105117706A (en) | Image processing method and apparatus and character recognition method and apparatus | |
Balakrishnan et al. | A stereo image processing system for visually impaired | |
US20190333496A1 (en) | Spatialized verbalization of visual scenes | |
JP2011250928A (en) | Device, method and program for space recognition for visually handicapped person | |
KR20100010981A (en) | Apparatus and method for converting image information into haptic sensible signal | |
Hoang et al. | Obstacle detection and warning for visually impaired people based on electrode matrix and mobile Kinect | |
JP5002068B1 (en) | Environmental information transmission device | |
JP4657532B2 (en) | Shape transfer device | |
JP5598981B2 (en) | Perceptual stimulus information generation system | |
Pei et al. | Census-based vision for auditory depth images and speech navigation of visually impaired users | |
KR20160113760A (en) | Picture book making method and system for blind child and tactile teaching tool using the same | |
JP2015011404A (en) | Motion-recognizing and processing device | |
JP2009503628A (en) | Multimedia digital code printing apparatus and printing method | |
JP2019008482A (en) | Braille character tactile sense presentation device and image forming apparatus | |
Bangar et al. | Vocal vision for visually impaired | |
JP4891375B2 (en) | Image hearing device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20080215 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20080215 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20100817 |
|
RD02 | Notification of acceptance of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7422 Effective date: 20100831 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20101008 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20101214 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20101222 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140107 Year of fee payment: 3 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 4657532 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140107 Year of fee payment: 3 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
LAPS | Cancellation because of no payment of annual fees |