JP4657532B2 - Shape transfer device - Google Patents

Shape transfer device Download PDF

Info

Publication number
JP4657532B2
JP4657532B2 JP2001275845A JP2001275845A JP4657532B2 JP 4657532 B2 JP4657532 B2 JP 4657532B2 JP 2001275845 A JP2001275845 A JP 2001275845A JP 2001275845 A JP2001275845 A JP 2001275845A JP 4657532 B2 JP4657532 B2 JP 4657532B2
Authority
JP
Japan
Prior art keywords
sound
image data
shape
output
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2001275845A
Other languages
Japanese (ja)
Other versions
JP2003084784A (en
Inventor
幸治 石突
美穂 細野
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Line Media Research Co Ltd
Original Assignee
Line Media Research Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Line Media Research Co Ltd filed Critical Line Media Research Co Ltd
Priority to JP2001275845A priority Critical patent/JP4657532B2/en
Publication of JP2003084784A publication Critical patent/JP2003084784A/en
Application granted granted Critical
Publication of JP4657532B2 publication Critical patent/JP4657532B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Abstract

PROBLEM TO BE SOLVED: To provide a shape transmission device which converts the shape and features of image data read by an image reader into sound data and outputs the sound data. SOLUTION: Image data is read, and outline information 2 of the image data is extracted as the shape and features of image data, and the X axis of outline information is replaced with sound position information 4 utilizing sound source orientation which is a human aural function which decides the direction of outputted sounds, and the Y axis is replaced with a frequency 5 of sounds, and image data is converted into sound data to output the sound data, and thus the shape and features of image data can be recognized by hearing.

Description

【0001】
【発明の属する技術分野】
本発明は、画像読取装置により認識され読取られた画像データ(1)の形状及び特徴を音データに変換し出力する手法である。
【0002】
【従来の技術】
従来の画像読取装置より認識され読み取られた画像データ(1)の情報を使用者に伝達する為の出力方法として、イメージスキャナやプリンタに見られる、入力された画像データ(1)をモニター又は用紙に出力し、視覚に伝達する方法が採られている。また、視覚障害者向けの歩行に見られる、認識された画像データ(1)の形状を振動および電気的刺激を発生させる事により、触覚に伝達を行なう方法、および入力された画像データ(1)の形状を、音を用いて聴覚に伝達を行なう方法が用いられている。
【0003】
【発明が解決しようとする課題】
従来の方法において、認識され読み取られた画像データ(1)の情報をモニター又は用紙へ出力する場合、暗所などの視界不良な場所で活動する場合をはじめとした視界および視覚に何らかの障害がある状態では、画像データ(1)を視覚にて認識する事は難しいという問題がある。また振動又は電気的刺激を発生させ触覚に伝達する方法、及び音を用いて聴覚に伝達する方法を用いた場合、読み取られた画像データ(1)の形状及び特徴を把握する事が難しいという問題がある。
【0004】
【課題を解決するための手段】
【0005】
CCDカメラやスキャナなどの画像読取装置より読取られた画像データ(1)の輪郭情報(2)を抽出する事で、画像データ(1)の形状及び特徴を抽出する。具体的には、画像データ(1)の横方向をX軸とし、画像データ(1)の縦方向をY軸とし、X軸方向に画像読取を行ない、画像データ(1)の形状及び特徴として抽出した輪郭情報(2)を音データに変換する。この時、X軸を音の出力される位置情報(4)に置き換え、Y軸を音の周波数(5)に置き換える事により画像データ(1)の形状及び特徴を音データとして出力し、視覚にて認識する事が困難であった視覚に障害を持つ者が画像データ(1)の形状及び特徴を聴覚にて認識する事が可能となる。
【0006】
【発明の実施の形態】
物体そのものが2次元もしくは3次元の場合でも、物体の輪郭情報(2)は2次元で表現することが可能である。又、人間の聴覚は音源定位(3)と呼ばれる出力される音がどの方向から聞こえるか位置を判定する機能にて、出力される音の方向を認識し、出力される音圧レベルによる音の音量の大小、及び音の周波数(5)により出力される音の音程の高低差及び音色を認識している。そこで、画像データ(1)の横方向をX軸とし、画像データ(1)の縦方向をY軸とし、X軸方向に画像読取を行なう事で画像データ(1)の輪郭情報(2)を画像データ(1)の形状及び特徴として抽出する。抽出された輪郭情報(2)を、音データに変換し出力する事で画像データ(1)の形状及び特徴を聴覚にて認識する事を可能にする。音データ変換時においてはX軸を音の出力される位置情報(4)に置き換え、Y軸を音の周波数(5)に置き換える。しかし、人間の聴覚能力において複数同時に出力する同一周波数の認識能力は無く、又、複数同時に出力する異なる周波数の認識能力には個人差がある。さらに出力される音の位置を認識する音源定位(3)においては、後から出力される音の位置の認識能力よりも、前及び左右から出力される音の位置の認識能力が高いことから、音の出力される位置の情報(4)を左側よりX軸方向に時間走査的に出力する事により、画像データ(1)の特徴を音データとして出力する。例として、画像データ(1)を音データとして出力を開始してよりnポイント(6)n+1ポイント(7)n+2ポイント(8)経過した各ポイントでは、出力される音データが異なる。nポイント(6)では、左側より異なる周波数の音が2音同時出力される。n+1ポイント(7)では中央より異なる周波数の音が2音同時出力されるが、nポイント(6)時に比べ周波数の高い音と周波数の低い音が出力される。n+2ポイント(8)では、音は出力されない。
【0007】
図11において、スキャナ入力部より画像データ(1)を画像読取後、画像変換(9)にて二値化処理(10)を行なう。画像読取された画像データ(1)は特徴抽出処理(11)として輪郭線追跡処理(12)を行ない、画像データ(1)の特徴を抽出する。輪郭情報(2)画像計測(13)後、画像データ(1)から音データにテーブルを置き換え(14)、画像読取を行なったX方向を音の出力される位置情報(4)、輪郭情報(2)の縦方向をY座標とし、ソート(15)を行なう。ソート(15)された画像データ(1)は、音データとして出力する。
【0008】
図12において、画像データと特定のポイントを音データに変換する例を示す。
図では、取り込まれた画像(21)と、特定のポイント(20)が画面上に混在する。特定のポイント(20)の位置は、特定のポイントが位置する周波数(22)と、特定のポイントが位置する音量(23)により解析される。
図内の(21)L.chと(21)R.chは、取り込まれた画像(21)を解析した時の左右の出力レベルと波形をイメージした波形であり、取り込まれた画像(21)の形状により周波数と左右の出力レベルが変化する。
図内の(20)L.chAと(20)R.chAは、特定のポイント(20)の位置を解析し特定のポイントが位置する周波数(22)と、特定のポイントが位置する音量(23)によって周辺の画像データと聞き分けが出きるように一定間隔で発信と無発信を繰り返した例であり、図内の(20)L.chBと(20)R.chBは、特定のポイント(20)の位置を解析し特定のポイントが位置する周波数(22)を、特定のポイントが位置する音量(23)によって、特定のポイント(20)が位置する横方向を再生する時間に一定間隔で発信した例である。
そして、図内の(21)L.chと(20)L.chA又は(20)L.chBの波形と、(21)R.chと(20)R.chA又は(20)R.chBの波形は各々合成して出力する。
【0009】
【図面の簡単な説明】
【図1】 画像データの一例を表す
【図2】 画像データの輪郭情報を抽出した図
【図3】 画像データを音データに変換する定義を表した図
【図4】 2次元図形の輪郭情報を抽出し、2軸データとして表す
【図5】 3次元図形の輪郭情報を抽出し、2軸データとして表す
【図6】 音源定位(3)を表わした図。
【図7】 取り込んだ画像に対して出力される音データを表わす図。
【図8】 出力される音データ(6)を表わす図。
【図9】 出力される音データ(7)を表わす図。
【図10】 出力される音データを表わす図。
【図11】 画像データを音データに変換するフローの例
【図12】 画像データと特定のポイントを音データに変換する例
【符号の説明】
1 画像データ
2 輪郭情報
3 音源定位
4 音の出力される位置情報
5 音の周波数
6 nポイント
7 n+1ポイント
8 n+2ポイント
9 画像変換
10 二値化処理
11 特徴抽出処理
12 輪郭線追跡処理
13 画像計測
14 テーブル置換
15 ソート
20 特定のポイント
21 取りこまれた画像
22 特定のポイントが位置する周波数
23 特定のポイントが位置する音量
[0001]
BACKGROUND OF THE INVENTION
The present invention is a technique for converting the shape and characteristics of image data (1) recognized and read by an image reading device into sound data and outputting the sound data.
[0002]
[Prior art]
As an output method for transmitting information of image data (1) recognized and read by a conventional image reading apparatus to a user, the input image data (1) found on an image scanner or printer is displayed on a monitor or paper. The method is used to output to and visually transmit. In addition, a method of transmitting the shape of the recognized image data (1) seen in walking for the visually handicapped person to the sense of touch by generating vibration and electrical stimulation, and the input image data (1). The method of transmitting the shape of the sound to the auditory sense using sound is used.
[0003]
[Problems to be solved by the invention]
In the conventional method, when the information of the recognized and read image data (1) is output to a monitor or paper, there are some obstacles in the field of vision and vision, including the case of operating in a place with poor visibility such as a dark place. In the state, there is a problem that it is difficult to visually recognize the image data (1). In addition, it is difficult to grasp the shape and characteristics of the read image data (1) when using a method of generating vibration or electrical stimulation and transmitting it to the tactile sense and a method of transmitting to the auditory sense using sound. There is.
[0004]
[Means for Solving the Problems]
[0005]
By extracting the contour information (2) of the image data (1) read by an image reading device such as a CCD camera or a scanner, the shape and characteristics of the image data (1) are extracted. Specifically, the horizontal direction of the image data (1) is the X axis, the vertical direction of the image data (1) is the Y axis, and the image is read in the X axis direction. As the shape and characteristics of the image data (1), The extracted contour information (2) is converted into sound data. At this time, the X-axis is replaced with the position information (4) where the sound is output, and the Y-axis is replaced with the sound frequency (5), so that the shape and characteristics of the image data (1) are output as sound data. It is possible for a visually impaired person who is difficult to recognize to recognize the shape and characteristics of the image data (1) by hearing.
[0006]
DETAILED DESCRIPTION OF THE INVENTION
Even when the object itself is two-dimensional or three-dimensional, the contour information (2) of the object can be expressed in two dimensions. In addition, the human auditory sense is called sound source localization (3), which recognizes the direction of the output sound with a function for determining the direction from which the output sound can be heard, and the sound of the output sound pressure level is detected. It recognizes the pitch difference and tone color of the sound output by the volume level and the sound frequency (5). Therefore, the horizontal direction of the image data (1) is taken as the X axis, the vertical direction of the image data (1) is taken as the Y axis, and image information is read in the X axis direction to obtain the contour information (2) of the image data (1). Extracted as the shape and features of the image data (1). By converting the extracted contour information (2) into sound data and outputting it, the shape and characteristics of the image data (1) can be recognized by hearing. At the time of sound data conversion, the X axis is replaced with position information (4) from which sound is output, and the Y axis is replaced with sound frequency (5). However, in human hearing ability, there is no recognition ability for the same frequency that is output simultaneously, and there are individual differences in recognition ability for different frequencies that are output simultaneously. Furthermore, in the sound source localization (3) for recognizing the position of the sound to be output, the ability to recognize the position of the sound output from the front and left and right is higher than the ability to recognize the position of the sound output later. By outputting the position information (4) where the sound is output from the left side in the X-axis direction in a time-scanning manner, the characteristics of the image data (1) are output as sound data. As an example, the output sound data is different at each point when n points (6), n + 1 points (7), and n + 2 points (8) have elapsed since the start of outputting image data (1) as sound data. At n point (6), two sounds of different frequencies are output simultaneously from the left side. At n + 1 point (7), two sounds having different frequencies are output simultaneously from the center, but a sound having a higher frequency and a sound having a lower frequency than those at n point (6) are output. At n + 2 points (8), no sound is output.
[0007]
In FIG. 11, after image data (1) is read from the scanner input unit, binarization processing (10) is performed by image conversion (9). The image data (1) that has been read is subjected to outline tracking processing (12) as feature extraction processing (11) to extract the features of the image data (1). Contour information (2) After image measurement (13), the table is replaced with sound data from image data (1) (14), position information (4) in which sound is output in the X direction in which image reading is performed, contour information ( Sorting (15) is performed with the vertical direction of 2) as the Y coordinate. The sorted (15) image data (1) is output as sound data.
[0008]
FIG. 12 shows an example of converting image data and specific points into sound data.
In the figure, the captured image (21) and the specific point (20) are mixed on the screen. The position of the specific point (20) is analyzed by the frequency (22) where the specific point is located and the volume (23) where the specific point is located.
(21) L. ch and (21) R.M. ch is a waveform in which the left and right output levels and the waveform when the captured image (21) is analyzed, and the frequency and the left and right output levels change depending on the shape of the captured image (21).
(20) L. chA and (20) R. The chA analyzes the position of a specific point (20), and is spaced at regular intervals so that it can be distinguished from surrounding image data by the frequency (22) where the specific point is located and the volume (23) where the specific point is located. In the example shown in FIG. chB and (20) R. chB analyzes the position of the specific point (20), determines the frequency (22) at which the specific point is located, and the horizontal direction in which the specific point (20) is located by the volume (23) at which the specific point is located. This is an example of transmission at regular intervals during the playback time.
And (21) L. ch and (20) L. chA or (20) L. chB waveform and (21) R.R. ch and (20) R.M. chA or (20) R.I. The chB waveforms are synthesized and output.
[0009]
[Brief description of the drawings]
[Fig. 1] An example of image data [Fig. 2] A diagram extracting contour information of image data [Fig. 3] A diagram showing a definition for converting image data into sound data [Fig. 4] [Fig. Is extracted and expressed as 2-axis data. [FIG. 5] The contour information of a three-dimensional figure is extracted and expressed as 2-axis data. [FIG. 6] A diagram showing sound source localization (3).
FIG. 7 is a diagram illustrating sound data output for a captured image.
FIG. 8 is a diagram showing sound data (6) to be output.
FIG. 9 is a diagram showing sound data (7) to be output.
FIG. 10 is a diagram showing sound data to be output.
11 is an example of a flow for converting image data into sound data. FIG. 12 is an example of converting image data and a specific point into sound data.
DESCRIPTION OF SYMBOLS 1 Image data 2 Contour information 3 Sound source localization 4 Sound output position information 5 Sound frequency 6 n point 7 n + 1 point 8 n + 2 point 9 Image conversion 10 Binarization process 11 Feature extraction process 12 Outline tracking process 13 Image measurement 14 Table replacement 15 Sort 20 Specific point 21 Captured image 22 Frequency at which specific point is located 23 Volume at which specific point is located

Claims (2)

CCDカメラやスキャナなどの画像読取装置より認識され読取られた画像データ(1)の形状及び特徴を音として出力する装置であって、
画像データ(1)の横方向をX軸とし、画像データ(1)縦方向をY軸とし、X軸方向に画像読取を行ない、画像データ(1)上の読取時のX座標に存在する図形の輪郭から当該輪郭のX座標およびY座標を含む輪郭情報(2)を抽出し、
当該輪郭情報(2)のX座標を、出力される音がどの方向から聞こえるかを判定する人間の聴覚機能である音源定位(3)を利用した音の出力される位置の情報(4)に置き換え、Y座標を音の周波数(5)に置き換え
画像データ(1)のX軸方向に走査を行って、走査時のX座標に対応する位置の情報(4)と音の周波数(5)により特定される音を出力する事により
画像データ(1)の形状及び特徴を聴覚にて認識させる事可能とする形状伝達装置
A device for outputting the shape and characteristics of image data (1) recognized and read by an image reading device such as a CCD camera or a scanner as sound ,
The horizontal direction of the image data (1) is the X-axis, the vertical direction of the image data (1) is the Y-axis, the image is read in the X-axis direction, and exists at the X coordinate at the time of reading on the image data (1). Contour information (2) including the X coordinate and Y coordinate of the contour is extracted from the contour of the figure ,
The X coordinates of the contour information (2), the sound source localization (3) location of the information in the output of sound using the sound output is judged human hearing functions or heard from which direction (4) Replace the Y coordinate with the sound frequency (5) ,
By scanning the image data (1) in the X-axis direction and outputting the sound specified by the position information (4) and the sound frequency (5) corresponding to the X coordinate at the time of scanning ,
Shape transmission apparatus capable Ru is recognized by hearing the shape and characteristics of the image data (1).
請求項1に記載の形状伝達装置において、取り込んだ画像の中から特定のポイントを判定し、その部分の座標に相当する周波数と音量により一定間隔で発信と無発信を繰り返すまたは音色を変化させるなどの方法により、特定のポイントと、周辺の様子の位置関係を聴覚にて認識させる事可能とする形状伝達装置2. The shape transfer device according to claim 1 , wherein a specific point is determined from the captured image, and transmission and non-transmission are repeated at regular intervals or a tone is changed according to a frequency and a volume corresponding to the coordinates of the portion. the method allows the specific point, the shape transfer device enabling that the positional relationship of the surrounding situation Ru is recognized by hearing.
JP2001275845A 2001-09-12 2001-09-12 Shape transfer device Expired - Fee Related JP4657532B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2001275845A JP4657532B2 (en) 2001-09-12 2001-09-12 Shape transfer device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2001275845A JP4657532B2 (en) 2001-09-12 2001-09-12 Shape transfer device

Publications (2)

Publication Number Publication Date
JP2003084784A JP2003084784A (en) 2003-03-19
JP4657532B2 true JP4657532B2 (en) 2011-03-23

Family

ID=19100639

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2001275845A Expired - Fee Related JP4657532B2 (en) 2001-09-12 2001-09-12 Shape transfer device

Country Status (1)

Country Link
JP (1) JP4657532B2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4891375B2 (en) * 2009-09-28 2012-03-07 昌弘 黒田 Image hearing device
CN107157651A (en) * 2017-06-13 2017-09-15 浙江诺尔康神经电子科技股份有限公司 A kind of visual pattern sensory perceptual system and method based on sonic stimulation

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03184540A (en) * 1989-07-27 1991-08-12 Philips Gloeilampenfab:Nv Image-voice conversion device
JPH08252279A (en) * 1995-03-17 1996-10-01 Hitachi Ltd Route guiding method for presenting object with sound
JPH0962866A (en) * 1995-08-22 1997-03-07 Nec Corp Information presentation device
JPH09258946A (en) * 1996-03-26 1997-10-03 Fujitsu Ltd Information processor
JPH11102280A (en) * 1994-12-16 1999-04-13 Hitachi Ltd Sound output method for image information
JP2000241544A (en) * 1999-02-18 2000-09-08 Techno Soft Systemnics:Kk Portable non-contact obstacle distance meter for visually handicapped person
JP2001084484A (en) * 1999-09-13 2001-03-30 Tamotsu Okazawa Method and device for scene recognition

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03184540A (en) * 1989-07-27 1991-08-12 Philips Gloeilampenfab:Nv Image-voice conversion device
JPH11102280A (en) * 1994-12-16 1999-04-13 Hitachi Ltd Sound output method for image information
JPH08252279A (en) * 1995-03-17 1996-10-01 Hitachi Ltd Route guiding method for presenting object with sound
JPH0962866A (en) * 1995-08-22 1997-03-07 Nec Corp Information presentation device
JPH09258946A (en) * 1996-03-26 1997-10-03 Fujitsu Ltd Information processor
JP2000241544A (en) * 1999-02-18 2000-09-08 Techno Soft Systemnics:Kk Portable non-contact obstacle distance meter for visually handicapped person
JP2001084484A (en) * 1999-09-13 2001-03-30 Tamotsu Okazawa Method and device for scene recognition

Also Published As

Publication number Publication date
JP2003084784A (en) 2003-03-19

Similar Documents

Publication Publication Date Title
KR101221513B1 (en) Graphic haptic electronic board and method for transferring visual information to visually impaired people as haptic information
Hoang et al. Obstacle detection and warning system for visually impaired people based on electrode matrix and mobile Kinect
CN107708483B (en) Method and system for extracting motion characteristics of a user to provide feedback to the user using hall effect sensors
Balakrishnan et al. Wearable real-time stereo vision for the visually impaired.
CN103973971B (en) The control method of information equipment and information equipment
WO2014064870A1 (en) Image processing device and image processing method
US11185445B2 (en) Portable system that allows blind or visually impaired persons to interpret the surrounding environment by sound and touch
EP1343108A3 (en) Method and apparatus for recognising faces using principal component analysis and second order independent component analysis
JPH1069539A (en) Scenery image input and touch output device
CN105117706A (en) Image processing method and apparatus and character recognition method and apparatus
Balakrishnan et al. A stereo image processing system for visually impaired
US20190333496A1 (en) Spatialized verbalization of visual scenes
JP2011250928A (en) Device, method and program for space recognition for visually handicapped person
KR20100010981A (en) Apparatus and method for converting image information into haptic sensible signal
Hoang et al. Obstacle detection and warning for visually impaired people based on electrode matrix and mobile Kinect
JP5002068B1 (en) Environmental information transmission device
JP4657532B2 (en) Shape transfer device
JP5598981B2 (en) Perceptual stimulus information generation system
Pei et al. Census-based vision for auditory depth images and speech navigation of visually impaired users
KR20160113760A (en) Picture book making method and system for blind child and tactile teaching tool using the same
JP2015011404A (en) Motion-recognizing and processing device
JP2009503628A (en) Multimedia digital code printing apparatus and printing method
JP2019008482A (en) Braille character tactile sense presentation device and image forming apparatus
Bangar et al. Vocal vision for visually impaired
JP4891375B2 (en) Image hearing device

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080215

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080215

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100817

RD02 Notification of acceptance of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7422

Effective date: 20100831

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20101008

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20101214

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20101222

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140107

Year of fee payment: 3

R150 Certificate of patent or registration of utility model

Ref document number: 4657532

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140107

Year of fee payment: 3

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees