CN108320624A - Text region phonetic machine - Google Patents
Text region phonetic machine Download PDFInfo
- Publication number
- CN108320624A CN108320624A CN201711400969.8A CN201711400969A CN108320624A CN 108320624 A CN108320624 A CN 108320624A CN 201711400969 A CN201711400969 A CN 201711400969A CN 108320624 A CN108320624 A CN 108320624A
- Authority
- CN
- China
- Prior art keywords
- camera
- mentioned
- text region
- processor
- shell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000010191 image analysis Methods 0.000 claims abstract description 3
- 238000000605 extraction Methods 0.000 abstract description 3
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B21/00—Teaching, or communicating with, the blind, deaf or mute
- G09B21/001—Teaching or communicating with blind persons
- G09B21/006—Teaching or communicating with blind persons using audible presentation of the information
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/10—Image acquisition
- G06V10/12—Details of acquisition arrangements; Constructional details thereof
- G06V10/14—Optical characteristics of the device performing the acquisition or on the illumination arrangements
- G06V10/147—Details of sensors, e.g. sensor lenses
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Vascular Medicine (AREA)
- Business, Economics & Management (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
The invention discloses a kind of Text region phonetic machines, including:Camera, it is connected to the reservoir of camera, it is connected to reservoir and reads the processor of image analysis word, it is connected to the sound card of processor, it is connected to the loudspeaker of sound card, the shell of fixed storage, processor, sound card, loudspeaker is detachably fixed camera and is connected in the set casing on shell.The present invention acquires pictorial information by camera, pass through the text information in chip and software cooperation extraction picture, text information is changed into voice messaging again, it is read by loudspeaker, facilitates blind person to read ordinary books, contribute to disabled persons' education, the design of set casing facilitates the handling of camera, to not limit the size of camera, as long as the equipment with camera function can use so that applied widely.
Description
Technical field
The present invention relates to a kind of image recognition apparatus, especially a kind of Text region phonetic machine.
Background technology
Blind person can only read special books for the blind, and single and not all general reading can all go out braille version, blind for convenience
Portrait normal person equally reads general reading, it is necessary to a kind of machine that pictorial information can be changed into voice messaging, the present invention
Solve this problem.
Invention content
To solve the deficiencies in the prior art, the purpose of the present invention is to provide a kind of Text region phonetic machines, can will scheme
Piece information is changed into voice messaging, facilitates blind person to read ordinary books, contributes to disabled persons' education.
In order to realize that above-mentioned target, the present invention adopt the following technical scheme that:
Text region phonetic machine, including:Camera is connected to the reservoir of camera, is connected to reservoir and reads image solution
The processor for analysing word, is connected to the sound card of processor, is connected to the loudspeaker of sound card, fixed storage, processor, sound card,
The shell of loudspeaker is detachably fixed camera and is connected in the set casing on shell.
Text region phonetic machine above-mentioned, set casing composition have:It is connected in the clamping piece of camera both sides, is set to clamping
Between piece and the fixinig plate that is fixed under shell, the spring element being connected between clamping piece and fixinig plate.
Text region phonetic machine above-mentioned, spring element are spring.
Text region phonetic machine above-mentioned, spring element are rubber strip.
Text region phonetic machine above-mentioned, shell are equipped with buckling groove.
Text region phonetic machine above-mentioned, clamping on piece are equipped with the clamping protrusion corresponding to buckling groove.
Text region phonetic machine above-mentioned, the section for being clamped protrusion are T-shaped.
The invention has the beneficial effects that:The present invention provides a kind of Text region phonetic machine, and picture is acquired by camera
Information is changed into voice messaging by the text information in chip and software cooperation extraction picture, then by text information, passes through expansion
Sound device is read, and facilitates blind person to read ordinary books, disabled persons' education, the design of set casing is contributed to facilitate the handling of camera,
To not limit the size of camera, as long as the equipment with camera function can use so that applied widely.
Description of the drawings
Fig. 1 is a kind of sectional view of embodiment of the present invention;
The meaning of reference numeral in figure:
1 camera, 2 loudspeakers, 3 shells, 301 buckling grooves, 4 set casings, 401 clamping pieces, 402 fixinig plates, 403 bullets
Power part, 5 clamping protrusions.
Specific implementation mode
Specific introduce is made to the present invention below in conjunction with the drawings and specific embodiments.
Text region phonetic machine, including:Camera 1 is connected to the reservoir of camera 1, is connected to reservoir and reads
The processor of image analysis word is connected to the sound card of processor, is connected to the loudspeaker 2 of sound card, fixed storage, processing
The shell 3 of device, sound card, loudspeaker 2, the set casing 4 for being detachably fixed camera 1 and being connected on shell 3;As a kind of excellent
Choosing, processor use DSC chips.Camera 1 acquires pictorial information, pictorial information is stored in reservoir, then by OCR skills
Art identifies the text information in picture, and text information is changed to digital information, is transferred to sound card, is reconverted into acoustic information, by
Loudspeaker 2 is read.
The composition of set casing 4 has:It is connected in the clamping piece 401 of 1 both sides of camera, is set between clamping piece 401 and fixed
Fixinig plate 402 under shell 3, the spring element 403 being connected between clamping piece 401 and fixinig plate 402;As a preferred embodiment,
Spring element 403 is spring or spring element 403 is rubber strip.Clamping piece 401 is pulled open to both sides, camera 1 is placed on card
Between contact pin 401, clamping piece 401 is unclamped again, since the clamping of camera 1 is realized in the effect of spring element 403.Set casing 4 is set
Meter facilitates the handling of camera 1, to not limit the size of camera 1, as long as the equipment with camera function can make
With so that it is applied widely.
In order to avoid being clamped random change location during being moved to both sides of piece 401, shell 3 is equipped with buckling groove
301, clamping piece 401 is equipped with the clamping protrusion 5 corresponding to buckling groove 301, as a preferred embodiment, the section of clamping protrusion 5 is
It is T-shaped.Buckling groove 301 is a strip slot, and limiting clamping piece 401 can only laterally move linearly, and be unable to voltuntary movement, really
Stablizing for camera 1 is protected to fix.
Phonetic machine is captured for the convenience of the users, handle is fixed on shell 3, as a preferred embodiment, the section of handle
For " L " shape.
The present invention provides a kind of Text region phonetic machine, acquires pictorial information by camera 1, is matched by chip and software
The text information in extraction picture is closed, then text information is changed into voice messaging, is read by loudspeaker 2, blind person is facilitated to read
Ordinary books are read, disabled persons' education, the design of set casing 4 is contributed to facilitate the handling of camera 1, to not limit camera 1
Size, as long as can be used with the equipment of camera function so that applied widely.
The basic principles, main features and advantages of the invention have been shown and described above.The technical staff of the industry should
Understand, the invention is not limited in any way above-described embodiment, all to be obtained by the way of equivalent substitution or equivalent transformation
Technical solution is all fallen in protection scope of the present invention.
Claims (7)
1. Text region phonetic machine, which is characterized in that including:Camera is connected to the reservoir of above-mentioned camera, is connected to
It states reservoir and reads the processor of image analysis word, be connected to the sound card of above-mentioned processor, be connected to the expansion of above-mentioned sound card
Sound device, fixed storage, processor, sound card, loudspeaker shell, be detachably fixed above-mentioned camera and be connected on shell
Set casing.
2. Text region phonetic machine according to claim 1, which is characterized in that above-mentioned set casing composition has:It is connected in
The clamping piece for stating camera both sides is set to the fixinig plate between above-mentioned clamping piece and being fixed under above-mentioned shell, is connected to
State the spring element between clamping piece and fixinig plate.
3. Text region phonetic machine according to claim 2, which is characterized in that above-mentioned spring element is spring.
4. Text region phonetic machine according to claim 2, which is characterized in that above-mentioned spring element is rubber strip.
5. Text region phonetic machine according to claim 2, which is characterized in that above-mentioned shell is equipped with buckling groove.
6. Text region phonetic machine according to claim 5, which is characterized in that above-mentioned clamping on piece, which is equipped with, corresponds to clamping
The clamping protrusion of slot.
7. Text region phonetic machine according to claim 6, which is characterized in that the section of above-mentioned clamping protrusion is " T " word
Shape.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711400969.8A CN108320624A (en) | 2017-12-22 | 2017-12-22 | Text region phonetic machine |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711400969.8A CN108320624A (en) | 2017-12-22 | 2017-12-22 | Text region phonetic machine |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108320624A true CN108320624A (en) | 2018-07-24 |
Family
ID=62893157
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711400969.8A Pending CN108320624A (en) | 2017-12-22 | 2017-12-22 | Text region phonetic machine |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108320624A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110070042A (en) * | 2019-04-23 | 2019-07-30 | 北京字节跳动网络技术有限公司 | Character recognition method, device and electronic equipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102509479A (en) * | 2011-10-08 | 2012-06-20 | 沈沾俊 | Portable character recognition voice reader and method for reading characters |
CN103077625A (en) * | 2013-01-30 | 2013-05-01 | 中国盲文出版社 | Blind electronic reader and blind assistance reading method |
CN106446887A (en) * | 2016-11-07 | 2017-02-22 | 罗杰仁 | Method and device for converting picture into voice |
-
2017
- 2017-12-22 CN CN201711400969.8A patent/CN108320624A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102509479A (en) * | 2011-10-08 | 2012-06-20 | 沈沾俊 | Portable character recognition voice reader and method for reading characters |
CN103077625A (en) * | 2013-01-30 | 2013-05-01 | 中国盲文出版社 | Blind electronic reader and blind assistance reading method |
CN106446887A (en) * | 2016-11-07 | 2017-02-22 | 罗杰仁 | Method and device for converting picture into voice |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110070042A (en) * | 2019-04-23 | 2019-07-30 | 北京字节跳动网络技术有限公司 | Character recognition method, device and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104508537B (en) | Glasses with built-in computer | |
EP1855459A3 (en) | Apparatus and method for photographing a business card in a portable terminal | |
TW201216666A (en) | Smart phone with lens | |
EP1876596A3 (en) | Recording and reproducing data | |
IT1390595B1 (en) | AID DEVICE IN READING A PRINTED TEXT | |
GB2401503C (en) | A portable data storage and image recording device capable of direct connection to a computer USB port | |
EP1603070A3 (en) | Medical image storage apparatus protecting personal information | |
EP1744264A3 (en) | Biometric information registration apparatus | |
CN108320624A (en) | Text region phonetic machine | |
CN110298349A (en) | A kind of is quickly the method and apparatus of digital content by paper book content transformation | |
TWM321184U (en) | Image processing device | |
CN203410176U (en) | Marking seal with fingerprint identification function | |
Saleous et al. | Read2Me: A cloud-based reading aid for the visually impaired | |
CN201774591U (en) | Digital camera with address book and face recognition function | |
Yoo et al. | Developing of Text Translation and Display Devices via Braille Module | |
CN108335579A (en) | Text region book reading machine | |
JP2014127197A (en) | Application software for voice reading characters recognized by camera of smartphone | |
CN108172044A (en) | A kind of English word learning apparatus | |
CN201936322U (en) | Talking character identifier | |
CN108242195A (en) | It is capable of the reading machine for the blind of automatic identification word | |
TR200101623A2 (en) | Multimedia e-book intended only for learning and memorizing the Quran. | |
CN205364891U (en) | Electronics bookmark and e -book reader | |
CN207558216U (en) | A kind of multi-functional japanese voice facility for study | |
CN201957134U (en) | File-photographing instrument | |
CN210181811U (en) | Card for memorizing English words |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180724 |