CN106484297A - A kind of word pick device and method - Google Patents
A kind of word pick device and method Download PDFInfo
- Publication number
- CN106484297A CN106484297A CN201610884064.1A CN201610884064A CN106484297A CN 106484297 A CN106484297 A CN 106484297A CN 201610884064 A CN201610884064 A CN 201610884064A CN 106484297 A CN106484297 A CN 106484297A
- Authority
- CN
- China
- Prior art keywords
- word
- touch location
- module
- coordinate
- dynamic image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 238000001514 detection method Methods 0.000 claims description 25
- 230000001143 conditioned effect Effects 0.000 claims description 19
- 201000009487 Amblyopia Diseases 0.000 abstract description 11
- 230000000007 visual effect Effects 0.000 abstract description 7
- 238000004891 communication Methods 0.000 description 23
- 230000008569 process Effects 0.000 description 14
- 230000006870 function Effects 0.000 description 9
- 238000010295 mobile communication Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 239000006185 dispersion Substances 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 238000003825 pressing Methods 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000007726 management method Methods 0.000 description 3
- 230000000712 assembly Effects 0.000 description 2
- 238000000429 assembly Methods 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 238000003909 pattern recognition Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 230000001629 suppression Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 101150012579 ADSL gene Proteins 0.000 description 1
- 102100020775 Adenylosuccinate lyase Human genes 0.000 description 1
- 108700040193 Adenylosuccinate lyases Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241001062009 Indigofera Species 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000005314 correlation function Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000009730 ganji Substances 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000013468 resource allocation Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000005496 tempering Methods 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000012384 transportation and delivery Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0488—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
- G06V10/235—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on user input or interaction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Telephone Function (AREA)
- Studio Devices (AREA)
Abstract
The invention discloses a kind of word pick device and method, the device includes:Taking module, the first determining module and playing module.Taking module is shot to the object before the camera of itself place terminal under default word pickup mode.Word in captured dynamic image is converted into voice and plays out by playing module.By embodiment of the present invention scheme, word content can be understood by terminal, solve the puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.
Description
Technical field
The present invention relates to terminal applies field, more particularly to a kind of word pick device and method.
Background technology
At present, blind person or amblyopia personnel are because there is inconvenience in visual problems in life, for example, to dining room have a meal when
Cannot a people ordered dishes by papery menu, when going out cannot oneself viewing public transport stop board etc., currently, with various terminals application
Broad development, how by terminal applies in the life of blind person or amblyopia personnel, help which to solve to bring because of visual problems
Above-mentioned puzzlement, is the problem of person skilled urgent need to resolve.
Content of the invention
Present invention is primarily targeted at a kind of word pick device and method is proposed, word can be understood by terminal
Content, solves the puzzlement that cannot be read that blind person or amblyopia personnel bring because of visual problems.
For achieving the above object, the invention provides a kind of word pick device, the device includes:Taking module and broadcasting
Module.
Taking module, for, under default word pickup mode, entering to the object before the camera of itself place terminal
Row shoots.
Playing module, plays out for the word in captured dynamic image is converted into voice.
Alternatively, the device also includes:Detection module and pattern enter module.
Detection module, for detecting the trigger condition of word pickup mode.
Pattern enters module, for when trigger condition is detected and determine that the trigger condition is effective, entering word pickup
Pattern.
Alternatively, taking module carries out shooting to the object before the camera of itself place terminal includes:
Object before detection camera;Include Word message in wherein relative with camera on object one side.
According to pre-conditioned adjusting focal length.
To include that the middle section of word segment in the object of Word message as shooting focus and is shot.
Alternatively, the device also includes prompting module.
Prompting module, for when Word message is not included in relative with camera one side on the object, sending prompting
Information.
Prompting message includes:The vibration of the motor on predeterminated position.
Alternatively, pre-conditioned including:Word size.
Taking module includes according to pre-conditioned adjusting focal length:
Detection is when the word size in dynamic image under front focal length.
The word size for detecting is compared with default character size.
Keep working as front focal length when the word size for detecting is consistent with default character size.
When the word size for detecting is inconsistent with default character size, the focal length for adjusting camera is first Jiao
Away from making the word size in dynamic image consistent with default character size.
Alternatively, the device also includes:First determining module.
First determining module, for, before according to pre-conditioned adjusting focal length, determining according to the fingerprint size of user pre-
If character size.
Alternatively, according to the fingerprint size of user, the first determining module determines that default character size includes:
Finger print information during collection user's touch terminal screen;Finger print information includes the fingerprint size.
Take the fingerprint from fingerprint size height and width.
Fingerprint height and width are defined as the height of the word in default character size and width.
Alternatively, the device also includes:Second determining module.
Second determining module, to the touch operation of captured dynamic image and determines touch location for detection.
Playing module, is additionally operable to for corresponding word at touch location to be converted into voice and plays out.
Alternatively, the device also includes:Text point determining module.
Text point determining module is used for:
After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in photo, when tactile
Touch position coordinate consistent with the coordinate of any one word in dynamic image when, determine that touch location is corresponding with word;When
When the coordinate of touch location is all inconsistent with the coordinate of each word in dynamic image, determine that touch location is not corresponding with word.
Alternatively, text point determining module is additionally operable to:
After touch location is determined, when not corresponding to word at the touch location, detecting distance current touch location is most
The position that the first near word is located.
Determine the position at the first word place and the relative direction of current touch location.
Preset motor in control respective direction is vibrated.
Alternatively, playing module by corresponding word at touch location be converted into voice play out including:
When touch location is on the straight line that a line word or a row word are located, a line word or a row word are turned
Turn to voice to play out.
Alternatively, the device also includes:Setup module.
Setup module is used for:
Default first spacing will all be kept with longitudinally adjacent word, and in the horizontal on identical straight line
Multiple words as a line word.
Default second spacing will all be kept with transversely adjacent word, and in the vertical on identical straight line
Multiple words as a line word.
Additionally, for achieving the above object, present invention also offers a kind of word pick-up method, the method includes:
Under default word pickup mode, the object before the camera of terminal is shot.
Word in captured dynamic image is converted into voice play out.
Alternatively, the method also includes:
The trigger condition of detection word pickup mode.
When trigger condition is detected and determine that the trigger condition is effective, word pickup mode is entered.
Alternatively, carrying out shooting to the object before the camera of itself place terminal includes:
Object before detection camera;Include Word message in wherein relative with camera on object one side.
According to pre-conditioned adjusting focal length.
To include that the middle section of word segment in the object of Word message as shooting focus and is shot.
Alternatively, the method also includes:
When Word message is not included in relative with camera one side on the object, prompting message is sent.
The prompting message includes:The vibration of the motor on predeterminated position.
Alternatively, pre-conditioned including:Word size.
Included according to pre-conditioned adjusting focal length:
Detection is when the word size in dynamic image under front focal length.
The word size for detecting is compared with default character size.
Keep working as front focal length when the word size for detecting is consistent with default character size.
When the word size for detecting is inconsistent with default character size, the focal length for adjusting camera is first Jiao
Away from making the word size in dynamic image consistent with default character size.
Alternatively, the method also includes:Before according to pre-conditioned adjusting focal length, determined according to the fingerprint size of user
Default character size.
Alternatively, determine that default character size includes according to the fingerprint size of user:
Finger print information during collection user's touch terminal screen;The finger print information includes fingerprint size.
Take the fingerprint from fingerprint size height and width.
Fingerprint height and width are defined as the height of the word in default character size and width.
Alternatively, the method also includes:
Detect the touch operation to captured dynamic image and determine touch location.
Corresponding word at touch location is converted into voice play out.
Alternatively, methods described also includes:
After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in photo, when tactile
Touch position coordinate consistent with the coordinate of any one word in dynamic image when, determine that touch location is corresponding with word;When
When the coordinate of touch location is all inconsistent with the coordinate of each word in dynamic image, determine that touch location is not corresponding with word.
Alternatively, the method also includes:
After touch location is determined, when not corresponding to word at the touch location, detecting distance current touch location is most
The position that the first near word is located.
Determine the position at the first word place and the relative direction of current touch location.
Preset motor in control respective direction is vibrated.
Alternatively, by corresponding word at touch location be converted into voice play out including:
When touch location is on the straight line that a line word or a row word are located, a line word or a row word are turned
Turn to voice to play out.
Alternatively, the method also includes:
Default first spacing will all be kept with longitudinally adjacent word, and in the horizontal on identical straight line
Multiple words as a line word.
Default second spacing will all be kept with transversely adjacent word, and in the vertical on identical straight line
Multiple words as a line word.
The present invention proposes a kind of word pick device and method, and the device includes:Taking module, the first determining module and
Playing module.Taking module is clapped to the object before the camera of itself place terminal under default word pickup mode
Take the photograph.Word in captured dynamic image is converted into voice and plays out by playing module.By embodiment of the present invention scheme,
Word content can be understood by terminal, solve the puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.
Description of the drawings
Fig. 1 is the hardware architecture diagram for realizing the optional mobile terminal of each embodiment of the present invention one;
Fig. 2 is the wireless communication system schematic diagram of mobile terminal as shown in Figure 1;
Fig. 3 is the word pick device composition frame chart of the embodiment of the present invention;
Fig. 4 is the word pick-up method flow chart of the embodiment of the present invention;
Fig. 5 is the word pick-up method schematic diagram of the embodiment of the present invention;
Fig. 6 is schematic diagram when image too hour user clicks in the word pick-up method of the embodiment of the present invention;
Schematic diagram when Fig. 7 is clicked on for user after being focused in the word pick-up method of the embodiment of the present invention;
Fig. 8 is for but listening, in the word pick-up method of the embodiment of the present invention, the embodiment schematic diagram for reminding motor.
The realization of the object of the invention, functional characteristics and advantage will be described further in conjunction with the embodiments referring to the drawings.
Specific embodiment
It should be appreciated that specific embodiment described herein is not intended to limit the present invention only in order to explain the present invention.
The optional mobile terminal of each embodiment of the present invention one is realized referring now to Description of Drawings.In follow-up description
In, using the suffix for representing such as " module ", " part " or " unit " of element only for being conducive to the explanation of the present invention,
Itself does not have specific meaning.Therefore, " module " mixedly can be used with " part ".
Mobile terminal can be implemented in a variety of manners.For example, the terminal described in the present invention can include such as to move
Phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP
The consolidating of the mobile terminal of (portable media player), guider etc. and such as numeral TV, desktop computer etc.
Determine terminal.Hereinafter it is assumed that terminal is mobile terminal.However, it will be understood by those skilled in the art that, except being used in particular for movement
Outside the element of purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.
Fig. 1 is that the hardware configuration of the mobile terminal for realizing each embodiment of the present invention is illustrated.
Mobile terminal 1 00 can include wireless communication unit 110, A/V (audio/video) input block 120, user input
Unit 130, sensing unit 140, output unit 150, memory 160, interface unit 170, controller 180 and power subsystem 190
Etc..Fig. 1 shows the mobile terminal with various assemblies, it should be understood that being not required for implementing all groups for illustrating
Part.More or less of component can alternatively be implemented.Will be discussed in more detail below the element of mobile terminal.
Wireless communication unit 110 generally includes one or more assemblies, and which allows mobile terminal 1 00 and wireless communication system
Or the radio communication between network.For example, wireless communication unit can include broadcasting reception module 111, mobile communication module
112nd, at least one of wireless Internet module 113, short range communication module 114 and location information module 115.
Broadcasting reception module 111 receives broadcast singal and/or broadcast via broadcast channel from external broadcast management server
Relevant information.Broadcast channel can include satellite channel and/or terrestrial channel.Broadcast management server can be generated and sent
The broadcast singal generated before the server or reception of broadcast singal and/or broadcast related information and/or broadcast related information
And send it to the server of terminal.Broadcast singal can include TV broadcast singal, radio signals, data broadcasting
Signal etc..And, broadcast singal may further include the broadcast singal combined with TV or radio signals.Broadcast phase
Pass information can also be provided via mobile communications network, and in this case, broadcast related information can be by mobile communication mould
Block 112 is receiving.Broadcast singal can be present in a variety of manners, and for example, which can be with the electronics of DMB (DMB)
The form of program guide (EPG), the electronic service guidebooks (ESG) of digital video broadcast-handheld (DVB-H) etc. and exist.Broadcast
Receiver module 111 can receive signal broadcast by using various types of broadcast systems.Especially, broadcasting reception module 111
Can be wide by using such as multimedia broadcasting-ground (DMB-T), DMB-satellite (DMB-S), digital video
Broadcast-hand-held (DVB-H), forward link media (MediaFLO@) Radio Data System, received terrestrial digital broadcasting integrated service
Etc. (ISDB-T) digit broadcasting system receives digital broadcasting.Broadcasting reception module 111 may be constructed such that and be adapted to provide for extensively
Broadcast the various broadcast systems of signal and above-mentioned digit broadcasting system.Via broadcasting reception module 111 receive broadcast singal and/
Or broadcast related information can be stored in memory 160 (or other types of storage medium).
Mobile communication module 112 sends radio signals to base station (for example, access point, node B etc.), exterior terminal
And at least one of server and/or receive from it radio signal.Such radio signal can be logical including voice
Words signal, video calling signal or the various types of data for sending and/or receiving according to text and/or Multimedia Message.
Wireless Internet module 113 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.The module can be internally or externally
It is couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by the module can include WLAN (WLAN) (Wi-Fi), Wibro
(WiMAX), Wimax (worldwide interoperability for microwave accesses), HSDPA (high-speed downlink packet access) etc..
Short range communication module 114 be for supporting the module of junction service.Some examples of short-range communication technology include indigo plant
ToothTM, RF identification (RFID), Infrared Data Association (IrDA), ultra broadband (UWB), purple honeybeeTMEtc..
Location information module 115 be for check or obtain mobile terminal positional information module.Location information module
Typical case be GPS (global positioning system).According to current technology, GPS module 115 is calculated from three or more satellites
Range information and correct time information and for calculate Information application triangulation, so as to according to longitude, latitude
Three-dimensional current location information is highly accurately calculated.Currently, defended using three for calculating the method for position and temporal information
Star and the error of the position that calculated by using other satellite correction and temporal information.Additionally, GPS module 115
Can be by Continuous plus current location information in real time come calculating speed information.
A/V input block 120 is used for receiving audio or video signal.A/V input block 120 can include 121 He of camera
Microphone 1220,121 pairs of static maps obtained by image capture apparatus in Video Capture pattern or image capture mode of camera
The view data of piece or video is processed.Picture frame after process is may be displayed on display unit 151.At camera 121
Picture frame after reason can be stored in memory 160 (or other storage mediums) or carry out via wireless communication unit 110
Send, two or more cameras 1210 can be provided according to the construction of mobile terminal.Microphone 122 can be in telephone relation mould
Sound (voice data) is received via microphone in formula, logging mode, speech recognition mode etc. operational mode, and can be by
Such acoustic processing is voice data.Audio frequency (voice) data after process can be changed in the case of telephone calling model
For the form output of mobile communication base station can be sent to via mobile communication module 112.Microphone 122 can implement all kinds
Noise eliminate (or suppression) algorithm with eliminate noise that (or suppression) is produced during receiving and sending audio signal or
Person disturbs.
User input unit 130 can generate key input data to control each of mobile terminal according to the order of user input
Plant operation.User input unit 130 allows the various types of information of user input, and can include keyboard, metal dome, touch
Plate (resistance that for example, detection causes due to being touched, pressure, the sensitive component of the change of electric capacity etc.), roller, rocking bar etc.
Deng.Especially, when touch pad is superimposed upon on display unit 151 as a layer, touch-screen can be formed.
Sensing unit 140 detect mobile terminal 1 00 current state, (for example, mobile terminal 1 00 open or close shape
State), the position of mobile terminal 1 00, user for mobile terminal 1 00 the presence or absence of contact (that is, touch input), mobile terminal
100 orientation, the acceleration or deceleration movement of mobile terminal 1 00 and direction etc., and generate for controlling mobile terminal 1 00
The order of operation or signal.For example, when mobile terminal 1 00 is embodied as sliding-type mobile phone, sensing unit 140 can be sensed
The sliding-type phone is opened or is cut out.In addition, sensing unit 140 can detect power subsystem 190 whether provide electric power or
Whether person's interface unit 170 is coupled with external device (ED).Sensing unit 140 can will be combined below including proximity transducer 1410
Touch-screen is being described to this.
Interface unit 170 is connected, as at least one external device (ED), the interface that can pass through with mobile terminal 1 00.For example,
External device (ED) can include wired or wireless head-band earphone port, external power source (or battery charger) port, wired or nothing
Line FPDP, memory card port, the port for being used for device of the connection with identification module, audio input/output (I/O) end
Mouth, video i/o port, ear port etc..Identification module can be stored for verifying user using each of mobile terminal 1 00
Kind of information and subscriber identification module (UIM), client identification module (SIM), Universal Subscriber identification module (USIM) can be included
Etc..In addition, the device (hereinafter referred to as " identifying device ") with identification module can take the form of smart card, therefore, know
Other device can be connected with mobile terminal 1 00 via port or other attachment means.Interface unit 170 can be used for receive from
The input (for example, data message, electric power etc.) of external device (ED) and the input for receiving is transferred in mobile terminal 1 00
One or more elements can be used for transmission data between mobile terminal and external device (ED).
In addition, when mobile terminal 1 00 is connected with external base, interface unit 170 can serve as allowing by which by electricity
Power provides the path of mobile terminal 1 00 from base or can serve as allowing the various command signals from base input to pass through which
It is transferred to the path of mobile terminal.The various command signals being input into from base or electric power may serve as recognizing that mobile terminal is
The no signal being accurately fitted within base.Output unit 150 is configured to defeated with the offer of vision, audio frequency and/or tactile manner
Go out signal (for example, audio signal, vision signal, alarm signal, vibration signal etc.).Output unit 150 can include to show
Unit 151, dio Output Modules 152, alarm unit 153 etc..
Display unit 151 may be displayed on the information processed in mobile terminal 1 00.For example, when mobile terminal 1 00 is in electricity
During words call mode, display unit 151 can show and call or other communicate (for example, text messaging, multimedia files
Download etc.) related user interface (UI) or graphic user interface (GUI).When mobile terminal 1 00 is in video calling pattern
Or during image capture mode, display unit 151 can show the image of capture and/or the image of reception, illustrate video or figure
UI or GUI of picture and correlation function etc..
Meanwhile, when the display unit 151 and touch pad touch-screen with formation superposed on one another as a layer, display unit
151 can serve as input unit and output device.Display unit 151 can include liquid crystal display (LCD), thin film transistor (TFT)
In LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc. at least
A kind of.Some in these displays may be constructed such that transparence to allow user from outside viewing, and this is properly termed as transparent
Display, typical transparent display can be, for example, TOLED (transparent organic light emitting diode) display etc..According to specific
The embodiment that wants, mobile terminal 1 00 can include two or more display units (or other display devices), for example, move
Dynamic terminal can include outernal display unit (not shown) and inner display unit (not shown).Touch-screen can be used for detection and touch
Input pressure and touch input position and touch input area.
Dio Output Modules 152 can mobile terminal in call signal reception pattern, call mode, logging mode,
When under the isotypes such as speech recognition mode, broadcast reception mode, that wireless communication unit 110 is received or in memory 160
The voice data transducing audio signal of middle storage and it is output as sound.And, dio Output Modules 152 can be provided and movement
The audio output (for example, call signal receives sound, message sink sound etc.) of the specific function correlation that terminal 100 is executed.
Dio Output Modules 152 can include loudspeaker, buzzer etc..
Alarm unit 153 can provide output to notify event to mobile terminal 1 00.Typical event is permissible
Including calling reception, message sink, key signals input, touch input etc..In addition to audio or video is exported, alarm unit
153 can provide output in a different manner with the generation of notification event.For example, alarm unit 153 can be in the form of vibration
Output is provided, when calling, message or some other entrance communication (incomingcommunication) are received, alarm list
Unit 153 can provide tactile output (that is, vibrating) to notify to user.By providing such tactile output, even if
When the mobile phone of user is in the pocket of user, user also can recognize that the generation of various events.Alarm unit 153
The output of the generation of notification event can be provided via display unit 151 or dio Output Modules 152.
Memory 160 can store software program for the process and control operation executed by controller 180 etc., Huo Zheke
Temporarily to store oneself data (for example, telephone directory, message, still image, video etc.) through exporting or will export.And
And, memory 160 can be with storage with regard to the vibration of various modes that exports when touching and being applied to touch-screen and audio signal
Data.
Memory 160 can include the storage medium of at least one type, and the storage medium includes flash memory, hard disk, many
Media card, card-type memory (for example, SD or DX memory etc.), random access storage device (RAM), static random-access storage
Device (SRAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory
(PROM), magnetic storage, disk, CD etc..And, mobile terminal 1 00 can execute memory with by network connection
The network storage device cooperation of 160 store function.
Controller 180 generally controls the overall operation of mobile terminal.For example, controller 180 is executed and voice call, data
The related control of communication, video calling etc. and process.In addition, controller 180 can be included for reproducing (or playback) many matchmakers
The multi-media module 1810 of volume data, multi-media module 1810 can be constructed in controller 180, or it is so structured that and control
Device processed 180 is separated.Controller 180 can be with execution pattern identifying processing, by the handwriting input for executing on the touchscreen or figure
Piece is drawn input and is identified as character or image.
Power subsystem 190 receives external power or internal power under the control of controller 180 and provides operation each unit
Appropriate electric power needed for part and component.
Various embodiments described herein can be to use such as computer software, hardware or its any combination of calculating
Machine computer-readable recording medium is implementing.Hardware is implemented, embodiment described herein can be by using application-specific IC
(ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), scene can
Program gate array (FPGA), processor, controller, microcontroller, microprocessor, be designed to execute function described herein
At least one in electronic unit implementing, in some cases, can be implemented in controller 180 by such embodiment.
Software is implemented, the embodiment of such as process or function can with allow to execute the single of at least one function or operation
Software module is implementing.Software code can be come by the software application (or program) that is write with any appropriate programming language
Implement, software code can be stored in memory 160 and be executed by controller 180.
So far, oneself is through describing mobile terminal according to its function.Below, for the sake of brevity, will description such as folded form,
Slide type mobile terminal in various types of mobile terminals of board-type, oscillating-type, slide type mobile terminal etc. is used as showing
Example.Therefore, the present invention can be applied to any kind of mobile terminal, and be not limited to slide type mobile terminal.
As shown in Figure 1 mobile terminal 1 00 may be constructed such that using via frame or packet transmission data all if any
Line and wireless communication system and satellite-based communication system are operating.
Referring now to the communication system that Fig. 2 description is wherein operable to according to the mobile terminal of the present invention.
Such communication system can be using different air interfaces and/or physical layer.For example, used by communication system
Air interface includes such as frequency division multiple access (FDMA), time division multiple acess (TDMA), CDMA (CDMA) and universal mobile communications system
System (UMTS) (especially, Long Term Evolution (LTE)), global system for mobile communications (GSM) etc..As non-limiting example, under
The description in face is related to cdma communication system, but such teaching is equally applicable to other types of system.
With reference to Fig. 2, cdma wireless communication system can include multiple mobile terminal 1s 00, multiple base stations (BS) 270, base station
Controller (BSC) 275 and mobile switching centre (MSC) 280.MSC280 is configured to and Public Switched Telephony Network (PSTN)
290 form interface.MSC280 is also structured to form interface with the BSC275 that can be couple to base station 270 via back haul link.
If back haul link can be constructed according to any one in the interface that Ganji knows, the interface includes such as E1/T1, ATM, IP,
PPP, frame relay, HDSL, ADSL or xDSL.It will be appreciated that system can include multiple BSC2750 as shown in Figure 2.
Each BS270 can service one or more subregions (or region), by multidirectional antenna or the day of sensing specific direction
Each subregion that line is covered is radially away from BS270.Or, each subregion can by for diversity reception two or more
Antenna is covered.Each BS270 may be constructed such that the multiple frequency distribution of support, and each frequency is distributed with specific frequency spectrum
(for example, 1.25MHz, 5MHz etc.).
Intersecting that subregion and frequency are distributed can be referred to as CDMA Channel.BS270 can also be referred to as base station transceiver
System (BTS) or other equivalent terms.In this case, term " base station " can be used for broadly representing single
BSC275 and at least one BS270.Base station can also be referred to as " cellular station ".Or, each subregion of specific BS270 can be claimed
For multiple cellular stations.
As shown in Figure 2, broadcast singal is sent to broadcsting transmitter (BT) 295 mobile terminal that operate in system
100.Broadcasting reception module 111 is arranged on to receive the broadcast sent by BT295 at mobile terminal 1 00 as shown in Figure 1
Signal.In fig. 2 it is shown that several global positioning system (GPS) satellite 300.Satellite 300 helps position multiple mobile terminals
At least one of 100.
In fig. 2, multiple satellites 300 are depicted, it is understood that be, it is possible to use any number of satellite obtains useful
Location information.GPS module 115 is generally configured to coordinate with satellite 300 to obtain the positioning that wants letter as shown in Figure 1
Breath.Substitute GPS tracking technique or outside GPS tracking technique, it is possible to use can track the position of mobile terminal other
Technology.In addition, at least one gps satellite 300 can optionally or additionally process satellite dmb transmission.
Used as a typical operation of wireless communication system, BS270 receives the reverse link from various mobile terminal 1s 00
Signal.Mobile terminal 1 00 generally participates in call, information receiving and transmitting and other types of communication.Each of the reception of certain base station 270 is anti-
Processed in specific BS270 to link signal.The data of acquisition are forwarded to the BSC275 of correlation.BSC provides call
Resource allocation and the mobile management function of the coordination including the soft switching process between BS270.BSC275 is also by the number for receiving
According to being routed to MSC280, which provides the extra route service for forming interface with PSTN290.Similarly, PSTN290 with
MSC280 forms interface, and MSC and BSC275 form interface, and BSC275 correspondingly controls BS270 with by forward link signals
It is sent to mobile terminal 1 00.
Based on above-mentioned optional mobile terminal hardware configuration and communication system, each embodiment of the inventive method is proposed.
As shown in figure 3, first embodiment of the invention proposes a kind of word pick device 1, the device includes:Taking module
01 and playing module 02.
Taking module 01, under default word pickup mode, to the object before the camera of itself place terminal
Shot.
Playing module 02, plays out for the word in captured dynamic image is converted into voice.
Alternatively, the device also includes:Detection module 03 and pattern enter module 04.
Detection module 03, for detecting the trigger condition of word pickup mode.
Pattern enters module 04, picks up for when trigger condition is detected and determine that the trigger condition is effective, entering word
Delivery formula.
Alternatively, taking module 01 carries out shooting to the object before the camera of itself place terminal includes:
Object before detection camera;Include Word message in wherein relative with camera on object one side.
According to pre-conditioned adjusting focal length.
To include that the middle section of word segment in the object of Word message as shooting focus and is shot.
Alternatively, the device also includes prompting module 05.
Prompting module, for when Word message is not included in relative with camera one side on the object, sending prompting
Information.
Prompting message includes:The vibration of the motor on predeterminated position.
Alternatively, pre-conditioned including:Word size.
Taking module 01 includes according to pre-conditioned adjusting focal length:
Detection is when the word size in dynamic image under front focal length.
The word size for detecting is compared with default character size.
Keep working as front focal length when the word size for detecting is consistent with default character size.
When the word size for detecting is inconsistent with default character size, the focal length for adjusting camera is first Jiao
Away from making the word size in dynamic image consistent with default character size.
Alternatively, the device also includes:First determining module 06.
First determining module 06, for, before according to pre-conditioned adjusting focal length, determining according to the fingerprint size of user
Default character size.
Alternatively, according to the fingerprint size of user, the first determining module 06 determines that default character size includes:
Finger print information during collection user's touch terminal screen;Finger print information includes the fingerprint size.
Take the fingerprint from fingerprint size height and width.
Fingerprint height and width are defined as the height of the word in default character size and width.
Alternatively, the device also includes:Second determining module 07.
Second determining module 07, to the touch operation of captured dynamic image and determines touch location for detection.
Playing module 02, is additionally operable to for corresponding word at touch location to be converted into voice and plays out.
Alternatively, the device also includes:Text point determining module 08.
Text point determining module 08 is used for:
After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in dynamic image,
When the coordinate of touch location is consistent with the coordinate of any one word in dynamic image, determine that touch location is relative with word
Should;When the coordinate of touch location is all inconsistent with the coordinate of each word in photo, determine that touch location is not corresponding with word.
Alternatively, text point determining module 08 is additionally operable to:
After touch location is determined, when not corresponding to word at the touch location, detecting distance current touch location is most
The position that the first near word is located.
Determine the position at the first word place and the relative direction of current touch location.
Preset motor in control respective direction is vibrated.
Alternatively, playing module 03 by corresponding word at touch location be converted into voice play out including:
When touch location is on the straight line that a line word or a row word are located, a line word or a row word are turned
Turn to voice to play out.
Alternatively, the device also includes:Setup module 09.
Setup module 09 is used for:
Default first spacing will all be kept with longitudinally adjacent word, and in the horizontal on identical straight line
Multiple words as a line word.
Default second spacing will all be kept with transversely adjacent word, and in the vertical on identical straight line
Multiple words as a line word.
Additionally, for achieving the above object, present invention also offers a kind of word pick-up method, as shown in Figure 4, Figure 5, the party
Method includes S101-S102:
S101, under default word pickup mode, the object before the camera of terminal is shot.
In embodiments of the present invention, in order to help blind person or amblyopia personnel to read on the various objects such as paper, gravestone, licence plate
Word, understand word content in order to which, embodiment of the present invention scheme the preterminal object can be carried out by terminal-pair
Shoot, and the Word message in captured dynamic image is caught, the Word message is played back with speech form, solve
The puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.
In embodiments of the present invention, in order to take pictures or shooting action is distinguished with general, embodiment of the present invention scheme is needed
To complete under default pattern, word pickup mode described above, the Word Input pattern is used for the camera by terminal
Object of the one side relative with camera before camera comprising Word message is found, and the object is shot, will
Word message in the dynamic image of shooting changes into voice messaging and plays out.It should be noted that carrying out speech play
When be not limited to Word message, can also be digital information, symbolic information etc..And above-mentioned dynamic image can be by shooting
The video image that head shoots out, or the real-time dynamic image that camera is captured during shooting.
In embodiments of the present invention, the word pickup mode can be entered by below scheme.
Alternatively, the method also includes S201-S202:
S201, the trigger condition of detection word pickup mode.Wherein, the trigger condition includes finger manipulation and/or voice
Order.
In embodiments of the present invention, terminal can detect the trigger condition of Message Processing pattern in real time or periodically.
In addition, in order to save terminal resource, the trigger condition can also be obtained by way of message informing, for example, when default pressure
(case includes hardware button and software for force snesor, fingerprint identification device, scanning means, speech recognition equipment and button
The button of form) etc. message of giving notice when detecting certain finger manipulation or voice command, so as to the terminal check finger exercise
Make or voice command be whether word pickup mode trigger condition.It should be noted that the trigger condition can include but not
It is limited to finger manipulation and/or voice command.In various embodiments, the trigger condition could be arranged to any one can be real
The operation that applies or order etc..For example, the trigger condition can also be a kind of gesture high up in the air, by default close sensing in terminal
Device is detecting to the gesture high up in the air.
S202, when trigger condition is detected and determine that the trigger condition is effective, enter word pickup mode.
In embodiments of the present invention, after being detected to the trigger condition of word pickup mode by step S201, also
It needs to be determined that the validity of the trigger condition.For example, when on the triggering button for detecting some default word pickup mode
Pressing operations when, need to detect the duration of the pressing operations, when the duration of pressing operations is less than or equal to default
Time threshold when then can determine that the pressing operations are invalid, i.e. the trigger condition of word pickup mode is invalid.Again for example, when pre-
If proximity transducer detect triggering Message Processing pattern gesture high up in the air when, if the retention time of the gesture high up in the air be less than
Or be equal to default time threshold, then equally can determine that the gesture high up in the air is invalid, i.e. the trigger condition nothing of word pickup mode
Effect.By the scheme of the embodiment of the present invention, the generation of maloperation can be effectively prevented.
In embodiments of the present invention, when the trigger condition that determination is detected is effective, just can be entered with triggering terminal default
Word pickup mode.Under the word pickup mode, user can be shot to preterminal object, so that terminal will
Word in the dynamic image of shooting is converted into voice, is easy to play to terminal use.
In embodiments of the present invention, preterminal object can be shot by below scheme.
Alternatively, the object before the camera of terminal is shot including S301-S302:
Object before S301, detection camera;Include Word message in wherein relative with camera on object one side.
In embodiments of the present invention, due to default word pickup mode primarily to the word in dynamic image is entered
Row is extracted, and is played out so as to problem is converted into voice.Therefore, under word pickup mode, can examine when terminal is shot
The photographed scene for surveying terminal includes the object of Word message.In embodiments of the present invention, can be by default image recognition
System completes the detection to word and identification process.
Alternatively, the method also includes:When Word message is not included in relative with camera one side on the object, send out
Go out prompting message.The prompting message includes:The vibration of the motor on predeterminated position.
In embodiments of the present invention, before terminal taking, when terminal is not detected by the presence of bag in the current scene of terminal
During object containing Word message, in order to remind user's conversion photographed scene, especially reminding blind or amblyopia personnel, can send
Default prompting message.It should be noted that the prompting message can include one or more of:The tinkle of bells, music, voice,
Vibration, flash lamp.For example, it is possible to the motor of control terminal predetermined position produces vibration.Because one can be included in terminal
Or multiple motors, different positions are respectively arranged at, so that different functions are realized, when preterminal object does not include Word message
When, only make the motor in a certain precalculated position produce vibration, so as to reach the purpose for reminding user.The predeterminated position can be terminal
On optional position, as long as facilitate user perceive motor vibration.
S302, according to pre-conditioned adjusting focal length.
In embodiments of the present invention, after the object comprising Word message of the terminal in photographed scene is captured, need
Terminal camera is focused according to default condition, meet pre-conditioned dynamic image to shoot.
Alternatively, this pre-conditioned including:Word size.
In embodiments of the present invention, the word size adjustment of word segment can be worth to suitable by focusing, so as to
User prevents the too little phenomenon generation for causing to click on mistake of the word in photo, especially for blind person and amblyopia when clicking on
For personnel, (for example, think when need not directly listen to the voice messaging being directly translated into by the Word message in dynamic image
When tempering finger and touching ability), the click of finger can be relied on determine the word that chooses and listen attentively to the content of the word,
In the case of can't see or not seeing the word in dynamic image, if its word is too little, user easily clicks on mistake always, such as
Shown in Fig. 6, this certainly will bring very poor experience sense for user.Accordingly, it would be desirable to first focused before shooting, to shoot
The satisfactory word size of dynamic image, facilitate user to click on.
Alternatively, the method also includes:Before according to pre-conditioned adjusting focal length, determined according to the fingerprint size of user
Default character size.
In embodiments of the present invention, according to the above, before shooting, need to predefine the word size
Standard, so as to terminal when being focused directly using the default value as the foundation that focuses.Due to predefining dynamic image
In word size standard be in order to avoid word too little cause click on mistake phenomenon occur, dynamic image when word is too big
The interior word that can be accommodated is very little.Therefore, in embodiments of the present invention, can be determined according to the size of user's finger or size
The standard of the word size of dynamic image.Specifically, can be realized by below scheme.
Alternatively, determine that default character size includes S401-S402 according to the fingerprint size of user:
Finger print information when S401, collection user's touch terminal screen;The finger print information includes fingerprint size.
In embodiments of the present invention, terminal can be according to the history service condition of user in user's once touch terminal screen
When gather and preserve the finger print information of user, it is also possible to gather the fingerprint letter of user under default finger print information drainage pattern
Breath, and therefrom obtain the dimension information of fingerprint.
S402, take the fingerprint from fingerprint size height and width.
In embodiments of the present invention, the fingerprint size of user includes height and the width of fingerprint.In the embodiment of the present invention
In scheme, fingerprint highly refers to longitudinally go up the distance between outline line maximum in the fingerprint profile for obtaining;Fingerprint width is referred to
Transversely the distance between outline line maximum in the fingerprint profile of acquisition.Fingerprint due to acquisition when carrying out fingerprint recognition every time
Profile can not possibly be identical, therefore can obtain a fingerprint height and width by way of multi collect is averaged
Mean value is used as fingerprint height and the standard value of width.In addition, in order that shoot when obtain sufficiently large word size, can
Therefrom to select a maximum after multi collect as fingerprint height and the standard value of width.
S403, by fingerprint height and width be defined as in default character size word height and width.
In embodiments of the present invention, after obtaining the standard value of fingerprint height and width, just can be by the finger of the standard
Line height and width are used as the standard for determining character size.For example, directly using fingerprint height and width as default word
Word height and width in size, or using as default character size after fingerprint height and the default ratio of width expansion
In word height and width.The such as preset ratio can be 1%, 5% etc..Here the preset ratio can not be arranged too
Greatly, so as not to word excessive cause photo accommodate word very little.In addition, when word size is determined, can be without while determining
The word height gone out in word size and width, can determine one of which according to the touch of user custom.For example, use
Family custom finger is laterally touched, then can only determine the width of word;User's custom finger is longitudinally touched, then can only determine text
The height of word.
By above scheme, when just can obtain shooting, the standard of word size, is carried out to camera according to the standard
Focusing just can obtain the word dynamic image of suitable user.
In embodiments of the present invention, when being focused according to word size, specifically can complete to adjust by below scheme
Burnt work.
Alternatively, S401-S404 is included according to pre-conditioned adjusting focal length:
S401, detection are when the word size in dynamic image under front focal length.
In embodiments of the present invention, before being focused according to default word size, first can detect when under front focal length
Word size in the dynamic image that camera is obtained, to judge whether the word size has met the word of default standard
Size, and be easy to be adjusted dynamic image according to current character size.In embodiments of the present invention, for working as front focal length
The detection of the word size in lower dynamic image again may be by default pattern recognition device to be carried out image recognition to realize.
S402, the word size for detecting is compared with default character size.
In embodiments of the present invention, detect after the word size in dynamic image under front focal length, by by this article
Word size is obtained when the specifying information of the word size in dynamic image under front focal length compared with default character size, and
Following process is carried out respectively for different comparative results.
S403, when the word size for detecting is consistent with default character size keep work as front focal length.
In embodiments of the present invention, when the word size for detecting is consistent with default character size, that is, detect
When word size or measures of dispersion identical with default character size is less than or equal to default measures of dispersion threshold value, both can be by
When front focal length is used as shooting focal length.
S404, when the word size for detecting is inconsistent with default character size, the focal length for adjusting camera is the
One focal length, makes the word size in dynamic image consistent with default character size.
In embodiments of the present invention, when the word size for detecting is inconsistent with default character size, that is, detect
Word size differed with default character size completely, and measures of dispersion more than default measures of dispersion threshold value when, then permissible
To being adjusted when front focal length so that the word size in dynamic image is consistent with default character size, and will adjustment
The first focal length in focal length afterwards, i.e. embodiment of the present invention scheme is defined as shooting focal length.
S303, will include that the middle section of word segment in the object of Word message focus will be shot as shooting.
In embodiments of the present invention, after determining the focal length of camera, in order that the dynamic image that shoots is with word
Based on part, the middle section of word segment in the object of Word message can be included as shooting focus.
In embodiments of the present invention, suitable shooting focal length and focus just can be obtained by above adjustment, according to this
Focal length and focus are shot the character image that just can obtain suitable user.
Alternatively, detect the touch operation to captured dynamic image and determine touch location;By at touch location pair
The word that answers is converted into voice and plays out.
In embodiments of the present invention, after carrying out dynamic image shooting by above scheme, user just can be dynamic according to this
State image obtains the Word message in photo.
It should be noted that terminal can extract the Word message in dynamic image by pattern recognition device, and will carry
The Word message for taking is arranged according to the position in dynamic image, the final electronics shape for obtaining Word message in dynamic image
Formula.After the electronic form for obtaining the Word message, directly the Word message of the electronic form can be converted into voice letter
Breath is played back, it is also possible to turned corresponding word after touch operation of the user to the dynamic image on terminal screen is detected
Turn to speech play out.Specifically, dynamic image can show that, on the interface of terminal, user can be to terminal after shooting and finishing
Dynamic image on interface is touched or is clicked on etc. operation, and the terminal-pair touch or clicking operation are detected, and are determined tactile
The position that touches or click on, to determine its corresponding word according to the position, as shown in Figure 7.
In embodiments of the present invention, can be completed using detection method, algorithm and the device that can arbitrarily implement above-mentioned
Detection scheme, is not limited for specific detection method, algorithm and device.
In embodiments of the present invention, due to, for blind person or amblyopia personnel, can't see or not see screen Dynamic Graph
As upper particular location, therefore touched position is likely to when touching and does not have word.In such a case, it is possible to pass through
Below scheme is determined with the presence or absence of word at touch location.
Alternatively, the method also includes:
After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in dynamic image,
When the coordinate of touch location is consistent with the coordinate of any one word in dynamic image, determine that touch location is relative with word
Should;When the coordinate of touch location is all inconsistent with the coordinate of each word in dynamic image, determine touch location with word not
Corresponding.
In embodiments of the present invention, as terminal can be left according to screen to the word in the dynamic image of display on screen
Side determines the coordinate of each word respectively.In the same manner, terminal is it may also be determined that the concrete coordinate of the touch location of user, therefore, whole
The coordinate of user touch location can be compared by end with each word coordinate, when the two coordinates are consistent, touch location is described
Corresponding with word, i.e., touch location falls on word, when the two coordinates are inconsistent, illustrates that touch location is not right with word
Should, i.e., touch location is not fallen within word.It should be noted that in embodiment of the present invention scheme, unanimously referring to identical
Or measures of dispersion is less than or equal to default discrepancy threshold, inconsistent refer to differ completely or measures of dispersion is more than default difference
Different threshold value.
Alternatively, the method also includes S501-S502:
S501, after touch location is determined, when not corresponding to word at the touch location, detecting distance current touch position
Put the position that the first nearest word is located.
In embodiments of the present invention, in the case of not corresponding to word at touch location, terminal-pair is needed for giving
Remind, carry out the adjustment of touch location so as to user in time.In embodiment of the present invention scheme, terminal can first detect distance
The nearest word of current touch location, and word position on a terminal screen is determined, will so as to user guided user
Finger moves to corresponding position, as shown in Figure 8.Concrete guide scheme can be realized by following proposal.
S502, determine the first word be located position and current touch location relative direction.
In embodiments of the present invention, when determining the word nearest apart from current touch location, the such as embodiment of the present invention
After the positional information of the first word in scheme, such as the first word coordinate on a terminal screen, just can determine that and works as
The relative direction of the position of front touch location and first word, for example, ten o'clock direction.
Preset motor in S503, control respective direction is vibrated.
In embodiments of the present invention, can multiple directions be set in advance in terminal and motor is indicated, in step S502 really
After making the relative direction of the first word and current touch location, the preset motor in respective direction just can be controlled to be shaken
Dynamic, to guide user's next step to need the direction of adjustment.In embodiment of the present invention scheme, the particular location of the motor is really
Surely can be the horse determined by the relative direction extension along the first word and current touch location with terminal screen center as starting point
Reach, as shown in Figure 8.
It should be noted that such scheme can also be not limited to using other guide schemes in other embodiments.Example
Such as, user can be given by way of voice message guide, for example, " please be moved to the left ", " please move up ".In the present invention
Direction in embodiment, when the left side is terminal screen user oriented, indicated by abscissa negative direction;The left side be terminal screen towards
Direction during user, indicated by abscissa positive direction;When top is terminal screen user oriented, indicated by ordinate positive direction
Direction;When being terminal screen user oriented below, the direction indicated by ordinate negative direction.
S102, corresponding word at touch location is converted into voice plays out.
In embodiments of the present invention, after detecting the word of user touch place by above scheme, or user is guided
After touching word, just corresponding word at touch location can be converted into voice messaging and play out.Need explanation
, as Word message is converted into the technology that voice messaging has been comparative maturity, will not be described here, and for selection
Method for transformation, algorithm, software and device etc. be all not specifically limited.
In addition, the conversion process for the word in dynamic image to voice can be direct when word dynamic image is obtained
Carry out, i.e., directly carry out in shooting process, it is also possible to carried out after the word touched by user is determined again, concrete mode can
With the application scenarios self-defining according to user, this is not restricted.
In embodiments of the present invention, directly the Word message in dynamic image is converted when word dynamic image is got
During for voice messaging, can be directly to the word in dynamic image according to preset order, such as from top to bottom and/or from left to right
Order carry out speech play, it is also possible to according to such scheme, carry out speech play when user touches corresponding word.In order to
It is suitable for the choosing at random of two kinds of broadcast modes, corresponding play mode can be pre-set, for example, selects play mode and automatically
Play mode.Under play mode is selected, need to detect the touch operation of user, so as to enter corresponding word at touch location
Row is played.Under automatic play mode, speech play can also be carried out to the word in dynamic image according to preset order automatically.
In addition, under above-mentioned selection play mode, in order that user quickly understands the word content in dynamic image,
Playing efficiency is improved, following player method can also be adopted.
Alternatively, by corresponding word at touch location be converted into voice play out including:
When touch location is on the straight line that a line word or a row word are located, a line word or a row word are turned
Turn to voice to play out.
In embodiments of the present invention, when detecting the corresponding word of user institute touch location in a row or column word
When, directly the content corresponding to the row or the row word can be played to user.In addition, if the style of writing word has adjacent one
Row or multline text, can issue the user with prompting, for example, voice reminder, remind whether user needs to continue to play next line
Or the word content of lastrow.In the same manner, if the style of writing word has adjacent one or more columns per page word, it is also possible to issue the user with
Remind, remind whether user needs the word content for continuing to play next column or previous column.User can adopt voice confirmation side
Formula, or this default operation acknowledgement mode fed back to the prompting.Terminal plays next line or next column according to feedback result
Word content, or stop playing.
In embodiments of the present invention, before the word in terminal-pair a row or column is identified, need terminal right in advance
The concept of a row or column is defined, and is pre-defined according to this so as to terminal and goes to be confirmed whether there is a row or column word.
Specifically can be realized by below scheme.
Alternatively, the method also includes:
Default first spacing will all be kept with longitudinally adjacent word, and in the horizontal on identical straight line
Multiple words as a line word.
Default second spacing will all be kept with transversely adjacent word, and in the vertical on identical straight line
Multiple words as a line word.
In embodiments of the present invention, terminal can be examined with the distance of adjacent word to each word in dynamic image
Survey, and the coordinate of each word is can determine, which word the coordinate value according to each word determines point-blank.
Therefore, based on above-mentioned termination function, and the concept according to row and column, just can determine that a line word i.e. with longitudinally adjacent
Word all keep default first spacing, and multiple words on identical straight line in the horizontal;One row word is
Default second spacing, and in the vertical multiple texts in identical straight line on are all kept with transversely adjacent word
Word.
In embodiments of the present invention, the concrete numerical value for the first spacing in such scheme and the second spacing is not limited
System.First spacing and the second spacing can be different numerical value according to different application scenarios.
So far, whole essential characteristics of the embodiment of the present invention that is over just are introduced, it should be noted that the above is all this
One or more specific embodiments of inventive embodiments scheme, in other embodiments can also be using other embodiment party
Formula, any and same or analogous embodiment of the embodiment of the present invention, and any group of the essential characteristic of the embodiment of the present invention
Close, all within the protection domain of the embodiment of the present invention.
The present invention proposes a kind of word pick device and method, and the device includes:Taking module and playing module.Shoot
Module is shot to the object before the camera of itself place terminal under default word pickup mode.Playing module will
Word in captured dynamic image is converted into voice and plays out.By embodiment of the present invention scheme, terminal can be passed through
Understand word content, solve the puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.
It should be noted that herein, term " including ", "comprising" or its any other variant are intended to non-row
The including of his property, so that a series of process including key elements, method, article or device not only include those key elements, and
And also include other key elements being not expressly set out, or also include intrinsic for this process, method, article or device institute
Key element.In the absence of more restrictions, the key element for being limited by sentence "including a ...", it is not excluded that including to be somebody's turn to do
Also there is other identical element in the process, method of key element, article or device.
The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side
Method can add the mode of required general hardware platform by software to realize, naturally it is also possible to by hardware, but in many cases
The former is more preferably embodiment.Based on such understanding, technical scheme is substantially done to prior art in other words
The part for going out contribution can be embodied in the form of software product, and the computer software product is stored in a storage medium
In (as ROM/RAM, magnetic disc, CD), use so that a station terminal equipment including some instructions (can be mobile phone, computer, clothes
Business device, air-conditioner, or network equipment etc.) execute method described in each embodiment of the present invention.
The preferred embodiments of the present invention are these are only, the scope of the claims of the present invention is not thereby limited, every using this
Equivalent structure or equivalent flow conversion that bright specification and accompanying drawing content are made, or directly or indirectly it is used in other related skills
Art field, is included within the scope of the present invention.
Claims (10)
1. a kind of word pick device, it is characterised in that described device includes:Taking module playing module;
The taking module, for, under default word pickup mode, entering to the object before the camera of itself place terminal
Row shoots;
The playing module, plays out for the word in captured dynamic image is converted into voice.
2. word pick device as claimed in claim 1, it is characterised in that described device also includes:Detection module and pattern
Enter module;
The detection module, for detecting the trigger condition of the word pickup mode;
The pattern enters module, for when the trigger condition is detected and determine that the trigger condition is effective, entering institute
State word pickup mode.
3. word pick device as claimed in claim 1, it is characterised in that the taking module is taken the photograph to itself place terminal
Include as the object before head carries out shooting:
Detect the object before the camera;Believe including word in the one side relative with the camera on wherein described object
Breath;
According to pre-conditioned adjusting focal length;
The middle section of word segment in the object including Word message as shooting focus and is shot.
4. word pick device as claimed in claim 1, it is characterised in that described device also includes:Second determining module;
Second determining module, to the touch operation of captured dynamic image and determines touch location for detection;
The playing module, is additionally operable to for corresponding word at the touch location to be converted into voice and plays out.
5. word pick device as claimed in claim 4, it is characterised in that described device also includes:Text point determines mould
Block;
The text point determining module is used for:
After the touch location is determined, the coordinate of the touch location is compared with the coordinate of each word in photo,
When the coordinate of the touch location is consistent with the coordinate of any one word in the dynamic image, the touch location is determined
Corresponding with word;When the coordinate of the touch location is all inconsistent with the coordinate of each word in the dynamic image, really
The fixed touch location is not corresponding with word.
6. a kind of word pick-up method, it is characterised in that methods described includes:
Under default word pickup mode, the object before the camera of terminal is shot;
Word in captured dynamic image is converted into voice play out.
7. word pick-up method as claimed in claim 6, it is characterised in that methods described also includes:
Detect the trigger condition of the word pickup mode;
When the trigger condition is detected and determine that the trigger condition is effective, the word pickup mode is entered.
8. word pick-up method as claimed in claim 6, it is characterised in that before the camera to itself place terminal
Object carries out shooting to be included:
Detect the object before the camera;Believe including word in the one side relative with the camera on wherein described object
Breath;
According to pre-conditioned adjusting focal length;
The middle section of word segment in the object including Word message as shooting focus and is shot.
9. word pick-up method as claimed in claim 6, it is characterised in that methods described also includes:
Detect the touch operation to captured dynamic image and determine touch location;
Corresponding word at the touch location is converted into voice play out.
10. word pick device as claimed in claim 9, it is characterised in that methods described also includes:
After the touch location is determined, the coordinate of the touch location is compared with the coordinate of each word in photo,
When the coordinate of the touch location is consistent with the coordinate of any one word in the dynamic image, the touch location is determined
Corresponding with word;When the coordinate of the touch location is all inconsistent with the coordinate of each word in the dynamic image, really
The fixed touch location is not corresponding with word.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610884064.1A CN106484297B (en) | 2016-10-10 | 2016-10-10 | Character picking device and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610884064.1A CN106484297B (en) | 2016-10-10 | 2016-10-10 | Character picking device and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106484297A true CN106484297A (en) | 2017-03-08 |
CN106484297B CN106484297B (en) | 2020-03-27 |
Family
ID=58270671
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610884064.1A Active CN106484297B (en) | 2016-10-10 | 2016-10-10 | Character picking device and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106484297B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107885430A (en) * | 2017-11-07 | 2018-04-06 | 广东欧珀移动通信有限公司 | A kind of audio frequency playing method, device, storage medium and electronic equipment |
CN108875694A (en) * | 2018-07-04 | 2018-11-23 | 百度在线网络技术(北京)有限公司 | Speech output method and device |
CN112925419A (en) * | 2021-03-31 | 2021-06-08 | 读书郎教育科技有限公司 | Result screening method based on flat-plate fingertip word searching |
-
2016
- 2016-10-10 CN CN201610884064.1A patent/CN106484297B/en active Active
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107885430A (en) * | 2017-11-07 | 2018-04-06 | 广东欧珀移动通信有限公司 | A kind of audio frequency playing method, device, storage medium and electronic equipment |
CN107885430B (en) * | 2017-11-07 | 2020-07-24 | Oppo广东移动通信有限公司 | Audio playing method and device, storage medium and electronic equipment |
CN108875694A (en) * | 2018-07-04 | 2018-11-23 | 百度在线网络技术(北京)有限公司 | Speech output method and device |
CN112925419A (en) * | 2021-03-31 | 2021-06-08 | 读书郎教育科技有限公司 | Result screening method based on flat-plate fingertip word searching |
Also Published As
Publication number | Publication date |
---|---|
CN106484297B (en) | 2020-03-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104808944B (en) | Touch operation inducing method and device | |
CN103685724B (en) | Mobile terminal and control method thereof | |
CN106504280A (en) | A kind of method and terminal for browsing video | |
CN106550146A (en) | A kind of chat message dispensing device and method | |
CN106210328A (en) | Information display device and method | |
CN102281349A (en) | Mobile terminal and controlling method thereof | |
CN106130734A (en) | The control method of mobile terminal and control device | |
CN104284085A (en) | Electronic device and method of operating the same | |
CN106603829A (en) | Screen capture method and mobile terminal | |
CN106843724A (en) | A kind of mobile terminal screen anti-error-touch device and method, mobile terminal | |
CN106775372A (en) | A kind of display adjusting method of suspension procedure disk, device and terminal | |
CN106157970A (en) | A kind of audio identification methods and terminal | |
CN106708321A (en) | Touch screen touch method and device and terminal | |
CN106527953A (en) | Mobile terminal and frame gesture operation method | |
CN106406737A (en) | A screen operating method and device and a mobile terminal | |
CN105739873A (en) | Screen capturing method and terminal | |
CN105786384A (en) | Device and method for adjusting focal point | |
CN107145272A (en) | A kind of icon hiding display terminal and method | |
CN106131274A (en) | Mobile terminal control device and method | |
CN106791155A (en) | A kind of volume adjustment device, volume adjusting method and mobile terminal | |
CN106648324A (en) | Hidden icon operating method, device and terminal | |
CN109240579A (en) | A kind of touch operation method, equipment and computer can storage mediums | |
CN106484297A (en) | A kind of word pick device and method | |
CN106357936A (en) | Control method of map application and mobile terminal | |
CN104731484B (en) | The method and device that picture is checked |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |