CN106484297A

CN106484297A - A kind of word pick device and method

Info

Publication number: CN106484297A
Application number: CN201610884064.1A
Authority: CN
Inventors: 李光宇; 王猛
Original assignee: Nubia Technology Co Ltd
Current assignee: Nubia Technology Co Ltd
Priority date: 2016-10-10
Filing date: 2016-10-10
Publication date: 2017-03-08
Anticipated expiration: 2036-10-10
Also published as: CN106484297B

Abstract

The invention discloses a kind of word pick device and method, the device includes：Taking module, the first determining module and playing module.Taking module is shot to the object before the camera of itself place terminal under default word pickup mode.Word in captured dynamic image is converted into voice and plays out by playing module.By embodiment of the present invention scheme, word content can be understood by terminal, solve the puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.

Description

A kind of word pick device and method

Technical field

The present invention relates to terminal applies field, more particularly to a kind of word pick device and method.

Background technology

At present, blind person or amblyopia personnel are because there is inconvenience in visual problems in life, for example, to dining room have a meal when Cannot a people ordered dishes by papery menu, when going out cannot oneself viewing public transport stop board etc., currently, with various terminals application Broad development, how by terminal applies in the life of blind person or amblyopia personnel, help which to solve to bring because of visual problems Above-mentioned puzzlement, is the problem of person skilled urgent need to resolve.

Content of the invention

Present invention is primarily targeted at a kind of word pick device and method is proposed, word can be understood by terminal Content, solves the puzzlement that cannot be read that blind person or amblyopia personnel bring because of visual problems.

For achieving the above object, the invention provides a kind of word pick device, the device includes：Taking module and broadcasting Module.

Taking module, for, under default word pickup mode, entering to the object before the camera of itself place terminal Row shoots.

Playing module, plays out for the word in captured dynamic image is converted into voice.

Alternatively, the device also includes：Detection module and pattern enter module.

Detection module, for detecting the trigger condition of word pickup mode.

Pattern enters module, for when trigger condition is detected and determine that the trigger condition is effective, entering word pickup Pattern.

Alternatively, taking module carries out shooting to the object before the camera of itself place terminal includes：

Object before detection camera；Include Word message in wherein relative with camera on object one side.

According to pre-conditioned adjusting focal length.

To include that the middle section of word segment in the object of Word message as shooting focus and is shot.

Alternatively, the device also includes prompting module.

Prompting module, for when Word message is not included in relative with camera one side on the object, sending prompting Information.

Prompting message includes：The vibration of the motor on predeterminated position.

Alternatively, pre-conditioned including：Word size.

Taking module includes according to pre-conditioned adjusting focal length：

Detection is when the word size in dynamic image under front focal length.

The word size for detecting is compared with default character size.

Keep working as front focal length when the word size for detecting is consistent with default character size.

When the word size for detecting is inconsistent with default character size, the focal length for adjusting camera is first Jiao Away from making the word size in dynamic image consistent with default character size.

Alternatively, the device also includes：First determining module.

First determining module, for, before according to pre-conditioned adjusting focal length, determining according to the fingerprint size of user pre- If character size.

Alternatively, according to the fingerprint size of user, the first determining module determines that default character size includes：

Finger print information during collection user's touch terminal screen；Finger print information includes the fingerprint size.

Take the fingerprint from fingerprint size height and width.

Fingerprint height and width are defined as the height of the word in default character size and width.

Alternatively, the device also includes：Second determining module.

Second determining module, to the touch operation of captured dynamic image and determines touch location for detection.

Playing module, is additionally operable to for corresponding word at touch location to be converted into voice and plays out.

Alternatively, the device also includes：Text point determining module.

Text point determining module is used for：

After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in photo, when tactile Touch position coordinate consistent with the coordinate of any one word in dynamic image when, determine that touch location is corresponding with word；When When the coordinate of touch location is all inconsistent with the coordinate of each word in dynamic image, determine that touch location is not corresponding with word.

Alternatively, text point determining module is additionally operable to：

After touch location is determined, when not corresponding to word at the touch location, detecting distance current touch location is most The position that the first near word is located.

Determine the position at the first word place and the relative direction of current touch location.

Preset motor in control respective direction is vibrated.

Alternatively, playing module by corresponding word at touch location be converted into voice play out including：

When touch location is on the straight line that a line word or a row word are located, a line word or a row word are turned Turn to voice to play out.

Alternatively, the device also includes：Setup module.

Setup module is used for：

Default first spacing will all be kept with longitudinally adjacent word, and in the horizontal on identical straight line Multiple words as a line word.

Default second spacing will all be kept with transversely adjacent word, and in the vertical on identical straight line Multiple words as a line word.

Additionally, for achieving the above object, present invention also offers a kind of word pick-up method, the method includes：

Under default word pickup mode, the object before the camera of terminal is shot.

Word in captured dynamic image is converted into voice play out.

Alternatively, the method also includes：

The trigger condition of detection word pickup mode.

When trigger condition is detected and determine that the trigger condition is effective, word pickup mode is entered.

Alternatively, carrying out shooting to the object before the camera of itself place terminal includes：

According to pre-conditioned adjusting focal length.

Alternatively, the method also includes：

When Word message is not included in relative with camera one side on the object, prompting message is sent.

The prompting message includes：The vibration of the motor on predeterminated position.

Alternatively, pre-conditioned including：Word size.

Included according to pre-conditioned adjusting focal length：

Detection is when the word size in dynamic image under front focal length.

The word size for detecting is compared with default character size.

Alternatively, the method also includes：Before according to pre-conditioned adjusting focal length, determined according to the fingerprint size of user Default character size.

Alternatively, determine that default character size includes according to the fingerprint size of user：

Finger print information during collection user's touch terminal screen；The finger print information includes fingerprint size.

Take the fingerprint from fingerprint size height and width.

Alternatively, the method also includes：

Detect the touch operation to captured dynamic image and determine touch location.

Corresponding word at touch location is converted into voice play out.

Alternatively, methods described also includes：

Alternatively, the method also includes：

Preset motor in control respective direction is vibrated.

Alternatively, by corresponding word at touch location be converted into voice play out including：

Alternatively, the method also includes：

The present invention proposes a kind of word pick device and method, and the device includes：Taking module, the first determining module and Playing module.Taking module is clapped to the object before the camera of itself place terminal under default word pickup mode Take the photograph.Word in captured dynamic image is converted into voice and plays out by playing module.By embodiment of the present invention scheme, Word content can be understood by terminal, solve the puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.

Description of the drawings

Fig. 1 is the hardware architecture diagram for realizing the optional mobile terminal of each embodiment of the present invention one；

Fig. 2 is the wireless communication system schematic diagram of mobile terminal as shown in Figure 1；

Fig. 3 is the word pick device composition frame chart of the embodiment of the present invention；

Fig. 4 is the word pick-up method flow chart of the embodiment of the present invention；

Fig. 5 is the word pick-up method schematic diagram of the embodiment of the present invention；

Fig. 6 is schematic diagram when image too hour user clicks in the word pick-up method of the embodiment of the present invention；

Schematic diagram when Fig. 7 is clicked on for user after being focused in the word pick-up method of the embodiment of the present invention；

Fig. 8 is for but listening, in the word pick-up method of the embodiment of the present invention, the embodiment schematic diagram for reminding motor.

The realization of the object of the invention, functional characteristics and advantage will be described further in conjunction with the embodiments referring to the drawings.

Specific embodiment

It should be appreciated that specific embodiment described herein is not intended to limit the present invention only in order to explain the present invention.

The optional mobile terminal of each embodiment of the present invention one is realized referring now to Description of Drawings.In follow-up description In, using the suffix for representing such as " module ", " part " or " unit " of element only for being conducive to the explanation of the present invention, Itself does not have specific meaning.Therefore, " module " mixedly can be used with " part ".

Mobile terminal can be implemented in a variety of manners.For example, the terminal described in the present invention can include such as to move Phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP The consolidating of the mobile terminal of (portable media player), guider etc. and such as numeral TV, desktop computer etc. Determine terminal.Hereinafter it is assumed that terminal is mobile terminal.However, it will be understood by those skilled in the art that, except being used in particular for movement Outside the element of purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.

Fig. 1 is that the hardware configuration of the mobile terminal for realizing each embodiment of the present invention is illustrated.

Mobile terminal 1 00 can include wireless communication unit 110, A/V (audio/video) input block 120, user input Unit 130, sensing unit 140, output unit 150, memory 160, interface unit 170, controller 180 and power subsystem 190 Etc..Fig. 1 shows the mobile terminal with various assemblies, it should be understood that being not required for implementing all groups for illustrating Part.More or less of component can alternatively be implemented.Will be discussed in more detail below the element of mobile terminal.

Wireless communication unit 110 generally includes one or more assemblies, and which allows mobile terminal 1 00 and wireless communication system Or the radio communication between network.For example, wireless communication unit can include broadcasting reception module 111, mobile communication module 112nd, at least one of wireless Internet module 113, short range communication module 114 and location information module 115.

Broadcasting reception module 111 receives broadcast singal and/or broadcast via broadcast channel from external broadcast management server Relevant information.Broadcast channel can include satellite channel and/or terrestrial channel.Broadcast management server can be generated and sent The broadcast singal generated before the server or reception of broadcast singal and/or broadcast related information and/or broadcast related information And send it to the server of terminal.Broadcast singal can include TV broadcast singal, radio signals, data broadcasting Signal etc..And, broadcast singal may further include the broadcast singal combined with TV or radio signals.Broadcast phase Pass information can also be provided via mobile communications network, and in this case, broadcast related information can be by mobile communication mould Block 112 is receiving.Broadcast singal can be present in a variety of manners, and for example, which can be with the electronics of DMB (DMB) The form of program guide (EPG), the electronic service guidebooks (ESG) of digital video broadcast-handheld (DVB-H) etc. and exist.Broadcast Receiver module 111 can receive signal broadcast by using various types of broadcast systems.Especially, broadcasting reception module 111 Can be wide by using such as multimedia broadcasting-ground (DMB-T), DMB-satellite (DMB-S), digital video Broadcast-hand-held (DVB-H), forward link media (MediaFLO^@) Radio Data System, received terrestrial digital broadcasting integrated service Etc. (ISDB-T) digit broadcasting system receives digital broadcasting.Broadcasting reception module 111 may be constructed such that and be adapted to provide for extensively Broadcast the various broadcast systems of signal and above-mentioned digit broadcasting system.Via broadcasting reception module 111 receive broadcast singal and/ Or broadcast related information can be stored in memory 160 (or other types of storage medium).

Mobile communication module 112 sends radio signals to base station (for example, access point, node B etc.), exterior terminal And at least one of server and/or receive from it radio signal.Such radio signal can be logical including voice Words signal, video calling signal or the various types of data for sending and/or receiving according to text and/or Multimedia Message.

Wireless Internet module 113 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.The module can be internally or externally It is couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by the module can include WLAN (WLAN) (Wi-Fi), Wibro (WiMAX), Wimax (worldwide interoperability for microwave accesses), HSDPA (high-speed downlink packet access) etc..

Short range communication module 114 be for supporting the module of junction service.Some examples of short-range communication technology include indigo plant Tooth^TM, RF identification (RFID), Infrared Data Association (IrDA), ultra broadband (UWB), purple honeybee^TMEtc..

Location information module 115 be for check or obtain mobile terminal positional information module.Location information module Typical case be GPS (global positioning system).According to current technology, GPS module 115 is calculated from three or more satellites Range information and correct time information and for calculate Information application triangulation, so as to according to longitude, latitude Three-dimensional current location information is highly accurately calculated.Currently, defended using three for calculating the method for position and temporal information Star and the error of the position that calculated by using other satellite correction and temporal information.Additionally, GPS module 115 Can be by Continuous plus current location information in real time come calculating speed information.

A/V input block 120 is used for receiving audio or video signal.A/V input block 120 can include 121 He of camera Microphone 1220,121 pairs of static maps obtained by image capture apparatus in Video Capture pattern or image capture mode of camera The view data of piece or video is processed.Picture frame after process is may be displayed on display unit 151.At camera 121 Picture frame after reason can be stored in memory 160 (or other storage mediums) or carry out via wireless communication unit 110 Send, two or more cameras 1210 can be provided according to the construction of mobile terminal.Microphone 122 can be in telephone relation mould Sound (voice data) is received via microphone in formula, logging mode, speech recognition mode etc. operational mode, and can be by Such acoustic processing is voice data.Audio frequency (voice) data after process can be changed in the case of telephone calling model For the form output of mobile communication base station can be sent to via mobile communication module 112.Microphone 122 can implement all kinds Noise eliminate (or suppression) algorithm with eliminate noise that (or suppression) is produced during receiving and sending audio signal or Person disturbs.

User input unit 130 can generate key input data to control each of mobile terminal according to the order of user input Plant operation.User input unit 130 allows the various types of information of user input, and can include keyboard, metal dome, touch Plate (resistance that for example, detection causes due to being touched, pressure, the sensitive component of the change of electric capacity etc.), roller, rocking bar etc. Deng.Especially, when touch pad is superimposed upon on display unit 151 as a layer, touch-screen can be formed.

Sensing unit 140 detect mobile terminal 1 00 current state, (for example, mobile terminal 1 00 open or close shape State), the position of mobile terminal 1 00, user for mobile terminal 1 00 the presence or absence of contact (that is, touch input), mobile terminal 100 orientation, the acceleration or deceleration movement of mobile terminal 1 00 and direction etc., and generate for controlling mobile terminal 1 00 The order of operation or signal.For example, when mobile terminal 1 00 is embodied as sliding-type mobile phone, sensing unit 140 can be sensed The sliding-type phone is opened or is cut out.In addition, sensing unit 140 can detect power subsystem 190 whether provide electric power or Whether person's interface unit 170 is coupled with external device (ED).Sensing unit 140 can will be combined below including proximity transducer 1410 Touch-screen is being described to this.

Interface unit 170 is connected, as at least one external device (ED), the interface that can pass through with mobile terminal 1 00.For example, External device (ED) can include wired or wireless head-band earphone port, external power source (or battery charger) port, wired or nothing Line FPDP, memory card port, the port for being used for device of the connection with identification module, audio input/output (I/O) end Mouth, video i/o port, ear port etc..Identification module can be stored for verifying user using each of mobile terminal 1 00 Kind of information and subscriber identification module (UIM), client identification module (SIM), Universal Subscriber identification module (USIM) can be included Etc..In addition, the device (hereinafter referred to as " identifying device ") with identification module can take the form of smart card, therefore, know Other device can be connected with mobile terminal 1 00 via port or other attachment means.Interface unit 170 can be used for receive from The input (for example, data message, electric power etc.) of external device (ED) and the input for receiving is transferred in mobile terminal 1 00 One or more elements can be used for transmission data between mobile terminal and external device (ED).

In addition, when mobile terminal 1 00 is connected with external base, interface unit 170 can serve as allowing by which by electricity Power provides the path of mobile terminal 1 00 from base or can serve as allowing the various command signals from base input to pass through which It is transferred to the path of mobile terminal.The various command signals being input into from base or electric power may serve as recognizing that mobile terminal is The no signal being accurately fitted within base.Output unit 150 is configured to defeated with the offer of vision, audio frequency and/or tactile manner Go out signal (for example, audio signal, vision signal, alarm signal, vibration signal etc.).Output unit 150 can include to show Unit 151, dio Output Modules 152, alarm unit 153 etc..

Display unit 151 may be displayed on the information processed in mobile terminal 1 00.For example, when mobile terminal 1 00 is in electricity During words call mode, display unit 151 can show and call or other communicate (for example, text messaging, multimedia files Download etc.) related user interface (UI) or graphic user interface (GUI).When mobile terminal 1 00 is in video calling pattern Or during image capture mode, display unit 151 can show the image of capture and/or the image of reception, illustrate video or figure UI or GUI of picture and correlation function etc..

Meanwhile, when the display unit 151 and touch pad touch-screen with formation superposed on one another as a layer, display unit 151 can serve as input unit and output device.Display unit 151 can include liquid crystal display (LCD), thin film transistor (TFT) In LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc. at least A kind of.Some in these displays may be constructed such that transparence to allow user from outside viewing, and this is properly termed as transparent Display, typical transparent display can be, for example, TOLED (transparent organic light emitting diode) display etc..According to specific The embodiment that wants, mobile terminal 1 00 can include two or more display units (or other display devices), for example, move Dynamic terminal can include outernal display unit (not shown) and inner display unit (not shown).Touch-screen can be used for detection and touch Input pressure and touch input position and touch input area.

Dio Output Modules 152 can mobile terminal in call signal reception pattern, call mode, logging mode, When under the isotypes such as speech recognition mode, broadcast reception mode, that wireless communication unit 110 is received or in memory 160 The voice data transducing audio signal of middle storage and it is output as sound.And, dio Output Modules 152 can be provided and movement The audio output (for example, call signal receives sound, message sink sound etc.) of the specific function correlation that terminal 100 is executed. Dio Output Modules 152 can include loudspeaker, buzzer etc..

Alarm unit 153 can provide output to notify event to mobile terminal 1 00.Typical event is permissible Including calling reception, message sink, key signals input, touch input etc..In addition to audio or video is exported, alarm unit 153 can provide output in a different manner with the generation of notification event.For example, alarm unit 153 can be in the form of vibration Output is provided, when calling, message or some other entrance communication (incomingcommunication) are received, alarm list Unit 153 can provide tactile output (that is, vibrating) to notify to user.By providing such tactile output, even if When the mobile phone of user is in the pocket of user, user also can recognize that the generation of various events.Alarm unit 153 The output of the generation of notification event can be provided via display unit 151 or dio Output Modules 152.

Memory 160 can store software program for the process and control operation executed by controller 180 etc., Huo Zheke Temporarily to store oneself data (for example, telephone directory, message, still image, video etc.) through exporting or will export.And And, memory 160 can be with storage with regard to the vibration of various modes that exports when touching and being applied to touch-screen and audio signal Data.

Memory 160 can include the storage medium of at least one type, and the storage medium includes flash memory, hard disk, many Media card, card-type memory (for example, SD or DX memory etc.), random access storage device (RAM), static random-access storage Device (SRAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory (PROM), magnetic storage, disk, CD etc..And, mobile terminal 1 00 can execute memory with by network connection The network storage device cooperation of 160 store function.

Controller 180 generally controls the overall operation of mobile terminal.For example, controller 180 is executed and voice call, data The related control of communication, video calling etc. and process.In addition, controller 180 can be included for reproducing (or playback) many matchmakers The multi-media module 1810 of volume data, multi-media module 1810 can be constructed in controller 180, or it is so structured that and control Device processed 180 is separated.Controller 180 can be with execution pattern identifying processing, by the handwriting input for executing on the touchscreen or figure Piece is drawn input and is identified as character or image.

Power subsystem 190 receives external power or internal power under the control of controller 180 and provides operation each unit Appropriate electric power needed for part and component.

Various embodiments described herein can be to use such as computer software, hardware or its any combination of calculating Machine computer-readable recording medium is implementing.Hardware is implemented, embodiment described herein can be by using application-specific IC (ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), scene can Program gate array (FPGA), processor, controller, microcontroller, microprocessor, be designed to execute function described herein At least one in electronic unit implementing, in some cases, can be implemented in controller 180 by such embodiment. Software is implemented, the embodiment of such as process or function can with allow to execute the single of at least one function or operation Software module is implementing.Software code can be come by the software application (or program) that is write with any appropriate programming language Implement, software code can be stored in memory 160 and be executed by controller 180.

So far, oneself is through describing mobile terminal according to its function.Below, for the sake of brevity, will description such as folded form, Slide type mobile terminal in various types of mobile terminals of board-type, oscillating-type, slide type mobile terminal etc. is used as showing Example.Therefore, the present invention can be applied to any kind of mobile terminal, and be not limited to slide type mobile terminal.

As shown in Figure 1 mobile terminal 1 00 may be constructed such that using via frame or packet transmission data all if any Line and wireless communication system and satellite-based communication system are operating.

Referring now to the communication system that Fig. 2 description is wherein operable to according to the mobile terminal of the present invention.

Such communication system can be using different air interfaces and/or physical layer.For example, used by communication system Air interface includes such as frequency division multiple access (FDMA), time division multiple acess (TDMA), CDMA (CDMA) and universal mobile communications system System (UMTS) (especially, Long Term Evolution (LTE)), global system for mobile communications (GSM) etc..As non-limiting example, under The description in face is related to cdma communication system, but such teaching is equally applicable to other types of system.

With reference to Fig. 2, cdma wireless communication system can include multiple mobile terminal 1s 00, multiple base stations (BS) 270, base station Controller (BSC) 275 and mobile switching centre (MSC) 280.MSC280 is configured to and Public Switched Telephony Network (PSTN) 290 form interface.MSC280 is also structured to form interface with the BSC275 that can be couple to base station 270 via back haul link. If back haul link can be constructed according to any one in the interface that Ganji knows, the interface includes such as E1/T1, ATM, IP, PPP, frame relay, HDSL, ADSL or xDSL.It will be appreciated that system can include multiple BSC2750 as shown in Figure 2.

Each BS270 can service one or more subregions (or region), by multidirectional antenna or the day of sensing specific direction Each subregion that line is covered is radially away from BS270.Or, each subregion can by for diversity reception two or more Antenna is covered.Each BS270 may be constructed such that the multiple frequency distribution of support, and each frequency is distributed with specific frequency spectrum (for example, 1.25MHz, 5MHz etc.).

Intersecting that subregion and frequency are distributed can be referred to as CDMA Channel.BS270 can also be referred to as base station transceiver System (BTS) or other equivalent terms.In this case, term " base station " can be used for broadly representing single BSC275 and at least one BS270.Base station can also be referred to as " cellular station ".Or, each subregion of specific BS270 can be claimed For multiple cellular stations.

As shown in Figure 2, broadcast singal is sent to broadcsting transmitter (BT) 295 mobile terminal that operate in system 100.Broadcasting reception module 111 is arranged on to receive the broadcast sent by BT295 at mobile terminal 1 00 as shown in Figure 1 Signal.In fig. 2 it is shown that several global positioning system (GPS) satellite 300.Satellite 300 helps position multiple mobile terminals At least one of 100.

In fig. 2, multiple satellites 300 are depicted, it is understood that be, it is possible to use any number of satellite obtains useful Location information.GPS module 115 is generally configured to coordinate with satellite 300 to obtain the positioning that wants letter as shown in Figure 1 Breath.Substitute GPS tracking technique or outside GPS tracking technique, it is possible to use can track the position of mobile terminal other Technology.In addition, at least one gps satellite 300 can optionally or additionally process satellite dmb transmission.

Used as a typical operation of wireless communication system, BS270 receives the reverse link from various mobile terminal 1s 00 Signal.Mobile terminal 1 00 generally participates in call, information receiving and transmitting and other types of communication.Each of the reception of certain base station 270 is anti- Processed in specific BS270 to link signal.The data of acquisition are forwarded to the BSC275 of correlation.BSC provides call Resource allocation and the mobile management function of the coordination including the soft switching process between BS270.BSC275 is also by the number for receiving According to being routed to MSC280, which provides the extra route service for forming interface with PSTN290.Similarly, PSTN290 with MSC280 forms interface, and MSC and BSC275 form interface, and BSC275 correspondingly controls BS270 with by forward link signals It is sent to mobile terminal 1 00.

Based on above-mentioned optional mobile terminal hardware configuration and communication system, each embodiment of the inventive method is proposed.

As shown in figure 3, first embodiment of the invention proposes a kind of word pick device 1, the device includes：Taking module 01 and playing module 02.

Taking module 01, under default word pickup mode, to the object before the camera of itself place terminal Shot.

Playing module 02, plays out for the word in captured dynamic image is converted into voice.

Alternatively, the device also includes：Detection module 03 and pattern enter module 04.

Detection module 03, for detecting the trigger condition of word pickup mode.

Pattern enters module 04, picks up for when trigger condition is detected and determine that the trigger condition is effective, entering word Delivery formula.

Alternatively, taking module 01 carries out shooting to the object before the camera of itself place terminal includes：

According to pre-conditioned adjusting focal length.

Alternatively, the device also includes prompting module 05.

Alternatively, pre-conditioned including：Word size.

Taking module 01 includes according to pre-conditioned adjusting focal length：

Detection is when the word size in dynamic image under front focal length.

The word size for detecting is compared with default character size.

Alternatively, the device also includes：First determining module 06.

First determining module 06, for, before according to pre-conditioned adjusting focal length, determining according to the fingerprint size of user Default character size.

Alternatively, according to the fingerprint size of user, the first determining module 06 determines that default character size includes：

Take the fingerprint from fingerprint size height and width.

Alternatively, the device also includes：Second determining module 07.

Second determining module 07, to the touch operation of captured dynamic image and determines touch location for detection.

Playing module 02, is additionally operable to for corresponding word at touch location to be converted into voice and plays out.

Alternatively, the device also includes：Text point determining module 08.

Text point determining module 08 is used for：

After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in dynamic image, When the coordinate of touch location is consistent with the coordinate of any one word in dynamic image, determine that touch location is relative with word Should；When the coordinate of touch location is all inconsistent with the coordinate of each word in photo, determine that touch location is not corresponding with word.

Alternatively, text point determining module 08 is additionally operable to：

Preset motor in control respective direction is vibrated.

Alternatively, playing module 03 by corresponding word at touch location be converted into voice play out including：

Alternatively, the device also includes：Setup module 09.

Setup module 09 is used for：

Additionally, for achieving the above object, present invention also offers a kind of word pick-up method, as shown in Figure 4, Figure 5, the party Method includes S101-S102：

S101, under default word pickup mode, the object before the camera of terminal is shot.

In embodiments of the present invention, in order to help blind person or amblyopia personnel to read on the various objects such as paper, gravestone, licence plate Word, understand word content in order to which, embodiment of the present invention scheme the preterminal object can be carried out by terminal-pair Shoot, and the Word message in captured dynamic image is caught, the Word message is played back with speech form, solve The puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.

In embodiments of the present invention, in order to take pictures or shooting action is distinguished with general, embodiment of the present invention scheme is needed To complete under default pattern, word pickup mode described above, the Word Input pattern is used for the camera by terminal Object of the one side relative with camera before camera comprising Word message is found, and the object is shot, will Word message in the dynamic image of shooting changes into voice messaging and plays out.It should be noted that carrying out speech play When be not limited to Word message, can also be digital information, symbolic information etc..And above-mentioned dynamic image can be by shooting The video image that head shoots out, or the real-time dynamic image that camera is captured during shooting.

In embodiments of the present invention, the word pickup mode can be entered by below scheme.

Alternatively, the method also includes S201-S202：

S201, the trigger condition of detection word pickup mode.Wherein, the trigger condition includes finger manipulation and/or voice Order.

In embodiments of the present invention, terminal can detect the trigger condition of Message Processing pattern in real time or periodically. In addition, in order to save terminal resource, the trigger condition can also be obtained by way of message informing, for example, when default pressure (case includes hardware button and software for force snesor, fingerprint identification device, scanning means, speech recognition equipment and button The button of form) etc. message of giving notice when detecting certain finger manipulation or voice command, so as to the terminal check finger exercise Make or voice command be whether word pickup mode trigger condition.It should be noted that the trigger condition can include but not It is limited to finger manipulation and/or voice command.In various embodiments, the trigger condition could be arranged to any one can be real The operation that applies or order etc..For example, the trigger condition can also be a kind of gesture high up in the air, by default close sensing in terminal Device is detecting to the gesture high up in the air.

S202, when trigger condition is detected and determine that the trigger condition is effective, enter word pickup mode.

In embodiments of the present invention, after being detected to the trigger condition of word pickup mode by step S201, also It needs to be determined that the validity of the trigger condition.For example, when on the triggering button for detecting some default word pickup mode Pressing operations when, need to detect the duration of the pressing operations, when the duration of pressing operations is less than or equal to default Time threshold when then can determine that the pressing operations are invalid, i.e. the trigger condition of word pickup mode is invalid.Again for example, when pre- If proximity transducer detect triggering Message Processing pattern gesture high up in the air when, if the retention time of the gesture high up in the air be less than Or be equal to default time threshold, then equally can determine that the gesture high up in the air is invalid, i.e. the trigger condition nothing of word pickup mode Effect.By the scheme of the embodiment of the present invention, the generation of maloperation can be effectively prevented.

In embodiments of the present invention, when the trigger condition that determination is detected is effective, just can be entered with triggering terminal default Word pickup mode.Under the word pickup mode, user can be shot to preterminal object, so that terminal will Word in the dynamic image of shooting is converted into voice, is easy to play to terminal use.

In embodiments of the present invention, preterminal object can be shot by below scheme.

Alternatively, the object before the camera of terminal is shot including S301-S302：

Object before S301, detection camera；Include Word message in wherein relative with camera on object one side.

In embodiments of the present invention, due to default word pickup mode primarily to the word in dynamic image is entered Row is extracted, and is played out so as to problem is converted into voice.Therefore, under word pickup mode, can examine when terminal is shot The photographed scene for surveying terminal includes the object of Word message.In embodiments of the present invention, can be by default image recognition System completes the detection to word and identification process.

Alternatively, the method also includes：When Word message is not included in relative with camera one side on the object, send out Go out prompting message.The prompting message includes：The vibration of the motor on predeterminated position.

In embodiments of the present invention, before terminal taking, when terminal is not detected by the presence of bag in the current scene of terminal During object containing Word message, in order to remind user's conversion photographed scene, especially reminding blind or amblyopia personnel, can send Default prompting message.It should be noted that the prompting message can include one or more of：The tinkle of bells, music, voice, Vibration, flash lamp.For example, it is possible to the motor of control terminal predetermined position produces vibration.Because one can be included in terminal Or multiple motors, different positions are respectively arranged at, so that different functions are realized, when preterminal object does not include Word message When, only make the motor in a certain precalculated position produce vibration, so as to reach the purpose for reminding user.The predeterminated position can be terminal On optional position, as long as facilitate user perceive motor vibration.

S302, according to pre-conditioned adjusting focal length.

In embodiments of the present invention, after the object comprising Word message of the terminal in photographed scene is captured, need Terminal camera is focused according to default condition, meet pre-conditioned dynamic image to shoot.

Alternatively, this pre-conditioned including：Word size.

In embodiments of the present invention, the word size adjustment of word segment can be worth to suitable by focusing, so as to User prevents the too little phenomenon generation for causing to click on mistake of the word in photo, especially for blind person and amblyopia when clicking on For personnel, (for example, think when need not directly listen to the voice messaging being directly translated into by the Word message in dynamic image When tempering finger and touching ability), the click of finger can be relied on determine the word that chooses and listen attentively to the content of the word, In the case of can't see or not seeing the word in dynamic image, if its word is too little, user easily clicks on mistake always, such as Shown in Fig. 6, this certainly will bring very poor experience sense for user.Accordingly, it would be desirable to first focused before shooting, to shoot The satisfactory word size of dynamic image, facilitate user to click on.

In embodiments of the present invention, according to the above, before shooting, need to predefine the word size Standard, so as to terminal when being focused directly using the default value as the foundation that focuses.Due to predefining dynamic image In word size standard be in order to avoid word too little cause click on mistake phenomenon occur, dynamic image when word is too big The interior word that can be accommodated is very little.Therefore, in embodiments of the present invention, can be determined according to the size of user's finger or size The standard of the word size of dynamic image.Specifically, can be realized by below scheme.

Alternatively, determine that default character size includes S401-S402 according to the fingerprint size of user：

Finger print information when S401, collection user's touch terminal screen；The finger print information includes fingerprint size.

In embodiments of the present invention, terminal can be according to the history service condition of user in user's once touch terminal screen When gather and preserve the finger print information of user, it is also possible to gather the fingerprint letter of user under default finger print information drainage pattern Breath, and therefrom obtain the dimension information of fingerprint.

S402, take the fingerprint from fingerprint size height and width.

In embodiments of the present invention, the fingerprint size of user includes height and the width of fingerprint.In the embodiment of the present invention In scheme, fingerprint highly refers to longitudinally go up the distance between outline line maximum in the fingerprint profile for obtaining；Fingerprint width is referred to Transversely the distance between outline line maximum in the fingerprint profile of acquisition.Fingerprint due to acquisition when carrying out fingerprint recognition every time Profile can not possibly be identical, therefore can obtain a fingerprint height and width by way of multi collect is averaged Mean value is used as fingerprint height and the standard value of width.In addition, in order that shoot when obtain sufficiently large word size, can Therefrom to select a maximum after multi collect as fingerprint height and the standard value of width.

S403, by fingerprint height and width be defined as in default character size word height and width.

In embodiments of the present invention, after obtaining the standard value of fingerprint height and width, just can be by the finger of the standard Line height and width are used as the standard for determining character size.For example, directly using fingerprint height and width as default word Word height and width in size, or using as default character size after fingerprint height and the default ratio of width expansion In word height and width.The such as preset ratio can be 1%, 5% etc..Here the preset ratio can not be arranged too Greatly, so as not to word excessive cause photo accommodate word very little.In addition, when word size is determined, can be without while determining The word height gone out in word size and width, can determine one of which according to the touch of user custom.For example, use Family custom finger is laterally touched, then can only determine the width of word；User's custom finger is longitudinally touched, then can only determine text The height of word.

By above scheme, when just can obtain shooting, the standard of word size, is carried out to camera according to the standard Focusing just can obtain the word dynamic image of suitable user.

In embodiments of the present invention, when being focused according to word size, specifically can complete to adjust by below scheme Burnt work.

Alternatively, S401-S404 is included according to pre-conditioned adjusting focal length：

S401, detection are when the word size in dynamic image under front focal length.

In embodiments of the present invention, before being focused according to default word size, first can detect when under front focal length Word size in the dynamic image that camera is obtained, to judge whether the word size has met the word of default standard Size, and be easy to be adjusted dynamic image according to current character size.In embodiments of the present invention, for working as front focal length The detection of the word size in lower dynamic image again may be by default pattern recognition device to be carried out image recognition to realize.

S402, the word size for detecting is compared with default character size.

In embodiments of the present invention, detect after the word size in dynamic image under front focal length, by by this article Word size is obtained when the specifying information of the word size in dynamic image under front focal length compared with default character size, and Following process is carried out respectively for different comparative results.

S403, when the word size for detecting is consistent with default character size keep work as front focal length.

In embodiments of the present invention, when the word size for detecting is consistent with default character size, that is, detect When word size or measures of dispersion identical with default character size is less than or equal to default measures of dispersion threshold value, both can be by When front focal length is used as shooting focal length.

S404, when the word size for detecting is inconsistent with default character size, the focal length for adjusting camera is the One focal length, makes the word size in dynamic image consistent with default character size.

In embodiments of the present invention, when the word size for detecting is inconsistent with default character size, that is, detect Word size differed with default character size completely, and measures of dispersion more than default measures of dispersion threshold value when, then permissible To being adjusted when front focal length so that the word size in dynamic image is consistent with default character size, and will adjustment The first focal length in focal length afterwards, i.e. embodiment of the present invention scheme is defined as shooting focal length.

S303, will include that the middle section of word segment in the object of Word message focus will be shot as shooting.

In embodiments of the present invention, after determining the focal length of camera, in order that the dynamic image that shoots is with word Based on part, the middle section of word segment in the object of Word message can be included as shooting focus.

In embodiments of the present invention, suitable shooting focal length and focus just can be obtained by above adjustment, according to this Focal length and focus are shot the character image that just can obtain suitable user.

Alternatively, detect the touch operation to captured dynamic image and determine touch location；By at touch location pair The word that answers is converted into voice and plays out.

In embodiments of the present invention, after carrying out dynamic image shooting by above scheme, user just can be dynamic according to this State image obtains the Word message in photo.

It should be noted that terminal can extract the Word message in dynamic image by pattern recognition device, and will carry The Word message for taking is arranged according to the position in dynamic image, the final electronics shape for obtaining Word message in dynamic image Formula.After the electronic form for obtaining the Word message, directly the Word message of the electronic form can be converted into voice letter Breath is played back, it is also possible to turned corresponding word after touch operation of the user to the dynamic image on terminal screen is detected Turn to speech play out.Specifically, dynamic image can show that, on the interface of terminal, user can be to terminal after shooting and finishing Dynamic image on interface is touched or is clicked on etc. operation, and the terminal-pair touch or clicking operation are detected, and are determined tactile The position that touches or click on, to determine its corresponding word according to the position, as shown in Figure 7.

In embodiments of the present invention, can be completed using detection method, algorithm and the device that can arbitrarily implement above-mentioned Detection scheme, is not limited for specific detection method, algorithm and device.

In embodiments of the present invention, due to, for blind person or amblyopia personnel, can't see or not see screen Dynamic Graph As upper particular location, therefore touched position is likely to when touching and does not have word.In such a case, it is possible to pass through Below scheme is determined with the presence or absence of word at touch location.

Alternatively, the method also includes：

After touch location is determined, the coordinate of touch location is compared with the coordinate of each word in dynamic image, When the coordinate of touch location is consistent with the coordinate of any one word in dynamic image, determine that touch location is relative with word Should；When the coordinate of touch location is all inconsistent with the coordinate of each word in dynamic image, determine touch location with word not Corresponding.

In embodiments of the present invention, as terminal can be left according to screen to the word in the dynamic image of display on screen Side determines the coordinate of each word respectively.In the same manner, terminal is it may also be determined that the concrete coordinate of the touch location of user, therefore, whole The coordinate of user touch location can be compared by end with each word coordinate, when the two coordinates are consistent, touch location is described Corresponding with word, i.e., touch location falls on word, when the two coordinates are inconsistent, illustrates that touch location is not right with word Should, i.e., touch location is not fallen within word.It should be noted that in embodiment of the present invention scheme, unanimously referring to identical Or measures of dispersion is less than or equal to default discrepancy threshold, inconsistent refer to differ completely or measures of dispersion is more than default difference Different threshold value.

Alternatively, the method also includes S501-S502：

S501, after touch location is determined, when not corresponding to word at the touch location, detecting distance current touch position Put the position that the first nearest word is located.

In embodiments of the present invention, in the case of not corresponding to word at touch location, terminal-pair is needed for giving Remind, carry out the adjustment of touch location so as to user in time.In embodiment of the present invention scheme, terminal can first detect distance The nearest word of current touch location, and word position on a terminal screen is determined, will so as to user guided user Finger moves to corresponding position, as shown in Figure 8.Concrete guide scheme can be realized by following proposal.

S502, determine the first word be located position and current touch location relative direction.

In embodiments of the present invention, when determining the word nearest apart from current touch location, the such as embodiment of the present invention After the positional information of the first word in scheme, such as the first word coordinate on a terminal screen, just can determine that and works as The relative direction of the position of front touch location and first word, for example, ten o'clock direction.

Preset motor in S503, control respective direction is vibrated.

In embodiments of the present invention, can multiple directions be set in advance in terminal and motor is indicated, in step S502 really After making the relative direction of the first word and current touch location, the preset motor in respective direction just can be controlled to be shaken Dynamic, to guide user's next step to need the direction of adjustment.In embodiment of the present invention scheme, the particular location of the motor is really Surely can be the horse determined by the relative direction extension along the first word and current touch location with terminal screen center as starting point Reach, as shown in Figure 8.

It should be noted that such scheme can also be not limited to using other guide schemes in other embodiments.Example Such as, user can be given by way of voice message guide, for example, " please be moved to the left ", " please move up ".In the present invention Direction in embodiment, when the left side is terminal screen user oriented, indicated by abscissa negative direction；The left side be terminal screen towards Direction during user, indicated by abscissa positive direction；When top is terminal screen user oriented, indicated by ordinate positive direction Direction；When being terminal screen user oriented below, the direction indicated by ordinate negative direction.

S102, corresponding word at touch location is converted into voice plays out.

In embodiments of the present invention, after detecting the word of user touch place by above scheme, or user is guided After touching word, just corresponding word at touch location can be converted into voice messaging and play out.Need explanation , as Word message is converted into the technology that voice messaging has been comparative maturity, will not be described here, and for selection Method for transformation, algorithm, software and device etc. be all not specifically limited.

In addition, the conversion process for the word in dynamic image to voice can be direct when word dynamic image is obtained Carry out, i.e., directly carry out in shooting process, it is also possible to carried out after the word touched by user is determined again, concrete mode can With the application scenarios self-defining according to user, this is not restricted.

In embodiments of the present invention, directly the Word message in dynamic image is converted when word dynamic image is got During for voice messaging, can be directly to the word in dynamic image according to preset order, such as from top to bottom and/or from left to right Order carry out speech play, it is also possible to according to such scheme, carry out speech play when user touches corresponding word.In order to It is suitable for the choosing at random of two kinds of broadcast modes, corresponding play mode can be pre-set, for example, selects play mode and automatically Play mode.Under play mode is selected, need to detect the touch operation of user, so as to enter corresponding word at touch location Row is played.Under automatic play mode, speech play can also be carried out to the word in dynamic image according to preset order automatically.

In addition, under above-mentioned selection play mode, in order that user quickly understands the word content in dynamic image, Playing efficiency is improved, following player method can also be adopted.

In embodiments of the present invention, when detecting the corresponding word of user institute touch location in a row or column word When, directly the content corresponding to the row or the row word can be played to user.In addition, if the style of writing word has adjacent one Row or multline text, can issue the user with prompting, for example, voice reminder, remind whether user needs to continue to play next line Or the word content of lastrow.In the same manner, if the style of writing word has adjacent one or more columns per page word, it is also possible to issue the user with Remind, remind whether user needs the word content for continuing to play next column or previous column.User can adopt voice confirmation side Formula, or this default operation acknowledgement mode fed back to the prompting.Terminal plays next line or next column according to feedback result Word content, or stop playing.

In embodiments of the present invention, before the word in terminal-pair a row or column is identified, need terminal right in advance The concept of a row or column is defined, and is pre-defined according to this so as to terminal and goes to be confirmed whether there is a row or column word. Specifically can be realized by below scheme.

Alternatively, the method also includes：

In embodiments of the present invention, terminal can be examined with the distance of adjacent word to each word in dynamic image Survey, and the coordinate of each word is can determine, which word the coordinate value according to each word determines point-blank. Therefore, based on above-mentioned termination function, and the concept according to row and column, just can determine that a line word i.e. with longitudinally adjacent Word all keep default first spacing, and multiple words on identical straight line in the horizontal；One row word is Default second spacing, and in the vertical multiple texts in identical straight line on are all kept with transversely adjacent word Word.

In embodiments of the present invention, the concrete numerical value for the first spacing in such scheme and the second spacing is not limited System.First spacing and the second spacing can be different numerical value according to different application scenarios.

So far, whole essential characteristics of the embodiment of the present invention that is over just are introduced, it should be noted that the above is all this One or more specific embodiments of inventive embodiments scheme, in other embodiments can also be using other embodiment party Formula, any and same or analogous embodiment of the embodiment of the present invention, and any group of the essential characteristic of the embodiment of the present invention Close, all within the protection domain of the embodiment of the present invention.

The present invention proposes a kind of word pick device and method, and the device includes：Taking module and playing module.Shoot Module is shot to the object before the camera of itself place terminal under default word pickup mode.Playing module will Word in captured dynamic image is converted into voice and plays out.By embodiment of the present invention scheme, terminal can be passed through Understand word content, solve the puzzlement that cannot be read that blind person or amblyopia personnel are brought because of visual problems.

It should be noted that herein, term " including ", "comprising" or its any other variant are intended to non-row The including of his property, so that a series of process including key elements, method, article or device not only include those key elements, and And also include other key elements being not expressly set out, or also include intrinsic for this process, method, article or device institute Key element.In the absence of more restrictions, the key element for being limited by sentence "including a ...", it is not excluded that including to be somebody's turn to do Also there is other identical element in the process, method of key element, article or device.

The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform by software to realize, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on such understanding, technical scheme is substantially done to prior art in other words The part for going out contribution can be embodied in the form of software product, and the computer software product is stored in a storage medium In (as ROM/RAM, magnetic disc, CD), use so that a station terminal equipment including some instructions (can be mobile phone, computer, clothes Business device, air-conditioner, or network equipment etc.) execute method described in each embodiment of the present invention.

The preferred embodiments of the present invention are these are only, the scope of the claims of the present invention is not thereby limited, every using this Equivalent structure or equivalent flow conversion that bright specification and accompanying drawing content are made, or directly or indirectly it is used in other related skills Art field, is included within the scope of the present invention.

Claims

1. a kind of word pick device, it is characterised in that described device includes：Taking module playing module；

The taking module, for, under default word pickup mode, entering to the object before the camera of itself place terminal Row shoots；

The playing module, plays out for the word in captured dynamic image is converted into voice.

2. word pick device as claimed in claim 1, it is characterised in that described device also includes：Detection module and pattern Enter module；

The detection module, for detecting the trigger condition of the word pickup mode；

The pattern enters module, for when the trigger condition is detected and determine that the trigger condition is effective, entering institute State word pickup mode.

3. word pick device as claimed in claim 1, it is characterised in that the taking module is taken the photograph to itself place terminal Include as the object before head carries out shooting：

Detect the object before the camera；Believe including word in the one side relative with the camera on wherein described object Breath；

According to pre-conditioned adjusting focal length；

The middle section of word segment in the object including Word message as shooting focus and is shot.

4. word pick device as claimed in claim 1, it is characterised in that described device also includes：Second determining module；

Second determining module, to the touch operation of captured dynamic image and determines touch location for detection；

The playing module, is additionally operable to for corresponding word at the touch location to be converted into voice and plays out.

5. word pick device as claimed in claim 4, it is characterised in that described device also includes：Text point determines mould Block；

The text point determining module is used for：

After the touch location is determined, the coordinate of the touch location is compared with the coordinate of each word in photo, When the coordinate of the touch location is consistent with the coordinate of any one word in the dynamic image, the touch location is determined Corresponding with word；When the coordinate of the touch location is all inconsistent with the coordinate of each word in the dynamic image, really The fixed touch location is not corresponding with word.

6. a kind of word pick-up method, it is characterised in that methods described includes：

Under default word pickup mode, the object before the camera of terminal is shot；

Word in captured dynamic image is converted into voice play out.

7. word pick-up method as claimed in claim 6, it is characterised in that methods described also includes：

Detect the trigger condition of the word pickup mode；

When the trigger condition is detected and determine that the trigger condition is effective, the word pickup mode is entered.

8. word pick-up method as claimed in claim 6, it is characterised in that before the camera to itself place terminal Object carries out shooting to be included：

According to pre-conditioned adjusting focal length；

9. word pick-up method as claimed in claim 6, it is characterised in that methods described also includes：

Detect the touch operation to captured dynamic image and determine touch location；

Corresponding word at the touch location is converted into voice play out.

10. word pick device as claimed in claim 9, it is characterised in that methods described also includes：