CN109740594A - Word enquiring method, apparatus and storage medium - Google Patents
Word enquiring method, apparatus and storage medium Download PDFInfo
- Publication number
- CN109740594A CN109740594A CN201811575255.5A CN201811575255A CN109740594A CN 109740594 A CN109740594 A CN 109740594A CN 201811575255 A CN201811575255 A CN 201811575255A CN 109740594 A CN109740594 A CN 109740594A
- Authority
- CN
- China
- Prior art keywords
- frame image
- word
- target
- image
- target area
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Electrically Operated Instructional Devices (AREA)
Abstract
The present invention relates to a kind of word enquiring methods, device and storage medium, it include: to detect the mobile terminal when carrying out image taking, obtain the voice messaging of user, subsequently determine whether the voice messaging whether with default voice match, if, then obtain the sequential frame image shot in preset duration, and target area is determined from the sequential frame image, target word is identified from the target area, and search the corresponding essential information of the target word, the essential information is provided then to the user, being manually entered word without user can be obtained the paraphrase of word, method is simple, word enquiring is high-efficient, be conducive to improve the study schedule of user, learning effect is good.
Description
[technical field]
This application involves image identification technical fields, more specifically to a kind of word enquiring method, apparatus and storage
Medium.
[background technique]
With more more and more universalization of English, periodical and works containing English also more and more enter the view of people
Open country, therewith caused by word enquiring amount it is also more and more.In currently marketed English learning tool, the tool of verification certificate word with
Science and technology development become increasingly to facilitate, from the lookup english dictionary of beginning, finally using learning machine carry out word enquiring,
The change of looking up words method is more and more convenient for learner, still, when the word quantity for occurring needing to inquire is excessive,
It will lead to learner's repeatedly pause manual queries word, upset the thinking and inspiration of learner, influence learning efficiency.
In conclusion existing word enquiring method also needs learner to carry out being manually entered word, word is then obtained
Paraphrase, cause the low effect low with learning efficiency of the level of learning of learner.
[summary of the invention]
A kind of method, apparatus and storage medium for being designed to provide word enquiring of the application, can be manual without user
Word enquiring is inputted, is improved learning efficiency.
To solve the above-mentioned problems, the embodiment of the present application provides a kind of word enquiring method, is applied to mobile terminal, packet
It includes:
When detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user;
Judge the voice messaging whether with default voice match;
If so, obtaining the sequential frame image shot in preset duration, and target area is determined from the sequential frame image
Domain;
Target word is identified from the target area, and searches the corresponding essential information of the target word;
The essential information is provided to the user.
The embodiment of the present application also provides a kind of word enquiring devices, are applied to mobile terminal, comprising:
Obtain module: when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user;
Judgment module: judge the voice messaging whether with default voice match;
Analysis module: if so, obtaining the sequential frame image shot in preset duration, and from the sequential frame image really
Set the goal region;
Identification module: identifying target word from the target area, and it is corresponding basic to search the target word
Information;
Output module: Xiang Suoshu user provides the essential information.
The embodiment of the present application also provides a kind of computer readable storage medium, a plurality of finger is stored in the storage medium
It enables, described instruction is suitable for being loaded by processor to execute any of the above-described word enquiring method.
Word enquiring method, apparatus provided by the present application and storage medium, when detecting that the mobile terminal carrying out figure
As shooting when, by obtain user voice messaging, subsequently determine whether the voice messaging whether with default voice match, if so,
The sequential frame image shot in preset duration is then obtained, and determines target area from the sequential frame image, from the target
Target word is identified in region, and searches the corresponding essential information of the target word, then to described in user offer
Essential information, being manually entered word without user can be obtained the paraphrase of word, and method is simple, and word enquiring is high-efficient,
Be conducive to improve the study schedule of user, learning effect is good.
[Detailed description of the invention]
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment
Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for
For those skilled in the art, without creative efforts, it can also be obtained according to these attached drawings other attached
Figure.
Fig. 1 is the flow diagram of word enquiring method provided in an embodiment of the present invention.
Fig. 2 is the schematic diagram of pen tip position during word enquiring provided in an embodiment of the present invention.
Fig. 3 is another schematic diagram of pen tip position during word enquiring provided in an embodiment of the present invention.
Fig. 4 is the structural schematic diagram of word enquiring device provided in an embodiment of the present invention.
Fig. 5 is the structural schematic diagram of mobile terminal provided in an embodiment of the present invention.
Fig. 6 is another structural schematic diagram of mobile terminal provided in an embodiment of the present invention.
[specific embodiment]
For technical problems, technical solutions and advantages to be solved are more clearly understood, below in conjunction with
Accompanying drawings and embodiments, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only used
To explain the present invention, it is not intended to limit the present invention.
A kind of word enquiring method, be applied to mobile terminal, comprising: when detect the mobile terminal carry out image taking
When, obtain the voice messaging of user;Judge the voice messaging whether with default voice match;If so, obtaining in preset duration
The sequential frame image of shooting, and target area is determined from the sequential frame image;Target word is identified from the target area,
And search the corresponding essential information of the target word;The essential information is provided to the user.
Referring to Figure 1, Fig. 1 is the flow diagram of word enquiring provided by the embodiments of the present application, the word enquiring method
Applied to mobile terminal, detailed process be can be such that
S101. when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user.
In the present embodiment, the shooting picture for carrying out image taking is the picture of user's study word, and user is in study word
In the process, mobile terminal camera can be opened and carry out image taking, which may be displayed in screen preview frame,
In, the word source of user's study can be books, newpapers and periodicals and learning software etc., and the voice messaging of user can be by mobile whole
End built-in microphone is acquired, and the speech message issued such as user is " little Bai, verification certificate word ".
S102. judge the voice messaging whether with default voice match, if so, execute following step S103, if it is not, then
It returns and executes above-mentioned steps S101.
In the present embodiment, default voice can be manually set, for example be " verification certificate word ", when user issues voice messaging,
The speech message is matched with " verification certificate word " voice, to analyze whether user needs to enter the service of verification certificate word.
S103. the sequential frame image shot in preset duration is obtained, and determines target area from the sequential frame image.
In the present embodiment, preset duration can be manually set, and when voice messaging successful match, mobile terminal will start meter
When device start timing, it is 2 seconds that preset duration, which is such as arranged, and according to the shooting image in timing duration to the word of required inquiry
Confirmed.
For example, above-mentioned steps " determining target area from the sequential frame image " specifically include:
The sequential frame image is handled using trained neural network model, to identify target in every frame image
The position of subject;
Judge whether the target subject moves according to the position of the target subject in every frame image;
Target area is determined from the sequential frame image according to judging result.
In the present embodiment, target subject can be manually set, for example be learning pen or finger etc., and the word of inquiry can be with
For a word, or a group of words, or in short.It, can be with after speech message and the success of default voice match
By the neural network algorithm of prior learning pen tip picture, the word that needs are inquired is carried out really by the pen tip below word
Recognize, to identify the position of pen tip in every frame image, then sequential frame image is obtained in 2 seconds, is reached 2 seconds when the time
Afterwards, the sequential frame image occurred in 2 seconds is obtained, and filters out the target figure containing confirmation looking up words from sequential frame image
Picture determines target area according to the position of pen tip.
Above-mentioned steps " target area is determined from the sequential frame image according to judging result " specifically include:
When the judgment result is yes, the first frame image and tail frame image in the sequential frame image are obtained;According to the head frame figure
Picture and tail frame image determine target area;
When the judgment result is no, by the image district in any frame image, above the position of the target subject
Domain is as target area.
Above-mentioned steps " determining target area according to the first frame image and tail frame image " specifically include:
Using the position of the target subject in the first frame image as initial position, the target in the tail frame image is shot
The position of object is as final position;
By in any frame image, above the initial position and above final position between image-region as mesh
Mark region.
In the present embodiment, Fig. 2 is referred to, Fig. 2 is pen tip position during word enquiring provided in an embodiment of the present invention
Schematic diagram if the position of pen tip is moved to the position X2 from the position X1, then shows that X1 is arrived after judging result is that pen tip moves
The top of distance is target area between X2, in obtaining the sequential frame image in 2 seconds, using first frame image as initial position,
Using tail frame image as final position, the region of initial position to top between final position is target area, that is to say, that
" fruits and vegetables " is target area.
In the present embodiment, Fig. 3 is referred to, Fig. 3 is pen tip position during word enquiring provided in an embodiment of the present invention
Another schematic diagram, when judging result is pen tip there is no after movement, if the position of pen tip rests on always the place of Y1, then table
The upper area of the bright position Y1 is target area, in obtaining the sequential frame image in 2 seconds, is selected from sequential frame image any
One image determines the position of pen tip, and the position in any one image above pen tip is target area, that is to say, that
Position where " diet " is target area.
S104. target word is identified from the target area, and searches the corresponding essential information of the target word;
For example, above-mentioned steps " identifying target word from the target area " specifically include:
Letter identification is carried out to the target area, obtains multiple letters;
The sequence of multiple letter from left to right or from right to left is combined, target word is obtained.
In the present embodiment, using neural network machine learning method, depth is carried out to 26 alphabetical capital and small letter fonts
It practises, realization accurately obtains word, then carries out partial enlargement to target area, carries out the identification of letter, Neng Goushi
Malapropism imperial mother starts to extract letter according to sequence from left to right or from top to bottom to target area, the letter after extraction is pressed
It is combined according to sequence of extraction, obtains target word, be then compared target word with local word library, by local word
Essential information in library containing target word extracts, and the essential information of the target word will not be contained in local word library
Word carry out internet search, obtain the essential information of target word, such as obtain word " diet " essential information " n. (patient or
Slimmer's) diet;Diet;V. it diets;N. a large amount of ".
S105. the essential information is provided to the user.
In the present embodiment, the essential information for obtaining target word is changed into corresponding voice messaging, passes through language
The mode that sound plays is broadcast to client, the essential information of target word can also be directly displayed in photographed screen, allows user
The essential information of target word can intuitively be obtained.
It can be seen from the above, word enquiring method provided in this embodiment, by detecting that the mobile terminal is carrying out image
When shooting, obtain the voice messaging of user, subsequently determine whether the voice messaging whether with default voice match, if so, obtaining pre-
If the sequential frame image shot in duration, and target area is determined from the sequential frame image, it is identified from the target area
Target word, and search the corresponding essential information of the target word, provides the essential information then to the user, without with
Family, which is manually entered word, can be obtained the paraphrase of word, and method is simple, and word enquiring is high-efficient, be conducive to the study for improving user
Progress, learning effect are good.
The method according to described in above-described embodiment, the present embodiment will be described from the angle of word enquiring device, tool
Body says and is described in detail so that the word enquiring device is integrated in the terminal as an example that the mobile terminal can be plate electricity
Brain, mobile phone etc..
Fig. 4 is referred to, Fig. 4 is word enquiring device provided by the embodiments of the present application, is applied to first movement terminal, can be with
It include: to obtain module 10, judgment module 20, analysis module 30, identification module 40 and output module 50, in which:
(1) module 10 is obtained
Module 10 is obtained, for when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user.
In the present embodiment, the shooting picture for carrying out image taking is the picture of user's study word, and user is in study word
In the process, mobile terminal camera can be opened and carry out image taking, which may be displayed in screen preview frame,
In, the word source of user's study can be books, newpapers and periodicals and learning software etc., and the voice messaging of user can be by mobile whole
End built-in microphone is acquired, and the speech message issued such as user is " little Bai, verification certificate word ".
(2) judgment module 20
Judgment module 20, for judge the voice messaging whether with default voice match, if so, execute analysis module
30, if it is not, then returning to execution obtains module 10.
In the present embodiment, default voice can be manually set, for example be " verification certificate word ", when user issues voice messaging,
The speech message is matched with " verification certificate word " voice, to analyze whether user needs to enter the service of verification certificate word.
(3) analysis module 30
Analysis module 30 is determined for obtaining the sequential frame image shot in preset duration, and from the sequential frame image
Target area.
In the present embodiment, preset duration can be manually set, and when voice messaging successful match, mobile terminal will start meter
When device start timing, it is 2 seconds that preset duration, which is such as arranged, and according to the shooting image in timing duration to the word of required inquiry
Confirmed.
For example, above-mentioned analysis module 30 is specifically used for:
The sequential frame image is handled using trained neural network model, to identify target in every frame image
The position of subject;
Judge whether the target subject moves according to the position of the target subject in every frame image;
Target area is determined from the sequential frame image according to judging result.
In the present embodiment, target subject can be manually set, for example be learning pen or finger etc., and the word of inquiry can be with
For a word, or a group of words, or in short.It, can be with after speech message and the success of default voice match
By the neural network algorithm of prior learning pen tip picture, the word that needs are inquired is carried out really by the pen tip below word
Recognize, to identify the position of pen tip in every frame image, then sequential frame image is obtained in 2 seconds, is reached 2 seconds when the time
Afterwards, the sequential frame image occurred in 2 seconds is obtained, and filters out the target figure containing confirmation looking up words from sequential frame image
Picture determines target area according to the position of pen tip.
Further, above-mentioned analysis module 30 is specifically used for:
When the judgment result is yes, the first frame image and tail frame image in the sequential frame image are obtained;According to the head frame figure
Picture and tail frame image determine target area;
When the judgment result is no, by the image district in any frame image, above the position of the target subject
Domain is as target area.
Further, above-mentioned analysis module 30 is specifically used for:
Using the position of the target subject in the first frame image as initial position, the target in the tail frame image is shot
The position of object is as final position;
By in any frame image, above the initial position and above final position between image-region as mesh
Mark region.
In the present embodiment, Fig. 2 is referred to, Fig. 2 is pen tip position during word enquiring provided in an embodiment of the present invention
Schematic diagram if the position of pen tip is moved to the position X2 from the position X1, then shows that X1 is arrived after judging result is that pen tip moves
The top of distance is target area between X2, in obtaining the sequential frame image in 2 seconds, using first frame image as initial position,
Using tail frame image as final position, the region of initial position to top between final position is target area, that is to say, that
" fruits and vegetables " is target area.
In the present embodiment, Fig. 3 is referred to, Fig. 3 is pen tip position during word enquiring provided in an embodiment of the present invention
Another schematic diagram, when judging result is pen tip there is no after movement, if the position of pen tip rests on always the place of Y1, then table
The upper area of the bright position Y1 is target area, in obtaining the sequential frame image in 2 seconds, is selected from sequential frame image any
One image determines the position of pen tip, and the position in any one image above pen tip is target area, that is to say, that
Position where " diet " is target area.
(4) identification module 40
Identification module 40 for identifying target word from the target area, and searches the corresponding base of the target word
This information;
For example, above-mentioned identification module 40 is specifically used for:
Letter identification is carried out to the target area, obtains multiple letters;
The sequence of multiple letter from left to right or from right to left is combined, target word is obtained.
In the present embodiment, using neural network machine learning method, depth is carried out to 26 alphabetical capital and small letter fonts
It practises, realization accurately obtains word, then carries out partial enlargement to target area, carries out the identification of letter, Neng Goushi
Malapropism imperial mother starts to extract letter according to sequence from left to right or from top to bottom to target area, the letter after extraction is pressed
It is combined according to sequence of extraction, obtains target word, be then compared target word with local word library, by local word
Essential information in library containing target word extracts, and the essential information of the target word will not be contained in local word library
Word carry out internet search, obtain the essential information of target word, such as obtain word " diet " essential information " n. (patient or
Slimmer's) diet;Diet;V. it diets;N. a large amount of ".
(5) output module 50
Output module 50, for providing the essential information to the user.
In the present embodiment, the essential information for obtaining target word is changed into corresponding voice messaging, passes through language
The mode that sound plays is broadcast to client, the essential information of target word can also be directly displayed in photographed screen, allows user
The essential information of target word can intuitively be obtained.
It can be seen from the above, word enquiring device provided in this embodiment, by detecting that the mobile terminal is carrying out image
When shooting, obtain the voice messaging of user, subsequently determine whether the voice messaging whether with default voice match, if so, obtaining pre-
If the sequential frame image shot in duration, and target area is determined from the sequential frame image, it is identified from the target area
Target word, and search the corresponding essential information of the target word, provides the essential information then to the user, without with
Family, which is manually entered word, can be obtained the paraphrase of word, and method is simple, and word enquiring is high-efficient, be conducive to the study for improving user
Progress, learning effect are good.
In addition, the embodiment of the present application also provides a kind of mobile terminal, which can be smart phone, tablet computer
Etc. equipment.As shown in figure 5, mobile terminal 200 includes processor 201, memory 202.Wherein, processor 201 and memory 202
It is electrically connected.
Processor 201 is the control centre of mobile terminal 200, utilizes various interfaces and the entire mobile terminal of connection
Various pieces by the application program of operation or load store in memory 202, and are called and are stored in memory 202
Data, execute mobile terminal various functions and processing data, thus to mobile terminal carry out integral monitoring.
In the present embodiment, processor 201 in mobile terminal 200 can according to following step, by one or one with
On the corresponding instruction of process of application program be loaded into memory 202, and be stored in memory by processor 201 to run
Application program in 202, to realize various functions:
When detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user;
Judge the voice messaging whether with default voice match;
If so, obtaining the sequential frame image shot in preset duration, and target area is determined from the sequential frame image;
Target word is identified from the target area, and searches the corresponding essential information of the target word;
The essential information is provided to the user.
Fig. 6 shows the specific block diagram of word enquiring provided in an embodiment of the present invention, which can be used for
The word enquiring method provided in above-described embodiment is provided.The mobile terminal 300 can be smart phone or tablet computer.
RF circuit 310 realizes the mutual conversion of electromagnetic wave and electric signal, thus with logical for receiving and transmitting electromagnetic wave
News network or other equipment are communicated.RF circuit 310 may include various existing for executing the circuit elements of these functions
Part, for example, antenna, RF transceiver, digital signal processor, encryption/deciphering chip, subscriber identity module (SIM) card, storage
Device etc..RF circuit 310 can carry out communicating or by wireless with various networks such as internet, intranet, wireless network
Network is communicated with other equipment.Above-mentioned wireless network may include cellular telephone networks, WLAN or Metropolitan Area Network (MAN).
Various communication standards, agreement and technology, including but not limited to global system for mobile communications can be used in above-mentioned wireless network
(Global System for Mobile Communication, GSM), enhanced mobile communication technology (Enhanced Data
GSM Environment, EDGE), Wideband CDMA Technology (Wideband Code Division Multiple
Access, WCDMA), Code Division Multiple Access (Code Division Access, CDMA), time division multiple access technology (Time
Division Multiple Access, TDMA), adopting wireless fidelity technology (Wireless Fidelity, Wi-Fi) (such as U.S.'s electricity
Gas and Electronic Engineering Association standard IEEE 802.11a, IEEE 802.11b, IEEE802.11g and/or IEEE802.11n),
The networking telephone (Voice over Internet Protocol, VoIP), worldwide interoperability for microwave accesses (Worldwide
Interoperability for Microwave Access, Wi-Max), other are for mail, instant messaging and short message
Agreement and any other suitable communications protocol, or even may include the agreement that those are not developed currently yet.
Memory 320 can be used for storing software program and module, as front camera is taken pictures automatically in above-described embodiment
Corresponding program instruction/the module of light-supplementing system, method, the software program that processor 380 is stored in memory 320 by operation
And module, thereby executing various function application and data processing, i.e. the realization front camera function of taking pictures automatic light-supplementing.
Memory 320 may include high speed random access memory, may also include nonvolatile memory, as one or more magnetic storage fills
It sets, flash memory or other non-volatile solid state memories.In some instances, memory 320 can further comprise relative to place
The remotely located memory of device 380 is managed, these remote memories can pass through network connection to mobile terminal 300.Above-mentioned network
Example include but is not limited to internet, intranet, local area network, mobile radio communication and combinations thereof.
Input unit 330 can be used for receiving the number or character information of input, and generate and user setting and function
Control related keyboard, mouse, operating stick, optics or trackball signal input.Specifically, input unit 330 may include touching
Sensitive surfaces 331 and other input equipments 332.Touch sensitive surface 331, also referred to as touch display screen or Trackpad are collected and are used
Family on it or nearby touch operation (such as user using any suitable object or attachment such as finger, stylus in touch-sensitive table
Operation on face 331 or near touch sensitive surface 331), and corresponding attachment device is driven according to preset formula.It is optional
, touch sensitive surface 331 may include both touch detecting apparatus and touch controller.Wherein, touch detecting apparatus detection is used
The touch orientation at family, and touch operation bring signal is detected, transmit a signal to touch controller;Touch controller is from touch
Touch information is received in detection device, and is converted into contact coordinate, then gives processor 380, and can receive processor 380
The order sent simultaneously is executed.Furthermore, it is possible to using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves
Realize touch sensitive surface 331.In addition to touch sensitive surface 331, input unit 330 can also include other input equipments 332.Specifically,
Other input equipments 332 can include but is not limited to physical keyboard, function key (such as volume control button, switch key etc.),
One of trace ball, mouse, operating stick etc. are a variety of.
Display unit 340 can be used for showing information input by user or the information and mobile terminal that are supplied to user
300 various graphical user interface, these graphical user interface can by figure, text, icon, video and any combination thereof Lai
It constitutes.Display unit 340 may include display panel 341, optionally, can using LCD (Liquid Crystal Display,
Liquid crystal display), the forms such as OLED (Organic Light-Emitting Diode, Organic Light Emitting Diode) configure display
Panel 341.Further, touch sensitive surface 331 can cover display panel 341, when touch sensitive surface 331 detect on it or near
Touch operation after, send processor 380 to determine the type of touch event, be followed by subsequent processing device 380 according to touch event
Type provides corresponding visual output on display panel 341.Although in Fig. 6, touch sensitive surface 331 is with display panel 341
Output and input function as two independent components to realize, but in some embodiments it is possible to by touch sensitive surface 331 with
Display panel 341 is integrated and realizes and outputs and inputs function.
Mobile terminal 300 may also include at least one sensor 350, for example, optical sensor, motion sensor and other
Sensor.Specifically, optical sensor may include ambient light sensor and proximity sensor, wherein ambient light sensor can basis
The light and shade of ambient light adjusts the brightness of display panel 341, proximity sensor can when mobile terminal 300 is moved in one's ear,
Close display panel 341 and/or backlight.As a kind of motion sensor, gravity accelerometer can detect all directions
The size of upper (generally three axis) acceleration, can detect that size and the direction of gravity, can be used to identify mobile phone posture when static
Application (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (for example pedometer, strikes
Hit) etc.;Gyroscope, barometer, hygrometer, thermometer, infrared sensor for can also configure as mobile terminal 300 etc. other
Sensor, details are not described herein.
Voicefrequency circuit 360, loudspeaker 361, microphone 362 can provide the audio interface between user and mobile terminal 300.
Electric signal after the audio data received conversion can be transferred to loudspeaker 361, be converted by loudspeaker 361 by voicefrequency circuit 360
For voice signal output;On the other hand, the voice signal of collection is converted to electric signal by microphone 362, is connect by voicefrequency circuit 360
Audio data is converted to after receipts, then by after the processing of audio data output processor 380, is sent to through RF circuit 310 such as another
One terminal, or audio data is exported to memory 320 to be further processed.Voicefrequency circuit 360 is also possible that earplug
Jack, to provide the communication of peripheral hardware earphone Yu mobile terminal 300.
Mobile terminal 300 can help user send and receive e-mail, is clear by transmission module 370 (such as Wi-Fi module)
Look at webpage and access streaming video etc., it provides wireless broadband internet for user and accesses.Although Fig. 6 shows transmission mould
Block 370, but it is understood that, and it is not belonging to must be configured into for mobile terminal 300, it can according to need do not changing completely
Become in the range of the essence of invention and omits.
Processor 380 is the control centre of mobile terminal 300, utilizes each of various interfaces and connection whole mobile phone
Part by running or execute the software program and/or module that are stored in memory 320, and calls and is stored in memory
Data in 320 execute the various functions and processing data of mobile terminal 300, to carry out integral monitoring to mobile phone.It is optional
, processor 380 may include one or more processing cores;In some embodiments, processor 380 can integrate application processor
And modem processor, wherein the main processing operation system of application processor, user interface and application program etc., modulatedemodulate
Processor is adjusted mainly to handle wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor
In 380.
Mobile terminal 300 further includes the power supply 390 (such as battery) powered to all parts, in some embodiments, electricity
Source can be logically contiguous by power-supply management system and processor 380, to realize management charging by power-supply management system, put
The functions such as electricity and power managed.Power supply 190 can also include one or more direct current or AC power source, recharge
The random components such as system, power failure detection circuit, power adapter or inverter, power supply status indicator.
Although being not shown, mobile terminal 300 can also include camera (such as front camera, rear camera), bluetooth
Module etc., details are not described herein.Specifically in the present embodiment, the display unit of mobile terminal is touch-screen display, mobile whole
End further includes having memory and one perhaps more than one program one of them or more than one program being stored in and deposits
In reservoir, and it is configured to execute one or more than one program by one or more than one processor to include for carrying out
The instruction operated below:
When detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user;
Judge the voice messaging whether with default voice match;
If so, obtaining the sequential frame image shot in preset duration, and target area is determined from the sequential frame image;
Target word is identified from the target area, and searches the corresponding essential information of the target word;
The essential information is provided to the user.
When it is implemented, the above modules can be used as independent entity to realize, any combination can also be carried out, is made
It is realized for same or several entities, the specific implementation of the above modules can be found in the embodiment of the method for front, herein not
It repeats again.
It will appreciated by the skilled person that all or part of the steps in the various methods of above-described embodiment can be with
It is completed by instructing, or relevant hardware is controlled by instruction to complete, which can store computer-readable deposits in one
In storage media, and is loaded and executed by processor.For this purpose, the embodiment of the present invention provides a kind of storage medium, wherein storing
There is a plurality of instruction, which can be loaded by processor, look into execute any word provided by the embodiment of the present invention
Step in inquiry method.
Wherein, which may include: read-only memory (ROM, Read Only Memory), random access memory
Body (RAM, Random Access Memory), disk or CD etc..
By the instruction stored in the storage medium, any word provided by the embodiment of the present invention can be executed and looked into
Step in inquiry method, it is thereby achieved that achieved by any word enquiring method provided by the embodiment of the present invention
Beneficial effect is detailed in the embodiment of front, and details are not described herein.
The specific implementation of above each operation can be found in the embodiment of front, and details are not described herein.
It is to sum up somebody's turn to do, although the application is disclosed above with preferred embodiment, above preferred embodiment is not to limit
The application, those skilled in the art are not departing from spirit and scope, can make various changes and profit
Decorations, therefore the protection scope of the application subjects to the scope of the claims.
Claims (10)
1. a kind of word enquiring method is applied to mobile terminal characterized by comprising
When detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user;
Judge the voice messaging whether with default voice match;
If so, obtaining the sequential frame image shot in preset duration, and target area is determined from the sequential frame image;
Target word is identified from the target area, and searches the corresponding essential information of the target word;
The essential information is provided to the user.
2. word enquiring method as described in claim 1, which is characterized in that described to determine target from the sequential frame image
The step of region, specifically includes:
The sequential frame image is handled using trained neural network model, to identify, target is clapped in every frame image
Take the photograph the position of object;
Judge whether the target subject moves according to the position of target subject described in every frame image;
Target area is determined from the sequential frame image according to judging result.
3. word enquiring method as claimed in claim 2, which is characterized in that it is described according to judging result from the successive frame figure
The step of target area is determined as in specifically includes:
When the judgment result is yes, the first frame image and tail frame image in the sequential frame image are obtained;According to the first frame figure
Picture and tail frame image determine target area;
When the judgment result is no, by the image district in any frame described image, above the position of the target subject
Domain is as target area.
4. word enquiring method as claimed in claim 3, which is characterized in that described according to the first frame image and tail frame image
The step of determining target area specifically includes:
Using the position of target subject described in the first frame image as initial position, by target described in the tail frame image
The position of subject is as final position;
By in any frame described image, above the initial position and above final position between image-region as mesh
Mark region.
5. word enquiring method as described in claim 1, which is characterized in that described to identify target from the target area
The step of word, specifically includes:
Letter identification is carried out to the target area, obtains multiple letters;
The sequence of the multiple letter from left to right or from right to left is combined, target word is obtained.
6. a kind of word enquiring device is applied to mobile terminal characterized by comprising
Module is obtained, for when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user;
Judgment module, for judge the voice messaging whether with default voice match;
Analysis module, for judging result if so, obtaining the sequential frame image that shoots in preset duration, and from the successive frame
Target area is determined in image;
Identification module, for identifying target word from the target area, and it is corresponding basic to search the target word
Information;
Output module, for providing the essential information to the user.
7. word enquiring device as claimed in claim 6, which is characterized in that the analysis module is specifically used for:
The sequential frame image is handled using trained neural network model, to identify, target is clapped in every frame image
Take the photograph the position of object;
Judge whether the target subject moves according to the position of target subject described in every frame image;
Target area is determined from the sequential frame image according to judging result.
8. word enquiring device as claimed in claim 7, which is characterized in that the analysis module is specifically used for:
When the judgment result is yes, the first frame image and tail frame image in the sequential frame image are obtained;According to the first frame figure
Picture and tail frame image determine target area;
When the judgment result is no, by the image district in any frame described image, above the position of the target subject
Domain is as target area.
9. word enquiring device as claimed in claim 8, which is characterized in that the analysis module is specifically used for:
Using the position of target subject described in the first frame image as initial position, by target described in the tail frame image
The position of subject is as final position;
By in any frame described image, above the initial position and above final position between image-region as mesh
Mark region.
10. a kind of computer readable storage medium, which is characterized in that be stored with a plurality of instruction, the finger in the storage medium
It enables and is suitable for being loaded by processor to execute such as word enquiring method described in any one of claim 1 to 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811575255.5A CN109740594A (en) | 2018-12-21 | 2018-12-21 | Word enquiring method, apparatus and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811575255.5A CN109740594A (en) | 2018-12-21 | 2018-12-21 | Word enquiring method, apparatus and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109740594A true CN109740594A (en) | 2019-05-10 |
Family
ID=66359599
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811575255.5A Pending CN109740594A (en) | 2018-12-21 | 2018-12-21 | Word enquiring method, apparatus and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109740594A (en) |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050076084A1 (en) * | 2003-10-03 | 2005-04-07 | Corvigo | Dynamic message filtering |
CN101271636A (en) * | 2008-04-30 | 2008-09-24 | 东莞市步步高教育电子产品有限公司 | Point-reading machine control method with spoken language exercise module |
CN101645190A (en) * | 2009-07-22 | 2010-02-10 | 合肥讯飞数码科技有限公司 | Word inquiring system and inquiring system method thereof |
CN102169540A (en) * | 2011-03-28 | 2011-08-31 | 汉王科技股份有限公司 | Camera-based point reading positioning method and device |
US20140188462A1 (en) * | 2011-09-24 | 2014-07-03 | Lotfi A. Zadeh | Methods and Systems for Applications for Z-numbers |
CN106648367A (en) * | 2016-12-23 | 2017-05-10 | 广东小天才科技有限公司 | Clicking and reading method and clicking and reading device |
CN107451127A (en) * | 2017-07-04 | 2017-12-08 | 广东小天才科技有限公司 | A kind of word translation method and system based on image, mobile device |
CN107766826A (en) * | 2017-10-30 | 2018-03-06 | 广东小天才科技有限公司 | A kind of method and electronic equipment for searching word lexical or textual analysis |
CN107835366A (en) * | 2017-11-07 | 2018-03-23 | 广东欧珀移动通信有限公司 | Multi-medium play method, device, storage medium and electronic equipment |
CN108509136A (en) * | 2018-04-12 | 2018-09-07 | 山东音为爱智能科技有限公司 | A kind of children based on artificial intelligence paint this aid reading method |
CN108536287A (en) * | 2018-03-26 | 2018-09-14 | 深圳市深晓科技有限公司 | A kind of method and device indicating reading according to user |
-
2018
- 2018-12-21 CN CN201811575255.5A patent/CN109740594A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050076084A1 (en) * | 2003-10-03 | 2005-04-07 | Corvigo | Dynamic message filtering |
CN101271636A (en) * | 2008-04-30 | 2008-09-24 | 东莞市步步高教育电子产品有限公司 | Point-reading machine control method with spoken language exercise module |
CN101645190A (en) * | 2009-07-22 | 2010-02-10 | 合肥讯飞数码科技有限公司 | Word inquiring system and inquiring system method thereof |
CN102169540A (en) * | 2011-03-28 | 2011-08-31 | 汉王科技股份有限公司 | Camera-based point reading positioning method and device |
US20140188462A1 (en) * | 2011-09-24 | 2014-07-03 | Lotfi A. Zadeh | Methods and Systems for Applications for Z-numbers |
CN106648367A (en) * | 2016-12-23 | 2017-05-10 | 广东小天才科技有限公司 | Clicking and reading method and clicking and reading device |
CN107451127A (en) * | 2017-07-04 | 2017-12-08 | 广东小天才科技有限公司 | A kind of word translation method and system based on image, mobile device |
CN107766826A (en) * | 2017-10-30 | 2018-03-06 | 广东小天才科技有限公司 | A kind of method and electronic equipment for searching word lexical or textual analysis |
CN107835366A (en) * | 2017-11-07 | 2018-03-23 | 广东欧珀移动通信有限公司 | Multi-medium play method, device, storage medium and electronic equipment |
CN108536287A (en) * | 2018-03-26 | 2018-09-14 | 深圳市深晓科技有限公司 | A kind of method and device indicating reading according to user |
CN108509136A (en) * | 2018-04-12 | 2018-09-07 | 山东音为爱智能科技有限公司 | A kind of children based on artificial intelligence paint this aid reading method |
Non-Patent Citations (3)
Title |
---|
XINHAO LIU ET AL.: "Scene text recognition with high performance CNN classifier and efficient word inference", 《2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)》 * |
周婷: "基于视频图像的点读机坐标定位方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
郭宝龙 等: "《数字图像处理系统工程导论》", 31 July 2012, 西安电子科技大学出版社 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109035914B (en) | Learning method based on intelligent desk lamp and intelligent desk lamp | |
CN106959761B (en) | A kind of terminal photographic method, device and terminal | |
CN108735216A (en) | A kind of voice based on semantics recognition searches topic method and private tutor's equipment | |
US20190019015A1 (en) | Method and related product for recognizing live face | |
CN103744592A (en) | Information processing method and terminal | |
CN109359056A (en) | A kind of applied program testing method and device | |
CN104217172A (en) | Privacy content checking method and device | |
CN107241552A (en) | A kind of image acquiring method, device, storage medium and terminal | |
CN106933351A (en) | A kind of method for starting camera in the terminal, device and mobile terminal | |
CN108604251A (en) | A kind of sharing method and terminal device of multimedia file | |
CN109523253A (en) | A kind of method of payment and device | |
CN107908770A (en) | A kind of photo searching method and mobile terminal | |
CN105120158B (en) | The image pickup method and device of mobile terminal | |
CN107948729A (en) | Rich Media's processing method, device, storage medium and electronic equipment | |
CN109493821A (en) | Screen brightness regulation method, device and storage medium | |
CN105635553B (en) | Image shooting method and device | |
CN106126171B (en) | A kind of sound effect treatment method and mobile terminal | |
CN109561255A (en) | Terminal photographic method, device and storage medium | |
CN109656431A (en) | Information display method, device and storage medium | |
CN109413256A (en) | Contact person information processing method, device, storage medium and electronic equipment | |
CN112287317B (en) | User information input method and electronic equipment | |
CN110162707A (en) | A kind of information recommendation method, terminal and computer readable storage medium | |
CN110113782A (en) | Data transmission method, device and storage medium | |
CN106486119A (en) | A kind of method and apparatus of identification voice messaging | |
CN109634481A (en) | Text display method, device, mobile terminal and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190510 |