CN109740594A

CN109740594A - Word enquiring method, apparatus and storage medium

Info

Publication number: CN109740594A
Application number: CN201811575255.5A
Authority: CN
Inventors: 许玉新; 张刘哲
Original assignee: Huizhou TCL Mobile Communication Co Ltd
Current assignee: Huizhou TCL Mobile Communication Co Ltd
Priority date: 2018-12-21
Filing date: 2018-12-21
Publication date: 2019-05-10

Abstract

The present invention relates to a kind of word enquiring methods, device and storage medium, it include: to detect the mobile terminal when carrying out image taking, obtain the voice messaging of user, subsequently determine whether the voice messaging whether with default voice match, if, then obtain the sequential frame image shot in preset duration, and target area is determined from the sequential frame image, target word is identified from the target area, and search the corresponding essential information of the target word, the essential information is provided then to the user, being manually entered word without user can be obtained the paraphrase of word, method is simple, word enquiring is high-efficient, be conducive to improve the study schedule of user, learning effect is good.

Description

Word enquiring method, apparatus and storage medium

[technical field]

This application involves image identification technical fields, more specifically to a kind of word enquiring method, apparatus and storage Medium.

[background technique]

With more more and more universalization of English, periodical and works containing English also more and more enter the view of people Open country, therewith caused by word enquiring amount it is also more and more.In currently marketed English learning tool, the tool of verification certificate word with Science and technology development become increasingly to facilitate, from the lookup english dictionary of beginning, finally using learning machine carry out word enquiring, The change of looking up words method is more and more convenient for learner, still, when the word quantity for occurring needing to inquire is excessive, It will lead to learner's repeatedly pause manual queries word, upset the thinking and inspiration of learner, influence learning efficiency.

In conclusion existing word enquiring method also needs learner to carry out being manually entered word, word is then obtained Paraphrase, cause the low effect low with learning efficiency of the level of learning of learner.

[summary of the invention]

A kind of method, apparatus and storage medium for being designed to provide word enquiring of the application, can be manual without user Word enquiring is inputted, is improved learning efficiency.

To solve the above-mentioned problems, the embodiment of the present application provides a kind of word enquiring method, is applied to mobile terminal, packet It includes:

When detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user；

Judge the voice messaging whether with default voice match；

If so, obtaining the sequential frame image shot in preset duration, and target area is determined from the sequential frame image Domain；

Target word is identified from the target area, and searches the corresponding essential information of the target word；

The essential information is provided to the user.

The embodiment of the present application also provides a kind of word enquiring devices, are applied to mobile terminal, comprising:

Obtain module: when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user；

Judgment module: judge the voice messaging whether with default voice match；

Analysis module: if so, obtaining the sequential frame image shot in preset duration, and from the sequential frame image really Set the goal region；

Identification module: identifying target word from the target area, and it is corresponding basic to search the target word Information；

Output module: Xiang Suoshu user provides the essential information.

The embodiment of the present application also provides a kind of computer readable storage medium, a plurality of finger is stored in the storage medium It enables, described instruction is suitable for being loaded by processor to execute any of the above-described word enquiring method.

Word enquiring method, apparatus provided by the present application and storage medium, when detecting that the mobile terminal carrying out figure As shooting when, by obtain user voice messaging, subsequently determine whether the voice messaging whether with default voice match, if so, The sequential frame image shot in preset duration is then obtained, and determines target area from the sequential frame image, from the target Target word is identified in region, and searches the corresponding essential information of the target word, then to described in user offer Essential information, being manually entered word without user can be obtained the paraphrase of word, and method is simple, and word enquiring is high-efficient, Be conducive to improve the study schedule of user, learning effect is good.

[Detailed description of the invention]

To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those skilled in the art, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.

Fig. 1 is the flow diagram of word enquiring method provided in an embodiment of the present invention.

Fig. 2 is the schematic diagram of pen tip position during word enquiring provided in an embodiment of the present invention.

Fig. 3 is another schematic diagram of pen tip position during word enquiring provided in an embodiment of the present invention.

Fig. 4 is the structural schematic diagram of word enquiring device provided in an embodiment of the present invention.

Fig. 5 is the structural schematic diagram of mobile terminal provided in an embodiment of the present invention.

Fig. 6 is another structural schematic diagram of mobile terminal provided in an embodiment of the present invention.

[specific embodiment]

For technical problems, technical solutions and advantages to be solved are more clearly understood, below in conjunction with Accompanying drawings and embodiments, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only used To explain the present invention, it is not intended to limit the present invention.

A kind of word enquiring method, be applied to mobile terminal, comprising: when detect the mobile terminal carry out image taking When, obtain the voice messaging of user；Judge the voice messaging whether with default voice match；If so, obtaining in preset duration The sequential frame image of shooting, and target area is determined from the sequential frame image；Target word is identified from the target area, And search the corresponding essential information of the target word；The essential information is provided to the user.

Referring to Figure 1, Fig. 1 is the flow diagram of word enquiring provided by the embodiments of the present application, the word enquiring method Applied to mobile terminal, detailed process be can be such that

S101. when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user.

In the present embodiment, the shooting picture for carrying out image taking is the picture of user's study word, and user is in study word In the process, mobile terminal camera can be opened and carry out image taking, which may be displayed in screen preview frame, In, the word source of user's study can be books, newpapers and periodicals and learning software etc., and the voice messaging of user can be by mobile whole End built-in microphone is acquired, and the speech message issued such as user is " little Bai, verification certificate word ".

S102. judge the voice messaging whether with default voice match, if so, execute following step S103, if it is not, then It returns and executes above-mentioned steps S101.

In the present embodiment, default voice can be manually set, for example be " verification certificate word ", when user issues voice messaging, The speech message is matched with " verification certificate word " voice, to analyze whether user needs to enter the service of verification certificate word.

S103. the sequential frame image shot in preset duration is obtained, and determines target area from the sequential frame image.

In the present embodiment, preset duration can be manually set, and when voice messaging successful match, mobile terminal will start meter When device start timing, it is 2 seconds that preset duration, which is such as arranged, and according to the shooting image in timing duration to the word of required inquiry Confirmed.

For example, above-mentioned steps " determining target area from the sequential frame image " specifically include:

The sequential frame image is handled using trained neural network model, to identify target in every frame image The position of subject；

Judge whether the target subject moves according to the position of the target subject in every frame image；

Target area is determined from the sequential frame image according to judging result.

In the present embodiment, target subject can be manually set, for example be learning pen or finger etc., and the word of inquiry can be with For a word, or a group of words, or in short.It, can be with after speech message and the success of default voice match By the neural network algorithm of prior learning pen tip picture, the word that needs are inquired is carried out really by the pen tip below word Recognize, to identify the position of pen tip in every frame image, then sequential frame image is obtained in 2 seconds, is reached 2 seconds when the time Afterwards, the sequential frame image occurred in 2 seconds is obtained, and filters out the target figure containing confirmation looking up words from sequential frame image Picture determines target area according to the position of pen tip.

Above-mentioned steps " target area is determined from the sequential frame image according to judging result " specifically include:

When the judgment result is yes, the first frame image and tail frame image in the sequential frame image are obtained；According to the head frame figure Picture and tail frame image determine target area；

When the judgment result is no, by the image district in any frame image, above the position of the target subject Domain is as target area.

Above-mentioned steps " determining target area according to the first frame image and tail frame image " specifically include:

Using the position of the target subject in the first frame image as initial position, the target in the tail frame image is shot The position of object is as final position；

By in any frame image, above the initial position and above final position between image-region as mesh Mark region.

In the present embodiment, Fig. 2 is referred to, Fig. 2 is pen tip position during word enquiring provided in an embodiment of the present invention Schematic diagram if the position of pen tip is moved to the position X2 from the position X1, then shows that X1 is arrived after judging result is that pen tip moves The top of distance is target area between X2, in obtaining the sequential frame image in 2 seconds, using first frame image as initial position, Using tail frame image as final position, the region of initial position to top between final position is target area, that is to say, that " fruits and vegetables " is target area.

In the present embodiment, Fig. 3 is referred to, Fig. 3 is pen tip position during word enquiring provided in an embodiment of the present invention Another schematic diagram, when judging result is pen tip there is no after movement, if the position of pen tip rests on always the place of Y1, then table The upper area of the bright position Y1 is target area, in obtaining the sequential frame image in 2 seconds, is selected from sequential frame image any One image determines the position of pen tip, and the position in any one image above pen tip is target area, that is to say, that Position where " diet " is target area.

S104. target word is identified from the target area, and searches the corresponding essential information of the target word；

For example, above-mentioned steps " identifying target word from the target area " specifically include:

Letter identification is carried out to the target area, obtains multiple letters；

The sequence of multiple letter from left to right or from right to left is combined, target word is obtained.

In the present embodiment, using neural network machine learning method, depth is carried out to 26 alphabetical capital and small letter fonts It practises, realization accurately obtains word, then carries out partial enlargement to target area, carries out the identification of letter, Neng Goushi Malapropism imperial mother starts to extract letter according to sequence from left to right or from top to bottom to target area, the letter after extraction is pressed It is combined according to sequence of extraction, obtains target word, be then compared target word with local word library, by local word Essential information in library containing target word extracts, and the essential information of the target word will not be contained in local word library Word carry out internet search, obtain the essential information of target word, such as obtain word " diet " essential information " n. (patient or Slimmer's) diet；Diet；V. it diets；N. a large amount of ".

S105. the essential information is provided to the user.

In the present embodiment, the essential information for obtaining target word is changed into corresponding voice messaging, passes through language The mode that sound plays is broadcast to client, the essential information of target word can also be directly displayed in photographed screen, allows user The essential information of target word can intuitively be obtained.

It can be seen from the above, word enquiring method provided in this embodiment, by detecting that the mobile terminal is carrying out image When shooting, obtain the voice messaging of user, subsequently determine whether the voice messaging whether with default voice match, if so, obtaining pre- If the sequential frame image shot in duration, and target area is determined from the sequential frame image, it is identified from the target area Target word, and search the corresponding essential information of the target word, provides the essential information then to the user, without with Family, which is manually entered word, can be obtained the paraphrase of word, and method is simple, and word enquiring is high-efficient, be conducive to the study for improving user Progress, learning effect are good.

The method according to described in above-described embodiment, the present embodiment will be described from the angle of word enquiring device, tool Body says and is described in detail so that the word enquiring device is integrated in the terminal as an example that the mobile terminal can be plate electricity Brain, mobile phone etc..

Fig. 4 is referred to, Fig. 4 is word enquiring device provided by the embodiments of the present application, is applied to first movement terminal, can be with It include: to obtain module 10, judgment module 20, analysis module 30, identification module 40 and output module 50, in which:

(1) module 10 is obtained

Module 10 is obtained, for when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user.

(2) judgment module 20

Judgment module 20, for judge the voice messaging whether with default voice match, if so, execute analysis module 30, if it is not, then returning to execution obtains module 10.

(3) analysis module 30

Analysis module 30 is determined for obtaining the sequential frame image shot in preset duration, and from the sequential frame image Target area.

For example, above-mentioned analysis module 30 is specifically used for:

Further, above-mentioned analysis module 30 is specifically used for:

(4) identification module 40

Identification module 40 for identifying target word from the target area, and searches the corresponding base of the target word This information；

For example, above-mentioned identification module 40 is specifically used for:

(5) output module 50

Output module 50, for providing the essential information to the user.

It can be seen from the above, word enquiring device provided in this embodiment, by detecting that the mobile terminal is carrying out image When shooting, obtain the voice messaging of user, subsequently determine whether the voice messaging whether with default voice match, if so, obtaining pre- If the sequential frame image shot in duration, and target area is determined from the sequential frame image, it is identified from the target area Target word, and search the corresponding essential information of the target word, provides the essential information then to the user, without with Family, which is manually entered word, can be obtained the paraphrase of word, and method is simple, and word enquiring is high-efficient, be conducive to the study for improving user Progress, learning effect are good.

In addition, the embodiment of the present application also provides a kind of mobile terminal, which can be smart phone, tablet computer Etc. equipment.As shown in figure 5, mobile terminal 200 includes processor 201, memory 202.Wherein, processor 201 and memory 202 It is electrically connected.

Processor 201 is the control centre of mobile terminal 200, utilizes various interfaces and the entire mobile terminal of connection Various pieces by the application program of operation or load store in memory 202, and are called and are stored in memory 202 Data, execute mobile terminal various functions and processing data, thus to mobile terminal carry out integral monitoring.

In the present embodiment, processor 201 in mobile terminal 200 can according to following step, by one or one with On the corresponding instruction of process of application program be loaded into memory 202, and be stored in memory by processor 201 to run Application program in 202, to realize various functions:

Judge the voice messaging whether with default voice match；

If so, obtaining the sequential frame image shot in preset duration, and target area is determined from the sequential frame image；

The essential information is provided to the user.

Fig. 6 shows the specific block diagram of word enquiring provided in an embodiment of the present invention, which can be used for The word enquiring method provided in above-described embodiment is provided.The mobile terminal 300 can be smart phone or tablet computer.

RF circuit 310 realizes the mutual conversion of electromagnetic wave and electric signal, thus with logical for receiving and transmitting electromagnetic wave News network or other equipment are communicated.RF circuit 310 may include various existing for executing the circuit elements of these functions Part, for example, antenna, RF transceiver, digital signal processor, encryption/deciphering chip, subscriber identity module (SIM) card, storage Device etc..RF circuit 310 can carry out communicating or by wireless with various networks such as internet, intranet, wireless network Network is communicated with other equipment.Above-mentioned wireless network may include cellular telephone networks, WLAN or Metropolitan Area Network (MAN). Various communication standards, agreement and technology, including but not limited to global system for mobile communications can be used in above-mentioned wireless network (Global System for Mobile Communication, GSM), enhanced mobile communication technology (Enhanced Data GSM Environment, EDGE), Wideband CDMA Technology (Wideband Code Division Multiple Access, WCDMA), Code Division Multiple Access (Code Division Access, CDMA), time division multiple access technology (Time Division Multiple Access, TDMA), adopting wireless fidelity technology (Wireless Fidelity, Wi-Fi) (such as U.S.'s electricity Gas and Electronic Engineering Association standard IEEE 802.11a, IEEE 802.11b, IEEE802.11g and/or IEEE802.11n), The networking telephone (Voice over Internet Protocol, VoIP), worldwide interoperability for microwave accesses (Worldwide Interoperability for Microwave Access, Wi-Max), other are for mail, instant messaging and short message Agreement and any other suitable communications protocol, or even may include the agreement that those are not developed currently yet.

Memory 320 can be used for storing software program and module, as front camera is taken pictures automatically in above-described embodiment Corresponding program instruction/the module of light-supplementing system, method, the software program that processor 380 is stored in memory 320 by operation And module, thereby executing various function application and data processing, i.e. the realization front camera function of taking pictures automatic light-supplementing. Memory 320 may include high speed random access memory, may also include nonvolatile memory, as one or more magnetic storage fills It sets, flash memory or other non-volatile solid state memories.In some instances, memory 320 can further comprise relative to place The remotely located memory of device 380 is managed, these remote memories can pass through network connection to mobile terminal 300.Above-mentioned network Example include but is not limited to internet, intranet, local area network, mobile radio communication and combinations thereof.

Input unit 330 can be used for receiving the number or character information of input, and generate and user setting and function Control related keyboard, mouse, operating stick, optics or trackball signal input.Specifically, input unit 330 may include touching Sensitive surfaces 331 and other input equipments 332.Touch sensitive surface 331, also referred to as touch display screen or Trackpad are collected and are used Family on it or nearby touch operation (such as user using any suitable object or attachment such as finger, stylus in touch-sensitive table Operation on face 331 or near touch sensitive surface 331), and corresponding attachment device is driven according to preset formula.It is optional , touch sensitive surface 331 may include both touch detecting apparatus and touch controller.Wherein, touch detecting apparatus detection is used The touch orientation at family, and touch operation bring signal is detected, transmit a signal to touch controller；Touch controller is from touch Touch information is received in detection device, and is converted into contact coordinate, then gives processor 380, and can receive processor 380 The order sent simultaneously is executed.Furthermore, it is possible to using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves Realize touch sensitive surface 331.In addition to touch sensitive surface 331, input unit 330 can also include other input equipments 332.Specifically, Other input equipments 332 can include but is not limited to physical keyboard, function key (such as volume control button, switch key etc.), One of trace ball, mouse, operating stick etc. are a variety of.

Display unit 340 can be used for showing information input by user or the information and mobile terminal that are supplied to user 300 various graphical user interface, these graphical user interface can by figure, text, icon, video and any combination thereof Lai It constitutes.Display unit 340 may include display panel 341, optionally, can using LCD (Liquid Crystal Display, Liquid crystal display), the forms such as OLED (Organic Light-Emitting Diode, Organic Light Emitting Diode) configure display Panel 341.Further, touch sensitive surface 331 can cover display panel 341, when touch sensitive surface 331 detect on it or near Touch operation after, send processor 380 to determine the type of touch event, be followed by subsequent processing device 380 according to touch event Type provides corresponding visual output on display panel 341.Although in Fig. 6, touch sensitive surface 331 is with display panel 341 Output and input function as two independent components to realize, but in some embodiments it is possible to by touch sensitive surface 331 with Display panel 341 is integrated and realizes and outputs and inputs function.

Mobile terminal 300 may also include at least one sensor 350, for example, optical sensor, motion sensor and other Sensor.Specifically, optical sensor may include ambient light sensor and proximity sensor, wherein ambient light sensor can basis The light and shade of ambient light adjusts the brightness of display panel 341, proximity sensor can when mobile terminal 300 is moved in one's ear, Close display panel 341 and/or backlight.As a kind of motion sensor, gravity accelerometer can detect all directions The size of upper (generally three axis) acceleration, can detect that size and the direction of gravity, can be used to identify mobile phone posture when static Application (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (for example pedometer, strikes Hit) etc.；Gyroscope, barometer, hygrometer, thermometer, infrared sensor for can also configure as mobile terminal 300 etc. other Sensor, details are not described herein.

Voicefrequency circuit 360, loudspeaker 361, microphone 362 can provide the audio interface between user and mobile terminal 300. Electric signal after the audio data received conversion can be transferred to loudspeaker 361, be converted by loudspeaker 361 by voicefrequency circuit 360 For voice signal output；On the other hand, the voice signal of collection is converted to electric signal by microphone 362, is connect by voicefrequency circuit 360 Audio data is converted to after receipts, then by after the processing of audio data output processor 380, is sent to through RF circuit 310 such as another One terminal, or audio data is exported to memory 320 to be further processed.Voicefrequency circuit 360 is also possible that earplug Jack, to provide the communication of peripheral hardware earphone Yu mobile terminal 300.

Mobile terminal 300 can help user send and receive e-mail, is clear by transmission module 370 (such as Wi-Fi module) Look at webpage and access streaming video etc., it provides wireless broadband internet for user and accesses.Although Fig. 6 shows transmission mould Block 370, but it is understood that, and it is not belonging to must be configured into for mobile terminal 300, it can according to need do not changing completely Become in the range of the essence of invention and omits.

Processor 380 is the control centre of mobile terminal 300, utilizes each of various interfaces and connection whole mobile phone Part by running or execute the software program and/or module that are stored in memory 320, and calls and is stored in memory Data in 320 execute the various functions and processing data of mobile terminal 300, to carry out integral monitoring to mobile phone.It is optional , processor 380 may include one or more processing cores；In some embodiments, processor 380 can integrate application processor And modem processor, wherein the main processing operation system of application processor, user interface and application program etc., modulatedemodulate Processor is adjusted mainly to handle wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor In 380.

Mobile terminal 300 further includes the power supply 390 (such as battery) powered to all parts, in some embodiments, electricity Source can be logically contiguous by power-supply management system and processor 380, to realize management charging by power-supply management system, put The functions such as electricity and power managed.Power supply 190 can also include one or more direct current or AC power source, recharge The random components such as system, power failure detection circuit, power adapter or inverter, power supply status indicator.

Although being not shown, mobile terminal 300 can also include camera (such as front camera, rear camera), bluetooth Module etc., details are not described herein.Specifically in the present embodiment, the display unit of mobile terminal is touch-screen display, mobile whole End further includes having memory and one perhaps more than one program one of them or more than one program being stored in and deposits In reservoir, and it is configured to execute one or more than one program by one or more than one processor to include for carrying out The instruction operated below:

Judge the voice messaging whether with default voice match；

The essential information is provided to the user.

When it is implemented, the above modules can be used as independent entity to realize, any combination can also be carried out, is made It is realized for same or several entities, the specific implementation of the above modules can be found in the embodiment of the method for front, herein not It repeats again.

It will appreciated by the skilled person that all or part of the steps in the various methods of above-described embodiment can be with It is completed by instructing, or relevant hardware is controlled by instruction to complete, which can store computer-readable deposits in one In storage media, and is loaded and executed by processor.For this purpose, the embodiment of the present invention provides a kind of storage medium, wherein storing There is a plurality of instruction, which can be loaded by processor, look into execute any word provided by the embodiment of the present invention Step in inquiry method.

Wherein, which may include: read-only memory (ROM, Read Only Memory), random access memory Body (RAM, Random Access Memory), disk or CD etc..

By the instruction stored in the storage medium, any word provided by the embodiment of the present invention can be executed and looked into Step in inquiry method, it is thereby achieved that achieved by any word enquiring method provided by the embodiment of the present invention Beneficial effect is detailed in the embodiment of front, and details are not described herein.

The specific implementation of above each operation can be found in the embodiment of front, and details are not described herein.

It is to sum up somebody's turn to do, although the application is disclosed above with preferred embodiment, above preferred embodiment is not to limit The application, those skilled in the art are not departing from spirit and scope, can make various changes and profit Decorations, therefore the protection scope of the application subjects to the scope of the claims.

Claims

1. a kind of word enquiring method is applied to mobile terminal characterized by comprising

Judge the voice messaging whether with default voice match；

The essential information is provided to the user.

2. word enquiring method as described in claim 1, which is characterized in that described to determine target from the sequential frame image The step of region, specifically includes:

The sequential frame image is handled using trained neural network model, to identify, target is clapped in every frame image Take the photograph the position of object；

Judge whether the target subject moves according to the position of target subject described in every frame image；

3. word enquiring method as claimed in claim 2, which is characterized in that it is described according to judging result from the successive frame figure The step of target area is determined as in specifically includes:

When the judgment result is yes, the first frame image and tail frame image in the sequential frame image are obtained；According to the first frame figure Picture and tail frame image determine target area；

When the judgment result is no, by the image district in any frame described image, above the position of the target subject Domain is as target area.

4. word enquiring method as claimed in claim 3, which is characterized in that described according to the first frame image and tail frame image The step of determining target area specifically includes:

Using the position of target subject described in the first frame image as initial position, by target described in the tail frame image The position of subject is as final position；

By in any frame described image, above the initial position and above final position between image-region as mesh Mark region.

5. word enquiring method as described in claim 1, which is characterized in that described to identify target from the target area The step of word, specifically includes:

The sequence of the multiple letter from left to right or from right to left is combined, target word is obtained.

6. a kind of word enquiring device is applied to mobile terminal characterized by comprising

Module is obtained, for when detecting that the mobile terminal when carrying out image taking, obtains the voice messaging of user；

Judgment module, for judge the voice messaging whether with default voice match；

Analysis module, for judging result if so, obtaining the sequential frame image that shoots in preset duration, and from the successive frame Target area is determined in image；

Identification module, for identifying target word from the target area, and it is corresponding basic to search the target word Information；

Output module, for providing the essential information to the user.

7. word enquiring device as claimed in claim 6, which is characterized in that the analysis module is specifically used for:

8. word enquiring device as claimed in claim 7, which is characterized in that the analysis module is specifically used for:

9. word enquiring device as claimed in claim 8, which is characterized in that the analysis module is specifically used for:

10. a kind of computer readable storage medium, which is characterized in that be stored with a plurality of instruction, the finger in the storage medium It enables and is suitable for being loaded by processor to execute such as word enquiring method described in any one of claim 1 to 5.