CN106611058A - Method and device for searching test questions - Google Patents

Method and device for searching test questions Download PDF

Info

Publication number
CN106611058A
CN106611058A CN201611229381.6A CN201611229381A CN106611058A CN 106611058 A CN106611058 A CN 106611058A CN 201611229381 A CN201611229381 A CN 201611229381A CN 106611058 A CN106611058 A CN 106611058A
Authority
CN
China
Prior art keywords
examination question
search results
scoring
search
examination
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611229381.6A
Other languages
Chinese (zh)
Inventor
丁新朗
林亚男
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201611229381.6A priority Critical patent/CN106611058A/en
Publication of CN106611058A publication Critical patent/CN106611058A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text

Abstract

The invention provides a method and a device for searching test questions. The method comprises the following steps: obtaining original images of target test questions, and carrying out image identification on the original images of the target test questions; carrying out full-text search in a question bank based on the result of the image identification, and obtaining test question search results; when the quantity of the obtained test question search results is two or more than two, calculating first scores corresponding to each obtained test question search result respectively according to a score mechanism of the full-text search; calculating second cores corresponding to each obtained test question search result respectively according to a similarity algorithm; carrying out weighting calculation on the first scores and the second scores according to a preset weighting linear scheme, determining a final score, and ranking and outputting the test question search results from top to bottom according to the final score. Through the scheme, the accuracy rate of test question search is improved.

Description

A kind of examination question searching method and device
Technical field
The present invention relates to communication technical field, and in particular to a kind of examination question searching method and device.
Background technology
At present, education sector is also integrating with the Internet, occurs in that many online education products, takes pictures including possessing The function such as answer questions searches topic class product.When searching topic class product and being intended to User and encounter a difficulty in operation, can obtain and include The image of exercise question simultaneously carries out image recognition to the image, and the result based on image recognition searches for the topic that user needs in backstage exam pool Mesh and answer are parsed.
However, the ambient light that can be taken due to existing image recognition technology and light are affected, use is frequently resulted in When family is using topic class product is searched, it is impossible to correctly search for out the examination question required for user and answer parsing, can not meet user's Topic demand is searched, the Consumer's Experience of such product is affected.
The content of the invention
The present invention provides a kind of examination question searching method and device, it is intended to improve the accuracy rate of examination question search.
The first aspect of the embodiment of the present invention, there is provided a kind of examination question searching method, the examination question searching method includes:
The original image of target examination question is obtained, and the original image to the target examination question carries out image recognition;
Result based on image recognition carries out full-text search in exam pool, obtains examination question Search Results;
When the quantity of the examination question Search Results for obtaining is two or more, according to the scoring of full-text search, calculating is obtained Corresponding first scoring of each examination question Search Results difference for taking;
According to similarity algorithm, corresponding second scoring of each examination question Search Results difference for obtaining is calculated;
According to default weighted linear scheme, the described first scoring and the second scoring are weighted, it is determined that most final review Point, and according to the final scoring, from high to low the examination question Search Results are ranked up and are exported.
The second aspect of the embodiment of the present invention, there is provided a kind of examination question searcher, the examination question searcher includes:
Target examination question acquiring unit, for obtaining the original image of target examination question, and to the original graph of the target examination question As carrying out image recognition;
Preliminary search unit, the result for obtaining image recognition based on the target examination question acquiring unit is entered in exam pool Row full-text search, obtains examination question Search Results;
First score calculation unit, the quantity of the examination question Search Results for getting when the preliminary search unit is two When more than individual, according to the scoring of full-text search, corresponding first scoring of each examination question Search Results difference for obtaining is calculated;
Second score calculation unit, for according to similarity algorithm, calculate that the preliminary search unit gets each Corresponding second scoring of examination question Search Results difference;
Search result determination unit, for according to default weighted linear scheme, obtaining to the first score calculation unit The first scoring and the second scoring for obtaining of the second score calculation unit be weighted, it is determined that final scoring, and root According to the final scoring, from high to low the examination question Search Results are ranked up and are exported.
Therefore, in the present invention program, the original image of target examination question is obtained first, and to the target examination question Original image carries out image recognition, and be then based on the result of image recognition carries out full-text search in exam pool, obtains examination question search As a result, when examination question Search Results include two or more result, according to the scoring of full-text search, each examination question search is calculated As a result distinguish corresponding first scoring, and according to similarity algorithm, calculate each examination question Search Results and corresponding second comment respectively Point, finally according to default weighted linear scheme, the described first scoring and the second scoring are weighted, it is determined that most final review Point, and according to the final scoring, from high to low the examination question Search Results are ranked up and are exported.So that examination question search As a result can further be screened, and finally filtered out more accurately, the higher examination question of matching degree and accordingly parsing.Relative to In prior art, the examination question result for finally searching is had influence on due to the inaccuracy of image recognition, lead to not search out The examination question of matching, the present invention program improves accuracy rate when examination question is searched for, and better met user searches topic demand, is lifted Consumer's Experience.
Description of the drawings
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing The accompanying drawing to be used needed for having technology description is briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, may be used also To obtain other accompanying drawings according to these accompanying drawings.
Fig. 1 is the flowchart of the examination question searching method that the embodiment of the present invention one is provided;
Fig. 2 implements flow chart for examination question searching method step S102 of the offer of the embodiment of the present invention one;
Fig. 3 is the structured flowchart of the examination question searcher that the embodiment of the present invention two is provided.
Specific embodiment
To enable goal of the invention, feature, the advantage of the present invention more obvious and understandable, below in conjunction with the present invention Accompanying drawing in embodiment, is clearly and completely described to the technical scheme in the embodiment of the present invention, it is clear that described reality It is only a part of embodiment of the invention to apply example, and not all embodiments.Based on the embodiment in the present invention, the common skill in this area The every other embodiment that art personnel are obtained under the premise of creative work is not made, belongs to the model of present invention protection Enclose.
In embodiments of the present invention, the original image of target examination question is obtained first, and to the original graph of the target examination question As carrying out image recognition, be then based on the result of image recognition carries out full-text search in exam pool, obtains examination question Search Results, when When examination question Search Results include two or more result, according to the scoring of full-text search, each examination question Search Results point are calculated Not corresponding first scoring, and according to similarity algorithm, corresponding second scoring of each examination question Search Results difference is calculated, finally According to default weighted linear scheme, the described first scoring and the second scoring are weighted, it is determined that final scoring, and according to The examination question Search Results are ranked up and are exported by the final scoring from high to low.The result that examination question is searched for Further screened, and finally filtered out more accurately, the higher examination question of matching degree and accordingly parsing.
It is described in detail below in conjunction with realization of the specific embodiment to the present invention:
Embodiment one
Fig. 1 shows that the examination question searching method that the embodiment of the present invention one is provided realizes flow process, and details are as follows:
In step S101, the original image of target examination question is obtained, and the original image to above-mentioned target examination question carries out figure As identification.
In embodiments of the present invention, the original image comprising target examination question can be obtained by way of photographic head shoots, For example can be to start after camera function, target examination question is taken pictures by photographic head.Or, in step S101, also may be used Actively or passively to obtain the original image comprising target examination question from miscellaneous equipment;Or, in step S101, it is also possible to from The original image comprising target examination question is obtained in image library, is not construed as limiting herein.After the original image for getting target examination question, Text identification is carried out to the original image of above-mentioned target examination question, is realized in several ways with this, it is convenient rapidly to get bag Original image containing target examination question.Specifically, in step S101, optical character recognition (OCR, Optical can be adopted Character Recognition) technology carries out text identification to the original image comprising target examination question for getting.Certainly, Step S104 can also carry out text knowledge using other text recognition techniques to the original image comprising target examination question for getting Not, it is not construed as limiting herein.
In step s 102, the result based on image recognition carries out full-text search in exam pool, obtains examination question Search Results.
In embodiments of the present invention, examination question searcher will receive the original image that includes target examination question and right Above-mentioned original image is carried out after image recognition, based on the result of image recognition, full-text search is carried out in exam pool, is got and mesh The related Search Results of mark examination question.Wherein, exam pool includes local exam pool, also including the exam pool on the Internet.Local exam pool can Think exam pool built-in in examination question searcher, or user via the topic in the Internet download to examination question searcher Storehouse, facilitates user that required exam pool can be downloaded under networked environment with this, and the topic downloaded can be accessed under offline environment Storehouse, does not limit herein.Full-text search all enters line retrieval due to the result that can be obtained image recognition in step S101, thus The degree of association and matching degree of the Search Results for being obtained all can be higher.
Specifically, when full-text search is carried out in exam pool based on the result of image recognition, it is possible to use Lucene frameworks Full-text search is carried out to the result of image recognition.The development language that Lucene is used is Java, is a full-text search increased income Engine tool bag.It is not a complete full-text search engine, but the framework of a full-text search engine, there is provided it is complete Query engine and index engine and part text analyzing engine.Lucene provides instrument easy to use for developer Bag, realizes easily complete full-text search engine being set up in examination question searcher with this, thus user can be based on this Realize the function of full-text search.Certainly, in step S102, can also be using other retrieval frameworks or search program to image recognition Result carry out full-text search, such as Galago, Xapian or Zebra etc., here is not limited.
In step s 103, when the quantity of the examination question Search Results for obtaining is two or more, commenting according to full-text search Extension set system, calculates corresponding first scoring of each entity search result difference for obtaining.
In embodiments of the present invention, the quantity of the examination question Search Results that step S102 is finally obtained cannot determine.When When the target examination question of user's search is more novel, step S102 it is possible that the examination question Search Results for retrieving seldom, or even Without situation.Examination question Search Results quantity for obtaining is different, there are following several application scenarios:
In a kind of application scenarios, step S102 does not get any examination question search knot matched with target examination question Really.Now, step S103 can directly return the information without Search Results, and with this user is informed, it is impossible to search in existing exam pool Rope is to the exercise question matched with target examination question.
In another kind of application scenarios, step S102 only gets an examination question Search Results.Now, step S103 is direct The said one examination question Search Results for getting are returned to into user, is consulted for user.
Alternatively, in above two application scenarios, can be with step display S102 on screen, examination question searcher Specifically searched in which exam pool, and other exam pools for recommending to attempt scanning for for user, user can be voluntarily Select download exam pool to be searched again for or to be networked search again for online, more examination question Search Results are obtained with this.
In the third application scenarios, step S102 has got two or more examination question Search Results.Now, due to examination question Search Results exist multiple, thus step S103 first can carry out for the first time scoring screening to it.Global search technology is provided A kind of scoring, can score the result that search is obtained, and the scoring of Search Results is higher, represents it with target examination question Degree of association it is higher.In step s 103, each Search Results for obtaining for step S102, all can once be scored, Gained score is corresponding first scoring of each Search Results difference, and is designated as S1.Above-mentioned first scoring S1 will be by Temporary transient stores, to treat that the above-mentioned first scoring S1 of later use is further calculated.
Specifically, when being in step s 102 to carry out full-text search to the result of image recognition using Lucene frameworks, For above-mentioned the third application scenarios, step S103 can be that each examination question Search Results to obtaining carry out respectively Lucene Scoring, obtains corresponding first scoring of above-mentioned each examination question Search Results difference.When carrying out full-text search based on Lucene frameworks, Each examination question Search Results can be scored.Wherein, the Lucene corresponding to arbitrary examination question Search Results scores most Big score value is 1.0.
In step S104, according to similarity algorithm, each examination question Search Results difference corresponding second for obtaining is calculated Scoring.
In embodiments of the present invention, in step S103 it is possible that three kinds of application scenarios, and step S104 is then directed to The third application scenarios, i.e. when the examination question number of searches that step S102 is obtained is two or more, the step of just performing.In step In rapid S104, it will according to similarity algorithm, for each Search Results that step S102 is obtained, make similar to target examination question Degree contrast, is once scored again, and gained score is corresponding second scoring of each Search Results difference, is designated as S2.Above-mentioned second scoring S2 will be stored by temporary transient, to treat that the above-mentioned second scoring S2 of later use is carried out further Meter.
Specifically, above-mentioned similarity algorithm can be longest common subsequence algorithm.Then now, step S104 concrete manifestation For according to longest common subsequence algorithm, each examination question Search Results to obtaining score respectively, obtain above-mentioned each examination Corresponding second scoring of topic Search Results difference.Longest common subsequence, english abbreviation is LCS (longest Common Subsequence), it can describe the similarity between two sections of texts, thus in step S104, use most long public sub- sequence Row algorithm, effectively can calculate the similarity between each examination question Search Results and target examination question, with this to examination question Search Results are further analyzed.Alternatively, examination question searcher can according to second scoring S2 score value, from high to low Two minor sorts are carried out to examination question Search Results.
In step S105, according to default weighted linear scheme, meter is weighted to the above-mentioned first scoring and the second scoring Calculate, it is determined that final scoring, and according to above-mentioned final scoring, examination question Search Results more than above-mentioned two are arranged from high to low Sequence is simultaneously exported.
In embodiments of the present invention, when above-mentioned the third application scenarios being in, i.e., the examination question that step S102 is obtained When Search Results quantity is two or more, the examination question searcher in step S105 will be respectively directed to each examination question search knot Really, corresponding first scoring S1 and the second scoring S2 is obtained, and according to default weighted linear scheme, to each examination question search knot The first scoring S1 and the second scoring S2 corresponding to fruit is weighted, and obtains the corresponding final scoring of examination question Search Results, S3 is designated as, and according to the score value of final scoring S3, from high to low, corresponding examination question Search Results is ranked up, and will Result output after sequence, for user's access.
Further, above-mentioned default weighted linear scheme can be:S3=n1*S1+n2*S2.Wherein, n1>0, n2>0, n1 + n2=1.Specifically, n1=0.3 can be taken, takes n2=0.7, the final scoring S3 for obtaining in this case and corresponding examination question What the sequence of Search Results more met most of user searches topic demand.
Therefore, in embodiments of the present invention, examination question Search Results can twice be scored, and this is commented twice Weighted calculation is allocated as, is finally scored, it is final because obtained from because final scoring is that the front weighting scored twice is processed Examination question Search Results accuracy will be enhanced, and can be very good to solve user in light by force or by other extraneous bad borders In the case of interference, target examination question is carried out taking pictures when searching topic, examination question Search Results can be affected, cause result it is inaccurate this One problem.Accuracy rate when user searches topic is improve, the operating experience of user is improved, having better met the topic of searching of user needs Ask.
Fig. 2 shows the flow chart that implements of examination question searching method step S102 provided in an embodiment of the present invention, describes in detail It is as follows:
In step s 201, full-text search is carried out in exam pool according to the result of image recognition, obtains all related examinations Topic Search Results.
In step S202, when the quantity of above-mentioned all related examination question Search Results is two or more, according to default Data model various dimensions statistics is carried out to above-mentioned all related examination question Search Results, it is determined that including examination question Search Results quantity Most examination question classifications.
In embodiments of the present invention, the quantity of all related examination question Search Results for obtaining in step S201 is uncertain 's.Only when the quantity of all related examination question Search Results obtained in step S201 is two or more, the just meeting of step S202 Various dimensions statistics is carried out to above-mentioned all related examination question Search Results according to default data filtering model, can be screened with this The examination question Search Results gone out under optimal classification.Specifically, above-mentioned dimension including but not limited to it is following more than one:Time, region, Subject, grade.
Wherein, the time is the proposition time corresponding to examination question.For User, especially in junior-senior high school Raw user group, the exercise question that they are faced is often the middle college entrance examination very topic or middle college entrance examination simulation topic in former years, and these exercise questions exist Year information can be usually remained with exam pool.Now in step S202, examination question searcher can be counted and target examination question In related examination question Search Results, the situation of each examination question result Annual distribution, and filter out most comprising examination question Search Results The affiliated time, obtain all examination question Search Results under the affiliated time.
Corresponding, region is the proposition region corresponding to examination question.For junior-high school student user group, region A key factor for being them when examination question is searched for, because junior-senior high school examination question usually has very strong region.Now in step In S202, when examination question is searched for, examination question searcher can count each examination question to user using region as a dimension of statistics As a result the situation of Regional Distribution, and filter out comprising the most affiliated region of examination question Search Results, and obtain under the affiliated region All examination question Search Results.
Meanwhile, subject is also an important dimension.Each examination question has the subject subject belonging to it, but due to actual life In work, knowledge is to intersect, therefore some knowledge points may all occur in multiple subjects.By taking natural sciences as an example, physics, change The subject such as, biology, mathematics, all can intersect each other, particularly mathematics as natural sciences class basic subject, in thing In reason, chemistry, biology, can all there is involved.For example, when user is searched using the application topic in one mathematics as target examination question Suo Shi, if this includes physical background, the displacement of entitled certain object of calculating, speed, distance or other things using topic During reason amount, if making full-text search in exam pool to above-mentioned target examination question, having greatly can not only may retrieve under mathematic subject Examination question, and can retrieve and include physics or other section's purpose exercise questions.Now in step S202, it is it to arrange subject In a statistical dimension, subject statistics has been carried out in all examination question results to searching, it is determined that the examination affiliated now of different section After the quantity of topic Search Results, can effectively filter out and the affiliated subject identical examination question Search Results of target examination question.
And grade is directed to, and in actual life, to some knowledge point, often having learnt the grade of this knowledge point, Number of setting a question is concentrated the most, and other grades will not be investigated emphatically to the knowledge point.Thus examination question searcher is also provided with For the statistics of grade of setting a question, the examination question search knot in same grade with target examination question can be accurately filtered out Really, it is to avoid examination question Search Results super guiding principles or other do not meet the desired situation of user and occur.
Alternatively, user can voluntarily select to be counted using which kind of or which kind dimension.
Alternatively, after user can count various dimensions have been carried out, in the different examination question classifications included under different dimensions, The examination question classification that user is interested or needs voluntarily is selected, and is selected automatically without examination question searcher and is tied comprising examination question search The most examination question classification of fruit quantity.
In step S203, under the above-mentioned examination question classification most comprising examination question Search Results quantity, retain degree of association high Examination question Search Results.
In embodiments of the present invention, because step S202 can be got under certain dimension, examination question Search Results quantity is most Examination question Search Results number under many a certain examination question classifications, wherein the examination question classification is uncertain, and wherein we only need to protect The examination question Search Results high with target examination question degree of association are stayed, for related to target examination question, but the not high examination question of degree of association For Search Results, the examination question Search Results can be what is filtered.
Specifically, when the examination question Search Results quantity under above-mentioned examination question classification more than it is N number of when, then under above-mentioned examination question classification Examination question Search Results carry out relevancy ranking, retain the high top n examination question Search Results of degree of association.Wherein above-mentioned examination question classification For the examination question classification most comprising examination question Search Results quantity that step S202 determines.Degree of association is being carried out to above-mentioned examination question classification During sequence, relatedness computation can be carried out with including but not limited to following any one algorithm:TF-IDF(Term Frequency- Inverse Document Frequency) or Okapi BM25 (Best Match 25).According to above-mentioned relatedness computation Examination question Search Results are ranked up from high to low by score, and are screened based on above-mentioned relevance score, only retain sequence heel row The higher N number of examination question Search Results of the forward relevance score of name, it is not intended that examination question classification, rejects step S201 and obtain other All examination question Search Results.
If the examination question Search Results quantity under above-mentioned examination question classification is not more than N number of, retain the institute under above-mentioned examination question classification There are examination question Search Results.Now because the examination question Search Results quantity under above-mentioned examination question classification is not unnecessary N number of, then no matter examination question Search Results and the degree of association height of target examination question, all make reservation process, without the need for making to be based on degree of association under above-mentioned examination question classification Further screening.But, the examination question Search Results under other examination question classifications will be all disallowable.
Wherein, N mentioned above be it is default be more than or equal to 2 natural number.In step S203, N can be by trying It is that topic searcher initially sets, or user's sets itself.Alternatively, examination question searcher will initially set N For 100.
Therefore, in the present embodiment, it is possible to before scoring examination question result, carry out to examination question result first Once rough preliminary screening, being avoided with this all carries out the first follow-up scoring to all of examination question Search Results, and second comments Divide and final scoring.Greatly save occupancy of the examination question searching method to system resource.
It should be noted that the examination question searcher referred in the embodiment of the present invention specifically can in the way of software (example The such as form of App) and/or the mode of hardware be integrated in mobile terminal (such as smart mobile phone, panel computer, learning machine terminal) In.
One of ordinary skill in the art will appreciate that realizing that all or part of step in above-described embodiment method can be Related hardware is instructed to complete by program, corresponding program can be stored in a computer read/write memory medium, Above-mentioned storage medium, such as ROM/RAM, disk or CD.
Embodiment two
Fig. 3 shows the concrete structure block diagram of the examination question searcher that the embodiment of the present invention two is provided, for convenience of description, Illustrate only the part related to the embodiment of the present invention.The examination question searcher 3 includes:
Target examination question acquiring unit 31, for obtaining the original image of target examination question, and to the original of above-mentioned target examination question Image carries out image recognition;
Preliminary search unit 32, for obtaining the result of image recognition in exam pool based on above-mentioned target examination question acquiring unit 31 In carry out full-text search, obtain examination question Search Results;
First score calculation unit 33, the quantity of the examination question Search Results for getting when above-mentioned preliminary search unit 32 For two or more when, according to the scoring of full-text search, calculate each examination question Search Results difference corresponding first for obtaining Scoring;
Second score calculation unit 34, for according to similarity algorithm, calculating what above-mentioned preliminary search unit 32 got Corresponding second scoring of each examination question Search Results difference;
Search result determination unit 35, for according to default weighted linear scheme, to above-mentioned first score calculation unit 33 The second scoring that the first scoring for obtaining and above-mentioned second score calculation unit 34 are obtained is weighted, it is determined that most final review Point, and according to above-mentioned final scoring, from high to low above-mentioned examination question Search Results are ranked up and are exported.
Specifically, above-mentioned preliminary search unit 32, for being carried out to the result of image recognition in full using Lucene frameworks Retrieval;
Specifically, above-mentioned first scoring determining unit 33 is used for, each examination got to above-mentioned preliminary search unit 32 Topic Search Results carry out respectively Lucene scorings, obtain corresponding first scoring of above-mentioned each examination question Search Results difference.
Specifically, above-mentioned second scoring determining unit 34 is used for, when the similarity algorithm for using is longest common subsequence During algorithm, according to longest common subsequence algorithm, each examination question Search Results got to above-mentioned preliminary search unit 32 point Do not scored, obtained corresponding second scoring of above-mentioned each examination question Search Results difference.
Alternatively, above-mentioned preliminary search unit 32 also includes:
Search Results obtain subelement, and the image recognition result for being obtained according to above-mentioned target examination question acquiring unit 31 exists Full-text search is carried out in exam pool, all related examination question Search Results are obtained;
Various dimensions count subelement, and the examination questions for obtaining all correlations that subelement gets when mentioned above searching results are searched When the quantity of hitch fruit is two or more, the institute that subelement gets is obtained to mentioned above searching results according to default data model The examination question Search Results for having correlation carry out various dimensions statistics, it is determined that comprising the most examination question classification of examination question Search Results quantity;
As a result subelement is screened, for determining in above-mentioned various dimensions statistics subelement comprising examination question Search Results quantity most Under many examination question classifications, retain the high examination question Search Results of degree of association.
Specifically, the above results screening subelement is used for, if under the examination question classification of above-mentioned various dimensions statistics subelement determination Examination question Search Results quantity more than N number of, then relevancy ranking is carried out to the examination question Search Results under above-mentioned examination question classification, retain The high top n examination question Search Results of degree of association;If the examination question search under the examination question classification that above-mentioned various dimensions statistics subelement determines Fruiting quantities are not more than N number of, then retain all examination question Search Results under above-mentioned examination question classification;Wherein, N for it is default more than or Natural number equal to 2.
It should be noted that the examination question searcher in the embodiment of the present invention specifically can (such as App in the way of software Form) and/or the mode of hardware be integrated in mobile terminal (such as the terminal such as smart mobile phone, panel computer, learning machine).
It should be understood that the examination question searcher in the embodiment of the present invention can be used for realizing the whole in said method embodiment Technical scheme, the function of its each functional module can be implemented according to the method in said method embodiment, its concrete reality Existing process can refer to the associated description in above-described embodiment, and here is omitted.
Therefore, in embodiments of the present invention, examination question searcher can be first before scoring examination question result First rough preliminary screening is carried out once to examination question result, avoided with this all of examination question Search Results are all carried out it is follow-up First scoring, the second scoring and final scoring.Greatly save occupancy of the examination question searcher to system resource.
It should be noted that in several embodiments provided herein, it should be understood that disclosed device and side Method, can realize by another way.For example, device embodiment described above is only schematic, for example, above-mentioned The division of unit, only a kind of division of logic function can have other dividing mode, such as multiple units when actually realizing Or component can with reference to or be desirably integrated into another system, or some features can be ignored, or not perform.It is another, institute The coupling each other for showing or discussing or direct-coupling or communication connection can be by some interfaces, device or unit INDIRECT COUPLING or communication connection, can be electrical, mechanical or other forms.
For aforesaid each method embodiment, for easy description, therefore it is all expressed as a series of combination of actions, but It is that those skilled in the art should know, the present invention is not limited by described sequence of movement, because according to the present invention, certain A little steps can adopt other orders or while carry out.Secondly, those skilled in the art also should know, be retouched in description The embodiment stated belongs to preferred embodiment, and involved action and module might not all be necessary to the present invention.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, without the portion described in detail in certain embodiment Point, may refer to the associated description of other embodiments.
Be more than to a kind of preferred embodiment provided by the present invention, for one of ordinary skill in the art, according to According to the thought of the embodiment of the present invention, will change in specific embodiments and applications, to sum up, in this specification Appearance should not be construed as limiting the invention.

Claims (10)

1. a kind of examination question searching method, it is characterised in that include:
The original image of target examination question is obtained, and the original image to the target examination question carries out image recognition;
Result based on image recognition carries out full-text search in exam pool, obtains examination question Search Results;
When the quantity of the examination question Search Results for obtaining is two or more, according to the scoring of full-text search, calculate what is obtained Corresponding first scoring of each examination question Search Results difference;
According to similarity algorithm, corresponding second scoring of each examination question Search Results difference for obtaining is calculated;
According to default weighted linear scheme, the described first scoring and the second scoring are weighted, it is determined that final scoring, and According to the final scoring, from high to low examination question Search Results are ranked up and are exported.
2. examination question searching method as claimed in claim 1, it is characterised in that the result based on image recognition is in exam pool Full-text search is carried out, examination question Search Results are obtained, including:
Full-text search is carried out in exam pool according to the result of image recognition, all related examination question Search Results are obtained;
When the quantity of all related examination question Search Results is two or more, according to default data model to the institute The examination question Search Results for having correlation carry out various dimensions statistics, it is determined that comprising the most examination question classification of examination question Search Results quantity;
Under the examination question classification most comprising examination question Search Results quantity, retain the high examination question Search Results of degree of association.
3. examination question searching method as claimed in claim 2, it is characterised in that it is described described comprising examination question Search Results quantity Under most examination question classifications, retain the high examination question Search Results of degree of association, including:
If the examination question Search Results quantity under the examination question classification is more than N number of, to the examination question search knot under the examination question classification Fruit carries out relevancy ranking, retains the high top n examination question Search Results of degree of association;
If the examination question Search Results quantity under the examination question classification is not more than N number of, retain all examinations under the examination question classification Topic Search Results;
N be it is default be more than or equal to 2 natural number.
4. the examination question searching method as described in any one of claim 1-3, it is characterised in that the result based on image recognition Full-text search is carried out in exam pool, including:
Full-text search is carried out to the result of image recognition using Lucene frameworks;
It is described when obtain examination question Search Results quantity be two or more when, according to the scoring of full-text search, calculating is obtained Corresponding first scoring of each examination question Search Results difference for taking, including:
Each examination question Search Results to obtaining carry out respectively Lucene scorings, obtain described each examination question Search Results right respectively The first scoring answered.
5. the examination question searching method as described in any one of claim 1-3, it is characterised in that the similarity algorithm is most long public affairs Common subsequence algorithm;It is described that corresponding second scoring of each examination question Search Results difference for obtaining is calculated based on similarity algorithm, Including:
According to longest common subsequence algorithm, each examination question Search Results to obtaining score respectively, obtain it is described each Corresponding second scoring of examination question Search Results difference.
6. a kind of examination question searcher, it is characterised in that the examination question searcher includes:
Target examination question acquiring unit, for obtaining the original image of target examination question, and the original image to the target examination question enters Row image recognition;
Preliminary search unit, the result for being obtained image recognition based on the target examination question acquiring unit is carried out entirely in exam pool Text retrieval, obtains examination question Search Results;
First score calculation unit, the quantity of the examination question Search Results for getting when the preliminary search unit be two with When upper, according to the scoring of full-text search, corresponding first scoring of each examination question Search Results difference for obtaining was calculated;
Second score calculation unit, for according to similarity algorithm, calculating each examination question that the preliminary search unit gets Corresponding second scoring of Search Results difference;
Search result determination unit, for according to default weighted linear scheme, the first score calculation unit is obtained the The second scoring that one scoring and the second score calculation unit are obtained is weighted, it is determined that final scoring, and according to institute Final scoring is stated, from high to low the examination question Search Results is ranked up and is exported.
7. a kind of examination question searcher as claimed in claim 6, it is characterised in that the preliminary search unit, including:
Search Results obtain subelement, for the image recognition result that obtained according to the target examination question acquiring unit in exam pool Full-text search is carried out, all related examination question Search Results are obtained;
Various dimensions count subelement, for obtaining all related examination question search knot that subelement gets when the Search Results When the quantity of fruit is two or more, all phases that subelement gets are obtained to the Search Results according to default data model The examination question Search Results of pass carry out various dimensions statistics, it is determined that comprising the most examination question classification of examination question Search Results quantity;
As a result subelement is screened, for counting the most comprising examination question Search Results quantity of subelement determination in the various dimensions Under examination question classification, retain the high examination question Search Results of degree of association.
8. a kind of examination question searcher as claimed in claim 7, it is characterised in that the result screens subelement, concrete to use In when the examination question Search Results quantity under the various dimensions count the examination question classification that subelement determines is more than N number of, to the examination Examination question Search Results under topic classification carry out relevancy ranking, retain the high top n examination question Search Results of degree of association;
When the examination question Search Results quantity under the various dimensions count the examination question classification that subelement determines is not more than N number of, retain All examination question Search Results under the examination question classification;
N be it is default be more than or equal to 2 natural number.
9. the examination question searcher as described in any one of claim 6-8, it is characterised in that the preliminary search unit is specifically used In carrying out full-text search to the result of image recognition using Lucene frameworks;
The first score calculation unit is specifically for each examination question Search Results got to the preliminary search unit point Lucene scorings are not carried out, corresponding first scoring of described each examination question Search Results difference is obtained.
10. the examination question searcher as described in any one of claim 6-8, it is characterised in that the second score calculation unit Specifically for when the similarity algorithm for using is longest common subsequence algorithm, according to longest common subsequence algorithm, to institute State each examination question Search Results that preliminary search unit gets to be scored respectively, obtain described each examination question Search Results point Not corresponding second scoring.
CN201611229381.6A 2016-12-27 2016-12-27 Method and device for searching test questions Pending CN106611058A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611229381.6A CN106611058A (en) 2016-12-27 2016-12-27 Method and device for searching test questions

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611229381.6A CN106611058A (en) 2016-12-27 2016-12-27 Method and device for searching test questions

Publications (1)

Publication Number Publication Date
CN106611058A true CN106611058A (en) 2017-05-03

Family

ID=58636226

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611229381.6A Pending CN106611058A (en) 2016-12-27 2016-12-27 Method and device for searching test questions

Country Status (1)

Country Link
CN (1) CN106611058A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107909520A (en) * 2017-11-02 2018-04-13 浙江工商大学 The method and apparatus that make the test based on examination question correlation
CN108416264A (en) * 2018-01-29 2018-08-17 山东汇贸电子口岸有限公司 A kind of searching method and search module of supporting OCR to input
CN109325051A (en) * 2018-08-14 2019-02-12 广东小天才科技有限公司 It is a kind of that topic result output method and facility for study are searched based on solution model
CN111241276A (en) * 2020-01-06 2020-06-05 广东小天才科技有限公司 Topic searching method, device, equipment and storage medium
CN111563498A (en) * 2020-04-30 2020-08-21 广东小天才科技有限公司 Method and device for collecting questions, electronic equipment and storage medium
CN111652203A (en) * 2020-06-01 2020-09-11 北京字节跳动网络技术有限公司 Resource pushing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140052716A1 (en) * 2012-08-14 2014-02-20 International Business Machines Corporation Automatic Determination of Question in Text and Determination of Candidate Responses Using Data Mining
CN103955525A (en) * 2014-05-09 2014-07-30 北京奇虎科技有限公司 Method and client for searching answer to test question
CN105373594A (en) * 2015-10-23 2016-03-02 广东小天才科技有限公司 Method and apparatus for screening repeated test questions from question bank
CN105426390A (en) * 2015-10-23 2016-03-23 广东小天才科技有限公司 Image recognition-based question search method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140052716A1 (en) * 2012-08-14 2014-02-20 International Business Machines Corporation Automatic Determination of Question in Text and Determination of Candidate Responses Using Data Mining
CN103955525A (en) * 2014-05-09 2014-07-30 北京奇虎科技有限公司 Method and client for searching answer to test question
CN105373594A (en) * 2015-10-23 2016-03-02 广东小天才科技有限公司 Method and apparatus for screening repeated test questions from question bank
CN105426390A (en) * 2015-10-23 2016-03-23 广东小天才科技有限公司 Image recognition-based question search method and system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107909520A (en) * 2017-11-02 2018-04-13 浙江工商大学 The method and apparatus that make the test based on examination question correlation
CN108416264A (en) * 2018-01-29 2018-08-17 山东汇贸电子口岸有限公司 A kind of searching method and search module of supporting OCR to input
CN109325051A (en) * 2018-08-14 2019-02-12 广东小天才科技有限公司 It is a kind of that topic result output method and facility for study are searched based on solution model
CN111241276A (en) * 2020-01-06 2020-06-05 广东小天才科技有限公司 Topic searching method, device, equipment and storage medium
CN111563498A (en) * 2020-04-30 2020-08-21 广东小天才科技有限公司 Method and device for collecting questions, electronic equipment and storage medium
CN111563498B (en) * 2020-04-30 2024-01-19 广东小天才科技有限公司 Method and device for collecting questions, electronic equipment and storage medium
CN111652203A (en) * 2020-06-01 2020-09-11 北京字节跳动网络技术有限公司 Resource pushing method and device

Similar Documents

Publication Publication Date Title
CN112632385B (en) Course recommendation method, course recommendation device, computer equipment and medium
CN106611058A (en) Method and device for searching test questions
EP2192500B1 (en) System and method for providing robust topic identification in social indexes
CN103514299B (en) Information search method and device
Wolf et al. Clarifying vulnerability definitions and assessments using formalisation
CA3153598A1 (en) Method of and device for predicting video playback integrity
DE102016013372A1 (en) Image labeling with weak monitoring
US11774264B2 (en) Method and system for providing information to a user relating to a point-of-interest
DE112015002286T5 (en) VISUAL INTERACTIVE SEARCH
CN105512331A (en) Video recommending method and device
CN110390052B (en) Search recommendation method, training method, device and equipment of CTR (China train redundancy report) estimation model
CN108875769A (en) Data mask method, device and system and storage medium
CN113722478B (en) Multi-dimensional feature fusion similar event calculation method and system and electronic equipment
WO2016114790A1 (en) Reading difficulty level based resource recommendation
Dang et al. MOOC-KG: A MOOC knowledge graph for cross-platform online learning resources
CN113641794A (en) Resume text evaluation method and device and server
CN115952277A (en) Knowledge relationship based retrieval enhancement method, model, device and storage medium
CN112396091B (en) Social media image popularity prediction method, system, storage medium and application
CN106776910A (en) The display methods and device of a kind of Search Results
CN103279549A (en) Method and device for acquiring target data of target objects
CN107679121B (en) Mapping method and device of classification system, storage medium and computing equipment
CN112069423A (en) Information recommendation method and device, storage medium and computer equipment
CN110555196A (en) method, device, equipment and storage medium for automatically generating article
CN112527999B (en) Extraction type intelligent question-answering method and system for introducing knowledge in agricultural field
CN112163165B (en) Information recommendation method, device, equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170503