CN106815372A - A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank - Google Patents

A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank Download PDF

Info

Publication number
CN106815372A
CN106815372A CN201710065948.9A CN201710065948A CN106815372A CN 106815372 A CN106815372 A CN 106815372A CN 201710065948 A CN201710065948 A CN 201710065948A CN 106815372 A CN106815372 A CN 106815372A
Authority
CN
China
Prior art keywords
examination question
character
question
examination
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710065948.9A
Other languages
Chinese (zh)
Inventor
涂继宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201710065948.9A priority Critical patent/CN106815372A/en
Publication of CN106815372A publication Critical patent/CN106815372A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a kind of examination question De-weight method and device, user equipment based on natural sciences test item bank, the method includes:Reception carries the examination question duplicate removal instruction of contents of test question;The examination question duplicate removal instruction is responded, from all examination questions that natural sciences test item bank includes, search is more than multiple target examination questions of default similarity threshold with the similarity of the contents of test question;Judge with the presence or absence of the first character matched with preset keyword symbol in the examination question key character of the contents of test question, if in the presence of for each the described target examination question for searching, the second character in the target examination question being extracted successively;Compare first character and whether second character is identical, if identical, examination question deduplication operation is performed to the multiple target examination question.The embodiment of the present invention can improve the discrimination of repetition examination question, meanwhile, the precision that removal is inscribed again can be improved.

Description

A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank
Technical field
The present invention relates to technical field of intelligent equipment, more particularly to a kind of examination question De-weight method based on natural sciences test item bank and Device, user equipment.
Background technology
At present, with the development of all kinds of works about test over the years, the examination question of test item bank the inside accumulation is also more and more, progressively shape Into the test item bank of magnanimity.For for the magnanimity test item bank that topic is searched for student, on the one hand, need the examination question shielding that will be repeated Fall, it is impossible to allow student to occur many identical examination questions when searching topic, this is accomplished by carrying out duplicate removal, the opposing party to the test item bank of magnanimity Face, it is to repeat examination question that student also is intended to by duplicate removal.Current De-weight method is that the similarity of content is more than into default similarity The examination question of threshold value is masked.
However, be directed to for natural sciences examination question, even if a numeral is different, or an oeprator difference, it is also not equivalent There is very big defect in Yu Yuanti, current De-weight method, easily mask more non-duplicate examination question so that repeat examination question Discrimination is not high.
The content of the invention
The embodiment of the invention discloses a kind of examination question De-weight method and device, user equipment based on natural sciences test item bank, can The discrimination of examination question is repeated to improve, meanwhile, the precision that removal is inscribed again can be improved.
Embodiment of the present invention first aspect discloses a kind of examination question De-weight method based on natural sciences test item bank, including:
Reception carries the examination question duplicate removal instruction of contents of test question;
The examination question duplicate removal instruction is responded, from all examination questions that natural sciences test item bank includes, is searched for and the contents of test question Similarity be more than multiple target examination questions of default similarity threshold;
Judge to whether there is the first character matched with preset keyword symbol in the examination question key character of the contents of test question, If in the presence of for each the described target examination question for searching, the second character in the target examination question being extracted successively;
Compare first character and whether second character is identical, if identical, the multiple target examination question is held Row examination question deduplication operation.
As a kind of optional implementation method, in embodiment of the present invention first aspect, first character and described Two characters include multiple characters, and when first character is identical with second character, methods described also includes:
The first order that multiple characters that determining first character includes occur in the contents of test question, and determine The second order that multiple characters that second character includes occur in the target examination question;
Judge whether first order is identical with the described second order, if identical, perform described to the multiple The step of target examination question performs examination question deduplication operation.
As a kind of optional implementation method, in embodiment of the present invention first aspect, judge first order with It is described second order it is identical when, methods described also includes:
Determine the residing first position in the contents of test question of each character in first character, and determine described The residing second place in the target examination question of each character in second character;
Judge whether the first position is identical with the second place, if identical, perform described to the multiple The step of target examination question performs examination question deduplication operation.
Used as a kind of optional implementation method, in embodiment of the present invention first aspect, methods described also includes:
If in the absence of the first character matched with preset keyword symbol in judging the examination question key character of the contents of test question, Examination question deduplication operation then is performed to the multiple target examination question.
It is described that the multiple target is tried in embodiment of the present invention first aspect as a kind of optional implementation method Topic performs examination question deduplication operation to be included:
Any one target examination question is selected from the multiple target examination question as reservation examination question, and is deleted except the reservation examination Remaining target examination question outside topic;Or,
The memory space shared by each described target examination question in the multiple target examination question is obtained, determines that memory space is minimum Target examination question as retain examination question, and delete except it is described reservation examination question in addition to remaining target examination question.
Embodiment of the present invention second aspect discloses a kind of examination question duplicate removal device, including:
Receiving unit, the examination question duplicate removal instruction of contents of test question is carried for receiving;
Search unit, for responding examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, search with The similarity of the contents of test question is more than multiple target examination questions of default similarity threshold;
First judging unit, for whether there is and preset keyword in the examination question key character for judging the contents of test question Accord with the first character of matching;
Extraction unit, in the examination question key character that the contents of test question is judged when first judging unit exist with During the first character of preset keyword symbol matching, for each the described target examination question for searching, the target examination is extracted successively The second character in topic;
Whether comparing unit is identical for comparing first character and second character;
Duplicate removal unit, for when the comparing unit first character is identical with second character, to institute State multiple target examination questions and perform examination question deduplication operation.
As a kind of optional implementation method, in embodiment of the present invention second aspect:First character and described Two characters include multiple characters, and the examination question duplicate removal device also includes:
Determining unit, for when the comparing unit first character is identical with second character, it is determined that The first order that multiple characters that first character includes occur in the contents of test question, and determine second character Including multiple characters occur in the target examination question second order;
Whether the second judging unit is identical with the described second order for judging first order;
The duplicate removal unit, specifically for judging first order with the described second order when second judging unit When identical, examination question deduplication operation is performed to the multiple target examination question.
As a kind of optional implementation method, in embodiment of the present invention second aspect,
The determining unit, is additionally operable to judge first order with the second order phase when second judging unit Meanwhile, determine the residing first position in the contents of test question of each character in first character, and determine described the The residing second place in the target examination question of each character in two characters;
Second judging unit, is additionally operable to judge whether the first position is identical with the second place;
The duplicate removal unit, specifically for judging first order with the described second order when second judging unit When the identical and first position is identical with the second place, examination question deduplication operation is performed to the multiple target examination question.
Used as a kind of optional implementation method, in embodiment of the present invention second aspect, the duplicate removal unit is additionally operable to First judging unit is judged in the examination question key character of the contents of test question in the absence of the matched with preset keyword symbol During one character, examination question deduplication operation is performed to the multiple target examination question.
Used as a kind of optional implementation method, in embodiment of the present invention second aspect, the duplicate removal unit is to described more The mode that individual target examination question performs examination question deduplication operation is specially:
Any one target examination question is selected from the multiple target examination question as reservation examination question, and is deleted except the reservation examination Remaining target examination question outside topic;Or,
The memory space shared by each described target examination question in the multiple target examination question is obtained, determines that memory space is minimum Target examination question as retain examination question, and delete except it is described reservation examination question in addition to remaining target examination question.
The embodiment of the present invention third aspect discloses a kind of user equipment, including institute disclosed in embodiment of the present invention second aspect State examination question duplicate removal device.
Compared with prior art, the embodiment of the present invention possesses following beneficial effect:
In the embodiment of the present invention, user equipment can receive the examination question duplicate removal instruction for carrying contents of test question;Response examination question Duplicate removal is instructed, and from all examination questions that natural sciences test item bank includes, search is more than default similarity threshold with the similarity of contents of test question Multiple target examination questions of value;Further, user equipment may determine that whether there is in the examination question key character of contents of test question with First character of preset keyword symbol matching, if in the presence of for each the target examination question for searching, successively in extraction target examination question The second character, further, user equipment can compare the first character and whether the second character identical, right if identical Multiple target examination questions perform examination question deduplication operation.It can be seen that, implement the embodiment of the present invention, user equipment can be from similarity more than pre- If in multiple target examination questions of similarity threshold, further comparing the first character and mesh in the examination question key character of contents of test question Whether the second character in mark examination question is identical, if identical, determines that the plurality of target examination question is attached most importance to retrial topic, and user equipment can be with Examination question deduplication operation is performed to the plurality of target examination question, such that it is able to improve the discrimination of repetition examination question, meanwhile, can improve Except the precision inscribed again.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below by to be used needed for embodiment Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention, for ability For the those of ordinary skill of domain, on the premise of not paying creative work, can also obtain other attached according to these accompanying drawings Figure.
Fig. 1 is a kind of schematic flow sheet of the examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention;
Fig. 2 is that the flow of another examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention is illustrated Figure;
Fig. 3 is that the flow of another examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention is illustrated Figure;
Fig. 4 is that the flow of another examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention is illustrated Figure;
Fig. 5 is a kind of structural representation of examination question duplicate removal device disclosed in the embodiment of the present invention;
Fig. 6 is the structural representation of another examination question duplicate removal device disclosed in the embodiment of the present invention;
Fig. 7 is a kind of structural representation of user equipment disclosed in the embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.Based on this Embodiment in invention, the every other reality that those of ordinary skill in the art are obtained under the premise of creative work is not made Example is applied, the scope of protection of the invention is belonged to.
It should be noted that the term " comprising " and " having " of the embodiment of the present invention and their any deformation, it is intended that Be cover it is non-exclusive include, for example, containing process, method, system, product or the equipment of series of steps or unit not Be necessarily limited to those steps or the unit clearly listed, but may include not list clearly or for these processes, side Method, product or other intrinsic steps of equipment or unit.
The embodiment of the invention discloses a kind of examination question De-weight method and device, user equipment based on natural sciences test item bank, can The discrimination of examination question is repeated to improve.Accompanying drawing is below combined to be described in detail.
Embodiment one
Fig. 1 is referred to, Fig. 1 is a kind of stream of the examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention Journey schematic diagram.As shown in figure 1, the examination question De-weight method that should be based on natural sciences test item bank may comprise steps of:
101st, user equipment receives the examination question duplicate removal instruction for carrying contents of test question.
In the embodiment of the present invention, the user equipment can be to be provided with the application program of JAVA WEB exploitations and possess networking The various electronic equipments of function, such as:Smart mobile phone, notebook computer, personal computer (Personal Computer, PC), Personal digital assistant (Personal Digital Assistant, PDA), mobile internet device (Mobile Internet Device, MID), Intelligent worn device (such as intelligent watch, Intelligent bracelet) each class of electronic devices.
Wherein, the contents of test question is the contents of test question for natural sciences examination question, and the contents of test question includes the main pass of examination question Key information, can be used for student according to the contents of test question to analyze examination question, and answer examination question.
In the embodiment of the present invention, the natural sciences examination question of the magnanimity that is stored with natural sciences test item bank can include but is not limited to mathematics Examination question, Physical Test Questions and chemical examination question etc., the natural sciences test item bank can be stored in the server, and user equipment is by connecting net Network just can be with the natural sciences examination question of natural sciences examination question library storage in access server.
Wherein, the examination question duplicate removal is instructed and performs deduplication operation for repeating examination question to multiple.
102nd, user device responsive examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, searches for and examination question The similarity of content is more than multiple target examination questions of default similarity threshold.
In the embodiment of the present invention, a default similarity threshold can be pre-set, such as:80%.User equipment can be with The contents of test question is put into search engine (such as lucene search engines) and is scanned for, specifically, natural sciences test item bank is included All examination questions compare with the contents of test question successively, if similarity is more than default similarity threshold, it is determined that the examination question It is target examination question, and extracts multiple target examination questions that all similarities are more than default similarity threshold.
Optionally, before search, user equipment can do to contents of test question and further process, such as:Delete examination question Chinese and English space, punctuation mark in content etc., then the contents of test question after by treatment scanned in being put into search engine. Do not influence the content of natural sciences examination question due to Chinese and English space, punctuation mark, scanned for after they are deleted, sieve can be improved The accuracy of choosing.
103rd, user equipment is judged in the examination question key character of contents of test question with the presence or absence of being matched with preset keyword symbol First character, if in the presence of, step 104~105 are performed, if not existing, perform step 106.
In the embodiment of the present invention, the examination question key character of the contents of test question can include but is not limited to punctuation mark, text Symbol, numerical chracter, letter character and oeprator;Preset keyword symbol can include but is not limited to numerical chracter, word Female symbol and oeprator, furthermore it is also possible to including the text character for characterizing the implications such as numeral, letter, computing;This One character can include but is not limited to numerical chracter, letter character and oeprator, furthermore it is also possible to including for characterizing number The text character of the implications such as word, letter, computing.Wherein, punctuation mark such as " ",.;Numerical character such as 1,2,3,4 ..., word Female symbol such as a, b, c, d, A, B, C, D ..., oeprator such as+,-, ×, ÷ ....
As an example it is assumed that preset keyword symbol includes numerical chracter, letter character and oeprator, if user Equipment judges the one kind or many existed in numerical chracter, letter character and oeprator in the examination question key character of contents of test question The combination planted, then can determine there is the first word matched with preset keyword symbol in the examination question key character of the contents of test question Symbol.
Generally, for for natural sciences examination question, such as numerical chracter, letter character and oeprator will influence natural sciences to try The analysis of topic and answer, even if the similarity of examination question is more than default similarity threshold, can not ensure that the examination question is attached most importance to retrial topic. Such as, examination question 1:One long 6 decimeters, 4 decimeters wide of rectangle has altogether can be cut into how many bottoms and height is all 2 decimeters of right angle three It is angularExamination question 2:One long 5 decimeters, 4 decimeters wide of rectangle has altogether can be cut into how many bottoms and height is all 2 decimeters of right angle three It is angularThe similarity of the examination question 1 and examination question 2 is more than default similarity threshold, but the examination question 1 is two different examinations with examination question 2 Topic.
In the embodiment of the present invention, if user equipment judges do not exist and default pass in the examination question key character of contents of test question First character of key characters matching, then can directly perform step 106.
104th, user equipment is directed to each the target examination question for searching, and the second character in target examination question is extracted successively.
Wherein, second character can include but is not limited to punctuation mark, textual character, numerical chracter, letter character with And oeprator.
105th, user equipment compares the first character and whether the second character is identical, if identical, performs step 106, if it is different, Perform step 107.
In the embodiment of the present invention, for each the target examination question for searching, user equipment can compare the first character and Whether two characters are identical, if the second character and the first character all same in each target examination question, can determine the plurality of Target examination question attach most importance to retrial topic, step 106 can be performed, if the second character and the first character in each target examination question are not Together, then can determine that the plurality of target examination question, for non-duplicate examination question, can perform step 107.
106th, user equipment performs examination question deduplication operation to multiple target examination questions, and terminates this flow.
Specifically, user equipment performs examination question deduplication operation to multiple target examination questions including:
Any one target examination question is selected from multiple target examination questions as reservation examination question, and is deleted in addition to examination question is retained Remaining target examination question;Or,
Memory space in the multiple target examination questions of acquisition shared by each target examination question, determines the minimum target examination of memory space Topic deletes the remaining target examination question in addition to examination question is retained as reservation examination question.
In the embodiment of the present invention, because the plurality of target examination question is attached most importance to retrial topic, therefore only need to retain one of examination question , user equipment can select any one target examination question as reservation examination question from multiple target examination questions, and delete except reservation Remaining target examination question outside examination question.
Or, optionally, user equipment can obtain the memory space shared by each target examination question in multiple target examination questions, The minimum target examination question of memory space is determined as reservation examination question, and deletes the remaining target examination question in addition to examination question is retained, this Sample, can not only save the memory space of natural sciences test item bank, at the same time it can also store more different examination questions.
107th, user equipment retains the plurality of target examination question.
Used as another optional implementation method, for each target examination question, user equipment compares the first character and second Whether character is identical, if identical, extracts the target examination question, and counts the quantity of the target examination question of extraction, if the quantity is more than 1, then examination question deduplication operation is performed to all target examination questions for extracting, if the quantity is equal to 1, retain the target examination of extraction Topic;If it is different, then retaining the target examination question.
In the method described by Fig. 1, in the embodiment of the present invention, user equipment can receive the examination question for carrying contents of test question Duplicate removal is instructed;The duplicate removal instruction of response examination question, from all examination questions that natural sciences test item bank includes, the similarity of search and contents of test question More than multiple target examination questions of default similarity threshold;Further, user equipment may determine that the examination question of contents of test question is crucial In character with the presence or absence of with preset keyword the first character for matching of symbol, if in the presence of, for each the target examination question for searching, according to Secondary the second character extracted in target examination question, further, whether user equipment can compare the first character and the second character It is identical, if identical, examination question deduplication operation is performed to multiple target examination questions.It can be seen that, implement the embodiment of the present invention, user equipment can It is more than in multiple target examination questions of default similarity threshold with from similarity, further compares the examination question key character of contents of test question In the first character and target examination question in the second character it is whether identical, if identical, determine the plurality of target examination question for repeat Examination question, user equipment can perform examination question deduplication operation to the plurality of target examination question, such that it is able to improve the identification of repetition examination question Rate, meanwhile, the precision that removal is inscribed again can be improved.
Embodiment two
Fig. 2 is referred to, Fig. 2 is another examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention Schematic flow sheet.As shown in Fig. 2 the examination question De-weight method that should be based on natural sciences test item bank may comprise steps of:
201st, user equipment receives the examination question duplicate removal instruction for carrying contents of test question.
202nd, user device responsive examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, searches for and examination question The similarity of content is more than multiple target examination questions of default similarity threshold.
203rd, user equipment is judged in the examination question key character of contents of test question with the presence or absence of being matched with preset keyword symbol First character, if in the presence of, step 204~205 are performed, if not existing, perform step 208.
204th, user equipment is directed to each the target examination question for searching, and the second character in target examination question is extracted successively.
205th, user equipment compares the first character and whether the second character is identical, if identical, performs step 206~207, if Difference, performs step 209.
206th, the first order that multiple characters that user equipment determines the first character and includes occur in contents of test question, and The second order that multiple characters that determining the second character includes occur in target examination question.
In the embodiment of the present invention, in the case where the first character and the second character are multiple characters, even if character is homogeneous Together, but, the different order of character can equally influence analysis and the answer of examination question, namely two examination questions are different.
For example, examination question 1:One upper bottom is 6 decimeters, bottom is 4 decimeters, a height of 3 decimeters of trapezoidal area is many It is fewExamination question 2:One upper bottom is 3 decimeters, bottom is 4 decimeters, a height of 6 decimeters of trapezoidal area is how manyNumber in examination question 1 The order of character number is 6,4,3, and the order of the numerical chracter in examination question 2 is 3,4,6, it is obvious that the order influence of numerical chracter The analysis of examination question 1 and examination question 2 and answer, the examination question 1 and examination question 2 be two different problems.
207th, user equipment judges that whether the first order is identical with the second order, if identical, performs step 208, if it is different, Perform step 209.
208th, user equipment performs examination question deduplication operation to multiple target examination questions, and terminates this flow.
In the embodiment of the present invention, if user equipment judges exist and default key in the examination question key character of contents of test question First character of character match, and the first character and the second character are identical, and the first order is identical with the second order, then can be true Fixed the plurality of target examination question is attached most importance to retrial topic, can perform examination question deduplication operation to multiple target examination questions;If user equipment is sentenced In the absence of the first character matched with preset keyword symbol in the examination question key character of disconnected contents of test question, same user equipment can be with Examination question deduplication operation is performed to multiple target examination questions.
209th, user equipment retains the plurality of target examination question.
In the embodiment of the present invention, if user equipment judges exist and default key in the examination question key character of contents of test question First character and the first character of character match and the second character are different, or, if user equipment judges the examination of contents of test question There is the first character with preset keyword symbol matching in topic key character and the first character and the second character are identical and first suitable Sequence is different from the second order, then can retain the plurality of target examination question.
Wherein, implement in the method described by Fig. 2, user equipment can be more than many of default similarity threshold from similarity In individual target examination question, further compare the second word in the first character and target examination question in the examination question key character of contents of test question Whether symbol is identical, if identical, it is first suitable that multiple characters that determining whether the first character includes occur in contents of test question Whether the second order that multiple characters that sequence includes with the second character occur in target examination question is identical, if identical, it is determined that should Multiple target examination questions attach most importance to retrial topic, user equipment can to the plurality of target examination question perform examination question deduplication operation, such that it is able to Improve the discrimination for repeating examination question.
Embodiment three
Fig. 3 is referred to, Fig. 3 is another examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention Schematic flow sheet.As shown in figure 3, the examination question De-weight method that should be based on natural sciences test item bank may comprise steps of:
301st, user equipment receives the examination question duplicate removal instruction for carrying contents of test question.
302nd, user device responsive examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, searches for and examination question The similarity of content is more than multiple target examination questions of default similarity threshold.
303rd, user equipment is judged in the examination question key character of contents of test question with the presence or absence of being matched with preset keyword symbol First character, if in the presence of, step 304~305 are performed, if not existing, perform step 308.
304th, user equipment is directed to each the target examination question for searching, and the second character in target examination question is extracted successively.
305th, user equipment compares the first character and whether the second character is identical, if identical, performs step 306, if it is different, Perform step 307.
306th, user equipment determines the residing first position in contents of test question of each character in the first character, and determines The residing second place in target examination question of each character in second character.
In the embodiment of the present invention, in the case where the first character and the second character are multiple characters, even if character is homogeneous Together, but, the location of each character can equally influence analysis and the answer of examination question, namely two examination questions are different.
For example, examination question 1:One upper bottom is 6 decimeters, bottom is 4 decimeters, a height of 3 decimeters of trapezoidal area is many It is fewExamination question 2:One upper bottom is 6 decimeters, bottom is 3 decimeters, a height of 4 decimeters of trapezoidal area is how manyNumber in examination question 1 Character number 6 is identical with the location of numerical chracter 6 in examination question 2, but, numerical chracter 4 and examination question 2 in examination question 1 In the location of numerical chracter 4 be different, residing for the numerical chracter 3 in numerical chracter 3 in examination question 1 and examination question 2 Position is also different, it is obvious that the position of numerical chracter have impact on analysis and the answer, the He of examination question 1 of examination question 1 and examination question 2 Examination question 2 is two different problems.
307th, user equipment judges whether first position and the second place are identical, if identical, performs step 308, if it is different, Perform step 309.
308th, user equipment performs examination question deduplication operation to multiple target examination questions, and terminates this flow.
In the embodiment of the present invention, if user equipment judges exist and default key in the examination question key character of contents of test question First character of character match, and the first character and the second character are identical, and first position is identical with the second place, then can be true Fixed the plurality of target examination question is attached most importance to retrial topic, can perform examination question deduplication operation to multiple target examination questions;If user equipment is sentenced In the absence of the first character matched with preset keyword symbol in the examination question key character of disconnected contents of test question, same user equipment can be with Examination question deduplication operation is performed to multiple target examination questions.
309th, user equipment retains the plurality of target examination question.
In the embodiment of the present invention, if user equipment judges exist and default key in the examination question key character of contents of test question First character and the first character of character match and the second character are different, or, if user equipment judges the examination of contents of test question There is the first character with preset keyword symbol matching in topic key character and the first character and the second character are identical and first Put different from the second place, then can retain the plurality of target examination question.
Wherein, in the method described by implementing Fig. 3, user equipment can be more than many of default similarity threshold from similarity In individual target examination question, further compare the second word in the first character and target examination question in the examination question key character of contents of test question Whether symbol identical, if identical, determine whether in the first character the residing first position in contents of test question of each character with Whether the residing second place in target examination question of each character is identical in second character, if identical, determines the plurality of target Examination question attach most importance to retrial topic, user equipment can to the plurality of target examination question perform examination question deduplication operation, such that it is able to improve repetition The discrimination of examination question.
Example IV
Fig. 4 is referred to, Fig. 4 is another examination question De-weight method based on natural sciences test item bank disclosed in the embodiment of the present invention Schematic flow sheet.As shown in figure 4, the examination question De-weight method that should be based on natural sciences test item bank may comprise steps of:
401st, user equipment receives the examination question duplicate removal instruction for carrying contents of test question.
402nd, user device responsive examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, searches for and examination question The similarity of content is more than multiple target examination questions of default similarity threshold.
404th, user equipment is judged in the examination question key character of contents of test question with the presence or absence of being matched with preset keyword symbol First character, if in the presence of, step 404~405 are performed, if not existing, perform step 409.
404th, user equipment is directed to each the target examination question for searching, and the second character in target examination question is extracted successively.
405th, user equipment compares the first character and whether the second character is identical, if identical, performs step 406~408, if Difference, performs step 410.
406th, the first order that multiple characters that user equipment determines the first character and includes occur in contents of test question, and Determine the residing first position in contents of test question of each character in the first character.
407th, the second order that multiple characters that user equipment determines the second character and includes occur in target examination question, and Determine the residing second place in target examination question of each character in the second character.
Optionally, step 406 and 407 can perform simultaneously, it is also possible to perform step 407 after first carrying out step 406, or Person, it is also possible to perform step 406 after first carrying out step 407, the embodiment of the present invention is not limited.
408th, user equipment judges whether the first order is identical with the second order, and judges first position and the second place It is whether identical, if the first order is identical with the second order and first position is identical with the second place, step 409 is performed, if first Order is different with the second place from the second different and/or first position of order, performs step 410.
Optionally, user equipment can simultaneously judge whether the first order is identical with the second order, and judge first Put with whether the second place is identical, or, user equipment can first judge whether the first order is identical with the second order, if phase Together, then judge whether first position is identical with the second place, or, user equipment can first judge first position and second Whether position is identical, if it is identical, then judge whether the first order is identical with the second order, and the embodiment of the present invention is not limited.
409th, user equipment performs examination question deduplication operation to multiple target examination questions, and terminates this flow.
410th, user equipment retains the plurality of target examination question.
Wherein, implement in the method described by Fig. 4, user equipment can be more than many of default similarity threshold from similarity In individual target examination question, further compare the second word in the first character and target examination question in the examination question key character of contents of test question Whether symbol is identical, if identical, it is first suitable that multiple characters that determining whether the first character includes occur in contents of test question Whether the second order that multiple characters that sequence includes with the second character occur in target examination question is identical, and judges the first character In the first position residing in the contents of test question of each character and each character in the second character residing the in target examination question Whether two positions are identical, if the first order is identical with the second order and first position is identical with the second place, it is determined that this is more Individual target examination question attach most importance to retrial topic, user equipment can to the plurality of target examination question perform examination question deduplication operation, such that it is able to carry Height repeats the discrimination of examination question.
Embodiment five
Fig. 5 is referred to, Fig. 5 is a kind of structural representation of examination question duplicate removal device disclosed in the embodiment of the present invention.Wherein, should Examination question duplicate removal device can be used for performing the part or all of step described in Fig. 1~Fig. 4 in method, specifically refer to Fig. 1 Associated description in~Fig. 4, will not be repeated here.As shown in figure 5, the examination question duplicate removal device can include:
Receiving unit 501, the examination question duplicate removal instruction of contents of test question is carried for receiving;
Search unit 502, for responding the examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, searches Rope is more than multiple target examination questions of default similarity threshold with the similarity of the contents of test question;
First judging unit 503, for whether there is and default pass in the examination question key character for judging the contents of test question First character of key characters matching;
Extraction unit 504, for when in the examination question key character that first judging unit 503 judges the contents of test question During in the presence of the first character matched with preset keyword symbol, for each the described target examination question for searching, extract successively described The second character in target examination question;
Whether comparing unit 505 is identical for comparing first character and second character;
Duplicate removal unit 506 is identical with second character for comparing first character in the comparing unit 505 When, examination question deduplication operation is performed to the multiple target examination question.
Optionally, the duplicate removal unit 506, is additionally operable to judge the contents of test question in first judging unit 503 When in examination question key character in the absence of the first character matched with preset keyword symbol, examination question is performed to the multiple target examination question Deduplication operation.
Optionally, the duplicate removal unit 506 is specially to the mode that the multiple target examination question performs examination question deduplication operation:
Any one target examination question is selected from the multiple target examination question as reservation examination question, and is deleted except the reservation examination Remaining target examination question outside topic;Or,
The memory space shared by each described target examination question in the multiple target examination question is obtained, determines that memory space is minimum Target examination question as retain examination question, and delete except it is described reservation examination question in addition to remaining target examination question.
Wherein, in the examination question duplicate removal device described by Fig. 5, the multiple of default similarity threshold can be more than from similarity In target examination question, further compare the second character in the first character and target examination question in the examination question key character of contents of test question It is whether identical, if identical, determine the plurality of target examination question attach most importance to retrial topic, can to the plurality of target examination question perform examination question go Operate again, such that it is able to improve the discrimination of repetition examination question, meanwhile, the precision that removal is inscribed again can be improved.
Embodiment six
Fig. 6 is referred to, Fig. 6 is the structural representation of another examination question duplicate removal device disclosed in the embodiment of the present invention.Wherein, The examination question duplicate removal device can be used for performing the part or all of step described in Fig. 1~Fig. 4 in method, specifically refer to figure Associated description in 1~Fig. 4, will not be repeated here.Examination question duplicate removal device shown in Fig. 6 is due to the examination question duplicate removal shown in Fig. 5 Device optimizes what is obtained.In Fig. 6, the first character and the second character include multiple characters, with the examination question duplicate removal shown in Fig. 5 Device is compared, and the examination question duplicate removal device shown in Fig. 6 can also include:
Determining unit 507 is identical with second character for comparing first character in the comparing unit 505 When, the first order that multiple characters that determining first character includes occur in the contents of test question, and determine described The second order that multiple characters that second character includes occur in the target examination question;
Whether the second judging unit 508 is identical with the described second order for judging first order;
The duplicate removal unit 506, specifically for judging first order with described when second judging unit 508 When two orders are identical, examination question deduplication operation is performed to the multiple target examination question.
Optionally, the determining unit 507, be additionally operable to when second judging unit 508 judge first order with When second order is identical, the residing first position in the contents of test question of each character in first character is determined, And the residing second place in the target examination question of each character in determination second character;
Second judging unit 508, is additionally operable to judge whether the first position is identical with the second place;
The duplicate removal unit 506, specifically for judging first order with described when second judging unit 508 Two orders are identical and during the first position identical with the second place, examination question duplicate removal behaviour are performed to the multiple target examination question Make.
Wherein, implement the examination question duplicate removal device described by Fig. 6, the multiple of default similarity threshold can be more than from similarity In target examination question, further compare the second character in the first character and target examination question in the examination question key character of contents of test question It is whether identical, if identical, determine the plurality of target examination question attach most importance to retrial topic, can to the plurality of target examination question perform examination question go Operate again, such that it is able to improve the discrimination of repetition examination question, meanwhile, the precision that removal is inscribed again can be improved.
Embodiment seven
Fig. 7 is referred to, Fig. 7 is a kind of structural representation of user equipment disclosed in the embodiment of the present invention.Wherein, Fig. 7 institutes The user equipment for showing includes any one examination question duplicate removal device of Fig. 5~Fig. 6.Implement the user equipment shown in Fig. 7, weight can be improved The discrimination of retrial topic, meanwhile, the precision that removal is inscribed again can be improved.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can Completed with instructing the hardware of correlation by program, the program can be stored in a computer-readable recording medium, storage Medium include read-only storage (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), programmable read only memory (Programmable Read-only Memory, PROM), erasable programmable is read-only deposits Reservoir (Erasable Programmable Read Only Memory, EPROM), disposable programmable read-only storage (One- Time Programmable Read-Only Memory, OTPROM), the electronics formula of erasing can make carbon copies read-only storage (Electrically-Erasable Programmable Read-Only Memory, EEPROM), read-only optical disc (Compact Disc Read-Only Memory, CD-ROM) or other disk storages, magnetic disk storage, magnetic tape storage or can For carrying or computer-readable any other medium of data storage.
Above to a kind of examination question De-weight method and device, Yong Hushe based on natural sciences test item bank disclosed in the embodiment of the present invention Standby to be described in detail, specific case used herein is set forth to principle of the invention and implementation method, the above The explanation of embodiment is only intended to help and understands the method for the present invention and its core concept;Simultaneously for the general skill of this area Art personnel, according to thought of the invention, will change in specific embodiments and applications, in sum, this Description should not be construed as limiting the invention.

Claims (11)

1. a kind of examination question De-weight method based on natural sciences test item bank, it is characterised in that including:
Reception carries the examination question duplicate removal instruction of contents of test question;
The examination question duplicate removal instruction is responded, from all examination questions that natural sciences test item bank includes, the phase with the contents of test question is searched for Multiple target examination questions like degree more than default similarity threshold;
Judge to whether there is the first character matched with preset keyword symbol in the examination question key character of the contents of test question, if depositing For each the described target examination question for searching, the second character in the target examination question is being extracted successively;
Compare first character and whether second character is identical, if identical, examination is performed to the multiple target examination question Topic deduplication operation.
2. method according to claim 1, it is characterised in that first character and second character include multiple Character, when first character is identical with second character, methods described also includes:
The first order that multiple characters that determining first character includes occur in the contents of test question, and determine described The second order that multiple characters that second character includes occur in the target examination question;
Judge whether first order is identical with the described second order, if identical, perform described to the multiple target The step of examination question performs examination question deduplication operation.
3. method according to claim 2, it is characterised in that judging that first order is identical with second order When, methods described also includes:
Determine the residing first position in the contents of test question of each character in first character, and determine described second The residing second place in the target examination question of each character in character;
Judge whether the first position is identical with the second place, if identical, perform described to the multiple target The step of examination question performs examination question deduplication operation.
4. the method according to any one of claims 1 to 3, it is characterised in that methods described also includes:
If right in the absence of the first character matched with preset keyword symbol in judging the examination question key character of the contents of test question The multiple target examination question performs examination question deduplication operation.
5. method according to claim 4, it is characterised in that described that examination question duplicate removal behaviour is performed to the multiple target examination question Work includes:
Selected from the multiple target examination question any one target examination question as retain examination question, and delete except it is described reservation examination question it Outer remaining target examination question;Or,
The memory space shared by each described target examination question in the multiple target examination question is obtained, the minimum mesh of memory space is determined Mark examination question deletes the remaining target examination question in addition to the reservation examination question as reservation examination question.
6. a kind of examination question duplicate removal device, it is characterised in that including:
Receiving unit, the examination question duplicate removal instruction of contents of test question is carried for receiving;
Search unit, for responding examination question duplicate removal instruction, from all examination questions that natural sciences test item bank includes, search with it is described The similarity of contents of test question is more than multiple target examination questions of default similarity threshold;
First judging unit, for whether there is in the examination question key character for judging the contents of test question and preset keyword symbol The first character matched somebody with somebody;
Extraction unit, for existing in the examination question key character that the contents of test question is judged when first judging unit and presetting During the first character of key character matching, for each the described target examination question for searching, in extracting the target examination question successively The second character;
Whether comparing unit is identical for comparing first character and second character;
Duplicate removal unit, for when the comparing unit first character is identical with second character, to described many Individual target examination question performs examination question deduplication operation.
7. examination question duplicate removal device according to claim 6, it is characterised in that first character and second character are equal Including multiple characters, the examination question duplicate removal device also includes:
Determining unit, for when the comparing unit first character is identical with second character, it is determined that described The first order that multiple characters that first character includes occur in the contents of test question, and determine that second character includes Multiple characters occur in the target examination question second order;
Whether the second judging unit is identical with the described second order for judging first order;
The duplicate removal unit, specifically for judging that first order is identical with second order when second judging unit When, examination question deduplication operation is performed to the multiple target examination question.
8. examination question duplicate removal device according to claim 7, it is characterised in that
The determining unit, is additionally operable to judge that first order is identical with second order when second judging unit When, determine the residing first position in the contents of test question of each character in first character, and determine described second The residing second place in the target examination question of each character in character;
Second judging unit, is additionally operable to judge whether the first position is identical with the second place;
The duplicate removal unit, specifically for judging that first order is identical with second order when second judging unit And the first position it is identical with the second place when, to the multiple target examination question perform examination question deduplication operation.
9. the examination question duplicate removal device according to any one of claim 6~8, it is characterised in that the duplicate removal unit, is additionally operable to Accord with what is matched in the absence of with preset keyword in first judging unit judges the examination question key character of the contents of test question During the first character, examination question deduplication operation is performed to the multiple target examination question.
10. examination question duplicate removal device according to claim 9, it is characterised in that the duplicate removal unit is to the multiple target The mode that examination question performs examination question deduplication operation is specially:
Selected from the multiple target examination question any one target examination question as retain examination question, and delete except it is described reservation examination question it Outer remaining target examination question;Or,
The memory space shared by each described target examination question in the multiple target examination question is obtained, the minimum mesh of memory space is determined Mark examination question deletes the remaining target examination question in addition to the reservation examination question as reservation examination question.
11. a kind of user equipmenies, it is characterised in that gone including the examination question described in claim 6~claim 10 any one Refitting is put.
CN201710065948.9A 2017-02-06 2017-02-06 A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank Pending CN106815372A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710065948.9A CN106815372A (en) 2017-02-06 2017-02-06 A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710065948.9A CN106815372A (en) 2017-02-06 2017-02-06 A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank

Publications (1)

Publication Number Publication Date
CN106815372A true CN106815372A (en) 2017-06-09

Family

ID=59111375

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710065948.9A Pending CN106815372A (en) 2017-02-06 2017-02-06 A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank

Country Status (1)

Country Link
CN (1) CN106815372A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107578659A (en) * 2017-09-27 2018-01-12 广东小天才科技有限公司 Generation method, generating means and the terminal of electronics topic
CN108984702A (en) * 2018-07-06 2018-12-11 深圳市卓帆技术有限公司 Examination question comparison method and system
CN111552782A (en) * 2020-04-30 2020-08-18 尚杰 Topic search processing method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102629272A (en) * 2012-03-14 2012-08-08 北京邮电大学 Clustering based optimization method for examination system database
CN105373594A (en) * 2015-10-23 2016-03-02 广东小天才科技有限公司 Method and apparatus for screening repeated test questions from question bank
CN105824798A (en) * 2016-03-03 2016-08-03 云南电网有限责任公司教育培训评价中心 Examination question de-duplicating method of examination question base based on examination question key word likeness

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102629272A (en) * 2012-03-14 2012-08-08 北京邮电大学 Clustering based optimization method for examination system database
CN105373594A (en) * 2015-10-23 2016-03-02 广东小天才科技有限公司 Method and apparatus for screening repeated test questions from question bank
CN105824798A (en) * 2016-03-03 2016-08-03 云南电网有限责任公司教育培训评价中心 Examination question de-duplicating method of examination question base based on examination question key word likeness

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107578659A (en) * 2017-09-27 2018-01-12 广东小天才科技有限公司 Generation method, generating means and the terminal of electronics topic
CN108984702A (en) * 2018-07-06 2018-12-11 深圳市卓帆技术有限公司 Examination question comparison method and system
CN111552782A (en) * 2020-04-30 2020-08-18 尚杰 Topic search processing method and device

Similar Documents

Publication Publication Date Title
Tian et al. Towards predicting the best answers in community-based question-answering services
Keogh et al. Finding surprising patterns in a time series database in linear time and space
WO2019218514A1 (en) Method for extracting webpage target information, device, and storage medium
Fakhari et al. Combination of classification and regression in decision tree for multi-labeling image annotation and retrieval
CN104899273A (en) Personalized webpage recommendation method based on topic and relative entropy
CN110334178A (en) Data retrieval method, device, equipment and readable storage medium storing program for executing
CN103617213B (en) Method and system for identifying newspage attributive characters
CN109471944A (en) Training method, device and the readable storage medium storing program for executing of textual classification model
WO2013058994A1 (en) Methods and apparatuses for generating search expressions from content, for applying search expressions to content collections, and/or for analyzing corresponding search results
CN106815372A (en) A kind of examination question De-weight method and device, user equipment based on natural sciences test item bank
CN107085583A (en) A kind of electronic document management method and device based on content
Chantrapornchai et al. Information extraction based on named entity for tourism corpus
CN110209659A (en) A kind of resume filter method, system and computer readable storage medium
CN103853797B (en) A kind of picture retrieval method and system based on n member picture indices structures
CN114330329A (en) Service content searching method and device, electronic equipment and storage medium
CN113515589A (en) Data recommendation method, device, equipment and medium
Islam et al. Review analysis of ride-sharing applications using machine learning approaches: Bangladesh perspective
Ontoum et al. Personality type based on myers-briggs type indicator with text posting style by using traditional and deep learning
CN115658080A (en) Method and system for identifying open source code components of software
CN109344233A (en) A kind of Chinese personal name recognition method
CN110377690A (en) A kind of information acquisition method and system based on long-range Relation extraction
CN111144453A (en) Method and equipment for constructing multi-model fusion calculation model and method and equipment for identifying website data
CN114780745A (en) Method and device for constructing knowledge system, electronic equipment and storage medium
Hamed et al. DISINFORMATION DETECTION ABOUT ISLAMIC ISSUES ON SOCIAL MEDIA USING DEEP LEARNING TECHNIQUES
CN115130455A (en) Article processing method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170609

RJ01 Rejection of invention patent application after publication