CN111651963A - Technology for generating test paper by importing word into question bank - Google Patents
Technology for generating test paper by importing word into question bank Download PDFInfo
- Publication number
- CN111651963A CN111651963A CN202010407220.1A CN202010407220A CN111651963A CN 111651963 A CN111651963 A CN 111651963A CN 202010407220 A CN202010407220 A CN 202010407220A CN 111651963 A CN111651963 A CN 111651963A
- Authority
- CN
- China
- Prior art keywords
- word
- html
- test
- question
- questions
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/151—Transformation
Abstract
The invention discloses a technology for generating test paper by importing word into question bank, which comprises the following steps: the first step is as follows: converting word into html; the second step is that: analyzing the test questions in the html format, and analyzing corresponding test question contents according to test question patterns of different question types; the implementation principle of converting word into html is as follows: after clicking a word import button, acquiring a suffix name of an uploaded file, judging that the suffix name of the uploaded wrod document is in doc format and in docx format, if the suffix name is docx, firstly generating an XWPFDocument object by loading the document, converting the XWPFDocument object into XHTML by using the XWPFDocument object, and if the suffix name is doc, reading the doc document by using the HWPFDocument object, and converting the word into html by using a WordToHtmlConverter method. Has the advantages that: the invention adds the functions of identifying and analyzing the types of special test questions such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short answer questions and the like on the basis of original identification of word document contents, and is convenient for users to realize batch introduction of the test questions through word document introduction.
Description
Technical Field
The invention relates to the field of word document identification, in particular to a technology for generating a test paper by importing a word into a question bank.
Background
Although many word document identification functions in the market can identify the content in a word document, the known word document identification in the market can only identify the title of the document and the content of the document, and cannot support the identification and analysis of a specific data structure, so that the problem that people who need to import test questions through the word document in the education industry cannot be met, and great inconvenience is brought to users who have the requirements.
An effective solution to the problems in the related art has not been proposed yet.
Disclosure of Invention
In order to overcome the defect that the existing word document identification function can only identify the title of a document and the content of the document, the invention provides a service for importing identification according to the word document, which not only can perfectly meet the existing document identification function in the market, but also can additionally identify special test question types such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short-answer questions and the like on the original basis.
In order to achieve the purpose, the invention provides the following technical scheme: a technology for generating test paper by using word to import question bank comprises the following steps:
the first step is as follows: converting word into html;
the second step is that: analyzing the test questions in the html format, and analyzing the corresponding test question contents according to the test question patterns of different question types.
Further, the implementation principle of converting word into html is as follows: and after a word import button is clicked, acquiring a suffix name of the uploaded file, and judging whether the suffix name of the uploaded wrad document is in a doc format or a doc format.
Further, if the suffix name is. docx, an XWPFDcument object is generated by loading the document and converted into XHTML by using XWPFDcument, and if the suffix name is. doc, the doc document is read by an HWPFdocument object and converted into html by using a WordToHtmlConverter method.
Further, obtaining all non-text elements such as pictures and formulas in the word document according to the html obtained through analysis, converting the non-text elements into a picture stream, uploading the picture stream to a file server through the converted picture stream, obtaining addresses of the pictures in the server, and finally replacing the non-text elements such as the pictures and the formulas in the html with the obtained server picture addresses, thereby completing the process of converting the word document into the html of the plain text.
Further, the implementation principle of analyzing the html-format test questions is as follows: after the html content is obtained, analyzing layer by layer in a recursion mode according to different label styles in the content, obtaining content related to test questions in the html text, and performing regular expression matching on the test question content according to a predetermined format after the test question content is obtained.
Further, the list of the test questions and the question stem, option, answer and other specific test question information of each question are obtained in a regular expression matching mode, and the analysis of the html-style test questions is completed after the information is obtained.
Compared with the prior art, the invention has the following beneficial effects: the invention adds the functions of identifying and analyzing the types of special test questions such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short answer questions and the like on the basis of original identification of word document contents, and is convenient for users to realize batch introduction of the test questions through word document introduction.
Detailed Description
The invention will be further described with reference to specific embodiments:
the technology for generating the test paper by using the word imported question bank comprises the following steps of:
the first step is as follows: the realization principle of converting word into html is as follows: after clicking a word import button, acquiring the suffix name of the uploaded file, judging the suffix name of the uploaded wrod document to be in a doc format or a docx format, docx, the XWPFDocument object is first generated by loading the document, converted to XHTML using XWPFDocument, if the suffix name is. doc, the doc document is read by the HWPFDocument object, word is converted to html using the wordtohtmlcoverter method, obtaining all non-text elements such as pictures, formulas and the like in the word document according to the html obtained by analyzing, converting the non-text elements into a picture stream, uploading the converted picture stream to a file server, obtaining addresses of the pictures in the server, finally replacing non-text elements such as pictures, formulas and the like in html with the obtained server picture addresses, completing the process of converting word documents into html of plain text, clicking a word import button, importing a word template, and converting the question types in the import template into html and displaying by a system;
the second step is that: analyzing the test questions in the html format, analyzing the corresponding test question contents according to the test question patterns of different question types, and realizing the analysis principle of the test questions in the html format as follows: after html content is obtained, analyzing layer by layer in a recursive mode according to different label patterns in the content, obtaining content related to test questions in a html text, performing regular expression matching on the test question content according to a format agreed in advance after the test question content is obtained, obtaining a list of the test questions and specific test question information such as a question stem, options and answers of each question in a regular expression matching mode, completing analysis on the html-pattern test questions by obtaining the information, clicking an identification button, identifying the corresponding types of the test questions by a system according to different types of the test questions on a left page, wherein the current question types supporting automatic identification comprise single-choice questions, multiple-choice questions, blank filling questions, judgment questions and short answer questions;
and finally, clicking an import test paper button, and importing the identified test questions into the system by the system.
In summary, the invention adds the functions of identifying and analyzing the types of special test questions such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short answer questions and the like on the basis of original identification of word document contents, and is convenient for users to realize batch import of test questions through word document import.
Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that various changes, modifications and substitutions can be made without departing from the spirit and scope of the invention as defined by the appended claims. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (6)
1. A technology for generating test paper by using word to import question bank is characterized by comprising the following steps:
the first step is as follows: converting word into html;
the second step is that: analyzing the test questions in the html format, and analyzing the corresponding test question contents according to the test question patterns of different question types.
2. The technology for generating the test paper by importing the word into the question bank according to claim 1, wherein the implementation principle of converting the word into the html is as follows: and after a word import button is clicked, acquiring a suffix name of the uploaded file, and judging whether the suffix name of the uploaded wrad document is in a doc format or a doc format.
3. The technique of claim 2, wherein if the suffix name is. docx, an XWPFDocument object is first generated by loading a document and converted into XHTML by using XWPFDocument, and if the suffix name is. doc, a doc document is read by an HWPFdocument object and word is converted into html by using a WordToHtmlConverter method.
4. The technology for generating a test paper by importing a word into a question bank according to claim 3 is characterized in that all non-text elements such as pictures and formulas in a word document are obtained according to html obtained through analysis, the non-text elements are converted into picture streams and uploaded to a file server through the converted picture streams, addresses of the pictures in the server are obtained, finally the obtained server picture addresses are used for replacing the non-text elements such as the pictures and the formulas in the html, and the process is used for completing the conversion of the word document into the html of the plain text.
5. The technology for generating the test paper by using the word import question bank as claimed in claim 1, wherein the analysis of the test paper in the html format is implemented according to the following principle: after the html content is obtained, analyzing layer by layer in a recursion mode according to different label styles in the content, obtaining content related to test questions in the html text, and performing regular expression matching on the test question content according to a predetermined format after the test question content is obtained.
6. The technology for generating test paper by importing word into question bank as claimed in claim 5, wherein the list of test questions and the question stem, option, answer and other specific test question information of each question are obtained by regular expression matching, and the analysis of html-style test questions is completed by obtaining the information.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010407220.1A CN111651963A (en) | 2020-05-14 | 2020-05-14 | Technology for generating test paper by importing word into question bank |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010407220.1A CN111651963A (en) | 2020-05-14 | 2020-05-14 | Technology for generating test paper by importing word into question bank |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111651963A true CN111651963A (en) | 2020-09-11 |
Family
ID=72346661
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010407220.1A Pending CN111651963A (en) | 2020-05-14 | 2020-05-14 | Technology for generating test paper by importing word into question bank |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111651963A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113065316A (en) * | 2021-03-26 | 2021-07-02 | 洛阳圣昂通网络科技有限公司 | Method for dynamically converting formal thumbnail file into html (hypertext markup language) and inputting question bank, selecting questions from question bank and composing draft and generating thumbnail file |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104199871A (en) * | 2014-08-19 | 2014-12-10 | 南京富士通南大软件技术有限公司 | High-speed test question inputting method for intelligent teaching |
CN104267953A (en) * | 2014-09-27 | 2015-01-07 | 昆明钢铁集团有限责任公司 | Control and method for importing Word test questions based on browser |
CN109614594A (en) * | 2018-11-27 | 2019-04-12 | 浙江万朋教育科技股份有限公司 | A method of topic document is resolved into exam pool data |
CN109947836A (en) * | 2019-03-21 | 2019-06-28 | 江西风向标教育科技有限公司 | English paper structural method and device |
-
2020
- 2020-05-14 CN CN202010407220.1A patent/CN111651963A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104199871A (en) * | 2014-08-19 | 2014-12-10 | 南京富士通南大软件技术有限公司 | High-speed test question inputting method for intelligent teaching |
CN104267953A (en) * | 2014-09-27 | 2015-01-07 | 昆明钢铁集团有限责任公司 | Control and method for importing Word test questions based on browser |
CN109614594A (en) * | 2018-11-27 | 2019-04-12 | 浙江万朋教育科技股份有限公司 | A method of topic document is resolved into exam pool data |
CN109947836A (en) * | 2019-03-21 | 2019-06-28 | 江西风向标教育科技有限公司 | English paper structural method and device |
Non-Patent Citations (1)
Title |
---|
烟火_: "java word文档转html文件", 《HTTPS://WWW.CNBLOGS.COM/ALWAYS-ONLINE/P/4800131.HTML》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113065316A (en) * | 2021-03-26 | 2021-07-02 | 洛阳圣昂通网络科技有限公司 | Method for dynamically converting formal thumbnail file into html (hypertext markup language) and inputting question bank, selecting questions from question bank and composing draft and generating thumbnail file |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Römer et al. | From student hard drive to web corpus (part 1): The design, compilation and genre classification of the Michigan Corpus of Upper-level Student Papers (MICUSP) | |
CN112052414A (en) | Data processing method and device and readable storage medium | |
CN106846961A (en) | The treating method and apparatus of electronic test paper | |
CN112115301A (en) | Video annotation method and system based on classroom notes | |
CN106940693B (en) | Single structure processing method for laboratory original record | |
Park et al. | Dublin Core metadata semantics: An analysis of the perspectives of information professionals | |
CN112183143A (en) | Reading and amending method and teaching system | |
Horner | Categories of environmental print: All logos are not created equal | |
Onwuegbuzie et al. | Evidence-Based Guidelines for Avoiding the Most Prevalent and Serious APA Error in Journal Article Submissions-The Citation Error. | |
CN109614594B (en) | Method for analyzing topic document into topic library data | |
CN111651963A (en) | Technology for generating test paper by importing word into question bank | |
CN113779345B (en) | Teaching material generation method and device, computer equipment and storage medium | |
Figueras et al. | Relating a reading comprehension test to the CEFR levels: A case of standard setting in practice with focus on judges and items | |
JP2009116519A (en) | Personal history development device | |
Johnston et al. | Communication of behaviour change interventions: can they be recognised from written descriptions? | |
Wickramanayake | Instruction and help services in the academic library websites and web pages in Sri Lanka: a content analysis | |
CN112509402A (en) | Establishment, issue and anti-counterfeiting method of competitive question bank based on three-dimensional teaching materials and teaching system thereof | |
KR20130050381A (en) | Systems and methods for document analysis | |
CN112396897A (en) | Teaching system | |
Hagedorn et al. | Research on Urban Community College Transfer and Retention: The Los Angeles TRUCCS Project. | |
CN115373649B (en) | Dynamic internet content barrier-free transformation method and device and website content barrier-free transformation method | |
CN116010569A (en) | Online answering method, system, electronic equipment and storage medium | |
CN114973798A (en) | Word learning card generation method and device | |
CN112783957A (en) | Method and system for importing word document format for English reading | |
KR102552408B1 (en) | System for providing math tutoring service based on au and method performing thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Room 901, 902, 903, 904, building 1, 1188 Qinzhou North Road, Xuhui District, Shanghai 200000 Applicant after: Shanghai Zhuoyue Ruixin Digital Technology Co.,Ltd. Address before: Room 901, 902, 903, 904, building 1, 1188 Qinzhou North Road, Xuhui District, Shanghai 200000 Applicant before: SHANGHAI ABLE DIGITAL & TECH Co.,Ltd. |
|
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200911 |