CN111651963A - Technology for generating test paper by importing word into question bank - Google Patents

Technology for generating test paper by importing word into question bank Download PDF

Info

Publication number
CN111651963A
CN111651963A CN202010407220.1A CN202010407220A CN111651963A CN 111651963 A CN111651963 A CN 111651963A CN 202010407220 A CN202010407220 A CN 202010407220A CN 111651963 A CN111651963 A CN 111651963A
Authority
CN
China
Prior art keywords
word
html
test
question
questions
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010407220.1A
Other languages
Chinese (zh)
Inventor
王晖
李佳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Able Digital & Tech Co ltd
Original Assignee
Shanghai Able Digital & Tech Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Able Digital & Tech Co ltd filed Critical Shanghai Able Digital & Tech Co ltd
Priority to CN202010407220.1A priority Critical patent/CN111651963A/en
Publication of CN111651963A publication Critical patent/CN111651963A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation

Abstract

The invention discloses a technology for generating test paper by importing word into question bank, which comprises the following steps: the first step is as follows: converting word into html; the second step is that: analyzing the test questions in the html format, and analyzing corresponding test question contents according to test question patterns of different question types; the implementation principle of converting word into html is as follows: after clicking a word import button, acquiring a suffix name of an uploaded file, judging that the suffix name of the uploaded wrod document is in doc format and in docx format, if the suffix name is docx, firstly generating an XWPFDocument object by loading the document, converting the XWPFDocument object into XHTML by using the XWPFDocument object, and if the suffix name is doc, reading the doc document by using the HWPFDocument object, and converting the word into html by using a WordToHtmlConverter method. Has the advantages that: the invention adds the functions of identifying and analyzing the types of special test questions such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short answer questions and the like on the basis of original identification of word document contents, and is convenient for users to realize batch introduction of the test questions through word document introduction.

Description

Technology for generating test paper by importing word into question bank
Technical Field
The invention relates to the field of word document identification, in particular to a technology for generating a test paper by importing a word into a question bank.
Background
Although many word document identification functions in the market can identify the content in a word document, the known word document identification in the market can only identify the title of the document and the content of the document, and cannot support the identification and analysis of a specific data structure, so that the problem that people who need to import test questions through the word document in the education industry cannot be met, and great inconvenience is brought to users who have the requirements.
An effective solution to the problems in the related art has not been proposed yet.
Disclosure of Invention
In order to overcome the defect that the existing word document identification function can only identify the title of a document and the content of the document, the invention provides a service for importing identification according to the word document, which not only can perfectly meet the existing document identification function in the market, but also can additionally identify special test question types such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short-answer questions and the like on the original basis.
In order to achieve the purpose, the invention provides the following technical scheme: a technology for generating test paper by using word to import question bank comprises the following steps:
the first step is as follows: converting word into html;
the second step is that: analyzing the test questions in the html format, and analyzing the corresponding test question contents according to the test question patterns of different question types.
Further, the implementation principle of converting word into html is as follows: and after a word import button is clicked, acquiring a suffix name of the uploaded file, and judging whether the suffix name of the uploaded wrad document is in a doc format or a doc format.
Further, if the suffix name is. docx, an XWPFDcument object is generated by loading the document and converted into XHTML by using XWPFDcument, and if the suffix name is. doc, the doc document is read by an HWPFdocument object and converted into html by using a WordToHtmlConverter method.
Further, obtaining all non-text elements such as pictures and formulas in the word document according to the html obtained through analysis, converting the non-text elements into a picture stream, uploading the picture stream to a file server through the converted picture stream, obtaining addresses of the pictures in the server, and finally replacing the non-text elements such as the pictures and the formulas in the html with the obtained server picture addresses, thereby completing the process of converting the word document into the html of the plain text.
Further, the implementation principle of analyzing the html-format test questions is as follows: after the html content is obtained, analyzing layer by layer in a recursion mode according to different label styles in the content, obtaining content related to test questions in the html text, and performing regular expression matching on the test question content according to a predetermined format after the test question content is obtained.
Further, the list of the test questions and the question stem, option, answer and other specific test question information of each question are obtained in a regular expression matching mode, and the analysis of the html-style test questions is completed after the information is obtained.
Compared with the prior art, the invention has the following beneficial effects: the invention adds the functions of identifying and analyzing the types of special test questions such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short answer questions and the like on the basis of original identification of word document contents, and is convenient for users to realize batch introduction of the test questions through word document introduction.
Detailed Description
The invention will be further described with reference to specific embodiments:
the technology for generating the test paper by using the word imported question bank comprises the following steps of:
the first step is as follows: the realization principle of converting word into html is as follows: after clicking a word import button, acquiring the suffix name of the uploaded file, judging the suffix name of the uploaded wrod document to be in a doc format or a docx format, docx, the XWPFDocument object is first generated by loading the document, converted to XHTML using XWPFDocument, if the suffix name is. doc, the doc document is read by the HWPFDocument object, word is converted to html using the wordtohtmlcoverter method, obtaining all non-text elements such as pictures, formulas and the like in the word document according to the html obtained by analyzing, converting the non-text elements into a picture stream, uploading the converted picture stream to a file server, obtaining addresses of the pictures in the server, finally replacing non-text elements such as pictures, formulas and the like in html with the obtained server picture addresses, completing the process of converting word documents into html of plain text, clicking a word import button, importing a word template, and converting the question types in the import template into html and displaying by a system;
the second step is that: analyzing the test questions in the html format, analyzing the corresponding test question contents according to the test question patterns of different question types, and realizing the analysis principle of the test questions in the html format as follows: after html content is obtained, analyzing layer by layer in a recursive mode according to different label patterns in the content, obtaining content related to test questions in a html text, performing regular expression matching on the test question content according to a format agreed in advance after the test question content is obtained, obtaining a list of the test questions and specific test question information such as a question stem, options and answers of each question in a regular expression matching mode, completing analysis on the html-pattern test questions by obtaining the information, clicking an identification button, identifying the corresponding types of the test questions by a system according to different types of the test questions on a left page, wherein the current question types supporting automatic identification comprise single-choice questions, multiple-choice questions, blank filling questions, judgment questions and short answer questions;
and finally, clicking an import test paper button, and importing the identified test questions into the system by the system.
In summary, the invention adds the functions of identifying and analyzing the types of special test questions such as single-choice questions, multiple-choice questions, blank filling questions, judgment questions, short answer questions and the like on the basis of original identification of word document contents, and is convenient for users to realize batch import of test questions through word document import.
Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that various changes, modifications and substitutions can be made without departing from the spirit and scope of the invention as defined by the appended claims. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (6)

1. A technology for generating test paper by using word to import question bank is characterized by comprising the following steps:
the first step is as follows: converting word into html;
the second step is that: analyzing the test questions in the html format, and analyzing the corresponding test question contents according to the test question patterns of different question types.
2. The technology for generating the test paper by importing the word into the question bank according to claim 1, wherein the implementation principle of converting the word into the html is as follows: and after a word import button is clicked, acquiring a suffix name of the uploaded file, and judging whether the suffix name of the uploaded wrad document is in a doc format or a doc format.
3. The technique of claim 2, wherein if the suffix name is. docx, an XWPFDocument object is first generated by loading a document and converted into XHTML by using XWPFDocument, and if the suffix name is. doc, a doc document is read by an HWPFdocument object and word is converted into html by using a WordToHtmlConverter method.
4. The technology for generating a test paper by importing a word into a question bank according to claim 3 is characterized in that all non-text elements such as pictures and formulas in a word document are obtained according to html obtained through analysis, the non-text elements are converted into picture streams and uploaded to a file server through the converted picture streams, addresses of the pictures in the server are obtained, finally the obtained server picture addresses are used for replacing the non-text elements such as the pictures and the formulas in the html, and the process is used for completing the conversion of the word document into the html of the plain text.
5. The technology for generating the test paper by using the word import question bank as claimed in claim 1, wherein the analysis of the test paper in the html format is implemented according to the following principle: after the html content is obtained, analyzing layer by layer in a recursion mode according to different label styles in the content, obtaining content related to test questions in the html text, and performing regular expression matching on the test question content according to a predetermined format after the test question content is obtained.
6. The technology for generating test paper by importing word into question bank as claimed in claim 5, wherein the list of test questions and the question stem, option, answer and other specific test question information of each question are obtained by regular expression matching, and the analysis of html-style test questions is completed by obtaining the information.
CN202010407220.1A 2020-05-14 2020-05-14 Technology for generating test paper by importing word into question bank Pending CN111651963A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010407220.1A CN111651963A (en) 2020-05-14 2020-05-14 Technology for generating test paper by importing word into question bank

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010407220.1A CN111651963A (en) 2020-05-14 2020-05-14 Technology for generating test paper by importing word into question bank

Publications (1)

Publication Number Publication Date
CN111651963A true CN111651963A (en) 2020-09-11

Family

ID=72346661

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010407220.1A Pending CN111651963A (en) 2020-05-14 2020-05-14 Technology for generating test paper by importing word into question bank

Country Status (1)

Country Link
CN (1) CN111651963A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113065316A (en) * 2021-03-26 2021-07-02 洛阳圣昂通网络科技有限公司 Method for dynamically converting formal thumbnail file into html (hypertext markup language) and inputting question bank, selecting questions from question bank and composing draft and generating thumbnail file

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199871A (en) * 2014-08-19 2014-12-10 南京富士通南大软件技术有限公司 High-speed test question inputting method for intelligent teaching
CN104267953A (en) * 2014-09-27 2015-01-07 昆明钢铁集团有限责任公司 Control and method for importing Word test questions based on browser
CN109614594A (en) * 2018-11-27 2019-04-12 浙江万朋教育科技股份有限公司 A method of topic document is resolved into exam pool data
CN109947836A (en) * 2019-03-21 2019-06-28 江西风向标教育科技有限公司 English paper structural method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199871A (en) * 2014-08-19 2014-12-10 南京富士通南大软件技术有限公司 High-speed test question inputting method for intelligent teaching
CN104267953A (en) * 2014-09-27 2015-01-07 昆明钢铁集团有限责任公司 Control and method for importing Word test questions based on browser
CN109614594A (en) * 2018-11-27 2019-04-12 浙江万朋教育科技股份有限公司 A method of topic document is resolved into exam pool data
CN109947836A (en) * 2019-03-21 2019-06-28 江西风向标教育科技有限公司 English paper structural method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
烟火_: "java word文档转html文件", 《HTTPS://WWW.CNBLOGS.COM/ALWAYS-ONLINE/P/4800131.HTML》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113065316A (en) * 2021-03-26 2021-07-02 洛阳圣昂通网络科技有限公司 Method for dynamically converting formal thumbnail file into html (hypertext markup language) and inputting question bank, selecting questions from question bank and composing draft and generating thumbnail file

Similar Documents

Publication Publication Date Title
Römer et al. From student hard drive to web corpus (part 1): The design, compilation and genre classification of the Michigan Corpus of Upper-level Student Papers (MICUSP)
CN112052414A (en) Data processing method and device and readable storage medium
CN106846961A (en) The treating method and apparatus of electronic test paper
CN112115301A (en) Video annotation method and system based on classroom notes
CN106940693B (en) Single structure processing method for laboratory original record
Park et al. Dublin Core metadata semantics: An analysis of the perspectives of information professionals
CN112183143A (en) Reading and amending method and teaching system
Horner Categories of environmental print: All logos are not created equal
Onwuegbuzie et al. Evidence-Based Guidelines for Avoiding the Most Prevalent and Serious APA Error in Journal Article Submissions-The Citation Error.
CN109614594B (en) Method for analyzing topic document into topic library data
CN111651963A (en) Technology for generating test paper by importing word into question bank
CN113779345B (en) Teaching material generation method and device, computer equipment and storage medium
Figueras et al. Relating a reading comprehension test to the CEFR levels: A case of standard setting in practice with focus on judges and items
JP2009116519A (en) Personal history development device
Johnston et al. Communication of behaviour change interventions: can they be recognised from written descriptions?
Wickramanayake Instruction and help services in the academic library websites and web pages in Sri Lanka: a content analysis
CN112509402A (en) Establishment, issue and anti-counterfeiting method of competitive question bank based on three-dimensional teaching materials and teaching system thereof
KR20130050381A (en) Systems and methods for document analysis
CN112396897A (en) Teaching system
Hagedorn et al. Research on Urban Community College Transfer and Retention: The Los Angeles TRUCCS Project.
CN115373649B (en) Dynamic internet content barrier-free transformation method and device and website content barrier-free transformation method
CN116010569A (en) Online answering method, system, electronic equipment and storage medium
CN114973798A (en) Word learning card generation method and device
CN112783957A (en) Method and system for importing word document format for English reading
KR102552408B1 (en) System for providing math tutoring service based on au and method performing thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: Room 901, 902, 903, 904, building 1, 1188 Qinzhou North Road, Xuhui District, Shanghai 200000

Applicant after: Shanghai Zhuoyue Ruixin Digital Technology Co.,Ltd.

Address before: Room 901, 902, 903, 904, building 1, 1188 Qinzhou North Road, Xuhui District, Shanghai 200000

Applicant before: SHANGHAI ABLE DIGITAL & TECH Co.,Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200911