CN101178786A - Online dissertation management method for realizing plagiarize and format checking by network resource - Google Patents

Online dissertation management method for realizing plagiarize and format checking by network resource Download PDF

Info

Publication number
CN101178786A
CN101178786A CNA2006101181508A CN200610118150A CN101178786A CN 101178786 A CN101178786 A CN 101178786A CN A2006101181508 A CNA2006101181508 A CN A2006101181508A CN 200610118150 A CN200610118150 A CN 200610118150A CN 101178786 A CN101178786 A CN 101178786A
Authority
CN
China
Prior art keywords
paper
essay
software
automatic
online
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2006101181508A
Other languages
Chinese (zh)
Inventor
李银胜
李涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI CHENNIAO INFORMATION SCIENCE AND TECHNOLOGY Co Ltd
Original Assignee
SHANGHAI CHENNIAO INFORMATION SCIENCE AND TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI CHENNIAO INFORMATION SCIENCE AND TECHNOLOGY Co Ltd filed Critical SHANGHAI CHENNIAO INFORMATION SCIENCE AND TECHNOLOGY Co Ltd
Priority to CNA2006101181508A priority Critical patent/CN101178786A/en
Publication of CN101178786A publication Critical patent/CN101178786A/en
Pending legal-status Critical Current

Links

Images

Abstract

1)The invention comprises cooperated online essay management facing the whole life period of an essay, utilizing the Word developing environment to realize the automatic format checking of the essay, utilizing the network essay resource to realize an automatic cribbing checking method, a mechanism and software realizing method of the essay. The system based on the invention consists of a computer with internet, an automatic cribbing checking software, an automatic format checking software, and a submitting and reviewing software of the essay; the current implementing example is the Fudan University. The system realized by the invention aims at the professional operation flow of the whole life period of the essay, and supports the cooperated online management of essay creating communication, online submitting, reviewing and management. The invention aims at reducing the working time and errors of managers, reviewers and journalists so as to save redundant essay printing fee, traveling expense, telephone fee and mail carrying charge, etc., and improve the work efficiency. 2) Based on the open network literature resource, the library online resource and the system self-establishing essay resource, the automatic cribbing checking function of the system is realized through utilizing the computer software to obtain, analyze and check the literature. Based on the essay cyclostyle of an essay checking unit, the automatic format checking function of the system is realized through utilizing the computer software to obtain the essay cyclostyle and the format of the submitted essay and executing comparison so as to output an inconsistent (wrong) format item.

Description

A kind of online dissertation management method that utilizes Internet resources to realize plagiarization and format checking
Affiliated technical field:
Patent of the present invention relates to education and submits to and evaluation with the paper of publishing business.
Background technology
Patent of the present invention is towards magazinist portion, universities and colleges and international conference, for their editor, contributor, responsible reader and managerial personnel provide a computer system, supports their online collaborative work.This system checks that by an online computer, automatic plagiarism check software, automatic format software and paper are submitted to and evaluation software is formed.This system is towards the paper Life cycle, support submission person, reviewer, paper supvr, unit management person, supervisor's polygonal look rights management and personalized professional tool case, support their online collaborative work, comprise the paper creation, exchange and evaluation, the progress real-time statistics, automatically notice, directed mass-sending, functions such as data filing.Purpose be working time of reducing managerial personnel, reviewer and contributor with exchanges in error, save unnecessary paper printing expense, traveling expense, telephone charge, postal delivery expense etc., increase work efficiency.
At present, examination, text plagiarized have been had some algorithms and to have detected method for supervising, for example patent 1 " a kind of method of utilizing computer program detected electrons text to plagiarize " and patent 2 " based on the anti-reprinting of computer network or/and the anti-method for supervising of plagiarizing ".Yet these method essence are based on the detection of static text feature database.With patent 1 relatively, and the plagiarism check of this patent is the robotization of manual plagiarism check in the present paper evaluation course, is to possess under the condition of computer interface, support software remote access and operation just in the cards in network paper resource.Therefore and target, use occasion, the technological means of above-mentioned method 1 different, just analyzes this link and have certain similarity carrying out to plagiarize, and this link is not the principal character of this patent.Patent 2 is a kind of a kind of computer system implementation method of carrying out the data comparison at static, particular document, do not relate to the use of Internet resources, do not have plagiarism check mechanism and algorithm yet, on target, application scenario and technological means, all do not have the feature of this patent.
There have been some paper management systems in international conference both domestic and external, editorial office, colleges and universities, but the major function of these systems are papers long-range submission and evaluation, the function that does not all have automatic format inspection and automatic plagiarism check does not have polygonal look rights management and Workflow Management automatically as native system yet.Managerial personnel and reviewer need the evaluation of manual coordination paper, and hand inspection paper form is manually consulted various documents and judged whether to be accused of to plagiarize.
Summary of the invention
Patent of the present invention is towards magazinist portion, universities and colleges and international conference, for their editor, contributor, responsible reader and managerial personnel provide an arbitrage with computer system automatically perform format checking, automatically plagiarism check, support the method and mechanism of their online collaborative work.System based on this patent realizes by an online computer, plagiarism check software, automatic format check that software and paper are submitted to and evaluation software is formed automatically.This system is towards the paper Life cycle, support submission person, reviewer, paper supvr, unit management person, supervisor's polygonal look rights management and personalized professional tool case, support their online collaborative work, comprise the paper creation, exchange and evaluation, the progress real-time statistics, automatically notice, directed mass-sending, functions such as data filing.Purpose be working time of reducing managerial personnel, reviewer and contributor with exchanges in error, save unnecessary paper printing expense, traveling expense, telephone charge, postal delivery expense etc., increase work efficiency.
Technical scheme
The composition of 1 paper management and mechanism
The whole thesis management is divided into 5 parts, on computer system realizes:
1) front end is to submit the interface to the paper of different role user interactions.Each role's user enters the interface, tool box of oneself by its login system.The all evaluation reciprocal process all EMAIL that automatically sends of adjoint system is carried out interconnected beyond the system boundary, makes things convenient for teachers and students to link up.All roles and paper all manage by the administrator is unified in the system.Paper submits to platform to set up paper from being submitted to the complete workflow of evaluation, and the procedural informationization that this is complicated has realized the with no paper running of this flow process.
2) server end is moving paper automatically and is checking service middleware, the paper information that it submits to platform to submit to up according to paper, to checking that the Web Services that paper form, paper are plagiarized proposes call request, and check result is organized into report form deposits in the database, paper submits to the interface that report is showed the user, and the user can inquire the up-to-date audit report of paper.
3) paper checks that service middleware is responsible for and two Web Services of inspection paper get in touch.
4) effect of paper format checking and plagiarism check Web Services is to accept the call request that paper checks that service middleware sends, receive paper and check the paper of the requirement inspection that service middleware sends over, document storage in the Shared Folders (Samba shares) that own and kernel program can both be visited, and is called kernel program and begun to check.After inspection finishes, the result is returned to paper check service middleware at last.These two Web Services receive a new request at every turn and just create out a new thread and be responsible for processing, and this process at first converts document to the file of TXT form, read whole file then, with the sentence is that unit preserves, and after checking respectively, provides whole report.The form of report is to generate a check result array, and with statistics, consuming time and result that be checked through writes in this array and returns.
5) last part is a search engine, and this is the Web search engine that the third party provides, and comprises the interface of Google API and other paper searching systems.
2 functional characters and function design
1) functional character of paper administrative section:
◆ online collaborative work, roles such as the student that thesis relates to, tutor, evaluation committee member, managerial personnel, person in charge president, the chairman of degree sub-committee are collaborative online, paperless office, the hardship of remove that paper is printed repeatedly, transmitted from, the loaded down with trivial details work of modification etc. and process data being lost.
◆ the professional work interface, each role's login system is own interface, needed statistics and the Work tool of being familiar with.
◆ perfect rights management, opening the action of proposing in report and the paper business according to different role, perfect operating right management is arranged.
◆ data progress statistics, enter system, different role can be seen own every statistics of paying close attention to, as data such as presenting papers number, paper states.
◆ line personnel management, managerial personnel can the various roles of online management, evaluate management.
◆ the directed mass-sending of notice, according to the needs of communicating by letter in the paper management, for different role provides corresponding orientation notice mass-sending function respectively.The reception staff not only can see the message that is sent out in system, also can receive evaluation simultaneously and wait the job placement notice in the Email of oneself.
◆ complete flow process support, support is proposed report from opening, the expert is definite, report is checked, paper is submitted to, evaluate distribution, paper is evaluated, comment is filled in up to the whole flow process by reply.
◆ the bulk print function, support to open and propose report, academic dissertation, paper and read and appraise the batch output of book etc., print, convenient management and filing.
2) major function of this patent design:
Figure A20061011815000041
Figure A20061011815000051
Figure A20061011815000081
3) paper is checked service middleware
◆ accept the paper that the user submits to
◆ accept the document of Word form
◆ send call request to the WS that checks the paper form
◆ to checking that the WS that paper is plagiarized sends call request
◆ the result of receiving check, return to the user with check result
4) check the Web Services of two of papers
◆ accept user's call request
◆ receive the document of the needs inspection of uploading
◆ safeguard Shared Folders (Samba shares) with core energy common access
◆ activate the program of core, begin to check (Sockets communication)
◆ check result is fed back to paper submission website
Kernel program:
◆ accept the call instruction of WS, creating new thread process should request
◆ is the TEXT file with document by the Word format conversion
◆ read the masterplate form
◆ read form in full, and contrast with masterplate
◆ read and deposit array in full in
◆ sentence is carried out participle, and generate corresponding URL request
◆ mutual with Google API, obtain return message
◆ read return message, the operation alignment algorithm
◆ add up final result, generate the check result report
5) search engine
◆ keyword is encapsulated in the URL request goes
◆ from the html page that returns, obtain the result of inquiry
3 format checking technical schemes
The automatic format audit function of system is based on the paper template of paper inspection unit, obtains the form of paper template and institute's presenting papers by computer software, carries out comparison, exports inconsistent (mistake) form project.
The realization of automatic format inspection mainly is based on Word file and secondary development environment thereof.The paper data of non-Word document at first will be converted into the Word form.
Tens kinds of patterns that carry are arranged among the Office Word, and can be by the new pattern of User Defined, and every kind of pattern is made of a series of daughter element, element definition as title one is: " text+paragraph outline: 1. paragraph upper end spacing: 17 pounds; paragraph lower end spacing: 16.5 pounds; line-spacing: many times; line-spacing size: 2.4 words are capable; keep lines together, with lower end same page, runic; font size: No. two, character-spacing is adjusted No. two ", and these elements are represented with a series of Word customized labels, rough Statistics has hundreds of kind element.Use Word API that word is changed into the XML file, all format items are all brought into play its effect with the form of XML label.This is through the XML after the middle layer conversion, so Chinese tag is arranged, if direct Zhuan Huan XML, its tag be English entirely, like this for just very difficult its expression implication of understanding of some remoter tag.
In this programme, comparatively common tag has been defined semantic base, the rudimentary algorithm of format checking is as follows:
◆ extract all patterns and elements corresponding definition thereof in the standard masterplate
◆ extract all patterns and the elements corresponding definition thereof submitted in the document
◆ check its Apply Styles sentence by sentence to submitting document to
◆ if its Apply Styles exists in the masterplate pattern, then contrast its element whether with masterplate in the element of this pattern conform to one by one, then be not summed up as pattern mistake be set if be inconsistent
If its Apply Styles does not exist in the masterplate pattern, then prompt for applied pattern and do not exist, please proofread and correct.
4 plagiarism check technical schemes
The automatic plagiarism check function of system is based on the open self-built paper resource of network document resource, library's online resource, system, utilizes computer software to obtain document, decomposes document, checks that document realizes.
The search word string that existing in the world search engine can be handled all is no more than 20 individual characters (word), and the words search precision that surpasses will reduce greatly.If the key word of input is limited, then lost the globality of sentence, can't judge the phase knowledge and magnanimity of its sentence, only be that the individual words acquaintance can not say something.Particularly in long paper, indiscriminately imitate than entire chapter is large-area, plagiarizing more susceptible condition is splicing.The plagiarism person usually mixes words and phrases in the original text or viewpoint, and illustrates with oneself language and viewpoint; Or from different data source, find and oneself need, do not indicate the source, be spliced into the thing of oneself; Perhaps will indicate blending into one of quoting with the unreceipted literal of quoting.
At first we will extract as key elements such as title, author, summaries the easiest similar keyword.If the word document of submitting to has these attributes in file attribute, that can directly extract by Word API, if Word file filename does not comprise relevant information, and paper title so, the author, information such as writer identity this how to obtain not be a nothing the matter.Because not all paper all is to write according to the consolidation form of strictness, even standard format is arranged, these standards neither be unique, and this just causes above-mentioned information can not be simply to determine by locating.Our disposal route is (is example with author's name): author's name is to exist with independent morpheme, that is to say that problem is converted into, and how to judge whether it is the author's name when obtaining a morpheme.1, this morpheme should satisfy the nomenclature of name, and promptly the characteristic from morpheme itself starts with.For example surname has certain span, and name length also has certain limit simultaneously.Any one morpheme can obtain a weights c according to the rule of characteristic.2, consider that in position in full promptly the location from morpheme starts with from morpheme.For example the author's name only may appear in document front or the last certain limit, and this positional value may be absolute position or percent position, and should be defined as a scope.Equally, each morpheme all obtains a weights l.3, consider that from the context at morpheme place promptly the context from morpheme starts with.For example should unit that describe the author before and after the author's name, information such as numbering, when information conforms, this morpheme should obtain higher weights t.To sum up, a morpheme can obtain a comprehensive weights q at last, just can be considered to target when the q value reaches certain level.
The variation of Google is to begin to adopt the stem method to carry out search service.In other words, if key in " dietary " in the search box, so also can obtain some Search Results based on " diet ".That is to say that it no longer adheres rigidly to search and the identical word of input inquiry, also can provide the Search Results close with some or all query terms to the user.For example, for query terms " pet lemur dietary needs ", Google also can show the Search Results of " pet lemur diet needs " and other conjugations form.And for query word that occurs on the search results pages itself and different versions thereof, Google can be with the outstanding expression of runic." so we adopt sentence by sentence formula search, are that decollator is cut apart text with the comma earlier, then playlet are carried out the validity screening, remove some versatility statements, the sentence that can not say something as " thanks ", " by the way " etc.Remaining valid sentence subrecord in array, is used as the input keyword that calls Google search.The realization of the anti-algorithm of plagiarizing.The quality of algorithm and the situation of programming are the effect of the whole checkout procedure of decision and the key of efficient.Because the article style of Chinese and English, word, punctuation mark all is differentiated, can develop any language hardly and can both cover all, and all good especially again algorithm of efficient.So carrying out algorithm at English paper, we realize.
Our algorithm is based on a hypothesis: the paper of being accused of plagiarizing at least all is indiscriminately imitating of one whole section content.The appearance of sentence may be because the author has the article of quoting other people separately, rather than says plagiarization, but exists a lot of sentences to be found, or even the content of the big section of section is found greatly, must illustrate that then the author has plagiarized other people paper.
The anti-algorithm of plagiarizingDesign as follows:
◆ receive paper, be converted to the TXT form
◆ sentence identification ﹠amp; Word identification
◆ simple sentence search ﹠amp; Random start
◆ Bi Duijieguo ﹠amp; " Check Next " strategy
◆ repeat 3,4 steps as requested
◆ result's statistics
Because design the time is considered that article is long and may be consumed long time, so be provided with an inspection intervals on the basis of checking in full, for example, selecting inspection intervals is 10, so every 10 word, can check that once individual problem is arranged like this, be exactly how fixed starting point is, in this interval of 1-10, any begin to check that we have adopted the method for random number, can select the position that a number is checked to start with in the scope of interval starting point at every turn from a word.So just can guarantee the randomness of inspection, and be unlikely to all can only be checked through the same part of article at every turn.If whole word can search on google, the words is to plagiarize certainly so '.If there are the plagiarism of a large amount of whole sentences in entire article or whole section article, and the source concentrates on again in several pieces of articles, can judge that so certainly article plagiarizes.The benefit of doing like this is as long as whereabouts Google has inquired about, and do not spend contrast is not in twos done in the article that calls in, and relatively saves time.
Chosen that to search behind the sentence be exactly to call Google API to search for.
The Google search engine provides the WebService based on SOAP.This means that different development languages, development environment can both use this service, in addition, Google is in order to simplify java applet person's exploitation, it also provides a cover Java api interface, be used to visit WebSerivce, this makes exploitation one cover support the program of Google function of search to become very easy.The developer can be embedded into the function of search of Google in the middle of their application program.
At present, the API of Google still is in the Beta stage, therefore, also has some restrictions in the use of API.For example, use these API need apply for an account number.For the account number of free application,, limit each account number, every day and can only inquire about at most 1000 times in order to prevent the wrongful use of developer.At present, Google does not also propose formal operation mode and charge method.
Google API mainly provides three kinds of services at present, and is as follows respectively:
1) search service.The user submits a search content request to, and the Google server will be searched and surpass 2,000,000,000 web page contents, and the target of meeting consumers' demand is returned to the user, and in general this processing only needs several seconds time.
2) buffer memory service (Cache).The user submits a URL to, and the Google server will return the state-of-the-art record that searcher is visited this URL.Like this, the user is easy to the search achievement of multiplexing Google.
3) spell check.This function is used to check whether user's query requests is effective.The user submits a scope of examination to, and the Google server will return effectively, near the query requests of this content, and the query requests of returning will meet the rule searching of Google.
Google provides the WebService based on SOAP, so the user can be to the query requests of Google server submission SOAP, and then the Google server will be handled these requests, and return the Query Result of SOAP form.
With Google API mutual after, can obtain the return results of Google, at this moment we need carry out the parsing of content, judge at first whether what is not all found, if like this, this inquiry just is through with.If the content of inquiring so just reads the content of finding and that sentence of inquiring about is compared, if find that both are identical or similar, so just get off content record.
Plagiarism has following several mode:
1, copy word by word and do not miss, equal COPY, have in addition punctuation mark do not let slip yet.This is a kind of rudimentary plagiarism, and this situation is easy to detect, but uncommon.
2, plagiarize others' core viewpoint, be placed in the langue of oneself.This plagiarism is comparatively common.
3, take over others theoretical model or framework, method, theorem, add some narrations again.This plagiarism is also more.
4, rewrite others' literal, add some views of oneself, but make the people feel no new meaning after reading over, and can find the source prototype of article.
5, it is dishonest to quote others achievement, adds datedly as some place, and does not indicate in some place, just rewrites, and quantitatively length is bigger.Or quote data that others quoted in a large number and do not illustrate and quote, also belong to a kind of plagiarization to others' fruit of labour, be more common act of plagiarism.
In order to distinguish this several plagiarisms, in design, we adopt the method for similarity weights.
Standard is as follows:
Check criteria The similarity weights
Just the same 1
On the duplicate basis how or lacked a word 0.9
But the just the same order of word has transposing 0.8
A sentence has 70% the same and order of word 0.7
Other 0.7 below
In our suspection record, do not write down the record below 0.7 at present.
In the result that Google returns, the degree of association according to search word has sorted, that is to say that beginning from article one that reading and recording compares is most possibly to find similar sentence, and it is the one section word that comprises keyword in the file that searches that individual summary field is arranged in the result who returns, so compare just passable with the summary field.If these words have been found so, it next should also can as long as we compare down in short again, can be hit on the so very big degree in the return results of Google so.So just can reduce one query.This is exactly the strategy of Check Next.This strategy is for plagiarizing serious more paper, and the speed of inspection can be fast more.
Document one is from first morpheme, and each morpheme all compares Report object addition element when finding coupling in proper order with each morpheme of document two.Finish after all couplings, call the summarizing () function of Report class and conclude, the continuous coupling of adjacent element is merged.In order to save search time, we only get preceding 10 of Search Results and compare.After checking out in short, under taking out, in short check that all check up to the sentence of all these inspections so repeatedly to have finished, query script also just is through with so then.Next be the generation of some statisticss, and the expression-form of report, the support of multiple matching way.
The patent effect
This patent is towards magazinist portion, universities and colleges and international conference, for their editor, contributor, responsible reader and managerial personnel provide a computer system, supports their online collaborative work.This system checks that by an online computer, automatic plagiarism check software, automatic format software and paper are submitted to and evaluation software is formed.This system is towards the paper Life cycle, support the paper creation and exchange, towards submission person, reviewer, paper supvr, unit management person, supervisor's polygonal look rights management and personalized professional tool case, the progress real-time statistics, automatically notice, directed mass-sending, functions such as data filing.Purpose be working time of reducing managerial personnel, reviewer and contributor with exchanges in error, save unnecessary paper printing expense, traveling expense, telephone charge, postal delivery expense etc., increase work efficiency.The patent case shows, this system can exempt the workload of managerial personnel and teachers and students 60-80%, reducing unnecessary time energy consumes, reduce expenses 10,000-more than 60,000 yuan every year, increase work efficiency more than 30%, everybody is freed from work such as trifling, miscellaneous paper creation, submission, modification, evaluation, management.
Description of drawings
Introduce this patent embodiment below in conjunction with caption.
Fig. 1 is the computer system composition diagram of this patent.
Fig. 2 is the system software structure of this patent.
Fig. 3 is that the paper of this patent is checked service middleware class illustrated example.
Fig. 4 is the User login interface among this patent embodiment.
Fig. 5 is the paper managerial personnel interface among this patent embodiment.
Fig. 6 is the paper format checking interface among this patent embodiment.
Fig. 7 is the paper plagiarism check interface among this patent embodiment.
Embodiment
This patent is towards magazinist portion, universities and colleges and international conference, for their editor, contributor, responsible reader and managerial personnel provide an arbitrage with computer system automatically perform format checking, automatically plagiarism check, support the method and mechanism of their online collaborative work.System embodiment based on this patent institute describing method and mechanism is the academic dissertation submission and the estimating and examining system of Fudan University.
The composition of 1 paper management and mechanism
As shown in Figure 1, the whole thesis management is divided into 5 parts, on computer system realizes:
1) front end is to submit the interface to the paper of different role user interactions.Each role's user enters the interface, tool box of oneself by its login system.The all evaluation reciprocal process all EMAIL that automatically sends of adjoint system is carried out interconnected beyond the system boundary, makes things convenient for teachers and students to link up.All roles and paper all manage by the administrator is unified in the system.Paper submits to platform to set up paper from being submitted to the complete workflow of evaluation, and the procedural informationization that this is complicated has realized the with no paper running of this flow process.
2) server end is moving paper automatically and is checking service middleware, the paper information that it submits to platform to submit to up according to paper, to checking that the Web Services that paper form, paper are plagiarized proposes call request, and check result is organized into report form deposits in the database, paper submits to the interface that report is showed the user, and the user can inquire the up-to-date audit report of paper.
3) paper checks that service middleware is responsible for and two Web Services of inspection paper get in touch.
4) effect of paper format checking and plagiarism check Web Services is to accept the call request that paper checks that service middleware sends, receive paper and check the paper of the requirement inspection that service middleware sends over, document storage in the Shared Folders (Samba shares) that own and kernel program can both be visited, and is called kernel program and begun to check.After inspection finishes, the result is returned to paper check service middleware at last.These two Web Services receive a new request at every turn and just create out a new thread and be responsible for processing, and this process at first converts document to the file of TXT form, read whole file then, with the sentence is that unit preserves, and after checking respectively, provides whole report.The form of report is to generate a check result array, and with statistics, consuming time and result that be checked through writes in this array and returns.
5) last part is a search engine, and this is the Web search engine that the third party provides, and comprises the interface of Google API and other paper searching systems.
2 functional characters and function design
Fig. 4 is the User login interface among this patent embodiment.Fig. 5 is the paper managerial personnel interface among this patent embodiment.
1) functional character of paper administrative section
◆ online collaborative work, roles such as the student that thesis relates to, tutor, evaluation committee member, managerial personnel, person in charge president, the chairman of degree sub-committee are collaborative online, paperless office, the hardship of remove that paper is printed repeatedly, transmitted from, the loaded down with trivial details work of modification etc. and process data being lost.
◆ the professional work interface, each role's login system is own interface, needed statistics and the Work tool of being familiar with.
◆ perfect rights management, opening the action of proposing in report and the paper business according to different role, perfect operating right management is arranged.
◆ data progress statistics, enter system, different role can be seen own every statistics of paying close attention to, as data such as presenting papers number, paper states.
◆ line personnel management, managerial personnel can the various roles of online management, evaluate management.
◆ the directed mass-sending of notice, according to the needs of communicating by letter in the paper management, for different role provides corresponding orientation notice mass-sending function respectively.The reception staff not only can see the message that is sent out in system, also can receive evaluation simultaneously and wait the job placement notice in the Email of oneself.
◆ complete flow process support, support is proposed report from opening, the expert is definite, report is checked, paper is submitted to, evaluate distribution, paper is evaluated, comment is filled in up to the whole flow process by reply.
◆ the bulk print function, support to open and propose report, academic dissertation, paper and read and appraise the batch output of book etc., print, convenient management and filing.
2) major function of this patent design
Figure A20061011815000151
Figure A20061011815000161
Figure A20061011815000171
3) paper is checked service middleware
◆ accept the paper that the user submits to
◆ accept the document of Word form
◆ send call request to the WS that checks the paper form
◆ to checking that the WS that paper is plagiarized sends call request
◆ the result of receiving check, return to the user with check result
4) check the Web Services of two of papers
◆ accept user's call request
◆ receive the document of the needs inspection of uploading
◆ safeguard Shared Folders (Samba shares) with core energy common access
◆ activate the program of core, begin to check (Sockets communication)
◆ check result is fed back to paper submission website
Kernel program:
◆ accept the call instruction of WS, creating new thread process should request
◆ is the TEXT file with document by the Word format conversion
◆ read the masterplate form
◆ read form in full, and contrast with masterplate
◆ read and deposit array in full in
◆ sentence is carried out participle, and generate corresponding URL request
◆ mutual with Google API, obtain return message
◆ read return message, the operation alignment algorithm
◆ add up final result, generate the check result report
5) search engine
◆ keyword is encapsulated in the URL request goes
◆ from the html page that returns, obtain the result of inquiry
3 format checking technical schemes
Fig. 6 is the paper format checking interface among this patent embodiment.
The automatic format audit function of system is based on the paper template of paper inspection unit, obtains the form of paper template and institute's presenting papers by computer software, carries out comparison, exports inconsistent (mistake) form project.
The realization of automatic format inspection mainly is based on Word file and secondary development environment thereof.The paper data of non-Word document at first will be converted into the Word form.
Tens kinds of patterns that carry are arranged among the Office Word, and can be by the new pattern of User Defined, and every kind of pattern is made of a series of daughter element, element definition as title one is: " text+paragraph outline: 1. paragraph upper end spacing: 17 pounds; paragraph lower end spacing: 16.5 pounds; line-spacing: many times; line-spacing size: 2.4 words are capable; keep lines together, with lower end same page, runic; font size: No. two, character-spacing is adjusted No. two ", and these elements are represented with a series of Word customized labels, rough Statistics has hundreds of kind element.Use Word API that word is changed into the XML file, form is as follows:
<?xml?version=″1.0″encoding=″UTF-8″?>
<uof:UOF?xmlns:uof=″http://schemas.uof.org/cn/2003/uof″
xmlns:=″http://schemas.uof.org/cn/2003/graph″
xmlns:=″http://schemas.uof.org/cn/2003/uof-wordproc″
xmlns:=″http://schemas.uof.org/cn/2003/uof-spreadsheet″
xmlns:=″http://schemas.uof.org/cn/2003/uof-slideshow″
xmlns:xsi=″http://www.w3.org/2001/XMLSchema-instance″
xsi:schemaLocation=″http://schemas.uof.orrg/cn/2003/uof
D:\UOF\uof_schema\uof.xsd″uof:language=″cn″uof:version=″1.0″
uof:locID=″u0000″>
<uof: metadata uof:locID=" u0001 " 〉
<uof: title uof:locID=" u0002 "〉based on the paper of Google web service make an inventory the system of attacking</uof: title
<uof: founder uof:locID=" u0004 "〉</uof: the founder 〉
<uof: last author uof:locID=" u0006 " 〉
</uof: last author 〉
<uof: date created uof:locID=" u0008 "〉2006-05-30T02:00:00</uof: date created 〉
<uof: editor number of times uof:locID=" u0009 "〉31</uof: editor's number of times 〉
<uof: edit session uof:locID=" u0010 "〉P0Y0M0DT0H55M6S</uof: edit session 〉
<uof: create application program uof:locID=" u0011 "〉EIOffice 2007</uof: create application program 〉
<uof: Business Name uof:locID=" u0018 "〉Software Fudan.U</uof: Business Name 〉
</uof: metadata 〉
<uof: bookmark collection uof:locID=" u0027 " 〉
<uof: bookmark uof:locID=" u0028 " uof:attrList=" title " uof: title=" _ Toc138184650 " 〉
<uof: text position uof:locID=" u0029 " uof:attrList=" quote in the zone " word: the zone quotes=" bk_Toc138184650 "/
</uof: bookmark 〉
</uof: bookmark collection 〉
<uof: object set uof:locID=" u0033 " 〉
<uof: other objects uof:locID=" u0036 " uof:attrList=" the embedded common type private type of identifier " uof: identifier=" OBJ00002 " uof: embedded=" false "
Uof: common type=" png " 〉
<uof: data uof:locID=" u0037 " 〉
</uof: data 〉
</uof: other objects 〉
<figure: figure uof:locID=" g0000 " uof:attrList=" level identifier combination tabulate other objects " figure: level=" 4 " figure: identifier=" OBJ00001 " figure: other objects=" OBJ00002 " 〉
<figure: predefine figure uof:locID=" g0005 " 〉
<figure: classification uof:locID=" g0006 "〉11</figure: classification 〉
<figure: title uof:locID=" g0007 "〉Rectangle</figure: title 〉
<figure: generate software uof:locID=" g0008 "〉EIOffice</figure: generate software 〉
<figure: attribute uof:locID=" g0011 " 〉
<figure: line style uof:locID=" g0014 "〉single</figure: line style 〉
<figure: line thickness uof:locID=" g0016 "〉0.75</figure: the line thickness 〉
<figure: width uof:locID=" g0023 "〉36.00055</figure: width 〉
<figure: height uof:locID=" g0024 "〉36.00055</figure: highly 〉
<figure: anglec of rotation uof:locID=" g0025 "〉0.0</figure: the anglec of rotation 〉
<figure: locking aspect ratio uof:locID=" g0028〉1</figure: the locking aspect ratio 〉
<figure: print object uof:locID=" g0032 "〉true</figure: print object 〉
<figure: Web literal uof:locID=" g0033 "〉Graph</figure: the Web literal 〉
</figure: attribute 〉
</figure: predefine figure 〉
</figure: figure 〉
Can see that all format items are all brought into play its effect with the form of XML label.This is still through the XML after the middle layer conversion, so Chinese tag is arranged, if direct Zhuan Huan XML, its tag be English entirely, like this for just very difficult its expression implication of understanding of some remoter tag.
In native system, comparatively common tag has been defined semantic base, the rudimentary algorithm of format checking is as follows:
◆ extract all patterns and elements corresponding definition thereof in the standard masterplate
◆ extract all patterns and the elements corresponding definition thereof submitted in the document
◆ check its Apply Styles sentence by sentence to submitting document to
◆ if its Apply Styles exists in the masterplate pattern, then contrast its element whether with masterplate in the element of this pattern conform to one by one, then be not summed up as pattern mistake be set if be inconsistent
◆ if its Apply Styles does not exist in the masterplate pattern, then prompt for applied pattern and do not exist, please proofread and correct.
The difficult point of this part service is, after Word changes into xml, its hierarchical structure is quite complicated, especially the element that is provided with in the pattern is too much or when existing complicate list form, the form dependency structure of a short sentence can reach about 10 layers, and the enormous amount of element tags, locate statement and accurately extraction element tags exactly, be very time-consuming and account for resource, if the document of submitting to excessive (generally greater than 2M) will be in operational process because of the resource exhaustion termination that makes mistakes.So native system adopts segment transmissions word, the segmentation inspection, the method that merges audit report at last solves this problem.
4 plagiarism check technical schemes
Fig. 7 is the paper plagiarism check interface among this patent embodiment.
The automatic plagiarism check function of system is based on the open self-built paper resource of network document resource, library's online resource, system, utilizes computer software to obtain document, decomposes document, checks that document realizes.
The search word string that existing in the world search engine can be handled all is no more than 20 individual characters (word), and the words search precision that surpasses will reduce greatly.If the key word of input is limited, then lost the globality of sentence, can't judge the phase knowledge and magnanimity of its sentence, only be that the individual words acquaintance can not say something.Particularly in long paper, indiscriminately imitate than entire chapter is large-area, plagiarizing more susceptible condition is splicing.The plagiarism person usually mixes words and phrases in the original text or viewpoint, and illustrates with oneself language and viewpoint; Or from different data source, find and oneself need, do not indicate the source, be spliced into the thing of oneself; Perhaps will indicate blending into one of quoting with the unreceipted literal of quoting.
So how to send the difficult point that rational search key is this project to Google web service.
At first we will extract as key elements such as title, author, summaries the easiest similar keyword.If the word document of submitting to has these attributes in file attribute, that can directly extract by Word API, if Word file filename does not comprise relevant information, and paper title so, the author, information such as writer identity this how to obtain not be a nothing the matter.Because not all paper all is to write according to the consolidation form of strictness, even standard format is arranged, these standards neither be unique, and this just causes above-mentioned information can not be simply to determine by locating.Our disposal route is (is example with author's name): author's name is to exist with independent morpheme, that is to say that problem is converted into, and how to judge whether it is the author's name when obtaining a morpheme.1, this morpheme should satisfy the nomenclature of name, and promptly the characteristic from morpheme itself starts with.For example surname has certain span, and name length also has certain limit simultaneously.Any one morpheme can obtain a weights c according to the rule of characteristic.2, consider that in position in full promptly the location from morpheme starts with from morpheme.For example the author's name only may appear in document front or the last certain limit, and this positional value may be absolute position or percent position, and should be defined as a scope.Equally, each morpheme all obtains a weights l.3, consider that from the context at morpheme place promptly the context from morpheme starts with.For example should unit that describe the author before and after the author's name, information such as numbering, when information conforms, this morpheme should obtain higher weights t.To sum up, a morpheme can obtain a comprehensive weights q at last, just can be considered to target when the q value reaches certain level.
The variation of Google is to begin to adopt the stem method to carry out search service.In other words, if key in " dietary " in the search box, so also can obtain some Search Results based on " diet ".That is to say that it no longer adheres rigidly to search and the identical word of input inquiry, also can provide the Search Results close with some or all query terms to the user.For example, for query terms " pet lemur dietary needs ", Google also can show the Search Results of " pet lemur diet needs " and other conjugations form.And for query word that occurs on the search results pages itself and different versions thereof, Google can be with the outstanding expression of runic." so we adopt sentence by sentence formula search, are that decollator is cut apart text with the comma earlier, then playlet are carried out the validity screening, remove some versatility statements, the sentence that can not say something as " thanks ", " by the way " etc.Remaining valid sentence subrecord in array, is used as the input keyword that calls Google search.The realization of the anti-algorithm of plagiarizing.The quality of algorithm and the situation of programming are the effect of the whole checkout procedure of decision and the key of efficient.Because the article style of Chinese and English, word, punctuation mark all is differentiated, can develop any language hardly and can both cover all, and all good especially again algorithm of efficient.So carrying out algorithm at English paper, we realize.
Our algorithm is based on a hypothesis: the paper of being accused of plagiarizing at least all is indiscriminately imitating of one whole section content.The appearance of sentence may be because the author has the article of quoting other people separately, rather than says plagiarization, but exists a lot of sentences to be found, or even the content of the big section of section is found greatly, must illustrate that then the author has plagiarized other people paper
The anti-algorithm of plagiarizingDesign as follows:
◆ receive paper, be converted to the TXT form
◆ sentence identification ﹠amp; Word identification
◆ simple sentence search ﹠amp; Random start
◆ Bi Duijieguo ﹠amp; " Check Next " strategy
◆ repeat 3,4 steps as requested
◆ result's statistics
Because design the time is considered that article is long and may be consumed long time, so be provided with an inspection intervals on the basis of checking in full, for example, selecting inspection intervals is 10, so every 10 word, can check that once individual problem is arranged like this, be exactly how fixed starting point is, in this interval of 1-10, any begin to check that we have adopted the method for random number, can select the position that a number is checked to start with in the scope of interval starting point at every turn from a word.So just can guarantee the randomness of inspection, and be unlikely to all can only be checked through the same part of article at every turn.If whole word can search on google, the words is to plagiarize certainly so '.If there are the plagiarism of a large amount of whole sentences in entire article or whole section article, and the source concentrates on again in several pieces of articles, can judge that so certainly article plagiarizes.The benefit of doing like this is as long as whereabouts Google has inquired about, and do not spend contrast is not in twos done in the article that calls in, and relatively saves time.
Chosen that to search behind the sentence be exactly to call Google API to search for.
The Google search engine provides the WebService based on SOAP.This means that different development languages, development environment can both use this service, in addition, Google is in order to simplify java applet person's exploitation, it also provides a cover Java api interface, be used to visit WebSerivce, this makes exploitation one cover support the program of Google function of search to become very easy.The developer can be embedded into the function of search of Google in the middle of their application program.
At present, the API of Google still is in the Beta stage, therefore, also has some restrictions in the use of API.For example, use these API need apply for an account number.For the account number of free application,, limit each account number, every day and can only inquire about at most 1000 times in order to prevent the wrongful use of developer.At present, Google does not also propose formal operation mode and charge method.
Google API mainly provides three kinds of services at present, and is as follows respectively:
1) search service.The user submits a search content request to, and the Google server will be searched and surpass 2,000,000,000 web page contents, and the target of meeting consumers' demand is returned to the user, and in general this processing only needs several seconds time.
2) buffer memory service (Cache).The user submits a URL to, and the Google server will return the state-of-the-art record that searcher is visited this URL.Like this, the user is easy to the search achievement of multiplexing Google.
3) spell check.This function is used to check whether user's query requests is effective.The user submits a scope of examination to, and the Google server will return effectively, near the query requests of this content, and the query requests of returning will meet the rule searching of Google.
Google provides the WebService based on SOAP, so the user can be to the query requests of Google server submission SOAP, and then the Google server will be handled these requests, and return the Query Result of SOAP form.
With Google API mutual after, can obtain the return results of Google, at this moment we need carry out the parsing of content, judge at first whether what is not all found, if like this, this inquiry just is through with.If the content of inquiring so just reads the content of finding and that sentence of inquiring about is compared, if find that both are identical or similar, so just get off content record.
Plagiarism has following several mode:
1, copy word by word and do not miss, equal COPY, have in addition punctuation mark do not let slip yet.This is a kind of rudimentary plagiarism, and this situation is easy to detect, but uncommon.
2, plagiarize others' core viewpoint, be placed in the langue of oneself.This plagiarism is comparatively common.
3, take over others theoretical model or framework, method, theorem, add some narrations again.This plagiarism is also more.
4, rewrite others' literal, add some views of oneself, but make the people feel no new meaning after reading over, and can find the source prototype of article.
5, it is dishonest to quote others achievement, adds datedly as some place, and does not indicate in some place, just rewrites, and quantitatively length is bigger.Or quote data that others quoted in a large number and do not illustrate and quote, also belong to a kind of plagiarization to others' fruit of labour, be more common act of plagiarism.
In order to distinguish this several plagiarisms, in design, we adopt the method for similarity weights.
Standard is as follows:
Check criteria The similarity weights
Just the same 1
On the duplicate basis how or lacked a word 0.9
But the just the same order of word has transposing 0.8
A sentence has 70% the same and order of word 0.7
Other 0.7 below
In our suspection record, do not write down the record below 0.7 at present.
In the result that Google returns, the degree of association according to search word has sorted, that is to say that beginning from article one that reading and recording compares is most possibly to find similar sentence, and it is the one section word that comprises keyword in the file that searches that individual summary field is arranged in the result who returns, so compare just passable with the summary field.If these words have been found so, it next should also can as long as we compare down in short again, can be hit on the so very big degree in the return results of Google so.So just can reduce one query.This is exactly the strategy of Check Next.This strategy is for plagiarizing serious more paper, and the speed of inspection can be fast more.
Document one is from first morpheme, and each morpheme all compares Report object addition element when finding coupling in proper order with each morpheme of document two.Finish after all couplings, call the summarizing () function of Report class and conclude, the continuous coupling of adjacent element is merged.In order to save search time, we only get preceding 10 of Search Results and compare.After checking out in short, under taking out, in short check that all check up to the sentence of all these inspections so repeatedly to have finished, query script also just is through with so then.Next be the generation of some statisticss, and the expression-form of report, the support of multiple matching way.
5 system business processes
Flow process number The workflow content System is auxiliary automatically
1 The keeper creates number of the account for all users After creating successfully, system sends mail according to the E-MAIL address of registering to the user automatically and tells its account number cipher.
2 After each Role Users is notified, revise personal information according to the account number cipher login system in the mail System generates the demographics data automatically
3 Student's logging in system by user is filled in preliminary report and submission System sends mail notification tutor, president and degree chairman automatically and reads and makes comments report and generate the recent statistics data
4 The tutor invites the expert to participate in preliminary report and checks System sends notice automatically and invites the expert.
5 Tutor, expert, president, degree chairman read and make comments preliminary report, and provide suggestion for revision System sends notice automatically and tells the student to read and make comments the result and generate the recent statistics data
6 The student is according to reading and making comments suggestion for revision modification preliminary report and resubmiting System sends mail notification tutor, expert, president and degree chairman automatically and reads and makes comments preliminary report again and generate the recent statistics data
7 Tutor, expert, president, degree chairman read and make comments preliminary report once more, could begin presenting papers by allowing the student until meeting the requirements System sends mail automatically and tells the student to read and make comments the result and generate the recent statistics data
8 After preliminary report passed through, the student began presenting papers System sends the mail notification tutor automatically, carries out format checking and plagiarism check simultaneously and generates audit report and generate the recent statistics data
9 The tutor checks format checking and plagiarism check report, and paper is checked, and proposes suggestion for revision System sends mail automatically and tells student tutor to check suggestion and generate the recent statistics data
10 The student is according to format checking and plagiarism check is reported and tutor's suggestion modification paper is uploaded again System sends the mail notification tutor automatically, carries out format checking and plagiarism check simultaneously and generates audit report and generate the recent statistics data
11 Managerial personnel are to uploading and evaluating distribution by the paper of format checking and plagiarism check System sends the mail notification evaluation expert automatically, increases this paper automatically and generates the recent statistics data in the paper evaluation module in the evaluation expert tool box simultaneously
12 The evaluation expert evaluates paper, submits review comment to System sends mail automatically and tells the student to evaluate suggestion for revision and generate the recent statistics data
13 The student revises paper once more and uploads again System sends mail notification tutor and evaluation automatically
The expert carries out format checking and plagiarism check simultaneously and generates audit report and generate the recent statistics data
14 The evaluation expert evaluates paper repeatedly and meets the requirements until the paper modification, can allow to participate in reply System sends mail notification student's final review comment and generates the recent statistics data
15 Whether managerial personnel can participate in reply according to format checking, plagiarism check result and evaluation expert's suggestion decision paper. System sends the mail notification student automatically and generates the recent statistics data
The software configuration that 6 systems realize
The whole thesis management system software structure that the support this patent is realized as shown in Figure 2.
Paper checks that service middleware is responsible for and two Web Services of inspection paper get in touch, and its structure as shown in Figure 2.7 system development environments
System at development environment is: Tomcat+Axis+Java.The environment configurations step is as follows:
1) sets up the Java development environment, the version of JavaSDK1.3.1 or renewal.Relative address http:. //java.sun.com/J2SE
2) the Jar file of download Google API, Googleapi.jar.The developer can arrive http://www.google.com/apis and download.
3) application Google visit account number, Google supports the mode of free application at present, the user need use an email account number to obtain new account.Notice that at present free account number is supported 1000 maximum inquiry amount every day.Applied address: http://www.google.com/apis.
4) (optional) if the user need directly send SOAP request by java applet, the developer need download relevant Java software package, JAXM.Download address: http://java.sun.com/xml
Web Services deploying step is as follows:
1) the Tomcat server is installed, tomcat5.0 or renewal version, and guarantee that the tomcat server can normally move.
2) the Axis kit is unziped under the webapps catalogue of Tomcat installation directory.
3) plgpreventing.jar of the Web Service that checks service bag is copied under the axis WEB-INF lib
4) jacob.dll is copied to windows under the system32 file
5) move undeploy.bat and the deploy.bat that format checking is served successively respectively, and the undeploy.bat and the deploy.bat of plagiarism check service.
6) restart the tomcat server
7) open any browser is imported on address field Http:// localhost: port numbers/axis/services/, whether the service of inspection exists.Then represent Web Service deployment success if exist
8 software and hardware configuration
◆ software arrangements
-Microsoft.NET?Framework1.1
-Microsoft?SQL?Server?2000?Enterprise?Edition
-Microsoft?SMTP?Server
-Microsoft?IIS
-Java?SDK?1.3.2
-Tomcat?5.0
-Axis kit
◆ hardware configuration
Server should have following configuration:
The two CPU of-minimalist configuration: PIII550,512M internal memory, the quick hard disk of 15G
-recommended configuration: two strong CPU, 2048M internal memory, hot-swappable hard disks of the above high speed of 36G (disk permutation) of causing
-client should possess following configuration:
-system configuration: more than the PII233,128M internal memory, 10G hard disk
-operating system: more than the Windows98
-browser: more than the IE5.5

Claims (3)

1. support the online dissertation management method and the realization mechanism of plagiarism check and format checking.Computer system based on this patent checks that by the computing machine of an online, automatic plagiarism check software, automatic format software and paper are submitted to and evaluation software is formed.This patent is supported submission person, reviewer, paper supvr, unit management person, supervisor's polygonal look rights management towards the paper Life cycle, supports their collaborative to work online, and comprises paper creation, submission, interchange, evaluation, management.Its principal character comprises: utilize the Word development environment to realize the inspection of paper automatic format, utilize method, mechanism and the software implementation method of the automatic plagiarism check of network paper resource practice paper, personalized professional tool case and collaborative work stream mechanism, the progress real-time statistics, the directed mass-sending of notice, functions such as data filing.
2. the automatic plagiarism check of the paper in " claim 1 " and its implementation.Feature: based on the open self-built paper resource of network document resource, library's online resource, system, utilize computer software to obtain, decompose, check document, decompose the examine paper simultaneously, provide the part of being accused of plagiarizing by semantic matches and plagiarization judgment rule.
3. the paper automatic format in " claim 1 " is checked and its implementation.Feature: based on the paper template of paper inspection unit, obtain the form of paper template and institute's presenting papers, carry out comparison, export inconsistent (mistake) form project description by computer software.
CNA2006101181508A 2006-11-09 2006-11-09 Online dissertation management method for realizing plagiarize and format checking by network resource Pending CN101178786A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2006101181508A CN101178786A (en) 2006-11-09 2006-11-09 Online dissertation management method for realizing plagiarize and format checking by network resource

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2006101181508A CN101178786A (en) 2006-11-09 2006-11-09 Online dissertation management method for realizing plagiarize and format checking by network resource

Publications (1)

Publication Number Publication Date
CN101178786A true CN101178786A (en) 2008-05-14

Family

ID=39405029

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2006101181508A Pending CN101178786A (en) 2006-11-09 2006-11-09 Online dissertation management method for realizing plagiarize and format checking by network resource

Country Status (1)

Country Link
CN (1) CN101178786A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663553A (en) * 2012-04-09 2012-09-12 吴溢华 Copy edit flow system and copy edit flow method for stopping one paper for multiple journals
CN103559592A (en) * 2013-11-22 2014-02-05 孙生强 Management system for scientific achievements
CN103902514A (en) * 2014-04-04 2014-07-02 浙江大学城市学院 Word format extracting and reutilizing method
CN104133840A (en) * 2014-06-24 2014-11-05 国家电网公司 Data processing method and data processing system with system detection and biological recognition functions
CN104601627A (en) * 2013-11-01 2015-05-06 北大方正集团有限公司 Server, terminal, and file online processing method and system
CN106779413A (en) * 2016-12-16 2017-05-31 广州支点网络科技股份有限公司 Information processing method and equipment in a kind of flow of task
CN107247756A (en) * 2017-05-26 2017-10-13 华南师范大学 The accurate search method of scientific paper and system based on Temporal Reasoning
CN107609127A (en) * 2017-09-15 2018-01-19 成都昊洋文采科技有限公司 Contents management method and system based on collaboration creation
CN108197085A (en) * 2017-12-29 2018-06-22 北京仁和汇智信息技术有限公司 A kind of submission reviewing method and device
CN110069785A (en) * 2019-05-05 2019-07-30 甯航 A kind of paper Authority Contro1 and analysis platform and system based on component agreement
CN111611787A (en) * 2019-02-25 2020-09-01 中国海洋大学 Plagiarism evaluation method, system and auxiliary writing system
CN113344755A (en) * 2021-08-09 2021-09-03 江苏羽驰区块链科技研究院有限公司 User image and block chain technology evidence obtaining method for academic mischief-oriented scene

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663553A (en) * 2012-04-09 2012-09-12 吴溢华 Copy edit flow system and copy edit flow method for stopping one paper for multiple journals
CN104601627B (en) * 2013-11-01 2019-05-14 北大方正集团有限公司 Server, terminal, file on-line processing method and system
CN104601627A (en) * 2013-11-01 2015-05-06 北大方正集团有限公司 Server, terminal, and file online processing method and system
CN103559592A (en) * 2013-11-22 2014-02-05 孙生强 Management system for scientific achievements
CN103902514A (en) * 2014-04-04 2014-07-02 浙江大学城市学院 Word format extracting and reutilizing method
CN104133840A (en) * 2014-06-24 2014-11-05 国家电网公司 Data processing method and data processing system with system detection and biological recognition functions
CN106779413A (en) * 2016-12-16 2017-05-31 广州支点网络科技股份有限公司 Information processing method and equipment in a kind of flow of task
CN107247756A (en) * 2017-05-26 2017-10-13 华南师范大学 The accurate search method of scientific paper and system based on Temporal Reasoning
CN107247756B (en) * 2017-05-26 2021-08-06 华南师范大学 Academic paper accurate retrieval method and system based on temporal reasoning
CN107609127A (en) * 2017-09-15 2018-01-19 成都昊洋文采科技有限公司 Contents management method and system based on collaboration creation
CN108197085A (en) * 2017-12-29 2018-06-22 北京仁和汇智信息技术有限公司 A kind of submission reviewing method and device
CN111611787A (en) * 2019-02-25 2020-09-01 中国海洋大学 Plagiarism evaluation method, system and auxiliary writing system
CN110069785A (en) * 2019-05-05 2019-07-30 甯航 A kind of paper Authority Contro1 and analysis platform and system based on component agreement
CN113344755A (en) * 2021-08-09 2021-09-03 江苏羽驰区块链科技研究院有限公司 User image and block chain technology evidence obtaining method for academic mischief-oriented scene

Similar Documents

Publication Publication Date Title
CN101178786A (en) Online dissertation management method for realizing plagiarize and format checking by network resource
CN101763343A (en) Document editor principle supporting format comparison and plagiarism check and method
Korobchinsky et al. Peculiarities of content forming and analysis in internet newspaper covering music news
US9361464B2 (en) Versatile log system
CN101404037B (en) Method for detecting and positioning electronic text contents plagiary
Pääkkönen et al. Exporting Finnish digitized historical newspaper contents for offline use
CN112231494B (en) Information extraction method and device, electronic equipment and storage medium
CN107748748B (en) Full text retrieval system for water conservancy and hydropower technology standard
King et al. Managing usability for people with disabilities in a large web presence
Veal Doverton Techniques of document management: a review of text retrieval and related technologies
Yacoub et al. Document digitization lifecycle for complex magazine collection
Zhang Research on the factors influencing XBRL Financial reporting innovation
Varvel Jr et al. Google Digital Humanities Awards recipient interviews report
LU504881B1 (en) Intelligent collection method and system for engineering archives based on enabling thinking
CN110688453B (en) Scene application method, system, medium and equipment based on information classification
Shen et al. Research on anti-plagiarism system and the law of plagiarism
Zolotas et al. An architecture for the development of distributed analytics based on polystore events
Liu Research on the Evaluation of Business English Teaching Model Under the Background of Big Data
Yoshioka WC3: Analyzing the style of metadata annotation among Wikipedia articles by using Wikipedia category and the DBpedia metadata database
Bernardi et al. Reverse engineering of web applications to abstract user-centered conceptual models
Yläjääski Document management as a part of product lifecycle management
Atkinson et al. Documentation Technology Report
Hou et al. An automatic question answering and knowledge summarization approach for Q&A services
QIN Research on intelligent library book retrieval model based on QR.
Satoh et al. Documentation know-how sharing by automatic process tracking

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20080514