CN113407684A - Article originality checking method, article originality checking device and storage medium - Google Patents
Article originality checking method, article originality checking device and storage medium Download PDFInfo
- Publication number
- CN113407684A CN113407684A CN202110452943.8A CN202110452943A CN113407684A CN 113407684 A CN113407684 A CN 113407684A CN 202110452943 A CN202110452943 A CN 202110452943A CN 113407684 A CN113407684 A CN 113407684A
- Authority
- CN
- China
- Prior art keywords
- article
- similarity
- originality
- sentence
- search
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 238000007689 inspection Methods 0.000 claims abstract description 29
- 238000004364 calculation method Methods 0.000 claims abstract description 19
- 238000012360 testing method Methods 0.000 claims description 13
- 238000012550 audit Methods 0.000 claims description 10
- 238000001514 detection method Methods 0.000 abstract 3
- 238000011002 quantification Methods 0.000 abstract 1
- 238000004891 communication Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses an article originality inspection method, which comprises the following steps: receiving an article to be detected and marking paragraphs of the article; splitting the paragraph into a plurality of sentences according to punctuation marks; randomly selecting a sentence from each sentence of the first segment and the last segment of the paragraph and the middle paragraph of the paragraph, and putting the sentences into a queue to be detected; taking each sentence of the queue to be detected as a search keyword in sequence to search in a search engine, and obtaining a search result corresponding to the sentence; and after the search result is subjected to de-duplication, calculating the similarity between the search result and the corresponding sentence, and obtaining the similarity calculation result of each sentence. An article originality inspection apparatus and a storage medium are also provided. The technical scheme provided by the invention realizes similarity detection and quantification of sentences, paragraphs and whole articles in the articles, improves the original detection efficiency of the articles, does not need human intervention, saves manpower, and has short detection time and accurate result judgment.
Description
Technical Field
The invention relates to the technical field of article originality inspection, in particular to a method, a device, a storage medium and a system for article originality inspection.
Background
The originality inspection of articles is increasingly implemented using search engines, which compare the originality of articles viewed based on search results of the search engines. The method is different from the method which utilizes a professional duplicate checking or plagiarism checking tool, the checking range is wider through the checking of a search engine, and the original degree of an article can be better confirmed. However, in actual operation, it is not practical to search the contents of an article manually by an inspector, on one hand, the article often contains many contents, and a keyword needs to be extracted first and then searched according to the keyword, on the other hand, the searched contents are too miscellaneous and need to be screened one by one, which is time-consuming, labor-consuming, low in efficiency, incapable of quantifying the inspection result, and incapable of accurately judging the originality of the article.
Disclosure of Invention
The invention provides a method, a device, a storage medium and a system for checking article originality, and aims to solve the problem of article originality checking.
In order to achieve the above object, the method for checking originality of an article provided by the present invention comprises:
receiving an article to be detected and marking paragraphs of the article;
splitting the paragraph into a plurality of sentences according to punctuation marks; randomly selecting a sentence from each sentence of the first segment and the last segment of the paragraph and the middle paragraph of the paragraph, and putting the sentences into a queue to be detected;
taking each sentence of the queue to be detected as a search keyword in sequence to search in a search engine, and obtaining a search result corresponding to the sentence;
and after the search result is subjected to de-duplication, calculating the similarity between the search result and the corresponding sentence, and obtaining the similarity calculation result of each sentence.
Further, the article originality inspection method further includes:
and finding out the search result of which the similarity of each sentence is more than or equal to a preset similarity threshold.
Further, the searching each sentence in the queue to be checked as a search keyword in a search engine further comprises:
each sentence is converted into Chinese pinyin and then is used as a search keyword to be searched in a search engine.
Further, the sentence length is greater than or equal to 5 words and less than or equal to 16 words.
Further, the article originality inspection method further includes:
adding the search result with the maximum similarity calculation result of each sentence into a similarity audit library of the article;
and calculating the similarity in the similarity auditing library of the article to obtain the overall similarity auditing result of the article.
Further, the calculating the similarity in the similarity audit library of the articles includes:
extracting a source address of a search result of each sentence;
calculating source address identity according to hyperlinks of the search results to obtain a source address identity value;
and setting the similarity weight according to the source address similarity value, and calculating the similarity of the articles according to the weight.
Further, the calculation formula of the article similarity is as follows:
wherein,
s represents the similarity of the articles;
k represents a weight;
n represents the number of sentences having the same weight;
t represents the total number of sentences;
x and y are natural numbers.
Further, the weight is preset according to the source address identity value, and the weight is a percentage between 1 and 100.
Furthermore, the present invention provides an article originality inspection apparatus including a memory and a processor, the memory having stored thereon an article originality inspection program executable on the processor, the article originality inspection program, when executed by the processor, implementing the steps of the article originality inspection method as described above.
Meanwhile, the present invention also provides a storage medium, which is a computer-readable storage medium, and the storage medium stores an article originality inspection program, and the article originality inspection program can be executed by one or more processors to implement the steps of the article originality inspection method as described above.
According to the method, the device and the storage medium for checking the originality of the article, the similarity of the article is automatically searched by using the search engine, and the search result is quantized, so that the similarity check and the quantization of the sentence, paragraph and whole article in the article are realized, the efficiency of checking the originality of the article is improved, manual intervention is not needed, manpower is saved, the time consumed for checking is short, and the result judgment is accurate.
Drawings
Fig. 1 is a schematic flow chart illustrating an article originality inspection method according to an embodiment of the present invention;
fig. 2 is a schematic flowchart of a similarity calculation method in an article similarity review library according to an embodiment of the present invention;
fig. 3 is a schematic view of an internal structure of an article originality testing apparatus according to a third embodiment of the present invention;
fig. 4 is a schematic diagram of an article originality testing program module in the article originality testing apparatus according to a third embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, in an embodiment of the present invention, a method for checking originality of an article is provided, including:
step S10: receiving an article to be detected and marking paragraphs of the article; the article herein generally refers to a word file format, and the content of the article is written in chinese, but it is needless to say that the article can also be extended to support more file formats or languages, such as pdf file format, english, etc., based on the technical idea of the present invention, and the present invention is not limited herein.
Step S20: splitting the paragraph into a plurality of sentences according to punctuation marks; randomly selecting a sentence from each sentence of the first segment and the last segment of the paragraph and the middle paragraph of the paragraph, and putting the sentences into a queue to be detected; the method comprises the steps of automatically splitting an article by using paragraph marks and punctuation marks, and marking each paragraph of the article firstly, wherein the paragraph comprises a first paragraph, a last paragraph and a middle paragraph, and the first paragraph, namely the first paragraph and the last paragraph, represents the last paragraph. When the first or last segment is too short, e.g., only has no more than 20 words, the second or last segment can be optionally added as the first and last segments. And extracting all sentences of the first segment and the last segment by punctuations, randomly selecting one sentence for the middle segment, and putting the selected sentence into a queue to be detected. The queue to be detected is a queue designated in a memory, and each queue has a length of 16 words (namely a length of 32 characters), so that sentences need to be limited not to exceed 16 words, meanwhile, too short sentences have the problem of little search opinion, and too long sentences can cause long search time and too many search results; therefore, the sentence length is set to 5 words or more and 16 words or less. The overlong sentence can be processed by splitting into two sentences or removing characters after 16 characters.
Step S30: taking each sentence of the queue to be detected as a search keyword in sequence to search in a search engine, and obtaining a search result corresponding to the sentence; here, the search engine may be a hundred-degree or other search engine, and the search result of each sentence is obtained by calling the search engine to perform a search according to the sentences in the queue to be checked as search keywords.
In another embodiment of the present invention, in step S30, in order to avoid wrongly written words or intentionally set search prevention keywords, each sentence may be converted into pinyin and then searched in a search engine as search keywords, so as to expand the search range.
Step S40: and after the search result is subjected to de-duplication, calculating the similarity between the search result and the corresponding sentence, and obtaining the similarity calculation result of each sentence. For each sentence, the search result of the sentence is obtained through a search engine, the duplication of the search result is removed firstly, repeated contents are filtered out firstly, and then the similarity calculation is carried out on the search result of the sentence.
Step S50: finding out a search result of which the similarity of each sentence is greater than or equal to a preset similarity threshold; specifically, a plurality of search results are generated by searching each sentence, are automatically arranged according to the similarity, and the search results with the similarity percentage larger than or equal to a preset similarity threshold are selected according to the similarity percentage displayed in a search engine; and when the similarity percentage is not larger than or equal to the preset similarity threshold value search result, selecting the first three of the search results as the search results of the sentence. Specifically, the preset similarity threshold may be set or adjusted according to the requirement of the similarity check, and specifically in an embodiment of the present invention, the preset similarity threshold is 60%.
Step S60: adding the search result with the maximum similarity calculation result of each sentence into a similarity audit library of the article; and then adding the search result with the maximum similarity calculation in the search result of each sentence into the similarity audit result of the article, so that the most similar search results of all sentences in the queue to be detected are all put into the similarity audit library of the article.
Step S70: and calculating the similarity in the similarity auditing library of the article to obtain the overall similarity auditing result of the article.
Referring to fig. 2, specifically, the calculating the similarity in the similarity audit library of the articles includes:
step S710: extracting a source address of a search result of each sentence; specifically, the source address of the search result is a hyperlink, and the hyperlink comprises a domain name, a path name and a page address; the same domain name representation is from the same domain name server, and the path name and the page address are also the same, the representation is from the same article or the same series of articles, and the similarity procedure is correspondingly increased.
Step S720: calculating source address identity according to hyperlinks of the search results to obtain a source address identity value; confirming the source address similarity according to the similarity of the domain name, the path name and the page address in the hyperlink; specifically, in an embodiment of the present invention, if the domain names are different, the source address identity value is 1, if the domain names are the same, the source address identity value is 5, if the path names are the same, the source address identity value is 10, and if the page addresses are the same, the source address identity value is 20.
Step S730: setting a similarity weight according to the source address similarity value, and calculating the article similarity according to the weight, wherein specifically, the calculation formula of the article similarity is as follows:
wherein,
s represents the similarity of the articles;
k represents a weight;
n represents the number of sentences having the same weight;
t represents the total number of sentences;
x and y are natural numbers.
Specifically, the weight K is preset according to a source address identity value, and the weight K is a percentage between 1 and 100; for example, the weight K is set to 1% when the source address identity value is 1, 5% when the source address identity value is 5, 10% when the source address identity value is 10, 20% when the source address identity value is 20, and so on. The calculated article similarity value may set a similarity threshold, for example, when the similarity threshold exceeds 30%, the article content is considered to be suspicious of duplication or plagiarism.
And outputting a similarity auditing result of the article, wherein the similarity auditing result of the article comprises the article similarity value and the most similar content in the search result of each sentence, and feeding back to an article writer for checking, checking and modifying through highlighting the similar content to confirm that the content of the article is original content.
The invention also provides an article originality inspection device.
Referring to fig. 3, an embodiment of the invention provides an internal structure diagram of an article originality testing apparatus, which at least includes a memory 11, a processor 12, a communication bus 13, and a network interface 14.
The memory 11 includes at least one type of readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 11 may in some embodiments be an internal storage unit of the article originality testing apparatus, for example a hard disk of the article originality testing apparatus. The memory 11 may also be an external storage device of the article originality testing apparatus in other embodiments, such as a plug-in hard disk provided on the article originality testing apparatus, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. Further, the memory 11 may also include both an internal storage unit of the article originality testing apparatus and an external storage device. The memory 11 can be used not only to store application software installed in the article originality testing apparatus and various types of data, such as codes of the article originality testing program, but also to temporarily store data that has been output or is to be output.
The processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor or other data Processing chip in some embodiments, and is used for executing program codes stored in the memory 11 or Processing data, such as executing an article originality inspection program.
The communication bus 13 is used to realize connection communication between these components.
The network interface 14 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface), typically used to establish a communication link between the article originality testing apparatus and other electronic devices.
Optionally, the article originality inspection apparatus may further include a user interface, the user interface may include a Display (Display), an input unit such as a Keyboard (Keyboard), and the optional user interface may further include a standard wired interface, a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or display unit, is suitable for displaying information processed in the article originality inspection apparatus and for displaying a visualized user interface.
While fig. 3 shows only the article originality inspection apparatus having the components 11-14 and the article originality inspection program, those skilled in the art will appreciate that the configuration shown in fig. 3 does not constitute a limitation of the article originality inspection apparatus, and may include fewer or more components than those shown, or some components in combination, or a different arrangement of components.
In the embodiment of the article originality testing apparatus shown in fig. 3, an article originality testing program is stored in the memory 11; the processor 12 implements the following steps when executing the article originality check program stored in the memory 11:
step S10: receiving an article to be detected and marking paragraphs of the article;
step S20: splitting the paragraph into a plurality of sentences according to punctuation marks; randomly selecting a sentence from each sentence of the first segment and the last segment of the paragraph and the middle paragraph of the paragraph, and putting the sentences into a queue to be detected;
step S30: taking each sentence of the queue to be detected as a search keyword in sequence to search in a search engine, and obtaining a search result corresponding to the sentence;
step S40: calculating the similarity between the search result and the corresponding sentence after the search result is deduplicated, and obtaining the similarity calculation result of each sentence;
step S50: finding out a search result of which the similarity of each sentence is greater than or equal to a preset similarity threshold;
step S60: adding the search result with the maximum similarity calculation result of each sentence into a similarity audit library of the article;
step S70: and calculating the similarity in the similarity auditing library of the article to obtain the overall similarity auditing result of the article.
Referring to fig. 4, a schematic diagram of program modules of an article originality testing program in an embodiment of the article originality testing apparatus of the present invention is shown, in this embodiment, the article originality testing program may be divided into a splitting module 10, a searching module 20, a calculating module 30, and an output module 40, and exemplarily:
the splitting module 10 is used for executing the tasks of paragraph splitting and sentence splitting of the article to be detected;
the searching module 20 is configured to execute a task of searching the sentences in the queue to be detected as search keywords in a search engine to obtain search results corresponding to the sentences;
a calculation module 30, configured to perform a task of calculating similarity of each sentence;
and the output module 40 is used for executing a task of outputting a sentence similarity calculation structure and a similarity auditing result of the article.
The functions or operation steps of the splitting module 10, the searching module 20, the calculating module 30 and the outputting module 40 that are implemented when the program modules are executed are substantially the same as those of the above embodiments, and are not described herein again.
Furthermore, an embodiment of the present invention further provides a storage medium, which is a computer-readable storage medium, and on which an article originality check program is stored, where the article originality check program is executable by one or more processors to implement the following operations:
step S10: receiving an article to be detected and marking paragraphs of the article;
step S20: splitting the paragraph into a plurality of sentences according to punctuation marks; randomly selecting a sentence from each sentence of the first segment and the last segment of the paragraph and the middle paragraph of the paragraph, and putting the sentences into a queue to be detected;
step S30: taking each sentence of the queue to be detected as a search keyword in sequence to search in a search engine, and obtaining a search result corresponding to the sentence;
step S40: calculating the similarity between the search result and the corresponding sentence after the search result is deduplicated, and obtaining the similarity calculation result of each sentence;
step S50: finding out a search result of which the similarity of each sentence is greater than or equal to a preset similarity threshold;
step S60: adding the search result with the maximum similarity calculation result of each sentence into a similarity audit library of the article;
step S70: and calculating the similarity in the similarity auditing library of the article to obtain the overall similarity auditing result of the article.
The storage medium of the present invention is substantially the same as the embodiments of the method and apparatus for checking originality of the article, and will not be described in detail here.
Compared with the prior art, the article originality inspection method, the article originality inspection device and the storage medium provided by the invention have the advantages that the article similarity automatic search is carried out by utilizing the search engine, the search result is quantized, the similarity inspection and the quantization of sentences, paragraphs and the whole article in the article are realized, the article originality inspection efficiency is improved, the manual intervention is not needed, the labor is saved, the inspection time is short, and the result judgment is accurate.
It should be noted that the above-mentioned numbers of the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments. And the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, apparatus, article, or method that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, apparatus, article, or method. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, apparatus, article, or method that comprises the element.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) as described above, and includes instructions for enabling a terminal device (e.g., a drone, a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.
Claims (10)
1. An article originality inspection method, comprising:
receiving an article to be detected and marking paragraphs of the article;
splitting the paragraph into a plurality of sentences according to punctuation marks; randomly selecting a sentence from each sentence of the first segment and the last segment of the paragraph and the middle paragraph of the paragraph, and putting the sentences into a queue to be detected;
taking each sentence of the queue to be detected as a search keyword in sequence to search in a search engine, and obtaining a search result corresponding to the sentence;
and after the search result is subjected to de-duplication, calculating the similarity between the search result and the corresponding sentence, and obtaining the similarity calculation result of each sentence.
2. The article originality inspection method according to claim 1, characterized in that the article originality inspection method further comprises:
and finding out the search result of which the similarity of each sentence is more than or equal to a preset similarity threshold.
3. The method for checking originality of articles according to claim 1, wherein said searching each sentence in said queue to be checked as a search key in a search engine further comprises:
each sentence is converted into Chinese pinyin and then is used as a search keyword to be searched in a search engine.
4. The method of checking originality of an article according to claim 1, wherein the sentence length is 5 words or more and 16 words or less.
5. The article originality inspection method according to claim 1, characterized in that the article originality inspection method further comprises:
adding the search result with the maximum similarity calculation result of each sentence into a similarity audit library of the article;
and calculating the similarity in the similarity auditing library of the article to obtain the overall similarity auditing result of the article.
6. The method of checking originality of articles according to claim 5, wherein the calculating the similarity in the similarity audit library of articles includes:
extracting a source address of a search result of each sentence;
calculating source address identity according to hyperlinks of the search results to obtain a source address identity value;
and setting the similarity weight according to the source address similarity value, and calculating the similarity of the articles according to the weight.
7. The method of checking originality of an article according to claim 6, wherein the calculation formula of the similarity of the article is:
wherein,
s represents the similarity of the articles;
k represents a weight;
n represents the number of sentences having the same weight;
t represents the total number of sentences;
x and y are natural numbers.
8. The method of claim 7, wherein the weight is preset according to a source address identity value, and the weight is a percentage between 1 and 100.
9. An article originality inspection apparatus comprising a memory and a processor, the memory having stored thereon an article originality inspection program executable on the processor, the article originality inspection program when executed by the processor implementing the steps of the article originality inspection method as claimed in any one of claims 1 to 8.
10. A storage medium that is a computer-readable storage medium having an article originality testing program stored thereon that is executable by one or more processors to implement the steps of the article originality testing method of any one of claims 1 to 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110452943.8A CN113407684A (en) | 2021-04-26 | 2021-04-26 | Article originality checking method, article originality checking device and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110452943.8A CN113407684A (en) | 2021-04-26 | 2021-04-26 | Article originality checking method, article originality checking device and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113407684A true CN113407684A (en) | 2021-09-17 |
Family
ID=77677647
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110452943.8A Pending CN113407684A (en) | 2021-04-26 | 2021-04-26 | Article originality checking method, article originality checking device and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113407684A (en) |
-
2021
- 2021-04-26 CN CN202110452943.8A patent/CN113407684A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109062874B (en) | Financial data acquisition method, terminal device and medium | |
CN108874928B (en) | Resume data information analysis processing method, device, equipment and storage medium | |
CN109766438B (en) | Resume information extraction method, resume information extraction device, computer equipment and storage medium | |
CN108932294B (en) | Resume data processing method, device, equipment and storage medium based on index | |
RU2613846C2 (en) | Method and system for extracting data from images of semistructured documents | |
US20170147688A1 (en) | Automatically mining patterns for rule based data standardization systems | |
US10572726B1 (en) | Media summarizer | |
CN111460131A (en) | Method, device and equipment for extracting official document abstract and computer readable storage medium | |
CN110909123B (en) | Data extraction method and device, terminal equipment and storage medium | |
CN105302626B (en) | Analytic method of XPS (XPS) structured data | |
US11520835B2 (en) | Learning system, learning method, and program | |
CN110688315A (en) | Interface code detection report generation method, electronic device, and storage medium | |
CN109670183B (en) | Text importance calculation method, device, equipment and storage medium | |
JP5687312B2 (en) | Digital information analysis system, digital information analysis method, and digital information analysis program | |
US20120109638A1 (en) | Electronic device and method for extracting component names using the same | |
CN111046636B (en) | Method, device, computer equipment and storage medium for screening PDF file information | |
CN110489528B (en) | Electronic dictionary reconstruction method based on electronic book content and computing equipment | |
CN110191124B (en) | Web front-end development data-based website identification method and device and storage equipment | |
CN111177387A (en) | User list information processing method, electronic device and computer readable storage medium | |
CN114239562B (en) | Method, device and equipment for identifying program code blocks in document | |
CN113407684A (en) | Article originality checking method, article originality checking device and storage medium | |
JP2016018279A (en) | Document file search program, document file search device, document file search method, document information output program, document information output device, and document information output method | |
CN114969385A (en) | Knowledge graph optimization method and device based on document attribute assignment entity weight | |
CN114220113A (en) | Paper quality detection method, device and equipment | |
CN114154480A (en) | Information extraction method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication |