CN113486247B

CN113486247B - Internet online identification and reading document reading hierarchical management system

Info

Publication number: CN113486247B
Application number: CN202110846295.4A
Authority: CN
Inventors: 余东燕; 黄钧
Original assignee: Shenzhen Zhiku Information Technology Co ltd
Current assignee: Unicom Woyuedu Technology Culture Co Ltd
Priority date: 2021-07-26
Filing date: 2021-07-26
Publication date: 2022-02-01
Anticipated expiration: 2041-07-26
Also published as: CN113486247A

Abstract

The invention discloses an internet online identification and reading document reading hierarchical management system, which relates to the technical field of document reading hierarchical management and solves the technical problem that the same online document cannot be subjected to hierarchical processing aiming at different readers in the prior art, and a plurality of sub-terminals are singly matched with corresponding login terminals, so that the situation that different users are matched with the same sub-terminal is prevented, the document grading task amount is increased, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; and the document is pushed to the real-time login reader, so that the document query time of the real-time login reader is reduced, and the reading quality of the reader is improved by pushing a proper document for the reader.

Description

Internet online identification and reading document reading hierarchical management system

Technical Field

The invention relates to the technical field of document reading hierarchical management, in particular to an internet online document reading hierarchical management system.

Background

For children in a growth period, the cognitive abilities of the children are limited, in order to enable the children to better accept different things, the importance of graded reading is also shown, in children in the period of children, initial knowledge of the children on the surroundings can be established through some simple words and pictures, meanwhile, according to simple reading materials, a small game such as a story telling game can be used for enhancing the confidence of the children obtained by reading and improving the interest of the children in reading, but in the era of rapid development of the internet, the fact that the children are prevented from being contacted with inappropriate information is crucial;

however, in the prior art, the same online document cannot be graded for different readers, and the disliked content is shielded, and if the disliked content is deleted, the reading requirements of other readers cannot be met, so that the quality of the corresponding document is reduced, and information transmission cannot be carried out more comprehensively; in addition, the reader cannot strictly control online browsing, so that the reader account is stolen, and the reading quality is reduced;

in view of the above technical drawbacks, a solution is proposed.

Disclosure of Invention

The invention aims to provide an internet online identification and reading document reading grading management system, wherein a plurality of sub-terminals are singly matched with corresponding login terminals, so that the increase of document grading task amount caused by matching of different users with the same sub-terminal is prevented, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; the paragraphs corresponding to the screened document types of the real-time login readers are respectively shielded, so that the intelligence of the online browsing terminal is improved.

The purpose of the invention can be realized by the following technical scheme:

an internet online identification and reading document reading hierarchical management system comprises an identity registration front end, a document browsing platform and a rear end input platform; the document browsing platform comprises an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-terminals; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit;

the identity registration front end is used for acquiring reader identity information and registering, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;

the document browsing platform is used for browsing documents of readers who access in real time, the real-time login readers with access rights are provided with the document browsing platform, the login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set the identity card numbers of the real-time login readers as labels, so that the sub-terminals are all in single matching with the corresponding login terminals; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;

the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires historical latest update time of the database, compares the historical latest update time with current system time, and acquires the documents of the Internet in real time if the interval duration between the historical latest update time and the current system time is greater than an update time threshold.

As a preferred embodiment of the invention, the document classification unit specifically classifies the process as follows:

analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;

analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key phrases; corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; and if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document.

As a preferred embodiment of the present invention, a document pushing unit specifically pushes the following:

collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; and if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document.

As a preferred embodiment of the present invention, the document reviewing unit specifically reviews the document as follows:

selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object; after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object; acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; and sending the qualified authentication object to a server.

Compared with the prior art, the invention has the beneficial effects that:

in the invention, a plurality of sub-terminals are singly matched with corresponding login terminals, so that the problem that different users are matched with the same sub-terminal, the document grading task amount is increased, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; the paragraphs of the screened document types corresponding to the real-time login readers are respectively shielded, so that the intelligence of the online browsing terminal is improved;

the method has the advantages that the document pushing is carried out on the real-time login readers, so that the document query time of the real-time login readers is reduced, and meanwhile, the reading quality of the readers is improved by pushing the proper document for the readers; the method has the advantages that the favorite document types of readers are collected, document pushing accuracy is improved, and the phenomenon that the pushed document is not suitable for the readers is prevented, so that reading quality of the readers is reduced.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is an overall schematic block diagram of the present invention.

Detailed Description

The technical solutions of the present invention will be described clearly and completely with reference to the following embodiments, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As shown in fig. 1, an internet online identification and reading document reading hierarchical management system includes an identity registration front end, a document browsing platform and a back end input platform, wherein the document browsing platform is in bidirectional communication with the identity registration front end and the back end input platform, the document browsing platform includes an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-ends, and the online browsing terminal is in bidirectional communication with the document pushing unit, the document analyzing unit and the plurality of sub-ends; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit, wherein the server is in bidirectional communication connection with the document identifying and reading unit and the database, and the database is in bidirectional communication connection with the data transmission unit;

the identity registration front end collects reader identity information and registers the reader identity information, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;

the document browsing platform is used for browsing documents of readers accessed in real time, real-time login readers with access rights are provided with the document browsing platform, login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set identity card numbers of the real-time login readers as labels, so that the sub-terminals are in single matching with the corresponding login terminals, different users are prevented from being matched with the same sub-terminal, the document grading task amount is prevented from being increased, and the document grading accuracy is reduced;

the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;

the document grading unit is used for grading the same browsed document according to different readers, and effectively avoids the readers from watching disliked or unfit contents, so that the browsing quality of the user to the document is indirectly improved, the use efficiency and frequency of the readers to a browsing platform are also directly improved, and the specific grading process is as follows:

step S1: analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; the dislike document type table is the document type corresponding to the reading document with bad history of the adult readers; marking the dislike document type and the unsuitable document type as a screening document type;

step S2: analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;

analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key word groups, if the document type is determined in the prior art, if the key Chinese characters of the corresponding document are love, judging the corresponding document type to be love type; if the keyword group of the corresponding document is a war, judging that the corresponding document type is a war type;

step S3: corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document; the method has the advantages that the paragraphs of the screened document types corresponding to the real-time login readers are respectively shielded, the intelligence of the online browsing terminal is improved, the same document is classified according to different readers, the readers are effectively prevented from browsing the disliked content, the document quality in the document browsing platform is improved, and the service quality of the readers is enhanced;

the real-time login reader completes browsing to the real-time browsing document and the communication connection between the corresponding sub-terminal and the online browsing terminal is not disconnected, then the online browsing terminal generates a document pushing signal and sends the document pushing signal to a document pushing unit, the document pushing unit is used for carrying out document pushing on the real-time login reader, the document query time of the real-time login reader is reduced, meanwhile, the reading quality of the reader is improved for the reader to push a proper document, and the specific pushing process is as follows:

collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; the method has the advantages that the favorite document types of readers are collected, so that the document pushing accuracy is improved, and the phenomenon that the pushed document is not suitable for the readers is prevented, and the reading quality of the readers is reduced;

selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document;

the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires the historical latest update time of the database and compares the historical latest update time with the current system time, if the interval duration of the historical latest update time and the current system time is greater than the update time threshold, the documents of the Internet are acquired in real time, and the documents acquired in real time are sent to the document identifying and reading unit; enhancing document browsing platform

The document identifying and reading unit is used for identifying and reading documents collected in real time, so that the quality of the documents is improved, unqualified documents are prevented from being transmitted to the document browsing platform, the reading quality of readers is reduced, the document storage space is wasted, and the specific identifying and reading process is as follows:

step SS 1: selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object;

step SS 2: after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object;

step SS 3: acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; sending the qualified reference object to a server; analyzing the identification and reading object to prevent the reading quality of readers from being reduced due to wrongly written characters or unsmooth documents;

the server receives the input authentication and reading objects and then sends the input authentication and reading objects to the database for storage, the database counts the input authentication and reading objects, and when the number of the input authentication and reading objects is larger than or equal to the number threshold value, the input authentication and reading objects in the database are conveyed to the document browsing platform through the data conveying unit.

The working process of the invention is as follows:

an internet online identification reading document reading hierarchical management system collects reader identity information and registers through an identity registration front end when working, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform; the method comprises the steps that a reader who accesses in real time through a document browsing platform browses documents, a real-time login reader with access right conducts the document browsing platform, a login terminal of the real-time login reader is in communication connection with sub-terminals, the sub-terminals correspond to the login terminal one by one, and the sub-terminals set identity card numbers of the real-time login reader as tags, so that a plurality of sub-terminals are all in single matching with the corresponding login terminal; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together; the method comprises the steps that documents are provided for a document browsing platform through a rear-end input platform to be obtained, a server collects historical latest updating time of a database, compares the historical latest updating time with current system time, and collects the documents of the Internet in real time if the interval duration between the historical latest updating time and the current system time is larger than an updating time threshold.

The above formulas are all calculated by taking the numerical value of the dimension, the formula is a formula which obtains the latest real situation by acquiring a large amount of data and performing software simulation, and the preset parameters in the formula are set by the technical personnel in the field according to the actual situation.

The foregoing is merely exemplary and illustrative of the present invention and various modifications, additions and substitutions may be made by those skilled in the art to the specific embodiments described without departing from the scope of the invention as defined in the following claims.

Claims

1. An internet online identification and reading document reading hierarchical management system is characterized by comprising an identity registration front end, a document browsing platform and a rear end input platform; the document browsing platform comprises an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-terminals; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit;

the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires the historical latest update time of the database, compares the historical latest update time with the current system time, and acquires the documents of the Internet in real time if the interval duration between the historical latest update time and the current system time is greater than the update time threshold;

the document grading unit specifically grades as follows:

2. The internet online appraisal and reading document reading hierarchical management system according to claim 1, wherein the document pushing unit specifically pushes the following:

3. The internet online identification document reading hierarchical management system according to claim 1, wherein the document identification unit specifically identifies the following processes: