CN113486247B - Internet online identification and reading document reading hierarchical management system - Google Patents
Internet online identification and reading document reading hierarchical management system Download PDFInfo
- Publication number
- CN113486247B CN113486247B CN202110846295.4A CN202110846295A CN113486247B CN 113486247 B CN113486247 B CN 113486247B CN 202110846295 A CN202110846295 A CN 202110846295A CN 113486247 B CN113486247 B CN 113486247B
- Authority
- CN
- China
- Prior art keywords
- document
- browsing
- time
- real
- reading
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 claims description 16
- 230000006854 communication Effects 0.000 claims description 9
- 238000004891 communication Methods 0.000 claims description 9
- 238000011156 evaluation Methods 0.000 claims description 6
- 238000012163 sequencing technique Methods 0.000 claims description 6
- 230000005540 biological transmission Effects 0.000 claims description 5
- 230000007175 bidirectional communication Effects 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000003930 cognitive ability Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
- G06F21/32—User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/55—Push-based network services
Abstract
The invention discloses an internet online identification and reading document reading hierarchical management system, which relates to the technical field of document reading hierarchical management and solves the technical problem that the same online document cannot be subjected to hierarchical processing aiming at different readers in the prior art, and a plurality of sub-terminals are singly matched with corresponding login terminals, so that the situation that different users are matched with the same sub-terminal is prevented, the document grading task amount is increased, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; and the document is pushed to the real-time login reader, so that the document query time of the real-time login reader is reduced, and the reading quality of the reader is improved by pushing a proper document for the reader.
Description
Technical Field
The invention relates to the technical field of document reading hierarchical management, in particular to an internet online document reading hierarchical management system.
Background
For children in a growth period, the cognitive abilities of the children are limited, in order to enable the children to better accept different things, the importance of graded reading is also shown, in children in the period of children, initial knowledge of the children on the surroundings can be established through some simple words and pictures, meanwhile, according to simple reading materials, a small game such as a story telling game can be used for enhancing the confidence of the children obtained by reading and improving the interest of the children in reading, but in the era of rapid development of the internet, the fact that the children are prevented from being contacted with inappropriate information is crucial;
however, in the prior art, the same online document cannot be graded for different readers, and the disliked content is shielded, and if the disliked content is deleted, the reading requirements of other readers cannot be met, so that the quality of the corresponding document is reduced, and information transmission cannot be carried out more comprehensively; in addition, the reader cannot strictly control online browsing, so that the reader account is stolen, and the reading quality is reduced;
in view of the above technical drawbacks, a solution is proposed.
Disclosure of Invention
The invention aims to provide an internet online identification and reading document reading grading management system, wherein a plurality of sub-terminals are singly matched with corresponding login terminals, so that the increase of document grading task amount caused by matching of different users with the same sub-terminal is prevented, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; the paragraphs corresponding to the screened document types of the real-time login readers are respectively shielded, so that the intelligence of the online browsing terminal is improved.
The purpose of the invention can be realized by the following technical scheme:
an internet online identification and reading document reading hierarchical management system comprises an identity registration front end, a document browsing platform and a rear end input platform; the document browsing platform comprises an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-terminals; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit;
the identity registration front end is used for acquiring reader identity information and registering, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;
the document browsing platform is used for browsing documents of readers who access in real time, the real-time login readers with access rights are provided with the document browsing platform, the login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set the identity card numbers of the real-time login readers as labels, so that the sub-terminals are all in single matching with the corresponding login terminals; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;
the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires historical latest update time of the database, compares the historical latest update time with current system time, and acquires the documents of the Internet in real time if the interval duration between the historical latest update time and the current system time is greater than an update time threshold.
As a preferred embodiment of the invention, the document classification unit specifically classifies the process as follows:
analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;
analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key phrases; corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; and if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document.
As a preferred embodiment of the present invention, a document pushing unit specifically pushes the following:
collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; and if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document.
As a preferred embodiment of the present invention, the document reviewing unit specifically reviews the document as follows:
selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object; after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object; acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; and sending the qualified authentication object to a server.
Compared with the prior art, the invention has the beneficial effects that:
in the invention, a plurality of sub-terminals are singly matched with corresponding login terminals, so that the problem that different users are matched with the same sub-terminal, the document grading task amount is increased, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; the paragraphs of the screened document types corresponding to the real-time login readers are respectively shielded, so that the intelligence of the online browsing terminal is improved;
the method has the advantages that the document pushing is carried out on the real-time login readers, so that the document query time of the real-time login readers is reduced, and meanwhile, the reading quality of the readers is improved by pushing the proper document for the readers; the method has the advantages that the favorite document types of readers are collected, document pushing accuracy is improved, and the phenomenon that the pushed document is not suitable for the readers is prevented, so that reading quality of the readers is reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is an overall schematic block diagram of the present invention.
Detailed Description
The technical solutions of the present invention will be described clearly and completely with reference to the following embodiments, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, an internet online identification and reading document reading hierarchical management system includes an identity registration front end, a document browsing platform and a back end input platform, wherein the document browsing platform is in bidirectional communication with the identity registration front end and the back end input platform, the document browsing platform includes an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-ends, and the online browsing terminal is in bidirectional communication with the document pushing unit, the document analyzing unit and the plurality of sub-ends; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit, wherein the server is in bidirectional communication connection with the document identifying and reading unit and the database, and the database is in bidirectional communication connection with the data transmission unit;
the identity registration front end collects reader identity information and registers the reader identity information, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;
the document browsing platform is used for browsing documents of readers accessed in real time, real-time login readers with access rights are provided with the document browsing platform, login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set identity card numbers of the real-time login readers as labels, so that the sub-terminals are in single matching with the corresponding login terminals, different users are prevented from being matched with the same sub-terminal, the document grading task amount is prevented from being increased, and the document grading accuracy is reduced;
the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;
the document grading unit is used for grading the same browsed document according to different readers, and effectively avoids the readers from watching disliked or unfit contents, so that the browsing quality of the user to the document is indirectly improved, the use efficiency and frequency of the readers to a browsing platform are also directly improved, and the specific grading process is as follows:
step S1: analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; the dislike document type table is the document type corresponding to the reading document with bad history of the adult readers; marking the dislike document type and the unsuitable document type as a screening document type;
step S2: analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;
analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key word groups, if the document type is determined in the prior art, if the key Chinese characters of the corresponding document are love, judging the corresponding document type to be love type; if the keyword group of the corresponding document is a war, judging that the corresponding document type is a war type;
step S3: corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document; the method has the advantages that the paragraphs of the screened document types corresponding to the real-time login readers are respectively shielded, the intelligence of the online browsing terminal is improved, the same document is classified according to different readers, the readers are effectively prevented from browsing the disliked content, the document quality in the document browsing platform is improved, and the service quality of the readers is enhanced;
the real-time login reader completes browsing to the real-time browsing document and the communication connection between the corresponding sub-terminal and the online browsing terminal is not disconnected, then the online browsing terminal generates a document pushing signal and sends the document pushing signal to a document pushing unit, the document pushing unit is used for carrying out document pushing on the real-time login reader, the document query time of the real-time login reader is reduced, meanwhile, the reading quality of the reader is improved for the reader to push a proper document, and the specific pushing process is as follows:
collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; the method has the advantages that the favorite document types of readers are collected, so that the document pushing accuracy is improved, and the phenomenon that the pushed document is not suitable for the readers is prevented, and the reading quality of the readers is reduced;
selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document;
the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires the historical latest update time of the database and compares the historical latest update time with the current system time, if the interval duration of the historical latest update time and the current system time is greater than the update time threshold, the documents of the Internet are acquired in real time, and the documents acquired in real time are sent to the document identifying and reading unit; enhancing document browsing platform
The document identifying and reading unit is used for identifying and reading documents collected in real time, so that the quality of the documents is improved, unqualified documents are prevented from being transmitted to the document browsing platform, the reading quality of readers is reduced, the document storage space is wasted, and the specific identifying and reading process is as follows:
step SS 1: selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object;
step SS 2: after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object;
step SS 3: acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; sending the qualified reference object to a server; analyzing the identification and reading object to prevent the reading quality of readers from being reduced due to wrongly written characters or unsmooth documents;
the server receives the input authentication and reading objects and then sends the input authentication and reading objects to the database for storage, the database counts the input authentication and reading objects, and when the number of the input authentication and reading objects is larger than or equal to the number threshold value, the input authentication and reading objects in the database are conveyed to the document browsing platform through the data conveying unit.
The working process of the invention is as follows:
an internet online identification reading document reading hierarchical management system collects reader identity information and registers through an identity registration front end when working, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform; the method comprises the steps that a reader who accesses in real time through a document browsing platform browses documents, a real-time login reader with access right conducts the document browsing platform, a login terminal of the real-time login reader is in communication connection with sub-terminals, the sub-terminals correspond to the login terminal one by one, and the sub-terminals set identity card numbers of the real-time login reader as tags, so that a plurality of sub-terminals are all in single matching with the corresponding login terminal; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together; the method comprises the steps that documents are provided for a document browsing platform through a rear-end input platform to be obtained, a server collects historical latest updating time of a database, compares the historical latest updating time with current system time, and collects the documents of the Internet in real time if the interval duration between the historical latest updating time and the current system time is larger than an updating time threshold.
The above formulas are all calculated by taking the numerical value of the dimension, the formula is a formula which obtains the latest real situation by acquiring a large amount of data and performing software simulation, and the preset parameters in the formula are set by the technical personnel in the field according to the actual situation.
The foregoing is merely exemplary and illustrative of the present invention and various modifications, additions and substitutions may be made by those skilled in the art to the specific embodiments described without departing from the scope of the invention as defined in the following claims.
Claims (3)
1. An internet online identification and reading document reading hierarchical management system is characterized by comprising an identity registration front end, a document browsing platform and a rear end input platform; the document browsing platform comprises an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-terminals; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit;
the identity registration front end is used for acquiring reader identity information and registering, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;
the document browsing platform is used for browsing documents of readers who access in real time, the real-time login readers with access rights are provided with the document browsing platform, the login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set the identity card numbers of the real-time login readers as labels, so that the sub-terminals are all in single matching with the corresponding login terminals; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;
the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires the historical latest update time of the database, compares the historical latest update time with the current system time, and acquires the documents of the Internet in real time if the interval duration between the historical latest update time and the current system time is greater than the update time threshold;
the document grading unit specifically grades as follows:
analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;
analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key phrases; corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; and if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document.
2. The internet online appraisal and reading document reading hierarchical management system according to claim 1, wherein the document pushing unit specifically pushes the following:
collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; and if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document.
3. The internet online identification document reading hierarchical management system according to claim 1, wherein the document identification unit specifically identifies the following processes:
selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object; after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object; acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; and sending the qualified authentication object to a server.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110846295.4A CN113486247B (en) | 2021-07-26 | 2021-07-26 | Internet online identification and reading document reading hierarchical management system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110846295.4A CN113486247B (en) | 2021-07-26 | 2021-07-26 | Internet online identification and reading document reading hierarchical management system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113486247A CN113486247A (en) | 2021-10-08 |
CN113486247B true CN113486247B (en) | 2022-02-01 |
Family
ID=77942789
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110846295.4A Active CN113486247B (en) | 2021-07-26 | 2021-07-26 | Internet online identification and reading document reading hierarchical management system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113486247B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115618278B (en) * | 2022-11-23 | 2023-03-10 | 万链指数(青岛)信息科技有限公司 | Classification method for digital model generated data |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101078971A (en) * | 2002-03-19 | 2007-11-28 | 电子图书系统有限公司 | Method and system for tracking electronic book reading pattern |
JP2009070120A (en) * | 2007-09-13 | 2009-04-02 | Coh Inc | Content providing system |
CN102045388A (en) * | 2010-11-25 | 2011-05-04 | 汉王科技股份有限公司 | Online reading device and method |
CN102214246A (en) * | 2011-07-18 | 2011-10-12 | 南京大学 | Method for grading Chinese electronic document reading on the Internet |
CN108280361A (en) * | 2017-01-05 | 2018-07-13 | 珠海金山办公软件有限公司 | A kind of authority classification management method and device |
CN110795753A (en) * | 2019-11-08 | 2020-02-14 | 深圳市理约云信息管理有限公司 | File security protection system, file security sharing method and security reading method |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060282413A1 (en) * | 2005-06-03 | 2006-12-14 | Bondi Victor J | System and method for a search engine using reading grade level analysis |
CN103186565B (en) * | 2011-12-28 | 2017-02-22 | 中国移动通信集团浙江有限公司 | Method and device for judging user preference according to web browsing behavior of user |
US8977687B2 (en) * | 2012-11-12 | 2015-03-10 | Linkedin Corporation | Techniques for enhancing a member profile with a document reading history |
CN110609814A (en) * | 2019-09-26 | 2019-12-24 | 珠海格力电器股份有限公司 | Document online browsing method, storage medium and system |
CN111680235A (en) * | 2020-05-19 | 2020-09-18 | 南京数娱天下网络科技有限公司 | Online reading system and method based on cloud computing |
CN112100587A (en) * | 2020-09-04 | 2020-12-18 | 江苏泽航创业投资有限公司 | Confidential document reading method, software system, UE (user Equipment) equipment, server and system |
-
2021
- 2021-07-26 CN CN202110846295.4A patent/CN113486247B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101078971A (en) * | 2002-03-19 | 2007-11-28 | 电子图书系统有限公司 | Method and system for tracking electronic book reading pattern |
JP2009070120A (en) * | 2007-09-13 | 2009-04-02 | Coh Inc | Content providing system |
CN102045388A (en) * | 2010-11-25 | 2011-05-04 | 汉王科技股份有限公司 | Online reading device and method |
CN102214246A (en) * | 2011-07-18 | 2011-10-12 | 南京大学 | Method for grading Chinese electronic document reading on the Internet |
CN108280361A (en) * | 2017-01-05 | 2018-07-13 | 珠海金山办公软件有限公司 | A kind of authority classification management method and device |
CN110795753A (en) * | 2019-11-08 | 2020-02-14 | 深圳市理约云信息管理有限公司 | File security protection system, file security sharing method and security reading method |
Non-Patent Citations (3)
Title |
---|
Book Recommender System for Wikipedia Article Readers in a University Library;Keita Tsuji;《IEEE》;20200213;第121-126页 * |
基于河南省公共图书馆儿童分级阅读服务的研究与策略;王靖雯;《河南图书馆学刊》;20180731;第132-135页 * |
高校图书馆读者借阅权限影响因素略谈;董利军;《内蒙古教育(职教版)》;20150430;第12页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113486247A (en) | 2021-10-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110543598B (en) | Information recommendation method and device and terminal | |
CN102483745B (en) | Co-selected image classification | |
CN111898031B (en) | Method and device for obtaining user portrait | |
CN108363821A (en) | A kind of information-pushing method, device, terminal device and storage medium | |
CN114707074B (en) | Content recommendation method, device and system | |
CN109165975B (en) | Label recommending method, device, computer equipment and storage medium | |
CN113297457B (en) | High-precision intelligent information resource pushing system and pushing method | |
CN110737821B (en) | Similar event query method, device, storage medium and terminal equipment | |
CN114238573B (en) | Text countercheck sample-based information pushing method and device | |
CN112632405A (en) | Recommendation method, device, equipment and storage medium | |
CN113688311A (en) | Information recommendation method, device and equipment based on data interaction and storage medium | |
CN114371946B (en) | Information push method and information push server based on cloud computing and big data | |
CN113486247B (en) | Internet online identification and reading document reading hierarchical management system | |
WO2023024408A1 (en) | Method for determining feature vector of user, and related device and medium | |
CN113099260B (en) | Live broadcast processing method, live broadcast platform, system, medium and electronic device | |
CN112269906B (en) | Automatic extraction method and device of webpage text | |
CN116452212B (en) | Intelligent customer service commodity knowledge base information management method and system | |
CN117520503A (en) | Financial customer service dialogue generation method, device, equipment and medium based on LLM model | |
EP4357942A1 (en) | Information processing method and apparatus based on data exchange, and device and storage medium | |
CN111428041A (en) | Case abstract generation method, device, system and storage medium | |
CN112328752B (en) | Course recommendation method and device based on search content, computer equipment and medium | |
CN112182390B (en) | Mail pushing method, device, computer equipment and storage medium | |
CN114550157A (en) | Bullet screen gathering identification method and device | |
CN113420018A (en) | User behavior data analysis method, device, equipment and storage medium | |
CN113704601A (en) | Information interaction method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230619 Address after: 410000 houses in the southwest corner of the intersection of Renmin East Road and Xiaokang Road, Changsha Airport Economic Demonstration Zone, Huanghua Town, Changsha County, Changsha City, Hunan Province Patentee after: CHINA UNICOM WOYUEDU TECHNOLOGY CULTURE Co.,Ltd. Address before: 518000 f6-021-c, Hedong building, Haoyunlai Plaza, Hedong community, Xixiang street, Bao'an District, Shenzhen City, Guangdong Province Patentee before: Shenzhen Zhiku Information Technology Co.,Ltd. |