CN113486247B - Internet online identification and reading document reading hierarchical management system - Google Patents

Internet online identification and reading document reading hierarchical management system Download PDF

Info

Publication number
CN113486247B
CN113486247B CN202110846295.4A CN202110846295A CN113486247B CN 113486247 B CN113486247 B CN 113486247B CN 202110846295 A CN202110846295 A CN 202110846295A CN 113486247 B CN113486247 B CN 113486247B
Authority
CN
China
Prior art keywords
document
browsing
time
real
reading
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110846295.4A
Other languages
Chinese (zh)
Other versions
CN113486247A (en
Inventor
余东燕
黄钧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unicom Woyuedu Technology Culture Co Ltd
Original Assignee
Shenzhen Zhiku Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Zhiku Information Technology Co ltd filed Critical Shenzhen Zhiku Information Technology Co ltd
Priority to CN202110846295.4A priority Critical patent/CN113486247B/en
Publication of CN113486247A publication Critical patent/CN113486247A/en
Application granted granted Critical
Publication of CN113486247B publication Critical patent/CN113486247B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/32User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/55Push-based network services

Abstract

The invention discloses an internet online identification and reading document reading hierarchical management system, which relates to the technical field of document reading hierarchical management and solves the technical problem that the same online document cannot be subjected to hierarchical processing aiming at different readers in the prior art, and a plurality of sub-terminals are singly matched with corresponding login terminals, so that the situation that different users are matched with the same sub-terminal is prevented, the document grading task amount is increased, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; and the document is pushed to the real-time login reader, so that the document query time of the real-time login reader is reduced, and the reading quality of the reader is improved by pushing a proper document for the reader.

Description

Internet online identification and reading document reading hierarchical management system
Technical Field
The invention relates to the technical field of document reading hierarchical management, in particular to an internet online document reading hierarchical management system.
Background
For children in a growth period, the cognitive abilities of the children are limited, in order to enable the children to better accept different things, the importance of graded reading is also shown, in children in the period of children, initial knowledge of the children on the surroundings can be established through some simple words and pictures, meanwhile, according to simple reading materials, a small game such as a story telling game can be used for enhancing the confidence of the children obtained by reading and improving the interest of the children in reading, but in the era of rapid development of the internet, the fact that the children are prevented from being contacted with inappropriate information is crucial;
however, in the prior art, the same online document cannot be graded for different readers, and the disliked content is shielded, and if the disliked content is deleted, the reading requirements of other readers cannot be met, so that the quality of the corresponding document is reduced, and information transmission cannot be carried out more comprehensively; in addition, the reader cannot strictly control online browsing, so that the reader account is stolen, and the reading quality is reduced;
in view of the above technical drawbacks, a solution is proposed.
Disclosure of Invention
The invention aims to provide an internet online identification and reading document reading grading management system, wherein a plurality of sub-terminals are singly matched with corresponding login terminals, so that the increase of document grading task amount caused by matching of different users with the same sub-terminal is prevented, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; the paragraphs corresponding to the screened document types of the real-time login readers are respectively shielded, so that the intelligence of the online browsing terminal is improved.
The purpose of the invention can be realized by the following technical scheme:
an internet online identification and reading document reading hierarchical management system comprises an identity registration front end, a document browsing platform and a rear end input platform; the document browsing platform comprises an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-terminals; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit;
the identity registration front end is used for acquiring reader identity information and registering, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;
the document browsing platform is used for browsing documents of readers who access in real time, the real-time login readers with access rights are provided with the document browsing platform, the login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set the identity card numbers of the real-time login readers as labels, so that the sub-terminals are all in single matching with the corresponding login terminals; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;
the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires historical latest update time of the database, compares the historical latest update time with current system time, and acquires the documents of the Internet in real time if the interval duration between the historical latest update time and the current system time is greater than an update time threshold.
As a preferred embodiment of the invention, the document classification unit specifically classifies the process as follows:
analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;
analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key phrases; corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; and if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document.
As a preferred embodiment of the present invention, a document pushing unit specifically pushes the following:
collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; and if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document.
As a preferred embodiment of the present invention, the document reviewing unit specifically reviews the document as follows:
selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object; after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object; acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; and sending the qualified authentication object to a server.
Compared with the prior art, the invention has the beneficial effects that:
in the invention, a plurality of sub-terminals are singly matched with corresponding login terminals, so that the problem that different users are matched with the same sub-terminal, the document grading task amount is increased, and the document grading accuracy is reduced; the same browsing document is graded aiming at different readers, so that the readers are effectively prevented from watching disliked or unsuitable contents, the browsing quality of the document by the users is indirectly improved, and the use efficiency and frequency of the browsing platform by the readers are directly improved; the paragraphs of the screened document types corresponding to the real-time login readers are respectively shielded, so that the intelligence of the online browsing terminal is improved;
the method has the advantages that the document pushing is carried out on the real-time login readers, so that the document query time of the real-time login readers is reduced, and meanwhile, the reading quality of the readers is improved by pushing the proper document for the readers; the method has the advantages that the favorite document types of readers are collected, document pushing accuracy is improved, and the phenomenon that the pushed document is not suitable for the readers is prevented, so that reading quality of the readers is reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is an overall schematic block diagram of the present invention.
Detailed Description
The technical solutions of the present invention will be described clearly and completely with reference to the following embodiments, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, an internet online identification and reading document reading hierarchical management system includes an identity registration front end, a document browsing platform and a back end input platform, wherein the document browsing platform is in bidirectional communication with the identity registration front end and the back end input platform, the document browsing platform includes an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-ends, and the online browsing terminal is in bidirectional communication with the document pushing unit, the document analyzing unit and the plurality of sub-ends; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit, wherein the server is in bidirectional communication connection with the document identifying and reading unit and the database, and the database is in bidirectional communication connection with the data transmission unit;
the identity registration front end collects reader identity information and registers the reader identity information, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;
the document browsing platform is used for browsing documents of readers accessed in real time, real-time login readers with access rights are provided with the document browsing platform, login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set identity card numbers of the real-time login readers as labels, so that the sub-terminals are in single matching with the corresponding login terminals, different users are prevented from being matched with the same sub-terminal, the document grading task amount is prevented from being increased, and the document grading accuracy is reduced;
the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;
the document grading unit is used for grading the same browsed document according to different readers, and effectively avoids the readers from watching disliked or unfit contents, so that the browsing quality of the user to the document is indirectly improved, the use efficiency and frequency of the readers to a browsing platform are also directly improved, and the specific grading process is as follows:
step S1: analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; the dislike document type table is the document type corresponding to the reading document with bad history of the adult readers; marking the dislike document type and the unsuitable document type as a screening document type;
step S2: analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;
analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key word groups, if the document type is determined in the prior art, if the key Chinese characters of the corresponding document are love, judging the corresponding document type to be love type; if the keyword group of the corresponding document is a war, judging that the corresponding document type is a war type;
step S3: corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document; the method has the advantages that the paragraphs of the screened document types corresponding to the real-time login readers are respectively shielded, the intelligence of the online browsing terminal is improved, the same document is classified according to different readers, the readers are effectively prevented from browsing the disliked content, the document quality in the document browsing platform is improved, and the service quality of the readers is enhanced;
the real-time login reader completes browsing to the real-time browsing document and the communication connection between the corresponding sub-terminal and the online browsing terminal is not disconnected, then the online browsing terminal generates a document pushing signal and sends the document pushing signal to a document pushing unit, the document pushing unit is used for carrying out document pushing on the real-time login reader, the document query time of the real-time login reader is reduced, meanwhile, the reading quality of the reader is improved for the reader to push a proper document, and the specific pushing process is as follows:
collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; the method has the advantages that the favorite document types of readers are collected, so that the document pushing accuracy is improved, and the phenomenon that the pushed document is not suitable for the readers is prevented, and the reading quality of the readers is reduced;
selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document;
the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires the historical latest update time of the database and compares the historical latest update time with the current system time, if the interval duration of the historical latest update time and the current system time is greater than the update time threshold, the documents of the Internet are acquired in real time, and the documents acquired in real time are sent to the document identifying and reading unit; enhancing document browsing platform
The document identifying and reading unit is used for identifying and reading documents collected in real time, so that the quality of the documents is improved, unqualified documents are prevented from being transmitted to the document browsing platform, the reading quality of readers is reduced, the document storage space is wasted, and the specific identifying and reading process is as follows:
step SS 1: selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object;
step SS 2: after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object;
step SS 3: acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; sending the qualified reference object to a server; analyzing the identification and reading object to prevent the reading quality of readers from being reduced due to wrongly written characters or unsmooth documents;
the server receives the input authentication and reading objects and then sends the input authentication and reading objects to the database for storage, the database counts the input authentication and reading objects, and when the number of the input authentication and reading objects is larger than or equal to the number threshold value, the input authentication and reading objects in the database are conveyed to the document browsing platform through the data conveying unit.
The working process of the invention is as follows:
an internet online identification reading document reading hierarchical management system collects reader identity information and registers through an identity registration front end when working, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform; the method comprises the steps that a reader who accesses in real time through a document browsing platform browses documents, a real-time login reader with access right conducts the document browsing platform, a login terminal of the real-time login reader is in communication connection with sub-terminals, the sub-terminals correspond to the login terminal one by one, and the sub-terminals set identity card numbers of the real-time login reader as tags, so that a plurality of sub-terminals are all in single matching with the corresponding login terminal; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together; the method comprises the steps that documents are provided for a document browsing platform through a rear-end input platform to be obtained, a server collects historical latest updating time of a database, compares the historical latest updating time with current system time, and collects the documents of the Internet in real time if the interval duration between the historical latest updating time and the current system time is larger than an updating time threshold.
The above formulas are all calculated by taking the numerical value of the dimension, the formula is a formula which obtains the latest real situation by acquiring a large amount of data and performing software simulation, and the preset parameters in the formula are set by the technical personnel in the field according to the actual situation.
The foregoing is merely exemplary and illustrative of the present invention and various modifications, additions and substitutions may be made by those skilled in the art to the specific embodiments described without departing from the scope of the invention as defined in the following claims.

Claims (3)

1. An internet online identification and reading document reading hierarchical management system is characterized by comprising an identity registration front end, a document browsing platform and a rear end input platform; the document browsing platform comprises an online browsing terminal, a document pushing unit, a document analyzing unit and a plurality of sub-terminals; the rear-end input platform comprises a server, a document identifying and reading unit, a database and a data transmission unit;
the identity registration front end is used for acquiring reader identity information and registering, wherein the reader identity information comprises the name, age, occupation and identification number of a reader; storing the identity information of the readers successfully registered in the identity registration front end, and setting the identity information of the readers stored in the identity registration front end as the access authority of the document browsing platform;
the document browsing platform is used for browsing documents of readers who access in real time, the real-time login readers with access rights are provided with the document browsing platform, the login terminals of the real-time login readers are in communication connection with the sub-terminals, the sub-terminals correspond to the login terminals one to one, and the sub-terminals set the identity card numbers of the real-time login readers as labels, so that the sub-terminals are all in single matching with the corresponding login terminals; the method comprises the steps that a corresponding sub-terminal of a real-time login reader is in communication connection with an online browsing terminal, the real-time login reader transmits a name of a browsed document to the online browsing terminal through the corresponding sub-terminal, the online browsing terminal binds the name of the real-time browsed document, i sub-terminals with consistent names of the real-time browsed document are bound with the corresponding real-time browsed document, and the real-time browsed document and the bound i sub-terminals are sent to a document grading unit together;
the back-end input platform is used for acquiring documents provided by the document browsing platform, the server acquires the historical latest update time of the database, compares the historical latest update time with the current system time, and acquires the documents of the Internet in real time if the interval duration between the historical latest update time and the current system time is greater than the update time threshold;
the document grading unit specifically grades as follows:
analyzing i sub-terminals corresponding to the real-time browsed document, wherein i is a natural number greater than or equal to 1, analyzing the sub-terminals corresponding to readers, dividing the corresponding readers into adult readers and immature readers, and acquiring unsuitable document types of the immature readers and dislike document types of the adult readers; inappropriate document types are represented as document types that do not fit into the age bracket of an underage reader; analyzing the real-time browsing document, carrying out paragraph division on the real-time browsing document, and constructing a set A { DL1, DL2, … and DLn } of paragraphs corresponding to the real-time browsing document after paragraph division, wherein DL2 represents the document content of a second paragraph in the real-time browsing document;
analyzing paragraphs corresponding to all subsets in the set A, marking the paragraphs corresponding to all the subsets as o, wherein o is a positive integer greater than 1, acquiring the corresponding occurrence frequency and times of Chinese characters H or phrases C in each paragraph, and if the occurrence frequency and times of the corresponding Chinese characters H or phrases C are respectively greater than an occurrence frequency threshold and an occurrence time threshold, judging that the corresponding Chinese characters H or phrases C are key Chinese characters or key phrases of the corresponding paragraph; judging the document type of the corresponding paragraph according to the key Chinese characters or the key phrases; corresponding paragraphs corresponding to all subsets in the set A to corresponding types one by one, if the corresponding types of the paragraphs in the set A belong to the screened document type of the real-time login reader, analyzing the length of the corresponding paragraphs, and if the length of the corresponding paragraphs is smaller than the threshold range of the length of the corresponding paragraphs, not shielding the corresponding paragraphs; if the segment length of the corresponding paragraph is within the segment length threshold range, shielding the corresponding paragraph; and if the segment length of the corresponding paragraph is larger than the segment length threshold range, marking the document to which the corresponding paragraph belongs as a non-conforming browsing document.
2. The internet online appraisal and reading document reading hierarchical management system according to claim 1, wherein the document pushing unit specifically pushes the following:
collecting historical browsing documents of real-time login readers, wherein the collected historical browsing documents are documents which are browsed by the real-time login readers, analyzing the historical browsing documents, collecting document types corresponding to all paragraphs in all the historical browsing documents, and if the same document type repeatedly appears in all the historical browsing documents, marking the corresponding document types as favorite document types; if the same document type does not repeatedly appear in each history browsing document, sequencing each document type in each history browsing document according to the sequence of the segment length from large to small, and marking the first sequenced document type as a favorite document type; selecting a document from a database in a back-end input platform by taking the favorite document type and the screened document type as selection conditions, collecting the document in the database, wherein the screened document type does not exist in the document types of all paragraphs in the collected document, and marking the collected document as a preselected document; acquiring the number of paragraphs corresponding to favorite document types in a preselected document, and sequencing the preselected document according to the number of paragraphs of the favorite types; analyzing the ranked pre-selected documents, acquiring browsing times and evaluation times of the pre-selected documents, if the ratio of the evaluation times to the browsing times is larger than a ratio threshold value, marking the corresponding pre-selected documents as selected documents, and sending the selected documents to an online browsing terminal; and if the ratio of the number of good comments to the number of browsing times is less than or equal to a ratio threshold value, marking the corresponding preselected document as an unselected document.
3. The internet online identification document reading hierarchical management system according to claim 1, wherein the document identification unit specifically identifies the following processes:
selecting five identifying and reading persons for identifying and reading the document, wherein the favorite document types of the five identifying and reading persons are not uniform, and setting the identifying and reading time and selecting any type of document from the documents collected in real time as an identifying and reading object; in the identification time, five identification persons read the identification object; after reading is finished, acquiring the number of wrongly-written characters acquired by five identifying and reading persons in the identifying and reading object, deleting repeated wrongly-written characters to acquire the actual number of wrongly-written characters, and judging that the identifying and reading object is unqualified if the actual number of wrongly-written characters is larger than or equal to the threshold value of the number of wrongly-written characters; if the actual number of the wrongly written characters is less than the threshold value of the number of the wrongly written characters, carrying out simple degree analysis on the identification and reading object; acquiring the segment length of each paragraph in the identification object, and if the segment length is greater than a segment length threshold, marking the corresponding paragraph as a long paragraph; otherwise, marking the corresponding paragraph as a short paragraph; acquiring the average number of the long paragraphs and the short paragraphs acquired by five identifying and reading persons, acquiring the ratio of the average number of the long paragraphs to the average number of the short paragraphs, and if the corresponding ratio is greater than 2, judging that the corresponding identifying and reading object is unqualified in compactness; otherwise, judging that the corresponding authentication object is qualified; and sending the qualified authentication object to a server.
CN202110846295.4A 2021-07-26 2021-07-26 Internet online identification and reading document reading hierarchical management system Active CN113486247B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110846295.4A CN113486247B (en) 2021-07-26 2021-07-26 Internet online identification and reading document reading hierarchical management system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110846295.4A CN113486247B (en) 2021-07-26 2021-07-26 Internet online identification and reading document reading hierarchical management system

Publications (2)

Publication Number Publication Date
CN113486247A CN113486247A (en) 2021-10-08
CN113486247B true CN113486247B (en) 2022-02-01

Family

ID=77942789

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110846295.4A Active CN113486247B (en) 2021-07-26 2021-07-26 Internet online identification and reading document reading hierarchical management system

Country Status (1)

Country Link
CN (1) CN113486247B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115618278B (en) * 2022-11-23 2023-03-10 万链指数(青岛)信息科技有限公司 Classification method for digital model generated data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101078971A (en) * 2002-03-19 2007-11-28 电子图书系统有限公司 Method and system for tracking electronic book reading pattern
JP2009070120A (en) * 2007-09-13 2009-04-02 Coh Inc Content providing system
CN102045388A (en) * 2010-11-25 2011-05-04 汉王科技股份有限公司 Online reading device and method
CN102214246A (en) * 2011-07-18 2011-10-12 南京大学 Method for grading Chinese electronic document reading on the Internet
CN108280361A (en) * 2017-01-05 2018-07-13 珠海金山办公软件有限公司 A kind of authority classification management method and device
CN110795753A (en) * 2019-11-08 2020-02-14 深圳市理约云信息管理有限公司 File security protection system, file security sharing method and security reading method

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060282413A1 (en) * 2005-06-03 2006-12-14 Bondi Victor J System and method for a search engine using reading grade level analysis
CN103186565B (en) * 2011-12-28 2017-02-22 中国移动通信集团浙江有限公司 Method and device for judging user preference according to web browsing behavior of user
US8977687B2 (en) * 2012-11-12 2015-03-10 Linkedin Corporation Techniques for enhancing a member profile with a document reading history
CN110609814A (en) * 2019-09-26 2019-12-24 珠海格力电器股份有限公司 Document online browsing method, storage medium and system
CN111680235A (en) * 2020-05-19 2020-09-18 南京数娱天下网络科技有限公司 Online reading system and method based on cloud computing
CN112100587A (en) * 2020-09-04 2020-12-18 江苏泽航创业投资有限公司 Confidential document reading method, software system, UE (user Equipment) equipment, server and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101078971A (en) * 2002-03-19 2007-11-28 电子图书系统有限公司 Method and system for tracking electronic book reading pattern
JP2009070120A (en) * 2007-09-13 2009-04-02 Coh Inc Content providing system
CN102045388A (en) * 2010-11-25 2011-05-04 汉王科技股份有限公司 Online reading device and method
CN102214246A (en) * 2011-07-18 2011-10-12 南京大学 Method for grading Chinese electronic document reading on the Internet
CN108280361A (en) * 2017-01-05 2018-07-13 珠海金山办公软件有限公司 A kind of authority classification management method and device
CN110795753A (en) * 2019-11-08 2020-02-14 深圳市理约云信息管理有限公司 File security protection system, file security sharing method and security reading method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Book Recommender System for Wikipedia Article Readers in a University Library;Keita Tsuji;《IEEE》;20200213;第121-126页 *
基于河南省公共图书馆儿童分级阅读服务的研究与策略;王靖雯;《河南图书馆学刊》;20180731;第132-135页 *
高校图书馆读者借阅权限影响因素略谈;董利军;《内蒙古教育(职教版)》;20150430;第12页 *

Also Published As

Publication number Publication date
CN113486247A (en) 2021-10-08

Similar Documents

Publication Publication Date Title
CN110543598B (en) Information recommendation method and device and terminal
CN102483745B (en) Co-selected image classification
CN111898031B (en) Method and device for obtaining user portrait
CN108363821A (en) A kind of information-pushing method, device, terminal device and storage medium
CN114707074B (en) Content recommendation method, device and system
CN109165975B (en) Label recommending method, device, computer equipment and storage medium
CN113297457B (en) High-precision intelligent information resource pushing system and pushing method
CN110737821B (en) Similar event query method, device, storage medium and terminal equipment
CN114238573B (en) Text countercheck sample-based information pushing method and device
CN112632405A (en) Recommendation method, device, equipment and storage medium
CN113688311A (en) Information recommendation method, device and equipment based on data interaction and storage medium
CN114371946B (en) Information push method and information push server based on cloud computing and big data
CN113486247B (en) Internet online identification and reading document reading hierarchical management system
WO2023024408A1 (en) Method for determining feature vector of user, and related device and medium
CN113099260B (en) Live broadcast processing method, live broadcast platform, system, medium and electronic device
CN112269906B (en) Automatic extraction method and device of webpage text
CN116452212B (en) Intelligent customer service commodity knowledge base information management method and system
CN117520503A (en) Financial customer service dialogue generation method, device, equipment and medium based on LLM model
EP4357942A1 (en) Information processing method and apparatus based on data exchange, and device and storage medium
CN111428041A (en) Case abstract generation method, device, system and storage medium
CN112328752B (en) Course recommendation method and device based on search content, computer equipment and medium
CN112182390B (en) Mail pushing method, device, computer equipment and storage medium
CN114550157A (en) Bullet screen gathering identification method and device
CN113420018A (en) User behavior data analysis method, device, equipment and storage medium
CN113704601A (en) Information interaction method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20230619

Address after: 410000 houses in the southwest corner of the intersection of Renmin East Road and Xiaokang Road, Changsha Airport Economic Demonstration Zone, Huanghua Town, Changsha County, Changsha City, Hunan Province

Patentee after: CHINA UNICOM WOYUEDU TECHNOLOGY CULTURE Co.,Ltd.

Address before: 518000 f6-021-c, Hedong building, Haoyunlai Plaza, Hedong community, Xixiang street, Bao'an District, Shenzhen City, Guangdong Province

Patentee before: Shenzhen Zhiku Information Technology Co.,Ltd.