CN117786040A - Automatic text monitoring method and device - Google Patents

Automatic text monitoring method and device Download PDF

Info

Publication number
CN117786040A
CN117786040A CN202311483608.XA CN202311483608A CN117786040A CN 117786040 A CN117786040 A CN 117786040A CN 202311483608 A CN202311483608 A CN 202311483608A CN 117786040 A CN117786040 A CN 117786040A
Authority
CN
China
Prior art keywords
text
history
information
screening
preset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311483608.XA
Other languages
Chinese (zh)
Inventor
范瀚贤
梁植斌
丘琪
王闯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
E Fund Management Co ltd
Original Assignee
E Fund Management Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by E Fund Management Co ltd filed Critical E Fund Management Co ltd
Priority to CN202311483608.XA priority Critical patent/CN117786040A/en
Publication of CN117786040A publication Critical patent/CN117786040A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an automatic text monitoring method and a device thereof, comprising the steps of obtaining a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website; when receiving notification information sent by a current visiting enterprise website, identifying first text information of the current enterprise website, and screening a current preset word for the first text information to obtain a first screening text; comparing the first screening text with the historical text, and if the first screening text is inconsistent with the historical text, reserving the first screening text; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of historical enterprise websites. Through the method, the invention realizes the monitoring of the information without missing, avoids the error of manual reference, and timely follows the subsequent state to update the information text.

Description

Automatic text monitoring method and device
Technical Field
The invention relates to the field of text analysis, in particular to an automatic text monitoring method and an automatic text monitoring device.
Background
With the development of business and the growth of companies, most enterprises face the business requirement of receiving and transmitting text, and collect the notices, articles and the like of various large enterprise websites, and then sort, reply and transmit the collected information.
At present, many enterprises log in the enterprise website one by one every day through a manual investigation mode, check relevant receiving and dispatching text notices and manually collect and sort. Because of the diversity of information channels, information sources are easy to miss in the manual acquisition process, related information is further missed, and in the information sources which are not missed, operators lack key information because of too scattered information. The diversified information also brings scattered and messy defects, the burden of operators is increased, and follow-up treatment is difficult to carry out later.
So a method for monitoring text without missing does not exist at present.
Disclosure of Invention
The invention provides an automatic text monitoring method and device, which can realize the text monitoring without missing.
In order to solve the technical problems, the invention provides an automatic text monitoring method, which comprises the following steps:
acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website;
When receiving notification information sent by a current visiting enterprise website, identifying first text information of the current enterprise website, and screening current preset words of the first text information to obtain a first screening text;
comparing the first screening text with the historical text, and if the first screening text is inconsistent with the historical text, reserving the first screening text; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of a historical enterprise website.
According to the method, a plurality of enterprise websites are logged in through account passwords, notification information sent by each enterprise website is received, text information identification is carried out on the currently accessed enterprise websites according to the received information, the identified text information is screened, and screened information is obtained; comparing the screened information with the historical information, and if the screened information is inconsistent with the historical information and updated, storing the screened information. Through the method, the invention realizes the monitoring of the information without missing, avoids the error of manual reference, and timely follows the subsequent state to update the information text.
As a preferred example, the method includes the steps of obtaining a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing each corresponding enterprise website according to the account passwords corresponding to each enterprise website and preset time, specifically:
acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing each enterprise website according to the account passwords corresponding to each enterprise website;
if the verification notification of the enterprise website is received, identifying verification information and a verification frame in the verification notification, filling the verification information into the verification frame, and accessing the enterprise website after verification.
According to the priority example, the enterprise websites, namely the corresponding account passwords, are acquired to log in respectively, verification information is identified after verification notification of the enterprise websites is received, and the verification information is filled in a verification frame, so that automatic acquisition of multiple channels and multiple information sources is realized.
In another embodiment of the invention, each enterprise website needing to be logged in creates a timing task through a program, accesses a fixed link at a designated time, and inputs an account password and a website obtained in advance through a program interface when the enterprise website needs to be logged in, so that a certain page of a certain website can be accessed in a timing manner. Logging in of multiple website platforms, also by way of such timed tasks, programs access multiple different links at the same time.
As a preferable example, the verification information and the verification box in the verification notification are identified, and the verification information is filled into the verification box, specifically:
identifying graphic information in the verification notification through an OCR image identification module, positioning and identifying text line information in the graphic information, and converting the text line information into editable verification information;
and identifying the verification frame notified by the verification code, and filling the verification information into the verification frame.
The preferred example identifies the graphic information in the verification notification, locates and identifies the text line information in the verification notification through the OCR image identification module, acquires the verification information and fills in the verification frame, improves the accuracy of character identification, and realizes automatic verification of the information.
In another embodiment of the present invention, the need to enter a verification code is sometimes encountered when logging into the system, at which time the program accesses an OCR image recognition module, recognizes graphical information in the verification code by invoking the module, and the OCR module outputs the recognized information for filling.
OCR techniques typically extract text information from images through image processing and statistical machine learning methods, including binarization, noise filtering, correlation domain analysis, and the like. According to the processing method, three stages can be divided: image preparation, text recognition and post-processing.
Image preprocessing: the purpose of image preprocessing is mainly for better text line location and discernment to improve the discernment rate of accuracy, also can carry out the image simultaneously beautifies, presents the effect of beautifying for the customer, lets the customer more easily carry out proofreading and storage, and common image preprocessing module has: background removal, tilt correction, perspective transformation, image enhancement, direction correction, reflection processing, reflection white processing, and the like.
Text line positioning: namely, all text lines of the document image are positioned, and the accuracy of the text line positioning directly influences the overall effect of the following text recognition and layout analysis.
Text line recognition: the OCR core algorithm converts text information of text lines into editable text information and performs post-processing: and correcting the recognition result according to the rule and the big data analysis, and improving the accuracy of character recognition.
As a preferred example, when receiving the notification information sent by the current visiting enterprise website, the method identifies the first text information of the current enterprise website, and further includes:
and stopping accessing the enterprise website when the notification message sent by the current accessed enterprise website is not received.
The preferred example reduces energy consumption by stopping access to the enterprise website in time when a notification message sent by the current access enterprise website is not received.
As a preferred example, the current preset word screening is performed on the first text information to obtain a first screened text, which specifically includes:
acquiring a first preset word, wherein the first preset word is the release time of the article of notification information, and when the release time of the article of the first text information is greater than the first preset word, screening through a first round;
acquiring a second preset word, wherein the second preset word is a notification information article source, and when the first text information article source screened by the first round is the same as the second preset word, screening by the second round;
acquiring a third preset word, wherein the third preset word is a notification information article label, and when the first text information article label screened by the second round is the same as the third preset word, screening by the third round;
acquiring a fourth preset word, wherein the fourth preset word is the name of a notification information article, and when the first text information screened by the third round contains the fourth preset word, the fourth round of screening is carried out;
the first text information passing through the round screening is determined as first screening text.
According to the preferred example, through preset word screening, a plurality of preset words are formulated according to the user demands, text segments are screened according to the preset words, information extraction is achieved, and text information which is not needed by the user is filtered.
As a preferable example, the determining the first text information passing through the round screening as the first screening text further includes:
acquiring a first preset state, wherein the first preset state is an article stamping requirement, and selecting an article stamping state option if the first screening text comprises the first preset state;
acquiring a second preset state, wherein the second preset state is an article examination requirement, and selecting an article examination state option if the first screening text comprises the second preset state;
and generating a state table according to the article stamping state options and the article examination state options.
The preferred example refines the seal and examination requirements corresponding to the required information by performing state screening on the first screening text.
As a preferred example, if the first screening text is inconsistent with the history text, the first screening text is retained, specifically:
and if the first screening text is consistent with the historical text, stopping accessing the enterprise website.
According to the preferred example, the access to the enterprise website is stopped in time when the first screening text is consistent with the historical text, so that the energy consumption is reduced.
In another embodiment of the present invention, the program accesses the website once at intervals, stores the fixed text information during the access, compares the fixed text information with the fixed text information, identifies and screens the keywords, compares the notification of the website with whether the notification of the website is updated last time, and records the new information if the notification of the website is updated last time. The update of the related information is determined by the preset interval set by the monitoring.
As a preferable example, the history text is obtained by screening a history preset word on the history text information, which specifically includes:
acquiring a first history preset word, wherein the first history preset word is a preset article release time, and when the article release time of the history text information is greater than the first history preset word, the first history screening is performed;
acquiring a second history preset word, wherein the second history preset word is a preset article source, and when the history text information article source screened by the first round of history is the same as the second history preset word, screening by the second history round;
acquiring a third history preset word, wherein the third history preset word is a preset article label, and when the history text information article label screened by the second round of history is the same as the third history preset word, screening by a third history round;
acquiring a fourth history preset word, wherein the fourth history preset word is a preset article keyword, and when the history text information screened by the third round of history contains the fourth history preset word, the fourth round of history screening is passed;
the history text information through the four rounds of history screening is determined as history text.
According to the preferred example, through historical preset word screening, a plurality of historical preset words are formulated according to user requirements, text segments are screened according to the historical preset words, information extraction is achieved, and text information which is not needed by a user is filtered.
As a preferable example, the determining the history text information passing through the four rounds of history screening as the history text further includes:
acquiring a first history preset state, wherein the first history preset state is an article stamping requirement, and selecting a history article stamping state option if the first screening text comprises the first history preset state;
acquiring a second history preset state, wherein the second history preset state is an article examination requirement, and selecting a history article examination state option if the first screening text comprises the second history preset state;
and generating a historical state table according to the historical article stamping state options and the historical article examination state options.
The preferred example refines the stamping and examination requirements corresponding to the required information by screening the history state of the history selection text.
As a preferable example, the historical text information is obtained by identifying historical text information of a historical enterprise website, specifically:
acquiring a plurality of historical enterprise websites selected by a user and account passwords corresponding to each historical enterprise website, and accessing each historical enterprise website according to the account passwords corresponding to each historical enterprise website;
If a history verification notification of a history enterprise website is received, identifying history verification information and a history verification frame in the history verification notification, filling the history verification information into the history verification frame, and accessing the history enterprise website after verification is passed.
According to the preferred example, the enterprise websites, namely the corresponding account passwords, are acquired to log in respectively, verification information is identified after verification notification of the enterprise websites is received, and the verification information is filled in a verification frame, so that automatic acquisition of multiple channels and multiple information sources is realized.
As a preferred example, the comparing the first screening text with the history text, if the first screening text is inconsistent with the history text, after the first screening text is reserved, generating a web page report according to the first screening text, the history text and the status table;
wherein the web page report further includes a processing status bar;
the processing status bar comprises an option to be processed, an option to be processed and an option to be unprocessed, and is used for marking according to the current report progress.
According to the preferred example, the webpage report is generated, the multiple information is uniformly displayed on one webpage report, overall management is achieved, the user can conveniently transfer the text receiving and sending work, and the user can mark the information processing state.
In another embodiment of the invention, the text information is displayed in a webpage mode, and the text information is uniformly displayed at a webpage end and used for circulation of text receiving and sending work of a user. The user can mark the processing state (such as to be processed, unprocessed, etc.) of each piece of information, or make remark labeling (such as that the text requires to cover a official seal), and can submit an application to a corresponding system (such as an OA system) for informing that the lead is required to check and reprocess.
The invention also provides an automatic text monitoring device, which comprises: the system comprises an access module, a first text screening module and a comparison module;
the access module is used for acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website;
the first screening text module is used for identifying first text information of a current enterprise website when receiving notification information sent by the current access enterprise website, and carrying out current preset word screening on the first text information to obtain a first screening text;
the comparison module is used for comparing the first screening text with the historical text, and if the first screening text is inconsistent with the historical text, the first screening text is reserved; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of a historical enterprise website.
According to the method, a plurality of enterprise websites are logged in through account passwords through an access module, notification information sent by each enterprise website is received, text information identification is carried out on the currently accessed enterprise websites according to the received information through a first text screening module, the identified text information is screened, and screened information is obtained; the comparison module compares the screened information with the historical information, and if the screened information is inconsistent with the historical information and updated, the screened information is stored. Through the method, the invention realizes the monitoring of the information without missing, avoids the error of manual reference, and timely follows the subsequent state to update the information text.
As a preferred example, a web page reporting module is also included;
the webpage report module generates according to the first screening text, the historical text and the state table;
the web page report module further comprises a processing status bar;
the processing status bar comprises an option to be processed, an option to be processed and an option to be unprocessed, and is used for marking according to the current report progress.
The preferred example generates a webpage report through a webpage report module, uniformly displays multiple information on one webpage report, realizes overall management, is convenient for users to transfer text receiving and sending work, and can mark information processing states
Drawings
FIG. 1 is a flow chart of an automated text monitoring method in accordance with an embodiment of the present invention;
fig. 2 is a block diagram of an automated text monitoring apparatus according to one embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made more apparent and fully hereinafter with reference to the accompanying drawings, in which some, but not all embodiments of the invention are shown. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The automatic text monitoring method and the device thereof provided by the embodiment of the invention are suitable for being used for.
Referring to fig. 1, in one embodiment of the present invention, a flowchart of an automated text monitoring method shown in fig. 1 is provided, and the method includes steps S1 to S3. The method comprises the following steps:
s1, acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website;
S2, when receiving notification information sent by a current visiting enterprise website, identifying first text information of the current enterprise website, and screening current preset words of the first text information to obtain a first screening text;
s3, comparing the first screening text with the historical text, and if the first screening text is inconsistent with the historical text, reserving the first screening text; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of a historical enterprise website.
The invention provides an automatic text monitoring method, which comprises the steps of logging in a plurality of enterprise websites through account passwords, receiving notification information sent by each enterprise website, identifying text information of the currently accessed enterprise website according to the received information, screening the identified text information, and obtaining screened information; comparing the screened information with the historical information, and if the screened information is inconsistent with the historical information and updated, storing the screened information. Through the method, the invention realizes the monitoring of the information without missing, and avoids the error of manual reference.
In an embodiment of the present invention, the obtaining a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to the account passwords corresponding to each enterprise website and preset time respectively includes:
acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing each enterprise website according to the account passwords corresponding to each enterprise website;
if the verification notification of the enterprise website is received, identifying verification information and a verification frame in the verification notification, filling the verification information into the verification frame, and accessing the enterprise website after verification.
According to the embodiment, the enterprise websites, namely the corresponding account passwords, are acquired to log in respectively, verification information is identified after verification notification of the enterprise websites is received, and the verification information is filled in a verification frame, so that automatic acquisition of multiple channels and multiple information sources is realized.
In one embodiment of the present invention, the identifying the verification information and the verification box in the verification notification fills in the verification information to the verification box, specifically:
identifying graphic information in the verification notification through an OCR image identification module, positioning and identifying text line information in the graphic information, and converting the text line information into editable verification information;
And identifying the verification frame notified by the verification code, and filling the verification information into the verification frame.
According to the embodiment, the OCR image recognition module is used for recognizing the graphic information in the verification notification, positioning and recognizing the text line information in the verification notification, acquiring the verification information and filling the verification information in the verification frame, so that the accuracy of character recognition is improved, and automatic verification information is realized.
In an embodiment of the present invention, when receiving the notification information sent by the current access enterprise website, the identifying the first text information of the current enterprise website further includes:
and stopping accessing the enterprise website when the notification message sent by the current accessed enterprise website is not received.
According to the embodiment, the access to the enterprise website is stopped in time when the notification message sent by the current access to the enterprise website is not received, so that the energy consumption is reduced.
In an embodiment of the present invention, the current preset word screening is performed on the first text information to obtain a first screened text, which specifically includes:
acquiring a first preset word, wherein the first preset word is the release time of the article of notification information, and when the release time of the article of the first text information is greater than the first preset word, screening through a first round;
Acquiring a second preset word, wherein the second preset word is a notification information article source, and when the first text information article source screened by the first round is the same as the second preset word, screening by the second round;
acquiring a third preset word, wherein the third preset word is a notification information article label, and when the first text information article label screened by the second round is the same as the third preset word, screening by the third round;
acquiring a fourth preset word, wherein the fourth preset word is the name of a notification information article, and when the first text information screened by the third round contains the fourth preset word, the fourth round of screening is carried out;
the first text information passing through the round screening is determined as first screening text.
According to the embodiment, through preset word screening, a plurality of preset words are formulated according to the user demands, text segments are screened according to the preset words, information extraction is achieved, and text information which is not needed by the user is filtered.
In an embodiment of the present invention, the determining the first text information passing through the round screening as the first screening text further includes:
acquiring a first preset state, wherein the first preset state is an article stamping requirement, and selecting an article stamping state option if the first screening text comprises the first preset state;
Acquiring a second preset state, wherein the second preset state is an article examination requirement, and selecting an article examination state option if the first screening text comprises the second preset state;
and generating a state table according to the article stamping state options and the article examination state options.
The embodiment refines the stamping and examining requirements corresponding to the required information by carrying out state screening on the first screening text.
In an embodiment of the present invention, if the first screening text is inconsistent with the history text, the first screening text is retained, specifically:
and if the first screening text is consistent with the historical text, stopping accessing the enterprise website.
According to the embodiment, the access to the enterprise website is stopped in time when the first screening text is consistent with the historical text, so that the energy consumption is reduced.
In an embodiment of the present invention, the history text is obtained by screening a history preset word on history text information, and specifically includes:
acquiring a first history preset word, wherein the first history preset word is a preset article release time, and when the article release time of the history text information is greater than the first history preset word, the first history screening is performed;
Acquiring a second history preset word, wherein the second history preset word is a preset article source, and when the history text information article source screened by the first round of history is the same as the second history preset word, screening by the second history round;
acquiring a third history preset word, wherein the third history preset word is a preset article label, and when the history text information article label screened by the second round of history is the same as the third history preset word, screening by a third history round;
acquiring a fourth history preset word, wherein the fourth history preset word is a preset article keyword, and when the history text information screened by the third round of history contains the fourth history preset word, the fourth round of history screening is passed;
the history text information through the four rounds of history screening is determined as history text.
According to the embodiment, through historical preset word screening, a plurality of historical preset words are formulated according to user requirements, text segments are screened according to the historical preset words, information extraction is achieved, and text information which is not needed by a user is filtered.
In an embodiment of the present invention, the determining the history text information passing through the four rounds of history screening as the history text further includes:
Acquiring a first history preset state, wherein the first history preset state is an article stamping requirement, and selecting a history article stamping state option if the first screening text comprises the first history preset state;
acquiring a second history preset state, wherein the second history preset state is an article examination requirement, and selecting a history article examination state option if the first screening text comprises the second history preset state;
and generating a historical state table according to the historical article stamping state options and the historical article examination state options.
According to the embodiment, historical state screening is carried out on the historical selection text, so that the stamping and examination requirements corresponding to the required information are extracted.
In one embodiment of the present invention, the historical text information is obtained by identifying historical text information of a historical enterprise website, specifically:
acquiring a plurality of historical enterprise websites selected by a user and account passwords corresponding to each historical enterprise website, and accessing each historical enterprise website according to the account passwords corresponding to each historical enterprise website;
if a history verification notification of a history enterprise website is received, identifying history verification information and a history verification frame in the history verification notification, filling the history verification information into the history verification frame, and accessing the history enterprise website after verification is passed.
According to the embodiment, the enterprise websites, namely the corresponding account passwords, are acquired to log in respectively, verification information is identified after verification notification of the enterprise websites is received, and the verification information is filled in a verification frame, so that automatic acquisition of multiple channels and multiple information sources is realized.
In an embodiment of the present invention, comparing the first screening text with the history text, and if the first screening text is inconsistent with the history text, generating a web page report according to the first screening text, the history text and the status table after the first screening text is reserved;
wherein the web page report further includes a processing status bar;
the processing status bar comprises an option to be processed, an option to be processed and an option to be unprocessed, and is used for marking according to the current report progress.
According to the embodiment, the webpage report is generated, the multiple information is uniformly displayed on one webpage report, overall management is achieved, the user can conveniently transfer the text receiving and sending work, and the user can mark the information processing state.
Referring to fig. 2, the present invention further provides a block diagram of an automatic text monitoring device, including: an access module 1, a first screening text module 2 and a comparison module 3;
The access module 1 is used for acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website;
the first filtering text module 2 is used for identifying first text information of a current enterprise website when receiving notification information sent by the current visiting enterprise website, and filtering current preset words of the first text information to obtain a first filtering text;
the comparison module 3 is configured to compare the first screening text with a history text, and if the first screening text is inconsistent with the history text, reserve the first screening text; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of a historical enterprise website.
According to the method, a plurality of enterprise websites are logged in through account passwords through an access module, notification information sent by each enterprise website is received, text information identification is carried out on the currently accessed enterprise websites according to the received information through a first text screening module, the identified text information is screened, and screened information is obtained; the comparison module compares the screened information with the historical information, and if the screened information is inconsistent with the historical information and updated, the screened information is stored. Through the method, the invention realizes the monitoring of the information without missing, avoids the error of manual reference, and timely follows the subsequent state to update the information text.
In a certain embodiment of the invention, the access module 1 comprises a first unit and a second unit;
the first unit is used for acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing each enterprise website according to the account passwords corresponding to each enterprise website;
and the second unit is used for identifying verification information and a verification frame in the verification notification if the verification notification of the enterprise website is received, filling the verification information into the verification frame, and accessing the enterprise website after verification is passed.
According to the embodiment, the enterprise websites, namely the corresponding account passwords, are acquired to log in respectively, verification information is identified after verification notification of the enterprise websites is received, and the verification information is filled in a verification frame, so that automatic acquisition of multiple channels and multiple information sources is realized.
In a certain embodiment of the invention, the second unit comprises a first subunit and a second subunit;
the first subunit is used for identifying the graphic information in the verification notification through the OCR image identification module, positioning and identifying text line information in the graphic information, and converting the text line information into editable verification information;
the second subunit is used for identifying the verification frame notified by the verification code and filling the verification information into the verification frame.
According to the embodiment, the OCR image recognition module is used for recognizing the graphic information in the verification notification, positioning and recognizing the text line information in the verification notification, acquiring the verification information and filling the verification information in the verification frame, so that the accuracy of character recognition is improved, and automatic verification information is realized.
In an embodiment of the present invention, the first screening text module 2 includes a third unit;
and the third unit is used for stopping accessing the enterprise website when the notification message sent by the current access enterprise website is not received.
According to the embodiment, the access to the enterprise website is stopped in time when the notification message sent by the current access to the enterprise website is not received, so that the energy consumption is reduced.
In one embodiment of the present invention, the first screening text module 2 includes a fourth unit, a fifth unit, a sixth unit, a seventh unit, and an eighth unit;
the fourth unit is configured to obtain a first preset word, where the first preset word is a notification information article release time, and when the article release time of the first text information is greater than the first preset word, the first preset word is screened by a first round;
the fifth unit is configured to obtain a second preset word, where the second preset word is a notification information article source, and when the first text information article source screened in the first round is the same as the second preset word, the second round is screened;
The sixth unit is configured to obtain a third preset word, where the third preset word is a notification information article tag, and when the first text information article tag screened by the second round is the same as the third preset word, the third round is screened;
the seventh unit is configured to obtain a fourth preset word, where the fourth preset word is a name of a notification information article, and when the first text information screened by the third round includes the fourth preset word, the fourth round of screening is passed;
the eighth unit is configured to determine the first text information that passes through the round screening as a first screening text.
According to the embodiment, through preset word screening, a plurality of preset words are formulated according to the user demands, text segments are screened according to the preset words, information extraction is achieved, and text information which is not needed by the user is filtered.
In one embodiment of the present invention, the eighth unit includes a third subunit, a fourth subunit, and a fifth subunit;
the third subunit is configured to obtain a first preset state, where the first preset state is an article stamping requirement, and if the first screening text includes the first preset state, select an article stamping state option;
the fourth subunit is configured to obtain a second preset state, where the second preset state is an article review requirement, and if the first screening text includes the second preset state, select an article review state option;
The fifth subunit is configured to generate a status table according to the article stamping status option and the article review status option.
The embodiment refines the stamping and examining requirements corresponding to the required information by carrying out state screening on the first screening text.
In an embodiment of the invention, the contrast module 3 comprises a ninth unit;
and the ninth unit is used for stopping accessing the enterprise website if the first screening text is consistent with the historical text.
According to the embodiment, the access to the enterprise website is stopped in time when the first screening text is consistent with the historical text, so that the energy consumption is reduced.
In an embodiment of the present invention, the comparing module 3 includes a tenth unit, an eleventh unit, a twelfth unit, a thirteenth unit, and a fourteenth unit;
the tenth unit is configured to obtain a first historical preset word, where the first historical preset word is a preset article release time, and when the article release time of the historical text information is greater than the first historical preset word, pass a first round of history screening;
the eleventh unit is configured to obtain a second preset history word, where the second preset history word is a preset article source, and when the history text information article source screened by the first round of history is the same as the second preset history word, the second preset history word is screened by the second round of history;
The twelfth unit is configured to obtain a third history preset word, where the third history preset word is a preset article tag, and when the history text information article tag screened by the second round of history is the same as the third history preset word, the third history round of history is screened;
the thirteenth unit is configured to obtain a fourth history preset word, where the fourth history preset word is a preset article keyword, and when the history text information screened by the third round of history includes the fourth history preset word, the fourth round of history screening is passed;
the fourteenth unit is configured to determine, as a history text, history text information that passes through the four rounds of history screening.
According to the embodiment, through historical preset word screening, a plurality of historical preset words are formulated according to user requirements, text segments are screened according to the historical preset words, information extraction is achieved, and text information which is not needed by a user is filtered.
In one embodiment of the present invention, the fourteenth unit includes a sixth subunit, a seventh subunit, and an eighth subunit;
the sixth subunit is configured to obtain a first history preset state, where the first history preset state is an article stamping requirement, and select a history article stamping state option if the first screening text includes the first history preset state;
The seventh subunit is configured to obtain a second history preset state, where the second history preset state is an article review requirement, and select a history article review status option if the first screening text includes the second history preset state;
the eighth subunit is configured to generate a history state table according to the history article sealing state option and the history article inspection state option.
According to the embodiment, historical state screening is carried out on the historical selection text, so that the stamping and examination requirements corresponding to the required information are extracted.
In an embodiment of the invention, the comparing module 3 comprises a fifteenth unit and a sixteenth unit;
the fifteenth unit is used for acquiring a plurality of historical enterprise websites selected by a user and account passwords corresponding to each historical enterprise website, and accessing each historical enterprise website according to the account passwords corresponding to each historical enterprise website;
the sixteenth unit is used for identifying the history verification information and the history verification frame in the history verification notification if the history verification notification of the history enterprise website is received, filling the history verification information into the history verification frame, and accessing the history enterprise website after verification.
According to the embodiment, the enterprise websites, namely the corresponding account passwords, are acquired to log in respectively, verification information is identified after verification notification of the enterprise websites is received, and the verification information is filled in a verification frame, so that automatic acquisition of multiple channels and multiple information sources is realized.
In one embodiment of the invention, the system further comprises a webpage report module;
the webpage report module generates according to the first screening text, the historical text and the state table;
the web page report module further comprises a processing status bar;
the processing status bar comprises an option to be processed, an option to be processed and an option to be unprocessed, and is used for marking according to the current report progress.
According to the embodiment, the webpage report is generated, the multiple information is uniformly displayed on one webpage report, overall management is achieved, the user can conveniently transfer the text receiving and sending work, and the user can mark the information processing state.
According to the method, a plurality of enterprise websites are logged in through account passwords, notification information sent by each enterprise website is received, text information identification is carried out on the currently accessed enterprise websites according to the received information, the identified text information is screened, and screened information is obtained; comparing the screened information with the historical information, and if the screened information is inconsistent with the historical information and updated, storing the screened information. Through the method, the invention realizes the monitoring of the information without missing, avoids the error of manual reference, and timely follows the subsequent state to update the information text.
While the foregoing is directed to the preferred embodiments of the present invention, it will be appreciated by those skilled in the art that changes and modifications may be made without departing from the principles of the invention, such changes and modifications are also intended to be within the scope of the invention.

Claims (13)

1. An automated text monitoring method, comprising:
acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website;
when receiving notification information sent by a current visiting enterprise website, identifying first text information of the current enterprise website, and screening current preset words of the first text information to obtain a first screening text;
comparing the first screening text with the historical text, and if the first screening text is inconsistent with the historical text, reserving the first screening text; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of a historical enterprise website.
2. The automated text monitoring method of claim 1, wherein the obtaining the plurality of enterprise websites selected by the user and the account passwords corresponding to each enterprise website, and accessing the corresponding enterprise websites according to the account passwords corresponding to each enterprise website and the preset time respectively comprises:
acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing each enterprise website according to the account passwords corresponding to each enterprise website;
if the verification notification of the enterprise website is received, identifying verification information and a verification frame in the verification notification, filling the verification information into the verification frame, and accessing the enterprise website after verification.
3. The automated text monitoring method according to claim 2, wherein the identifying the verification information and the verification box in the verification notification fills in the verification information to the verification box, specifically:
identifying graphic information in the verification notification through an OCR image identification module, positioning and identifying text line information in the graphic information, and converting the text line information into editable verification information;
And identifying the verification frame notified by the verification code, and filling the verification information into the verification frame.
4. The automated text monitoring method of claim 1, wherein the identifying the first text information of the current enterprise website when receiving the notification information sent by the current access enterprise website, further comprises:
and stopping accessing the enterprise website when the notification message sent by the current accessed enterprise website is not received.
5. The automatic text monitoring method according to claim 1, wherein the current preset word screening is performed on the first text information to obtain a first screened text, which specifically includes:
acquiring a first preset word, wherein the first preset word is the release time of the article of notification information, and when the release time of the article of the first text information is greater than the first preset word, screening through a first round;
acquiring a second preset word, wherein the second preset word is a notification information article source, and when the first text information article source screened by the first round is the same as the second preset word, screening by the second round;
acquiring a third preset word, wherein the third preset word is a notification information article label, and when the first text information article label screened by the second round is the same as the third preset word, screening by the third round;
Acquiring a fourth preset word, wherein the fourth preset word is the name of a notification information article, and when the first text information screened by the third round contains the fourth preset word, the fourth round of screening is carried out;
the first text information passing through the round screening is determined as first screening text.
6. The automated text monitoring method of claim 5, wherein the determining the first text information that passes through the round of screening as the first screened text further comprises:
acquiring a first preset state, wherein the first preset state is an article stamping requirement, and selecting an article stamping state option if the first screening text comprises the first preset state;
acquiring a second preset state, wherein the second preset state is an article examination requirement, and selecting an article examination state option if the first screening text comprises the second preset state;
and generating a state table according to the article stamping state options and the article examination state options.
7. The automated text monitoring method of claim 1, wherein if the first screening text is inconsistent with the history text, the first screening text is retained, specifically:
And if the first screening text is consistent with the historical text, stopping accessing the enterprise website.
8. The automated text monitoring method of claim 1, wherein the historical text is obtained by screening a historical preset word of historical text information, and specifically comprises:
acquiring a first history preset word, wherein the first history preset word is a preset article release time, and when the article release time of the history text information is greater than the first history preset word, the first history screening is performed;
acquiring a second history preset word, wherein the second history preset word is a preset article source, and when the history text information article source screened by the first round of history is the same as the second history preset word, screening by the second history round;
acquiring a third history preset word, wherein the third history preset word is a preset article label, and when the history text information article label screened by the second round of history is the same as the third history preset word, screening by a third history round;
acquiring a fourth history preset word, wherein the fourth history preset word is a preset article keyword, and when the history text information screened by the third round of history contains the fourth history preset word, the fourth round of history screening is passed;
The history text information through the four rounds of history screening is determined as history text.
9. The automated text monitoring method of claim 8, wherein the determining historical text information that passed through the round of historical screening as historical text further comprises:
acquiring a first history preset state, wherein the first history preset state is an article stamping requirement, and selecting a history article stamping state option if the first screening text comprises the first history preset state;
acquiring a second history preset state, wherein the second history preset state is an article examination requirement, and selecting a history article examination state option if the first screening text comprises the second history preset state;
and generating a historical state table according to the historical article stamping state options and the historical article examination state options.
10. The automated text monitoring method of claim 1, wherein the historical text information is derived from identifying historical text information of a historical enterprise website, in particular:
acquiring a plurality of historical enterprise websites selected by a user and account passwords corresponding to each historical enterprise website, and accessing each historical enterprise website according to the account passwords corresponding to each historical enterprise website;
If a history verification notification of a history enterprise website is received, identifying history verification information and a history verification frame in the history verification notification, filling the history verification information into the history verification frame, and accessing the history enterprise website after verification is passed.
11. The automated text monitoring method of claim 1, wherein comparing the first screening text with the history text, and if the first screening text is inconsistent with the history text, generating a web page report according to the first screening text, the history text and the status table after the first screening text is retained;
wherein the web page report further includes a processing status bar;
the processing status bar comprises an option to be processed, an option to be processed and an option to be unprocessed, and is used for marking according to the current report progress.
12. An automated text monitoring apparatus, comprising: the system comprises an access module, a first text screening module and a comparison module;
the access module is used for acquiring a plurality of enterprise websites selected by a user and account passwords corresponding to each enterprise website, and accessing corresponding enterprise websites according to preset time respectively according to the account passwords corresponding to each enterprise website;
The first screening text module is used for identifying first text information of a current enterprise website when receiving notification information sent by the current access enterprise website, and carrying out current preset word screening on the first text information to obtain a first screening text;
the comparison module is used for comparing the first screening text with the historical text, and if the first screening text is inconsistent with the historical text, the first screening text is reserved; the history text is obtained by screening and obtaining history preset words of history text information; the historical text information is derived from identifying historical text information of a historical enterprise website.
13. The automated text monitoring apparatus of claim 12, further comprising a web page reporting module;
the webpage report module generates according to the first screening text, the historical text and the state table;
the web page report module further comprises a processing status bar;
the processing status bar comprises an option to be processed, an option to be processed and an option to be unprocessed, and is used for marking according to the current report progress.
CN202311483608.XA 2023-11-08 2023-11-08 Automatic text monitoring method and device Pending CN117786040A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311483608.XA CN117786040A (en) 2023-11-08 2023-11-08 Automatic text monitoring method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311483608.XA CN117786040A (en) 2023-11-08 2023-11-08 Automatic text monitoring method and device

Publications (1)

Publication Number Publication Date
CN117786040A true CN117786040A (en) 2024-03-29

Family

ID=90393343

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311483608.XA Pending CN117786040A (en) 2023-11-08 2023-11-08 Automatic text monitoring method and device

Country Status (1)

Country Link
CN (1) CN117786040A (en)

Similar Documents

Publication Publication Date Title
CN103678109B (en) A kind of dump file analysis method, device and system
CN107943838B (en) Method and system for automatically acquiring xpath generated crawler script
CN105740402A (en) Method and device for acquiring semantic labels of digital images
US10296552B1 (en) System and method for automated identification of internet advertising and creating rules for blocking of internet advertising
CN108959349B (en) Financial audit inquiry system
CN112163553B (en) Material price accounting method, device, storage medium and computer equipment
CN112836018A (en) Method and device for processing emergency plan
CN111680073A (en) Financial service platform policy information recommendation method based on user data
CN111583000B (en) Method and device for identifying behavior of surrounding mark and string mark, computer equipment and storage medium
CN110188856B (en) Automatic generation method and system of environmental quality monitoring sampling label
CN117786040A (en) Automatic text monitoring method and device
CN112418813A (en) AEO qualification intelligent rating management system and method based on intelligent analysis and identification and storage medium
CN111324463A (en) Engineering file label clearing method, system, device and storage medium
CN107643968A (en) Crash log processing method and processing device
CN115565193A (en) Questionnaire information input method and device, electronic equipment and storage medium
CN114861166A (en) Popup window intercepting method, device, equipment and medium
EP2976721B1 (en) Identification of packaged items
CN105874470A (en) Interactive optical codes
CN113688346A (en) Illegal website identification method, device, equipment and storage medium
CN112035440A (en) Knowledge base management method and device, electronic equipment and storage medium
CN113327023A (en) Traversal test method and device, electronic equipment and computer readable storage medium
CN112541085B (en) Method for structuring questionnaire, apparatus for structuring questionnaire, and storage medium
CN111177501B (en) Label processing method, device and system
Hatlem et al. Intelligent tracing and process improvement of pathology workflows using character recognition
CN114283492B (en) Staff behavior-based work saturation analysis method, device, equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination