CN114328907A - Natural language processing method for early warning risk upgrade event - Google Patents

Natural language processing method for early warning risk upgrade event Download PDF

Info

Publication number
CN114328907A
CN114328907A CN202111232364.9A CN202111232364A CN114328907A CN 114328907 A CN114328907 A CN 114328907A CN 202111232364 A CN202111232364 A CN 202111232364A CN 114328907 A CN114328907 A CN 114328907A
Authority
CN
China
Prior art keywords
event
risk
early warning
model
natural language
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111232364.9A
Other languages
Chinese (zh)
Inventor
宋超伟
王雁飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Jiaxing Digital City Laboratory Co ltd
Original Assignee
Zhejiang Jiaxing Digital City Laboratory Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Jiaxing Digital City Laboratory Co ltd filed Critical Zhejiang Jiaxing Digital City Laboratory Co ltd
Priority to CN202111232364.9A priority Critical patent/CN114328907A/en
Publication of CN114328907A publication Critical patent/CN114328907A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a natural language processing method for early warning of risk upgrade events, which comprises the following steps: s1, establishing a data warehouse, and accessing a data source to the data warehouse; s2, based on natural language processing, extracting and processing event information in the data source through an event analysis related model; and S3, inputting the result value of the event analysis related model into the early warning analysis model, and carrying out risk value analysis and early warning of risk upgrading event upgrading. According to the method, various elements in the event are scientifically and efficiently analyzed through a natural language processing technology, a risk analysis model is obtained through massive event data training, the risk of converting a certain event into a high-risk case can be accurately obtained, and related workers can intervene in the event in advance, resolve contradiction disputes and prevent the high-risk case from occurring.

Description

Natural language processing method for early warning risk upgrade event
Technical Field
The invention belongs to the technical field of early warning risk upgrading events, and particularly relates to a natural language processing method for early warning risk upgrading events.
Background
Currently, with the development of social economy, civil cases caused by family marital disputes, neighborhood disputes, economic compensation disputes, labor and resources disputes, life trivia disputes and the like in daily work and life are increasingly frequent. If not correctly guided, along with the accumulation of contradictions, high-risk cases are often caused, and huge losses are brought to the lives and properties of people. In the past, no intelligent and efficient method is available for analyzing the cases of disputes and civil affairs to obtain the risk of upgrading the events into high-risk cases.
The natural language processing integrates linguistics, computer science and mathematics, and can perform entity extraction, automatic summarization, text classification and the like on texts. In a civil event, the event can be classified through natural language processing, factors such as related personnel, personnel property loss, personnel emotion and the like of the event are extracted, whether the event is further upgraded can be scientifically and accurately analyzed, and meanwhile, an algorithm model is perfected by tracking the whole life cycle of the event. In the prior art, a method for analyzing disputes and civil cases and obtaining the risk of upgrading events to high-risk cases by a natural language processing method does not exist.
Disclosure of Invention
The invention aims to solve the problems, provides a natural language processing method for early warning of risk upgrading events, can accurately obtain the risk of converting a certain event into a high-risk case, is convenient for resolving contradiction disputes, and prevents the occurrence of the high-risk case.
In order to achieve the purpose, the invention adopts the following technical scheme:
the natural language processing method for early warning of risk escalation events comprises the following steps:
s1, establishing a data warehouse, and accessing a data source to the data warehouse;
s2, based on natural language processing, extracting and processing event information in the data source through an event analysis related model;
and S3, inputting the result value of the event analysis related model into the early warning analysis model, and carrying out risk value analysis and early warning of risk upgrading event upgrading. According to the invention, various elements in the event are scientifically and efficiently analyzed by the natural language processing method, and a risk analysis model obtained by mass event data training is combined, so that the risk upgrading event can be early warned, and related personnel can intervene the event in advance, resolve contradiction disputes and prevent and reduce high-risk cases.
Further, S1 specifically includes:
s101, accessing a multi-channel data source;
and S102, cleaning the data according to the uniform format to generate uniform structured data, and writing the uniform structured data into a database. According to the method, event data information is acquired from each channel, a risk analysis model can be obtained by combining mass event data training, and the generated structured data is stored in the same database or is distributed in a plurality of databases.
Further, the data source includes event data in each field, and specifically includes: the system comprises citizen hot lines, hot spot events, non-police affair warning conditions, various help hot lines, citizen regulation data and contradiction regulation center data. The data resource library established by the invention is accessed to the event data of each field, and provides data basis and basis for subsequent event classification and analysis.
Further, the event analysis related model comprises an event classification model capable of dynamically configuring event types according to business needs, an entity extraction model for extracting event places, organizations and personnel, a property loss model for calculating property loss, a casualty model for calculating personnel and a civil emotion model for calculating the emotional intensity of the civil.
Further, S2 includes:
s201, classifying events according to event information, and dividing the events into a plurality of major classes through an event classification model, wherein a plurality of minor classes are arranged under each major class;
s202, sending the classified results into an entity extraction model for entity extraction, and extracting the characteristics of texts and categories which are in accordance with the social comprehensive treatment;
s203, performing preset keyword collision according to different entity extraction result contents, calculating a score through a property loss model and a casualty model, enabling a full text to enter a people emotion model, and obtaining the people emotion violence degree of a risk upgrading event through analysis and calculation;
and transmitting corresponding model calculation results downwards among the models in an interface mode.
Further, event classification classifies events by a hybrid algorithm combining a K-means algorithm and a bayesian network: training sample clustering through a K-means algorithm, providing an artificial intervention interface for artificial dynamic intervention to configure a training result, and classifying events by using a Bayesian network according to the training result. The invention can intervene the training result dynamically.
Further, prior to step S202, corpus feeding is performed on the entity extraction model in advance: and (3) arranging and summarizing a large number of events related to the social comprehensive treatment, then training, marking and naming each entity in the training process, and correcting according to the obtained result to obtain an entity extraction model capable of extracting the characteristics of texts and categories which are in accordance with the social comprehensive treatment.
Further, risk value calculation is carried out on the risk value analysis of the risk upgrading event according to a risk value algorithm, and the citizen turning risk value is calculated by combining the event extraction result.
Further, step S3 is followed by adjusting the weight of each element to the risk value through the historical data to optimize the risk value algorithm.
Further, the risk value early warning of the risk upgrade event provides early warning of the risk value of the event upgrade by establishing a risk upgrade event early warning trigger mechanism: and setting a corresponding early warning risk value threshold value for the event, and sending early warning information to related personnel when the risk upgrading event upgrading risk value exceeds the risk threshold value. According to the invention, when the risk value of the risk upgrade event exceeds the risk threshold value, related personnel are notified to process the risk value in a convenient and fast manner.
Compared with the prior art, the invention has the advantages that:
the natural language processing method for the early warning risk upgrade event, disclosed by the invention, scientifically and efficiently analyzes various elements in the event through a natural language processing technology, and can accurately obtain the risk of converting a certain event into a high-risk case by combining a risk analysis model obtained by mass event data training, so that related workers can intervene in the event in advance, resolve contradiction disputes and prevent the high-risk case.
Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention.
Drawings
FIG. 1 is a flow chart of the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings.
As shown in fig. 1, the natural language processing method for warning a risk upgrade event according to the present embodiment, which warns a risk upgrade event through the natural language processing method, includes the following steps:
s1, establishing a data warehouse, and accessing a data source to the data warehouse;
s2, based on natural language processing, extracting and processing event information in the data source through an event analysis related model;
and S3, inputting the result value of the event analysis related model algorithm into the early warning analysis model, and carrying out risk value analysis and early warning of risk upgrading event upgrading. The data source in this embodiment includes event data. According to the method, various elements in the event are scientifically and efficiently analyzed through a natural language processing method, and a risk analysis model obtained through mass event data training is combined, so that the risk upgrading event can be used for early warning, relevant personnel can intervene in the event in advance according to the risk upgrading event, conflict disputes are resolved, and high-risk cases are prevented from happening.
This embodiment S1 specifically includes:
s101, accessing a multi-channel data source: in the embodiment, access event information is acquired through sources such as a large data center in the whole city, each operator, a hot line data center, each basic management platform and the like;
and S102, cleaning the data according to the uniform format to generate uniform structured data, and writing the uniform structured data into a plurality of databases.
The data repository, namely the data warehouse, established in the embodiment can be accessed to event data of each field, and the event data of each field comprises event data obtained from citizen hotlines, hot spot events, non-police affairs, various help hotlines, people mediation data, spear mediation center and the like of the whole city. The data resource library established in the embodiment is accessed to event data in each field, and can provide data basis and basis for subsequent classification and analysis.
In this embodiment, an event analysis correlation model is established based on natural language processing, and the event analysis correlation model of this embodiment includes: 1. an event classification model, namely, according to business needs, the event type can be dynamically configured; 2. an entity extraction model, which extracts event locations, organizations, personnel and the like; 3. a property loss model; 4. a casualty model; 5. the people emotion model obtains the emotional bias and the violence degree of characters or voice according to an algorithm. And inputting the result values of the 5 model algorithms into an early warning analysis model.
The present embodiment S2 includes:
s201, event classification is carried out according to event information, events are divided into a plurality of major classes through an event classification model, a plurality of minor classes are arranged under each major class, and the events are classified according to a mixed algorithm combining a K-means algorithm and a Bayesian network in the embodiment: training sample clustering through a K-means algorithm, and classifying events by using an improved Bayesian network according to a training result; the embodiment can dynamically configure the training result through automatic intervention or manual intervention of a result machine;
s202, sending the classified result into an entity extraction model for entity extraction, and extracting the text and the class characteristics which are fit for the social comprehensive treatment, wherein the text and the class characteristics are fit for the social comprehensive treatment, namely the text and the class characteristics are fit for the social comprehensive treatment;
s203, performing preset keyword collision according to different entity extraction result contents, calculating to obtain a score through a property loss model and a casualty model, enabling a full text to enter a civil emotion model, and performing analytic operations such as lexical analysis, Chinese word vector representation, word meaning similarity, Chinese DNN language model, dependency syntactic analysis, short text similarity and the like on the civil emotion model to obtain the emotional intensity of the event;
and the calculation results of each model are transmitted downwards in an interface mode. That is, in this embodiment, the model calculation result is transmitted downward through the interface between the aforementioned models.
The score in this embodiment S203 is a configuration rule that can be customized, such as whether there is casualty to obtain different scores, whether there is property loss and how much loss to obtain different scores. Full text refers to all data, the entirety of data. The calculation rule of the emotional intensity is that the excitation composition is divided into three parts according to a negative emotion machine learning algorithm, and the three parts are specifically as follows: 1-33 were mild, 34-67 were moderate, and 68-100 were severe.
In this embodiment, nlp is adopted as the method for extracting the text and category features in accordance with the social comprehensive treatment, and before step S202, the corpus is fed in advance: a large number of events related to the social comprehensive treatment are sorted and summarized, then training is carried out, labeling and naming are carried out on each entity (entity) in the training process, and the obtained result is corrected, so that a text and category feature extraction method which is fit for the social comprehensive treatment is designed.
According to the risk value analysis and early warning of the risk upgrading event, the event is converted into the high-risk case risk value through calculation by combining result information extracted by the event according to a specific algorithm model. The risk value calculation of the embodiment calculates the civil turning penalty risk value according to a risk value algorithm and by combining result information extracted by an event. The event extraction result comprises a large category, a small category, a characteristic entity, a difference rate of preset keywords and the like.
Meanwhile, the risk value early warning of the risk upgrade event of the embodiment provides the early warning of the risk value of the event upgrade by establishing a risk upgrade event early warning trigger mechanism: the early warning is carried out according to early warning risk value thresholds set by various events, and when the risk value reaches a certain value, namely the risk value is upgraded to exceed the risk threshold in a risk upgrading event, the early warning is provided for relevant departments. The embodiment can inform related personnel to process through convenient forms such as short messages, mails and the like. The early warning risk value threshold of each kind of event is set up in advance, and the technical personnel in the field can set up according to actual conditions is nimble.
In addition, in the long-term calculation process, the risk value algorithm can be optimized according to the historical data, and the weight of each element to the risk value can be adjusted. And the weight of each element to the risk value is adjusted through historical data to optimize a risk value algorithm, and each element is a result extracted from the event used as the basis for computation of the civil transfer criminal risk value and comprises each intermediate product.
The specific embodiments described herein are merely illustrative of the spirit of the invention. Various modifications or additions may be made to the described embodiments or alternatives may be employed by those skilled in the art without departing from the spirit or ambit of the invention as defined in the appended claims.

Claims (10)

1. A natural language processing method for early warning of risk escalation events is characterized by comprising the following steps:
s1, establishing a data warehouse, and accessing a data source to the data warehouse;
s2, based on natural language processing, extracting and processing event information in the data source through an event analysis related model;
and S3, inputting the result value of the event analysis related model into the early warning analysis model, and carrying out risk value analysis and early warning of risk upgrading event upgrading.
2. The natural language processing method for early warning of a risk escalation event according to claim 1, wherein the S1 specifically includes:
s101, accessing a multi-channel data source;
and S102, cleaning the data according to the uniform format to generate uniform structured data, and writing the uniform structured data into a database.
3. The natural language processing method for early warning risk escalation event according to claim 1, wherein the data source includes event data of each field, specifically including: the system comprises citizen hot lines, hot spot events, non-police affair warning conditions, various help hot lines, citizen regulation data and contradiction regulation center data.
4. The natural language processing method for early warning of risk escalation events according to claim 1, wherein the event analysis correlation models include an event classification model for dynamically configurable event types according to business needs, an entity extraction model for extracting event locations, organizations, and people, a property loss model for calculating property losses, a casualty model for calculating people, and a people emotion model for calculating the emotional intensity of people.
5. The natural language processing method for early warning of a risk escalation event according to claim 4, wherein the S2 includes:
s201, classifying events according to event information, and dividing the events into a plurality of major classes through an event classification model, wherein a plurality of minor classes are arranged under each major class;
s202, sending the classified results into an entity extraction model for entity extraction, and extracting the characteristics of texts and categories which are in accordance with the social comprehensive treatment;
s203, performing preset keyword collision according to different entity extraction result contents, calculating a score through a property loss model and a casualty model, enabling a full text to enter a people emotion model, and obtaining the people emotion violence degree of a risk upgrading event through analysis and calculation;
and transmitting corresponding model calculation results downwards among the models in an interface mode.
6. The natural language processing method for early warning of risk escalation events according to claim 5, wherein the event classification classifies events by a hybrid algorithm combining K-means algorithm and Bayesian network:
training sample clustering through a K-means algorithm, providing an artificial intervention interface for artificial dynamic intervention to configure a training result, and classifying events by using a Bayesian network according to the training result.
7. The natural language processing method for early warning of risk escalation event according to claim 5, characterized in that, before step S202, the entity extraction model is corpus-fed in advance: and (3) arranging and summarizing a large number of events related to the social comprehensive treatment, then training, marking and naming each entity in the training process, and correcting according to the obtained result to obtain an entity extraction model capable of extracting the characteristics of texts and categories which are in accordance with the social comprehensive treatment.
8. The natural language processing method for early warning risk escalation events according to claim 1, wherein the risk escalation event escalation risk value analysis performs risk value calculation according to a risk value algorithm, and calculates a civil diversion risk value by combining the event extracted result.
9. The natural language processing method for early warning of a risk escalation event according to claim 8, further comprising, after step S3: and adjusting the weight of each element on the risk value through historical data to optimize the risk value algorithm.
10. The natural language processing method for early warning of a risk escalation event according to claim 1, wherein the risk escalation event escalation risk value early warning provides early warning of an event escalation risk value by establishing a risk escalation event early warning trigger mechanism:
and setting a corresponding early warning risk value threshold value for the event, and sending early warning information to related personnel when the risk upgrading event upgrading risk value exceeds the risk threshold value.
CN202111232364.9A 2021-10-22 2021-10-22 Natural language processing method for early warning risk upgrade event Pending CN114328907A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111232364.9A CN114328907A (en) 2021-10-22 2021-10-22 Natural language processing method for early warning risk upgrade event

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111232364.9A CN114328907A (en) 2021-10-22 2021-10-22 Natural language processing method for early warning risk upgrade event

Publications (1)

Publication Number Publication Date
CN114328907A true CN114328907A (en) 2022-04-12

Family

ID=81044560

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111232364.9A Pending CN114328907A (en) 2021-10-22 2021-10-22 Natural language processing method for early warning risk upgrade event

Country Status (1)

Country Link
CN (1) CN114328907A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116013027A (en) * 2022-08-05 2023-04-25 航天神舟智慧系统技术有限公司 Group event early warning method and system

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020083080A1 (en) * 2001-02-22 2002-06-27 Classen Immunotherapies Computer algorithms and methods for product safety
US20050071217A1 (en) * 2003-09-30 2005-03-31 General Electric Company Method, system and computer product for analyzing business risk using event information extracted from natural language sources
US9552548B1 (en) * 2016-07-01 2017-01-24 Intraspexion Inc. Using classified text and deep learning algorithms to identify risk and provide early warning
WO2018063773A1 (en) * 2016-07-01 2018-04-05 Intraspexion Inc. Using classified text and deep learning algorithms to identify risk and provide early warning
CN108320256A (en) * 2017-12-08 2018-07-24 中国电子科技集团公司电子科学研究院 Social security events recognition methods, equipment and storage medium based on big data
CN110008349A (en) * 2019-02-01 2019-07-12 阿里巴巴集团控股有限公司 The method and device for the event risk assessment that computer executes
CN110008311A (en) * 2019-04-04 2019-07-12 北京邮电大学 A kind of product information security risk monitoring method based on semantic analysis
CN111062562A (en) * 2019-11-06 2020-04-24 航天信息股份有限公司 Community grid service linkage disposal control method and system
CN111260223A (en) * 2020-01-17 2020-06-09 山东省计算中心(国家超级计算济南中心) Intelligent identification and early warning method, system, medium and equipment for trial and judgment risk
CN111582762A (en) * 2020-05-29 2020-08-25 重庆木舌科技有限公司 School law risk early warning method and system based on public events
CN112750028A (en) * 2020-12-30 2021-05-04 北京知因智慧科技有限公司 Risk early warning method and device of event text based on entity extraction
CN112765485A (en) * 2021-01-18 2021-05-07 深圳市网联安瑞网络科技有限公司 Network social event prediction method, system, terminal, computer device and medium
CN112836517A (en) * 2021-01-27 2021-05-25 浪潮云信息技术股份公司 Method for processing mining risk signal based on natural language
EP3835994A1 (en) * 2019-12-13 2021-06-16 Tata Consultancy Services Limited System and method for identification and profiling adverse events

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020083080A1 (en) * 2001-02-22 2002-06-27 Classen Immunotherapies Computer algorithms and methods for product safety
US20050071217A1 (en) * 2003-09-30 2005-03-31 General Electric Company Method, system and computer product for analyzing business risk using event information extracted from natural language sources
US9552548B1 (en) * 2016-07-01 2017-01-24 Intraspexion Inc. Using classified text and deep learning algorithms to identify risk and provide early warning
WO2018063773A1 (en) * 2016-07-01 2018-04-05 Intraspexion Inc. Using classified text and deep learning algorithms to identify risk and provide early warning
CN108320256A (en) * 2017-12-08 2018-07-24 中国电子科技集团公司电子科学研究院 Social security events recognition methods, equipment and storage medium based on big data
CN110008349A (en) * 2019-02-01 2019-07-12 阿里巴巴集团控股有限公司 The method and device for the event risk assessment that computer executes
CN110008311A (en) * 2019-04-04 2019-07-12 北京邮电大学 A kind of product information security risk monitoring method based on semantic analysis
CN111062562A (en) * 2019-11-06 2020-04-24 航天信息股份有限公司 Community grid service linkage disposal control method and system
EP3835994A1 (en) * 2019-12-13 2021-06-16 Tata Consultancy Services Limited System and method for identification and profiling adverse events
CN111260223A (en) * 2020-01-17 2020-06-09 山东省计算中心(国家超级计算济南中心) Intelligent identification and early warning method, system, medium and equipment for trial and judgment risk
CN111582762A (en) * 2020-05-29 2020-08-25 重庆木舌科技有限公司 School law risk early warning method and system based on public events
CN112750028A (en) * 2020-12-30 2021-05-04 北京知因智慧科技有限公司 Risk early warning method and device of event text based on entity extraction
CN112765485A (en) * 2021-01-18 2021-05-07 深圳市网联安瑞网络科技有限公司 Network social event prediction method, system, terminal, computer device and medium
CN112836517A (en) * 2021-01-27 2021-05-25 浪潮云信息技术股份公司 Method for processing mining risk signal based on natural language

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116013027A (en) * 2022-08-05 2023-04-25 航天神舟智慧系统技术有限公司 Group event early warning method and system

Similar Documents

Publication Publication Date Title
CN110765244B (en) Method, device, computer equipment and storage medium for obtaining answering operation
CN108304468B (en) Text classification method and text classification device
Inzalkar et al. A survey on text mining-techniques and application
CN102096703B (en) Filtering method and equipment of short messages
CN110458296B (en) Method and device for marking target event, storage medium and electronic device
CN110309234B (en) Knowledge graph-based customer warehouse-holding early warning method and device and storage medium
CN110852095B (en) Statement hot spot extraction method and system
CN104834651A (en) Method and apparatus for providing answers to frequently asked questions
CN110020327A (en) A kind of resume resolution system based on vertical search engine
CN104933072A (en) Multi-language internet information analysis method
EP3994589A1 (en) System, apparatus and method of managing knowledge generated from technical data
CN110516057A (en) A kind of petition letter problem answer method and device
CN114896305A (en) Smart internet security platform based on big data technology
KR102376489B1 (en) Text document cluster and topic generation apparatus and method thereof
WO2011044025A2 (en) Method and system for taking actions based on analysis of enterprise communication messages
CN114328907A (en) Natural language processing method for early warning risk upgrade event
CN112579733A (en) Rule matching method, rule matching device, storage medium and electronic equipment
CN114757178A (en) Core product word extraction method, device, equipment and medium
CN116992010A (en) Content distribution and interaction method and system based on multi-mode large model
CN110633468A (en) Information processing method and device for object feature extraction
US20200184521A1 (en) Method and system for initiating an interface concurrent with generation of a transitory sentiment community
CN114971833A (en) Tax information processing method and related equipment
AU2019261748B2 (en) Automatic data transformations for process automations
Povoda et al. Genetic optimization of big data sentiment analysis
CN112988704A (en) AI consultation database cluster building method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination