WO2019169766A1 - Electronic apparatus, method and system for early warning regarding system sensitive content, and storage medium - Google Patents
Electronic apparatus, method and system for early warning regarding system sensitive content, and storage medium Download PDFInfo
- Publication number
- WO2019169766A1 WO2019169766A1 PCT/CN2018/089460 CN2018089460W WO2019169766A1 WO 2019169766 A1 WO2019169766 A1 WO 2019169766A1 CN 2018089460 W CN2018089460 W CN 2018089460W WO 2019169766 A1 WO2019169766 A1 WO 2019169766A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- index
- content
- sensitive
- coefficient
- allergy
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/50—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
- G06F21/57—Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
- G06F21/577—Assessing vulnerabilities and evaluating computer system security
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Definitions
- the present application relates to the field of communications technologies, and in particular, to an electronic device, an early warning method, system, and storage medium for sensitive content of a system.
- the company usually covers a variety of businesses, such as financial institutions, including securities, insurance, banking and other financial services, each of which can correspond to one or more social platforms, such as stock BBS, financial management.
- financial institutions including securities, insurance, banking and other financial services, each of which can correspond to one or more social platforms, such as stock BBS, financial management.
- social platforms such as stock BBS, financial management.
- a variety of sensitive information may appear in these platforms, such as pornography, politics, advertising, etc., which are prone to adverse effects.
- There are some methods in the industry to judge the content of the platform and to warn sensitive information but the evaluation of sensitive information is relatively solid, generally only pay attention to the correct rate of judgment, and it is prone to a one-size-fits-all situation.
- the purpose of the present application is to provide an electronic device, an early warning method for a sensitive content of a system, a system, and a storage medium, which are designed to hierarchically evaluate sensitive information in combination with business scenarios and user behaviors, and provide more accurate early warning of sensitive information.
- the present application provides an electronic device including a memory and a processor coupled to the memory, the memory storing a processing system operable on the processor, the processing The system implements the following steps when executed by the processor:
- the anti-allergy index processing step acquires each anti-allergy parameter corresponding to the business system and the weight corresponding to each anti-allergy parameter, and calculates a system anti-allergy index of the business system according to each anti-allergy parameter and corresponding weight;
- the sensitive index processing step acquires the content sensitivity index and the user historical behavior index of each user's published content in the business system, calculates the system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitive the system of all users in the business system.
- the sum of the indices gives the sum of the system sensitivity indices;
- the early warning step is determined, and the difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes is calculated, and whether the early warning is issued is determined according to the difference.
- the present application further provides an early warning method for system sensitive content, and the method for alerting sensitive content of the system includes:
- S2 obtaining a content sensitivity index and a user historical behavior index of content published by each user in the business system, calculating a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and adding system sensitivity indexes of all users in the business system Get the sum of the system sensitivity index;
- S3 Calculate the difference between the system anti-allergy index corresponding to the service system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
- the present application further provides an early warning system for system sensitive content, wherein the system early warning system for sensitive content includes:
- the obtaining module is configured to obtain each anti-allergy parameter corresponding to the service system and a weight corresponding to each anti-allergy parameter, and calculate a system anti-allergy index of the service system according to each anti-allergy parameter and the corresponding weight;
- the processing module is configured to obtain a content sensitivity index and a user historical behavior index of each user published content in the business system, calculate a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitivity the system of all users in the business system.
- the sum of the indices gives the sum of the system sensitivity indices;
- the early warning module is configured to calculate a difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
- the present application also provides a computer readable storage medium having stored thereon a processing system that, when executed by a processor, implements the steps of the method described above.
- the present application calculates a system anti-allergy index for judging its ability to resist sensitive information for different business systems, and calculates a system sensitivity index for judging the degree of sensitive influence of the user's behavior on the business system. Sum, according to the difference between the two to determine whether the business system issues an early warning, this application comprehensively considers the actual application of the business system, each business system has different tolerance for sensitive information, and the rigor of users in each business system is different. Based on these two aspects, from the perspective of the practical application of the business system, combined with the business scenario and user behavior, the sensitive information is hierarchically evaluated, and the sensitive information is more accurately alerted.
- FIG. 1 is a schematic diagram of a hardware architecture of an embodiment of an electronic device according to the present application.
- FIG. 2 is a schematic flowchart of an embodiment of an early warning method for sensitive content of the system of the present application.
- the electronic device 1 is a schematic diagram of a hardware architecture of an embodiment of an electronic device of the present application.
- the electronic device 1 is an apparatus capable of automatically performing numerical calculation and/or information processing in accordance with an instruction set or stored in advance.
- the electronic device 1 may be a computer, a single network server, a server group composed of multiple network servers, or a cloud-based cloud composed of a large number of hosts or network servers, where cloud computing is a type of distributed computing.
- a super virtual computer consisting of a group of loosely coupled computers.
- the electronic device 1 may include, but is not limited to, a memory 11 communicably connected to each other through a system bus, a processor 12, and a network interface 13, and the memory 11 stores a processing system operable on the processor 12. It should be noted that FIG. 1 only shows the electronic device 1 having the components 11-13, but it should be understood that not all illustrated components are required to be implemented, and more or fewer components may be implemented instead.
- the memory 11 includes a memory and at least one type of readable storage medium.
- the memory provides a cache for the operation of the electronic device 1;
- the readable storage medium may be, for example, a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static random access memory (SRAM).
- a non-volatile storage medium such as a read only memory (ROM), an electrically erasable programmable read only memory (EEPROM), a programmable read only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, or the like.
- the readable storage medium may be an internal storage unit of the electronic device 1, such as a hard disk of the electronic device 1; in other embodiments, the non-volatile storage medium may also be external to the electronic device 1.
- the storage device for example, a plug-in hard disk provided on the electronic device 1, a smart memory card (SMC), a Secure Digital (SD) card, a flash card, or the like.
- the readable storage medium of the memory 11 is generally used to store an operating system and various types of application software installed in the electronic device 1, such as program code for storing a processing system in an embodiment of the present application. Further, the memory 11 can also be used to temporarily store various types of data that have been output or are to be output.
- the processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments.
- the processor 12 is typically used to control the overall operation of the electronic device 1, such as performing control and processing related to data interaction or communication with other devices.
- the processor 12 is configured to run program code or process data stored in the memory 11, such as running a processing system or the like.
- the network interface 13 may comprise a wireless network interface or a wired network interface, which is typically used to establish a communication connection between the electronic device 1 and other electronic devices.
- the processing system is stored in the memory 11 and includes at least one computer readable instruction stored in the memory 11, the at least one computer readable instruction being executable by the processor 12 to implement the methods of various embodiments of the present application;
- the at least one computer readable instruction can be classified into different logic modules depending on the functions implemented by its various parts.
- the anti-allergy index processing step acquires each anti-allergy parameter corresponding to the business system and the weight corresponding to each anti-allergy parameter, and calculates a system anti-allergy index of the business system according to each anti-allergy parameter and corresponding weight;
- the system anti-allergy index is an indicator of the ability of the business system to resist sensitive information or sensitive content. The higher the system anti-allergy index, the stronger the ability of the business system to resist sensitive vocabulary.
- the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient; in another embodiment, the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient. , emergency response coefficient, sensitive vocabulary attention coefficient.
- the system important grade coefficient has a range of [0, 1].
- the system important grade coefficient is 0, the business system has the highest importance, and the system important grade coefficient can be divided into three levels: 0 is a level. , 0.5 is the second level, and 1 is the third level;
- the user quantity level coefficient ranges from [0, 1]. When the user quantity level coefficient is 0, the user quantity is the highest, and the user quantity level coefficient can be divided into three levels: 0 is the user quantity is above 10000, and 0.5 is the user quantity. Within 1000 to 10000, 1 is less than 1000 for the user;
- the system information propagation coefficient ranges from [0, 1]. When the system information propagation coefficient is 0, the system information is the easiest to propagate.
- the system information propagation coefficient can be divided into three levels: 0 is the pure Internet as the transmission route, and 0.5 is the enterprise.
- 0 is the pure Internet as the transmission route, and 0.5 is the enterprise.
- the mixed transmission route in the network composed of the local area network and the Internet, 1 is only the local area network as the transmission route;
- the range of emergency response coefficient is [0,1].
- the emergency response coefficient is 0, the system has no emergency treatment.
- the emergency treatment coefficient can be divided into three levels: 0 for no emergency response and 0.5 for quickly deleting sensitive content in the system. 1 is to locate the forwarding address and delete it while deleting the sensitive content quickly;
- the sensitivity vocabulary attention coefficient range is [0,1]. When the sensitive vocabulary attention degree coefficient is 0, the sensitive vocabulary attention degree is the highest.
- the sensitive vocabulary attention degree coefficient can be divided into 3 levels: 0 is pornography, politics, advertisement, illegal All of the concerns, 0.5 is not harmful to the attention of advertising, etc., 1 is already sensitive to other means, so do not pay attention.
- the anti-allergy parameters include the system important grade coefficient x1, the user magnitude coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, the sensitive vocabulary attention degree coefficient x5, for example, the system important grade coefficient x1, the user magnitude grade coefficient x2, the system
- the more important the business system the larger the user volume, the easier the information dissemination, the weaker the emergency processing capability, and the higher the sensitivity of the sensitive vocabulary, the smaller the system anti-allergic index of the business system and the resistance to sensitive vocabulary.
- the system important level coefficient x1 may be measured according to the weight of each service scenario in the service system and the service volume corresponding to each service scenario, and the financial service system is taken as an example, and the business scenario includes a financial transaction scenario and a common transaction scenario.
- Financial transaction scenarios include, for example, bank transfers, fund purchases, etc., and general transactions include, for example, premium payments, premium renewals, and the like.
- the weight of the financial transaction scenario is k1, the weight of the common transaction scenario is k2, the traffic volume of the financial transaction scenario is c1, the traffic volume of the common transaction scenario is c2, and the traffic volume can be measured by the amount of data sent and received, and the amount of data sent and received. The larger the traffic, the larger the traffic.
- the business scenario can be further subdivided to more objectively and comprehensively judge the importance of the business system.
- a financial transaction scenario, a common transaction scenario, a core business scenario, and a non-core business scenario, etc. various scenarios have corresponding traffic volumes, and the system important rank coefficient x1 can be calculated by the same principle calculation method as above.
- the sensitive index processing step acquires the content sensitivity index and the user historical behavior index of each user's published content in the business system, calculates the system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitive the system of all users in the business system.
- the sum of the indices gives the sum of the system sensitivity indices;
- the user's system sensitivity index is used to judge the degree of sensitive influence of the user's behavior on the business system. The smaller the user's system sensitivity index, the less sensitive the impact on the business system, and the greater the user's system sensitivity index, the greater the sensitivity to the business system.
- the content sensitivity index refers to the sensitivity or metric of the content published by the user. If the content involves sensitive vocabulary or sensitive information, the content sensitivity index is larger, and the sensitive vocabulary or sensitive information includes pornography, politics, advertisement, illegality, etc. .
- the process of obtaining the content sensitivity index includes: segmenting the content of the user by using the sentence as a unit, matching the word after the word segmentation with the word in the pre-established thesaurus, to match the corresponding keyword,
- the keywords include nouns, verbs, words related to pornography, politics, advertisements, illegals, etc., and the corresponding core viewpoint information of each sentence is analyzed according to the keyword.
- the keywords may be directly composed.
- the new statement, the information of the statement is the core viewpoint information, and the content sensitive index associated with the core viewpoint information is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index.
- the content sensitive index corresponding to the core viewpoint information is obtained by identifying the core program in advance and stored in the thesaurus.
- the core view information is pornographic information
- the content sensitivity index corresponds to n1
- the core view information is illegal information
- the content sensitivity index corresponds to n2
- the core view information is political information
- the content sensitivity index corresponds to n3.
- the core viewpoint information is advertisement information
- its content sensitivity index corresponds to n4, n1 ⁇ n2 ⁇ n3 ⁇ n4.
- the user historical behavior index is a description of the behavior of the user to the content published by other users.
- the user's historical behavior index has a base of 1. If the user reports that other users publish sensitive vocabulary or sensitive information, The user's historical behavior index is (1-0.2). If the user has published a behavior involving sensitive vocabulary or sensitive information, the user's historical behavior index is (1+0.2).
- the system sensitivity index content sensitivity index*user historical behavior index of the user is calculated, and the system sensitivity indexes of all users in the business system are added. Get the sum of the system sensitivity index.
- the words in the thesaurus may be maintained, and the evaluation of the content sensitivity index corresponding to the core viewpoint information in the existing thesaurus may be adjusted in real time according to the operation of the business system, and the specific adjustment algorithm includes: according to the vocabulary The total frequency of occurrence, the breadth of vocabulary in each business system, the feedback rating of the operator, and the feedback frequency of the operator are comprehensively assessed. If the above four indicators are higher, the content sensitivity index is larger.
- the early warning step is determined, and the difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes is calculated, and whether the early warning is issued is determined according to the difference.
- the business system if the difference between the system anti-allergy index and the sum of the system sensitivity index is larger, the system anti-allergy index is larger, and the sum of the system sensitivity index is smaller, the business system
- the step specifically includes:
- the difference is greater than 90, it is determined that the service system does not issue an early warning; if the difference is less than or equal to 90 and greater than 60, it is determined that the service system issues a slight warning; if the difference is less than If it is equal to 60 and greater than 30, it is determined that the service system issues a medium warning; if the difference is less than or equal to 30, it is determined that the service system issues a severe warning.
- the present application calculates a system anti-allergy index for judging its ability to resist sensitive information for different business systems, and calculates a system sensitivity index for judging the degree of sensitive influence of the user's behavior on the business system.
- this application comprehensively considers the actual application of the business system, each business system has different tolerance for sensitive information, and the rigor of users in each business system is different. Based on these two aspects, from the perspective of the practical application of the business system, combined with the business scenario and user behavior, the sensitive information is hierarchically evaluated to provide more accurate warning of sensitive information.
- FIG. 2 is a schematic flowchart of an embodiment of an early warning method for sensitive content of a system according to the present application.
- the method for alerting sensitive content of the system includes the following steps:
- Step S1 acquiring respective anti-allergy parameters corresponding to the service system and weights corresponding to the respective anti-allergy parameters, and calculating a system anti-allergy index of the service system according to each anti-allergy parameter and corresponding weight;
- the system anti-allergy index is an indicator of the ability of the business system to resist sensitive vocabulary. The higher the system anti-allergic index, the stronger the ability of the business system to resist sensitive vocabulary.
- the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient; in another embodiment, the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient. , emergency response coefficient, sensitive vocabulary attention coefficient.
- the system important grade coefficient has a range of [0, 1].
- the system important grade coefficient is 0, the business system has the highest importance, and the system important grade coefficient can be divided into three levels: 0 is a level. , 0.5 is the second level, and 1 is the third level;
- the user quantity level coefficient ranges from [0, 1]. When the user quantity level coefficient is 0, the user quantity is the highest, and the user quantity level coefficient can be divided into three levels: 0 is the user quantity is above 10000, and 0.5 is the user quantity. Within 1000 to 10000, 1 is less than 1000 for the user;
- the system information propagation coefficient ranges from [0, 1]. When the system information propagation coefficient is 0, the system information is the easiest to propagate.
- the system information propagation coefficient can be divided into three levels: 0 is the pure Internet as the transmission route, and 0.5 is the enterprise.
- 0 is the pure Internet as the transmission route, and 0.5 is the enterprise.
- the mixed transmission route in the network composed of the local area network and the Internet, 1 is only the local area network as the transmission route;
- the range of emergency response coefficient is [0,1].
- the emergency response coefficient is 0, the system has no emergency treatment.
- the emergency treatment coefficient can be divided into three levels: 0 for no emergency response and 0.5 for quickly deleting sensitive content in the system. 1 is to locate the forwarding address and delete it while deleting the sensitive content quickly;
- the sensitivity vocabulary attention coefficient range is [0,1]. When the sensitive vocabulary attention degree coefficient is 0, the sensitive vocabulary attention degree is the highest.
- the sensitive vocabulary attention degree coefficient can be divided into 3 levels: 0 is pornography, politics, advertisement, illegal All of the concerns, 0.5 is not harmful to the attention of advertising, etc., 1 is already sensitive to other means, so do not pay attention.
- the anti-allergy parameters include the system important grade coefficient x1, the user magnitude coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, the sensitive vocabulary attention degree coefficient x5, for example, the system important grade coefficient x1, the user magnitude grade coefficient x2, the system
- the more important the business system the larger the user volume, the easier the information dissemination, the weaker the emergency processing capability, and the higher the sensitivity of the sensitive vocabulary, the smaller the system anti-allergic index of the business system and the resistance to sensitive vocabulary.
- the system important level coefficient x1 may be measured according to the weight of each service scenario in the service system and the service volume corresponding to each service scenario, and the financial service system is taken as an example, and the business scenario includes a financial transaction scenario and a common transaction scenario.
- Financial transaction scenarios include, for example, bank transfers, fund purchases, etc., and general transactions include, for example, premium payments, premium renewals, and the like.
- the weight of the financial transaction scenario is k1, the weight of the common transaction scenario is k2, the traffic volume of the financial transaction scenario is c1, the traffic volume of the common transaction scenario is c2, and the traffic volume can be measured by the amount of data sent and received, and the amount of data sent and received. The larger the traffic, the larger the traffic.
- the business scenario can be further subdivided to more objectively and comprehensively judge the importance of the business system.
- a financial transaction scenario, a common transaction scenario, a core business scenario, and a non-core business scenario, etc. various scenarios have corresponding traffic volumes, and the system important rank coefficient x1 can be calculated by the same principle calculation method as above.
- Step S2 acquiring a content sensitivity index and a user historical behavior index of each user published content in the service system, calculating a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and comparing system sensitivity indexes of all users in the service system. Add the sum of the system sensitivity index;
- the user's system sensitivity index is used to judge the degree of sensitive influence of the user's behavior on the business system.
- the content sensitivity index refers to the sensitivity or metric of the content published by the user. If the content involves sensitive vocabulary or sensitive information, the content sensitivity index is larger, and the sensitive vocabulary or sensitive information includes pornography, politics, advertisement, illegality, etc. .
- the process of obtaining the content sensitivity index includes: segmenting the content of the user by using the sentence as a unit, matching the word after the word segmentation with the word in the pre-established thesaurus, to match the corresponding keyword,
- the keywords include nouns, verbs, words related to pornography, politics, advertisements, illegals, etc., and the corresponding core viewpoint information of each sentence is analyzed according to the keyword.
- the keywords may be directly composed.
- the new statement, the information of the statement is the core viewpoint information, and the content sensitive index associated with the core viewpoint information is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index.
- the content sensitive index corresponding to the core viewpoint information is obtained by identifying the core program in advance and stored in the thesaurus.
- the core view information is pornographic information
- the content sensitivity index corresponds to n1
- the core view information is illegal information
- the content sensitivity index corresponds to n2
- the core view information is political information
- the content sensitivity index corresponds to n3.
- the core viewpoint information is advertisement information
- its content sensitivity index corresponds to n4, n1 ⁇ n2 ⁇ n3 ⁇ n4.
- the user historical behavior index is a description of the behavior of the user to the content published by other users.
- the user's historical behavior index has a base of 1. If the user reports that other users publish sensitive vocabulary or sensitive information, The user's historical behavior index is (1-0.2). If the user has published a behavior involving sensitive vocabulary or sensitive information, the user's historical behavior index is (1+0.2).
- the system sensitivity index content sensitivity index*user historical behavior index of the user is calculated, and the system sensitivity indexes of all users in the business system are added. Get the sum of the system sensitivity index.
- the words in the thesaurus may be maintained, and the evaluation of the content sensitivity index corresponding to the core viewpoint information in the existing thesaurus may be adjusted in real time according to the operation of the business system, and the specific adjustment algorithm includes: according to the vocabulary The total frequency of occurrence, the breadth of vocabulary in each business system, the feedback rating of the operator, and the feedback frequency of the operator are comprehensively assessed. If the above four indicators are higher, the content sensitivity index is larger.
- Step S3 Calculate a difference between the system anti-allergy index corresponding to the service system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
- the step specifically includes:
- the difference is greater than 90, it is determined that the service system does not issue an early warning; if the difference is less than or equal to 90 and greater than 60, it is determined that the service system issues a slight warning; if the difference is less than If it is equal to 60 and greater than 30, it is determined that the service system issues a medium warning; if the difference is less than or equal to 30, it is determined that the service system issues a severe warning.
- the present application calculates a system anti-allergy index for judging its ability to resist sensitive information for different business systems, and calculates a system sensitivity index for judging the degree of sensitive influence of the user's behavior on the business system, according to the difference between the two.
- the value determines whether the service system issues an early warning.
- the application considers that in the actual application of the service system, each service system has different tolerances for sensitive information, and the rigor of users in each service system is different, based on these two aspects. Based on the practical application of the business system, combined with the business scenario and user behavior, the sensitive information is hierarchically evaluated to provide more accurate warning of sensitive information.
- the present application also provides a computer readable storage medium having stored thereon a processing system, the processing system being executed by a processor to implement the steps of the early warning method of system sensitive content.
- the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better.
- Implementation Based on such understanding, the technical solution of the present application, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
- the optical disc includes a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present application.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computer Security & Cryptography (AREA)
- Computer Hardware Design (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present application relates to an electronic apparatus, a method and system for early warning regarding system sensitive content, and a storage medium. The method comprises: acquiring each anti-sensitive parameter corresponding to a service system and a weight corresponding to each anti-sensitive parameter, and computing, according to each anti-sensitive parameter and the corresponding weight, a system anti-sensitive index of the service system; acquiring a content sensitive index of the content released by each user, and a historical user behavior index in the service system, computing a system sensitive index of the user according to the content sensitive index and the historical user behavior index, and adding the system sensitive indexes of all users in the service system to obtain a system sensitive index sum; and computing the difference between the system anti-sensitive index corresponding to the service system and the system sensitive index sum, and determining whether to raise an early warning according to the difference. According to the present application, hierarchical determination can be carried out on sensitive information by combining a service scenario and a user behavior to raise a more accurate early warning regarding sensitive information.
Description
优先权申明Priority claim
本申请基于巴黎公约申明享有2018年03月06日递交的申请号为CN 201810182928.4、名称为“电子装置、系统敏感内容的预警方法及存储介质”中国专利申请的优先权,该中国专利申请的整体内容以参考的方式结合在本申请中。This application is based on the priority of the Chinese Patent Application for the application of the Chinese Patent Application entitled "Electronic Device, Systematic Sensitive Contents and Storage Media", filed on March 6, 2018, with the application number CN 201810182928.4, which is filed on March 06, 2018. The content is incorporated herein by reference.
本申请涉及通信技术领域,尤其涉及一种电子装置、系统敏感内容的预警方法、系统及存储介质。The present application relates to the field of communications technologies, and in particular, to an electronic device, an early warning method, system, and storage medium for sensitive content of a system.
目前,对于大型综合企业机构,旗下通常涵盖多种业务,例如金融企业机构,旗下包括证券、保险、银行等各种金融业务,每种业务可以对应一个或多个社交平台,例如股票BBS、理财讲解直播等。在这些平台中可能会出现各类敏感信息,例如,色情、政治、广告等词,容易产生了不良的影响。业内有一些针对平台中的内容进行评判并对敏感信息预警的方法,但是对敏感信息的评判较固化,一般仅关注于评判的正确率,容易出现一刀切的情况。At present, for large-scale integrated enterprise institutions, the company usually covers a variety of businesses, such as financial institutions, including securities, insurance, banking and other financial services, each of which can correspond to one or more social platforms, such as stock BBS, financial management. Explain the live broadcast and so on. A variety of sensitive information may appear in these platforms, such as pornography, politics, advertising, etc., which are prone to adverse effects. There are some methods in the industry to judge the content of the platform and to warn sensitive information, but the evaluation of sensitive information is relatively solid, generally only pay attention to the correct rate of judgment, and it is prone to a one-size-fits-all situation.
发明内容Summary of the invention
本申请的目的在于提供一种电子装置、系统敏感内容的预警方法、系统及存储介质,旨在结合业务场景及用户行为对敏感信息进行层级化评判,对敏感信息进行更准确的预警。The purpose of the present application is to provide an electronic device, an early warning method for a sensitive content of a system, a system, and a storage medium, which are designed to hierarchically evaluate sensitive information in combination with business scenarios and user behaviors, and provide more accurate early warning of sensitive information.
为实现上述目的,本申请提供一种电子装置,所述电子装置包括存储器及与所述存储器连接的处理器,所述存储器中存储有可在所述处理器上运行的处理系统,所述处理系统被所述处理器执行时实现如下步骤:To achieve the above object, the present application provides an electronic device including a memory and a processor coupled to the memory, the memory storing a processing system operable on the processor, the processing The system implements the following steps when executed by the processor:
抗敏指数处理步骤,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;The anti-allergy index processing step acquires each anti-allergy parameter corresponding to the business system and the weight corresponding to each anti-allergy parameter, and calculates a system anti-allergy index of the business system according to each anti-allergy parameter and corresponding weight;
敏感指数处理步骤,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;The sensitive index processing step acquires the content sensitivity index and the user historical behavior index of each user's published content in the business system, calculates the system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitive the system of all users in the business system. The sum of the indices gives the sum of the system sensitivity indices;
确定预警步骤,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。The early warning step is determined, and the difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes is calculated, and whether the early warning is issued is determined according to the difference.
为实现上述目的,本申请还提供一种系统敏感内容的预警方法,所述系统敏感内容的预警方法包括:To achieve the above object, the present application further provides an early warning method for system sensitive content, and the method for alerting sensitive content of the system includes:
S1,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;S1, obtaining respective anti-allergy parameters corresponding to the service system and weights corresponding to the respective anti-allergy parameters, and calculating a system anti-allergy index of the service system according to each anti-allergy parameter and corresponding weight;
S2,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;S2: obtaining a content sensitivity index and a user historical behavior index of content published by each user in the business system, calculating a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and adding system sensitivity indexes of all users in the business system Get the sum of the system sensitivity index;
S3,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。S3: Calculate the difference between the system anti-allergy index corresponding to the service system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
为实现上述目的,本申请还提供一种系统敏感内容的预警系统,所述系统敏感内容的预警系统包括:To achieve the above objective, the present application further provides an early warning system for system sensitive content, wherein the system early warning system for sensitive content includes:
获取模块,用于获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;The obtaining module is configured to obtain each anti-allergy parameter corresponding to the service system and a weight corresponding to each anti-allergy parameter, and calculate a system anti-allergy index of the service system according to each anti-allergy parameter and the corresponding weight;
处理模块,用于获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总 和;The processing module is configured to obtain a content sensitivity index and a user historical behavior index of each user published content in the business system, calculate a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitivity the system of all users in the business system. The sum of the indices gives the sum of the system sensitivity indices;
预警模块,用于计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。The early warning module is configured to calculate a difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有处理系统,所述处理系统被处理器执行时实现上述的方法的步骤。The present application also provides a computer readable storage medium having stored thereon a processing system that, when executed by a processor, implements the steps of the method described above.
本申请的有益效果是:本申请针对不同的业务系统计算用以评判其抵抗敏感信息的能力的系统抗敏指数、计算用以评判用户的行为给业务系统带来的敏感影响程度的系统敏感指数总和,根据两者的差值确定该业务系统是否发出预警,本申请综合考虑在业务系统的实际应用中,各个业务系统对敏感信息的包容度不同,各个业务系统中的用户的严谨程度也不同,基于这两个方面从业务系统的实际应用角度出发,结合业务场景及用户行为对敏感信息进行层级化评判,并对敏感信息进行更准确的预警。The beneficial effects of the present application are: the present application calculates a system anti-allergy index for judging its ability to resist sensitive information for different business systems, and calculates a system sensitivity index for judging the degree of sensitive influence of the user's behavior on the business system. Sum, according to the difference between the two to determine whether the business system issues an early warning, this application comprehensively considers the actual application of the business system, each business system has different tolerance for sensitive information, and the rigor of users in each business system is different. Based on these two aspects, from the perspective of the practical application of the business system, combined with the business scenario and user behavior, the sensitive information is hierarchically evaluated, and the sensitive information is more accurately alerted.
图1为本申请电子装置一实施例的硬件架构的示意图;1 is a schematic diagram of a hardware architecture of an embodiment of an electronic device according to the present application;
图2为本申请系统敏感内容的预警方法一实施例的流程示意图。FIG. 2 is a schematic flowchart of an embodiment of an early warning method for sensitive content of the system of the present application.
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the objects, technical solutions, and advantages of the present application more comprehensible, the present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
需要说明的是,在本申请中涉及“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特 征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本申请要求的保护范围之内。It should be noted that the descriptions of "first", "second" and the like in the present application are for the purpose of description only, and are not to be construed as indicating or implying their relative importance or implicitly indicating the number of technical features indicated. . Thus, features defining "first" or "second" may include at least one of the features, either explicitly or implicitly. In addition, the technical solutions between the various embodiments may be combined with each other, but must be based on the realization of those skilled in the art, and when the combination of the technical solutions is contradictory or impossible to implement, it should be considered that the combination of the technical solutions does not exist. Nor is it within the scope of protection required by this application.
参阅图1所示,是本申请电子装置一实施例的硬件架构示意图。电子装置1是一种能够按照事先设定或者存储的指令,自动进行数值计算和/或信息处理的设备。所述电子装置1可以是计算机、也可以是单个网络服务器、多个网络服务器组成的服务器组或者基于云计算的由大量主机或者网络服务器构成的云,其中云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个超级虚拟计算机。1 is a schematic diagram of a hardware architecture of an embodiment of an electronic device of the present application. The electronic device 1 is an apparatus capable of automatically performing numerical calculation and/or information processing in accordance with an instruction set or stored in advance. The electronic device 1 may be a computer, a single network server, a server group composed of multiple network servers, or a cloud-based cloud composed of a large number of hosts or network servers, where cloud computing is a type of distributed computing. A super virtual computer consisting of a group of loosely coupled computers.
在本实施例中,电子装置1可包括,但不仅限于,可通过系统总线相互通信连接的存储器11、处理器12、网络接口13,存储器11存储有可在处理器12上运行的处理系统。需要指出的是,图1仅示出了具有组件11-13的电子装置1,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。In the present embodiment, the electronic device 1 may include, but is not limited to, a memory 11 communicably connected to each other through a system bus, a processor 12, and a network interface 13, and the memory 11 stores a processing system operable on the processor 12. It should be noted that FIG. 1 only shows the electronic device 1 having the components 11-13, but it should be understood that not all illustrated components are required to be implemented, and more or fewer components may be implemented instead.
其中,存储器11包括内存及至少一种类型的可读存储介质。内存为电子装置1的运行提供缓存;可读存储介质可为如闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等的非易失性存储介质。在一些实施例中,可读存储介质可以是电子装置1的内部存储单元,例如该电子装置1的硬盘;在另一些实施例中,该非易失性存储介质也可以是电子装置1的外部存储设备,例如电子装置1上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD) 卡,闪存卡(Flash Card)等。本实施例中,存储器11的可读存储介质通常用于存储安装于电子装置1的操作系统和各类应用软件,例如存储本申请一实施例中的处理系统的程序代码等。此外,存储器11还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 11 includes a memory and at least one type of readable storage medium. The memory provides a cache for the operation of the electronic device 1; the readable storage medium may be, for example, a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static random access memory (SRAM). A non-volatile storage medium such as a read only memory (ROM), an electrically erasable programmable read only memory (EEPROM), a programmable read only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, or the like. In some embodiments, the readable storage medium may be an internal storage unit of the electronic device 1, such as a hard disk of the electronic device 1; in other embodiments, the non-volatile storage medium may also be external to the electronic device 1. The storage device, for example, a plug-in hard disk provided on the electronic device 1, a smart memory card (SMC), a Secure Digital (SD) card, a flash card, or the like. In this embodiment, the readable storage medium of the memory 11 is generally used to store an operating system and various types of application software installed in the electronic device 1, such as program code for storing a processing system in an embodiment of the present application. Further, the memory 11 can also be used to temporarily store various types of data that have been output or are to be output.
所述处理器12在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器12通常用于控制所述电子装置1的总体操作,例如执行与其他设备进行数据交互或者通信相关的控制和处理等。本实施例中,所述处理器12用于运行所述存储器11中存储的程序代码或者处理数据,例如运行处理系统等。The processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments. The processor 12 is typically used to control the overall operation of the electronic device 1, such as performing control and processing related to data interaction or communication with other devices. In this embodiment, the processor 12 is configured to run program code or process data stored in the memory 11, such as running a processing system or the like.
所述网络接口13可包括无线网络接口或有线网络接口,该网络接口13通常用于在所述电子装置1与其他电子设备之间建立通信连接。The network interface 13 may comprise a wireless network interface or a wired network interface, which is typically used to establish a communication connection between the electronic device 1 and other electronic devices.
所述处理系统存储在存储器11中,包括至少一个存储在存储器11中的计算机可读指令,该至少一个计算机可读指令可被处理器器12执行,以实现本申请各实施例的方法;以及,该至少一个计算机可读指令依据其各部分所实现的功能不同,可被划为不同的逻辑模块。The processing system is stored in the memory 11 and includes at least one computer readable instruction stored in the memory 11, the at least one computer readable instruction being executable by the processor 12 to implement the methods of various embodiments of the present application; The at least one computer readable instruction can be classified into different logic modules depending on the functions implemented by its various parts.
在一实施例中,上述处理系统被所述处理器12执行时实现如下步骤:In an embodiment, when the processing system is executed by the processor 12, the following steps are implemented:
抗敏指数处理步骤,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;The anti-allergy index processing step acquires each anti-allergy parameter corresponding to the business system and the weight corresponding to each anti-allergy parameter, and calculates a system anti-allergy index of the business system according to each anti-allergy parameter and corresponding weight;
其中,系统抗敏指数为业务系统抵抗敏感信息或敏感内容的能力的指标,系统抗敏指数越高,则业务系统抵抗敏感词汇的能力越强。在一实施例中,抗敏参数包括系统重要等级系数、用户量等级系数、系统信息传播系数;在另一实施例中,抗敏参数包括系统重要等级系数、用户量等级系数、系统信息传播系数、应急处理系数、敏感词汇关注度系数。Among them, the system anti-allergy index is an indicator of the ability of the business system to resist sensitive information or sensitive content. The higher the system anti-allergy index, the stronger the ability of the business system to resist sensitive vocabulary. In an embodiment, the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient; in another embodiment, the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient. , emergency response coefficient, sensitive vocabulary attention coefficient.
其中,系统重要等级系数越小则该业务系统越重要,用户量等级系数越 小则该业务系统的用户数量越多,系统信息传播系数越小则该业务系统的信息传播能力越强,应急处理系数越小则该业务系统的应急处理能力越弱,敏感词汇关注度系数越小则该业务系统中敏感词汇关注度最高。The smaller the system important level coefficient is, the more important the service system is. The smaller the user quantity level coefficient is, the more the number of users of the service system is. The smaller the system information propagation coefficient is, the stronger the information dissemination capability of the service system is. The smaller the coefficient, the weaker the emergency response capability of the business system, and the smaller the sensitive vocabulary attention coefficient, the higher the sensitivity of the sensitive vocabulary in the business system.
在一具体的实例中,系统重要等级系数的范围为[0,1],当系统重要等级系数为0时候业务系统的重要性最高,可以将系统重要等级系数分为3级:0为一级、0.5为二级、1为三级;In a specific example, the system important grade coefficient has a range of [0, 1]. When the system important grade coefficient is 0, the business system has the highest importance, and the system important grade coefficient can be divided into three levels: 0 is a level. , 0.5 is the second level, and 1 is the third level;
用户量等级系数的范围为[0,1],当用户量等级系数为0时候用户量最多最高,可以将用户量等级系数分为3级:0为用户量在10000以上、0.5为用户量在1000至10000以内、1为用户量在1000以内;The user quantity level coefficient ranges from [0, 1]. When the user quantity level coefficient is 0, the user quantity is the highest, and the user quantity level coefficient can be divided into three levels: 0 is the user quantity is above 10000, and 0.5 is the user quantity. Within 1000 to 10000, 1 is less than 1000 for the user;
系统信息传播系数的范围为[0,1],当系统信息传播系数为0时候系统信息最容易传播,可以将系统信息传播系数分为3级:0为仅以纯互联网为传播途径、0.5为企业局域网与互联网组成的网络中的混合传播途径、1为仅以局域网为传播途径;The system information propagation coefficient ranges from [0, 1]. When the system information propagation coefficient is 0, the system information is the easiest to propagate. The system information propagation coefficient can be divided into three levels: 0 is the pure Internet as the transmission route, and 0.5 is the enterprise. The mixed transmission route in the network composed of the local area network and the Internet, 1 is only the local area network as the transmission route;
应急处理系数的范围为[0,1],当应急处理系数为0时候系统无应急处理,可以将应急处理系数分为3级:0为无应急处理、0.5为能迅速删除系统内的敏感内容、1为能迅速删除敏感内容的同时定位转发地址协助删除;The range of emergency response coefficient is [0,1]. When the emergency response coefficient is 0, the system has no emergency treatment. The emergency treatment coefficient can be divided into three levels: 0 for no emergency response and 0.5 for quickly deleting sensitive content in the system. 1 is to locate the forwarding address and delete it while deleting the sensitive content quickly;
敏感词汇关注度系数的范围为[0,1],当敏感词汇关注度系数为0时候敏感词汇关注度最高,可以将敏感词汇关注度系数分为3级:0为色情、政治、广告、违法等全部关注、0.5为关注广告等危害性不大的、1为已有其他手段抗敏所以都不用关注。The sensitivity vocabulary attention coefficient range is [0,1]. When the sensitive vocabulary attention degree coefficient is 0, the sensitive vocabulary attention degree is the highest. The sensitive vocabulary attention degree coefficient can be divided into 3 levels: 0 is pornography, politics, advertisement, illegal All of the concerns, 0.5 is not harmful to the attention of advertising, etc., 1 is already sensitive to other means, so do not pay attention.
以抗敏参数包括系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5为例,系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5的权重对应分别为w1、w2、w3、w4、w5,则系统抗敏指数=x1*w1+x2*w2+x3*w3+x4*w4+x5*w5。其中,权重w1、w2、 w3、w4、w5为预先配置的大于1的数值,可以取均相同的值或者不同的值,在一实施例中,w1=w2=w3=w4=w5=20。The anti-allergy parameters include the system important grade coefficient x1, the user magnitude coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, the sensitive vocabulary attention degree coefficient x5, for example, the system important grade coefficient x1, the user magnitude grade coefficient x2, the system The weights of information propagation coefficient x3, emergency processing coefficient x4, and sensitive vocabulary attention degree coefficient x5 are respectively w1, w2, w3, w4, w5, then the system anti-allergy index = x1 * w1 + x2 * w2 + x3 * w3 + x4 *w4+x5*w5. The weights w1, w2, w3, w4, and w5 are pre-configured values greater than 1, and may be the same value or different values. In an embodiment, w1=w2=w3=w4=w5=20.
根据上述实例的描述,业务系统越重要、用户量越大、信息传播越容易、应急处理能力越弱、敏感词汇关注度越高,则该业务系统的系统抗敏指数越小,抵抗敏感词汇的能力越弱;反之则业务系统的系统抗敏指数越大,抵抗敏感词汇的能力越强。According to the description of the above example, the more important the business system, the larger the user volume, the easier the information dissemination, the weaker the emergency processing capability, and the higher the sensitivity of the sensitive vocabulary, the smaller the system anti-allergic index of the business system and the resistance to sensitive vocabulary. The weaker the ability; the greater the system's system sensitivity index, the stronger the ability to resist sensitive vocabulary.
在另一实例中,系统重要等级系数x1可以根据该业务系统中各业务场景的权重及各业务场景对应的业务量进行度量,以金融业务系统为例,业务场景包括金融交易场景及普通交易场景,金融交易场景包括例如银行转账、基金购买等,普通交易包括例如保费支付、保费续保等。金融交易场景的权重为k1,普通交易场景的权重为k2,金融交易场景的业务量为c1,普通交易场景的的业务量为c2,业务量可以以数据的收发量为度量,数据的收发量越大,则业务量越大。业务系统的系统重要等级系数x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)),其中,α为该业务系统的业务场景的平均业务量,α=(c1+c2)/2,系统重要等级系数x1的范围为[0,1]。当然,业务场景还可以进一步进行细分,以便更加客观、综合性地评判业务系统的重要性。例如包括金融交易场景、普通交易场景、核心业务场景及非核心业务场景等等,各种场景具有对应的业务量,通过与上述相同原理的计算方式可以计算得到系统重要等级系数x1。In another example, the system important level coefficient x1 may be measured according to the weight of each service scenario in the service system and the service volume corresponding to each service scenario, and the financial service system is taken as an example, and the business scenario includes a financial transaction scenario and a common transaction scenario. Financial transaction scenarios include, for example, bank transfers, fund purchases, etc., and general transactions include, for example, premium payments, premium renewals, and the like. The weight of the financial transaction scenario is k1, the weight of the common transaction scenario is k2, the traffic volume of the financial transaction scenario is c1, the traffic volume of the common transaction scenario is c2, and the traffic volume can be measured by the amount of data sent and received, and the amount of data sent and received. The larger the traffic, the larger the traffic. The system important rank coefficient of the business system is x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)), where α is the average traffic of the business scenario of the business system , α = (c1 + c2) / 2, the system important level coefficient x1 range is [0, 1]. Of course, the business scenario can be further subdivided to more objectively and comprehensively judge the importance of the business system. For example, a financial transaction scenario, a common transaction scenario, a core business scenario, and a non-core business scenario, etc., various scenarios have corresponding traffic volumes, and the system important rank coefficient x1 can be calculated by the same principle calculation method as above.
敏感指数处理步骤,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;The sensitive index processing step acquires the content sensitivity index and the user historical behavior index of each user's published content in the business system, calculates the system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitive the system of all users in the business system. The sum of the indices gives the sum of the system sensitivity indices;
其中,用户的系统敏感指数用以评判:用户的行为给业务系统带来的敏感影响程度。用户的系统敏感指数越小,则给业务系统带来的敏感影响程度 越小,用户的系统敏感指数越大,则给业务系统带来的敏感影响程度越大。Among them, the user's system sensitivity index is used to judge the degree of sensitive influence of the user's behavior on the business system. The smaller the user's system sensitivity index, the less sensitive the impact on the business system, and the greater the user's system sensitivity index, the greater the sensitivity to the business system.
其中,内容敏感指数指的是用户发布的内容的敏感程度或度量,其内容若涉及敏感词汇或者敏感信息,则内容敏感指数越大,敏感词汇或者敏感信息包括色情、政治、广告、违法等等。The content sensitivity index refers to the sensitivity or metric of the content published by the user. If the content involves sensitive vocabulary or sensitive information, the content sensitivity index is larger, and the sensitive vocabulary or sensitive information includes pornography, politics, advertisement, illegality, etc. .
在一实施例中,内容敏感指数的获取过程包括:对用户发布内容以句子为单位进行分词,将分词后的词与预先建立的词库中的词进行匹配,以匹配得到对应的关键词,其中,关键词包括名词、动词、与色情、政治、广告、违法等相关的词等,根据该关键词分析每一句子的对应的核心观点信息,在一实施例中,可以直接将关键词组成新的语句,该语句的信息即为核心观点信息,根据预先建立的核心观点信息与内容敏感指数的关联关系获取该核心观点信息关联的内容敏感指数。其中,预先通过鉴定核心程序鉴定得出核心观点信息对应的内容敏感指数并存储在词库中。在一实施例中,核心观点信息为色情信息,其内容敏感指数对应为n1,核心观点信息为违法信息,其内容敏感指数对应为n2,核心观点信息为政治信息,其内容敏感指数对应为n3,核心观点信息为广告信息,其内容敏感指数对应为n4,n1≥n2≥n3≥n4。In an embodiment, the process of obtaining the content sensitivity index includes: segmenting the content of the user by using the sentence as a unit, matching the word after the word segmentation with the word in the pre-established thesaurus, to match the corresponding keyword, The keywords include nouns, verbs, words related to pornography, politics, advertisements, illegals, etc., and the corresponding core viewpoint information of each sentence is analyzed according to the keyword. In an embodiment, the keywords may be directly composed. The new statement, the information of the statement is the core viewpoint information, and the content sensitive index associated with the core viewpoint information is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index. Wherein, the content sensitive index corresponding to the core viewpoint information is obtained by identifying the core program in advance and stored in the thesaurus. In an embodiment, the core view information is pornographic information, the content sensitivity index corresponds to n1, the core view information is illegal information, the content sensitivity index corresponds to n2, the core view information is political information, and the content sensitivity index corresponds to n3. The core viewpoint information is advertisement information, and its content sensitivity index corresponds to n4, n1≥n2≥n3≥n4.
在一具体的实例中,例如:对于用户在某个平台系统中发布的内容“***化妆品今日促销大优惠”,该语句经分词后得到“***化妆品”、“今日”、“促销”、“大优惠”,将这些分词与词库中的词进行匹配,得到的关键词为“化妆品”、“促销”、“大优惠”,根据这些关键词,可以分析出该语句的核心观点信息为“化妆品促销优惠”,其核心观点信息属于广告信息。In a specific example, for example, for the content published by the user in a certain platform system, "*** Cosmetics Today Promotion Promotion", the sentence is divided into "*** Cosmetics", "Today", "Promotion" ", "big discount", matching these participles with the words in the thesaurus, the keywords obtained are "cosmetics", "promotion", "big discount", according to these keywords, the core viewpoint of the statement can be analyzed The information is “cosmetic promotion offer”, and its core viewpoint information belongs to advertisement information.
其中,用户历史行为指数为用户应对其他用户发布的内容的行为描述,在一实施例中,用户历史行为指数的基数为1,若用户有举报其他用户发布涉及敏感词汇或者敏感信息的行为,则用户历史行为指数为(1-0.2),若用户自己有发布涉及敏感词汇或者敏感信息的行为,则用户历史行为指数为(1+0.2)。The user historical behavior index is a description of the behavior of the user to the content published by other users. In an embodiment, the user's historical behavior index has a base of 1. If the user reports that other users publish sensitive vocabulary or sensitive information, The user's historical behavior index is (1-0.2). If the user has published a behavior involving sensitive vocabulary or sensitive information, the user's historical behavior index is (1+0.2).
在获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数之后,计算该用户的系统敏感指数=内容敏感指数*用户历史行为指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和。After obtaining the content sensitivity index and the user historical behavior index of each user's published content in the business system, the system sensitivity index=content sensitivity index*user historical behavior index of the user is calculated, and the system sensitivity indexes of all users in the business system are added. Get the sum of the system sensitivity index.
在其他实施例中,可对词库中的词进行维护,结合业务系统的运营情况,实时调整现有词库中核心观点信息对应的内容敏感指数的评定,其具体的调整算法包括:根据词汇的出现总频率、词汇的在各业务系统出现广度、运营人员的反馈评定等级、运营人员的反馈频率等进行综合评定,若上述的4个指标越高,则内容敏感指数越大。In other embodiments, the words in the thesaurus may be maintained, and the evaluation of the content sensitivity index corresponding to the core viewpoint information in the existing thesaurus may be adjusted in real time according to the operation of the business system, and the specific adjustment algorithm includes: according to the vocabulary The total frequency of occurrence, the breadth of vocabulary in each business system, the feedback rating of the operator, and the feedback frequency of the operator are comprehensively assessed. If the above four indicators are higher, the content sensitivity index is larger.
确定预警步骤,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。The early warning step is determined, and the difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes is calculated, and whether the early warning is issued is determined according to the difference.
其中,若系统抗敏指数与系统敏感指数总和两者的差值越大,则系统抗敏指数越大,系统敏感指数总和越小,该业务系统Wherein, if the difference between the system anti-allergy index and the sum of the system sensitivity index is larger, the system anti-allergy index is larger, and the sum of the system sensitivity index is smaller, the business system
在一实施例中,该步骤具体包括:In an embodiment, the step specifically includes:
若所述差值大于预设的第一阈值,则确定该业务系统不发出预警;If the difference is greater than a preset first threshold, determining that the service system does not issue an early warning;
若所述差值小于等于该第一阈值且大于预设的第二阈值,则确定该业务系统发出轻度预警;If the difference is less than or equal to the first threshold and greater than a preset second threshold, determining that the service system issues a slight alert;
若所述差值小于等于该第二阈值且大于预设的第三阈值,则确定该业务系统发出中度预警;If the difference is less than or equal to the second threshold and greater than a preset third threshold, determining that the service system issues a medium alert;
若所述差值小于等于该第三阈值,则确定该业务系统发出重度预警。If the difference is less than or equal to the third threshold, it is determined that the service system issues a severe warning.
在一具体的实例中,若差值大于90,则确定该业务系统不发出预警;若所述差值小于等于90且大于60,则确定该业务系统发出轻度预警;若所述差值小于等于60且大于30,则确定该业务系统发出中度预警;若所述差值小于等于30,则确定该业务系统发出重度预警。In a specific example, if the difference is greater than 90, it is determined that the service system does not issue an early warning; if the difference is less than or equal to 90 and greater than 60, it is determined that the service system issues a slight warning; if the difference is less than If it is equal to 60 and greater than 30, it is determined that the service system issues a medium warning; if the difference is less than or equal to 30, it is determined that the service system issues a severe warning.
与现有技术相比,本申请针对不同的业务系统计算用以评判其抵抗敏感信息的能力的系统抗敏指数、计算用以评判用户的行为给业务系统带来的敏 感影响程度的系统敏感指数总和,根据两者的差值确定该业务系统是否发出预警,本申请综合考虑在业务系统的实际应用中,各个业务系统对敏感信息的包容度不同,各个业务系统中的用户的严谨程度也不同,基于这两个方面从业务系统的实际应用角度出发,结合业务场景及用户行为对敏感信息进行层级化评判,对敏感信息进行更准确的预警。Compared with the prior art, the present application calculates a system anti-allergy index for judging its ability to resist sensitive information for different business systems, and calculates a system sensitivity index for judging the degree of sensitive influence of the user's behavior on the business system. Sum, according to the difference between the two to determine whether the business system issues an early warning, this application comprehensively considers the actual application of the business system, each business system has different tolerance for sensitive information, and the rigor of users in each business system is different. Based on these two aspects, from the perspective of the practical application of the business system, combined with the business scenario and user behavior, the sensitive information is hierarchically evaluated to provide more accurate warning of sensitive information.
如图2所示,图2为本申请系统敏感内容的预警方法一实施例的流程示意图,该系统敏感内容的预警方法包括以下步骤:As shown in FIG. 2, FIG. 2 is a schematic flowchart of an embodiment of an early warning method for sensitive content of a system according to the present application. The method for alerting sensitive content of the system includes the following steps:
步骤S1,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;Step S1: acquiring respective anti-allergy parameters corresponding to the service system and weights corresponding to the respective anti-allergy parameters, and calculating a system anti-allergy index of the service system according to each anti-allergy parameter and corresponding weight;
其中,系统抗敏指数为业务系统抵抗敏感词汇的能力的指标,系统抗敏指数越高,则业务系统抵抗敏感词汇的能力越强。在一实施例中,抗敏参数包括系统重要等级系数、用户量等级系数、系统信息传播系数;在另一实施例中,抗敏参数包括系统重要等级系数、用户量等级系数、系统信息传播系数、应急处理系数、敏感词汇关注度系数。Among them, the system anti-allergy index is an indicator of the ability of the business system to resist sensitive vocabulary. The higher the system anti-allergic index, the stronger the ability of the business system to resist sensitive vocabulary. In an embodiment, the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient; in another embodiment, the anti-allergy parameter includes a system important level coefficient, a user quantity level coefficient, and a system information propagation coefficient. , emergency response coefficient, sensitive vocabulary attention coefficient.
其中,系统重要等级系数越小则该业务系统越重要,用户量等级系数越小则该业务系统的用户数量越多,系统信息传播系数越小则该业务系统的信息传播能力越强,应急处理系数越小则该业务系统的应急处理能力越弱,敏感词汇关注度系数越小则该业务系统中敏感词汇关注度最高。The smaller the system important level coefficient is, the more important the service system is. The smaller the user quantity level coefficient is, the more the number of users of the service system is. The smaller the system information propagation coefficient is, the stronger the information dissemination capability of the service system is. The smaller the coefficient, the weaker the emergency response capability of the business system, and the smaller the sensitive vocabulary attention coefficient, the higher the sensitivity of the sensitive vocabulary in the business system.
在一具体的实例中,系统重要等级系数的范围为[0,1],当系统重要等级系数为0时候业务系统的重要性最高,可以将系统重要等级系数分为3级:0为一级、0.5为二级、1为三级;In a specific example, the system important grade coefficient has a range of [0, 1]. When the system important grade coefficient is 0, the business system has the highest importance, and the system important grade coefficient can be divided into three levels: 0 is a level. , 0.5 is the second level, and 1 is the third level;
用户量等级系数的范围为[0,1],当用户量等级系数为0时候用户量最多最高,可以将用户量等级系数分为3级:0为用户量在10000以上、0.5为用户量在1000至10000以内、1为用户量在1000以内;The user quantity level coefficient ranges from [0, 1]. When the user quantity level coefficient is 0, the user quantity is the highest, and the user quantity level coefficient can be divided into three levels: 0 is the user quantity is above 10000, and 0.5 is the user quantity. Within 1000 to 10000, 1 is less than 1000 for the user;
系统信息传播系数的范围为[0,1],当系统信息传播系数为0时候系统信息最容易传播,可以将系统信息传播系数分为3级:0为仅以纯互联网为传播途径、0.5为企业局域网与互联网组成的网络中的混合传播途径、1为仅以局域网为传播途径;The system information propagation coefficient ranges from [0, 1]. When the system information propagation coefficient is 0, the system information is the easiest to propagate. The system information propagation coefficient can be divided into three levels: 0 is the pure Internet as the transmission route, and 0.5 is the enterprise. The mixed transmission route in the network composed of the local area network and the Internet, 1 is only the local area network as the transmission route;
应急处理系数的范围为[0,1],当应急处理系数为0时候系统无应急处理,可以将应急处理系数分为3级:0为无应急处理、0.5为能迅速删除系统内的敏感内容、1为能迅速删除敏感内容的同时定位转发地址协助删除;The range of emergency response coefficient is [0,1]. When the emergency response coefficient is 0, the system has no emergency treatment. The emergency treatment coefficient can be divided into three levels: 0 for no emergency response and 0.5 for quickly deleting sensitive content in the system. 1 is to locate the forwarding address and delete it while deleting the sensitive content quickly;
敏感词汇关注度系数的范围为[0,1],当敏感词汇关注度系数为0时候敏感词汇关注度最高,可以将敏感词汇关注度系数分为3级:0为色情、政治、广告、违法等全部关注、0.5为关注广告等危害性不大的、1为已有其他手段抗敏所以都不用关注。The sensitivity vocabulary attention coefficient range is [0,1]. When the sensitive vocabulary attention degree coefficient is 0, the sensitive vocabulary attention degree is the highest. The sensitive vocabulary attention degree coefficient can be divided into 3 levels: 0 is pornography, politics, advertisement, illegal All of the concerns, 0.5 is not harmful to the attention of advertising, etc., 1 is already sensitive to other means, so do not pay attention.
以抗敏参数包括系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5为例,系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5的权重对应分别为w1、w2、w3、w4、w5,则系统抗敏指数=x1*w1+x2*w2+x3*w3+x4*w4+x5*w5。其中,权重w1、w2、w3、w4、w5为预先配置的大于1的数值,可以取均相同的值或者不同的值,在一实施例中,w1=w2=w3=w4=w5=20。The anti-allergy parameters include the system important grade coefficient x1, the user magnitude coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, the sensitive vocabulary attention degree coefficient x5, for example, the system important grade coefficient x1, the user magnitude grade coefficient x2, the system The weights of information propagation coefficient x3, emergency processing coefficient x4, and sensitive vocabulary attention degree coefficient x5 are respectively w1, w2, w3, w4, w5, then the system anti-allergy index = x1 * w1 + x2 * w2 + x3 * w3 + x4 *w4+x5*w5. The weights w1, w2, w3, w4, and w5 are pre-configured values greater than 1, and may be the same value or different values. In an embodiment, w1=w2=w3=w4=w5=20.
根据上述实例的描述,业务系统越重要、用户量越大、信息传播越容易、应急处理能力越弱、敏感词汇关注度越高,则该业务系统的系统抗敏指数越小,抵抗敏感词汇的能力越弱;反之则业务系统的系统抗敏指数越大,抵抗敏感词汇的能力越强。According to the description of the above example, the more important the business system, the larger the user volume, the easier the information dissemination, the weaker the emergency processing capability, and the higher the sensitivity of the sensitive vocabulary, the smaller the system anti-allergic index of the business system and the resistance to sensitive vocabulary. The weaker the ability; the greater the system's system sensitivity index, the stronger the ability to resist sensitive vocabulary.
在另一实例中,系统重要等级系数x1可以根据该业务系统中各业务场景的权重及各业务场景对应的业务量进行度量,以金融业务系统为例,业务场景包括金融交易场景及普通交易场景,金融交易场景包括例如银行转账、 基金购买等,普通交易包括例如保费支付、保费续保等。金融交易场景的权重为k1,普通交易场景的权重为k2,金融交易场景的业务量为c1,普通交易场景的的业务量为c2,业务量可以以数据的收发量为度量,数据的收发量越大,则业务量越大。业务系统的系统重要等级系数x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)),其中,α为该业务系统的业务场景的平均业务量,α=(c1+c2)/2,系统重要等级系数x1的范围为[0,1]。当然,业务场景还可以进一步进行细分,以便更加客观、综合性地评判业务系统的重要性。例如包括金融交易场景、普通交易场景、核心业务场景及非核心业务场景等等,各种场景具有对应的业务量,通过与上述相同原理的计算方式可以计算得到系统重要等级系数x1。In another example, the system important level coefficient x1 may be measured according to the weight of each service scenario in the service system and the service volume corresponding to each service scenario, and the financial service system is taken as an example, and the business scenario includes a financial transaction scenario and a common transaction scenario. Financial transaction scenarios include, for example, bank transfers, fund purchases, etc., and general transactions include, for example, premium payments, premium renewals, and the like. The weight of the financial transaction scenario is k1, the weight of the common transaction scenario is k2, the traffic volume of the financial transaction scenario is c1, the traffic volume of the common transaction scenario is c2, and the traffic volume can be measured by the amount of data sent and received, and the amount of data sent and received. The larger the traffic, the larger the traffic. The system important rank coefficient of the business system is x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)), where α is the average traffic of the business scenario of the business system , α = (c1 + c2) / 2, the system important level coefficient x1 range is [0, 1]. Of course, the business scenario can be further subdivided to more objectively and comprehensively judge the importance of the business system. For example, a financial transaction scenario, a common transaction scenario, a core business scenario, and a non-core business scenario, etc., various scenarios have corresponding traffic volumes, and the system important rank coefficient x1 can be calculated by the same principle calculation method as above.
步骤S2,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;Step S2: acquiring a content sensitivity index and a user historical behavior index of each user published content in the service system, calculating a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and comparing system sensitivity indexes of all users in the service system. Add the sum of the system sensitivity index;
其中,用户的系统敏感指数用以评判:用户的行为给业务系统带来的敏感影响程度。用户的系统敏感指数越小,则给业务系统带来的敏感影响程度越小,用户的系统敏感指数越大,则给业务系统带来的敏感影响程度越大。Among them, the user's system sensitivity index is used to judge the degree of sensitive influence of the user's behavior on the business system. The smaller the user's system sensitivity index is, the smaller the sensitivity impact on the business system is. The greater the user's system sensitivity index, the greater the sensitivity to the business system.
其中,内容敏感指数指的是用户发布的内容的敏感程度或度量,其内容若涉及敏感词汇或者敏感信息,则内容敏感指数越大,敏感词汇或者敏感信息包括色情、政治、广告、违法等等。The content sensitivity index refers to the sensitivity or metric of the content published by the user. If the content involves sensitive vocabulary or sensitive information, the content sensitivity index is larger, and the sensitive vocabulary or sensitive information includes pornography, politics, advertisement, illegality, etc. .
在一实施例中,内容敏感指数的获取过程包括:对用户发布内容以句子为单位进行分词,将分词后的词与预先建立的词库中的词进行匹配,以匹配得到对应的关键词,其中,关键词包括名词、动词、与色情、政治、广告、违法等相关的词等,根据该关键词分析每一句子的对应的核心观点信息,在一实施例中,可以直接将关键词组成新的语句,该语句的信息即为核心观点信息,根据预先建立的核心观点信息与内容敏感指数的关联关系获取该核心 观点信息关联的内容敏感指数。其中,预先通过鉴定核心程序鉴定得出核心观点信息对应的内容敏感指数并存储在词库中。在一实施例中,核心观点信息为色情信息,其内容敏感指数对应为n1,核心观点信息为违法信息,其内容敏感指数对应为n2,核心观点信息为政治信息,其内容敏感指数对应为n3,核心观点信息为广告信息,其内容敏感指数对应为n4,n1≥n2≥n3≥n4。In an embodiment, the process of obtaining the content sensitivity index includes: segmenting the content of the user by using the sentence as a unit, matching the word after the word segmentation with the word in the pre-established thesaurus, to match the corresponding keyword, The keywords include nouns, verbs, words related to pornography, politics, advertisements, illegals, etc., and the corresponding core viewpoint information of each sentence is analyzed according to the keyword. In an embodiment, the keywords may be directly composed. The new statement, the information of the statement is the core viewpoint information, and the content sensitive index associated with the core viewpoint information is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index. Wherein, the content sensitive index corresponding to the core viewpoint information is obtained by identifying the core program in advance and stored in the thesaurus. In an embodiment, the core view information is pornographic information, the content sensitivity index corresponds to n1, the core view information is illegal information, the content sensitivity index corresponds to n2, the core view information is political information, and the content sensitivity index corresponds to n3. The core viewpoint information is advertisement information, and its content sensitivity index corresponds to n4, n1≥n2≥n3≥n4.
在一具体的实例中,例如:对于用户在某个平台系统中发布的内容“***化妆品今日促销大优惠”,该语句经分词后得到“***化妆品”、“今日”、“促销”、“大优惠”,将这些分词与词库中的词进行匹配,得到的关键词为“化妆品”、“促销”、“大优惠”,根据这些关键词,可以分析出该语句的核心观点信息为“化妆品促销优惠”,其核心观点信息属于广告信息。In a specific example, for example, for the content published by the user in a certain platform system, "*** Cosmetics Today Promotion Promotion", the sentence is divided into "*** Cosmetics", "Today", "Promotion" ", "big discount", matching these participles with the words in the thesaurus, the keywords obtained are "cosmetics", "promotion", "big discount", according to these keywords, the core viewpoint of the statement can be analyzed The information is “cosmetic promotion offer”, and its core viewpoint information belongs to advertisement information.
其中,用户历史行为指数为用户应对其他用户发布的内容的行为描述,在一实施例中,用户历史行为指数的基数为1,若用户有举报其他用户发布涉及敏感词汇或者敏感信息的行为,则用户历史行为指数为(1-0.2),若用户自己有发布涉及敏感词汇或者敏感信息的行为,则用户历史行为指数为(1+0.2)。The user historical behavior index is a description of the behavior of the user to the content published by other users. In an embodiment, the user's historical behavior index has a base of 1. If the user reports that other users publish sensitive vocabulary or sensitive information, The user's historical behavior index is (1-0.2). If the user has published a behavior involving sensitive vocabulary or sensitive information, the user's historical behavior index is (1+0.2).
在获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数之后,计算该用户的系统敏感指数=内容敏感指数*用户历史行为指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和。After obtaining the content sensitivity index and the user historical behavior index of each user's published content in the business system, the system sensitivity index=content sensitivity index*user historical behavior index of the user is calculated, and the system sensitivity indexes of all users in the business system are added. Get the sum of the system sensitivity index.
在其他实施例中,可对词库中的词进行维护,结合业务系统的运营情况,实时调整现有词库中核心观点信息对应的内容敏感指数的评定,其具体的调整算法包括:根据词汇的出现总频率、词汇的在各业务系统出现广度、运营人员的反馈评定等级、运营人员的反馈频率等进行综合评定,若上述的4个指标越高,则内容敏感指数越大。In other embodiments, the words in the thesaurus may be maintained, and the evaluation of the content sensitivity index corresponding to the core viewpoint information in the existing thesaurus may be adjusted in real time according to the operation of the business system, and the specific adjustment algorithm includes: according to the vocabulary The total frequency of occurrence, the breadth of vocabulary in each business system, the feedback rating of the operator, and the feedback frequency of the operator are comprehensively assessed. If the above four indicators are higher, the content sensitivity index is larger.
步骤S3,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。Step S3: Calculate a difference between the system anti-allergy index corresponding to the service system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
在一实施例中,该步骤具体包括:In an embodiment, the step specifically includes:
若所述差值大于预设的第一阈值,则确定该业务系统不发出预警;If the difference is greater than a preset first threshold, determining that the service system does not issue an early warning;
若所述差值小于等于该第一阈值且大于预设的第二阈值,则确定该业务系统发出轻度预警;If the difference is less than or equal to the first threshold and greater than a preset second threshold, determining that the service system issues a slight alert;
若所述差值小于等于该第二阈值且大于预设的第三阈值,则确定该业务系统发出中度预警;If the difference is less than or equal to the second threshold and greater than a preset third threshold, determining that the service system issues a medium alert;
若所述差值小于等于该第三阈值,则确定该业务系统发出重度预警。If the difference is less than or equal to the third threshold, it is determined that the service system issues a severe warning.
在一具体的实例中,若差值大于90,则确定该业务系统不发出预警;若所述差值小于等于90且大于60,则确定该业务系统发出轻度预警;若所述差值小于等于60且大于30,则确定该业务系统发出中度预警;若所述差值小于等于30,则确定该业务系统发出重度预警。In a specific example, if the difference is greater than 90, it is determined that the service system does not issue an early warning; if the difference is less than or equal to 90 and greater than 60, it is determined that the service system issues a slight warning; if the difference is less than If it is equal to 60 and greater than 30, it is determined that the service system issues a medium warning; if the difference is less than or equal to 30, it is determined that the service system issues a severe warning.
本申请针对不同的业务系统计算用以评判其抵抗敏感信息的能力的系统抗敏指数、计算用以评判用户的行为给业务系统带来的敏感影响程度的系统敏感指数总和,根据两者的差值确定该业务系统是否发出预警,本申请综合考虑在业务系统的实际应用中,各个业务系统对敏感信息的包容度不同,各个业务系统中的用户的严谨程度也不同,基于这两个方面从业务系统的实际应用角度出发,结合业务场景及用户行为对敏感信息进行层级化评判,对敏感信息进行更准确的预警。The present application calculates a system anti-allergy index for judging its ability to resist sensitive information for different business systems, and calculates a system sensitivity index for judging the degree of sensitive influence of the user's behavior on the business system, according to the difference between the two. The value determines whether the service system issues an early warning. The application considers that in the actual application of the service system, each service system has different tolerances for sensitive information, and the rigor of users in each service system is different, based on these two aspects. Based on the practical application of the business system, combined with the business scenario and user behavior, the sensitive information is hierarchically evaluated to provide more accurate warning of sensitive information.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有处理系统,所述处理系统被处理器执行时实现上述的系统敏感内容的预警方法的步骤。The present application also provides a computer readable storage medium having stored thereon a processing system, the processing system being executed by a processor to implement the steps of the early warning method of system sensitive content.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present application are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的 技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better. Implementation. Based on such understanding, the technical solution of the present application, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, The optical disc includes a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above is only a preferred embodiment of the present application, and is not intended to limit the scope of the patent application, and the equivalent structure or equivalent process transformations made by the specification and the drawings of the present application, or directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of this application.
Claims (20)
- 一种电子装置,其特征在于,所述电子装置包括存储器及与所述存储器连接的处理器,所述存储器中存储有可在所述处理器上运行的处理系统,所述处理系统被所述处理器执行时实现如下步骤:An electronic device, comprising: a memory and a processor coupled to the memory, wherein the memory stores a processing system operable on the processor, the processing system being The processor implements the following steps when it executes:抗敏指数处理步骤,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;The anti-allergy index processing step acquires each anti-allergy parameter corresponding to the business system and the weight corresponding to each anti-allergy parameter, and calculates a system anti-allergy index of the business system according to each anti-allergy parameter and corresponding weight;敏感指数处理步骤,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;The sensitive index processing step acquires the content sensitivity index and the user historical behavior index of each user's published content in the business system, calculates the system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitive the system of all users in the business system. The sum of the indices gives the sum of the system sensitivity indices;确定预警步骤,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。The early warning step is determined, and the difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes is calculated, and whether the early warning is issued is determined according to the difference.
- 根据权利要求1所述的电子装置,其特征在于,所述抗敏参数包括系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5,所述系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5的权重对应分别为w1、w2、w3、w4、w5,所述系统抗敏指数=x1*w1+x2*w2+x3*w3+x4*w4+x5*w5。The electronic device according to claim 1, wherein the anti-allergy parameter comprises a system importance level coefficient x1, a user quantity level coefficient x2, a system information propagation coefficient x3, an emergency processing coefficient x4, a sensitive vocabulary attention degree coefficient x5, The weights of the system important level coefficient x1, the user quantity level coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, and the sensitive vocabulary attention degree coefficient x5 are respectively w1, w2, w3, w4, w5, and the system is resistant. Sensitivity index = x1 * w1 + x2 * w2 + x3 * w3 + x4 * w4 + x5 * w5.
- 根据权利要求2所述的电子装置,其特征在于,所述系统重要等级系数x1根据该业务系统中各业务场景的权重及各业务场景对应的业务量计算得到,所述业务场景包括金融交易场景及普通交易场景,所述金融交易场景的权重为k1,所述普通交易场景的权重为k2,所述金融交易场景的业务量为c1,所述普通交易场景的的业务量为c2,所述业务系统的系统重要等级系数x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)),其中,所述α 为该业务系统的业务场景的平均业务量,α=(c1+c2)/2。The electronic device according to claim 2, wherein the system important level coefficient x1 is calculated according to the weight of each business scenario in the service system and the traffic volume corresponding to each service scenario, and the business scenario includes a financial transaction scenario. And a common transaction scenario, the weight of the financial transaction scenario is k1, the weight of the common transaction scenario is k2, the traffic volume of the financial transaction scenario is c1, and the traffic volume of the common transaction scenario is c2, The system important level coefficient x1 of the business system is (k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)), where α is the average of the business scenarios of the business system Traffic, α = (c1 + c2) / 2.
- 根据权利要求1至3任一项所述的电子装置,其特征在于,所述内容敏感指数的获取过程包括:对用户发布内容以句子为单位进行分词,将分词后的词与预先建立的词库中的词进行匹配,以匹配得到对应的关键词,根据该关键词分析每一句子的对应的核心观点信息,根据预先建立的核心观点信息与内容敏感指数的关联关系获取该核心观点信息关联的内容敏感指数。The electronic device according to any one of claims 1 to 3, wherein the process of acquiring the content sensitivity index comprises: segmenting the content of the user to the sentence, and dividing the word after the word segmentation with the pre-established word. The words in the library are matched to match the corresponding keywords, and the corresponding core viewpoint information of each sentence is analyzed according to the keyword, and the core viewpoint information association is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index. Content sensitivity index.
- 根据权利要求1至3任一项所述的电子装置,其特征在于,所述确定预警步骤,具体包括:The electronic device according to any one of claims 1 to 3, wherein the determining the warning step comprises:若所述差值大于预设的第一阈值,则确定该业务系统不发出预警;If the difference is greater than a preset first threshold, determining that the service system does not issue an early warning;若所述差值小于等于该第一阈值且大于预设的第二阈值,则确定该业务系统发出轻度预警;If the difference is less than or equal to the first threshold and greater than a preset second threshold, determining that the service system issues a slight alert;若所述差值小于等于该第二阈值且大于预设的第三阈值,则确定该业务系统发出中度预警;If the difference is less than or equal to the second threshold and greater than a preset third threshold, determining that the service system issues a medium alert;若所述差值小于等于该第三阈值,则确定该业务系统发出重度预警。If the difference is less than or equal to the third threshold, it is determined that the service system issues a severe warning.
- 一种系统敏感内容的预警方法,其特征在于,所述系统敏感内容的预警方法包括:An early warning method for system sensitive content, characterized in that the method for warning the sensitive content of the system comprises:S1,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;S1, obtaining respective anti-allergy parameters corresponding to the service system and weights corresponding to the respective anti-allergy parameters, and calculating a system anti-allergy index of the service system according to each anti-allergy parameter and corresponding weight;S2,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;S2: obtaining a content sensitivity index and a user historical behavior index of content published by each user in the business system, calculating a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and adding system sensitivity indexes of all users in the business system Get the sum of the system sensitivity index;S3,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。S3: Calculate the difference between the system anti-allergy index corresponding to the service system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
- 根据权利要求6所述的系统敏感内容的预警方法,其特征在于,所述抗敏参数包括系统重要等级系数x1、用户量等级系数x2、系统信息传播系 数x3、应急处理系数x4、敏感词汇关注度系数x5,所述系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5的权重对应分别为w1、w2、w3、w4、w5,所述系统抗敏指数=x1*w1+x2*w2+x3*w3+x4*w4+x5*w5。The early warning method for sensitive content of a system according to claim 6, wherein the anti-allergy parameter comprises a system important level coefficient x1, a user quantity level coefficient x2, a system information propagation coefficient x3, an emergency processing coefficient x4, a sensitive vocabulary concern. The degree coefficient x5, the system important level coefficient x1, the user quantity level coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, the sensitive vocabulary attention degree coefficient x5 have weights corresponding to w1, w2, w3, w4, w5, respectively. The system anti-allergy index = x1 * w1 + x2 * w2 + x3 * w3 + x4 * w4 + x5 * w5.
- 根据权利要求7所述的系统敏感内容的预警方法,其特征在于,所述系统重要等级系数x1根据该业务系统中各业务场景的权重及各业务场景对应的业务量计算得到,所述业务场景包括金融交易场景及普通交易场景,所述金融交易场景的权重为k1,所述普通交易场景的权重为k2,所述金融交易场景的业务量为c1,所述普通交易场景的的业务量为c2,所述业务系统的系统重要等级系数x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)),其中,所述α为该业务系统的业务场景的平均业务量,α=(c1+c2)/2。The early warning method for the sensitive content of the system according to claim 7, wherein the system important level coefficient x1 is calculated according to the weight of each service scenario in the service system and the service volume corresponding to each service scenario, the service scenario Including a financial transaction scenario and a common transaction scenario, the financial transaction scenario has a weight of k1, the common transaction scenario has a weight of k2, the financial transaction scenario has a traffic volume of c1, and the normal transaction scenario has a traffic volume of C2, the system important level coefficient x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α))) of the service system, wherein the α is the service system The average traffic of the business scenario, α = (c1 + c2) / 2.
- 根据权利要求6至8任一项所述的系统敏感内容的预警方法,其特征在于,所述内容敏感指数的获取过程包括:对用户发布内容以句子为单位进行分词,将分词后的词与预先建立的词库中的词进行匹配,以匹配得到对应的关键词,根据该关键词分析每一句子的对应的核心观点信息,根据预先建立的核心观点信息与内容敏感指数的关联关系获取该核心观点信息关联的内容敏感指数。The system for alerting the sensitive content of the system according to any one of claims 6 to 8, wherein the process of obtaining the content sensitive index comprises: segmenting the content of the user by using the sentence as a unit, and dividing the word after the word segmentation The words in the pre-established thesaurus are matched to match the corresponding keywords, and the corresponding core viewpoint information of each sentence is analyzed according to the keyword, and the relationship is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index. The content sensitivity index associated with the core view information.
- 根据权利要求6至8任一项所述的系统敏感内容的预警方法,其特征在于,所述步骤S3,具体包括:The method for alerting the sensitive content of the system according to any one of claims 6 to 8, wherein the step S3 comprises:若所述差值大于预设的第一阈值,则确定该业务系统不发出预警;If the difference is greater than a preset first threshold, determining that the service system does not issue an early warning;若所述差值小于等于该第一阈值且大于预设的第二阈值,则确定该业务系统发出轻度预警;If the difference is less than or equal to the first threshold and greater than a preset second threshold, determining that the service system issues a slight alert;若所述差值小于等于该第二阈值且大于预设的第三阈值,则确定该业务系统发出中度预警;If the difference is less than or equal to the second threshold and greater than a preset third threshold, determining that the service system issues a medium alert;若所述差值小于等于该第三阈值,则确定该业务系统发出重度预警。If the difference is less than or equal to the third threshold, it is determined that the service system issues a severe warning.
- 一种系统敏感内容的预警系统,其特征在于,所述系统敏感内容的预警系统包括:An early warning system for system sensitive content, characterized in that the early warning system for sensitive content of the system comprises:获取模块,用于获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;The obtaining module is configured to obtain each anti-allergy parameter corresponding to the service system and a weight corresponding to each anti-allergy parameter, and calculate a system anti-allergy index of the service system according to each anti-allergy parameter and the corresponding weight;处理模块,用于获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;The processing module is configured to obtain a content sensitivity index and a user historical behavior index of each user published content in the business system, calculate a system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitivity the system of all users in the business system. The sum of the indices gives the sum of the system sensitivity indices;预警模块,用于计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。The early warning module is configured to calculate a difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes, and determine whether to issue an early warning according to the difference.
- 根据权利要求11所述的系统敏感内容的预警系统,其特征在于,所述抗敏参数包括系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5,所述系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5的权重对应分别为w1、w2、w3、w4、w5,所述系统抗敏指数=x1*w1+x2*w2+x3*w3+x4*w4+x5*w5。The early warning system for sensitive content of a system according to claim 11, wherein the anti-allergy parameter comprises a system important level coefficient x1, a user quantity level coefficient x2, a system information propagation coefficient x3, an emergency processing coefficient x4, a sensitive vocabulary concern. The degree coefficient x5, the system important level coefficient x1, the user quantity level coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, the sensitive vocabulary attention degree coefficient x5 have weights corresponding to w1, w2, w3, w4, w5, respectively. The system anti-allergy index = x1 * w1 + x2 * w2 + x3 * w3 + x4 * w4 + x5 * w5.
- 根据权利要求12所述的系统敏感内容的预警系统,其特征在于,所述系统重要等级系数x1根据该业务系统中各业务场景的权重及各业务场景对应的业务量计算得到,所述业务场景包括金融交易场景及普通交易场景,所述金融交易场景的权重为k1,所述普通交易场景的权重为k2,所述金融交易场景的业务量为c1,所述普通交易场景的的业务量为c2,所述业务系统的系统重要等级系数x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)),其中,所述α为该业务系统的业务场景的平均业务量,α=(c1+c2)/2。The early warning system for system sensitive content according to claim 12, wherein the system important level coefficient x1 is calculated according to the weight of each service scenario in the service system and the traffic volume corresponding to each service scenario, the service scenario Including a financial transaction scenario and a common transaction scenario, the financial transaction scenario has a weight of k1, the common transaction scenario has a weight of k2, the financial transaction scenario has a traffic volume of c1, and the normal transaction scenario has a traffic volume of C2, the system important level coefficient x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α))) of the service system, wherein the α is the service system The average traffic of the business scenario, α = (c1 + c2) / 2.
- 根据权利要求11至13任一项所述的系统敏感内容的预警系统,其特征在于,所述处理模块具体用于:对用户发布内容以句子为单位进行分词, 将分词后的词与预先建立的词库中的词进行匹配,以匹配得到对应的关键词,根据该关键词分析每一句子的对应的核心观点信息,根据预先建立的核心观点信息与内容敏感指数的关联关系获取该核心观点信息关联的内容敏感指数。The early warning system for sensitive content of a system according to any one of claims 11 to 13, wherein the processing module is specifically configured to: perform word segmentation on a sentence-by-sentence basis for a user to post content, and pre-establish a word after the word segmentation. The words in the thesaurus are matched to match the corresponding keywords, and the corresponding core viewpoint information of each sentence is analyzed according to the keyword, and the core viewpoint is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index. Content-sensitive index of information association.
- 根据权利要求11至13任一项所述的系统敏感内容的预警系统,其特征在于,所述预警模块具体用于:若所述差值大于预设的第一阈值,则确定该业务系统不发出预警;若所述差值小于等于该第一阈值且大于预设的第二阈值,则确定该业务系统发出轻度预警;若所述差值小于等于该第二阈值且大于预设的第三阈值,则确定该业务系统发出中度预警;若所述差值小于等于该第三阈值,则确定该业务系统发出重度预警。The early warning system for the sensitive content of the system according to any one of claims 11 to 13, wherein the early warning module is specifically configured to: if the difference is greater than a preset first threshold, determine that the service system is not And sending an early warning; if the difference is less than or equal to the first threshold and greater than a preset second threshold, determining that the service system issues a slight warning; if the difference is less than or equal to the second threshold and greater than a preset The three thresholds determine that the service system issues a medium alert; if the difference is less than or equal to the third threshold, it is determined that the service system issues a severe alert.
- 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有处理系统,所述处理系统被处理器执行时实现步骤:A computer readable storage medium, wherein the computer readable storage medium stores a processing system, and when the processing system is executed by the processor, the steps are:抗敏指数处理步骤,获取业务系统对应的各个抗敏参数及各个抗敏参数对应的权重,根据各个抗敏参数及对应的权重计算业务系统的系统抗敏指数;The anti-allergy index processing step acquires each anti-allergy parameter corresponding to the business system and the weight corresponding to each anti-allergy parameter, and calculates a system anti-allergy index of the business system according to each anti-allergy parameter and corresponding weight;敏感指数处理步骤,获取业务系统中每一用户发布内容的内容敏感指数及用户历史行为指数,根据内容敏感指数及用户历史行为指数计算该用户的系统敏感指数,将业务系统中所有用户的系统敏感指数相加得到系统敏感指数总和;The sensitive index processing step acquires the content sensitivity index and the user historical behavior index of each user's published content in the business system, calculates the system sensitivity index of the user according to the content sensitivity index and the user historical behavior index, and sensitive the system of all users in the business system. The sum of the indices gives the sum of the system sensitivity indices;确定预警步骤,计算业务系统对应的系统抗敏指数与系统敏感指数总和两者的差值,根据该差值确定是否发出预警。The early warning step is determined, and the difference between the system anti-allergy index corresponding to the business system and the sum of the system sensitivity indexes is calculated, and whether the early warning is issued is determined according to the difference.
- 根据权利要求16所述的计算机可读存储介质,其特征在于,所述抗敏参数包括系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关注度系数x5,所述系统重要等级系数x1、用户量等级系数x2、系统信息传播系数x3、应急处理系数x4、敏感词汇关 注度系数x5的权重对应分别为w1、w2、w3、w4、w5,所述系统抗敏指数=x1*w1+x2*w2+x3*w3+x4*w4+x5*w5。The computer readable storage medium according to claim 16, wherein the anti-allergy parameter comprises a system importance level coefficient x1, a user quantity level coefficient x2, a system information propagation coefficient x3, an emergency processing coefficient x4, a sensitive vocabulary attention degree. The coefficient x5, the weight of the system important level coefficient x1, the user quantity level coefficient x2, the system information propagation coefficient x3, the emergency processing coefficient x4, and the sensitive vocabulary attention degree coefficient x5 are respectively w1, w2, w3, w4, w5, The system anti-allergy index = x1 * w1 + x2 * w2 + x3 * w3 + x4 * w4 + x5 * w5.
- 根据权利要求17所述的计算机可读存储介质,其特征在于,所述系统重要等级系数x1根据该业务系统中各业务场景的权重及各业务场景对应的业务量计算得到,所述业务场景包括金融交易场景及普通交易场景,所述金融交易场景的权重为k1,所述普通交易场景的权重为k2,所述金融交易场景的业务量为c1,所述普通交易场景的的业务量为c2,所述业务系统的系统重要等级系数x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α)),其中,所述α为该业务系统的业务场景的平均业务量,α=(c1+c2)/2。The computer readable storage medium according to claim 17, wherein the system importance level coefficient x1 is calculated according to the weight of each service scenario in the service system and the traffic volume corresponding to each service scenario, where the service scenario includes The financial transaction scenario and the common transaction scenario, the weight of the financial transaction scenario is k1, the weight of the common transaction scenario is k2, the traffic volume of the financial transaction scenario is c1, and the traffic volume of the common transaction scenario is c2 The system important rank coefficient x1=(k1+k2)/((πk1/2arctan(c1/α)+πk2/2arctan(c2/α))), wherein the α is the service of the service system The average traffic of the scene, α = (c1 + c2) / 2.
- 根据权利要求16至18任一项所述的计算机可读存储介质,其特征在于,所述内容敏感指数的获取过程包括:对用户发布内容以句子为单位进行分词,将分词后的词与预先建立的词库中的词进行匹配,以匹配得到对应的关键词,根据该关键词分析每一句子的对应的核心观点信息,根据预先建立的核心观点信息与内容敏感指数的关联关系获取该核心观点信息关联的内容敏感指数。The computer readable storage medium according to any one of claims 16 to 18, wherein the process of obtaining the content sensitivity index comprises: segmenting the content of the user to the sentence, and segmenting the word after the word segmentation with the word The words in the established thesaurus are matched to match the corresponding keywords, and the corresponding core viewpoint information of each sentence is analyzed according to the keyword, and the core is obtained according to the association relationship between the pre-established core viewpoint information and the content sensitive index. The content sensitivity index associated with the opinion information.
- 根据权利要求16至18任一项所述的计算机可读存储介质,其特征在于,所述确定预警步骤,具体包括:The computer readable storage medium according to any one of claims 16 to 18, wherein the determining the warning step comprises:若所述差值大于预设的第一阈值,则确定该业务系统不发出预警;If the difference is greater than a preset first threshold, determining that the service system does not issue an early warning;若所述差值小于等于该第一阈值且大于预设的第二阈值,则确定该业务系统发出轻度预警;If the difference is less than or equal to the first threshold and greater than a preset second threshold, determining that the service system issues a slight alert;若所述差值小于等于该第二阈值且大于预设的第三阈值,则确定该业务系统发出中度预警;If the difference is less than or equal to the second threshold and greater than a preset third threshold, determining that the service system issues a medium alert;若所述差值小于等于该第三阈值,则确定该业务系统发出重度预警。If the difference is less than or equal to the third threshold, it is determined that the service system issues a severe warning.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810182928.4A CN108446270B (en) | 2018-03-06 | 2018-03-06 | Electronic device, early warning method of system sensitive content and storage medium |
CN201810182928.4 | 2018-03-06 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019169766A1 true WO2019169766A1 (en) | 2019-09-12 |
Family
ID=63193761
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/089460 WO2019169766A1 (en) | 2018-03-06 | 2018-06-01 | Electronic apparatus, method and system for early warning regarding system sensitive content, and storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN108446270B (en) |
WO (1) | WO2019169766A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111079029A (en) * | 2019-12-20 | 2020-04-28 | 珠海格力电器股份有限公司 | Sensitive account detection method, storage medium and computer equipment |
CN112015869A (en) * | 2020-09-14 | 2020-12-01 | 支付宝(杭州)信息技术有限公司 | Risk detection method, device and equipment for text to be issued |
CN113779336A (en) * | 2021-09-08 | 2021-12-10 | 五八同城信息技术有限公司 | User behavior data processing method and device and electronic equipment |
CN113891104A (en) * | 2021-09-24 | 2022-01-04 | 北京沃东天骏信息技术有限公司 | Live broadcast processing method, live broadcast platform, storage medium and electronic equipment |
CN114238280A (en) * | 2021-12-20 | 2022-03-25 | 中电金信软件有限公司 | Method and device for constructing financial sensitive information standard library and electronic equipment |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111966915A (en) * | 2019-05-20 | 2020-11-20 | 腾讯科技(深圳)有限公司 | Information inspection method, computer equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103268350A (en) * | 2013-05-29 | 2013-08-28 | 安徽雷越网络科技有限公司 | Internet public opinion information monitoring system and monitoring method |
US8655939B2 (en) * | 2007-01-05 | 2014-02-18 | Digital Doors, Inc. | Electromagnetic pulse (EMP) hardened information infrastructure with extractor, cloud dispersal, secure storage, content analysis and classification and method therefor |
CN104933093A (en) * | 2015-05-19 | 2015-09-23 | 武汉泰迪智慧科技有限公司 | Regional public opinion monitoring and decision-making auxiliary system and method based on big data |
CN107633084A (en) * | 2017-09-28 | 2018-01-26 | 武汉虹旭信息技术有限责任公司 | Based on the public sentiment managing and control system and its method from media |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104156365B (en) * | 2013-05-14 | 2018-05-11 | 中国移动通信集团湖南有限公司 | A kind of monitoring method of file, apparatus and system |
US9099006B2 (en) * | 2013-08-22 | 2015-08-04 | GM Global Technology Operations LLC | Context-aware threat response arbitration |
CN106445998B (en) * | 2016-05-26 | 2020-08-21 | 达而观信息科技(上海)有限公司 | Text content auditing method and system based on sensitive words |
CN106599713B (en) * | 2016-11-11 | 2019-04-12 | 中国电子科技网络信息安全有限公司 | A kind of database desensitization system and method based on big data |
CN106777098A (en) * | 2016-12-14 | 2017-05-31 | 齐雪成 | A kind of method and device of wisdom endowment big data public sentiment networking monitoring |
CN206451332U (en) * | 2017-01-22 | 2017-08-29 | 沈阳一通智能科技有限公司 | A kind of intelligent network warning device |
CN107633183B (en) * | 2017-09-29 | 2020-10-02 | 东南大学 | Data leakage detection method based on sensitivity of query result set |
-
2018
- 2018-03-06 CN CN201810182928.4A patent/CN108446270B/en active Active
- 2018-06-01 WO PCT/CN2018/089460 patent/WO2019169766A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8655939B2 (en) * | 2007-01-05 | 2014-02-18 | Digital Doors, Inc. | Electromagnetic pulse (EMP) hardened information infrastructure with extractor, cloud dispersal, secure storage, content analysis and classification and method therefor |
CN103268350A (en) * | 2013-05-29 | 2013-08-28 | 安徽雷越网络科技有限公司 | Internet public opinion information monitoring system and monitoring method |
CN104933093A (en) * | 2015-05-19 | 2015-09-23 | 武汉泰迪智慧科技有限公司 | Regional public opinion monitoring and decision-making auxiliary system and method based on big data |
CN107633084A (en) * | 2017-09-28 | 2018-01-26 | 武汉虹旭信息技术有限责任公司 | Based on the public sentiment managing and control system and its method from media |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111079029A (en) * | 2019-12-20 | 2020-04-28 | 珠海格力电器股份有限公司 | Sensitive account detection method, storage medium and computer equipment |
CN111079029B (en) * | 2019-12-20 | 2023-11-21 | 珠海格力电器股份有限公司 | Sensitive account detection method, storage medium and computer equipment |
CN112015869A (en) * | 2020-09-14 | 2020-12-01 | 支付宝(杭州)信息技术有限公司 | Risk detection method, device and equipment for text to be issued |
CN113779336A (en) * | 2021-09-08 | 2021-12-10 | 五八同城信息技术有限公司 | User behavior data processing method and device and electronic equipment |
CN113891104A (en) * | 2021-09-24 | 2022-01-04 | 北京沃东天骏信息技术有限公司 | Live broadcast processing method, live broadcast platform, storage medium and electronic equipment |
WO2023045939A1 (en) * | 2021-09-24 | 2023-03-30 | 北京沃东天骏信息技术有限公司 | Live broadcast processing method, live broadcast platform, storage medium and electronic device |
CN114238280A (en) * | 2021-12-20 | 2022-03-25 | 中电金信软件有限公司 | Method and device for constructing financial sensitive information standard library and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN108446270A (en) | 2018-08-24 |
CN108446270B (en) | 2021-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019169766A1 (en) | Electronic apparatus, method and system for early warning regarding system sensitive content, and storage medium | |
US10467631B2 (en) | Ranking and tracking suspicious procurement entities | |
CA2580731C (en) | Fraud risk advisor | |
US20200302521A1 (en) | Method and system for software monitoring associated with an entity | |
US7558834B2 (en) | Method and system to process issue data pertaining to a system | |
WO2020177478A1 (en) | Credit-based qualification information auditing method, apparatus and device | |
AU2016288779A1 (en) | Modifying data structures to indicate derived relationships among entity data objects | |
TW202418102A (en) | System and method for a thing machine to perform models | |
US20230116345A1 (en) | Systems and methods for enterprise wide risk assessment and aggregated risk analysis | |
CN110942314A (en) | Abnormal account supervision method and device | |
WO2019095569A1 (en) | Financial analysis method based on financial and economic event on microblog, application server, and computer readable storage medium | |
CN109918189B (en) | Resource management method and related equipment | |
US20200219149A1 (en) | Validating and controlling social posts | |
CN110659981A (en) | Enterprise dependency relationship identification method and device and electronic equipment | |
CN116244751A (en) | Data desensitization method, device, electronic equipment, storage medium and program product | |
CN113609407B (en) | Regional consistency verification method and device | |
CN114580889A (en) | Operation risk management and control method, device, equipment, medium and program product | |
CN114693358A (en) | Data processing method and device, electronic equipment and storage medium | |
CN110288365B (en) | Data processing method and system, computer system and computer readable storage medium | |
CN106446093B (en) | Method and device for aggregating concepts | |
CN111275545A (en) | Method, apparatus, device and medium for on-line mortgage | |
CN111625556B (en) | Order matching method, device, equipment and storage medium based on credit investigation | |
CN116228382A (en) | Data processing method, device, electronic equipment and storage medium | |
CN118245678A (en) | Method, apparatus, device and computer readable medium for processing business data | |
TWM653797U (en) | Information processing system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18908594 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 11.12.2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18908594 Country of ref document: EP Kind code of ref document: A1 |