CN110134703A - A kind of keywords database update method and device - Google Patents

A kind of keywords database update method and device Download PDF

Info

Publication number
CN110134703A
CN110134703A CN201910421356.5A CN201910421356A CN110134703A CN 110134703 A CN110134703 A CN 110134703A CN 201910421356 A CN201910421356 A CN 201910421356A CN 110134703 A CN110134703 A CN 110134703A
Authority
CN
China
Prior art keywords
target
keywords
monitoring data
information
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910421356.5A
Other languages
Chinese (zh)
Inventor
何晶
刘杨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Miaozhen Information Technology Co Ltd
Miaozhen Systems Information Technology Co Ltd
Original Assignee
Miaozhen Systems Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Miaozhen Systems Information Technology Co Ltd filed Critical Miaozhen Systems Information Technology Co Ltd
Priority to CN201910421356.5A priority Critical patent/CN110134703A/en
Publication of CN110134703A publication Critical patent/CN110134703A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

This application provides a kind of keywords database update method and devices, wherein the keywords database update method includes obtaining target monitoring data, and the information of destination user agent UA is extracted from target monitoring data;The corresponding target keywords of type of target UA are extracted from the information of target UA;Judge in target keywords with the presence or absence of preset characters;If it exists, it is determined that target monitoring data are webpage WEB data on flows, and update WEB key word library using target keywords;If it does not exist, it is determined that target monitoring data are application APP data on flows, and update APP key word library using target keywords.The application utilizes the information of target UA in target monitoring data, to determine that the target monitoring data are WEB data on flows, or it is APP data on flows, WEB key word library and APP key word library can be automatically updated, manual intervention is reduced, and then improves the accuracy rate that identification target monitoring data are WEB data on flows or APP data on flows.

Description

A kind of keywords database update method and device
Technical field
This application involves technical field of data processing, in particular to a kind of keywords database update method and device.
Background technique
The information that user is pushed by service platform, available a large amount of information, and then convenience and the life for enriching oneself It is living.Service platform browses the mode of information according to user to preferably service user, pushes information to be established as the user Strategy, so that user experience is high.
In general, the server of monitoring platform, which receives user, browses monitoring data (the i.e. flow of information generated after information Information), by monitoring data in the webpage WEB key word library that pre-establishes keyword, in application APP key word library Keyword is matched, and then determines the source type (as WEB data on flows or being APP data on flows) of the monitoring data, So that service platform can formulate the strategy of push information according to the source type of the monitoring data.
But all artificial collections of keyword in above-mentioned WEB key word library and APP key word library, it is difficult to according to new Keyword update WEB key word library and APP key word library, take time and effort, and make the source class for determining the monitoring data When type, accuracy rate is low.
Summary of the invention
In view of this, the embodiment of the present application is designed to provide a kind of keywords database update method and device, can from It is dynamic to update WEB key word library and APP key word library, manual intervention is reduced, and then improving identification target monitoring data is WEB flow The accuracy rate of data or APP data on flows.
In a first aspect, the embodiment of the present application provides a kind of keywords database update method, wherein include:
Target monitoring data are obtained, and extract the information of destination user agent UA from the target monitoring data;
The corresponding target keywords of type of the target UA are extracted from the information of the target UA;
Judge in the target keywords with the presence or absence of preset characters;
If it exists, it is determined that the target monitoring data are webpage WEB data on flows, and more using the target keywords New WEB key word library;
If it does not exist, it is determined that the target monitoring data are application APP data on flows, and are closed using the target Key word updates APP key word library.
With reference to first aspect, the embodiment of the present application provides the first possible embodiment of first aspect, wherein also Include:
Search whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
With reference to first aspect, the embodiment of the present application provides second of possible embodiment of first aspect, wherein institute State the corresponding target keywords of type that the target UA is extracted from the information of the target UA, comprising:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted Later, the corresponding target keywords of type of the target UA are obtained.
With reference to first aspect, the embodiment of the present application provides the third possible embodiment of first aspect, wherein also Include:
Updated WEB key word library is showed into background work personnel, so that the background work personnel verify.
Second aspect, the embodiment of the present application also provides a kind of keywords database updating devices, wherein includes:
Module is obtained, extracts target user's generation for obtaining target monitoring data, and from the target monitoring data Manage the information of UA;
Extraction module, for extracting the corresponding target critical of type of the target UA from the information of the target UA Word;
Judgment module, for judging in the target keywords with the presence or absence of preset characters;
First update module, for if it exists, it is determined that the target monitoring data are webpage WEB data on flows, and sharp WEB key word library is updated with the target keywords;
Second update module, for if it does not exist, it is determined that the target monitoring data are application APP flow number According to, and APP key word library is updated using the target keywords.
In conjunction with second aspect, the embodiment of the present application provides the first possible embodiment of second aspect, wherein also Include:
Searching module, for searching whether that there are any keys in WEB key word library from the information of the target UA Word;
If it exists, it is determined that the target monitoring data are WEB data on flows.
In conjunction with second aspect, the embodiment of the present application provides second of possible embodiment of second aspect, wherein packet It includes:
The extraction module is converted into lowercase versions specifically for the character string for including by the information of the target UA;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted Later, the corresponding target keywords of type of the target UA are obtained.
In conjunction with second aspect, the embodiment of the present application provides the third possible embodiment of second aspect, wherein also Include:
Correction verification module, for updated WEB key word library to be showed background work personnel, so that the background work Personnel verify.
The third aspect, the embodiment of the present application also provide a kind of electronic equipment, comprising: processor, memory and bus, it is described Memory is stored with the executable machine readable instructions of the processor, when electronic equipment operation, the processor with it is described By bus communication between memory, the machine readable instructions executed when being executed by the processor it is above-mentioned in a first aspect, or The possible embodiment of the first of first aspect any possibility into the third possible embodiment of first aspect Embodiment in step.
Fourth aspect, the embodiment of the present application also provide a kind of computer readable storage medium, the computer-readable storage medium Computer program is stored in matter, which executes above-mentioned in a first aspect, or first aspect when being run by processor The first possible embodiment any possible embodiment into the third possible embodiment of first aspect In step.
A kind of keywords database update method provided by the embodiments of the present application and device, wherein the keywords database update method Including obtaining target monitoring data, and extract from target monitoring data the information of destination user agent UA;From target UA's The corresponding target keywords of target UA are extracted in information;Judge in target keywords with the presence or absence of preset characters;If it exists, then really The monitoring data that set the goal are WEB data on flows, and update WEB key word library using the target keywords;If it does not exist, it is determined that Target monitoring data are application APP data on flows.The embodiment of the present application utilizes the information of target UA in target monitoring data, Come determine the target monitoring data be WEB data on flows, or be APP data on flows, can automatically update WEB key word library and APP key word library reduces manual intervention, and then improving identification target monitoring data is WEB data on flows or APP data on flows Accuracy rate.
To enable the above objects, features, and advantages of the application to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate Appended attached drawing, is described in detail below.
Detailed description of the invention
Technical solution in ord to more clearly illustrate embodiments of the present application, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only some embodiments of the application, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1 shows a kind of flow chart of keywords database update method provided by the embodiment of the present application;
Fig. 2 shows the flow charts of another kind keywords database update method provided by the embodiment of the present application;
Fig. 3 shows a kind of structural schematic diagram of keywords database updating device provided by the embodiment of the present application;
Fig. 4 shows the structural schematic diagram of electronic equipment provided by the embodiment of the present application.
Specific embodiment
To keep the purposes, technical schemes and advantages of the embodiment of the present application clearer, below in conjunction with the embodiment of the present application Middle attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only It is some embodiments of the present application, instead of all the embodiments.The application being usually described and illustrated herein in the accompanying drawings is real The component for applying example can be arranged and be designed with a variety of different configurations.Therefore, below to the application's provided in the accompanying drawings The detailed description of embodiment is not intended to limit claimed scope of the present application, but is merely representative of the selected reality of the application Apply example.Based on embodiments herein, those skilled in the art institute obtained without making creative work There are other embodiments, shall fall in the protection scope of this application.
Currently, the server of monitoring platform, which receives user, browses monitoring data (the i.e. flow of information generated after information Information), by monitoring data in the webpage WEB key word library that pre-establishes keyword, in application APP key word library Keyword is matched, and then determines the source type (as WEB data on flows or being APP data on flows) of the monitoring data. But all artificial collections of keyword in WEB key word library and APP key word library, it is difficult to be updated according to new keyword WEB key word library and APP key word library, when so that determining the source type of the monitoring data, accuracy rate is low, takes time and effort.Needle To the above problem, a kind of keywords database update method provided by the embodiments of the present application and device can automatically update WEB keyword Library and APP key word library reduce manual intervention, and then improving identification target monitoring data is WEB data on flows or APP flow number According to accuracy rate.
For convenient for understanding the embodiment of the present application, first more to a kind of keywords database disclosed in the embodiment of the present application New method describes in detail.
As shown in Figure 1, be the embodiment of the present application using server as executing subject when keywords database update method flow chart, Specific step is as follows:
S101 obtains target monitoring data, and the information of destination user agent UA is extracted from target monitoring data.
In specific implementation, after device plays information, monitoring data will be sent to server, in order to server Record the flow of equipment generation.
Server obtains the target monitoring data of equipment transmission in real time, and extracts target user from target monitoring data Act on behalf of the information of (User Agent, UA)
Wherein, the information of target UA may include hardware platform, system software, application software etc..
S102 extracts the corresponding target keywords of type of target UA from the information of target UA.
In specific implementation, the corresponding target keywords of target UA, the target keywords are extracted from the information of target UA The type of target UA is indicated, the type of target UA may include baidu browser, 360 browsers, IE browser etc..
The specific method for extracting the corresponding target keywords of target UA, illustrates in detail below, does not do herein excessive It repeats.
It, can be from the information of target UA before extracting the corresponding target keywords of target UA in the information from target UA Search whether that there are any keywords in webpage WEB key word library;If it exists, it is determined that target monitoring data are WEB flow Data.
The information of the type of instruction target UA can be carried in the information of target UA, can directly with the WEB that pre-establishes Key word library is matched, if being matched to any keyword in WEB key word library, can directly determine target monitoring number According to for WEB data on flows.
If there is no any keywords in WEB key word library in the information of target UA, step 102 is carried out, extracts mesh Mark the corresponding target keywords of UA.
Wherein, WEB key word library constructs in advance, may include 360Browser, 360Aphone Browser, The keywords such as Xiao Mi Browser, Baidu Browser, Sogou Browser.
S103 judges in target keywords with the presence or absence of preset characters.
In specific implementation, there are the browser that the lower browser of popularity rate and user are not frequently used, Yong Huli Webpage, the information of the target UA in the target monitoring data of generation, with the progress of WEB key word library are browsed with above-mentioned browser Match, it is difficult to match the corresponding keyword of above-mentioned browser.Therefore, the corresponding target of target UA is extracted from the information of target UA Keyword further searches in target keywords with the presence or absence of preset characters.
Wherein, which may include explorer, 115Browser, wifibrowser etc..
S104, and if it exists, then determine that target monitoring data are webpage WEB data on flows, and utilize the target keywords Update WEB key word library.
After determining that target monitoring data are WEB data on flows, server by utilizing target keywords automatically update the pass WEB Key character library.
Updated WEB key word library can also be showed background work personnel by server, so that background work personnel It is verified.
By updated WEB key word library, it is the accurate of WEB data on flows that identification target monitoring data, which can be improved, Rate.
S105, if it does not exist, it is determined that target monitoring data are application APP data on flows, and utilize the target Keyword updates APP key word library.
In specific implementation, if any keyword in WEB key word library is not present in the information of target UA, and target Also preset characters are not present in the corresponding target keywords of UA, it is determined that target monitoring data are application APP data on flows.
Due to APP quantity far more than browser quantity, the quantity of the keyword in APP key word library compares Greatly, it when being matched in order to avoid target keywords with APP key word library, wastes time and the problem of resource, is determining target There is no any keywords in WEB key word library in the information of UA, and determine in target keywords there is no when preset characters Target monitoring data are application APP data on flows.
Wherein, APP key word library may include MicroMessenger/, QQ/, baiduboxapp, mmbang, Weibo, AlipayClient etc..
It is worth noting that after determining that target monitoring data are application APP data on flows, it can also be directly sharp APP key word library is updated with target keywords.
The embodiment of the present application utilizes the information of target UA in target monitoring data, to determine that the target monitoring data are WEB Data on flows, or it is APP data on flows, can be improved identification target monitoring data is WEB data on flows or APP data on flows Accuracy rate, automatically update WEB key word library, APP key word library, reduce manual intervention.With the increase of identification number, every time Accuracy rate can be continuously improved in update to WEB key word library, and the keyword of required correction also can be fewer and fewer, to subtract The waste of few manpower.
It is worth noting that when extracting the corresponding target keywords of target UA in the information from target UA, if not extracting To target keywords, it is determined that the target monitoring data are unknown flow rate, that is, are not belonging to WEB data on flows, are also not belonging to APP Data on flows.Further, which can be sent to the client of background work personnel, so that backstage work Make personnel to judge the unknown flow rate.
It, can be according to judging result to this after determining that target monitoring data are WEB data on flows or APP data on flows The information of user is precisely pushed, i.e., is pushed by WEB, or is pushed by APP.
The corresponding target keywords of type of target UA are extracted from the information of target UA according to method shown in Fig. 2, In, the specific steps are as follows:
S201, the character string that the information by target UA includes are converted into lowercase versions;
S202 according to regular expressions cuts the character string of lowercase versions, obtains multiple first candidate keys;
S203 is cut and is recombinated to each first candidate key according to space, obtains multiple second candidate keys Word;
S204, from multiple second candidate keys, delete the candidate key unrelated with the type feature of target UA it Afterwards, the corresponding target keywords of type of target UA are obtained.
In specific implementation, the character string that the information of target UA includes is converted into lowercase versions first, according to canonical table Cut up to character string of the formula to lowercase versions, specifically, according to regular expressions " ((.* ?)) | [0-9]+x [0-9] + " information and resolution information in the character strings of replacement lowercase versions in all brackets, then according to regular expression "/ [^] * " cuts the character string of entire lowercase versions, obtains the first candidate key;Wherein, in the first candidate key Carry all possible effective keywords.
Then, candidate key is cut and is recombinated according to space, specifically, the first candidate key of recombination is every Rejected when a part it is all can match regular expression " .* [~!@# $ %^&* (),?;" |<>{ }=+ _-[]] .* " Sub- keyword, finally by after all recombinations non-null key return be used as the second candidate key.
By step 201-203, the corresponding keyword of the facility information for including in the information of target UA can be removed and be The corresponding keyword of information of uniting, such as " (Linux;U;Android 2.2.1;zh-cn;HTC_Wildfire_A3333Build/ ) ", and the corresponding keyword of version information etc. of similar "/533.1 " browser or APP that carries FRG83D.
Finally, deleting the candidate key unrelated with the feature of target UA from the second candidate key, obtaining target UA Corresponding target keywords.
Wherein, the candidate key unrelated with the feature of target UA is the keyword that can not indicate the type of target UA, example Such as " Mozilla ", the keywords such as " AppleWebKit " be not only present in the UA information of common WEB data on flows, but also existed In the UA information of most APP datas on flows.
In addition, further including chorme, mbbms, symbianos, cfnetwork, build etc..
By the above method, extract the corresponding target keywords of target UA, can with the corresponding keyword of eliminating equipment information, The corresponding keyword of system information, the corresponding keyword of version information, candidate key unrelated with the feature of target UA etc., mention High target keywords and preset characters, the matched accuracy rate of WEN key word library, while saving server resource.
Based on the same inventive concept, the embodiment of the present application also provides keywords databases corresponding with keywords database update method Updating device, the above-mentioned keywords database of principle and the embodiment of the present application solved the problems, such as due to the device in the embodiment of the present application are updated Method is similar, therefore the implementation of device may refer to the implementation of method, and overlaps will not be repeated.
Shown in Figure 3, keywords database updating device provided by the another embodiment of the application includes:
Module 301 is obtained, extracts target user for obtaining target monitoring data, and from the target monitoring data Act on behalf of the information of UA;
Extraction module 302, the corresponding target of type for extracting the target UA from the information of the target UA are closed Key word;
Judgment module 303, for judging in the target keywords with the presence or absence of preset characters;
First update module 304, for if it exists, it is determined that the target monitoring data are webpage WEB data on flows, and WEB key word library is updated using the target keywords;
Second update module 305, for if it does not exist, it is determined that the target monitoring data are application APP flow Data, and APP key word library is updated using the target keywords.
In one embodiment, above-mentioned keywords database updating device further include:
Searching module 306, for searching whether that there are any passes in WEB key word library from the information of the target UA Key word;
If it exists, it is determined that the target monitoring data are WEB data on flows.
In another embodiment, said extracted module 302, is specifically used for:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted Later, the corresponding target keywords of type of the target UA are obtained.
In yet another embodiment, above-mentioned keywords database updating device further include:
Correction verification module 307, for updated WEB key word library to be showed background work personnel, so that the backstage Staff verifies.
Fig. 4 describes the structure of a kind of electronic equipment 400 provided in an embodiment of the present invention, the electronic equipment 400 include: to A few processor 401, at least one network interface 404 or other users interface 403, memory 405, at least one communication Bus 402.Communication bus 402 is for realizing the connection communication between these components.The electronic equipment 400 optionally includes user Interface 403, including display is (for example, touch screen, LCD, CRT, holographic imaging (Holographic) or projection (Projector) etc.), keyboard or pointing device are (for example, mouse, trace ball (trackball), touch-sensitive plate or touch screen Deng).
Memory 405 may include read-only memory and random access memory, and provide instruction sum number to processor 401 According to.The a part of of memory 405 can also include nonvolatile RAM (NVRAM).
In some embodiments, memory 405 stores following element, executable modules or data structures, or Their subset of person or their superset:
Operating system 4051 includes various system programs, hardware based for realizing various basic businesses and processing Task;
Application program module 4052 includes various application programs, such as desktop (launcher), media player (Media Player), browser (Browser) etc., for realizing various applied business.
In embodiments of the present invention, by the program or instruction of calling memory 405 to store, processor 401 is used for: being obtained Target monitoring data, and extract from the target monitoring data information of destination user agent UA;
The corresponding target keywords of type of the target UA are extracted from the information of the target UA;
Judge in the target keywords with the presence or absence of preset characters;
If it exists, it is determined that the target monitoring data are webpage WEB data on flows, and more using the target keywords New WEB key word library;
If it does not exist, it is determined that the target monitoring data are application APP data on flows, and are closed using the target Key word updates APP key word library.
Optionally, in the method that processor 401 executes, further includes:
Search whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
Optionally, described to extract the target UA's from the information of the target UA in the method that processor 401 executes The corresponding target keywords of type, comprising:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted Later, the corresponding target keywords of type of the target UA are obtained.
Optionally, in the method that processor 401 executes, further includes:
Updated WEB key word library is showed into background work personnel, so that the background work personnel verify.
The computer program product of keywords database update method and device provided by the embodiment of the present application, including store The computer readable storage medium of program code, the instruction that program code includes can be used for executing the side in previous methods embodiment Method, specific implementation can be found in embodiment of the method, and details are not described herein.
Specifically, which can be general storage medium, such as mobile disk, hard disk, on the storage medium Computer program when being run, above-mentioned keywords database update method is able to carry out, so as to automatically update WEB key word library With APP key word library, manual intervention is reduced, and then improving identification target monitoring data is WEB data on flows or APP data on flows Accuracy rate.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product It is stored in the executable non-volatile computer-readable storage medium of a processor.Based on this understanding, the application Technical solution substantially the part of the part that contributes to existing technology or the technical solution can be with software in other words The form of product embodies, which is stored in a storage medium, including some instructions use so that One computer equipment (can be personal computer, server or the network equipment etc.) executes each embodiment institute of the application State all or part of the steps of method.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic or disk etc. is various to deposit Store up the medium of program code.
Finally, it should be noted that embodiment described above, the only specific embodiment of the application, to illustrate the application Technical solution, rather than its limitations, the protection scope of the application is not limited thereto, although with reference to the foregoing embodiments to this Shen It please be described in detail, those skilled in the art should understand that: anyone skilled in the art Within the technical scope of the present application, it can still modify to technical solution documented by previous embodiment or can be light It is readily conceivable that variation or equivalent replacement of some of the technical features;And these modifications, variation or replacement, do not make The essence of corresponding technical solution is detached from the spirit and scope of the embodiment of the present application technical solution, should all cover the protection in the application Within the scope of.Therefore, the protection scope of the application shall be subject to the protection scope of the claim.

Claims (10)

1. a kind of keywords database update method characterized by comprising
Target monitoring data are obtained, and extract the information of destination user agent UA from the target monitoring data;
The corresponding target keywords of type of the target UA are extracted from the information of the target UA;
Judge in the target keywords with the presence or absence of preset characters;
If it exists, it is determined that the target monitoring data are webpage WEB data on flows, and are updated using the target keywords WEB key word library;
If it does not exist, it is determined that the target monitoring data are application APP data on flows, and utilize the target keywords Update APP key word library.
2. keywords database update method according to claim 1, which is characterized in that further include:
Search whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
3. keywords database update method according to claim 1, which is characterized in that described from the information of the target UA Extract the corresponding target keywords of type of the target UA, comprising:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, delete the candidate key unrelated with the type feature of the target UA it Afterwards, the corresponding target keywords of type of the target UA are obtained.
4. keywords database update method according to claim 1, which is characterized in that further include:
Updated WEB key word library is showed into background work personnel, so that the background work personnel verify.
5. a kind of keywords database updating device characterized by comprising
Module is obtained, extracts destination user agent UA for obtaining target monitoring data, and from the target monitoring data Information;
Extraction module, for extracting the corresponding target keywords of type of the target UA from the information of the target UA;
Judgment module, for judging in the target keywords with the presence or absence of preset characters;
First update module, for if it exists, it is determined that the target monitoring data are webpage WEB data on flows, and utilize institute It states target keywords and updates WEB key word library;
Second update module, for if it does not exist, it is determined that the target monitoring data are application APP data on flows, and APP key word library is updated using the target keywords.
6. keywords database updating device according to claim 5, which is characterized in that further include:
Searching module, for searching whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
7. keywords database updating device according to claim 5 characterized by comprising
The extraction module is converted into lowercase versions specifically for the character string for including by the information of the target UA;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, delete the candidate key unrelated with the type feature of the target UA it Afterwards, the corresponding target keywords of type of the target UA are obtained.
8. keywords database updating device according to claim 9, which is characterized in that further include:
Correction verification module, for updated WEB key word library to be showed background work personnel, so that the background work personnel It is verified.
9. a kind of electronic equipment characterized by comprising processor, memory and bus, the memory are stored with the place The executable machine readable instructions of device are managed, when electronic equipment operation, pass through bus between the processor and the memory Communication, keywords database of the execution as described in Claims 1-4 is any be more when the machine readable instructions are executed by the processor The step of new method.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer journey on the computer readable storage medium Sequence executes the keywords database update method as described in Claims 1-4 any one when the computer program is run by processor The step of.
CN201910421356.5A 2019-05-21 2019-05-21 A kind of keywords database update method and device Pending CN110134703A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910421356.5A CN110134703A (en) 2019-05-21 2019-05-21 A kind of keywords database update method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910421356.5A CN110134703A (en) 2019-05-21 2019-05-21 A kind of keywords database update method and device

Publications (1)

Publication Number Publication Date
CN110134703A true CN110134703A (en) 2019-08-16

Family

ID=67571867

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910421356.5A Pending CN110134703A (en) 2019-05-21 2019-05-21 A kind of keywords database update method and device

Country Status (1)

Country Link
CN (1) CN110134703A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110795668A (en) * 2019-10-28 2020-02-14 北京博睿宏远数据科技股份有限公司 Website data analysis method, device, equipment and storage medium
CN113342866A (en) * 2021-06-22 2021-09-03 广州华多网络科技有限公司 Keyword updating method and device, computer equipment and storage medium
CN113382000A (en) * 2021-06-09 2021-09-10 北京天融信网络安全技术有限公司 UA character string anomaly detection method, device, equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103186675A (en) * 2013-04-03 2013-07-03 南京安讯科技有限责任公司 Automatic webpage classification method based on network hot word identification
CN103246703A (en) * 2013-04-03 2013-08-14 百度在线网络技术(北京)有限公司 Method and equipment for determining application word banks
US20140169752A1 (en) * 2012-12-14 2014-06-19 Motorola Solutions, Inc. Computer assisted dispatch incident report video search and tagging systems and methods
CN107346182A (en) * 2016-05-05 2017-11-14 北京搜狗科技发展有限公司 A kind of method for building user thesaurus and the device for building user thesaurus
US20180130465A1 (en) * 2016-11-10 2018-05-10 Linearhub Apparatus and method for correcting pronunciation by contextual recognition

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140169752A1 (en) * 2012-12-14 2014-06-19 Motorola Solutions, Inc. Computer assisted dispatch incident report video search and tagging systems and methods
CN103186675A (en) * 2013-04-03 2013-07-03 南京安讯科技有限责任公司 Automatic webpage classification method based on network hot word identification
CN103246703A (en) * 2013-04-03 2013-08-14 百度在线网络技术(北京)有限公司 Method and equipment for determining application word banks
CN107346182A (en) * 2016-05-05 2017-11-14 北京搜狗科技发展有限公司 A kind of method for building user thesaurus and the device for building user thesaurus
US20180130465A1 (en) * 2016-11-10 2018-05-10 Linearhub Apparatus and method for correcting pronunciation by contextual recognition

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110795668A (en) * 2019-10-28 2020-02-14 北京博睿宏远数据科技股份有限公司 Website data analysis method, device, equipment and storage medium
CN113382000A (en) * 2021-06-09 2021-09-10 北京天融信网络安全技术有限公司 UA character string anomaly detection method, device, equipment and medium
CN113342866A (en) * 2021-06-22 2021-09-03 广州华多网络科技有限公司 Keyword updating method and device, computer equipment and storage medium
CN113342866B (en) * 2021-06-22 2022-06-21 广州华多网络科技有限公司 Keyword updating method and device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
US10809984B2 (en) System for generating functionality representation, indexing, searching, componentizing, and analyzing of source code in codebases and method thereof
Nguyen et al. Graph-based statistical language model for code
US9665365B2 (en) Transparently upgrading derived database objects
Robbes et al. How program history can improve code completion
US20110087670A1 (en) Systems and methods for concept mapping
CN110134703A (en) A kind of keywords database update method and device
EP2610765A1 (en) Systems and methods for migrating database data
US10303469B1 (en) Commit graph generation
CN103597469A (en) Live browser tooling in an integrated development environment
CN105550206B (en) The edition control method and device of structured query sentence
CN105528416B (en) A kind of monitoring method and system of network upgrade content
WO2016076906A1 (en) Testing insecure computing environments using random data sets generated from characterizations of real data sets
CN111444181A (en) Knowledge graph updating method and device and electronic equipment
Kpodjedo et al. Madmatch: Many-to-many approximate diagram matching for design comparison
Lamela Seijas et al. Towards property-based testing of RESTful web services
US20230012642A1 (en) Method and device for snapshotting metadata, and storage medium
US20050166115A1 (en) Method for performing software stress test
CN110989991B (en) Method and system for detecting source code clone open source software in application program
US20120284224A1 (en) Build of website knowledge tables
CN113094367A (en) Data processing method and device and server
JP6870454B2 (en) Analytical equipment, analytical programs and analytical methods
CN110309315B (en) Template file generation method and device, computer readable medium and electronic equipment
CN110851517A (en) Source data extraction method, device and equipment and computer storage medium
Ivkovic et al. Enhancing domain-specific software architecture recovery
JP5020274B2 (en) Semantic drift occurrence evaluation method and apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190816