CN110134703A - A kind of keywords database update method and device - Google Patents
A kind of keywords database update method and device Download PDFInfo
- Publication number
- CN110134703A CN110134703A CN201910421356.5A CN201910421356A CN110134703A CN 110134703 A CN110134703 A CN 110134703A CN 201910421356 A CN201910421356 A CN 201910421356A CN 110134703 A CN110134703 A CN 110134703A
- Authority
- CN
- China
- Prior art keywords
- target
- keywords
- monitoring data
- information
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/90335—Query processing
- G06F16/90344—Query processing by using string matching techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Abstract
This application provides a kind of keywords database update method and devices, wherein the keywords database update method includes obtaining target monitoring data, and the information of destination user agent UA is extracted from target monitoring data;The corresponding target keywords of type of target UA are extracted from the information of target UA;Judge in target keywords with the presence or absence of preset characters;If it exists, it is determined that target monitoring data are webpage WEB data on flows, and update WEB key word library using target keywords;If it does not exist, it is determined that target monitoring data are application APP data on flows, and update APP key word library using target keywords.The application utilizes the information of target UA in target monitoring data, to determine that the target monitoring data are WEB data on flows, or it is APP data on flows, WEB key word library and APP key word library can be automatically updated, manual intervention is reduced, and then improves the accuracy rate that identification target monitoring data are WEB data on flows or APP data on flows.
Description
Technical field
This application involves technical field of data processing, in particular to a kind of keywords database update method and device.
Background technique
The information that user is pushed by service platform, available a large amount of information, and then convenience and the life for enriching oneself
It is living.Service platform browses the mode of information according to user to preferably service user, pushes information to be established as the user
Strategy, so that user experience is high.
In general, the server of monitoring platform, which receives user, browses monitoring data (the i.e. flow of information generated after information
Information), by monitoring data in the webpage WEB key word library that pre-establishes keyword, in application APP key word library
Keyword is matched, and then determines the source type (as WEB data on flows or being APP data on flows) of the monitoring data,
So that service platform can formulate the strategy of push information according to the source type of the monitoring data.
But all artificial collections of keyword in above-mentioned WEB key word library and APP key word library, it is difficult to according to new
Keyword update WEB key word library and APP key word library, take time and effort, and make the source class for determining the monitoring data
When type, accuracy rate is low.
Summary of the invention
In view of this, the embodiment of the present application is designed to provide a kind of keywords database update method and device, can from
It is dynamic to update WEB key word library and APP key word library, manual intervention is reduced, and then improving identification target monitoring data is WEB flow
The accuracy rate of data or APP data on flows.
In a first aspect, the embodiment of the present application provides a kind of keywords database update method, wherein include:
Target monitoring data are obtained, and extract the information of destination user agent UA from the target monitoring data;
The corresponding target keywords of type of the target UA are extracted from the information of the target UA;
Judge in the target keywords with the presence or absence of preset characters;
If it exists, it is determined that the target monitoring data are webpage WEB data on flows, and more using the target keywords
New WEB key word library;
If it does not exist, it is determined that the target monitoring data are application APP data on flows, and are closed using the target
Key word updates APP key word library.
With reference to first aspect, the embodiment of the present application provides the first possible embodiment of first aspect, wherein also
Include:
Search whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
With reference to first aspect, the embodiment of the present application provides second of possible embodiment of first aspect, wherein institute
State the corresponding target keywords of type that the target UA is extracted from the information of the target UA, comprising:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted
Later, the corresponding target keywords of type of the target UA are obtained.
With reference to first aspect, the embodiment of the present application provides the third possible embodiment of first aspect, wherein also
Include:
Updated WEB key word library is showed into background work personnel, so that the background work personnel verify.
Second aspect, the embodiment of the present application also provides a kind of keywords database updating devices, wherein includes:
Module is obtained, extracts target user's generation for obtaining target monitoring data, and from the target monitoring data
Manage the information of UA;
Extraction module, for extracting the corresponding target critical of type of the target UA from the information of the target UA
Word;
Judgment module, for judging in the target keywords with the presence or absence of preset characters;
First update module, for if it exists, it is determined that the target monitoring data are webpage WEB data on flows, and sharp
WEB key word library is updated with the target keywords;
Second update module, for if it does not exist, it is determined that the target monitoring data are application APP flow number
According to, and APP key word library is updated using the target keywords.
In conjunction with second aspect, the embodiment of the present application provides the first possible embodiment of second aspect, wherein also
Include:
Searching module, for searching whether that there are any keys in WEB key word library from the information of the target UA
Word;
If it exists, it is determined that the target monitoring data are WEB data on flows.
In conjunction with second aspect, the embodiment of the present application provides second of possible embodiment of second aspect, wherein packet
It includes:
The extraction module is converted into lowercase versions specifically for the character string for including by the information of the target UA;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted
Later, the corresponding target keywords of type of the target UA are obtained.
In conjunction with second aspect, the embodiment of the present application provides the third possible embodiment of second aspect, wherein also
Include:
Correction verification module, for updated WEB key word library to be showed background work personnel, so that the background work
Personnel verify.
The third aspect, the embodiment of the present application also provide a kind of electronic equipment, comprising: processor, memory and bus, it is described
Memory is stored with the executable machine readable instructions of the processor, when electronic equipment operation, the processor with it is described
By bus communication between memory, the machine readable instructions executed when being executed by the processor it is above-mentioned in a first aspect, or
The possible embodiment of the first of first aspect any possibility into the third possible embodiment of first aspect
Embodiment in step.
Fourth aspect, the embodiment of the present application also provide a kind of computer readable storage medium, the computer-readable storage medium
Computer program is stored in matter, which executes above-mentioned in a first aspect, or first aspect when being run by processor
The first possible embodiment any possible embodiment into the third possible embodiment of first aspect
In step.
A kind of keywords database update method provided by the embodiments of the present application and device, wherein the keywords database update method
Including obtaining target monitoring data, and extract from target monitoring data the information of destination user agent UA;From target UA's
The corresponding target keywords of target UA are extracted in information;Judge in target keywords with the presence or absence of preset characters;If it exists, then really
The monitoring data that set the goal are WEB data on flows, and update WEB key word library using the target keywords;If it does not exist, it is determined that
Target monitoring data are application APP data on flows.The embodiment of the present application utilizes the information of target UA in target monitoring data,
Come determine the target monitoring data be WEB data on flows, or be APP data on flows, can automatically update WEB key word library and
APP key word library reduces manual intervention, and then improving identification target monitoring data is WEB data on flows or APP data on flows
Accuracy rate.
To enable the above objects, features, and advantages of the application to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate
Appended attached drawing, is described in detail below.
Detailed description of the invention
Technical solution in ord to more clearly illustrate embodiments of the present application, below will be to needed in the embodiment attached
Figure is briefly described, it should be understood that the following drawings illustrates only some embodiments of the application, therefore is not construed as pair
The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this
A little attached drawings obtain other relevant attached drawings.
Fig. 1 shows a kind of flow chart of keywords database update method provided by the embodiment of the present application;
Fig. 2 shows the flow charts of another kind keywords database update method provided by the embodiment of the present application;
Fig. 3 shows a kind of structural schematic diagram of keywords database updating device provided by the embodiment of the present application;
Fig. 4 shows the structural schematic diagram of electronic equipment provided by the embodiment of the present application.
Specific embodiment
To keep the purposes, technical schemes and advantages of the embodiment of the present application clearer, below in conjunction with the embodiment of the present application
Middle attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only
It is some embodiments of the present application, instead of all the embodiments.The application being usually described and illustrated herein in the accompanying drawings is real
The component for applying example can be arranged and be designed with a variety of different configurations.Therefore, below to the application's provided in the accompanying drawings
The detailed description of embodiment is not intended to limit claimed scope of the present application, but is merely representative of the selected reality of the application
Apply example.Based on embodiments herein, those skilled in the art institute obtained without making creative work
There are other embodiments, shall fall in the protection scope of this application.
Currently, the server of monitoring platform, which receives user, browses monitoring data (the i.e. flow of information generated after information
Information), by monitoring data in the webpage WEB key word library that pre-establishes keyword, in application APP key word library
Keyword is matched, and then determines the source type (as WEB data on flows or being APP data on flows) of the monitoring data.
But all artificial collections of keyword in WEB key word library and APP key word library, it is difficult to be updated according to new keyword
WEB key word library and APP key word library, when so that determining the source type of the monitoring data, accuracy rate is low, takes time and effort.Needle
To the above problem, a kind of keywords database update method provided by the embodiments of the present application and device can automatically update WEB keyword
Library and APP key word library reduce manual intervention, and then improving identification target monitoring data is WEB data on flows or APP flow number
According to accuracy rate.
For convenient for understanding the embodiment of the present application, first more to a kind of keywords database disclosed in the embodiment of the present application
New method describes in detail.
As shown in Figure 1, be the embodiment of the present application using server as executing subject when keywords database update method flow chart,
Specific step is as follows:
S101 obtains target monitoring data, and the information of destination user agent UA is extracted from target monitoring data.
In specific implementation, after device plays information, monitoring data will be sent to server, in order to server
Record the flow of equipment generation.
Server obtains the target monitoring data of equipment transmission in real time, and extracts target user from target monitoring data
Act on behalf of the information of (User Agent, UA)
Wherein, the information of target UA may include hardware platform, system software, application software etc..
S102 extracts the corresponding target keywords of type of target UA from the information of target UA.
In specific implementation, the corresponding target keywords of target UA, the target keywords are extracted from the information of target UA
The type of target UA is indicated, the type of target UA may include baidu browser, 360 browsers, IE browser etc..
The specific method for extracting the corresponding target keywords of target UA, illustrates in detail below, does not do herein excessive
It repeats.
It, can be from the information of target UA before extracting the corresponding target keywords of target UA in the information from target UA
Search whether that there are any keywords in webpage WEB key word library;If it exists, it is determined that target monitoring data are WEB flow
Data.
The information of the type of instruction target UA can be carried in the information of target UA, can directly with the WEB that pre-establishes
Key word library is matched, if being matched to any keyword in WEB key word library, can directly determine target monitoring number
According to for WEB data on flows.
If there is no any keywords in WEB key word library in the information of target UA, step 102 is carried out, extracts mesh
Mark the corresponding target keywords of UA.
Wherein, WEB key word library constructs in advance, may include 360Browser, 360Aphone Browser,
The keywords such as Xiao Mi Browser, Baidu Browser, Sogou Browser.
S103 judges in target keywords with the presence or absence of preset characters.
In specific implementation, there are the browser that the lower browser of popularity rate and user are not frequently used, Yong Huli
Webpage, the information of the target UA in the target monitoring data of generation, with the progress of WEB key word library are browsed with above-mentioned browser
Match, it is difficult to match the corresponding keyword of above-mentioned browser.Therefore, the corresponding target of target UA is extracted from the information of target UA
Keyword further searches in target keywords with the presence or absence of preset characters.
Wherein, which may include explorer, 115Browser, wifibrowser etc..
S104, and if it exists, then determine that target monitoring data are webpage WEB data on flows, and utilize the target keywords
Update WEB key word library.
After determining that target monitoring data are WEB data on flows, server by utilizing target keywords automatically update the pass WEB
Key character library.
Updated WEB key word library can also be showed background work personnel by server, so that background work personnel
It is verified.
By updated WEB key word library, it is the accurate of WEB data on flows that identification target monitoring data, which can be improved,
Rate.
S105, if it does not exist, it is determined that target monitoring data are application APP data on flows, and utilize the target
Keyword updates APP key word library.
In specific implementation, if any keyword in WEB key word library is not present in the information of target UA, and target
Also preset characters are not present in the corresponding target keywords of UA, it is determined that target monitoring data are application APP data on flows.
Due to APP quantity far more than browser quantity, the quantity of the keyword in APP key word library compares
Greatly, it when being matched in order to avoid target keywords with APP key word library, wastes time and the problem of resource, is determining target
There is no any keywords in WEB key word library in the information of UA, and determine in target keywords there is no when preset characters
Target monitoring data are application APP data on flows.
Wherein, APP key word library may include MicroMessenger/, QQ/, baiduboxapp, mmbang, Weibo,
AlipayClient etc..
It is worth noting that after determining that target monitoring data are application APP data on flows, it can also be directly sharp
APP key word library is updated with target keywords.
The embodiment of the present application utilizes the information of target UA in target monitoring data, to determine that the target monitoring data are WEB
Data on flows, or it is APP data on flows, can be improved identification target monitoring data is WEB data on flows or APP data on flows
Accuracy rate, automatically update WEB key word library, APP key word library, reduce manual intervention.With the increase of identification number, every time
Accuracy rate can be continuously improved in update to WEB key word library, and the keyword of required correction also can be fewer and fewer, to subtract
The waste of few manpower.
It is worth noting that when extracting the corresponding target keywords of target UA in the information from target UA, if not extracting
To target keywords, it is determined that the target monitoring data are unknown flow rate, that is, are not belonging to WEB data on flows, are also not belonging to APP
Data on flows.Further, which can be sent to the client of background work personnel, so that backstage work
Make personnel to judge the unknown flow rate.
It, can be according to judging result to this after determining that target monitoring data are WEB data on flows or APP data on flows
The information of user is precisely pushed, i.e., is pushed by WEB, or is pushed by APP.
The corresponding target keywords of type of target UA are extracted from the information of target UA according to method shown in Fig. 2,
In, the specific steps are as follows:
S201, the character string that the information by target UA includes are converted into lowercase versions;
S202 according to regular expressions cuts the character string of lowercase versions, obtains multiple first candidate keys;
S203 is cut and is recombinated to each first candidate key according to space, obtains multiple second candidate keys
Word;
S204, from multiple second candidate keys, delete the candidate key unrelated with the type feature of target UA it
Afterwards, the corresponding target keywords of type of target UA are obtained.
In specific implementation, the character string that the information of target UA includes is converted into lowercase versions first, according to canonical table
Cut up to character string of the formula to lowercase versions, specifically, according to regular expressions " ((.* ?)) | [0-9]+x [0-9]
+ " information and resolution information in the character strings of replacement lowercase versions in all brackets, then according to regular expression "/
[^] * " cuts the character string of entire lowercase versions, obtains the first candidate key;Wherein, in the first candidate key
Carry all possible effective keywords.
Then, candidate key is cut and is recombinated according to space, specifically, the first candidate key of recombination is every
Rejected when a part it is all can match regular expression " .* [~!@# $ %^&* (),?;" |<>{ }=+ _-[]] .* "
Sub- keyword, finally by after all recombinations non-null key return be used as the second candidate key.
By step 201-203, the corresponding keyword of the facility information for including in the information of target UA can be removed and be
The corresponding keyword of information of uniting, such as " (Linux;U;Android 2.2.1;zh-cn;HTC_Wildfire_A3333Build/
) ", and the corresponding keyword of version information etc. of similar "/533.1 " browser or APP that carries FRG83D.
Finally, deleting the candidate key unrelated with the feature of target UA from the second candidate key, obtaining target UA
Corresponding target keywords.
Wherein, the candidate key unrelated with the feature of target UA is the keyword that can not indicate the type of target UA, example
Such as " Mozilla ", the keywords such as " AppleWebKit " be not only present in the UA information of common WEB data on flows, but also existed
In the UA information of most APP datas on flows.
In addition, further including chorme, mbbms, symbianos, cfnetwork, build etc..
By the above method, extract the corresponding target keywords of target UA, can with the corresponding keyword of eliminating equipment information,
The corresponding keyword of system information, the corresponding keyword of version information, candidate key unrelated with the feature of target UA etc., mention
High target keywords and preset characters, the matched accuracy rate of WEN key word library, while saving server resource.
Based on the same inventive concept, the embodiment of the present application also provides keywords databases corresponding with keywords database update method
Updating device, the above-mentioned keywords database of principle and the embodiment of the present application solved the problems, such as due to the device in the embodiment of the present application are updated
Method is similar, therefore the implementation of device may refer to the implementation of method, and overlaps will not be repeated.
Shown in Figure 3, keywords database updating device provided by the another embodiment of the application includes:
Module 301 is obtained, extracts target user for obtaining target monitoring data, and from the target monitoring data
Act on behalf of the information of UA;
Extraction module 302, the corresponding target of type for extracting the target UA from the information of the target UA are closed
Key word;
Judgment module 303, for judging in the target keywords with the presence or absence of preset characters;
First update module 304, for if it exists, it is determined that the target monitoring data are webpage WEB data on flows, and
WEB key word library is updated using the target keywords;
Second update module 305, for if it does not exist, it is determined that the target monitoring data are application APP flow
Data, and APP key word library is updated using the target keywords.
In one embodiment, above-mentioned keywords database updating device further include:
Searching module 306, for searching whether that there are any passes in WEB key word library from the information of the target UA
Key word;
If it exists, it is determined that the target monitoring data are WEB data on flows.
In another embodiment, said extracted module 302, is specifically used for:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted
Later, the corresponding target keywords of type of the target UA are obtained.
In yet another embodiment, above-mentioned keywords database updating device further include:
Correction verification module 307, for updated WEB key word library to be showed background work personnel, so that the backstage
Staff verifies.
Fig. 4 describes the structure of a kind of electronic equipment 400 provided in an embodiment of the present invention, the electronic equipment 400 include: to
A few processor 401, at least one network interface 404 or other users interface 403, memory 405, at least one communication
Bus 402.Communication bus 402 is for realizing the connection communication between these components.The electronic equipment 400 optionally includes user
Interface 403, including display is (for example, touch screen, LCD, CRT, holographic imaging (Holographic) or projection
(Projector) etc.), keyboard or pointing device are (for example, mouse, trace ball (trackball), touch-sensitive plate or touch screen
Deng).
Memory 405 may include read-only memory and random access memory, and provide instruction sum number to processor 401
According to.The a part of of memory 405 can also include nonvolatile RAM (NVRAM).
In some embodiments, memory 405 stores following element, executable modules or data structures, or
Their subset of person or their superset:
Operating system 4051 includes various system programs, hardware based for realizing various basic businesses and processing
Task;
Application program module 4052 includes various application programs, such as desktop (launcher), media player (Media
Player), browser (Browser) etc., for realizing various applied business.
In embodiments of the present invention, by the program or instruction of calling memory 405 to store, processor 401 is used for: being obtained
Target monitoring data, and extract from the target monitoring data information of destination user agent UA;
The corresponding target keywords of type of the target UA are extracted from the information of the target UA;
Judge in the target keywords with the presence or absence of preset characters;
If it exists, it is determined that the target monitoring data are webpage WEB data on flows, and more using the target keywords
New WEB key word library;
If it does not exist, it is determined that the target monitoring data are application APP data on flows, and are closed using the target
Key word updates APP key word library.
Optionally, in the method that processor 401 executes, further includes:
Search whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
Optionally, described to extract the target UA's from the information of the target UA in the method that processor 401 executes
The corresponding target keywords of type, comprising:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, the candidate key unrelated with the type feature of the target UA is deleted
Later, the corresponding target keywords of type of the target UA are obtained.
Optionally, in the method that processor 401 executes, further includes:
Updated WEB key word library is showed into background work personnel, so that the background work personnel verify.
The computer program product of keywords database update method and device provided by the embodiment of the present application, including store
The computer readable storage medium of program code, the instruction that program code includes can be used for executing the side in previous methods embodiment
Method, specific implementation can be found in embodiment of the method, and details are not described herein.
Specifically, which can be general storage medium, such as mobile disk, hard disk, on the storage medium
Computer program when being run, above-mentioned keywords database update method is able to carry out, so as to automatically update WEB key word library
With APP key word library, manual intervention is reduced, and then improving identification target monitoring data is WEB data on flows or APP data on flows
Accuracy rate.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product
It is stored in the executable non-volatile computer-readable storage medium of a processor.Based on this understanding, the application
Technical solution substantially the part of the part that contributes to existing technology or the technical solution can be with software in other words
The form of product embodies, which is stored in a storage medium, including some instructions use so that
One computer equipment (can be personal computer, server or the network equipment etc.) executes each embodiment institute of the application
State all or part of the steps of method.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (Read-Only
Memory, ROM), random access memory (Random Access Memory, RAM), magnetic or disk etc. is various to deposit
Store up the medium of program code.
Finally, it should be noted that embodiment described above, the only specific embodiment of the application, to illustrate the application
Technical solution, rather than its limitations, the protection scope of the application is not limited thereto, although with reference to the foregoing embodiments to this Shen
It please be described in detail, those skilled in the art should understand that: anyone skilled in the art
Within the technical scope of the present application, it can still modify to technical solution documented by previous embodiment or can be light
It is readily conceivable that variation or equivalent replacement of some of the technical features;And these modifications, variation or replacement, do not make
The essence of corresponding technical solution is detached from the spirit and scope of the embodiment of the present application technical solution, should all cover the protection in the application
Within the scope of.Therefore, the protection scope of the application shall be subject to the protection scope of the claim.
Claims (10)
1. a kind of keywords database update method characterized by comprising
Target monitoring data are obtained, and extract the information of destination user agent UA from the target monitoring data;
The corresponding target keywords of type of the target UA are extracted from the information of the target UA;
Judge in the target keywords with the presence or absence of preset characters;
If it exists, it is determined that the target monitoring data are webpage WEB data on flows, and are updated using the target keywords
WEB key word library;
If it does not exist, it is determined that the target monitoring data are application APP data on flows, and utilize the target keywords
Update APP key word library.
2. keywords database update method according to claim 1, which is characterized in that further include:
Search whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
3. keywords database update method according to claim 1, which is characterized in that described from the information of the target UA
Extract the corresponding target keywords of type of the target UA, comprising:
The character string that information by the target UA includes is converted into lowercase versions;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, delete the candidate key unrelated with the type feature of the target UA it
Afterwards, the corresponding target keywords of type of the target UA are obtained.
4. keywords database update method according to claim 1, which is characterized in that further include:
Updated WEB key word library is showed into background work personnel, so that the background work personnel verify.
5. a kind of keywords database updating device characterized by comprising
Module is obtained, extracts destination user agent UA for obtaining target monitoring data, and from the target monitoring data
Information;
Extraction module, for extracting the corresponding target keywords of type of the target UA from the information of the target UA;
Judgment module, for judging in the target keywords with the presence or absence of preset characters;
First update module, for if it exists, it is determined that the target monitoring data are webpage WEB data on flows, and utilize institute
It states target keywords and updates WEB key word library;
Second update module, for if it does not exist, it is determined that the target monitoring data are application APP data on flows, and
APP key word library is updated using the target keywords.
6. keywords database updating device according to claim 5, which is characterized in that further include:
Searching module, for searching whether that there are any keywords in WEB key word library from the information of the target UA;
If it exists, it is determined that the target monitoring data are WEB data on flows.
7. keywords database updating device according to claim 5 characterized by comprising
The extraction module is converted into lowercase versions specifically for the character string for including by the information of the target UA;
The character string of lowercase versions is cut according to regular expressions, obtains multiple first candidate keys;
Each first candidate key is cut according to space, obtains multiple second candidate keys;
From the multiple second candidate key, delete the candidate key unrelated with the type feature of the target UA it
Afterwards, the corresponding target keywords of type of the target UA are obtained.
8. keywords database updating device according to claim 9, which is characterized in that further include:
Correction verification module, for updated WEB key word library to be showed background work personnel, so that the background work personnel
It is verified.
9. a kind of electronic equipment characterized by comprising processor, memory and bus, the memory are stored with the place
The executable machine readable instructions of device are managed, when electronic equipment operation, pass through bus between the processor and the memory
Communication, keywords database of the execution as described in Claims 1-4 is any be more when the machine readable instructions are executed by the processor
The step of new method.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer journey on the computer readable storage medium
Sequence executes the keywords database update method as described in Claims 1-4 any one when the computer program is run by processor
The step of.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910421356.5A CN110134703A (en) | 2019-05-21 | 2019-05-21 | A kind of keywords database update method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910421356.5A CN110134703A (en) | 2019-05-21 | 2019-05-21 | A kind of keywords database update method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110134703A true CN110134703A (en) | 2019-08-16 |
Family
ID=67571867
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910421356.5A Pending CN110134703A (en) | 2019-05-21 | 2019-05-21 | A kind of keywords database update method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110134703A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110795668A (en) * | 2019-10-28 | 2020-02-14 | 北京博睿宏远数据科技股份有限公司 | Website data analysis method, device, equipment and storage medium |
CN113342866A (en) * | 2021-06-22 | 2021-09-03 | 广州华多网络科技有限公司 | Keyword updating method and device, computer equipment and storage medium |
CN113382000A (en) * | 2021-06-09 | 2021-09-10 | 北京天融信网络安全技术有限公司 | UA character string anomaly detection method, device, equipment and medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103186675A (en) * | 2013-04-03 | 2013-07-03 | 南京安讯科技有限责任公司 | Automatic webpage classification method based on network hot word identification |
CN103246703A (en) * | 2013-04-03 | 2013-08-14 | 百度在线网络技术(北京)有限公司 | Method and equipment for determining application word banks |
US20140169752A1 (en) * | 2012-12-14 | 2014-06-19 | Motorola Solutions, Inc. | Computer assisted dispatch incident report video search and tagging systems and methods |
CN107346182A (en) * | 2016-05-05 | 2017-11-14 | 北京搜狗科技发展有限公司 | A kind of method for building user thesaurus and the device for building user thesaurus |
US20180130465A1 (en) * | 2016-11-10 | 2018-05-10 | Linearhub | Apparatus and method for correcting pronunciation by contextual recognition |
-
2019
- 2019-05-21 CN CN201910421356.5A patent/CN110134703A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140169752A1 (en) * | 2012-12-14 | 2014-06-19 | Motorola Solutions, Inc. | Computer assisted dispatch incident report video search and tagging systems and methods |
CN103186675A (en) * | 2013-04-03 | 2013-07-03 | 南京安讯科技有限责任公司 | Automatic webpage classification method based on network hot word identification |
CN103246703A (en) * | 2013-04-03 | 2013-08-14 | 百度在线网络技术(北京)有限公司 | Method and equipment for determining application word banks |
CN107346182A (en) * | 2016-05-05 | 2017-11-14 | 北京搜狗科技发展有限公司 | A kind of method for building user thesaurus and the device for building user thesaurus |
US20180130465A1 (en) * | 2016-11-10 | 2018-05-10 | Linearhub | Apparatus and method for correcting pronunciation by contextual recognition |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110795668A (en) * | 2019-10-28 | 2020-02-14 | 北京博睿宏远数据科技股份有限公司 | Website data analysis method, device, equipment and storage medium |
CN113382000A (en) * | 2021-06-09 | 2021-09-10 | 北京天融信网络安全技术有限公司 | UA character string anomaly detection method, device, equipment and medium |
CN113342866A (en) * | 2021-06-22 | 2021-09-03 | 广州华多网络科技有限公司 | Keyword updating method and device, computer equipment and storage medium |
CN113342866B (en) * | 2021-06-22 | 2022-06-21 | 广州华多网络科技有限公司 | Keyword updating method and device, computer equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10809984B2 (en) | System for generating functionality representation, indexing, searching, componentizing, and analyzing of source code in codebases and method thereof | |
Nguyen et al. | Graph-based statistical language model for code | |
US9665365B2 (en) | Transparently upgrading derived database objects | |
Robbes et al. | How program history can improve code completion | |
US20110087670A1 (en) | Systems and methods for concept mapping | |
CN110134703A (en) | A kind of keywords database update method and device | |
EP2610765A1 (en) | Systems and methods for migrating database data | |
US10303469B1 (en) | Commit graph generation | |
CN103597469A (en) | Live browser tooling in an integrated development environment | |
CN105550206B (en) | The edition control method and device of structured query sentence | |
CN105528416B (en) | A kind of monitoring method and system of network upgrade content | |
WO2016076906A1 (en) | Testing insecure computing environments using random data sets generated from characterizations of real data sets | |
CN111444181A (en) | Knowledge graph updating method and device and electronic equipment | |
Kpodjedo et al. | Madmatch: Many-to-many approximate diagram matching for design comparison | |
Lamela Seijas et al. | Towards property-based testing of RESTful web services | |
US20230012642A1 (en) | Method and device for snapshotting metadata, and storage medium | |
US20050166115A1 (en) | Method for performing software stress test | |
CN110989991B (en) | Method and system for detecting source code clone open source software in application program | |
US20120284224A1 (en) | Build of website knowledge tables | |
CN113094367A (en) | Data processing method and device and server | |
JP6870454B2 (en) | Analytical equipment, analytical programs and analytical methods | |
CN110309315B (en) | Template file generation method and device, computer readable medium and electronic equipment | |
CN110851517A (en) | Source data extraction method, device and equipment and computer storage medium | |
Ivkovic et al. | Enhancing domain-specific software architecture recovery | |
JP5020274B2 (en) | Semantic drift occurrence evaluation method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190816 |