CN103886033A - Intelligent vertical searching device and method for safety industry chain - Google Patents

Intelligent vertical searching device and method for safety industry chain Download PDF

Info

Publication number
CN103886033A
CN103886033A CN201410078014.5A CN201410078014A CN103886033A CN 103886033 A CN103886033 A CN 103886033A CN 201410078014 A CN201410078014 A CN 201410078014A CN 103886033 A CN103886033 A CN 103886033A
Authority
CN
China
Prior art keywords
spider
engine
crawl device
middleware
scheduling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410078014.5A
Other languages
Chinese (zh)
Other versions
CN103886033B (en
Inventor
刘欣毅
李昂生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing insight Network Co., Ltd.
Original Assignee
WUXI XIANGXIANG BIOTECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WUXI XIANGXIANG BIOTECHNOLOGY Co Ltd filed Critical WUXI XIANGXIANG BIOTECHNOLOGY Co Ltd
Priority to CN201410078014.5A priority Critical patent/CN103886033B/en
Publication of CN103886033A publication Critical patent/CN103886033A/en
Application granted granted Critical
Publication of CN103886033B publication Critical patent/CN103886033B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention discloses an intelligent vertical searching device and method for a safety industry chain. The intelligent vertical searching device for the safety industry chain comprises a crawl device engine, namely a searcher engine, a scheduling part, a downloader, spiders, a searching factor bank, an item pipeline, a downloader middleware, a spider middleware and a scheduling middleware. The downloader captures a webpage and returns webpage content to the spiders. The spiders are classes defined by a crawl device user and are used for analyzing the webpage and capturing the content returned by a set URL, and each spider can process a domain name or a group of domain names, namely the spiders are used for defining the capturing and analyzing rule of a certain website. The scheduling middleware is a middleware placed between the crawl device engine and the scheduling part and is in charge of requests and responses sent from the crawl device to the scheduling part, and a self-defined code is provided for expanding the function of the crawl device. The advantages of reliability, accuracy, real-time performance and intelligent searching are achieved.

Description

For the intelligent uprightness searching apparatus and method of Safe industry chain
Technical field
The present invention relates to the intelligent uprightness searching apparatus and method for Safe industry chain, particularly, relate to a kind of for medicine, food and Safe of Medical Device industrial chain intelligent uprightness searching apparatus and method.
Background technology
Large data (big data), or title flood tide data, refer to related data quantity huge to seeing through current main flow Software tool, reaching acquisition, management, processing within reasonable time, also arranging and become the information that helps the more positive object of enterprise management decision-making.According to win advisory data, within 2005, global common property has been given birth to 1,300 hundred million GB(GB) data.Estimate that the year two thousand twenty will increase to 40,000,000,000,000 GB.And in the 25GB data that produce every day, only have 0.5% to be fully utilized, show its break-up value.2010, the value of large data industry was 3,200,000,000 dollars.Estimate that by 2015 this numeral will be up to 16,900,000,000 dollars.
In medicine, food, Safe of Medical Device industrial chain cloud computing cluster service platform, ten thousand parts of accumulation core business data to 200 in 2012,1,000 ten thousand parts of associated data in literature, within 2014, core business data accumulation reaches 5,000,000 parts.Increase with 250% every year.As shown in Table 1:
Table one, medicine, food, the large tables of data of Safe of Medical Device industrial chain cloud computing cluster service platform Big Data:
Based on medicine, food, Safe of Medical Device field, in the face of huge data like this, and increase year after year, at present, general search engine is mainly google, Baidu, search dog and Yahoo etc., is mainly all the search engine technique based on general, its Data Source is mainly the open web page contents in internet, and by collecting, directly present to user, centre has added its commercial activity.Its shortcoming is mainly as follows:
1. Data Source does not have authority, is the open web page contents in internet;
2. industry vertical search service can not be provided, comprise intelligent excavating and the analysis of the large data of industry Big Data;
3. lack the intelligent excavating and the authoritative foundation of analysis of the large data of vertical industry Big Data;
4. lack the search service of vertical industry closed loop;
5. the precision of Search Results is not high, and just the result of document rank presents.
Summary of the invention
The object of the invention is to, for the problems referred to above, propose a kind of intelligent uprightness searching apparatus and method for Safe industry chain, reliable to realize, accurately, in real time and the advantage of intelligent search.
For achieving the above object, the technical solution used in the present invention is:
For an intelligent uprightness searching device for Safe industry chain, comprise
Crawl device engine is searcher engine: crawl device engine is used for controlling the flow chart of data processing of whole system, and carries out the triggering of issued transaction;
Scheduling: scheduler program accepts request and the Sorted list row of joining the team from crawl device engine, and returns to scheduler program after the request of crawl device engine;
Downloader: downloader captures webpage and web page contents is returned to spider;
Spider: spider is that crawl device user oneself definition is used for analyzing web page capturing and formulates the class of the content that URL returns, and each spider can be processed a domain name or one group of domain name, is used for defining crawl and the resolution rules of specific website;
Search procatarxis word bank: comprise that standard is because of word bank, He Yu storehouse, weight factor storehouse: standard is recorded the data of medicine and apparatus because of word bank, namely first searches plain object, weight factor storehouse, storehouse, territory: the internet scope of being responsible for authenticating authority;
Project pipeline: the project that the responsible processing spider of project pipeline extracts from webpage, checking and storage data, after the page is resolved by spider, will be sent to project pipeline; Whether the process that project pipeline is carried out conventionally has: clean html data, the data that checking is resolved to are whether inspection item comprises necessary field, be that repeating data repeats just to delete, the data that are resolved to are stored in database if checked;
Downloader middleware: downloading middleware is the hook framework between crawl device engine and downloader, is responsible for processing request and the response between crawl device engine and downloader;
Spider middleware: spider middleware is the hook framework between crawl device engine and spider, is responsible for processing response input and the request output of spider; Provide the mode of a self-defined code to expand the function of crawl device;
Scheduling middleware: scheduling middleware is the middleware between crawl device engine and scheduling, is responsible for processing sending to request and the response of scheduling from crawl device engine, and provides a self-defining code to expand the function of crawl device.
According to a preferred embodiment of the invention, also comprise security authentication module: be responsible for internal user safety certification;
User behavior recognition memory module: be responsible for intelligent behavior identification and the memory of user in vertical closed-loop search, use guiding and service for user provides intelligence.
Technical solution of the present invention discloses a kind of searching method of the intelligent uprightness searching device for Safe industry chain simultaneously, comprises the following steps:
Step 1, crawl device engine are opened a domain name, and spider is processed this domain name, and allow spider obtain the URL that first crawls;
Step 2, engine obtain from spider the URL that first need to crawl, and then dispatch in scheduling as request;
Step 3, engine obtain from dispatching that page that next step crawls;
The URL that step 4, scheduling crawl the next one returns to engine, and engine sends to downloader by downloading middleware;
Step 5, after webpage is downloaded device and has downloaded, response contents is sent to crawl device engine by downloading middleware;
Step 6, crawl device engine are received the response of downloader and it are sent to spider by spider middleware and process;
Step 7, spider processing response are also returned to the project crawling, and send new request then to crawl device engine;
The project grabbing is sent to project pipeline by step 8, crawl device engine, and send request to scheduling;
Step 9, return to step 2 until then not request in scheduling disconnects contacting between engine and territory.
Technical scheme of the present invention has following beneficial effect:
Technical scheme of the present invention, with medicine, food, Safe of Medical Device industry standard and mass data are as support, data source is data accumulation and the safe document of the FDA of China and foreign countries of government organs and terminal security authenticated, it uses for government organs and terminal security certification authority, it is vertical industry specialized search engine, the result of search has secret, reliably, accurately, the feature such as real-time and intelligent, for providing, terminal user draws and pushes away two-channel intelligent service, and with the medicine of company, food, Safe of Medical Device industrial chain cloud computing cluster service platform is realized and being interconnected.
Below by drawings and Examples, technical scheme of the present invention is described in further detail.
Brief description of the drawings
Fig. 1 is the intelligent uprightness searching principle of device frame for Safe industry chain described in the embodiment of the present invention;
Fig. 2 is the intelligent uprightness searching device work block diagram for Safe industry chain.
Embodiment
Below in conjunction with accompanying drawing, the preferred embodiments of the present invention are described, should be appreciated that preferred embodiment described herein, only for description and interpretation the present invention, is not intended to limit the present invention.
As shown in Figure 1, a kind of intelligent uprightness searching device for Safe industry chain, comprises
Crawl device engine is searcher engine: crawl device engine is used for controlling the flow chart of data processing of whole system, and carries out the triggering of issued transaction;
Scheduling: scheduler program accepts request and the Sorted list row of joining the team from crawl device engine, and returns to scheduler program after the request of crawl device engine;
Downloader: downloader captures webpage and web page contents is returned to spider;
Spider: spider is that crawl device user oneself definition is used for analyzing web page capturing and formulates the class of the content that URL returns, and each spider can be processed a domain name or one group of domain name, is used for defining crawl and the resolution rules of specific website;
Search procatarxis word bank: comprise that standard is because of word bank, He Yu storehouse, weight factor storehouse: standard is recorded the data of medicine and apparatus because of word bank, namely first searches plain object, weight factor storehouse, storehouse, territory: the internet scope of being responsible for authenticating authority;
Project pipeline: the project that the responsible processing spider of project pipeline extracts from webpage, checking and storage data, after the page is resolved by spider, will be sent to project pipeline; Whether the process that project pipeline is carried out conventionally has: clean html data, the data that checking is resolved to are whether inspection item comprises necessary field, be that repeating data repeats just to delete, the data that are resolved to are stored in database if checked;
Downloader middleware: downloading middleware is the hook framework between crawl device engine and downloader, is responsible for processing request and the response between crawl device engine and downloader;
Spider middleware: spider middleware is the hook framework between crawl device engine and spider, is responsible for processing response input and the request output of spider; Provide the mode of a self-defined code to expand the function of crawl device;
Scheduling middleware: scheduling middleware is the middleware between crawl device engine and scheduling, is responsible for processing sending to request and the response of scheduling from crawl device engine, and provides a self-defining code to expand the function of crawl device.
Searcher also comprises, security authentication module: be responsible for internal user safety certification;
User behavior recognition memory module: be responsible for intelligent behavior identification and the memory of user in vertical closed-loop search, use guiding and service for user provides intelligence.
Be specially:
1) crawl device Engine(searcher engine): crawl device engine is the flow chart of data processing for controlling whole system, and carries out the triggering of issued transaction.
2) Scheduler(scheduling): scheduler program accepts request and the Sorted list row of joining the team from crawl device engine, and returns to them after the request of crawl device engine.
3) Downloader(downloader): the major responsibility of downloader is capture webpage and web page contents is returned to spider (Spiders).
4) Spiders(spider): spider is to have crawl device user oneself definition to be used for analyzing web page capturing to formulate the class of the content that URL returns, and each spider can be processed a domain name or one group of domain name.Be used in other words defining crawl and the resolution rules of specific website.
5) search procatarxis word bank: be that vertical industry authority searches prime factor, mainly comprise that standard, because of word bank (medicine and apparatus), namely first searches plain object; Weight factor storehouse; Storehouse, territory, the internet scope of responsible authenticating authority.
6) Item Pipeline(project pipeline): the prime responsibility of project pipeline is to be responsible for processing the project that has spider to extract from webpage, and his main task is checking and storage data.After the page is resolved by spider, will be sent to project pipeline, and through several specific order deal with data.The Python class that the assembly of each project pipeline is made up of a simple method.Obtained project and carried out the method for class, whether need to continuing in project pipeline of simultaneously also needing to determine carried out, next step or directly discard and do not process.The process that project pipeline is carried out conventionally has: clean html data, verify whether the data (whether inspection item comprises necessary field), the inspection that are resolved to are repeating datas (just deleting if repeated), the data that are resolved to are stored in database.
7) Downloader middlewares(downloader middleware): downloading middleware is the hook framework between crawl device engine and downloader, is mainly request and the response of processing between crawl device engine and downloader.It provides the mode of a self-defining code to expand the function of crawl device.In the middle of downloading, device is a hook framework of processing request and response.He is lightweight, crawl device is enjoyed to the system of the bottom of overall situation control.
8) Spider middlewares(spider middleware): spider middleware is the hook framework between crawl device engine and spider, and groundwork is to process the response of spider input and request output.It provides the mode of a self-defined code to expand the function of crawl device.Spider middleware is a framework that is articulated to the spider treatment mechanism of crawl device, and you can insert self-defining code and process and send to the request of spider and return to response contents and the project that spider obtains.
9) Scheduler middlewares(scheduling middleware): scheduling middleware is the middleware between crawl device engine and scheduling, and groundwork is place sends to scheduling request and response from crawl device engine.He provides a self-defining code to expand the function of crawl device.
10) security authentication module: be responsible for cluster platform internal user safety certification, be similar to QQ certification login;
11) user behavior recognition memory: be responsible for intelligent behavior identification and the memory of user in vertical closed-loop search, use guiding and service for user provides intelligence.
Technical solution of the present invention discloses a kind of searching method of the intelligent uprightness searching device for Safe industry chain simultaneously, comprises the following steps:
Step 1, crawl device engine are opened a domain name, and spider is processed this domain name, and allow spider obtain the URL that first crawls;
Step 2, engine obtain from spider the URL that first need to crawl, and then dispatch in scheduling as request;
Step 3, engine obtain from dispatching that page that next step crawls;
The URL that step 4, scheduling crawl the next one returns to engine, and engine sends to downloader by downloading middleware;
Step 5, after webpage is downloaded device and has downloaded, response contents is sent to crawl device engine by downloading middleware;
Step 6, crawl device engine are received the response of downloader and it are sent to spider by spider middleware and process;
Step 7, spider processing response are also returned to the project crawling, and send new request then to crawl device engine;
The project grabbing is sent to project pipeline by step 8, crawl device engine, and send request to scheduling;
Step 9, return to step 2 until then not request in scheduling disconnects contacting between engine and territory.
In its search engine technique framework cluster, in the time that a task is submitted, two roles of mission thread and worker thread initiating task class thread and operation class thread respectively.For the operation of a crawl device, after task class thread has started, can notify operation class thread, operation brings into operation after operation class thread waits all working class thread has started.While startup in a crawl device job task, can start a message queue (Message Queue, main operation is put and get behavior, the object that operation class grabs can be by put in queue, and will capture new object time, as long as get from queue), on each worker thread role, there is message queue node, have a duplicate removal module (bloom filter realization) simultaneously.Its search engine technique framework is as shown in Figure 2:
Technical solution of the present invention can be applicable to distributed cloud computing and large data Big Data technology, is made up of terminals such as system for cloud computing, APP/Mobile APP, PC, iPad.User need to first pass through medicine, food, the certification of Safe of Medical Device industrial chain cloud computing cluster service platform safety, or to the public user who applies for and pass through certification, can use.This search engine provides two kinds of service modes:
1) being embedded in medicine, food, Safe of Medical Device industrial chain cloud computing cluster service platform inside, is a spirit being similar in cluster platform, search service is provided at any time, and the service of user's intelligent and safe is provided in time;
2) Mobile APP, user, by mobile phone-downloaded, carries out, after safety certification login, can using the similar QQ of service mode, Baidu, search dog pop-up advertisement hurdle etc.
The technical program is also used following technology, physical connection, also comprise by multiple wireless connections modes such as Wi-Fi, 3G, 4G, GPRS, connect the application apparatus that terminal comprises the multiple support such as PC, mobile phone, IPad Mobile APP, as long as signal and data energy transmitting.
Multiple distributed terminal equipment and user, it is distributed in each manufacturing enterprise, circulation enterprise, use mechanism and regulator, can use PC, iPad, Mobile phone various electronic to use.
Finally it should be noted that: the foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, although the present invention is had been described in detail with reference to previous embodiment, for a person skilled in the art, its technical scheme that still can record aforementioned each embodiment is modified, or part technical characterictic is wherein equal to replacement.Within the spirit and principles in the present invention all, any amendment of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (3)

1. for an intelligent uprightness searching device for Safe industry chain, it is characterized in that, comprise
Crawl device engine is searcher engine: crawl device engine is used for controlling the flow chart of data processing of whole system, and carries out the triggering of issued transaction;
Scheduling: scheduler program accepts request and the Sorted list row of joining the team from crawl device engine, and returns to scheduler program after the request of crawl device engine;
Downloader: downloader captures webpage and web page contents is returned to spider;
Spider: spider is that crawl device user oneself definition is used for analyzing web page capturing and formulates the class of the content that URL returns, and each spider can be processed a domain name or one group of domain name, is used for defining crawl and the resolution rules of specific website;
Search procatarxis word bank: comprise that standard is because of word bank, He Yu storehouse, weight factor storehouse: standard is recorded the data of medicine and apparatus because of word bank, namely first searches plain object, weight factor storehouse, storehouse, territory: the internet scope of being responsible for authenticating authority;
Project pipeline: the project that the responsible processing spider of project pipeline extracts from webpage, checking and storage data, after the page is resolved by spider, will be sent to project pipeline; Whether the process that project pipeline is carried out conventionally has: clean html data, the data that checking is resolved to are whether inspection item comprises necessary field, be that repeating data repeats just to delete, the data that are resolved to are stored in database if checked;
Downloader middleware: downloading middleware is the hook framework between crawl device engine and downloader, is responsible for processing request and the response between crawl device engine and downloader;
Spider middleware: spider middleware is the hook framework between crawl device engine and spider, is responsible for processing response input and the request output of spider; Provide the mode of a self-defined code to expand the function of crawl device;
Scheduling middleware: scheduling middleware is the middleware between crawl device engine and scheduling, is responsible for processing sending to request and the response of scheduling from crawl device engine, and provides a self-defining code to expand the function of crawl device.
2. the intelligent uprightness searching device for Safe industry chain according to claim 1, is characterized in that, also comprise,
Security authentication module: be responsible for internal user safety certification;
User behavior recognition memory module: be responsible for intelligent behavior identification and the memory of user in vertical closed-loop search, use guiding and service for user provides intelligence.
3. a searching method for the intelligent uprightness searching device for Safe industry chain claimed in claim 2, is characterized in that, comprises the following steps:
Step 1, crawl device engine are opened a domain name, and spider is processed this domain name, and allow spider obtain the URL that first crawls;
Step 2, engine obtain from spider the URL that first need to crawl, and then dispatch in scheduling as request;
Step 3, engine obtain from dispatching that page that next step crawls;
The URL that step 4, scheduling crawl the next one returns to engine, and engine sends to downloader by downloading middleware;
Step 5, after webpage is downloaded device and has downloaded, response contents is sent to crawl device engine by downloading middleware;
Step 6, crawl device engine are received the response of downloader and it are sent to spider by spider middleware and process;
Step 7, spider processing response are also returned to the project crawling, and send new request then to crawl device engine;
The project grabbing is sent to project pipeline by step 8, crawl device engine, and send request to scheduling;
Step 9, return to step 2 until then not request in scheduling disconnects contacting between engine and territory.
CN201410078014.5A 2014-03-05 2014-03-05 Intelligent vertical searching device and method for safety industry chain Active CN103886033B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410078014.5A CN103886033B (en) 2014-03-05 2014-03-05 Intelligent vertical searching device and method for safety industry chain

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410078014.5A CN103886033B (en) 2014-03-05 2014-03-05 Intelligent vertical searching device and method for safety industry chain

Publications (2)

Publication Number Publication Date
CN103886033A true CN103886033A (en) 2014-06-25
CN103886033B CN103886033B (en) 2017-02-08

Family

ID=50954925

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410078014.5A Active CN103886033B (en) 2014-03-05 2014-03-05 Intelligent vertical searching device and method for safety industry chain

Country Status (1)

Country Link
CN (1) CN103886033B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104820680A (en) * 2015-04-17 2015-08-05 南京大学 Universal distributed crawler scheduling system
CN111274466A (en) * 2019-12-18 2020-06-12 成都迪普曼林信息技术有限公司 Non-structural data acquisition system and method for overseas server
CN113507529A (en) * 2021-07-26 2021-10-15 上海中通吉网络技术有限公司 Method for realizing file downloading based on Web application
CN116126997B (en) * 2023-04-04 2023-06-13 北京洞悉网络有限公司 Document deduplication storage method, system, device and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101290588A (en) * 2008-03-07 2008-10-22 重庆邮电大学 Micro-embedded real time task scheduling device and scheduling method
US7472393B2 (en) * 2000-03-21 2008-12-30 Microsoft Corporation Method and system for real time scheduler
CN102012835A (en) * 2010-12-22 2011-04-13 北京航空航天大学 Virtual central processing unit (CPU) scheduling method capable of supporting software real-time application

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7472393B2 (en) * 2000-03-21 2008-12-30 Microsoft Corporation Method and system for real time scheduler
CN101290588A (en) * 2008-03-07 2008-10-22 重庆邮电大学 Micro-embedded real time task scheduling device and scheduling method
CN102012835A (en) * 2010-12-22 2011-04-13 北京航空航天大学 Virtual central processing unit (CPU) scheduling method capable of supporting software real-time application

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
于晓红等: "数字油田安全企业搜索引擎的研究与应用", 《中国期刊全文数据库 信息系统工程》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104820680A (en) * 2015-04-17 2015-08-05 南京大学 Universal distributed crawler scheduling system
CN104820680B (en) * 2015-04-17 2018-04-06 南京大学 A kind of universal distributed reptile scheduling system
CN111274466A (en) * 2019-12-18 2020-06-12 成都迪普曼林信息技术有限公司 Non-structural data acquisition system and method for overseas server
CN113507529A (en) * 2021-07-26 2021-10-15 上海中通吉网络技术有限公司 Method for realizing file downloading based on Web application
CN116126997B (en) * 2023-04-04 2023-06-13 北京洞悉网络有限公司 Document deduplication storage method, system, device and storage medium

Also Published As

Publication number Publication date
CN103886033B (en) 2017-02-08

Similar Documents

Publication Publication Date Title
CN104077402B (en) Data processing method and data handling system
CN108292323A (en) Use the database manipulation of the metadata of data source
CN109997126A (en) Event-driven is extracted, transformation, loads (ETL) processing
CN102710646B (en) Method and system for collecting phishing websites
CN101610265B (en) Service workflow process recognition method
US20150154249A1 (en) Data ingestion module for event detection and increased situational awareness
US20170235726A1 (en) Information identification and extraction
CN102662966B (en) Method and system for obtaining subject-oriented dynamic page content
CN104838413A (en) Adjusting content delivery based on user submissions
CN107087001A (en) A kind of important address spatial retrieval system in distributed internet
CN110019267A (en) A kind of metadata updates method, apparatus, system, electronic equipment and storage medium
CN106844640A (en) A kind of web data analysis and processing method
CN103886033A (en) Intelligent vertical searching device and method for safety industry chain
US11934431B2 (en) Computer-based systems configured for efficient entity resolution for database merging and reconciliation
Sangameswar et al. An algorithm for identification of natural disaster affected area
CN110020075A (en) Device is excavated in illegal website automatically
CN109729044A (en) A kind of general internet data acquisition is counter to climb system and method
CN103745006A (en) Internet information searching system and internet information searching method
US20170235835A1 (en) Information identification and extraction
WO2015084756A1 (en) Event detection through text analysis using trained event template models
CN110070344A (en) The city management system of task quantization
CN108574585B (en) System fault solution obtaining method and device
Pirnau Tool for monitoring Web sites for emergency-related posts and post analysis
CN109583210A (en) A kind of recognition methods, device and its equipment of horizontal permission loophole
CN106095984A (en) A kind of method and device obtaining structural data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20180307

Address after: No. 4, floor No. 102, No. 28, No. 102, Xinjie street, Xicheng District, Beijing City, No. 424

Patentee after: Beijing insight Network Co., Ltd.

Address before: 214000 Jiangsu city of Wuxi province Xishan Economic Development Zone in three Furong Road No. 99 Room 502 5 zuiun

Patentee before: WUXI XIANGXIANG BIOTECHNOLOGY CO., LTD.

TR01 Transfer of patent right