CN108197136A - A kind of collection of Enterprise's competition information system - Google Patents
A kind of collection of Enterprise's competition information system Download PDFInfo
- Publication number
- CN108197136A CN108197136A CN201711120740.9A CN201711120740A CN108197136A CN 108197136 A CN108197136 A CN 108197136A CN 201711120740 A CN201711120740 A CN 201711120740A CN 108197136 A CN108197136 A CN 108197136A
- Authority
- CN
- China
- Prior art keywords
- information
- module
- data
- collection
- enterprise
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of collection of Enterprise's competition information system, including data obtaining module 1, information import modul 2, information adaptation module 3, information sifting module 4, information categorization module 5 and the total library module 6 of competitive intelligence;Using Competitive Intelligence System of the present invention, relevant information can be collected more fully hereinafter, can carry out information acquisition for related field much sooner, accurately.
Description
Technical field
The present invention relates to field of information acquisition, and in particular to a kind of to collect relevant information automatically using using computer technology
Information gathering system.
Background technology
At present, information acquisition means were gradually relied on the mode of manual research, artificial inquiry and document acquisition by the past, to
Using computer technology to rely on, using Internet technology as the new way transition of support.
However, at present by using the method for Internet technology collect intelligence, it is most of to there is collected relevant information
The problem of seriously disconnecting with the required information of scientific research personnel, meanwhile, presently, there are collect relevant information using internet
Method can only collection network public information, scientific research personnel's locally store information can not but be collected, also can not collect statistics point
Analysis.
Invention content
In order to promote information collecting efficiency, related information, the present invention can much sooner, comprehensively, be accurately collected
It is proposed a kind of Competitive Intelligence System, specific inventive technique is as follows:
A kind of collection of Enterprise's competition information system, including data obtaining module 1, information import modul 2, information adaptation module 3, letter
Cease screening module 4, information categorization module 5 and the total library module 6 of competitive intelligence;
Data obtaining module:It is periodically from targeted website that webpage HTML or JSON is literary using computer network crawler technology
On part, storage is downloaded to local server.The crawler technology that computer uses supports automated log on and simple identifying code to know
Other function supports page turn over operation, supports automatic identification page coded format.
Information import modul:Program is imported using the computer of exploitation, the data from internal data source are automatically imported
To local server.
Information adaptation module:The data that data obtaining module and information import modul two parts are generated are according to data mart modeling
Rule carries out automatic arranging matching, forms the unified form of competitive intelligence information.
The information obtained from internet is fitted using the methods of XPATH, JsonPath, regular expression matching
Match;For the data imported from inside data of enterprise source, it is adapted in the form of the field table of comparisons.
Information sifting
Information sifting module:Automatic duplicate removal simultaneously filters invalid information.The module calculates relevant information using similarity algorithm
With the similarity degree of system existing information, intelligent processing data automatically remove the higher information of similarity;
Meanwhile for invalid or relatively low information content data information, information sifting module is by calculating different information institutes
Information magnitude containing keyword actively rejects invalid or relatively low information content data information.
Information categorization module:Using existing subsumption algorithm and keyword, by point of information automatic sorting to tree structure
In class table, by analysis of key word meaning, system Auto-matching relative words carry out automatic indexing to information data.
The total library module of competitive intelligence:This module is for storing the information data sorted out and index data, by different data
It classifies.
According to flow chart of data processing, data obtaining module 1 and information import modul 2 and column distribution, the two respectively with information
Adaptation module 3 be connected, information adaptation module 3, information sifting module 4, information categorization module 5 and the total library module 6 of competitive intelligence according to
Secondary to be connected, information import modul 2 can be automatically imported locally store information data.
Wherein, the workflow of collection of Enterprise's competition information system first passes through acquisition of information for system and information import modul obtains
Source data, by information adaptation and information sifting module by information data of the source data processing for unified form;Then by letter
Breath classifying module is processed, indexes, the data storage total library module of competitive intelligence machined.
Information categorization module 5 is arranged on total 6 front end of library module of competitive intelligence, is conducive in time file information and classify, subtracts
The light data processing pressure of the total library module 6 of competitive intelligence.
The present invention is provided with data obtaining module 1 and information import modul 2, is conducive to user using a variety of different channels
Data source, extend system information sources;Information data passes through 3 working process of information adaptation module, can will be different
The information data of form unifies form, facilitates the processing and analysis of follow-up information;Meanwhile the present invention is sieved by setting information
Modeling block 4 intelligent automatic can filter out the higher information of multiplicity and the relatively low information of information content, improve information collection
Accuracy.
In summary described, Competitive Intelligence System collection information of the present invention is more comprehensive, and system information collection efficiency is substantially
It is promoted, information acquisition is much sooner, accurately.
Description of the drawings
Fig. 1 is system construction drawing, 1 is data obtaining module, 2 is information import modul, 3 is information adaptation module, 4 is letter
Breath screening module, 5 be information categorization module, 6 be the total library module of competitive intelligence
Specific embodiment
A kind of collection of Enterprise's competition information system, including data obtaining module 1, information import modul 2, information adaptation module 3, letter
Cease screening module 4, information categorization module 5 and the total library module 6 of competitive intelligence;
According to flow chart of data processing, data obtaining module 1 and information import modul 2 and column distribution, the two respectively with information
Adaptation module 3 be connected, information adaptation module 3, information sifting module 4, information categorization module 5 and the total library module 6 of competitive intelligence according to
It is secondary to be connected.
Claims (3)
1. a kind of collection of Enterprise's competition information system, including data obtaining module (1), information import modul (2), information adaptation module
(3), information sifting module (4), information categorization module (5) and the total library module of competitive intelligence (6);
Wherein
Data obtaining module (1) is using computer network crawler technology, periodically from targeted website by webpage HTML or JSON file
On, storage is downloaded to local server.
Information import modul (2) imports program using the computer of exploitation, future internet or from the data of internal data source from
It is dynamic to imported into local server.
Information adaptation module (3) advises the data that data obtaining module and information import modul two parts generate according to data mart modeling
Automatic arranging matching is then carried out, forms the unified form of competitive intelligence information.
The automatic duplicate removal of information sifting module (4) simultaneously filters invalid information.
Information categorization module (5) utilizes existing subsumption algorithm and keyword, by the classification of information automatic sorting to tree structure
In table, by analysis of key word meaning, system Auto-matching relative words carry out automatic indexing to information data.
The total library module of competitive intelligence (6) classifies different data for storing the information data sorted out and index data.
In collection of Enterprise's competition information system, according to flow chart of data processing, data obtaining module (1) and information import modul (2) are simultaneously
Column distribution, the two are connected respectively with information adaptation module (3), information adaptation module (3), information sifting module (4), information categorization
Module (5) and the total library module of competitive intelligence (6) are sequentially distributed arrangement.
2. collection of Enterprise's competition information system according to claim 1, it is characterised in that information categorization module (5) is arranged on competition
The total library module of information (6).
3. collection of Enterprise's competition information system according to claim 1, it is characterised in that this system setting information import modul
(2), local information is imported in time.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711120740.9A CN108197136A (en) | 2017-11-14 | 2017-11-14 | A kind of collection of Enterprise's competition information system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711120740.9A CN108197136A (en) | 2017-11-14 | 2017-11-14 | A kind of collection of Enterprise's competition information system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108197136A true CN108197136A (en) | 2018-06-22 |
Family
ID=62572907
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711120740.9A Pending CN108197136A (en) | 2017-11-14 | 2017-11-14 | A kind of collection of Enterprise's competition information system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108197136A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111277560A (en) * | 2019-12-24 | 2020-06-12 | 普世(南京)智能科技有限公司 | Safe information acquisition, import and compilation method and system based on high-bandwidth physical isolation unidirectional transmission |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070255670A1 (en) * | 2004-05-18 | 2007-11-01 | Netbreeze Gmbh | Method and System for Automatically Producing Computer-Aided Control and Analysis Apparatuses |
CN101158963A (en) * | 2007-10-31 | 2008-04-09 | 中兴通讯股份有限公司 | Information acquisition processing and retrieval system |
CN206224473U (en) * | 2016-11-25 | 2017-06-06 | 中国南方电网有限责任公司电网技术研究中心 | Information collection system |
-
2017
- 2017-11-14 CN CN201711120740.9A patent/CN108197136A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070255670A1 (en) * | 2004-05-18 | 2007-11-01 | Netbreeze Gmbh | Method and System for Automatically Producing Computer-Aided Control and Analysis Apparatuses |
CN101158963A (en) * | 2007-10-31 | 2008-04-09 | 中兴通讯股份有限公司 | Information acquisition processing and retrieval system |
CN206224473U (en) * | 2016-11-25 | 2017-06-06 | 中国南方电网有限责任公司电网技术研究中心 | Information collection system |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111277560A (en) * | 2019-12-24 | 2020-06-12 | 普世(南京)智能科技有限公司 | Safe information acquisition, import and compilation method and system based on high-bandwidth physical isolation unidirectional transmission |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109189901B (en) | Method for automatically discovering new classification and corresponding corpus in intelligent customer service system | |
CN102542061B (en) | Intelligent product classification method | |
CN112650848A (en) | Urban railway public opinion information analysis method based on text semantic related passenger evaluation | |
CN103823824A (en) | Method and system for automatically constructing text classification corpus by aid of internet | |
CN104182465A (en) | Network-based big data processing method | |
CN107292744A (en) | Investment Trend analysis method and its system based on machine learning | |
CN109325860A (en) | Network public-opinion detection method and system for overseas investment Risk-warning | |
CN107194617A (en) | A kind of app software engineers soft skill categorizing system and method | |
CN111782806A (en) | Artificial intelligence algorithm-based similar marketing enterprise retrieval classification method and system | |
CN115794803B (en) | Engineering audit problem monitoring method and system based on big data AI technology | |
CN109710826A (en) | A kind of internet information artificial intelligence acquisition method and its system | |
CN112328792A (en) | Optimization method for recognizing credit events based on DBSCAN clustering algorithm | |
CN113761242A (en) | Big data image recognition system and method based on artificial intelligence | |
CN115238154A (en) | Search engine optimization system | |
CN108228787A (en) | According to the method and apparatus of multistage classification processing information | |
CN108197136A (en) | A kind of collection of Enterprise's competition information system | |
KR102345410B1 (en) | Big data intelligent collecting method and device | |
CN105653567A (en) | Method for quickly looking for feature character strings in text sequential data | |
CN109063063B (en) | Data processing method and device based on multi-source data | |
CN206224473U (en) | Information collection system | |
CN110597796A (en) | Big data real-time modeling method and system based on full life cycle | |
CN112668836B (en) | Risk spectrum-oriented associated risk evidence efficient mining and monitoring method and apparatus | |
CN112800219B (en) | Method and system for feeding back customer service log to return database | |
CN114064997A (en) | Artificial intelligence power dispatching decision-making system based on big data | |
CN113420622A (en) | Intelligent scanning, recognizing and filing system based on machine deep learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20180622 |