CN108197136A - A kind of collection of Enterprise's competition information system - Google Patents

A kind of collection of Enterprise's competition information system Download PDF

Info

Publication number
CN108197136A
CN108197136A CN201711120740.9A CN201711120740A CN108197136A CN 108197136 A CN108197136 A CN 108197136A CN 201711120740 A CN201711120740 A CN 201711120740A CN 108197136 A CN108197136 A CN 108197136A
Authority
CN
China
Prior art keywords
information
module
data
collection
enterprise
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711120740.9A
Other languages
Chinese (zh)
Inventor
申敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CSG Electric Power Research Institute
Research Institute of Southern Power Grid Co Ltd
Original Assignee
Research Institute of Southern Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Research Institute of Southern Power Grid Co Ltd filed Critical Research Institute of Southern Power Grid Co Ltd
Priority to CN201711120740.9A priority Critical patent/CN108197136A/en
Publication of CN108197136A publication Critical patent/CN108197136A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of collection of Enterprise's competition information system, including data obtaining module 1, information import modul 2, information adaptation module 3, information sifting module 4, information categorization module 5 and the total library module 6 of competitive intelligence;Using Competitive Intelligence System of the present invention, relevant information can be collected more fully hereinafter, can carry out information acquisition for related field much sooner, accurately.

Description

A kind of collection of Enterprise's competition information system
Technical field
The present invention relates to field of information acquisition, and in particular to a kind of to collect relevant information automatically using using computer technology Information gathering system.
Background technology
At present, information acquisition means were gradually relied on the mode of manual research, artificial inquiry and document acquisition by the past, to Using computer technology to rely on, using Internet technology as the new way transition of support.
However, at present by using the method for Internet technology collect intelligence, it is most of to there is collected relevant information The problem of seriously disconnecting with the required information of scientific research personnel, meanwhile, presently, there are collect relevant information using internet Method can only collection network public information, scientific research personnel's locally store information can not but be collected, also can not collect statistics point Analysis.
Invention content
In order to promote information collecting efficiency, related information, the present invention can much sooner, comprehensively, be accurately collected It is proposed a kind of Competitive Intelligence System, specific inventive technique is as follows:
A kind of collection of Enterprise's competition information system, including data obtaining module 1, information import modul 2, information adaptation module 3, letter Cease screening module 4, information categorization module 5 and the total library module 6 of competitive intelligence;
Data obtaining module:It is periodically from targeted website that webpage HTML or JSON is literary using computer network crawler technology On part, storage is downloaded to local server.The crawler technology that computer uses supports automated log on and simple identifying code to know Other function supports page turn over operation, supports automatic identification page coded format.
Information import modul:Program is imported using the computer of exploitation, the data from internal data source are automatically imported To local server.
Information adaptation module:The data that data obtaining module and information import modul two parts are generated are according to data mart modeling Rule carries out automatic arranging matching, forms the unified form of competitive intelligence information.
The information obtained from internet is fitted using the methods of XPATH, JsonPath, regular expression matching Match;For the data imported from inside data of enterprise source, it is adapted in the form of the field table of comparisons.
Information sifting
Information sifting module:Automatic duplicate removal simultaneously filters invalid information.The module calculates relevant information using similarity algorithm With the similarity degree of system existing information, intelligent processing data automatically remove the higher information of similarity;
Meanwhile for invalid or relatively low information content data information, information sifting module is by calculating different information institutes Information magnitude containing keyword actively rejects invalid or relatively low information content data information.
Information categorization module:Using existing subsumption algorithm and keyword, by point of information automatic sorting to tree structure In class table, by analysis of key word meaning, system Auto-matching relative words carry out automatic indexing to information data.
The total library module of competitive intelligence:This module is for storing the information data sorted out and index data, by different data It classifies.
According to flow chart of data processing, data obtaining module 1 and information import modul 2 and column distribution, the two respectively with information Adaptation module 3 be connected, information adaptation module 3, information sifting module 4, information categorization module 5 and the total library module 6 of competitive intelligence according to Secondary to be connected, information import modul 2 can be automatically imported locally store information data.
Wherein, the workflow of collection of Enterprise's competition information system first passes through acquisition of information for system and information import modul obtains Source data, by information adaptation and information sifting module by information data of the source data processing for unified form;Then by letter Breath classifying module is processed, indexes, the data storage total library module of competitive intelligence machined.
Information categorization module 5 is arranged on total 6 front end of library module of competitive intelligence, is conducive in time file information and classify, subtracts The light data processing pressure of the total library module 6 of competitive intelligence.
The present invention is provided with data obtaining module 1 and information import modul 2, is conducive to user using a variety of different channels Data source, extend system information sources;Information data passes through 3 working process of information adaptation module, can will be different The information data of form unifies form, facilitates the processing and analysis of follow-up information;Meanwhile the present invention is sieved by setting information Modeling block 4 intelligent automatic can filter out the higher information of multiplicity and the relatively low information of information content, improve information collection Accuracy.
In summary described, Competitive Intelligence System collection information of the present invention is more comprehensive, and system information collection efficiency is substantially It is promoted, information acquisition is much sooner, accurately.
Description of the drawings
Fig. 1 is system construction drawing, 1 is data obtaining module, 2 is information import modul, 3 is information adaptation module, 4 is letter Breath screening module, 5 be information categorization module, 6 be the total library module of competitive intelligence
Specific embodiment
A kind of collection of Enterprise's competition information system, including data obtaining module 1, information import modul 2, information adaptation module 3, letter Cease screening module 4, information categorization module 5 and the total library module 6 of competitive intelligence;
According to flow chart of data processing, data obtaining module 1 and information import modul 2 and column distribution, the two respectively with information Adaptation module 3 be connected, information adaptation module 3, information sifting module 4, information categorization module 5 and the total library module 6 of competitive intelligence according to It is secondary to be connected.

Claims (3)

1. a kind of collection of Enterprise's competition information system, including data obtaining module (1), information import modul (2), information adaptation module (3), information sifting module (4), information categorization module (5) and the total library module of competitive intelligence (6);
Wherein
Data obtaining module (1) is using computer network crawler technology, periodically from targeted website by webpage HTML or JSON file On, storage is downloaded to local server.
Information import modul (2) imports program using the computer of exploitation, future internet or from the data of internal data source from It is dynamic to imported into local server.
Information adaptation module (3) advises the data that data obtaining module and information import modul two parts generate according to data mart modeling Automatic arranging matching is then carried out, forms the unified form of competitive intelligence information.
The automatic duplicate removal of information sifting module (4) simultaneously filters invalid information.
Information categorization module (5) utilizes existing subsumption algorithm and keyword, by the classification of information automatic sorting to tree structure In table, by analysis of key word meaning, system Auto-matching relative words carry out automatic indexing to information data.
The total library module of competitive intelligence (6) classifies different data for storing the information data sorted out and index data.
In collection of Enterprise's competition information system, according to flow chart of data processing, data obtaining module (1) and information import modul (2) are simultaneously Column distribution, the two are connected respectively with information adaptation module (3), information adaptation module (3), information sifting module (4), information categorization Module (5) and the total library module of competitive intelligence (6) are sequentially distributed arrangement.
2. collection of Enterprise's competition information system according to claim 1, it is characterised in that information categorization module (5) is arranged on competition The total library module of information (6).
3. collection of Enterprise's competition information system according to claim 1, it is characterised in that this system setting information import modul (2), local information is imported in time.
CN201711120740.9A 2017-11-14 2017-11-14 A kind of collection of Enterprise's competition information system Pending CN108197136A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711120740.9A CN108197136A (en) 2017-11-14 2017-11-14 A kind of collection of Enterprise's competition information system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711120740.9A CN108197136A (en) 2017-11-14 2017-11-14 A kind of collection of Enterprise's competition information system

Publications (1)

Publication Number Publication Date
CN108197136A true CN108197136A (en) 2018-06-22

Family

ID=62572907

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711120740.9A Pending CN108197136A (en) 2017-11-14 2017-11-14 A kind of collection of Enterprise's competition information system

Country Status (1)

Country Link
CN (1) CN108197136A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111277560A (en) * 2019-12-24 2020-06-12 普世(南京)智能科技有限公司 Safe information acquisition, import and compilation method and system based on high-bandwidth physical isolation unidirectional transmission

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070255670A1 (en) * 2004-05-18 2007-11-01 Netbreeze Gmbh Method and System for Automatically Producing Computer-Aided Control and Analysis Apparatuses
CN101158963A (en) * 2007-10-31 2008-04-09 中兴通讯股份有限公司 Information acquisition processing and retrieval system
CN206224473U (en) * 2016-11-25 2017-06-06 中国南方电网有限责任公司电网技术研究中心 Information collection system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070255670A1 (en) * 2004-05-18 2007-11-01 Netbreeze Gmbh Method and System for Automatically Producing Computer-Aided Control and Analysis Apparatuses
CN101158963A (en) * 2007-10-31 2008-04-09 中兴通讯股份有限公司 Information acquisition processing and retrieval system
CN206224473U (en) * 2016-11-25 2017-06-06 中国南方电网有限责任公司电网技术研究中心 Information collection system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111277560A (en) * 2019-12-24 2020-06-12 普世(南京)智能科技有限公司 Safe information acquisition, import and compilation method and system based on high-bandwidth physical isolation unidirectional transmission

Similar Documents

Publication Publication Date Title
CN109189901B (en) Method for automatically discovering new classification and corresponding corpus in intelligent customer service system
CN102542061B (en) Intelligent product classification method
CN112650848A (en) Urban railway public opinion information analysis method based on text semantic related passenger evaluation
CN103823824A (en) Method and system for automatically constructing text classification corpus by aid of internet
CN104182465A (en) Network-based big data processing method
CN107292744A (en) Investment Trend analysis method and its system based on machine learning
CN109325860A (en) Network public-opinion detection method and system for overseas investment Risk-warning
CN107194617A (en) A kind of app software engineers soft skill categorizing system and method
CN111782806A (en) Artificial intelligence algorithm-based similar marketing enterprise retrieval classification method and system
CN115794803B (en) Engineering audit problem monitoring method and system based on big data AI technology
CN109710826A (en) A kind of internet information artificial intelligence acquisition method and its system
CN112328792A (en) Optimization method for recognizing credit events based on DBSCAN clustering algorithm
CN113761242A (en) Big data image recognition system and method based on artificial intelligence
CN115238154A (en) Search engine optimization system
CN108228787A (en) According to the method and apparatus of multistage classification processing information
CN108197136A (en) A kind of collection of Enterprise's competition information system
KR102345410B1 (en) Big data intelligent collecting method and device
CN105653567A (en) Method for quickly looking for feature character strings in text sequential data
CN109063063B (en) Data processing method and device based on multi-source data
CN206224473U (en) Information collection system
CN110597796A (en) Big data real-time modeling method and system based on full life cycle
CN112668836B (en) Risk spectrum-oriented associated risk evidence efficient mining and monitoring method and apparatus
CN112800219B (en) Method and system for feeding back customer service log to return database
CN114064997A (en) Artificial intelligence power dispatching decision-making system based on big data
CN113420622A (en) Intelligent scanning, recognizing and filing system based on machine deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180622