CN102929888A - Data mining method based on web - Google Patents
Data mining method based on web Download PDFInfo
- Publication number
- CN102929888A CN102929888A CN2011102294858A CN201110229485A CN102929888A CN 102929888 A CN102929888 A CN 102929888A CN 2011102294858 A CN2011102294858 A CN 2011102294858A CN 201110229485 A CN201110229485 A CN 201110229485A CN 102929888 A CN102929888 A CN 102929888A
- Authority
- CN
- China
- Prior art keywords
- data
- web
- industry
- mining
- search condition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a data mining method based on web, belonging to the field of data mining, and relating to a method utilizing the web to mine data. The method comprises the steps: an input step of specific retrieval conditions: inputting the specific retrieval condition to be mined; a data search step: a user selects data sources according to own needs, and a retrieval system retrieves data matched with the specific retrieval condition from the selected data sources; a data preprocessing step: clearing, integrating and converting the searched data; a data discovering step: according to the relation between accessed web pages on a server, mining the effective, novel, potential and useful data; and a data managing step: classifying the obtained data to manage. According to the method, the information related with an industry in external environment of the industry can be collected, organized and analyzed, and the collected information can be utilized to help the industry make decisions and adjust the business strategy.
Description
Technical field
The invention belongs to Data Mining, relate to the method for utilizing the Web mining data.
Background technology
Along with automatic the popularizing etc. of words, management information system, Internet of the rapid anti-war and ecommerce of computer technology, network technology, mechanics of communication, Internet technology, office, the day by day robotization of business operation flow process of each industry, a large amount of data in the operational process of industry, have been produced, the treasure of each industry when these data and consequent information, it is recording the essential situation of managing faithfully.But in the face of like this a large amount of data, traditional data analysing method, can only obtain the surface layer information of data such as data retrieval, statistical study, can not obtain information its inherence, profound, the supvr is faced with the abundant and predicament of knowledge poorness of data.Therefore it is very important how excavating the useful knowledge of business decision from these data.
Summary of the invention
A kind of Web-based data mining method of the present invention's invention may further comprise the steps:
The input step of specific search condition is inputted specific search condition to be excavated;
The finding step of data, the user selects first the source of described data according to self needs, the data that searching system retrieval and specific search condition from selected Data Source are complementary;
The pretreatment step, to the data that find clear up, integrated, conversion;
The discovery step of data according to the contact between the page of accessing at server, is excavated effective, novel, potential, useful data;
The management process of data carries out Classification Management with the data of obtaining.
The method of the present invention's invention can be collected, put in order and analyze information relevant with the industry in the industry external environment condition, utilizes this to collect to such an extent that information can help industry to make a policy, and adjusts management strategy.
Description of drawings
Fig. 1 is the flow chart of steps of the present invention's a kind of Web-based data mining method of inventing.
Embodiment
The flow chart of steps of a kind of Web-based data mining method of the present invention invention as shown in Figure 1, it may further comprise the steps:
The input step of specific search condition is inputted specific search condition to be excavated;
The finding step of data, the user selects first the source of described data according to self needs, the data that searching system retrieval and specific search condition from selected Data Source are complementary;
The pretreatment step, to the data that find clear up, integrated, conversion;
The discovery step of data according to the contact between the page of accessing at server, is excavated effective, novel, potential, useful data;
The management process of data carries out Classification Management with the data of obtaining.
Claims (1)
1. a Web-based data mining method is characterized in that, may further comprise the steps:
The input step of specific search condition is inputted specific search condition to be excavated;
The finding step of data, the user selects first the source of described data according to self needs, the data that searching system retrieval and specific search condition from selected Data Source are complementary;
The pretreatment step, to the data that find clear up, integrated, conversion;
The discovery step of data according to the contact between the page of accessing at server, is excavated effective, novel, potential, useful data;
The management process of data carries out Classification Management with the data of obtaining.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011102294858A CN102929888A (en) | 2011-08-11 | 2011-08-11 | Data mining method based on web |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011102294858A CN102929888A (en) | 2011-08-11 | 2011-08-11 | Data mining method based on web |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102929888A true CN102929888A (en) | 2013-02-13 |
Family
ID=47644687
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011102294858A Pending CN102929888A (en) | 2011-08-11 | 2011-08-11 | Data mining method based on web |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102929888A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103473636A (en) * | 2013-09-03 | 2013-12-25 | 沈效国 | System data components for collecting, analyzing and distributing internet business information |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080097938A1 (en) * | 1998-05-01 | 2008-04-24 | Isabelle Guyon | Data mining platform for bioinformatics and other knowledge discovery |
CN101877003A (en) * | 2009-01-20 | 2010-11-03 | 国际商业机器公司 | Data analysis system and method |
CN101908191A (en) * | 2010-08-03 | 2010-12-08 | 深圳市她秀时尚电子商务有限公司 | Data analysis method and system for e-commerce |
CN102075963A (en) * | 2009-11-25 | 2011-05-25 | 中国移动通信集团贵州有限公司 | A mobile business data acquisition analysis method and a system for the same |
CN102075560A (en) * | 2010-11-19 | 2011-05-25 | 福建富士通信息软件有限公司 | Fukutomi enterprise search engine technology based on system coupling |
-
2011
- 2011-08-11 CN CN2011102294858A patent/CN102929888A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080097938A1 (en) * | 1998-05-01 | 2008-04-24 | Isabelle Guyon | Data mining platform for bioinformatics and other knowledge discovery |
CN101877003A (en) * | 2009-01-20 | 2010-11-03 | 国际商业机器公司 | Data analysis system and method |
CN102075963A (en) * | 2009-11-25 | 2011-05-25 | 中国移动通信集团贵州有限公司 | A mobile business data acquisition analysis method and a system for the same |
CN101908191A (en) * | 2010-08-03 | 2010-12-08 | 深圳市她秀时尚电子商务有限公司 | Data analysis method and system for e-commerce |
CN102075560A (en) * | 2010-11-19 | 2011-05-25 | 福建富士通信息软件有限公司 | Fukutomi enterprise search engine technology based on system coupling |
Non-Patent Citations (1)
Title |
---|
周绪倩: "基于电子商务的Web数据挖掘系统架构研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103473636A (en) * | 2013-09-03 | 2013-12-25 | 沈效国 | System data components for collecting, analyzing and distributing internet business information |
CN103473636B (en) * | 2013-09-03 | 2017-08-08 | 沈效国 | A kind of system data element of collection, analysis and distribution network business information |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | Identifying technological topics and institution-topic distribution probability for patent competitive intelligence analysis: a case study in LTE technology | |
US9590880B2 (en) | Dynamic collection analysis and reporting of telemetry data | |
US20170364834A1 (en) | Real-time monitoring of public sentiment | |
CN102750336B (en) | Resource individuation recommendation method based on user relevance | |
Palmer | Renewables rise above fossil fuels | |
Brulé | Big data in E&P: Real-time adaptive analytics and data-flow architecture | |
CN102402539A (en) | Design technology for object-level personalized vertical search engine | |
CN104679827A (en) | Big data-based public information association method and mining engine | |
CN103838754A (en) | Information searching device and method | |
CN104731906A (en) | Automatic recruiting website resume pushing method | |
LU503512B1 (en) | Operating method for construction of knowledge graph based on naming rule and caching mechanism | |
CN105808722A (en) | Information discrimination method and system | |
CN104216889A (en) | Data transmissibility analysis and prediction method and system based on cloud service | |
CN103020083B (en) | The automatic mining method of demand recognition template, demand recognition methods and corresponding device | |
Hajirahimova | Opportunities and challenges big data in oil and gas industry | |
CN104484367A (en) | Data mining and analyzing system | |
Nagdive et al. | Web server log analysis for unstructured data using apache flume and pig | |
CN103218390A (en) | Site resource management method and device | |
Gaurav et al. | An outline on big data and big data analytics | |
CN102929888A (en) | Data mining method based on web | |
CN101894318A (en) | Position working standard-generating and information-promoting system based on user operation behavior | |
Motwani et al. | Modeling Implementation of Big Data Analytics in Oil and Gas Industries in India | |
KR20210045172A (en) | Big Data Management and System for Livestock Disease Outbreak Analysis | |
Renouf | Do Heavy and Medium Oil Waterfloods Differ? | |
Avdeev et al. | Transition to the use of digital assistants in the kinematic interpretation of the data of seismic exploration by the example of the problem of improving the quality of seismic data after summation and reliability of the tectonic model forecast |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20130213 |
|
WD01 | Invention patent application deemed withdrawn after publication |