CN104239346A - Search engine based website optimal construction system - Google Patents
Search engine based website optimal construction system Download PDFInfo
- Publication number
- CN104239346A CN104239346A CN201310246759.3A CN201310246759A CN104239346A CN 104239346 A CN104239346 A CN 104239346A CN 201310246759 A CN201310246759 A CN 201310246759A CN 104239346 A CN104239346 A CN 104239346A
- Authority
- CN
- China
- Prior art keywords
- search engine
- web
- construction system
- data
- engine based
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
Abstract
The invention discloses a search engine based website optimal construction system. The system comprises an internet, a local database, a web server and a webpage foreground module. The search engine based website optimal construction system has the advantages that the system is simple in construction structure, contents of information obtained after data collection and processing are richer and higher in quality, and a ranking result better than that of 'originals' can be obtained to realize optimization of website construction.
Description
Technical field
The present invention relates to a kind of internet site development & construction system.
Background technology
Internet provides online reading and download channel by individual multiple in society and collective in Web realease various information content and for user, and modal a kind of example is exactly press service class website.The information of this kind of website is substantially all consistent, reprints mutually, copies between them.But no matter be people or search engine, do not wish to see too much duplicate message, these information are too many has just become junk information, so present all kinds of search engine all heavily can delete a large amount of duplicate messages by looking into, cause a lot of website reprint others' information after not searched engine include, even enter the blacklist of search engine.
Summary of the invention
Goal of the invention: for the problems referred to above, the object of this invention is to provide a kind of web information flow construction system based on search engine.
Technical scheme: a kind of web information flow construction system based on search engine, comprises internet, local data base, web server, web page foreground module.
From web mining data, and be stored in local data base, carry out data extraction; Processed data are carried out classifying and putting in storage and upload to web server; By the data publication that uploads in the web server corresponding module to web page foreground.
The content that data are extracted comprises crucial character/word, text, hyperlink, user interaction, weight word.
Beneficial effect: compared with prior art, advantage of the present invention is that system constructing structure is simple, more abundant to the information content that obtains after Data Collection and process, quality is higher, also can obtain than " better ranking results reaches the object optimizing Web Hosting simultaneously.
Embodiment
Below in conjunction with specific embodiment, illustrate the present invention further, these embodiments should be understood only be not used in for illustration of the present invention and limit the scope of the invention, after having read the present invention, the amendment of those skilled in the art to the various equivalent form of value of the present invention has all fallen within the application's claims limited range.
Based on a web information flow construction system for search engine, comprise internet, local data base, web server, web page foreground module.
First carry out excavation and the collection of data from internet, and the deposit data excavated is carried out pre-service in local server.
Data according to excavating are classified to the calculating of keyword weight respectively, text classification, and hyperlink is classified.
The keyword of having classified can extract the high word of weight and be stored in the heavy keywords database of core rights, and the text of having classified and hyperlink to be stored in corresponding database and to upload in web server.
From web server, the core word of weight dictionary is mated with keyword in text library, extract the text for associative key coupling, and be published to the weight keyword describing module of webpage.
The filtration of label will be carried out in raw text content from web server, the increase process of new label, and be published to the text module of webpage.
The multiple keywords extracted are published to the TITLE label of webpage, keyword label is with in description label.
The hyperlink of the keyword of extraction in hyperlink storehouse is done relevant matches, and extracts peer link and carry out cluster.
Claims (3)
1. based on a web information flow construction system for search engine, it is characterized in that: comprise internet, local data base, web server, web page foreground module.
2. a kind of web information flow construction system based on search engine according to claim 1, is characterized in that: from web mining data, and be stored in local data base, carries out data extraction; Processed data are carried out classifying and putting in storage and upload to web server; By the data publication that uploads in the web server corresponding module to web page foreground.
3. a kind of web information flow construction system based on search engine according to claim 1, is characterized in that: the content that data are extracted comprises crucial character/word, text, hyperlink, user interaction, weight word.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310246759.3A CN104239346A (en) | 2013-06-21 | 2013-06-21 | Search engine based website optimal construction system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310246759.3A CN104239346A (en) | 2013-06-21 | 2013-06-21 | Search engine based website optimal construction system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104239346A true CN104239346A (en) | 2014-12-24 |
Family
ID=52227430
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310246759.3A Pending CN104239346A (en) | 2013-06-21 | 2013-06-21 | Search engine based website optimal construction system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104239346A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103556A (en) * | 2017-05-16 | 2017-08-29 | 杭州云锄科技有限公司 | Planting management method and device |
CN107679170A (en) * | 2017-09-29 | 2018-02-09 | 肖丽媛 | A kind of web information flow method and system based on user behavior analysis |
CN110232163A (en) * | 2018-03-05 | 2019-09-13 | 上海联启网络科技有限公司 | A kind of enterprise web site construction Extension Software Platform and method |
-
2013
- 2013-06-21 CN CN201310246759.3A patent/CN104239346A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103556A (en) * | 2017-05-16 | 2017-08-29 | 杭州云锄科技有限公司 | Planting management method and device |
CN107679170A (en) * | 2017-09-29 | 2018-02-09 | 肖丽媛 | A kind of web information flow method and system based on user behavior analysis |
CN110232163A (en) * | 2018-03-05 | 2019-09-13 | 上海联启网络科技有限公司 | A kind of enterprise web site construction Extension Software Platform and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101593200B (en) | Method for classifying Chinese webpages based on keyword frequency analysis | |
CN103365924B (en) | A kind of method of internet information search, device and terminal | |
CN104598577B (en) | A kind of extracting method of Web page text | |
CN107391502B (en) | Time interval data query method and device and index construction method and device | |
CN102622443A (en) | Customized screening system and method for microblog | |
CN104536956A (en) | A Microblog platform based event visualization method and system | |
CN105022827A (en) | Field subject-oriented Web news dynamic aggregation method | |
CN104679875B (en) | A kind of information data classification method based on digital newspaper | |
CN103390051A (en) | Topic detection and tracking method based on microblog data | |
US10078672B2 (en) | Search device, search method, and computer program product | |
CN102662965A (en) | Method and system of automatically discovering hot news theme on the internet | |
CN103617169A (en) | Microblog hot topic extracting method based on Hadoop | |
CN103324622A (en) | Method and device for automatic generating of front page abstract | |
CN103617174A (en) | Distributed searching method based on cloud computing | |
CN103389998A (en) | Novel Internet commercial intelligence information semantic analysis technology based on cloud service | |
CN102681994A (en) | Webpage information extracting method and system | |
CN103150335A (en) | Co-clustering-based coal mine public sentiment monitoring system | |
CN104298785A (en) | Searching method for public searching resources | |
CN102542061A (en) | Intelligent product classification method | |
CN102722499A (en) | Search engine and implementation method thereof | |
CN103559258A (en) | Webpage ranking method based on cloud computation | |
CN103544165A (en) | Neologism mining method and system | |
CN104462532A (en) | Method and device for extracting webpage text | |
CN104915405A (en) | Microblog query expansion method based on multiple layers | |
CN111859065A (en) | Big data-based public opinion listening system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20141224 |