CN104239346A - Search engine based website optimal construction system - Google Patents

Search engine based website optimal construction system Download PDF

Info

Publication number
CN104239346A
CN104239346A CN201310246759.3A CN201310246759A CN104239346A CN 104239346 A CN104239346 A CN 104239346A CN 201310246759 A CN201310246759 A CN 201310246759A CN 104239346 A CN104239346 A CN 104239346A
Authority
CN
China
Prior art keywords
search engine
web
construction system
data
engine based
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310246759.3A
Other languages
Chinese (zh)
Inventor
江萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhenjiang Xin Ye Network Technology Co Ltd
Original Assignee
Zhenjiang Xin Ye Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhenjiang Xin Ye Network Technology Co Ltd filed Critical Zhenjiang Xin Ye Network Technology Co Ltd
Priority to CN201310246759.3A priority Critical patent/CN104239346A/en
Publication of CN104239346A publication Critical patent/CN104239346A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention discloses a search engine based website optimal construction system. The system comprises an internet, a local database, a web server and a webpage foreground module. The search engine based website optimal construction system has the advantages that the system is simple in construction structure, contents of information obtained after data collection and processing are richer and higher in quality, and a ranking result better than that of 'originals' can be obtained to realize optimization of website construction.

Description

A kind of web information flow construction system based on search engine
Technical field
The present invention relates to a kind of internet site development & construction system.
Background technology
Internet provides online reading and download channel by individual multiple in society and collective in Web realease various information content and for user, and modal a kind of example is exactly press service class website.The information of this kind of website is substantially all consistent, reprints mutually, copies between them.But no matter be people or search engine, do not wish to see too much duplicate message, these information are too many has just become junk information, so present all kinds of search engine all heavily can delete a large amount of duplicate messages by looking into, cause a lot of website reprint others' information after not searched engine include, even enter the blacklist of search engine.
Summary of the invention
Goal of the invention: for the problems referred to above, the object of this invention is to provide a kind of web information flow construction system based on search engine.
Technical scheme: a kind of web information flow construction system based on search engine, comprises internet, local data base, web server, web page foreground module.
From web mining data, and be stored in local data base, carry out data extraction; Processed data are carried out classifying and putting in storage and upload to web server; By the data publication that uploads in the web server corresponding module to web page foreground.
The content that data are extracted comprises crucial character/word, text, hyperlink, user interaction, weight word.
Beneficial effect: compared with prior art, advantage of the present invention is that system constructing structure is simple, more abundant to the information content that obtains after Data Collection and process, quality is higher, also can obtain than " better ranking results reaches the object optimizing Web Hosting simultaneously.
Embodiment
Below in conjunction with specific embodiment, illustrate the present invention further, these embodiments should be understood only be not used in for illustration of the present invention and limit the scope of the invention, after having read the present invention, the amendment of those skilled in the art to the various equivalent form of value of the present invention has all fallen within the application's claims limited range.
Based on a web information flow construction system for search engine, comprise internet, local data base, web server, web page foreground module.
First carry out excavation and the collection of data from internet, and the deposit data excavated is carried out pre-service in local server.
Data according to excavating are classified to the calculating of keyword weight respectively, text classification, and hyperlink is classified.
The keyword of having classified can extract the high word of weight and be stored in the heavy keywords database of core rights, and the text of having classified and hyperlink to be stored in corresponding database and to upload in web server.
From web server, the core word of weight dictionary is mated with keyword in text library, extract the text for associative key coupling, and be published to the weight keyword describing module of webpage.
The filtration of label will be carried out in raw text content from web server, the increase process of new label, and be published to the text module of webpage.
The multiple keywords extracted are published to the TITLE label of webpage, keyword label is with in description label.
The hyperlink of the keyword of extraction in hyperlink storehouse is done relevant matches, and extracts peer link and carry out cluster.

Claims (3)

1. based on a web information flow construction system for search engine, it is characterized in that: comprise internet, local data base, web server, web page foreground module.
2. a kind of web information flow construction system based on search engine according to claim 1, is characterized in that: from web mining data, and be stored in local data base, carries out data extraction; Processed data are carried out classifying and putting in storage and upload to web server; By the data publication that uploads in the web server corresponding module to web page foreground.
3. a kind of web information flow construction system based on search engine according to claim 1, is characterized in that: the content that data are extracted comprises crucial character/word, text, hyperlink, user interaction, weight word.
CN201310246759.3A 2013-06-21 2013-06-21 Search engine based website optimal construction system Pending CN104239346A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310246759.3A CN104239346A (en) 2013-06-21 2013-06-21 Search engine based website optimal construction system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310246759.3A CN104239346A (en) 2013-06-21 2013-06-21 Search engine based website optimal construction system

Publications (1)

Publication Number Publication Date
CN104239346A true CN104239346A (en) 2014-12-24

Family

ID=52227430

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310246759.3A Pending CN104239346A (en) 2013-06-21 2013-06-21 Search engine based website optimal construction system

Country Status (1)

Country Link
CN (1) CN104239346A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103556A (en) * 2017-05-16 2017-08-29 杭州云锄科技有限公司 Planting management method and device
CN107679170A (en) * 2017-09-29 2018-02-09 肖丽媛 A kind of web information flow method and system based on user behavior analysis
CN110232163A (en) * 2018-03-05 2019-09-13 上海联启网络科技有限公司 A kind of enterprise web site construction Extension Software Platform and method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103556A (en) * 2017-05-16 2017-08-29 杭州云锄科技有限公司 Planting management method and device
CN107679170A (en) * 2017-09-29 2018-02-09 肖丽媛 A kind of web information flow method and system based on user behavior analysis
CN110232163A (en) * 2018-03-05 2019-09-13 上海联启网络科技有限公司 A kind of enterprise web site construction Extension Software Platform and method

Similar Documents

Publication Publication Date Title
CN101593200B (en) Method for classifying Chinese webpages based on keyword frequency analysis
CN103365924B (en) A kind of method of internet information search, device and terminal
CN104598577B (en) A kind of extracting method of Web page text
CN107391502B (en) Time interval data query method and device and index construction method and device
CN102622443A (en) Customized screening system and method for microblog
CN104536956A (en) A Microblog platform based event visualization method and system
CN105022827A (en) Field subject-oriented Web news dynamic aggregation method
CN104679875B (en) A kind of information data classification method based on digital newspaper
CN103390051A (en) Topic detection and tracking method based on microblog data
US10078672B2 (en) Search device, search method, and computer program product
CN102662965A (en) Method and system of automatically discovering hot news theme on the internet
CN103617169A (en) Microblog hot topic extracting method based on Hadoop
CN103324622A (en) Method and device for automatic generating of front page abstract
CN103617174A (en) Distributed searching method based on cloud computing
CN103389998A (en) Novel Internet commercial intelligence information semantic analysis technology based on cloud service
CN102681994A (en) Webpage information extracting method and system
CN103150335A (en) Co-clustering-based coal mine public sentiment monitoring system
CN104298785A (en) Searching method for public searching resources
CN102542061A (en) Intelligent product classification method
CN102722499A (en) Search engine and implementation method thereof
CN103559258A (en) Webpage ranking method based on cloud computation
CN103544165A (en) Neologism mining method and system
CN104462532A (en) Method and device for extracting webpage text
CN104915405A (en) Microblog query expansion method based on multiple layers
CN111859065A (en) Big data-based public opinion listening system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20141224