CN113392355A - Page configuration method, device, equipment and storage medium - Google Patents
Page configuration method, device, equipment and storage medium Download PDFInfo
- Publication number
- CN113392355A CN113392355A CN202110720113.9A CN202110720113A CN113392355A CN 113392355 A CN113392355 A CN 113392355A CN 202110720113 A CN202110720113 A CN 202110720113A CN 113392355 A CN113392355 A CN 113392355A
- Authority
- CN
- China
- Prior art keywords
- page
- browsing
- hot
- words
- content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 46
- 230000011218 segmentation Effects 0.000 claims abstract description 13
- 230000000875 corresponding effect Effects 0.000 claims description 56
- 239000013598 vector Substances 0.000 claims description 41
- 230000006399 behavior Effects 0.000 claims description 19
- 238000004590 computer program Methods 0.000 claims description 15
- 238000012216 screening Methods 0.000 claims description 12
- 238000011156 evaluation Methods 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000002596 correlated effect Effects 0.000 claims description 3
- 238000012545 processing Methods 0.000 abstract description 3
- 230000008569 process Effects 0.000 description 7
- 230000009286 beneficial effect Effects 0.000 description 4
- 230000009193 crawling Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- JLYXXMFPNIAWKQ-GNIYUCBRSA-N gamma-hexachlorocyclohexane Chemical compound Cl[C@H]1[C@H](Cl)[C@@H](Cl)[C@@H](Cl)[C@H](Cl)[C@H]1Cl JLYXXMFPNIAWKQ-GNIYUCBRSA-N 0.000 description 1
- JLYXXMFPNIAWKQ-UHFFFAOYSA-N gamma-hexachlorocyclohexane Natural products ClC1C(Cl)C(Cl)C(Cl)C(Cl)C1Cl JLYXXMFPNIAWKQ-UHFFFAOYSA-N 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229960002809 lindane Drugs 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/445—Program loading or initiating
- G06F9/44505—Configuring for program initiating, e.g. using registry, configuration files
- G06F9/4451—User profiles; Roaming
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The application relates to the technical field of data processing, and provides a page configuration method, a device, equipment and a storage medium, wherein the method comprises the following steps: acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content; acquiring a page head area of a page to be configured, and configuring a keyword in a keyword column of the page head area; and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured. According to the page configured by the application, when a user needs to search the target website, the target website containing the current affair hot words in the head area of the page is displayed in the front row, so that the user can quickly acquire the required target website, and the searching efficiency is improved.
Description
Technical Field
The present application relates to the field of data processing technologies, and in particular, to a page configuration method, apparatus, device, and storage medium.
Background
With the development of internet technology and the development of search engine technology, users can query information through search applications, such as querying the pronunciation, meaning of a word, profile of a person, and the like.
When a user needs to search a target website, a plurality of search results are usually displayed in a search result display page by a search application, but the search results are generally displayed in the front columns of the page contents which are current affair hotspots, and the page contents of the target website are difficult to be displayed in the front columns of the website, such as an enterprise website, so that the user is not favorable for quickly acquiring the required target website, the popularization of the websites is not favorable, and the search efficiency is low.
Disclosure of Invention
The present application mainly aims to provide a page configuration method, device, equipment and storage medium, so as to improve the search efficiency.
In order to achieve the above object, the present application provides a page configuration method, which includes the following steps:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
Preferably, when there are a plurality of current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page includes:
acquiring the search quantity of the plurality of current affair hotspot words;
setting weights for the current affair hot words in sequence according to the search amount, and screening out the current affair hot words with the weights larger than the preset weights to obtain target current affair hot words; wherein the magnitude of the weight is positively correlated with the search quantity;
and adding the target current affair hot words into the keyword column in the head area of the page.
Preferably, when there are a plurality of current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page includes:
respectively converting the plurality of current affair hot words into vectors to obtain a plurality of first word vectors;
converting the keywords in the keyword column into vectors to obtain second word vectors;
respectively calculating the similarity of the plurality of first word vectors and the second word vectors, and taking the current affair hot words corresponding to the first word vectors with the similarity greater than the preset similarity as target current affair hot words;
and adding the target current affair hot words into the keyword column in the head area of the page.
Further, after the step of completing the configuration of the page to be configured, the method further includes:
acquiring browsing data browsed by the configured page within a preset time period;
evaluating the configured page according to the browsing data to generate an evaluation result;
and adjusting current affair hot words in the keyword column configured in the page head area according to the evaluation result.
Preferably, the step of obtaining the current affair hotspot words by the search engine includes:
acquiring hot web page resources through a search engine, and determining a hot event in the hot web page resources;
performing event semantic understanding on the hot event, and determining an original event of the hot event;
extracting the keywords of the original event, and screening current affair hot words from the keywords of the original event.
Preferably, the step of determining the hot event in the hot web page resource includes:
acquiring each search result item in the hot webpage resource, and acquiring user behavior data corresponding to the search result item; the user behavior data comprises browsing content of a user in a page corresponding to the search result item and browsing time corresponding to the browsing content;
determining browsing duration corresponding to each browsing content according to the browsing content and the corresponding browsing time;
and determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content.
Preferably, the step of determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content includes:
counting browsing frequency corresponding to each browsing content;
performing weighted calculation according to the browsing duration and the browsing frequency corresponding to each browsing content to determine a heat value corresponding to each browsing content;
acquiring browsing contents with the hot values arranged at the front N bits to obtain target browsing contents; wherein N is a positive integer greater than 0;
and determining a hot event corresponding to the search result item according to the target browsing content.
The present application further provides a page configuration apparatus, which includes:
the acquisition module is used for acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
the configuration module is used for acquiring a page head area of the page to be configured and configuring the keyword in a keyword column of the page head area;
and the completion module is used for acquiring the current affair hot words through a search engine, adding the current affair hot words into the keyword column at the head of the page and completing the configuration of the page to be configured.
The present application further provides a computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the steps of any of the above methods when executing the computer program.
The present application also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of any of the methods described above.
According to the page configuration method, the page configuration device, the page configuration equipment and the storage medium, after a page to be configured of a target website and page content of the page to be configured are obtained, the page content of the page to be configured is subjected to word segmentation to obtain keywords of the page content; acquiring a page head area of a page to be configured, configuring a keyword in a keyword column of the page head area, and matching the keyword to the page to be configured of the target website when searching the keyword; the current affair hot words are obtained through the search engine and added into the keyword column at the head of the page, and the configuration of the page to be configured is completed, so that the target websites containing the current affair hot words in the head area of the page are displayed in the front row when a user needs to search the target websites by adding the current affair hot words into the page, the user can acquire the required target websites quickly, the popularization of the target websites is facilitated, and the search efficiency is improved.
Drawings
Fig. 1 is a schematic flowchart of a page configuration method according to an embodiment of the present application;
FIG. 2 is a block diagram illustrating a page allocating apparatus according to an embodiment of the present application;
fig. 3 is a block diagram illustrating a structure of a computer device according to an embodiment of the present application.
The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The SEO operation principle is introduced, the SEO is an English abbreviation of Search engine optimization (Search engine Opt imi zat i on), and Chinese is translated into 'Search engine optimization'. On the basis of knowing a natural ranking mechanism of a search engine, internal and external adjustment and optimization are carried out on a website, the keyword natural ranking of the website in the search engine is improved, more flow is obtained, and therefore the expected targets of website sales and brand construction are achieved. At present, a plurality of search engines are available in the market, such as Baidu search, dog search, Mars, 360 and the like, the search engines build search result indexes by crawling keywords, description information and text contents in a page head area of a website, and the weights of each page and a website in the search engines are calculated according to the relevance of the page hot words and current affair hot words, the direction of a weighted website and whether content participles are reasonable, wherein the larger the weight is, the more the page rank in the search results is, and the more the rank is, the more the page flow can be obtained.
Typical industry solutions to improve SEO efficiency include purchasing weight web site directions or purchases of search keyword traffic:
the point of purchasing the weight website refers to that a website with a weight value which is closer to that of the previous website purchased in a search engine is placed in a link of an enterprise website, and the site weight of the current enterprise website is indirectly improved.
The step of buying the search keyword flow refers to directly buying the keyword flow of a search engine, and when a user searches for a corresponding keyword from the search engine, the user directly jumps to a corresponding conversion page from the search engine.
Therefore, in order to solve the technical problems that an existing user is difficult to quickly acquire a required target website, and is not beneficial to popularization of the target websites and low in search efficiency, the present application provides a page configuration method, which uses a server as an execution main body and refers to fig. 1, wherein in one embodiment, the page configuration method includes the following steps:
s11, acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
s12, acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
s13, obtaining current affair hot words through a search engine, adding the current affair hot words into the keyword column at the head of the page, and completing the configuration of the page to be configured.
As described in step S11, in this step, a to-be-configured page of a target website is obtained, where the target website includes a website of a desired configuration page, such as an enterprise website, a video website, or a news website, so that the website after page configuration can be displayed in the front of a web page. Then obtaining page content of a page to be configured, segmenting the page content of the page to be configured to obtain a plurality of words, calculating word frequency of each word, screening out words with the word frequency arranged in the front row as target words, extracting subject information of the page content, calculating matching degree of the subject information and the target words, and obtaining the target word with the highest matching degree with the subject information from the target words as a keyword. And the word frequency is the frequency of each word appearing in the page content.
The page content includes text information, video information, picture information, and the like in the page. When the page content is video information or picture information, identifying content information described by the video information or the picture information, wherein the content information is text information, and segmenting the text information by a word segmentation tool to obtain keywords of the page content.
As described in the above step S12, after the keyword is acquired, the keyword is arranged in the keyword column in the page header area. The page head area is used for describing main information of page content, the keyword column is used for storing keywords of the page content, the page content is further highly summarized, when a user searches a page, the keywords of the keyword column matched with search terms are inquired according to the search terms input by the user in a search application, and then the page containing the keywords is searched.
In an embodiment, after step S12, the method may further include:
and intercepting the text content of the page, and putting the intercepted text content into a description column at the head of the page to be used as the page description of the page to be configured.
In the step, the text content of the page is obtained, and a part of description columns which are put into a head area of the page are intercepted from the text content of the page and are used as the page description of the page to be configured. The description column is a brief introduction describing the content of the page, and may be a sentence or a paragraph, so as to facilitate the user to further understand the content of the page.
As described in step S13, the current affair hot words in the network can be obtained regularly through a search engine or a web data crawling tool, and the current affair hot words are added to the keyword column in the head area of the page, so as to complete the configuration of the page to be configured, so that when a user needs to search a target website, keywords and the current affair hot words of the target website are matched, and the target websites containing the current affair hot words in the head area of the page can be displayed in the front, which is beneficial for the user to quickly obtain the required target websites, and is also beneficial for popularization of the target websites, and the search efficiency is improved.
According to the page configuration method, after a page to be configured of a target website and page content of the page to be configured are obtained, the page content of the page to be configured is segmented to obtain keywords of the page content; acquiring a page head area of a page to be configured, configuring a keyword in a keyword column of the page head area, and matching the keyword to the page to be configured of the target website when searching the keyword; the current affair hot words are obtained through the search engine and added into the keyword column at the head of the page, so that when a user needs to search a target website, the target website containing the current affair hot words in the head area of the page is displayed in the front row, the user can quickly obtain the required target website, the popularization of the target websites is facilitated, and the search efficiency is improved.
In an embodiment, when there are a plurality of current affair hotspot words, in step S12, the step of adding the current affair hotspot words to the keyword column at the head of the page may specifically include:
s121, obtaining the search quantity of the plurality of current affair hot words;
s122, sequentially setting weights for the current affair hot words according to the search amount, and screening out the current affair hot words with the weights larger than a preset weight to obtain target current affair hot words; wherein the magnitude of the weight is positively correlated with the search quantity;
s123, adding the target current affair hot words into the keyword column in the head area of the page.
As described in step S121, when adding a current affair hot word to a keyword column in a head area of a page, a search amount of a plurality of current affair hot words may be obtained from a database, where the search amount is a number of times each current affair hot word is searched, and when a user inputs a search word in a search field to perform a search, a search of the search word is performed.
As described in step S122, weights are sequentially set for the current affair hot words according to the search volume, the current affair hot words with the weights greater than the preset weight are screened out, and the current affair hot words with the weights greater than the preset weight are used as the target current affair hot words. The preset weight can be customized according to actual needs, and is not specifically limited herein. For example, the weight of the current hotspot word with the search volume of 0-100 times can be set to 0.5, the weight of the current hotspot word with the search volume of 100-1000 times can be set to 0.6, the weight of the current hotspot word with the search volume of 1000-10000 times can be set to 0.7, the weight of the current hotspot word with the search volume of more than 10000 times can be set to 0.8, the preset weight can be defined to 0.7, and the current hotspot word with the search volume of more than 10000 times can be taken as the target current hotspot word, so that the corresponding weight can be quickly matched according to the section in which the search volume falls, and the target current hotspot word at the next hottest can be simply and accurately determined based on the weight.
As described in step S123, after the target current affair hot word is determined, the target current affair hot word is added to the keyword column in the head area of the page, so that the page containing the target current affair hot word can be quickly queried during searching, and the searching efficiency is improved.
In an embodiment, when there are a plurality of current affair hotspot words, in step S12, the step of adding the current affair hotspot words to the keyword column at the head of the page may specifically include:
a121, respectively converting the plurality of current affair hot words into vectors to obtain a plurality of first word vectors;
a122, converting the keywords in the keyword column into vectors to obtain second word vectors;
a123, respectively calculating the similarity of the plurality of first word vectors and the second word vectors, and taking the current affair hot words corresponding to the first word vectors with the similarity greater than the preset similarity as target current affair hot words;
and A124, adding the target current affair hot word into a keyword column in the head area of the page.
In this embodiment, a word vector tool may be further used to convert a plurality of current affair hot words into word vectors respectively, so as to obtain first word vectors corresponding to the plurality of current affair hot words respectively, and a word vector tool is also used to convert keywords in a keyword column into word vectors so as to obtain second word vectors corresponding to the keywords, and then similarity between the plurality of first word vectors and the second word vectors is calculated respectively, and the current affair hot words corresponding to the first word vectors with similarity greater than a preset similarity are used as target current affair hot words, and the target current affair hot words are added to the keyword column in the head area of the page, so as to screen current affair hot words from the current affair hot words, so that when a user searches for the current affair hot words, a page with current affair hot words searched by the user can be obtained, and the page content of the page has a greater association with the current affair hot words, the method and the device can ensure the accuracy of page search while realizing page promotion.
For example, when the keyword in the keyword column includes "basketball", and when the current multiple-time-of-affair hot words are "yaoming", "lina" and "lindane", when the similarity between the keyword and the multiple-time-of-affair hot words is calculated, because the time-of-affair hot word of "yaoming" is a basketball player and is closer to the keyword of "basketball", the time-of-affair hot word of "yaoming" is taken as the target time-of-affair hot word and is added to the keyword column, so that when the user generally searches for "yaoming", the page containing "yaoming" in the keyword column can be matched, and the website popularization of the page is facilitated.
Preferably, the word vector tool is a word2vec word vector tool, the word2vec word vector tool is an efficient tool for representing words as real-valued vectors, processing of text contents is simplified into vector operation in a K-dimensional vector space through training by utilizing a deep learning thought, and similarity on the vector space can be used for representing similarity on text semantics.
In addition, the Word vectors output by the Word2vec Word vector tool can be used to do many natural language processing related tasks, such as clustering, finding synonyms, part-of-speech analysis, and so on. If a Word is taken as a feature, Word2vec Word vectors can map the feature to K-dimensional vector space, and can seek deeper feature representation for text data.
In an embodiment, in step S13, the step of obtaining the current affair hotspot words through the search engine may specifically include:
s131, acquiring hot webpage resources through a search engine, and determining a hot event in the hot webpage resources;
s132, performing event semantic understanding on the hot event, and determining an original event of the hot event;
s133, extracting the keywords of the original event, and screening current affair hot words from the keywords of the original event.
In the embodiment, when a current event hot word is acquired, hot web page resources ranked in the front are captured, web page contents of the hot web page resources are extracted, a hot event in the hot web page resources, namely event contents described by the hot web page resources, is determined according to the web page contents of the hot web page resources, then the hot event is subjected to event semantic understanding to obtain an original event, keywords of the original event are extracted, and the current event hot word is screened from the original event according to the keywords of the original event, so that the current event hot word is accurately obtained. The hot web page resource is an integration of all web page contents displayed on a website home page, and comprises texts, images, audio, video, interactive forms and the like.
Some hot events may not satisfy the subsequent screening condition, some hot events may satisfy the subsequent screening condition, and those hot events that satisfy the subsequent screening condition are not real hot events. Therefore, the hot events need to be further processed to obtain the original events with the highest authenticity after screening and confirmation, and current affair hot words are extracted from the original events.
In an embodiment, in step S131, the step of determining the hot event in the hot web page resource may specifically include:
s1311, obtaining each search result item in the hot webpage resource, and obtaining user behavior data corresponding to the search result items; the user behavior data comprises browsing content of a user in a page corresponding to the search result item and browsing time corresponding to the browsing content;
s1312, determining browsing duration corresponding to each browsing content according to the browsing content and the corresponding browsing time;
s1313, determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content.
In this embodiment, by obtaining the user behavior data corresponding to the search result item, the browsing duration corresponding to each browsing content is determined according to the browsing content in the user behavior data and the corresponding browsing time, for example, when a large number of users are in a 9: 30-11:30, when the user visits the live broadcast page and watches NBA in the current time period, the browsing time of the user is 2 hours.
And then acquiring browsing contents and search result items with the browsing duration arranged in the front, and taking the hot events corresponding to the search result items with the browsing duration arranged in the front as the hot events in the hot webpage resources.
The search result item is obtained by the user through the search keyword in the search process. For example, after the user inputs the search keyword "football" at the client, the received search result page interface returned by the search server has ten search result items on the search result page, which are ordered from top to bottom.
When user behavior data is acquired, behavior data generated when a user browses a page can be acquired by embedding points, such as buttons, text boxes, menus and the like, in elements in the page visited by the user, where the acquired behavior data is usually in a log format, and basic elements for recording the behavior data can adopt a mode of combining tasks, time, places and behaviors, that is, what behavior is generated when the user uses what mode at what time and place, and since the user behavior may include various behavior data, such as user page visiting behavior, user page element clicking behavior, browsing content and browsing time corresponding to the browsing content.
In an embodiment, in step S1313, the step of determining the hotspot event corresponding to the search result item according to the browsing duration corresponding to each browsing content may specifically include:
s13131, counting browsing frequency corresponding to each browsing content;
s13132, performing weighted calculation according to the browsing duration and the browsing frequency corresponding to each browsing content, and determining a heat value corresponding to each browsing content;
s13133, obtaining browsing contents with the heat values arranged at the front N positions to obtain target browsing contents; wherein N is a positive integer greater than 0;
s13134, determining a hot event corresponding to the search result item according to the target browsing content.
In this embodiment, weighted summation calculation may be performed according to the browsing duration and the browsing frequency corresponding to the browsing content to obtain a popularity value of each browsing content, where the popularity value is used to reflect a popularity condition of each browsing content; and then, taking the hot event corresponding to the browsing content with the hot value ranked at the top N as the hot event in the hot webpage resource, thereby accurately determining the hot event. For example, the weight of the browsing duration may be set to 0.8, the browsing frequency may be set to 0.6, and when the browsing duration of the target browsing content browsed by the user is 10 hours and the browsing frequency is 5 times, the corresponding heat value is 0.8 × 10+0.6 × 5 — 11.
In an embodiment, in step S13, after the step of completing the configuration of the page to be configured, the method may further include:
s15, acquiring browsed data of the configured page within a preset time period;
s16, evaluating the configured page according to the browsing data to generate an evaluation result;
s17, adjusting the current affair hot words in the keyword column in the page head area according to the evaluation result.
In this embodiment, after the configured page runs for a period of time, browsing data of the configured page in a preset time period can be acquired in real time, the configured page is evaluated according to the browsing data, an evaluation result is generated, and current-affair hot words added to the keyword column in the head region of the page are adjusted according to the evaluation result.
For example, when the browsing volume of the configured page is low within a preset time period, the current affair hotspot words added to the keyword column in the head area of the page need to be replaced. In addition, the current affair hot words can be acquired in real time, the current affair hot words in the keyword column of the head area of the page are updated regularly, and the searching effect is improved.
Referring to fig. 2, an embodiment of the present application further provides a page configuration apparatus, including:
the acquisition module 11 is configured to acquire a page to be configured of a target website and page content of the page to be configured, and perform word segmentation on the page content of the page to be configured to obtain keywords of the page content;
the configuration module 12 is configured to obtain a page head area of the page to be configured, and configure the keyword in a keyword column of the page head area;
and the completion module 13 is configured to obtain the current affair hot words through a search engine, add the current affair hot words to the keyword column at the head of the page, and complete the configuration of the page to be configured.
In this embodiment, a to-be-configured page of a target website is obtained, where the target website includes a website of a desired configuration page, such as an enterprise website, a video website, or a news website, so that the website after page configuration can be displayed in the front of a web page. Then obtaining page content of a page to be configured, segmenting the page content of the page to be configured to obtain a plurality of words, calculating word frequency of each word, screening out words with the word frequency arranged in the front row as target words, extracting subject information of the page content, calculating matching degree of the subject information and the target words, and obtaining the target word with the highest matching degree with the subject information from the target words as a keyword. And the word frequency is the frequency of each word appearing in the page content.
The page content includes text information, video information, picture information, and the like in the page. When the page content is video information or picture information, identifying content information described by the video information or the picture information, wherein the content information is text information, and segmenting the text information by a word segmentation tool to obtain keywords of the page content.
After the keywords are obtained, the keywords are configured in a keyword column of a page head area. The page head area is used for describing main information of page content, the keyword column is used for storing keywords of the page content, the page content is further highly summarized, when a user searches a page, the keywords of the keyword column matched with search terms are inquired according to the search terms input by the user in a search application, and then the page containing the keywords is searched.
In addition, current affair hot words in a network can be obtained through a search engine or a webpage data crawling tool regularly, and the current affair hot words are added into the keyword columns in the head area of the page, so that the configuration of the page to be configured is completed, when a user needs to search a target website, the keywords of the target website are matched with the current affair hot words, the target website containing the current affair hot words in the head area of the page can be displayed in the front row, the user can acquire the required target website quickly, the popularization of the target websites is facilitated, and the search efficiency is improved.
As described above, it can be understood that each component of the page configuration apparatus provided in this application may implement the function of any one of the above-described page configuration methods, and a detailed structure is not described again.
Referring to fig. 3, a computer device, which may be a server and whose internal structure may be as shown in fig. 3, is also provided in the embodiment of the present application. The computer device includes a processor, a memory, a network interface, and a database connected by a system bus. Wherein the computer designed processor is used to provide computational and control capabilities. The memory of the computer device comprises a storage medium and an internal memory. The storage medium stores an operating system, a computer program, and a database. The memory provides an environment for the operation of the operating system and computer programs in the storage medium. The database of the computer device is used for storing data such as page content, current affair hot words, user behavior data and the like. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a page configuration method.
The processor executes the page configuration method, and the method comprises the following steps:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
An embodiment of the present application further provides a computer-readable storage medium, on which a computer program is stored, and the computer program, when executed by a processor, implements a page configuration method, including the steps of:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the computer program is executed. Any reference to memory, storage, database, or other medium provided herein and used in the examples may include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), double-rate SDRAM (SSRSDRAM), Enhanced SDRAM (ESDRAM), Synch Lnk DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bused dynamic RAM (DRDRAM), and bused dynamic RAM (RDRAM).
To sum up, the most beneficial effect of this application lies in:
according to the page configuration method, the page configuration device, the page configuration equipment and the storage medium, after a page to be configured of a target website and page content of the page to be configured are obtained, the page content of the page to be configured is subjected to word segmentation to obtain keywords of the page content; acquiring a page head area of a page to be configured, configuring a keyword in a keyword column of the page head area, and matching the keyword to the page to be configured of the target website when searching the keyword; the current affair hot words are obtained through the search engine and added into the keyword column at the head of the page, and the configuration of the page to be configured is completed, so that the target websites containing the current affair hot words in the head area of the page are displayed in the front row when a user needs to search the target websites by adding the current affair hot words into the page, the user can acquire the required target websites quickly, the popularization of the target websites is facilitated, and the search efficiency is improved.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, apparatus, article, or method that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, apparatus, article, or method. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, apparatus, article, or method that includes the element.
The above description is only a preferred embodiment of the present application, and not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings of the present application, or which are directly or indirectly applied to other related technical fields, are also included in the scope of the present application.
Claims (10)
1. A page configuration method is characterized by comprising the following steps:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
2. The method according to claim 1, wherein when there are a plurality of the current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page comprises:
acquiring the search quantity of the plurality of current affair hotspot words;
setting weights for the current affair hot words in sequence according to the search amount, and screening out the current affair hot words with the weights larger than the preset weights to obtain target current affair hot words; wherein the magnitude of the weight is positively correlated with the search quantity;
and adding the target current affair hot words into the keyword column in the head area of the page.
3. The method according to claim 1, wherein when there are a plurality of the current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page comprises:
respectively converting the plurality of current affair hot words into vectors to obtain a plurality of first word vectors;
converting the keywords in the keyword column into vectors to obtain second word vectors;
respectively calculating the similarity of the plurality of first word vectors and the second word vectors, and taking the current affair hot words corresponding to the first word vectors with the similarity greater than the preset similarity as target current affair hot words;
and adding the target current affair hot words into the keyword column in the head area of the page.
4. The method according to claim 1, wherein after the step of completing the configuration of the page to be configured, the method further comprises:
acquiring browsing data browsed by the configured page within a preset time period;
evaluating the configured page according to the browsing data to generate an evaluation result;
and adjusting current affair hot words in the keyword column configured in the page head area according to the evaluation result.
5. The method of claim 1, wherein the step of obtaining the current affair hotspot words by a search engine comprises:
acquiring hot web page resources through a search engine, and determining a hot event in the hot web page resources;
performing event semantic understanding on the hot event, and determining an original event of the hot event;
extracting the keywords of the original event, and screening current affair hot words from the keywords of the original event.
6. The method of claim 5, wherein the step of determining the hotspot event in the hotspot web page resource comprises:
acquiring each search result item in the hot webpage resource, and acquiring user behavior data corresponding to the search result item; the user behavior data comprises browsing content of a user in a page corresponding to the search result item and browsing time corresponding to the browsing content;
determining browsing duration corresponding to each browsing content according to the browsing content and the corresponding browsing time;
and determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content.
7. The method according to claim 6, wherein the step of determining the hotspot event corresponding to the search result item according to the browsing duration corresponding to each browsing content comprises:
counting browsing frequency corresponding to each browsing content;
performing weighted calculation according to the browsing duration and the browsing frequency corresponding to each browsing content to determine a heat value corresponding to each browsing content;
acquiring browsing contents with the hot values arranged at the front N bits to obtain target browsing contents; wherein N is a positive integer greater than 0;
and determining a hot event corresponding to the search result item according to the target browsing content.
8. A page configuring apparatus, comprising:
the acquisition module is used for acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
the configuration module is used for acquiring a page head area of the page to be configured and configuring the keyword in a keyword column of the page head area;
and the completion module is used for acquiring the current affair hot words through a search engine, adding the current affair hot words into the keyword column at the head of the page and completing the configuration of the page to be configured.
9. A computer device, comprising:
a processor;
a memory;
a computer program, wherein the computer program is stored in the memory and configured to be executed by the processor, the computer program being configured to perform the page configuration method according to any of claims 1 to 7.
10. A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, which computer program, when being executed by a processor, implements the page configuration method of any one of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110720113.9A CN113392355A (en) | 2021-06-28 | 2021-06-28 | Page configuration method, device, equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110720113.9A CN113392355A (en) | 2021-06-28 | 2021-06-28 | Page configuration method, device, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113392355A true CN113392355A (en) | 2021-09-14 |
Family
ID=77624227
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110720113.9A Pending CN113392355A (en) | 2021-06-28 | 2021-06-28 | Page configuration method, device, equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113392355A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114327441A (en) * | 2021-12-24 | 2022-04-12 | 中国联合网络通信集团有限公司 | Webpage making processing method, device, equipment and storage medium |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110270678A1 (en) * | 2010-05-03 | 2011-11-03 | Drummond Mark E | System and method for using real-time keywords for targeting advertising in web search and social media |
CN103577392A (en) * | 2013-11-15 | 2014-02-12 | 北京奇虎科技有限公司 | Keyword pushing method and device based on current browse webpage |
CN105488205A (en) * | 2015-12-09 | 2016-04-13 | 百度在线网络技术(北京)有限公司 | Page generation method and page generation apparatus |
CN105630871A (en) * | 2015-12-16 | 2016-06-01 | 广州神马移动信息科技有限公司 | Search result display method and device as well as search system |
US20160232162A1 (en) * | 2013-09-13 | 2016-08-11 | Longtail Ux Pty Ltd | Website traffic optimization |
CN106407344A (en) * | 2016-09-06 | 2017-02-15 | 努比亚技术有限公司 | Method and system for generating search engine optimization label |
CN109271574A (en) * | 2018-08-28 | 2019-01-25 | 麒麟合盛网络技术股份有限公司 | A kind of hot word recommended method and device |
CN109299413A (en) * | 2018-09-13 | 2019-02-01 | 北京搜狗科技发展有限公司 | A kind of data processing method, device and electronic equipment |
CN110502687A (en) * | 2019-08-22 | 2019-11-26 | 山东开创云软件有限公司 | A kind of web information flow method and apparatus |
CN110717092A (en) * | 2018-06-27 | 2020-01-21 | 北京京东尚科信息技术有限公司 | Method, system, device and storage medium for matching objects for articles |
CN111368185A (en) * | 2020-02-25 | 2020-07-03 | 北京字节跳动网络技术有限公司 | Data display method and device, storage medium and electronic equipment |
CN112328872A (en) * | 2020-10-27 | 2021-02-05 | 北京字节跳动网络技术有限公司 | Information display method, information search method and device |
CN112579941A (en) * | 2020-12-17 | 2021-03-30 | 京东数字科技控股股份有限公司 | Information processing method, device, equipment and storage medium |
CN112699314A (en) * | 2020-12-25 | 2021-04-23 | 百度在线网络技术(北京)有限公司 | Hot event determination method and device, electronic equipment and storage medium |
-
2021
- 2021-06-28 CN CN202110720113.9A patent/CN113392355A/en active Pending
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110270678A1 (en) * | 2010-05-03 | 2011-11-03 | Drummond Mark E | System and method for using real-time keywords for targeting advertising in web search and social media |
US20160232162A1 (en) * | 2013-09-13 | 2016-08-11 | Longtail Ux Pty Ltd | Website traffic optimization |
CN103577392A (en) * | 2013-11-15 | 2014-02-12 | 北京奇虎科技有限公司 | Keyword pushing method and device based on current browse webpage |
CN105488205A (en) * | 2015-12-09 | 2016-04-13 | 百度在线网络技术(北京)有限公司 | Page generation method and page generation apparatus |
CN105630871A (en) * | 2015-12-16 | 2016-06-01 | 广州神马移动信息科技有限公司 | Search result display method and device as well as search system |
CN106407344A (en) * | 2016-09-06 | 2017-02-15 | 努比亚技术有限公司 | Method and system for generating search engine optimization label |
CN110717092A (en) * | 2018-06-27 | 2020-01-21 | 北京京东尚科信息技术有限公司 | Method, system, device and storage medium for matching objects for articles |
CN109271574A (en) * | 2018-08-28 | 2019-01-25 | 麒麟合盛网络技术股份有限公司 | A kind of hot word recommended method and device |
CN109299413A (en) * | 2018-09-13 | 2019-02-01 | 北京搜狗科技发展有限公司 | A kind of data processing method, device and electronic equipment |
CN110502687A (en) * | 2019-08-22 | 2019-11-26 | 山东开创云软件有限公司 | A kind of web information flow method and apparatus |
CN111368185A (en) * | 2020-02-25 | 2020-07-03 | 北京字节跳动网络技术有限公司 | Data display method and device, storage medium and electronic equipment |
CN112328872A (en) * | 2020-10-27 | 2021-02-05 | 北京字节跳动网络技术有限公司 | Information display method, information search method and device |
CN112579941A (en) * | 2020-12-17 | 2021-03-30 | 京东数字科技控股股份有限公司 | Information processing method, device, equipment and storage medium |
CN112699314A (en) * | 2020-12-25 | 2021-04-23 | 百度在线网络技术(北京)有限公司 | Hot event determination method and device, electronic equipment and storage medium |
Non-Patent Citations (1)
Title |
---|
郑杰: "《SEO搜索引擎优化 原理+方法+实战》", 31 January 2017, 人民邮电出版社 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114327441A (en) * | 2021-12-24 | 2022-04-12 | 中国联合网络通信集团有限公司 | Webpage making processing method, device, equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9489401B1 (en) | Methods and systems for object recognition | |
US10210179B2 (en) | Dynamic feature weighting | |
JP6423845B2 (en) | Method and system for dynamically ranking images to be matched with content in response to a search query | |
US8301616B2 (en) | Search equalizer | |
JP5436665B2 (en) | Classification of simultaneously selected images | |
US8001152B1 (en) | Method and system for semantic affinity search | |
US7958109B2 (en) | Intent driven search result rich abstracts | |
US8762326B1 (en) | Personalized hot topics | |
EP3513328A1 (en) | Method and apparatus for ranking electronic information by similarity association | |
CN104537065A (en) | Search result pushing method and system | |
CN106445963B (en) | Advertisement index keyword automatic generation method and device of APP platform | |
US9639627B2 (en) | Method to search a task-based web interaction | |
CN112740202A (en) | Performing image search using content tags | |
CN111159563A (en) | Method, device and equipment for determining user interest point information and storage medium | |
JP2017220204A (en) | Method and system for matching images with content using whitelists and blacklists in response to search query | |
CN112597274A (en) | Document determination method, device, equipment and storage medium based on BM25 algorithm | |
Tang et al. | Relevant feedback based accurate and intelligent retrieval on capturing user intention for personalized websites | |
CN106919593B (en) | Searching method and device | |
CN104933099B (en) | Method and device for providing target search result for user | |
CN113392355A (en) | Page configuration method, device, equipment and storage medium | |
CN107665442B (en) | Method and device for acquiring target user | |
KR20140091375A (en) | System and method for searching semantic contents using user query expansion | |
CN111753161B (en) | Improved PageRank-based web crawler method and system | |
KR101663359B1 (en) | Method and apparatus for providing updated news contents | |
CN110825976B (en) | Website page detection method and device, electronic equipment and medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20210914 |
|
WD01 | Invention patent application deemed withdrawn after publication |