CN113392355A - Page configuration method, device, equipment and storage medium - Google Patents

Page configuration method, device, equipment and storage medium Download PDF

Info

Publication number
CN113392355A
CN113392355A CN202110720113.9A CN202110720113A CN113392355A CN 113392355 A CN113392355 A CN 113392355A CN 202110720113 A CN202110720113 A CN 202110720113A CN 113392355 A CN113392355 A CN 113392355A
Authority
CN
China
Prior art keywords
page
browsing
hot
words
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110720113.9A
Other languages
Chinese (zh)
Inventor
余鸿飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Weikun Shanghai Technology Service Co Ltd
Original Assignee
Weikun Shanghai Technology Service Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Weikun Shanghai Technology Service Co Ltd filed Critical Weikun Shanghai Technology Service Co Ltd
Priority to CN202110720113.9A priority Critical patent/CN113392355A/en
Publication of CN113392355A publication Critical patent/CN113392355A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/445Program loading or initiating
    • G06F9/44505Configuring for program initiating, e.g. using registry, configuration files
    • G06F9/4451User profiles; Roaming

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The application relates to the technical field of data processing, and provides a page configuration method, a device, equipment and a storage medium, wherein the method comprises the following steps: acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content; acquiring a page head area of a page to be configured, and configuring a keyword in a keyword column of the page head area; and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured. According to the page configured by the application, when a user needs to search the target website, the target website containing the current affair hot words in the head area of the page is displayed in the front row, so that the user can quickly acquire the required target website, and the searching efficiency is improved.

Description

Page configuration method, device, equipment and storage medium
Technical Field
The present application relates to the field of data processing technologies, and in particular, to a page configuration method, apparatus, device, and storage medium.
Background
With the development of internet technology and the development of search engine technology, users can query information through search applications, such as querying the pronunciation, meaning of a word, profile of a person, and the like.
When a user needs to search a target website, a plurality of search results are usually displayed in a search result display page by a search application, but the search results are generally displayed in the front columns of the page contents which are current affair hotspots, and the page contents of the target website are difficult to be displayed in the front columns of the website, such as an enterprise website, so that the user is not favorable for quickly acquiring the required target website, the popularization of the websites is not favorable, and the search efficiency is low.
Disclosure of Invention
The present application mainly aims to provide a page configuration method, device, equipment and storage medium, so as to improve the search efficiency.
In order to achieve the above object, the present application provides a page configuration method, which includes the following steps:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
Preferably, when there are a plurality of current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page includes:
acquiring the search quantity of the plurality of current affair hotspot words;
setting weights for the current affair hot words in sequence according to the search amount, and screening out the current affair hot words with the weights larger than the preset weights to obtain target current affair hot words; wherein the magnitude of the weight is positively correlated with the search quantity;
and adding the target current affair hot words into the keyword column in the head area of the page.
Preferably, when there are a plurality of current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page includes:
respectively converting the plurality of current affair hot words into vectors to obtain a plurality of first word vectors;
converting the keywords in the keyword column into vectors to obtain second word vectors;
respectively calculating the similarity of the plurality of first word vectors and the second word vectors, and taking the current affair hot words corresponding to the first word vectors with the similarity greater than the preset similarity as target current affair hot words;
and adding the target current affair hot words into the keyword column in the head area of the page.
Further, after the step of completing the configuration of the page to be configured, the method further includes:
acquiring browsing data browsed by the configured page within a preset time period;
evaluating the configured page according to the browsing data to generate an evaluation result;
and adjusting current affair hot words in the keyword column configured in the page head area according to the evaluation result.
Preferably, the step of obtaining the current affair hotspot words by the search engine includes:
acquiring hot web page resources through a search engine, and determining a hot event in the hot web page resources;
performing event semantic understanding on the hot event, and determining an original event of the hot event;
extracting the keywords of the original event, and screening current affair hot words from the keywords of the original event.
Preferably, the step of determining the hot event in the hot web page resource includes:
acquiring each search result item in the hot webpage resource, and acquiring user behavior data corresponding to the search result item; the user behavior data comprises browsing content of a user in a page corresponding to the search result item and browsing time corresponding to the browsing content;
determining browsing duration corresponding to each browsing content according to the browsing content and the corresponding browsing time;
and determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content.
Preferably, the step of determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content includes:
counting browsing frequency corresponding to each browsing content;
performing weighted calculation according to the browsing duration and the browsing frequency corresponding to each browsing content to determine a heat value corresponding to each browsing content;
acquiring browsing contents with the hot values arranged at the front N bits to obtain target browsing contents; wherein N is a positive integer greater than 0;
and determining a hot event corresponding to the search result item according to the target browsing content.
The present application further provides a page configuration apparatus, which includes:
the acquisition module is used for acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
the configuration module is used for acquiring a page head area of the page to be configured and configuring the keyword in a keyword column of the page head area;
and the completion module is used for acquiring the current affair hot words through a search engine, adding the current affair hot words into the keyword column at the head of the page and completing the configuration of the page to be configured.
The present application further provides a computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the steps of any of the above methods when executing the computer program.
The present application also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of any of the methods described above.
According to the page configuration method, the page configuration device, the page configuration equipment and the storage medium, after a page to be configured of a target website and page content of the page to be configured are obtained, the page content of the page to be configured is subjected to word segmentation to obtain keywords of the page content; acquiring a page head area of a page to be configured, configuring a keyword in a keyword column of the page head area, and matching the keyword to the page to be configured of the target website when searching the keyword; the current affair hot words are obtained through the search engine and added into the keyword column at the head of the page, and the configuration of the page to be configured is completed, so that the target websites containing the current affair hot words in the head area of the page are displayed in the front row when a user needs to search the target websites by adding the current affair hot words into the page, the user can acquire the required target websites quickly, the popularization of the target websites is facilitated, and the search efficiency is improved.
Drawings
Fig. 1 is a schematic flowchart of a page configuration method according to an embodiment of the present application;
FIG. 2 is a block diagram illustrating a page allocating apparatus according to an embodiment of the present application;
fig. 3 is a block diagram illustrating a structure of a computer device according to an embodiment of the present application.
The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The SEO operation principle is introduced, the SEO is an English abbreviation of Search engine optimization (Search engine Opt imi zat i on), and Chinese is translated into 'Search engine optimization'. On the basis of knowing a natural ranking mechanism of a search engine, internal and external adjustment and optimization are carried out on a website, the keyword natural ranking of the website in the search engine is improved, more flow is obtained, and therefore the expected targets of website sales and brand construction are achieved. At present, a plurality of search engines are available in the market, such as Baidu search, dog search, Mars, 360 and the like, the search engines build search result indexes by crawling keywords, description information and text contents in a page head area of a website, and the weights of each page and a website in the search engines are calculated according to the relevance of the page hot words and current affair hot words, the direction of a weighted website and whether content participles are reasonable, wherein the larger the weight is, the more the page rank in the search results is, and the more the rank is, the more the page flow can be obtained.
Typical industry solutions to improve SEO efficiency include purchasing weight web site directions or purchases of search keyword traffic:
the point of purchasing the weight website refers to that a website with a weight value which is closer to that of the previous website purchased in a search engine is placed in a link of an enterprise website, and the site weight of the current enterprise website is indirectly improved.
The step of buying the search keyword flow refers to directly buying the keyword flow of a search engine, and when a user searches for a corresponding keyword from the search engine, the user directly jumps to a corresponding conversion page from the search engine.
Therefore, in order to solve the technical problems that an existing user is difficult to quickly acquire a required target website, and is not beneficial to popularization of the target websites and low in search efficiency, the present application provides a page configuration method, which uses a server as an execution main body and refers to fig. 1, wherein in one embodiment, the page configuration method includes the following steps:
s11, acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
s12, acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
s13, obtaining current affair hot words through a search engine, adding the current affair hot words into the keyword column at the head of the page, and completing the configuration of the page to be configured.
As described in step S11, in this step, a to-be-configured page of a target website is obtained, where the target website includes a website of a desired configuration page, such as an enterprise website, a video website, or a news website, so that the website after page configuration can be displayed in the front of a web page. Then obtaining page content of a page to be configured, segmenting the page content of the page to be configured to obtain a plurality of words, calculating word frequency of each word, screening out words with the word frequency arranged in the front row as target words, extracting subject information of the page content, calculating matching degree of the subject information and the target words, and obtaining the target word with the highest matching degree with the subject information from the target words as a keyword. And the word frequency is the frequency of each word appearing in the page content.
The page content includes text information, video information, picture information, and the like in the page. When the page content is video information or picture information, identifying content information described by the video information or the picture information, wherein the content information is text information, and segmenting the text information by a word segmentation tool to obtain keywords of the page content.
As described in the above step S12, after the keyword is acquired, the keyword is arranged in the keyword column in the page header area. The page head area is used for describing main information of page content, the keyword column is used for storing keywords of the page content, the page content is further highly summarized, when a user searches a page, the keywords of the keyword column matched with search terms are inquired according to the search terms input by the user in a search application, and then the page containing the keywords is searched.
In an embodiment, after step S12, the method may further include:
and intercepting the text content of the page, and putting the intercepted text content into a description column at the head of the page to be used as the page description of the page to be configured.
In the step, the text content of the page is obtained, and a part of description columns which are put into a head area of the page are intercepted from the text content of the page and are used as the page description of the page to be configured. The description column is a brief introduction describing the content of the page, and may be a sentence or a paragraph, so as to facilitate the user to further understand the content of the page.
As described in step S13, the current affair hot words in the network can be obtained regularly through a search engine or a web data crawling tool, and the current affair hot words are added to the keyword column in the head area of the page, so as to complete the configuration of the page to be configured, so that when a user needs to search a target website, keywords and the current affair hot words of the target website are matched, and the target websites containing the current affair hot words in the head area of the page can be displayed in the front, which is beneficial for the user to quickly obtain the required target websites, and is also beneficial for popularization of the target websites, and the search efficiency is improved.
According to the page configuration method, after a page to be configured of a target website and page content of the page to be configured are obtained, the page content of the page to be configured is segmented to obtain keywords of the page content; acquiring a page head area of a page to be configured, configuring a keyword in a keyword column of the page head area, and matching the keyword to the page to be configured of the target website when searching the keyword; the current affair hot words are obtained through the search engine and added into the keyword column at the head of the page, so that when a user needs to search a target website, the target website containing the current affair hot words in the head area of the page is displayed in the front row, the user can quickly obtain the required target website, the popularization of the target websites is facilitated, and the search efficiency is improved.
In an embodiment, when there are a plurality of current affair hotspot words, in step S12, the step of adding the current affair hotspot words to the keyword column at the head of the page may specifically include:
s121, obtaining the search quantity of the plurality of current affair hot words;
s122, sequentially setting weights for the current affair hot words according to the search amount, and screening out the current affair hot words with the weights larger than a preset weight to obtain target current affair hot words; wherein the magnitude of the weight is positively correlated with the search quantity;
s123, adding the target current affair hot words into the keyword column in the head area of the page.
As described in step S121, when adding a current affair hot word to a keyword column in a head area of a page, a search amount of a plurality of current affair hot words may be obtained from a database, where the search amount is a number of times each current affair hot word is searched, and when a user inputs a search word in a search field to perform a search, a search of the search word is performed.
As described in step S122, weights are sequentially set for the current affair hot words according to the search volume, the current affair hot words with the weights greater than the preset weight are screened out, and the current affair hot words with the weights greater than the preset weight are used as the target current affair hot words. The preset weight can be customized according to actual needs, and is not specifically limited herein. For example, the weight of the current hotspot word with the search volume of 0-100 times can be set to 0.5, the weight of the current hotspot word with the search volume of 100-1000 times can be set to 0.6, the weight of the current hotspot word with the search volume of 1000-10000 times can be set to 0.7, the weight of the current hotspot word with the search volume of more than 10000 times can be set to 0.8, the preset weight can be defined to 0.7, and the current hotspot word with the search volume of more than 10000 times can be taken as the target current hotspot word, so that the corresponding weight can be quickly matched according to the section in which the search volume falls, and the target current hotspot word at the next hottest can be simply and accurately determined based on the weight.
As described in step S123, after the target current affair hot word is determined, the target current affair hot word is added to the keyword column in the head area of the page, so that the page containing the target current affair hot word can be quickly queried during searching, and the searching efficiency is improved.
In an embodiment, when there are a plurality of current affair hotspot words, in step S12, the step of adding the current affair hotspot words to the keyword column at the head of the page may specifically include:
a121, respectively converting the plurality of current affair hot words into vectors to obtain a plurality of first word vectors;
a122, converting the keywords in the keyword column into vectors to obtain second word vectors;
a123, respectively calculating the similarity of the plurality of first word vectors and the second word vectors, and taking the current affair hot words corresponding to the first word vectors with the similarity greater than the preset similarity as target current affair hot words;
and A124, adding the target current affair hot word into a keyword column in the head area of the page.
In this embodiment, a word vector tool may be further used to convert a plurality of current affair hot words into word vectors respectively, so as to obtain first word vectors corresponding to the plurality of current affair hot words respectively, and a word vector tool is also used to convert keywords in a keyword column into word vectors so as to obtain second word vectors corresponding to the keywords, and then similarity between the plurality of first word vectors and the second word vectors is calculated respectively, and the current affair hot words corresponding to the first word vectors with similarity greater than a preset similarity are used as target current affair hot words, and the target current affair hot words are added to the keyword column in the head area of the page, so as to screen current affair hot words from the current affair hot words, so that when a user searches for the current affair hot words, a page with current affair hot words searched by the user can be obtained, and the page content of the page has a greater association with the current affair hot words, the method and the device can ensure the accuracy of page search while realizing page promotion.
For example, when the keyword in the keyword column includes "basketball", and when the current multiple-time-of-affair hot words are "yaoming", "lina" and "lindane", when the similarity between the keyword and the multiple-time-of-affair hot words is calculated, because the time-of-affair hot word of "yaoming" is a basketball player and is closer to the keyword of "basketball", the time-of-affair hot word of "yaoming" is taken as the target time-of-affair hot word and is added to the keyword column, so that when the user generally searches for "yaoming", the page containing "yaoming" in the keyword column can be matched, and the website popularization of the page is facilitated.
Preferably, the word vector tool is a word2vec word vector tool, the word2vec word vector tool is an efficient tool for representing words as real-valued vectors, processing of text contents is simplified into vector operation in a K-dimensional vector space through training by utilizing a deep learning thought, and similarity on the vector space can be used for representing similarity on text semantics.
In addition, the Word vectors output by the Word2vec Word vector tool can be used to do many natural language processing related tasks, such as clustering, finding synonyms, part-of-speech analysis, and so on. If a Word is taken as a feature, Word2vec Word vectors can map the feature to K-dimensional vector space, and can seek deeper feature representation for text data.
In an embodiment, in step S13, the step of obtaining the current affair hotspot words through the search engine may specifically include:
s131, acquiring hot webpage resources through a search engine, and determining a hot event in the hot webpage resources;
s132, performing event semantic understanding on the hot event, and determining an original event of the hot event;
s133, extracting the keywords of the original event, and screening current affair hot words from the keywords of the original event.
In the embodiment, when a current event hot word is acquired, hot web page resources ranked in the front are captured, web page contents of the hot web page resources are extracted, a hot event in the hot web page resources, namely event contents described by the hot web page resources, is determined according to the web page contents of the hot web page resources, then the hot event is subjected to event semantic understanding to obtain an original event, keywords of the original event are extracted, and the current event hot word is screened from the original event according to the keywords of the original event, so that the current event hot word is accurately obtained. The hot web page resource is an integration of all web page contents displayed on a website home page, and comprises texts, images, audio, video, interactive forms and the like.
Some hot events may not satisfy the subsequent screening condition, some hot events may satisfy the subsequent screening condition, and those hot events that satisfy the subsequent screening condition are not real hot events. Therefore, the hot events need to be further processed to obtain the original events with the highest authenticity after screening and confirmation, and current affair hot words are extracted from the original events.
In an embodiment, in step S131, the step of determining the hot event in the hot web page resource may specifically include:
s1311, obtaining each search result item in the hot webpage resource, and obtaining user behavior data corresponding to the search result items; the user behavior data comprises browsing content of a user in a page corresponding to the search result item and browsing time corresponding to the browsing content;
s1312, determining browsing duration corresponding to each browsing content according to the browsing content and the corresponding browsing time;
s1313, determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content.
In this embodiment, by obtaining the user behavior data corresponding to the search result item, the browsing duration corresponding to each browsing content is determined according to the browsing content in the user behavior data and the corresponding browsing time, for example, when a large number of users are in a 9: 30-11:30, when the user visits the live broadcast page and watches NBA in the current time period, the browsing time of the user is 2 hours.
And then acquiring browsing contents and search result items with the browsing duration arranged in the front, and taking the hot events corresponding to the search result items with the browsing duration arranged in the front as the hot events in the hot webpage resources.
The search result item is obtained by the user through the search keyword in the search process. For example, after the user inputs the search keyword "football" at the client, the received search result page interface returned by the search server has ten search result items on the search result page, which are ordered from top to bottom.
When user behavior data is acquired, behavior data generated when a user browses a page can be acquired by embedding points, such as buttons, text boxes, menus and the like, in elements in the page visited by the user, where the acquired behavior data is usually in a log format, and basic elements for recording the behavior data can adopt a mode of combining tasks, time, places and behaviors, that is, what behavior is generated when the user uses what mode at what time and place, and since the user behavior may include various behavior data, such as user page visiting behavior, user page element clicking behavior, browsing content and browsing time corresponding to the browsing content.
In an embodiment, in step S1313, the step of determining the hotspot event corresponding to the search result item according to the browsing duration corresponding to each browsing content may specifically include:
s13131, counting browsing frequency corresponding to each browsing content;
s13132, performing weighted calculation according to the browsing duration and the browsing frequency corresponding to each browsing content, and determining a heat value corresponding to each browsing content;
s13133, obtaining browsing contents with the heat values arranged at the front N positions to obtain target browsing contents; wherein N is a positive integer greater than 0;
s13134, determining a hot event corresponding to the search result item according to the target browsing content.
In this embodiment, weighted summation calculation may be performed according to the browsing duration and the browsing frequency corresponding to the browsing content to obtain a popularity value of each browsing content, where the popularity value is used to reflect a popularity condition of each browsing content; and then, taking the hot event corresponding to the browsing content with the hot value ranked at the top N as the hot event in the hot webpage resource, thereby accurately determining the hot event. For example, the weight of the browsing duration may be set to 0.8, the browsing frequency may be set to 0.6, and when the browsing duration of the target browsing content browsed by the user is 10 hours and the browsing frequency is 5 times, the corresponding heat value is 0.8 × 10+0.6 × 5 — 11.
In an embodiment, in step S13, after the step of completing the configuration of the page to be configured, the method may further include:
s15, acquiring browsed data of the configured page within a preset time period;
s16, evaluating the configured page according to the browsing data to generate an evaluation result;
s17, adjusting the current affair hot words in the keyword column in the page head area according to the evaluation result.
In this embodiment, after the configured page runs for a period of time, browsing data of the configured page in a preset time period can be acquired in real time, the configured page is evaluated according to the browsing data, an evaluation result is generated, and current-affair hot words added to the keyword column in the head region of the page are adjusted according to the evaluation result.
For example, when the browsing volume of the configured page is low within a preset time period, the current affair hotspot words added to the keyword column in the head area of the page need to be replaced. In addition, the current affair hot words can be acquired in real time, the current affair hot words in the keyword column of the head area of the page are updated regularly, and the searching effect is improved.
Referring to fig. 2, an embodiment of the present application further provides a page configuration apparatus, including:
the acquisition module 11 is configured to acquire a page to be configured of a target website and page content of the page to be configured, and perform word segmentation on the page content of the page to be configured to obtain keywords of the page content;
the configuration module 12 is configured to obtain a page head area of the page to be configured, and configure the keyword in a keyword column of the page head area;
and the completion module 13 is configured to obtain the current affair hot words through a search engine, add the current affair hot words to the keyword column at the head of the page, and complete the configuration of the page to be configured.
In this embodiment, a to-be-configured page of a target website is obtained, where the target website includes a website of a desired configuration page, such as an enterprise website, a video website, or a news website, so that the website after page configuration can be displayed in the front of a web page. Then obtaining page content of a page to be configured, segmenting the page content of the page to be configured to obtain a plurality of words, calculating word frequency of each word, screening out words with the word frequency arranged in the front row as target words, extracting subject information of the page content, calculating matching degree of the subject information and the target words, and obtaining the target word with the highest matching degree with the subject information from the target words as a keyword. And the word frequency is the frequency of each word appearing in the page content.
The page content includes text information, video information, picture information, and the like in the page. When the page content is video information or picture information, identifying content information described by the video information or the picture information, wherein the content information is text information, and segmenting the text information by a word segmentation tool to obtain keywords of the page content.
After the keywords are obtained, the keywords are configured in a keyword column of a page head area. The page head area is used for describing main information of page content, the keyword column is used for storing keywords of the page content, the page content is further highly summarized, when a user searches a page, the keywords of the keyword column matched with search terms are inquired according to the search terms input by the user in a search application, and then the page containing the keywords is searched.
In addition, current affair hot words in a network can be obtained through a search engine or a webpage data crawling tool regularly, and the current affair hot words are added into the keyword columns in the head area of the page, so that the configuration of the page to be configured is completed, when a user needs to search a target website, the keywords of the target website are matched with the current affair hot words, the target website containing the current affair hot words in the head area of the page can be displayed in the front row, the user can acquire the required target website quickly, the popularization of the target websites is facilitated, and the search efficiency is improved.
As described above, it can be understood that each component of the page configuration apparatus provided in this application may implement the function of any one of the above-described page configuration methods, and a detailed structure is not described again.
Referring to fig. 3, a computer device, which may be a server and whose internal structure may be as shown in fig. 3, is also provided in the embodiment of the present application. The computer device includes a processor, a memory, a network interface, and a database connected by a system bus. Wherein the computer designed processor is used to provide computational and control capabilities. The memory of the computer device comprises a storage medium and an internal memory. The storage medium stores an operating system, a computer program, and a database. The memory provides an environment for the operation of the operating system and computer programs in the storage medium. The database of the computer device is used for storing data such as page content, current affair hot words, user behavior data and the like. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a page configuration method.
The processor executes the page configuration method, and the method comprises the following steps:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
An embodiment of the present application further provides a computer-readable storage medium, on which a computer program is stored, and the computer program, when executed by a processor, implements a page configuration method, including the steps of:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the computer program is executed. Any reference to memory, storage, database, or other medium provided herein and used in the examples may include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), double-rate SDRAM (SSRSDRAM), Enhanced SDRAM (ESDRAM), Synch Lnk DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bused dynamic RAM (DRDRAM), and bused dynamic RAM (RDRAM).
To sum up, the most beneficial effect of this application lies in:
according to the page configuration method, the page configuration device, the page configuration equipment and the storage medium, after a page to be configured of a target website and page content of the page to be configured are obtained, the page content of the page to be configured is subjected to word segmentation to obtain keywords of the page content; acquiring a page head area of a page to be configured, configuring a keyword in a keyword column of the page head area, and matching the keyword to the page to be configured of the target website when searching the keyword; the current affair hot words are obtained through the search engine and added into the keyword column at the head of the page, and the configuration of the page to be configured is completed, so that the target websites containing the current affair hot words in the head area of the page are displayed in the front row when a user needs to search the target websites by adding the current affair hot words into the page, the user can acquire the required target websites quickly, the popularization of the target websites is facilitated, and the search efficiency is improved.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, apparatus, article, or method that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, apparatus, article, or method. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, apparatus, article, or method that includes the element.
The above description is only a preferred embodiment of the present application, and not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings of the present application, or which are directly or indirectly applied to other related technical fields, are also included in the scope of the present application.

Claims (10)

1. A page configuration method is characterized by comprising the following steps:
acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
acquiring a page head area of the page to be configured, and configuring the keyword in a keyword column of the page head area;
and acquiring current affair hot words through a search engine, and adding the current affair hot words into the keyword column at the head of the page to complete the configuration of the page to be configured.
2. The method according to claim 1, wherein when there are a plurality of the current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page comprises:
acquiring the search quantity of the plurality of current affair hotspot words;
setting weights for the current affair hot words in sequence according to the search amount, and screening out the current affair hot words with the weights larger than the preset weights to obtain target current affair hot words; wherein the magnitude of the weight is positively correlated with the search quantity;
and adding the target current affair hot words into the keyword column in the head area of the page.
3. The method according to claim 1, wherein when there are a plurality of the current affair hotspot words, the step of adding the current affair hotspot words to the keyword column at the head of the page comprises:
respectively converting the plurality of current affair hot words into vectors to obtain a plurality of first word vectors;
converting the keywords in the keyword column into vectors to obtain second word vectors;
respectively calculating the similarity of the plurality of first word vectors and the second word vectors, and taking the current affair hot words corresponding to the first word vectors with the similarity greater than the preset similarity as target current affair hot words;
and adding the target current affair hot words into the keyword column in the head area of the page.
4. The method according to claim 1, wherein after the step of completing the configuration of the page to be configured, the method further comprises:
acquiring browsing data browsed by the configured page within a preset time period;
evaluating the configured page according to the browsing data to generate an evaluation result;
and adjusting current affair hot words in the keyword column configured in the page head area according to the evaluation result.
5. The method of claim 1, wherein the step of obtaining the current affair hotspot words by a search engine comprises:
acquiring hot web page resources through a search engine, and determining a hot event in the hot web page resources;
performing event semantic understanding on the hot event, and determining an original event of the hot event;
extracting the keywords of the original event, and screening current affair hot words from the keywords of the original event.
6. The method of claim 5, wherein the step of determining the hotspot event in the hotspot web page resource comprises:
acquiring each search result item in the hot webpage resource, and acquiring user behavior data corresponding to the search result item; the user behavior data comprises browsing content of a user in a page corresponding to the search result item and browsing time corresponding to the browsing content;
determining browsing duration corresponding to each browsing content according to the browsing content and the corresponding browsing time;
and determining the hot event corresponding to the search result item according to the browsing duration corresponding to each browsing content.
7. The method according to claim 6, wherein the step of determining the hotspot event corresponding to the search result item according to the browsing duration corresponding to each browsing content comprises:
counting browsing frequency corresponding to each browsing content;
performing weighted calculation according to the browsing duration and the browsing frequency corresponding to each browsing content to determine a heat value corresponding to each browsing content;
acquiring browsing contents with the hot values arranged at the front N bits to obtain target browsing contents; wherein N is a positive integer greater than 0;
and determining a hot event corresponding to the search result item according to the target browsing content.
8. A page configuring apparatus, comprising:
the acquisition module is used for acquiring a page to be configured of a target website and page content of the page to be configured, and performing word segmentation on the page content of the page to be configured to obtain keywords of the page content;
the configuration module is used for acquiring a page head area of the page to be configured and configuring the keyword in a keyword column of the page head area;
and the completion module is used for acquiring the current affair hot words through a search engine, adding the current affair hot words into the keyword column at the head of the page and completing the configuration of the page to be configured.
9. A computer device, comprising:
a processor;
a memory;
a computer program, wherein the computer program is stored in the memory and configured to be executed by the processor, the computer program being configured to perform the page configuration method according to any of claims 1 to 7.
10. A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, which computer program, when being executed by a processor, implements the page configuration method of any one of claims 1 to 7.
CN202110720113.9A 2021-06-28 2021-06-28 Page configuration method, device, equipment and storage medium Pending CN113392355A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110720113.9A CN113392355A (en) 2021-06-28 2021-06-28 Page configuration method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110720113.9A CN113392355A (en) 2021-06-28 2021-06-28 Page configuration method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113392355A true CN113392355A (en) 2021-09-14

Family

ID=77624227

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110720113.9A Pending CN113392355A (en) 2021-06-28 2021-06-28 Page configuration method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113392355A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114327441A (en) * 2021-12-24 2022-04-12 中国联合网络通信集团有限公司 Webpage making processing method, device, equipment and storage medium

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110270678A1 (en) * 2010-05-03 2011-11-03 Drummond Mark E System and method for using real-time keywords for targeting advertising in web search and social media
CN103577392A (en) * 2013-11-15 2014-02-12 北京奇虎科技有限公司 Keyword pushing method and device based on current browse webpage
CN105488205A (en) * 2015-12-09 2016-04-13 百度在线网络技术(北京)有限公司 Page generation method and page generation apparatus
CN105630871A (en) * 2015-12-16 2016-06-01 广州神马移动信息科技有限公司 Search result display method and device as well as search system
US20160232162A1 (en) * 2013-09-13 2016-08-11 Longtail Ux Pty Ltd Website traffic optimization
CN106407344A (en) * 2016-09-06 2017-02-15 努比亚技术有限公司 Method and system for generating search engine optimization label
CN109271574A (en) * 2018-08-28 2019-01-25 麒麟合盛网络技术股份有限公司 A kind of hot word recommended method and device
CN109299413A (en) * 2018-09-13 2019-02-01 北京搜狗科技发展有限公司 A kind of data processing method, device and electronic equipment
CN110502687A (en) * 2019-08-22 2019-11-26 山东开创云软件有限公司 A kind of web information flow method and apparatus
CN110717092A (en) * 2018-06-27 2020-01-21 北京京东尚科信息技术有限公司 Method, system, device and storage medium for matching objects for articles
CN111368185A (en) * 2020-02-25 2020-07-03 北京字节跳动网络技术有限公司 Data display method and device, storage medium and electronic equipment
CN112328872A (en) * 2020-10-27 2021-02-05 北京字节跳动网络技术有限公司 Information display method, information search method and device
CN112579941A (en) * 2020-12-17 2021-03-30 京东数字科技控股股份有限公司 Information processing method, device, equipment and storage medium
CN112699314A (en) * 2020-12-25 2021-04-23 百度在线网络技术(北京)有限公司 Hot event determination method and device, electronic equipment and storage medium

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110270678A1 (en) * 2010-05-03 2011-11-03 Drummond Mark E System and method for using real-time keywords for targeting advertising in web search and social media
US20160232162A1 (en) * 2013-09-13 2016-08-11 Longtail Ux Pty Ltd Website traffic optimization
CN103577392A (en) * 2013-11-15 2014-02-12 北京奇虎科技有限公司 Keyword pushing method and device based on current browse webpage
CN105488205A (en) * 2015-12-09 2016-04-13 百度在线网络技术(北京)有限公司 Page generation method and page generation apparatus
CN105630871A (en) * 2015-12-16 2016-06-01 广州神马移动信息科技有限公司 Search result display method and device as well as search system
CN106407344A (en) * 2016-09-06 2017-02-15 努比亚技术有限公司 Method and system for generating search engine optimization label
CN110717092A (en) * 2018-06-27 2020-01-21 北京京东尚科信息技术有限公司 Method, system, device and storage medium for matching objects for articles
CN109271574A (en) * 2018-08-28 2019-01-25 麒麟合盛网络技术股份有限公司 A kind of hot word recommended method and device
CN109299413A (en) * 2018-09-13 2019-02-01 北京搜狗科技发展有限公司 A kind of data processing method, device and electronic equipment
CN110502687A (en) * 2019-08-22 2019-11-26 山东开创云软件有限公司 A kind of web information flow method and apparatus
CN111368185A (en) * 2020-02-25 2020-07-03 北京字节跳动网络技术有限公司 Data display method and device, storage medium and electronic equipment
CN112328872A (en) * 2020-10-27 2021-02-05 北京字节跳动网络技术有限公司 Information display method, information search method and device
CN112579941A (en) * 2020-12-17 2021-03-30 京东数字科技控股股份有限公司 Information processing method, device, equipment and storage medium
CN112699314A (en) * 2020-12-25 2021-04-23 百度在线网络技术(北京)有限公司 Hot event determination method and device, electronic equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
郑杰: "《SEO搜索引擎优化 原理+方法+实战》", 31 January 2017, 人民邮电出版社 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114327441A (en) * 2021-12-24 2022-04-12 中国联合网络通信集团有限公司 Webpage making processing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
US9489401B1 (en) Methods and systems for object recognition
US10210179B2 (en) Dynamic feature weighting
JP6423845B2 (en) Method and system for dynamically ranking images to be matched with content in response to a search query
US8301616B2 (en) Search equalizer
JP5436665B2 (en) Classification of simultaneously selected images
US8001152B1 (en) Method and system for semantic affinity search
US7958109B2 (en) Intent driven search result rich abstracts
US8762326B1 (en) Personalized hot topics
EP3513328A1 (en) Method and apparatus for ranking electronic information by similarity association
CN104537065A (en) Search result pushing method and system
CN106445963B (en) Advertisement index keyword automatic generation method and device of APP platform
US9639627B2 (en) Method to search a task-based web interaction
CN112740202A (en) Performing image search using content tags
CN111159563A (en) Method, device and equipment for determining user interest point information and storage medium
JP2017220204A (en) Method and system for matching images with content using whitelists and blacklists in response to search query
CN112597274A (en) Document determination method, device, equipment and storage medium based on BM25 algorithm
Tang et al. Relevant feedback based accurate and intelligent retrieval on capturing user intention for personalized websites
CN106919593B (en) Searching method and device
CN104933099B (en) Method and device for providing target search result for user
CN113392355A (en) Page configuration method, device, equipment and storage medium
CN107665442B (en) Method and device for acquiring target user
KR20140091375A (en) System and method for searching semantic contents using user query expansion
CN111753161B (en) Improved PageRank-based web crawler method and system
KR101663359B1 (en) Method and apparatus for providing updated news contents
CN110825976B (en) Website page detection method and device, electronic equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210914

WD01 Invention patent application deemed withdrawn after publication