CN109271580B - Search method, device, client and search engine - Google Patents

Search method, device, client and search engine Download PDF

Info

Publication number
CN109271580B
CN109271580B CN201811392116.9A CN201811392116A CN109271580B CN 109271580 B CN109271580 B CN 109271580B CN 201811392116 A CN201811392116 A CN 201811392116A CN 109271580 B CN109271580 B CN 109271580B
Authority
CN
China
Prior art keywords
page
search
content page
content
quality
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811392116.9A
Other languages
Chinese (zh)
Other versions
CN109271580A (en
Inventor
刘俊启
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Original Assignee
Baidu Online Network Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Baidu Online Network Technology Beijing Co Ltd filed Critical Baidu Online Network Technology Beijing Co Ltd
Priority to CN201811392116.9A priority Critical patent/CN109271580B/en
Publication of CN109271580A publication Critical patent/CN109271580A/en
Application granted granted Critical
Publication of CN109271580B publication Critical patent/CN109271580B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The application provides a searching method, a searching device, a client and a searching engine, wherein the method comprises the following steps: the client acquires a search result page obtained by searching according to the search information by the search engine; responding to the operation of page links in the search result page, and accessing the links by the client side to obtain corresponding content pages; the client determines the page quality of the content page according to the correlation degree between the content page and the search information; the client sends the indication information of the page quality to the search engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page. The method can realize real-time monitoring of the page quality of each content page, thereby realizing the provision of high-quality content pages for users and improving the searching and browsing experience of the users.

Description

Search method, device, client and search engine
Technical Field
The present application relates to the field of internet technologies, and in particular, to a search method, an apparatus, a client, and a search engine.
Background
With the continuous development of terminal technology, various terminal devices are becoming popular, and the mobile internet has become a main way for users to obtain information. At present, after obtaining search information input by a user, a search engine matches content pages (which can be understood as web pages) in a database according to keywords in the search information, and then sorts the content pages according to a matching degree.
However, this approach does not guarantee the page quality of the content page for the client on the terminal device.
Disclosure of Invention
The application provides a searching method, a searching device, a client and a search engine, so as to realize real-time monitoring of the page quality of each content page, thereby realizing providing of high-quality content pages for users, improving the searching and browsing experience of the users, and solving the technical problem that the page quality of the content pages cannot be guaranteed because the search engine is matched with the content pages according to keywords in search information in the prior art.
An embodiment of a first aspect of the present application provides a search method, including:
the client acquires a search result page obtained by searching according to the search information by the search engine;
responding to the operation of page links in the search result page, and accessing the links by the client to obtain corresponding content pages;
the client determines the page quality of the content page according to the correlation degree between the content page and the search information;
the client sends the indication information of the page quality to the search engine; the indication information is used for the search engine to determine the search ranking of the content page according to the page quality of the content page.
According to the searching method, the client side obtains a searching result page obtained by searching the searching engine according to the searching information, then, in response to the operation of page links in the searching result page, the client side accesses the links to obtain a corresponding content page, then, according to the correlation degree between the content page and the searching information, the page quality of the content page is determined, and finally, indicating information of the page quality is sent to the searching engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page. In the application, the client has the characteristics of real environment and the like, the page quality of each content page is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
The embodiment of the second aspect of the present application provides another search method, including:
the search engine searches according to the first search information to obtain each content page;
the search engine inquires the page quality of each content page; the page quality is determined according to the degree of correlation between the corresponding content page and the second search information input by each client side;
the search engine determines the sequence of each content page according to the page quality of each content page;
and generating a search result page corresponding to the first search information according to the sequence of each content page.
According to the searching method, searching is carried out through a search engine according to first searching information to obtain each content page, and then page quality of each content page is inquired; the page quality is determined according to the correlation degree between the corresponding content page and the second search information input by each client, then, the sequence of each content page is determined according to the page quality of each content page, and finally, the search result page corresponding to the first search information is generated according to the sequence of each content page. In the application, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
An embodiment of a third aspect of the present application provides a search apparatus, including:
the acquisition module is used for acquiring a search result page obtained by searching according to the search information by the search engine;
the access module is used for responding to the operation of page links in the search result page and accessing the links to obtain corresponding content pages;
the determining module is used for determining the page quality of the content page according to the correlation degree between the content page and the search information;
the sending module is used for sending the indication information of the page quality to the search engine; the indication information is used for the search engine to determine the search ranking of the content page according to the page quality of the content page.
According to the searching device, the client side obtains the searching result page obtained by searching the searching engine according to the searching information, then, in response to the operation of page links in the searching result page, the client side accesses the links to obtain the corresponding content page, then, the page quality of the content page is determined according to the correlation degree between the content page and the searching information, and finally, the indicating information of the page quality is sent to the searching engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page. In the application, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
An embodiment of a fourth aspect of the present application provides another search apparatus, including:
the searching module is used for searching according to the first searching information to obtain each content page;
the query module is used for querying the page quality of each content page; the page quality is determined according to the degree of correlation between the corresponding content page and the second search information input by each client side;
the sequencing module is used for determining the sequencing of each content page according to the page quality of each content page;
and the generating module is used for generating a search result page corresponding to the first search information according to the sequence of each content page.
According to the searching device, searching is carried out through the search engine according to the first searching information, each content page is obtained, and then page quality of each content page is inquired; the page quality is determined according to the correlation degree between the corresponding content page and the second search information input by each client, then, the sequence of each content page is determined according to the page quality of each content page, and finally, the search result page corresponding to the first search information is generated according to the sequence of each content page. In the application, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
In an embodiment of a fifth aspect of the present application, a client is provided, where the client includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the client implements the search method as set forth in the embodiment of the first aspect of the present application.
An embodiment of a sixth aspect of the present application provides a search engine, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the search method as set forth in the embodiment of the second aspect of the present application.
An embodiment of a seventh aspect of the present application proposes a non-transitory computer-readable storage medium, on which a computer program is stored, which, when executed by a processor, implements the search method as proposed in the embodiment of the first aspect of the present application, or implements the search method as proposed in the embodiment of the second aspect of the present application.
An eighth aspect of the present application provides a computer program product, wherein when the instructions in the computer program product are executed by a processor, the method for searching as set forth in the first aspect of the present application is performed, or the method for searching as set forth in the second aspect of the present application is performed.
Additional aspects and advantages of the present application will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the present application.
Drawings
The foregoing and/or additional aspects and advantages of the present application will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
fig. 1 is a schematic flowchart of a searching method according to an embodiment of the present application;
fig. 2 is a schematic flowchart of a searching method according to a second embodiment of the present application;
FIG. 3 is a first diagram of a content page in an embodiment of the present application;
FIG. 4 is a second illustration of a content page in an embodiment of the present application;
FIG. 5 is a schematic diagram of a search results page in an embodiment of the present application;
fig. 6 is a schematic flowchart of a searching method provided in the third embodiment of the present application;
fig. 7 is a schematic flowchart of a searching method according to a fourth embodiment of the present application;
FIG. 8 is a schematic diagram of interaction between a client and a search engine in an embodiment of the present application;
FIG. 9 is a schematic diagram of a quality assessment system according to an embodiment of the present application;
fig. 10 is a schematic structural diagram of a search apparatus according to a fifth embodiment of the present application;
fig. 11 is a schematic structural diagram of a search apparatus according to a sixth embodiment of the present application;
fig. 12 is a schematic structural diagram of a search apparatus according to a seventh embodiment of the present application.
Detailed Description
Reference will now be made in detail to embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are exemplary and intended to be used for explaining the present application and should not be construed as limiting the present application.
With the continuous development of terminal technology, various terminal devices are becoming popular, and the mobile internet has become a main way for users to obtain information. For example, according to the related data, 97.5% of users in china use an Application program (APP), which has 6 hundred million users and responds to searches 60 hundred million times per day on average.
At present, after obtaining search information input by a user, a search engine matches content pages in a database according to keywords in the search information, and then sorts the content pages according to a matching degree.
However, this approach does not guarantee the page quality of the content page for the client on the terminal device.
The application provides a searching method mainly aiming at the technical problem that in the prior art, a searching engine is matched with a content page according to keywords in searching information, and the page quality of the content page cannot be guaranteed.
According to the searching method, the client side obtains a searching result page obtained by searching the searching engine according to the searching information, then, in response to the operation of page links in the searching result page, the client side accesses the links to obtain a corresponding content page, then, according to the correlation degree between the content page and the searching information, the page quality of the content page is determined, and finally, indicating information of the page quality is sent to the searching engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page. In the application, the client has the characteristics of real environment and the like, the page quality of each content page is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
The following describes a search method, apparatus, client, and search engine according to an embodiment of the present application with reference to the drawings.
Fig. 1 is a schematic flowchart of a searching method according to an embodiment of the present application.
The searching method can be applied to the client, wherein the client is a program which corresponds to the cloud service end and provides local service for the user.
As shown in fig. 1, the search method may include the steps of:
step 101, a client acquires a search result page obtained by a search engine according to search information.
In the embodiment of the present application, the search information is input by a user, the search information may be text information, voice information, picture information, and the like, and the input manner of the search information may include, but is not limited to, touch input (e.g., sliding, clicking, and the like), keyboard input, voice input, and the like. When a user wants to search for content online, the user can search for desired picture, video, audio, text content, etc. based on a client on the terminal device, e.g. a search-class APP. For example, the client may provide a search box in which the user may manually input search information, or the client may provide a voice input control by which the user may input search information by long-pressing the voice input control, or the display interface of the client may have a recommendation area in which the user may click on recommendation content in the recommendation area as search information, or the client may provide a picture upload control in which the user may click on the search information, or the like.
The terminal device may be a Personal Computer (PC), a cloud device or a mobile device, and the mobile device may be a hardware device having various operating systems, touch screens and/or display screens, such as a mobile phone, a tablet Computer, a Personal digital assistant, a wearable device, and an in-vehicle device.
In the embodiment of the application, after the user inputs the search information, the client can send the search information to the search engine, the search engine can collect the search result page related to the search information from the cloud server by using a specific computer program according to a certain strategy and send the search result page to the client, and correspondingly, the client can obtain the search result page obtained by searching according to the search information by the search engine.
In response to the operation on the page link in the search result page, the client accesses the link to obtain the corresponding content page, step 102.
In the embodiment of the application, the content page is a page into which the client accesses the link, and the content page is a page into which the user clicks the link, that is, the content page is a page actually browsed by the user.
In the embodiment of the application, after the client acquires the search result page obtained by the search engine through searching according to the search information, the search result page can be displayed on the display interface, so that the user can trigger the preset page link in the search result page according to the self requirement. The client side can monitor the operation of the page link in the search result page triggered by the user in real time in a monitoring mode, and when the operation of the page link in the search result page triggered by the user is monitored, the link can be accessed to obtain the corresponding content page in response to the operation of the page link in the search result page.
Step 103, the client determines the page quality of the content page according to the correlation degree between the content page and the search information.
In the embodiment of the present application, the content page may include text data, image data, audio data, and/or video data, and thus, the degree of correlation between the content page and the search information includes: text relevance, image relevance, audio relevance, and/or video relevance.
As a possible implementation manner, when the search information is text information or voice information, the client may extract semantic features of the search information and semantic features of the content page, and then determine the degree of correlation between the content page and the search information according to the similarity between the semantic features of the search information and the semantic features of the content page, so as to determine the page quality of the content page according to the degree of correlation.
Certainly, the keywords in the search information can be extracted based on a keyword extraction algorithm, and then the keywords in the search information are matched with the content page to determine the degree of correlation between the content page and the search information, so that the page quality of the content page can be determined according to the degree of correlation. For example, if the search information is "pachyrhizus" and there is no keyword such as "pachyrhizus" or "pachyrhizus" in the content page, the content page is less relevant to the search information and the page quality is poor.
As another possible implementation manner, when the search information is picture information, the client may extract image features of the search information and image features of pictures in the content page, and then determine a degree of correlation between the content page and the search information according to a similarity between the image features of the search information and the image features of the pictures in the content page, so that the page quality of the content page may be determined according to the degree of correlation.
It should be noted that, when the content page includes a video, the video frames may be extracted, the image features of each video frame may be extracted, and then the image features of the search information may be matched with the image features of each video frame to determine the degree of correlation between the content page and the search information, so that the page quality of the content page may be determined according to the degree of correlation.
In the embodiment of the application, the client has the characteristics of real environment and the like, and the page quality of the content page is calculated on the client side, so that the authenticity and the accuracy of a calculation result can be ensured on the basis of monitoring the page quality of the content page in real time.
104, the client sends page quality indication information to a search engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page.
In the embodiment of the application, after the client determines the page quality of the content page, the client can send the indication information of the page quality to the search engine, so that after the search engine receives the indication information, the search ranking of the content page can be determined according to the page quality of the content page.
For example, when the search result page includes two page links, link a and link B, respectively, after the user clicks link a, the client accesses link a to obtain corresponding content page 1, and accesses link B to obtain corresponding content page 2, assuming that the page quality of content page 1 is s1, the page quality of content page 2 is s2, and s2> s1, the search engine may order content 2 before content page 1 after obtaining the indication information sent by the client.
According to the searching method, the client side obtains a searching result page obtained by searching the searching engine according to the searching information, then, in response to the operation of page links in the searching result page, the client side accesses the links to obtain a corresponding content page, then, according to the correlation degree between the content page and the searching information, the page quality of the content page is determined, and finally, indicating information of the page quality is sent to the searching engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page. In the application, the client has the characteristics of real environment and the like, the page quality of each content page is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
It should be noted that, when the terminal device is a mobile device, because the mobile device is different from a PC, the screen of the terminal device is small, the display space is limited, and the content of the advertisement, the blank area, the floating frame, and the like in the content page seriously affects the content of the content page browsed by the user, so as to improve the search and browse experience of the user, as a possible implementation manner of the embodiment of the present application, the advertisement display area, the floating frame, the blank area, and the like in the content page may be identified, and the invalid display area proportion of the content page may be determined according to the identified advertisement display area, floating frame, and/or blank area, so that the page quality of the content page may be determined according to the correlation degree between the content page and the search information and the invalid display area proportion. The above process is described in detail below with reference to fig. 2.
Fig. 2 is a schematic flowchart of a searching method provided in the second embodiment of the present application.
As shown in fig. 2, the search method may include the steps of:
step 201, the client obtains a search result page obtained by the search engine according to the search information.
In response to the operation on the page link in the search result page, the client accesses the link to obtain the corresponding content page, step 202.
The execution process of steps 201 to 202 may refer to the execution process of steps 101 to 102 in the above embodiments, which is not described herein again.
In step 203, the client extracts the semantic features of the search information and the content page.
In the embodiment of the application, the semantic features of the search information and the semantic features of the content page can be extracted based on a feature extraction technology. Wherein, the content page may include: the semantic features of the title, the text information, the picture information, the link information, the interaction area information, the visible area information and the like in the content page can be respectively extracted, so that the semantic features of the content page can be obtained.
And step 204, determining the correlation degree between the content page and the search information according to the similarity between the semantic features of the search information and the semantic features of the content page.
As an example, referring to fig. 3, fig. 3 is a first schematic view of a content page in an embodiment of the present application. After the user inputs search information and searches, the display interface of the client can display a search result page, and the user can click a page link 1 and a page link 2 in the search result page respectively to obtain a corresponding content page 1 and a corresponding content page 2. At this time, the semantic features of the search information "pachyrhizus hirsuta", the semantic features of the content page 1, and the semantic features of the content page 2 may be extracted, respectively, and since the content page 2 does not have the keyword features such as "pachyrhizus hirsuta" and "pachyrhizus", the content page 2 has a low degree of correlation with the search information and a poor page quality, and the content page 1 has the keyword features of "pachyrhizus hirsuta", so the content page 1 has a high degree of correlation with the search information.
In step 205, the client determines the invalid display area ratio of the content page according to at least one or more combinations of the advertisement display area, the floating window and the blank area in the content page.
In the embodiment of the application, the client can identify the content page by using an image identification algorithm in the related technology, determine the advertisement display area, the floating frame and the blank area in the content page, and then determine the invalid display area proportion of the content page according to the advertisement display area, the floating frame and/or the blank area.
Step 206, weighting the correlation degree and the invalid display area ratio of the content page to determine the page quality of the content page.
In the embodiment of the application, when the invalid display area ratio of the content page is determined, the correlation degree and the invalid display area ratio of the content page may be weighted to determine the page quality of the content page. As a possible implementation manner, a weight value corresponding to the correlation degree and the invalid display area ratio may be preset, and the correlation degree and the invalid display area ratio of the content page may be weighted according to the weight value corresponding to the correlation degree and the invalid display area ratio to determine the page quality of the content page.
For example, when the degree of correlation is low and the invalid display area is large, the page quality of the content page is poor, and when the degree of correlation is high and the invalid display area is small, the page quality of the content page is good.
As an example, referring to fig. 4, fig. 4 is a schematic diagram of a content page in the embodiment of the present application. When the search information is "big master", the content page 1 corresponding to the page link 1 has an advertisement display area and a blank area, and the content page 2 corresponding to the page link 2 does not have an advertisement display area, a floating frame and a blank area, and assuming that the degree of correlation between the content page 1 and the search information is the same as the degree of correlation between the content page 2 and the search information, the page quality of the content page 2 is higher than the page quality of the content page 1.
As a possible implementation manner, in the present application, the area ratio between the picture in the content page and the display area of the terminal device may also be determined, and the correlation degree and the area ratio are weighted to determine the page quality of the content page. For example, when the area ratio is greater than the preset threshold, it indicates that the picture is large, which affects the viewing experience of the user, and therefore, it may be determined that the page quality of the content page is low.
As another possible implementation manner, in the present application, a pixel value of a picture in a content page may also be determined, and the correlation degree and the pixel value are weighted to determine the page quality of the content page. For example, when the pixel value of a picture in a content page is large, such as a large picture or a moving picture, more traffic is consumed by the user, or the network speed of the user may be reduced, and thus, it may be determined that the page quality of the content page is low.
Step 207, the client sends the indication information of the page quality to the search engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page.
The process of step 207 may refer to the process of step 101 in the above embodiments, and is not described herein again.
As an example, as shown in fig. 4, the page quality of the content page 2 is higher than that of the content page 1, and after receiving the indication information, the search engine may order the content page 2 before the content page 1, so that the search result page received by the client may be as shown in fig. 5, where fig. 5 is a schematic diagram of the search result page in the embodiment of the present application.
In the embodiment of the application, the correlation degree and the invalid display area ratio of the content page are weighted to determine the page quality of the content page. Therefore, the page quality of the content page can be determined based on the multi-dimensional information, the accuracy of the calculation result is improved, the high-quality content page is provided for the user, and the searching and browsing experience of the user is improved.
It should be noted that, due to the random mobility of the user, when the user uses the terminal device to search for content, the network status of the terminal device may not be stable, and the mobile network needs to charge a tariff, which severely damages the benefit of the user for a page with more traffic consumption. Therefore, as another possible implementation manner of the embodiment of the present application, the client may count the access links to obtain the data traffic and/or the access speed required by the content page, weight the correlation degree and at least one of the data traffic and the access speed, and determine the page quality of the content page. The above process is described in detail below with reference to fig. 6.
Fig. 6 is a flowchart illustrating a searching method provided in the third embodiment of the present application.
As shown in fig. 6, based on the embodiment shown in fig. 1, step 103 may specifically include the following sub-steps:
in step 301, the client counts the access links to obtain the data traffic and/or access speed required by the content page.
Step 302, weighting the correlation degree and at least one of the data flow and the access speed to determine the page quality of the content page.
It will be appreciated that if the access speed is slow, two situations may occur, the first being: the data traffic required to access the link to get the content page is high, the second case being: network conditions are unstable and the user's search experience is greatly reduced, whichever is lower between data traffic and access speed. Therefore, in the present application, in order to improve the search experience of the user, the correlation degree, and at least one of the data traffic and the access speed may be weighted to determine the page quality of the content page.
For example, content page a and content page B correspond to the same degree of correlation, and if the data traffic of content page a is higher than that of content page B, it is determined that the page quality of content page B is better than that of content page a.
As a possible implementation manner, a weight value corresponding to the correlation degree, the data flow and the access speed may be preset, and the correlation degree, at least one of the data flow and the access speed may be weighted according to the weight value corresponding to the correlation degree, the data flow and the access speed, so as to determine the page quality of the content page.
It is understood that when the user is located in different regions, the content page required by the user may be different. Therefore, in the present application, in order to provide customized search services for different users, the client may determine the page quality of each content page according to the degree of correlation and the geographic location information of the user.
Also, when the network information used by the user is different, the content page required by the user may be different. For example, in a mobile network, in order to reduce traffic consumption, a user may need a content page with low data traffic, and in a wireless network, traffic usage of the current user is not limited, so in the present application, in order to provide customized search services for different users, the client may further determine the page quality of the content page according to the degree of correlation, the geographic location information of the user, and/or the network information.
In practical application, in order to improve the accuracy of page quality calculation, in the present application, the correlation degree, the data traffic, the access speed, the geographic location information of the user, and/or the network information may be weighted to determine the page quality of the content page.
For example, when the correlation degree between pages 1 and 2 is the same, assuming that the network information is a mobile network, if the data flow rate for accessing page 1 is higher than that for accessing page 2, the page quality of page 1 is lower than that of page 2.
In the embodiment of the application, the correlation degree and at least one of the data flow and the access speed are weighted to determine the page quality of the content page. Therefore, the page quality of the content page can be determined based on the multi-dimensional information, the accuracy of the calculation result is improved, the high-quality content page is provided for the user, and the searching and browsing experience of the user is improved.
It should be noted that, in the embodiment of the present application, only the step 205-.
In order to implement the above embodiment, the present application also provides a search method.
Fig. 7 is a flowchart illustrating a searching method according to a fourth embodiment of the present application.
The search method of the embodiment of the application can be applied to a search engine, wherein the search engine is a system for automatically collecting information from the Internet and providing retrieval service for users after organizing and processing the information.
As shown in fig. 7, the search method may include the steps of:
step 401, the search engine searches according to the first search information to obtain each content page.
In the embodiment of the application, the first search information is search information input by a current user when the user uses a client to search. The input manner of the first search information may include, but is not limited to, a touch input (e.g., a slide, a click, etc.), a keyboard input, a voice input, and the like.
In the embodiment of the application, after the current user inputs the first search information, the client can send the first search information to the search engine, and the search engine can collect content pages related to the first search information from the cloud server by using a specific computer program according to a certain strategy.
Step 402, a search engine inquires the page quality of each content page; the page quality is determined according to the degree of correlation between the corresponding content page and the second search information input by each client.
In the embodiment of the application, the second search information is search information input by other clients before the current user searches the first search information.
In the embodiment of the application, before a current user searches first search information, a search engine searches according to second search information input by other clients to obtain a search result page and returns the search result page to the other clients, after the other clients receive the search result page, the other clients respond to the operation of the user on page links in the search result page to access the links to obtain corresponding content pages, determine the page quality of the corresponding content pages according to the correlation degree between the corresponding content pages and the second search information input by the clients, and send indication information of the page quality of the corresponding content pages to the search engine, so that after the indication information is received by the search engine, the page quality of the corresponding content pages can be determined, and the page quality of each content page and the content pages are correspondingly stored. For a specific execution process, reference may be made to the execution processes in fig. 1 to fig. 6 in the foregoing embodiments, which are not described herein again.
By way of example, referring to fig. 8, fig. 8 is a schematic diagram of interaction between a client and a search engine in an embodiment of the present application. After the client performs quality evaluation on the content pages, the page quality of the content pages can be sent to a search engine, so that the search engine can rank the content pages in the subsequent steps based on the page quality of the content pages.
In the embodiment of the application, after the search engine searches to obtain each content page, the data stored in the database can be inquired, and the page quality of each content page is determined.
In step 403, the search engine determines the rank of each content page according to the page quality of each content page.
As a possible implementation manner, for the same content page, the page quality determined by the multiple clients for the content page may be obtained, and the average value of the page quality determined by the multiple clients for the content page is calculated to obtain the page quality of the content page.
As another possible implementation manner, for the same content page, the page quality determined by the multiple clients for the content page may be obtained, and the maximum value or the minimum value of the page quality determined by the multiple clients for the content page is used as the page quality of the content page.
Or, for the same content page, after the page quality determined by the multiple clients for the content page is obtained, the page quality of the content page may also be determined based on other algorithms, which is not limited to this.
In the embodiment of the application, after the search engine determines the page quality of each content page, the content pages may be sequentially ordered according to the level of the page quality, that is, the content page closer to the front is higher in page quality, and the content page closer to the back is lower in page quality.
Step 404, generating a search result page corresponding to the first search information according to the sequence of each content page.
In the embodiment of the application, after the content pages are sorted, the search result page corresponding to the first search information can be generated according to the sorting result.
According to the searching method, searching is carried out through a search engine according to first searching information to obtain each content page, and then page quality of each content page is inquired; the page quality is determined according to the correlation degree between the corresponding content page and the second search information input by each client, then, the sequence of each content page is determined according to the page quality of each content page, and finally, the search result page corresponding to the first search information is generated according to the sequence of each content page. In the application, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
As an example, refer to fig. 9, and fig. 9 is a schematic diagram of a quality evaluation system in an embodiment of the present application. After the client accesses the link to obtain the corresponding content page, the degree of correlation of the content page can be evaluated, and the degree of correlation between each content page and the keywords in the search information (namely, the degree of correlation between the keywords in the search information and each content page in the database) during warehousing is changed into the degree of correlation between the content page actually browsed by the user and the search information, so that the actual search requirement of the user can be determined. In addition, browsing experience dimensionality is increased, namely, the page quality of the content page is comprehensively calculated according to the correlation degree, the data flow, the access speed, the invalid display area ratio, the geographic position information and the network information, the page quality of the content page can be determined based on multi-dimensional information, the accuracy of a calculation result is improved, the high-quality content page is provided for a user, and the searching and browsing experience of the user is improved. In addition, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, the page quality of each content page can be monitored in real time, and the real-time performance and the authenticity of the calculation result are guaranteed.
In order to implement the above embodiments, the present application also provides a search apparatus.
Fig. 10 is a schematic structural diagram of a search apparatus according to a fifth embodiment of the present application.
As shown in fig. 10, the search apparatus is applied to a client, and includes: an acquisition module 110, an access module 120, a determination module 130, and a sending module 140.
The obtaining module 110 is configured to obtain a search result page obtained by a search engine performing a search according to the search information.
And the access module 120 is configured to, in response to an operation on the page link in the search result page, access the link to obtain a corresponding content page.
The determining module 130 is configured to determine the page quality of the content page according to the degree of correlation between the content page and the search information.
A sending module 140, configured to send information indicating page quality to a search engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page.
Further, in a possible implementation manner of the embodiment of the present application, referring to fig. 11, on the basis of the embodiment shown in fig. 10, the search apparatus may further include:
the extracting module 150 is configured to extract the semantic features of the content page and the search information before determining the page quality of the content page according to the degree of correlation between the content page and the search information.
And the processing module 160 is configured to determine a degree of correlation between the content page and the search information according to a similarity between the semantic features of the search information and the semantic features of the content page.
As a possible implementation manner, the determining module 130 is specifically configured to: counting the access links to obtain the data flow and/or the access speed required by the content page; the degree of correlation is weighted with at least one of data traffic and access speed to determine the page quality of the content page.
As another possible implementation manner, the determining module 130 is specifically configured to: determining the invalid display area ratio of the content page according to at least one or more combinations of the advertisement display area, the floating window and the blank area in the content page; and weighting the correlation degree and the ratio of the invalid display area of the content page to determine the page quality of the content page.
It should be noted that the explanation of the embodiment of the search method in fig. 1 to fig. 6 is also applicable to the search apparatus of this embodiment, and is not repeated here.
According to the searching device, the client side obtains the searching result page obtained by searching the searching engine according to the searching information, then, in response to the operation of page links in the searching result page, the client side accesses the links to obtain the corresponding content page, then, the page quality of the content page is determined according to the correlation degree between the content page and the searching information, and finally, the indicating information of the page quality is sent to the searching engine; the indication information is used for determining the search ranking of the content page by the search engine according to the page quality of the content page. In the application, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
In order to implement the above embodiments, the present application also provides a search apparatus.
Fig. 12 is a schematic structural diagram of a search apparatus according to a seventh embodiment of the present application.
As shown in fig. 12, the search apparatus is applied to a search engine, and includes: a search module 210, a query module 220, a ranking module 230, and a generation module 240.
The searching module 210 is configured to perform a search according to the first search information to obtain each content page.
A query module 220, configured to query page quality of each content page; the page quality is determined according to the degree of correlation between the corresponding content page and the second search information input by each client.
As a possible implementation manner, the query module 220 is specifically configured to: inquiring the page quality determined by a plurality of clients for the same content page; and calculating the average value of the page quality determined by the plurality of clients for the same content page to obtain the page quality of the corresponding content page.
The sorting module 230 is configured to determine a sorting of each content page according to the page quality of each content page.
The generating module 240 is configured to generate a search result page corresponding to the first search information according to the ranking of each content page.
It should be noted that the explanation of the embodiment of the search method in fig. 7 is also applicable to the search apparatus of this embodiment, and is not repeated here.
According to the searching device, searching is carried out through the search engine according to the first searching information, each content page is obtained, and then page quality of each content page is inquired; the page quality is determined according to the correlation degree between the corresponding content page and the second search information input by each client, then, the sequence of each content page is determined according to the page quality of each content page, and finally, the search result page corresponding to the first search information is generated according to the sequence of each content page. In the application, the client has the characteristics of real environment and the like, the page quality of the content pages is calculated on the client side, and the page quality of each content page can be monitored in real time, so that the high-quality content pages are provided for users, and the searching and browsing experience of the users is improved.
In order to implement the foregoing embodiments, the present application further provides a client, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where when the processor executes the computer program, the client implements the search method as set forth in the foregoing embodiments of fig. 1 to 6 of the present application.
In order to implement the foregoing embodiments, the present application further provides a search engine, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the search engine implements the search method as set forth in the foregoing fig. 7 embodiment of the present application.
In order to implement the above embodiments, the present application also proposes a non-transitory computer-readable storage medium, on which a computer program is stored, wherein the program, when executed by a processor, implements a search method as proposed in the foregoing fig. 1 to fig. 6 embodiments of the present application, or implements a search method as proposed in the foregoing fig. 7 embodiments of the present application.
In order to implement the foregoing embodiments, the present application also proposes a computer program product, wherein when the instructions of the computer program product are executed by a processor, the search method proposed by the foregoing fig. 1 to fig. 6 embodiment of the present application is executed, or the search method proposed by the foregoing fig. 7 embodiment of the present application is executed.
In the description herein, reference to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the application. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present application, "plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.
Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing steps of a custom logic function or process, and alternate implementations are included within the scope of the preferred embodiment of the present application in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present application.
The logic and/or steps represented in the flowcharts or otherwise described herein, e.g., an ordered listing of executable instructions that can be considered to implement logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.
It should be understood that portions of the present application may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. If implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.
It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.
In addition, functional units in the embodiments of the present application may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.
The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc. Although embodiments of the present application have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present application, and that variations, modifications, substitutions and alterations may be made to the above embodiments by those of ordinary skill in the art within the scope of the present application.

Claims (12)

1. A method of searching, the method comprising the steps of:
the client acquires a search result page obtained by searching according to the search information by the search engine;
responding to the operation of page links in the search result page, and accessing the links by the client to obtain corresponding content pages;
the client determines the page quality of the content page according to the correlation degree between the content page and the search information, wherein if the correlation degree is low, the page quality of the content page is poor;
the client sends the indication information of the page quality to the search engine; the indication information is used for the search engine to determine the search ranking of the content page according to the page quality of the content page.
2. The searching method according to claim 1, wherein before the client determines the page quality of the content page according to the degree of correlation between the content page and the search information, the method further comprises:
the client extracts the semantic features of the search information and the content page;
and determining the correlation degree between the content page and the search information according to the similarity between the semantic features of the search information and the semantic features of the content page.
3. The searching method according to claim 1 or 2, wherein the determining, by the client, the page quality of the content page according to the degree of correlation between the content page and the search information comprises:
the client side counts and accesses the link to obtain the data flow and/or the access speed required by the content page;
and weighting the correlation degree and at least one of the data flow and the access speed to determine the page quality of the content page.
4. The searching method according to claim 1 or 2, wherein the determining, by the client, the page quality of the content page according to the degree of correlation between the content page and the search information comprises:
the client determines the invalid display area ratio of the content page according to at least one or more combinations of the advertisement display area, the floating window and the blank area in the content page;
and weighting the correlation degree and the invalid display area ratio of the content page to determine the page quality of the content page.
5. A method of searching, the method comprising the steps of:
the search engine searches according to the first search information to obtain each content page;
the search engine inquires the page quality of each content page; the page quality is determined according to the degree of correlation between the corresponding content page and the second search information input by each client, wherein if the degree of correlation is low, the page quality of the content page is poor;
the search engine determines the sequence of each content page according to the page quality of each content page;
and generating a search result page corresponding to the first search information according to the sequence of each content page.
6. The method of claim 5, wherein the search engine queries page quality of each content page, comprising:
the search engine queries the page quality determined by a plurality of clients for the same content page;
and calculating the average value of the page quality determined by the plurality of clients for the same content page to obtain the page quality of the corresponding content page.
7. A search apparatus, characterized in that the apparatus comprises:
the acquisition module is used for acquiring a search result page obtained by searching according to the search information by the search engine;
the access module is used for responding to the operation of page links in the search result page and accessing the links to obtain corresponding content pages;
a determining module, configured to determine page quality of the content page according to a degree of correlation between the content page and the search information, where if the degree of correlation is low, the page quality of the content page is poor;
the sending module is used for sending the indication information of the page quality to the search engine; the indication information is used for the search engine to determine the search ranking of the content page according to the page quality of the content page.
8. A search apparatus, characterized in that the apparatus comprises:
the searching module is used for searching according to the first searching information to obtain each content page;
the query module is used for querying the page quality of each content page; the page quality is determined according to the degree of correlation between the corresponding content page and the second search information input by each client, wherein if the degree of correlation is low, the page quality of the content page is poor;
the sequencing module is used for determining the sequencing of each content page according to the page quality of each content page;
and the generating module is used for generating a search result page corresponding to the first search information according to the sequence of each content page.
9. A client comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the search method according to any one of claims 1 to 4 when executing the program.
10. A search engine comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor when executing the program implementing the search method of claim 5 or 6.
11. A non-transitory computer-readable storage medium on which a computer program is stored, the program, when executed by a processor, implementing the search method according to any one of claims 1 to 4 or implementing the search method according to claim 5 or 6.
12. A computer program product, characterized in that instructions in the computer program product, when executed by a processor, perform the search method according to any one of claims 1-4, or perform the search method according to claim 5 or 6.
CN201811392116.9A 2018-11-21 2018-11-21 Search method, device, client and search engine Active CN109271580B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811392116.9A CN109271580B (en) 2018-11-21 2018-11-21 Search method, device, client and search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811392116.9A CN109271580B (en) 2018-11-21 2018-11-21 Search method, device, client and search engine

Publications (2)

Publication Number Publication Date
CN109271580A CN109271580A (en) 2019-01-25
CN109271580B true CN109271580B (en) 2022-04-01

Family

ID=65190428

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811392116.9A Active CN109271580B (en) 2018-11-21 2018-11-21 Search method, device, client and search engine

Country Status (1)

Country Link
CN (1) CN109271580B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110598739B (en) * 2019-08-07 2023-06-23 广州视源电子科技股份有限公司 Image-text conversion method, image-text conversion equipment, intelligent interaction method, intelligent interaction system, intelligent interaction equipment, intelligent interaction client, intelligent interaction server, intelligent interaction machine and intelligent interaction medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101105815A (en) * 2007-09-06 2008-01-16 腾讯科技(深圳)有限公司 Internet music file sequencing method, system and search method and search engine
CN101661490A (en) * 2008-08-28 2010-03-03 国际商业机器公司 Search engine, client thereof and method for searching page
US20110313773A1 (en) * 2010-05-25 2011-12-22 Keiichi Yamada Search apparatus, search method, and program
CN104809207A (en) * 2015-04-28 2015-07-29 百度在线网络技术(北京)有限公司 Search method and device
CN106095819A (en) * 2016-05-31 2016-11-09 北京奇艺世纪科技有限公司 A kind of video recommendation method and device
CN107463641A (en) * 2012-01-19 2017-12-12 谷歌公司 System and method for improving the access to search result

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101105815A (en) * 2007-09-06 2008-01-16 腾讯科技(深圳)有限公司 Internet music file sequencing method, system and search method and search engine
CN101661490A (en) * 2008-08-28 2010-03-03 国际商业机器公司 Search engine, client thereof and method for searching page
US20110313773A1 (en) * 2010-05-25 2011-12-22 Keiichi Yamada Search apparatus, search method, and program
CN107463641A (en) * 2012-01-19 2017-12-12 谷歌公司 System and method for improving the access to search result
CN104809207A (en) * 2015-04-28 2015-07-29 百度在线网络技术(北京)有限公司 Search method and device
CN106095819A (en) * 2016-05-31 2016-11-09 北京奇艺世纪科技有限公司 A kind of video recommendation method and device

Also Published As

Publication number Publication date
CN109271580A (en) 2019-01-25

Similar Documents

Publication Publication Date Title
CN108763502B (en) Information recommendation method and system
CN107862553B (en) Advertisement real-time recommendation method and device, terminal equipment and storage medium
JP6967612B2 (en) Information retrieval methods, devices and systems
TWI636416B (en) Method and system for multi-phase ranking for content personalization
KR101700352B1 (en) Generating improved document classification data using historical search results
JP6487201B2 (en) Method and apparatus for generating recommended pages
US10152479B1 (en) Selecting representative media items based on match information
WO2017071251A1 (en) Information pushing method and device
US8332775B2 (en) Adaptive user feedback window
CN106339394B (en) Information processing method and device
CN108846091B (en) Information recommendation method, device and equipment
CN108460082B (en) Recommendation method and device and electronic equipment
CN107766399B (en) Method and system for matching images to content items and machine-readable medium
WO2014143371A1 (en) Method and system for measuring user engagement using scroll dwell time
CN109168047B (en) Video recommendation method and device, server and storage medium
WO2009108576A2 (en) Prioritizing media assets for publication
CN112052387B (en) Content recommendation method, device and computer readable storage medium
CN110717093B (en) Movie recommendation system and method based on Spark
CN109753601B (en) Method and device for determining click rate of recommended information and electronic equipment
CN111125528B (en) Information recommendation method and device
CN112307366B (en) Information display method and device and computer storage medium
CN108319646B (en) Vehicle source searching method and device based on user historical behaviors
CN108304432B (en) Information push processing method, information push processing device and storage medium
CN106682049B (en) Topic display system and topic display method
CN104899306A (en) Information processing method, information display method and information display device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant