CN110110197B - Information acquisition method and device - Google Patents

Information acquisition method and device Download PDF

Info

Publication number
CN110110197B
CN110110197B CN201711418685.1A CN201711418685A CN110110197B CN 110110197 B CN110110197 B CN 110110197B CN 201711418685 A CN201711418685 A CN 201711418685A CN 110110197 B CN110110197 B CN 110110197B
Authority
CN
China
Prior art keywords
pushed
information
search
test
search result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711418685.1A
Other languages
Chinese (zh)
Other versions
CN110110197A (en
Inventor
肖艳文
徐文胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Original Assignee
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jingdong Century Trading Co Ltd, Beijing Jingdong Shangke Information Technology Co Ltd filed Critical Beijing Jingdong Century Trading Co Ltd
Priority to CN201711418685.1A priority Critical patent/CN110110197B/en
Publication of CN110110197A publication Critical patent/CN110110197A/en
Application granted granted Critical
Publication of CN110110197B publication Critical patent/CN110110197B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the application discloses an information acquisition method and device. One embodiment of the method comprises: acquiring a test information set to be pushed and a reference information set to be pushed; in response to receiving a search request from any source of the same preset category, searching information to be pushed corresponding to the search request in a test set to be pushed or a reference set to be pushed based on a preset flow distribution proportion; sending a search result to a source sending the search request, wherein the search result comprises information to be pushed corresponding to the search request; and determining the push pertinence information for testing the information set to be pushed and referring to the information set to be pushed based on the log associated with the search result. According to the embodiment, the information set to be pushed in the test can be accurately obtained, the push pertinence information of the information set to be pushed in the reference can be accurately obtained in the same type of source, and the push pertinence of each piece of information to be pushed in the information set to be pushed in the test can be further improved.

Description

Information acquisition method and device
Technical Field
The embodiment of the application relates to the technical field of computers, in particular to the technical field of internet, and particularly relates to an information acquisition method and device.
Background
Information push, also called "network broadcast", is a technology for reducing information overload by pushing information required by users on the internet through a certain technical standard or protocol. The information push technology can reduce the time spent by the user in searching on the network by actively pushing information to the user.
The existing information push mode is usually to directly load various push information on the web page, and the push information has obvious difference with the content of the web page.
In the prior art, the push pertinence of information push can be purposefully improved by analyzing statistics and the like of the operation of a user on information to be pushed.
Disclosure of Invention
The embodiment of the application provides an information acquisition method and device.
In a first aspect, an embodiment of the present application provides an information obtaining method, where the method includes: the method comprises the steps of obtaining a test information set to be pushed and a reference information set to be pushed, wherein the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first label set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule; in response to receiving a search request from any source of the same preset category, searching information to be pushed corresponding to the search request in a test set to be pushed or a reference set to be pushed based on a preset flow distribution proportion; sending a search result to a source sending the search request, wherein the search result comprises information to be pushed corresponding to the search request; and determining the push pertinence information for testing the information set to be pushed and referring to the information set to be pushed based on the log associated with the search result.
In some embodiments, obtaining the test information set to be pushed and the reference information set to be pushed includes: randomly selecting a preset number of information to be pushed from a pre-established information database to be pushed to form a reference information set to be pushed; and for each piece of information to be pushed in the reference information set to be pushed, replacing a second label of the piece of information to be pushed with a first label set based on a first preset rule, and generating a test information set to be pushed.
In some embodiments, determining push targeting information for testing a set of information to be pushed and referring to the set to be pushed based on logs associated with search results comprises: determining a first attention degree of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed from a log associated with the search result in a preset time period; determining a second attention degree of each piece of information to be pushed in the search results obtained by searching in the reference set to be pushed from the log associated with the search results in a preset time period; and determining pushing pertinence information of the information set to be pushed to be tested based on the first attention and determining pushing pertinence information of the information set to be pushed to be referred to based on the second attention.
In some embodiments, the log associated with the search result comprises a click data log for recording a click operation of the user on the information to be pushed in the search result; the first attention comprises a first click quantity of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed and determined from a click data log in a preset time period; the second attention comprises a second click rate of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from the click data log in the preset time period.
In some embodiments, the log associated with the search result includes an exposure data log, where the exposure data log is used to record presentation information of any information to be pushed in the search result presented on the page; the first attention comprises a first exposure of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed and determined from an exposure data log in a preset time period; the second attention comprises a second exposure of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from the exposure data log in the preset time period.
In some embodiments, the method further comprises: establishing a first search index for testing an information set to be pushed based on the first label; establishing a second search index which refers to the information set to be pushed based on the second label; in response to receiving a search request from any source of the same preset category, searching information to be pushed corresponding to the search request in a test set to be pushed or a reference set to be pushed based on a preset flow distribution proportion, including: in response to receiving a search request from any source of the same preset category, searching the first search index or the second search index for information to be pushed corresponding to the search request based on a preset flow distribution proportion.
In a second aspect, an embodiment of the present application further provides an information acquiring apparatus, including: the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is configured to acquire a test information set to be pushed and a reference information set to be pushed, the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first label set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule; the searching unit is configured to respond to a search request from any source of the same preset category, and search information to be pushed corresponding to the search request in a test set to be pushed or a reference set to be pushed based on a preset flow distribution proportion; the sending unit is configured to send a search result to a source sending the search request, wherein the search result comprises information to be pushed corresponding to the search request; and a determining unit configured to determine, based on the log associated with the search result, push pertinence information that tests the set of information to be pushed and references the set of information to be pushed.
In some embodiments, the obtaining unit is further configured to: randomly selecting a preset number of information to be pushed from a pre-established information database to be pushed to form a reference information set to be pushed; and for each piece of information to be pushed in the reference information set to be pushed, replacing a second label of the piece of information to be pushed with a first label set based on a first preset rule, and generating a test information set to be pushed.
In some embodiments, the determining unit is further configured to: determining a first attention degree of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed from a log associated with the search result in a preset time period; determining a second attention degree of each piece of information to be pushed in the search results obtained by searching in the reference set to be pushed from the log associated with the search results in a preset time period; and determining pushing pertinence information of the information set to be pushed to be tested based on the first attention and determining pushing pertinence information of the information set to be pushed to be referred to based on the second attention.
In some embodiments, the log associated with the search result comprises a click data log for recording a click operation of the user on the information to be pushed in the search result; the first attention comprises a first click quantity of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed and determined from a click data log in a preset time period; the second attention comprises a second click rate of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from the click data log in the preset time period.
In some embodiments, the log associated with the search result includes an exposure data log, where the exposure data log is used to record presentation information of any information to be pushed in the search result presented on the page; the first attention comprises a first exposure of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed and determined from an exposure data log in a preset time period; the second attention comprises a second exposure of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from the exposure data log in the preset time period.
In some embodiments, the apparatus further comprises an indexing unit; the index unit is configured to establish a first search index for testing the information set to be pushed based on the first tag, and establish a second search index for referring to the information set to be pushed based on the second tag; the search unit is further configured to: in response to receiving a search request from any source of the same preset category, searching the first search index or the second search index for information to be pushed corresponding to the search request based on a preset flow distribution proportion.
In a third aspect, an embodiment of the present application further provides an apparatus, including: one or more processors; the storage device is used for storing one or more programs, and when the one or more programs are executed by one or more processors, the one or more processors realize the information acquisition method.
In a fourth aspect, the present application further provides a computer-readable storage medium, on which a computer program is stored, where the program is executed by a processor to implement the information acquisition method as above.
The information acquisition method and the information acquisition device provided by the embodiment of the application have the advantages that by acquiring the test information set to be pushed and the reference information set to be pushed, and when receiving a search request from any source of the same preset category, based on a preset flow distribution ratio, searching the information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed, then sending a search result containing information to be pushed corresponding to the search request to a preset source, finally determining a set of information to be pushed and push targeted information referring to the set to be pushed based on a log associated with the search result, the push pertinence information of the test information set to be pushed and the reference information set to be pushed in the same category of sources can be accurately obtained, and then be favorable to promoting the propelling movement pertinence of each information to be propelled in the test information set to be propelled.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is an exemplary system architecture diagram in which the present application may be applied;
FIG. 2 is a flow diagram of one embodiment of an information acquisition method according to the present application;
FIG. 3 is a flow diagram of yet another embodiment of an information acquisition method according to the present application;
FIG. 4 is a schematic diagram of an application scenario of an information acquisition method according to the present application;
FIG. 5 is a schematic structural diagram of one embodiment of an information acquisition apparatus according to the present application;
FIG. 6 is a schematic block diagram of a computer system suitable for use in implementing a server according to embodiments of the present application.
Detailed Description
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 illustrates an exemplary system architecture 100 to which embodiments of the information acquisition method or information acquisition apparatus of the present application may be applied.
As shown in fig. 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. The terminal devices 101, 102, 103 may have various communication client applications installed thereon, such as a web browser application, a shopping application, a search application, an instant messaging tool, a mailbox client, social platform software, and the like.
The terminal devices 101, 102, 103 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, e-book readers, MP3 players (Moving Picture Experts Group Audio Layer III, mpeg compression standard Audio Layer 3), MP4 players (Moving Picture Experts Group Audio Layer IV, mpeg compression standard Audio Layer 4), laptop portable computers, desktop computers, and the like.
The server 105 may be a server that provides various services, such as a search-class server that provides support for search requests issued by the terminal apparatuses 101, 102, 103. The search server may analyze and other processes the received data such as the search request, and feed back a processing result (e.g., a search result corresponding to the search request and including the information to be pushed) to the terminal device.
It should be noted that the information acquiring method provided in the embodiment of the present application is generally executed by the server 105, and accordingly, the information acquiring apparatus is generally disposed in the server 105.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
With continued reference to FIG. 2, a flow 200 of one embodiment of an information acquisition method according to the present application is shown. The information acquisition method comprises the following steps:
step 201, obtaining a test information set to be pushed and a reference information set to be pushed.
The test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first label set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule.
In this embodiment, an electronic device (for example, a server shown in fig. 1) on which the information obtaining method operates may obtain the test information set to be pushed and the reference information set to be pushed from a database stored thereon or communicatively connected thereto through a wired connection manner or a wireless connection manner. It should be noted that the wireless connection means may include, but is not limited to, a 3G/4G connection, a WiFi connection, a bluetooth connection, a WiMAX connection, a Zigbee connection, a uwb (ultra wideband) connection, and other wireless connection means now known or developed in the future.
For example, in some application scenarios, the information set to be pushed is tested and the information to be pushed contained in the information set to be pushed is referred to as the article detail information of some clothing articles. In these application scenarios, each piece of information to be pushed in the set to be pushed is tested to have a label set according to its applicable object, for example, infant clothing, teenager clothing, middle-aged and elderly clothing, and the like. And each piece of information to be pushed in the set of reference information to be pushed has a tag set according to its applicable weather, for example, winter clothes, summer clothes, spring and autumn clothes, and the like. Then, the first preset rule for testing each piece of information to be pushed in the set of information to be pushed may be understood as a rule of "classify according to applicable objects", and accordingly, the infant garment, the teenager garment, the middle-aged and elderly garments, and the like may be understood as each first label set based on the rule. Similarly, the second preset rule for each piece of information to be pushed in the set of information to be pushed may be understood as a rule of "sort by applicable weather", and accordingly, winter clothes, summer clothes, spring and autumn clothes, etc. may be understood as each second tag set based on the rule.
Step 202, in response to receiving a search request from any source of the same preset category, searching for information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed based on a preset traffic distribution ratio.
Here, the same predetermined source may be understood as a source belonging to any one of the categories after being classified according to a predetermined rule.
For example, in some application scenarios, the electronic device that issues the search request may be classified into a plurality of categories of sources according to a number segment where an IP Address (Internet Protocol Address) of the electronic device that issues the search request is located.
Here, the preset flow rate distribution ratio may be, for example, a specific value set in advance. In some application scenarios, the preset flow distribution ratio may be 50%, for example. In these alternative implementations, when search requests from the same preset source are received, a search may be alternately performed in the test to-be-pushed set or the reference to-be-pushed set, so that within a period of time, 50% of the search requests from the preset source of the category are searched in the test to-be-pushed set and the other 50% are searched in the reference to-be-pushed set.
Step 203, sending a search result to a source sending the search request, where the search result includes information to be pushed corresponding to the search request.
In some application scenarios, the search results may be presented in a list on a search results page. In these application scenarios, each item in the list may be a URL (Uniform resource Locator) corresponding to one of the search results. In these application scenarios, the information to be pushed may be presented on the page as some item in the list, e.g. at the top of the page. Alternatively, the information to be pushed may also be presented in a form of other search results distinguished from the search results page, for example, the information to be pushed may be presented on the right side of the page, and the other search results may be presented on the left side of the page in a list form.
And step 204, determining the push pertinence information of the test information set to be pushed and the reference information set to be pushed based on the log associated with the search result.
Here, the log associated with the search result may be any log capable of characterizing a record of a certain item of the search result being operated by the user or being presented on the terminal device used by the user.
In this step, any predetermined manner may be used to determine the push pertinence information of the test information set to be pushed and the reference information set to be pushed through the log associated with the search result.
For example, in some application scenarios, the push pertinence information indicating the push pertinence may be based on a ratio of the number of clicks of the user on the information to be pushed and the total number of clicks of other search result items except the information to be pushed in the search results within a certain period of time. Specifically, assuming that the sum of the number of clicks of the search result items in the search result by the users from the same preset category of sources is a, and meanwhile, the number of clicks of the users on the information to be pushed in the test information to be pushed set is b, and the number of clicks on the information to be pushed in the reference information to be pushed set is c, the push pertinence of the test information to be pushed set can be represented by the value of b/a, and the push pertinence of the reference information to be pushed set can be represented by the value of c/a.
The information acquiring method of the embodiment acquires the test information set to be pushed and the reference information set to be pushed, and when receiving a search request from any source of the same preset category, based on a preset flow distribution ratio, searching the information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed, then sending a search result containing information to be pushed corresponding to the search request to a preset source, finally determining a set of information to be pushed and push targeted information referring to the set to be pushed based on a log associated with the search result, the push pertinence information of the test information set to be pushed and the reference information set to be pushed in the same category of sources can be accurately obtained, and then be favorable to promoting the propelling movement pertinence of each information to be propelled in the test information set to be propelled.
With further reference to fig. 3, a flow 300 of yet another embodiment of an information acquisition method is shown. The process 300 of the information obtaining method includes the following steps:
301, acquiring a test information set to be pushed and a reference information set to be pushed, wherein the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first tag set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second tag set based on a second preset rule, and the second preset rule is different from the first preset rule;
step 302, in response to receiving a search request from any source of the same preset category, searching information to be pushed corresponding to the search request in a test set to be pushed or a reference set to be pushed based on a preset flow distribution proportion;
step 303, sending a search result to a source sending the search request, where the search result includes information to be pushed corresponding to the search request.
The execution manner of the steps 301 to 303 is similar to that of the steps 201 to 203 in the embodiment shown in fig. 2, and is not described again here.
Step 304, determining a first attention degree of each piece of information to be pushed in the search results obtained by searching in the test set to be pushed from the log associated with the search results in a preset time period.
Here, the first attention degree of each piece of information to be pushed in the search result obtained by searching in the test set to be pushed may be an attention degree of the user to any piece of information to be pushed in the test set to be pushed. The degree of interest may be a specific quantitative representation (e.g., may be represented by a specific numerical value analyzed from a log associated with the search results). Alternatively, the degree of interest may be a more abstract qualitative representation (e.g., represented by a certain degree of interest level analyzed from a log associated with the search results).
Step 305, determining a second attention of each piece of information to be pushed in the search results obtained by searching in the reference set to be pushed from the log associated with the search results in a preset time period.
In this step, the determination method of the second attention of each piece of information to be pushed in the search result obtained by referring to the set to be pushed may be similar to the determination method of the first attention in step 304.
Step 306, determining to test the push pertinence information of the information set to be pushed based on the first attention and determining to refer to the push pertinence information of the information set to be pushed based on the second attention.
As can be seen from fig. 3, compared with the embodiment corresponding to fig. 2, the flow 300 of the information obtaining method in this embodiment highlights a flow of determining the push pertinence information of the test information set to be pushed and the push pertinence information of the reference information set to be pushed, which is beneficial to improving the accuracy of determining the push pertinence indicated by the push pertinence information of the test information set to be pushed and the push pertinence indicated by the push pertinence information of the reference information set to be pushed.
In some optional implementations of the embodiment, the log associated with the search result may include a click data log for recording a click operation of the user on the information to be pushed in the search result.
In these optional implementation manners, the first attention may include a first click amount of each piece of information to be pushed in search results obtained by searching in the test set to be pushed, which is determined from click data logs within a preset time period. Correspondingly, the second attention may include a second click rate of each piece of information to be pushed in the search result obtained by searching in the reference set to be pushed, which is determined from the click data log within the preset time period.
In other optional implementation manners of this embodiment, the log associated with the search result may further include an exposure data log, and the exposure data log may be used to record presentation information of any information to be pushed in the search result, which is presented on the page.
In these alternative implementations, the first attention may include a first exposure amount of each piece of information to be pushed in search results searched in the test set to be pushed, which is determined from exposure data logs within a preset time period. Accordingly, the second attention may include a second exposure amount of each piece of information to be pushed in the search result searched in the reference set to be pushed, which is determined from the exposure data log within the preset time period.
It is to be understood that, within the preset time period, the push pertinence of the set to be pushed and the push pertinence of the reference set to be pushed may be determined based on the first click quantity and the second click quantity, respectively. Alternatively, the push pertinence of the set to be pushed and the push pertinence of the reference set to be pushed may be determined based on the first exposure and the second exposure respectively within a preset time period. Or, the push pertinence of the set to be pushed and the push pertinence of the reference set to be pushed can be determined in a preset time period based on the first click quantity, the first exposure and the second click quantity and the second exposure respectively.
Alternatively, when the push pertinence of the set to be pushed and the push pertinence of the reference set to be pushed are tested within the preset period determined based on the first click amount, the first exposure amount, the second click amount, and the second exposure amount, respectively, for example, respective weights may be set for the click amount (including the first click amount and the second click amount) and the exposure amount (including the first exposure amount and the second exposure amount), the push pertinence of the test set to be pushed is determined by weighted summation of the first click amount and the first exposure amount, and the push pertinence of the reference set to be pushed is determined by weighted summation of the second click amount and the second exposure amount.
The influence of the first preset rule for setting each first label and the second preset rule for setting each second label on the push pertinence can be reflected through the determined push pertinence of the test to-be-pushed set and the determined push pertinence of the reference to-be-pushed set, and the push pertinence of the test to-be-pushed set can be improved by adjusting the first preset rule.
In some optional implementation manners of the information obtaining method according to the embodiments of the present application described above, the obtaining of the test information set to be pushed and the reference information set in steps 201 and 301 may further include:
randomly selecting a preset number of information to be pushed from a pre-established information database to be pushed to form a reference information set to be pushed.
And for each piece of information to be pushed in the reference information set to be pushed, replacing a second label of the piece of information to be pushed with a first label set based on a first preset rule, and generating a test information set to be pushed.
In these alternative implementations, the reference set of information to be pushed may be randomly selected from an existing database of information to be pushed. And the information to be pushed in the information database to be pushed has at least one second label set based on a second preset rule.
In addition, in some optional implementation manners of the above-described embodiments of the present application, the information obtaining method may further include:
establishing a first search index for testing an information set to be pushed based on the first label; and establishing a second search index referring to the information set to be pushed based on the second label.
In this way, in step 202 and step 302, when a search request from any source of the same preset category is received, a search may be performed in the first search index or the second search index, so as to improve the search efficiency of searching for information to be pushed corresponding to the search request from the test set to be pushed or the reference set to be pushed.
Fig. 4 is a schematic view of an application scenario of the information acquisition method of the present application.
In this application scenario 400, the traffic forwarding server 400 receives a search request from a user and divides the search request belonging to the same class of sources into two parts with a distribution ratio of 50%. The retrieval sorting server 402 retrieves 50% of the search requests in the test information to be pushed set 403, and returns the information to be pushed corresponding to the search requests to the user. Similarly, the retrieval ranking server 402 retrieves another 50% of the search requests in the reference information to be pushed set 404, and returns the information to be pushed corresponding to the search requests to the user. Here, the test information to be pushed set 403 and the reference information to be pushed set 404 both comprise the same information to be pushed randomly chosen from the information to be pushed database 405. And each piece of information to be pushed in the information set to be pushed is tested to have at least one first label set based on a first preset rule, each piece of information to be pushed in the information set to be pushed is referred to have at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule.
Next, the traffic forwarding server 401 and a user click behavior server (not shown in the figure) may record a click log and an exposure log of each to-be-pushed message, respectively.
Further, by analyzing and processing the click log and the exposure log, point estimation and/or interval estimation for testing the click rate and the conversion rate of each piece of information to be pushed in the information set to be pushed, and point estimation and/or interval estimation for referring to the click rate and the conversion rate of each piece of information to be pushed in the information set to be pushed can be obtained. Here, the click rate, the point estimation of the conversion rate, and/or the interval estimation of each piece of information to be pushed in the information set to be pushed may be used to represent the push pertinence of each piece of information to be pushed in the information set to be pushed, and accordingly, the point estimation and/or the interval estimation referring to the click rate, the conversion rate, and/or the interval estimation of each piece of information to be pushed in the information set to be pushed may be used to represent the push pertinence referring to each piece of information to be pushed in the information set to be pushed.
With further reference to fig. 5, as an implementation of the method shown in the above-mentioned figures, the present application provides an embodiment of an information obtaining apparatus, which corresponds to the embodiment of the method shown in fig. 2, and which can be applied to various electronic devices.
As shown in fig. 5, the information acquisition apparatus 500 of the present embodiment includes an acquisition unit 510, a search unit 520, a transmission unit 530, and a determination unit 540.
The obtaining unit 510 may be configured to obtain a test information set to be pushed and a reference information set to be pushed, where the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each information to be pushed in the test information set to be pushed has at least one first tag set based on a first preset rule, each information to be pushed in the reference information set to be pushed has at least one second tag set based on a second preset rule, and the second preset rule is different from the first preset rule.
The searching unit 520 may be configured to, in response to receiving a search request from any source of the same preset category, search for information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed based on a preset traffic distribution ratio.
The sending unit 530 may be configured to send a search result to a source sending the search request, where the search result includes information to be pushed corresponding to the search request.
The determining unit 540 may be configured to determine, based on the log associated with the search result, push pertinence information that tests the set of information to be pushed and references the set of information to be pushed.
In some optional implementations, the obtaining unit 510 may be further configured to: randomly selecting a preset number of information to be pushed from a pre-established information database to be pushed to form a reference information set to be pushed; and for each piece of information to be pushed in the reference information set to be pushed, replacing a second label of the piece of information to be pushed with a first label set based on a first preset rule, and generating a test information set to be pushed.
In some optional implementations, the determining unit 540 may be further configured to: determining a first attention degree of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed from a log associated with the search result in a preset time period; determining a second attention degree of each piece of information to be pushed in the search results obtained by searching in the reference set to be pushed from the log associated with the search results in a preset time period; and determining pushing pertinence information of the information set to be pushed to be tested based on the first attention and determining pushing pertinence information of the information set to be pushed to be referred to based on the second attention.
In some optional implementations, the log associated with the search result may include a click data log for recording a click operation of the user on the information to be pushed in the search result; the first attention comprises a first click quantity of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed and determined from a click data log in a preset time period; the second attention comprises a second click rate of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from the click data log in the preset time period.
In some optional implementations, the log associated with the search result may include an exposure data log, where the exposure data log is used to record presentation information of any information to be pushed in the search result, which is presented on the page; the first attention comprises a first exposure of each piece of information to be pushed in a search result obtained by searching in a test set to be pushed and determined from an exposure data log in a preset time period; the second attention comprises a second exposure of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from the exposure data log in the preset time period.
In some optional implementations, the information obtaining apparatus of this embodiment may further include an indexing unit (not shown in the figure).
In these alternative implementations, the indexing unit may be configured to establish a first search index that tests the information set to be pushed based on the first tag, and establish a second search index that refers to the information set to be pushed based on the second tag. In these alternative implementations, the search unit 520 may be further configured to: in response to receiving a search request from any source of the same preset category, searching the first search index or the second search index for information to be pushed corresponding to the search request based on a preset flow distribution proportion.
Referring now to FIG. 6, shown is a block diagram of a computer system 600 suitable for use in implementing a server according to embodiments of the present application. The terminal device/server shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 6, the computer system 600 includes a Central Processing Unit (CPU)601 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)602 or a program loaded from a storage section 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data necessary for the operation of the system 600 are also stored. The CPU 601, ROM 602, and RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
The following components are connected to the I/O interface 605: an input portion 606 including a keyboard, a mouse, and the like; an output portion 607 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 608 including a hard disk and the like; and a communication section 609 including a network interface card such as a LAN card, a modem, or the like. The communication section 609 performs communication processing via a network such as the internet. The driver 610 is also connected to the I/O interface 605 as needed. A removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 610 as necessary, so that a computer program read out therefrom is mounted in the storage section 608 as necessary.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 609, and/or installed from the removable medium 611. The computer program performs the above-described functions defined in the method of the present application when executed by a Central Processing Unit (CPU) 601. It should be noted that the computer readable medium described herein can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present application may be implemented by software or hardware. The described units may also be provided in a processor, and may be described as: a processor includes an acquisition unit, a search unit, a transmission unit, and a determination unit. The names of these units do not in some cases form a limitation on the units themselves, for example, the acquiring unit may also be described as a "unit that acquires a test set of information to be pushed and a reference set of information to be pushed".
As another aspect, the present application also provides a computer-readable medium, which may be contained in the apparatus described in the above embodiments; or may be present separately and not assembled into the device. The computer readable medium carries one or more programs which, when executed by the apparatus, cause the apparatus to: the method comprises the steps of obtaining a test information set to be pushed and a reference information set to be pushed, wherein the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first label set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule; in response to receiving a search request from any source of the same preset category, searching information to be pushed corresponding to the search request in a test set to be pushed or a reference set to be pushed based on a preset flow distribution proportion; sending a search result to a source sending the search request, wherein the search result comprises information to be pushed corresponding to the search request; and determining the push pertinence information for testing the information set to be pushed and referring to the information set to be pushed based on the log associated with the search result.
The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention herein disclosed is not limited to the particular combination of features described above, but also encompasses other arrangements formed by any combination of the above features or their equivalents without departing from the spirit of the invention. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.

Claims (14)

1. An information acquisition method, comprising:
acquiring a test information set to be pushed and a reference information set to be pushed, wherein the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first label set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule;
in response to receiving a search request from any source of the same preset category, searching information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed based on a preset flow distribution proportion;
sending a search result to the source sending the search request, wherein the search result comprises information to be pushed corresponding to the search request; and
and determining pushing targeted information of the test information to be pushed set and the reference information to be pushed set based on a log associated with the search result, wherein the pushing targeted information is determined by the ratio of the number of clicks of the user on the information to be pushed in the search result to the total number of clicks of other search result items except the information to be pushed by the user in a certain period of time.
2. The method according to claim 1, wherein the obtaining a test information set to be pushed and a reference information set to be pushed comprises:
randomly selecting a preset number of information to be pushed from a pre-established information database to be pushed to form a reference information set to be pushed;
and for each piece of information to be pushed in the reference information set to be pushed, replacing a second label of the piece of information to be pushed with a first label set based on the first preset rule, and generating the test information set to be pushed.
3. The method of claim 1, wherein the determining push targeting information for the test set of information to be pushed and the reference set of information to be pushed based on logs associated with the search results comprises:
determining a first attention degree of each piece of information to be pushed in the search results obtained by searching in the test set to be pushed from the log associated with the search results in a preset time period;
determining a second degree of attention of each piece of information to be pushed in the search results obtained by searching in the reference set to be pushed from the log associated with the search results in a preset time period; and
determining push pertinence information of the test information set to be pushed based on the first attention and determining push pertinence information of the reference information set to be pushed based on the second attention.
4. The method of claim 3, wherein the log associated with the search result comprises a click data log for recording a click operation of a user on information to be pushed in the search result;
the first attention comprises a first click quantity of each piece of information to be pushed in a search result obtained by searching in the test set to be pushed and determined from a click data log in a preset time period;
the second attention comprises a second click rate of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from a click data log in a preset time period.
5. The method according to claim 3 or 4, wherein the log associated with the search result comprises an exposure data log, and the exposure data log is used for recording presentation information of any information to be pushed in the search result presented on a page;
the first attention comprises a first exposure amount of each piece of information to be pushed in a search result obtained by searching in the test set to be pushed and determined from an exposure data log in a preset time period;
the second attention comprises a second exposure amount of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from an exposure data log in a preset time period.
6. The method of claim 1, wherein the method further comprises:
establishing a first search index for testing the information set to be pushed based on the first label; and
establishing a second search index of the reference information set to be pushed based on the second label;
the searching for the information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed based on a preset flow distribution proportion in response to receiving the search request from any source of the same preset category includes:
in response to receiving a search request from any source of the same preset category, searching the first search index or the second search index for information to be pushed corresponding to the search request based on a preset traffic distribution ratio.
7. An information acquisition apparatus comprising:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is configured to acquire a test information set to be pushed and a reference information set to be pushed, the test information set to be pushed and the reference information set to be pushed have the same information to be pushed, each piece of information to be pushed in the test information set to be pushed has at least one first label set based on a first preset rule, each piece of information to be pushed in the reference information set to be pushed has at least one second label set based on a second preset rule, and the second preset rule is different from the first preset rule;
the searching unit is configured to respond to a search request from any source of the same preset category, and search information to be pushed corresponding to the search request in the test set to be pushed or the reference set to be pushed based on a preset flow distribution proportion;
the sending unit is configured to send a search result to the source sending the search request, wherein the search result comprises information to be pushed corresponding to the search request; and
the determining unit is configured to determine pushing pertinence information of the test information to be pushed set and the reference information to be pushed set based on a log associated with the search result, wherein the pushing pertinence information is determined by a ratio of the number of clicks of the user on the information to be pushed in the search result to the total number of clicks of the user on other search result items except the information to be pushed in a certain period of time.
8. The apparatus of claim 7, wherein the obtaining unit is further configured to:
randomly selecting a preset number of information to be pushed from a pre-established information database to be pushed to form a reference information set to be pushed;
and for each piece of information to be pushed in the reference information set to be pushed, replacing a second label of the piece of information to be pushed with a first label set based on the first preset rule, and generating the test information set to be pushed.
9. The apparatus of claim 7, wherein the determining unit is further configured to:
determining a first attention degree of each piece of information to be pushed in the search results obtained by searching in the test set to be pushed from the log associated with the search results in a preset time period;
determining a second degree of attention of each piece of information to be pushed in the search results obtained by searching in the reference set to be pushed from the log associated with the search results in a preset time period; and
determining push pertinence information of the test information set to be pushed based on the first attention and determining push pertinence information of the reference information set to be pushed based on the second attention.
10. The device of claim 9, wherein the log associated with the search result comprises a click data log for recording a click operation of a user on information to be pushed in the search result;
the first attention comprises a first click quantity of each piece of information to be pushed in a search result obtained by searching in the test set to be pushed and determined from a click data log in a preset time period;
the second attention comprises a second click rate of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from a click data log in a preset time period.
11. The device according to claim 9 or 10, wherein the log associated with the search result comprises an exposure data log, and the exposure data log is used for recording presentation information of any information to be pushed in the search result presented on a page;
the first attention comprises a first exposure amount of each piece of information to be pushed in a search result obtained by searching in the test set to be pushed and determined from an exposure data log in a preset time period;
the second attention comprises a second exposure amount of each piece of information to be pushed in a search result obtained by searching in the reference set to be pushed and determined from an exposure data log in a preset time period.
12. The apparatus of claim 7, wherein the apparatus further comprises an indexing unit;
the index unit is configured to establish a first search index of the test information set to be pushed based on the first tag, and establish a second search index of the reference information set to be pushed based on the second tag;
the search unit is further configured to:
in response to receiving a search request from any source of the same preset category, searching the first search index or the second search index for information to be pushed corresponding to the search request based on a preset traffic distribution ratio.
13. An apparatus, comprising:
one or more processors;
a storage device for storing one or more programs,
when executed by the one or more processors, cause the one or more processors to implement the method of any one of claims 1-6.
14. A computer-readable storage medium, on which a computer program is stored, wherein the program, when executed by a processor, implements the method of any one of claims 1-6.
CN201711418685.1A 2017-12-25 2017-12-25 Information acquisition method and device Active CN110110197B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711418685.1A CN110110197B (en) 2017-12-25 2017-12-25 Information acquisition method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711418685.1A CN110110197B (en) 2017-12-25 2017-12-25 Information acquisition method and device

Publications (2)

Publication Number Publication Date
CN110110197A CN110110197A (en) 2019-08-09
CN110110197B true CN110110197B (en) 2021-08-03

Family

ID=67483473

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711418685.1A Active CN110110197B (en) 2017-12-25 2017-12-25 Information acquisition method and device

Country Status (1)

Country Link
CN (1) CN110110197B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113158032B (en) * 2021-03-18 2024-03-01 北京京东乾石科技有限公司 Information pushing method and device
CN114928647B (en) * 2022-05-17 2022-11-18 北京时代动向广告传媒有限公司 Application program pushing system and method based on big data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103106259A (en) * 2013-01-25 2013-05-15 西北工业大学 Mobile webpage content recommending method based on situation
CN103455559A (en) * 2011-12-27 2013-12-18 北京奇虎科技有限公司 Method and device for automatically recommending application
CN104270429A (en) * 2014-09-19 2015-01-07 北京奇虎科技有限公司 Method and device for pushing application to terminal
CN104572791A (en) * 2013-10-24 2015-04-29 腾讯科技(深圳)有限公司 Method and device for evaluating search prompt system
CN105701155A (en) * 2015-12-30 2016-06-22 百度在线网络技术(北京)有限公司 Information push method and the device
CN107306355A (en) * 2016-04-20 2017-10-31 华为技术有限公司 A kind of content recommendation method and server

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101479591B1 (en) * 2008-11-21 2015-01-08 삼성전자주식회사 Method and apparatus for searching cell of mobile communication

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103455559A (en) * 2011-12-27 2013-12-18 北京奇虎科技有限公司 Method and device for automatically recommending application
CN103106259A (en) * 2013-01-25 2013-05-15 西北工业大学 Mobile webpage content recommending method based on situation
CN104572791A (en) * 2013-10-24 2015-04-29 腾讯科技(深圳)有限公司 Method and device for evaluating search prompt system
CN104270429A (en) * 2014-09-19 2015-01-07 北京奇虎科技有限公司 Method and device for pushing application to terminal
CN105701155A (en) * 2015-12-30 2016-06-22 百度在线网络技术(北京)有限公司 Information push method and the device
CN107306355A (en) * 2016-04-20 2017-10-31 华为技术有限公司 A kind of content recommendation method and server

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
中小学网络学习平台个性化推送系统的研究与设计;方琦;《中国优秀硕士学位论文全文数据库 信息科技辑》;20140315;I138-1162 *

Also Published As

Publication number Publication date
CN110110197A (en) 2019-08-09

Similar Documents

Publication Publication Date Title
CN109460513B (en) Method and apparatus for generating click rate prediction model
CN107172151B (en) Method and device for pushing information
CN109522483B (en) Method and device for pushing information
CN111125574B (en) Method and device for generating information
WO2017035970A1 (en) Information pushing method and apparatus
CN107944481B (en) Method and apparatus for generating information
CN110298716B (en) Information pushing method and device
CN108810047B (en) Method and device for determining information push accuracy rate and server
CN111913738B (en) Access request processing method, device, computing equipment and medium
CN104067274A (en) System and method for improving access to search results
CN110866040B (en) User portrait generation method, device and system
CN109992719B (en) Method and apparatus for determining push priority information
CN111222960A (en) Room source recommendation method and system based on public traffic zone
CN108600329B (en) Method and equipment for pushing information and displaying information
CN113971243A (en) Data processing method, system, equipment and storage medium applied to questionnaire survey
CN110674404A (en) Link information generation method, device, system, storage medium and electronic equipment
CN107704357B (en) Log generation method and device
CN108932640B (en) Method and device for processing orders
CN110110197B (en) Information acquisition method and device
CN111126649B (en) Method and device for generating information
CN108011936B (en) Method and device for pushing information
CN114297476A (en) Questionnaire survey method, system, electronic equipment and storage medium based on user tags
CN108932594B (en) Information pushing method and device
CN111125502B (en) Method and device for generating information
CN107920100B (en) Information pushing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant