WO2018171288A1 - Method and apparatus for tagging information stream, terminal device, and storage medium - Google Patents

Method and apparatus for tagging information stream, terminal device, and storage medium Download PDF

Info

Publication number
WO2018171288A1
WO2018171288A1 PCT/CN2017/120182 CN2017120182W WO2018171288A1 WO 2018171288 A1 WO2018171288 A1 WO 2018171288A1 CN 2017120182 W CN2017120182 W CN 2017120182W WO 2018171288 A1 WO2018171288 A1 WO 2018171288A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
tag
preset
interest
degree
Prior art date
Application number
PCT/CN2017/120182
Other languages
French (fr)
Chinese (zh)
Inventor
潘岸腾
Original Assignee
广州优视网络科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州优视网络科技有限公司 filed Critical 广州优视网络科技有限公司
Publication of WO2018171288A1 publication Critical patent/WO2018171288A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • the present invention relates to the field of information processing technologies, and in particular, to a method, an apparatus, a terminal device, and a storage medium for labeling information streams.
  • the current app store or application market can display the app directly to the user, as shown in FIG. 1A, and also provides a new app release method: Add information to the app store, introduce and promote apps through interesting articles, short videos, or headlines, open the stream to see article content, videos or news, and have the stream at the bottom of the page.
  • the application for downloading as shown in FIG. 1A and FIG. 1B, when clicking on the news "Do you have the most dangerous function of WeChat?" in the information flow display page shown in FIG. 1A, the page shown in FIG. 1B is entered.
  • the news provider is also provided at the bottom of the page - the third-party application "UC headline" and the download button.
  • the information flow provided by many channels lacks description of information.
  • the specifications of information description are not uniform, and the use of information flow-application store or application market does not currently have one. A good way to unify the information with irregular descriptions from various channels, so that the labeling of the information flow can not be done automatically by means of tools, and relying on manual implementation of labeling is time-consuming and laborious, not easy to do.
  • the business that carries out the personalized recommendation information flow related to the information itself may encounter many difficulties.
  • an embodiment of the present invention provides a method for labeling an information flow, including: determining, according to a third-party application installed by the user on the terminal, the user's interest in different tags; based on the interest degree and the user information.
  • the click condition of the stream determines the matching degree of the information flow to the label; according to the matching degree, a corresponding number of labels are selected according to the preset manner to label the information flow.
  • an embodiment of the present invention provides an apparatus for labeling an information flow, including: an interest degree determining unit, configured to determine, according to a third-party application installed by the user on the terminal, the user's interest level for different labels;
  • the matching degree determining unit is configured to determine the matching degree of the information stream to the label based on the interest degree and the user's click condition of the information flow;
  • the labeling unit is configured to select a corresponding quantity label according to the matching degree to the information flow labeling according to the preset manner.
  • an embodiment of the present invention provides a terminal device, including: one or more processors; a memory; one or more applications, wherein the one or more applications are stored in the memory And configured to be executed by the one or more processors, the one or more programs configured to:
  • a corresponding number of labels are selected according to a preset manner to label the information flow.
  • an embodiment of the present invention provides a computer readable storage medium carrying one or more computer instruction programs executed by one or more processors when the computer instruction program is executed by one or more processors
  • Methods for labeling information flows including:
  • a corresponding number of labels are selected according to a preset manner to label the information flow.
  • the embodiment of the invention provides a method, a device, a terminal device and a storage medium for labeling information flow, and can analyze the user by counting the third-party application installed by the user on the terminal used by the user and clicking the information flow by the user.
  • the degree of interest in the tag, and then the degree of matching between the information stream that the user clicked and the tag is analyzed, so that a certain number of tags with the highest matching degree can be selected as the tag of the information stream, and the tag is marked, thereby realizing the application store
  • the information flow provided in the application market automatically labels the label, which solves the problem that the manual labeling is time-consuming and laborious, and is not easy to complete, and the automatic labeling of the information flow is beneficial to the subsequent personalized recommendation related to the information flow itself.
  • Information flow business
  • FIG. 1A is a screenshot of an example of an existing application store adopting an information flow recommendation application
  • Figure 1B is an example screenshot of a detail page of an information of an information stream
  • FIG. 2 is a flowchart of a method for labeling an information flow according to an embodiment of the present invention
  • FIG. 3 is a schematic block diagram of an apparatus for labeling an information flow according to an embodiment of the present disclosure
  • FIG. 4 is a block diagram of an internal structure of a terminal device according to an embodiment of the present invention.
  • the main idea of the solution is to determine the user's interest in different tags by the third-party application installed by the user on the terminal; determine the information flow tag based on the user's interest in different tags and the user clicks on the information flow. Matching degree; after obtaining the matching degree of the information stream to the label, selecting a certain number of labels as the label of the information stream according to the matching degree according to the matching degree, and marking the information flow.
  • FIG. 2 is a flow chart of a method for labeling an information flow according to an embodiment of the present invention. As shown in FIG. 2, the method for labeling an information flow according to an embodiment of the present invention includes the following steps:
  • S1 Determine the user's interest in different tags based on the third-party application installed by the user on the terminal.
  • cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
  • Ct1 p,i indicates that the user p installed the third-party application i in the preset application library in the N days and maintained the number of days until today, where the user p does not install the third-party application i within N days, ct1 p,i is N;
  • Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
  • n the number of third-party applications in the preset application library
  • n the number of labels in the preset label set
  • N is an integer greater than zero.
  • N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
  • the third-party application in the preset application library installed by the user within a certain number of days has a weight of all the labels of the application, and the weight of the label has decayed with time, and the user is installed because the third party has installed the third party.
  • a third-party application that a user usually installs on a terminal that is used by the user, so that all third-party applications installed on the terminal need to accumulate interest values of the same tag, thereby obtaining that the user has different tags.
  • the degree of interest is as follows:
  • int p,j represents the degree of interest of a user p for the tag j in the preset tag set
  • D represents the number of third-party applications in the preset application library installed by the user p on the terminal used by the user p;
  • Cint p,i,j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set.
  • the user is accumulated for the same tag's interest value due to all the third-party applications installed on the terminal, thereby obtaining the user's interest in different tags.
  • S2 determining, according to the interest degree and the click condition of the user on the information flow, the matching degree of the information flow that the user clicked on the label;
  • the user After determining a user's interest in different tags, the user needs to count the user's click on the information stream, and then determine the user click based on the user's interest in different tags and the user's click on the information stream. The degree to which the information flow matches the label.
  • the degree of interest of different users for different tags is attenuated according to the time when the user clicks on the information flow, and the matching degree of the information flow to the tags is obtained.
  • S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set
  • Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
  • Int p,j represents the degree of interest of the user p for the tag j in the preset tag set
  • n the number of labels in the preset label set
  • U represents the number of streams in the preset stream library
  • F indicates the number of users who have clicked on the information stream l
  • N is an integer greater than 0;
  • N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
  • Those skilled in the art know that when counting data of a user clicking on a stream of information, it is necessary to define a period of time to have a statistical result.
  • the number of days to install a third-party application is the same, for example, 60 days, 90 days, 180 days, and the like.
  • the application store or the application market can also preset a stream library, and the information streams that need to be displayed are placed in the preset stream library.
  • S3 Select a corresponding quantity label according to the matching degree to label the information flow according to a preset manner.
  • the step of selecting a corresponding quantity of labels according to a matching manner to label the information flow according to the matching manner includes: sorting the labels in descending order according to the matching degree of the information streams to the labels; and selecting a certain number from the arrangement according to the largest to the small Label and label the information flow.
  • the corresponding number of labels are selected according to the matching degree from the largest to the smallest, and the information flow is marked.
  • 3-5 labels may be selected as labels of the information flow, that is, from large to small. Sort the 3-5 labels corresponding to the matching scores of the first 3-5.
  • a certain number of labels may be randomly selected from the plurality of labels corresponding to the matching degree that is greater than or equal to the preset threshold to label the information flow, and may also be selected according to the matching degree from large to small, for example, may be selected.
  • 3-5 labels are labeled as labels for the information stream.
  • the information flow involved in the embodiment of the present invention preferably refers to an information flow without a label.
  • the information flow can be re-labeled using the method of the embodiment of the present invention.
  • the matching degree calculated based on the tag is relatively large; when encountered A label is an unpopular label, such as "Chinese Chess”, and the matching calculated based on the label is relatively small.
  • the label “Chinese Chess” will become a hot label. It is necessary to standardize the matching degree to obtain the final matching degree of the information stream to the label.
  • the first matching degree of the different labels is calculated as follows:
  • S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set
  • Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
  • Int p,j represents the degree of interest of the user p for the tag j in the preset tag set
  • n the number of labels in the preset label set
  • U represents the number of streams in the preset stream library
  • F indicates the number of users who have clicked on the information stream l
  • N is an integer greater than 0;
  • N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
  • Those skilled in the art know that when counting data of a user clicking on a stream of information, it is necessary to define a period of time to have a statistical result.
  • the number of days to install a third-party application is the same, for example, 60 days, 90 days, 180 days, and the like.
  • the application store or the application market can also preset a stream library, and the information streams that need to be displayed are placed in the preset stream library.
  • the first matching degree is standardized, and the final matching degree of the information stream clicked by the user on the label is obtained, as follows:
  • ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
  • S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set
  • U represents the number of streams in the preset stream library
  • n the number of labels in the preset label set.
  • the practical significance of performing the standardization process is to convert the matching degree of the information stream 1 clicked by the user to the label j to a multiple of the average matching degree of all the information streams for the label j, and use this multiple to measure the information flow 1 and the label j.
  • the degree of matching between the two indicates that the higher the degree of matching of the information stream 1 to the label j, the lower the multiple indicates that the matching degree of the information stream 1 to the label j is lower.
  • the matching degree of the information stream a clicked by the user on the label b is 0.3
  • the average matching degree of all the information streams to the label b is 0.1
  • the final matching degree of the information stream a to the label b is 3.
  • the method for labeling information flow by counting the third-party application installed by the user on the terminal used by the user and the user clicking the information flow, the user's interest in the label can be determined, and then the user is obtained.
  • the degree of matching between the clicked stream and the tag so that a certain number of tags with the highest matching degree can be selected as the tag of the stream, and the tag is marked, thereby realizing the automatic flow of information preset in the application store or the application market.
  • Labeling labels solves the problem of time-consuming and laborious manual labeling, and it is not easy to complete.
  • FIG. 3 is a schematic block diagram of an apparatus for labeling an information flow according to another embodiment of the present invention. As shown in FIG. 3, the apparatus for labeling an information flow according to an embodiment of the present invention includes:
  • a degree of interest determining unit configured to determine, according to a third party application installed by the user on the terminal, the degree of interest of the user for different tags
  • a matching degree determining unit configured to determine a matching degree of the information stream to the tag based on the interest degree and the click condition of the user on the information flow;
  • a labeling unit configured to select, according to the matching degree, a corresponding quantity of labels to label the information flow according to a preset manner.
  • the interest degree determining unit is specifically configured to determine, by counting, that each tag of the third-party application in the preset application library installed by the user in a period of time accounts for attenuating the weight of all tags of the application with time, and determining The user's interest in different tags in the tag set due to the installation of the third-party application; the interest degree determining unit is specifically used to interest the same tag for all third-party applications installed on the terminal.
  • the degrees are accumulated to determine the user's interest in different tags.
  • the interest degree determining unit is further configured to calculate, by using the following formula, the interest degree of each third-party application installed by the user on the terminal for different labels in the label set:
  • cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
  • Ct1 p,i means that the user p installs the third-party application i in the preset application library in the N days on the terminal and keeps the number of days until today, wherein the user p does not install ct1 p, N is N in N days;
  • Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
  • n the number of third-party applications in the preset application library
  • n the number of labels in the preset label set
  • N is an integer greater than 0;
  • the interest degree determining unit is further configured to determine the user's interest in different tags by using the following formula:
  • int p,j represents the degree of interest of a user p for the tag j in the preset tag set
  • D represents the number of third-party applications in the preset application library installed by the user p on the terminal.
  • the matching degree determining unit is specifically configured to obtain a matching degree of the information stream to the label by performing statistics on the degree of interest of different users for different tags as the user clicks on the information flow.
  • S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set
  • Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
  • Int p,j represents the degree of interest of the user p for the tag j in the preset tag set
  • n the number of labels in the preset label set
  • U represents the number of streams in the preset stream library
  • F indicates the number of users who have clicked on the information stream l
  • N is an integer greater than zero.
  • the first matching degree of the different labels is calculated as follows:
  • S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set
  • Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
  • Int p,j represents the degree of interest of the user p for the tag j in the preset tag set
  • n the number of labels in the preset label set
  • U represents the number of streams in the preset stream library
  • F indicates the number of users who have clicked on the information stream l
  • N is an integer greater than 0;
  • N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
  • Those skilled in the art know that when counting data of a user clicking on a stream of information, it is necessary to define a period of time to have a statistical result.
  • the number of days to install a third-party application is the same, for example, 60 days, 90 days, 180 days, and the like.
  • the application store or the application market can also preset a stream library, and the information streams that need to be displayed are placed in the preset stream library.
  • the first matching degree is standardized, and the final matching degree of the information stream clicked by the user on the label is obtained, as follows:
  • ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
  • S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set
  • U represents the number of streams in the preset stream library
  • n the number of labels in the preset label set.
  • the labeling unit is specifically configured to arrange the labels in descending order of the matching degree of the labels according to the information flow.
  • the labeling unit is specifically used to select a certain number of labels from the largest to the smallest in the arrangement, and mark the information flow.
  • the labeling unit selects a corresponding number of labels to mark the information flow based on the matching degree from the largest to the smallest, for example, 3-5 labels may be selected as labels of the information stream, that is, Sort the 3-5 labels corresponding to the matching degrees of the first 3-5 names from large to small.
  • a certain number of labels may be randomly selected from the plurality of labels corresponding to the matching degree that are greater than or equal to the preset threshold to label the information flow, and of course, the matching degree may be selected from the largest to the smallest, for example, 3-5 labels are selected as labels for the information stream for labeling.
  • the device for labeling the information flow can determine the user's interest in the label by counting the third-party application installed by the user on the terminal used by the user and the user clicking the information flow, thereby obtaining the user.
  • the degree of matching between the clicked stream and the tag so that a certain number of tags with the highest matching degree can be selected as the tag of the stream, and the tag is marked, thereby realizing the automatic flow of information preset in the application store or the application market.
  • Labeling labels solves the problem of time-consuming and laborious manual labeling, and it is not easy to complete.
  • a computer program product for providing a method for labeling an information stream according to an embodiment of the present invention includes a computer readable storage medium storing program code, the program code comprising instructions for executing the method described in the foregoing method embodiment
  • program code comprising instructions for executing the method described in the foregoing method embodiment
  • the embodiment of the present invention provides a terminal device, which is specifically as follows:
  • the terminal device includes a processor 410, a memory 420, an internal memory 430, a network interface 440, and a display screen 450 connected through a system bus.
  • the processor 410 is configured to implement a computing function and a function of controlling the operation of the terminal device, and the processor 410 is configured to perform the method for labeling the information flow provided by the above embodiment.
  • the processor 410 is configured to determine, according to the third party application installed by the user on the terminal, the degree of interest of the user for different tags; determining the matching degree of the information flow to the tag based on the degree of interest and the click condition of the user on the information flow; Set the mode to select a corresponding number of labels to label the information flow.
  • the memory 420 is a non-volatile storage medium storing an operating system 421, a database 422, and a computer program for implementing the information stream labeling-based method provided by the above embodiments, and candidate intermediate data generated by executing the computer program, and Result data.
  • Network interface 440 is used to communicate with the server, and network interface 440 includes a radio frequency transceiver.
  • an embodiment of the present invention further provides a computer readable storage medium carrying one or more computer instruction programs, and when the computer instruction program is executed by one or more processors, one or more processors execute the implementation
  • the method for labeling information flow includes: determining, according to a third-party application installed by the user on the terminal, the user's interest in different tags; determining the matching degree of the information flow to the tag based on the interest degree and the user's click on the information flow According to the matching degree, a corresponding number of labels are selected according to a preset manner to mark the information flow.
  • the foregoing program may be stored in a computer readable storage medium, and the program is executed when executed.
  • the foregoing storage medium includes: a mobile storage device, a random access memory (RAM), a read-only memory (ROM), a magnetic disk, or an optical disk.
  • RAM random access memory
  • ROM read-only memory
  • magnetic disk or an optical disk.
  • optical disk A medium that can store program code.
  • the functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product.
  • the technical solution of the present invention which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including
  • the instructions are used to cause a computer device (which may be a personal computer, tablet, smartphone, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes various media that can store program codes, such as a USB flash drive, a removable hard disk, a read only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.

Abstract

A method for tagging an information stream, comprising: determining degrees of interest of a user to different tags on the basis a third party application installed by the user on a terminal (S1); determining degrees of matching between an information stream and the tags on the basis of the degrees of interest and information about tapping the information stream by the user (S2); and selecting a particular number of tags on the basis of the degrees of matching in a preset manner, and tagging the information stream (S3).

Description

给信息流标注标签的方法、装置、终端设备及存储介质Method, device, terminal device and storage medium for labeling information flow 技术领域Technical field
本发明涉及信息处理技术领域,具体而言涉及一种给信息流标注标签的方法、装置、终端设备及存储介质。The present invention relates to the field of information processing technologies, and in particular, to a method, an apparatus, a terminal device, and a storage medium for labeling information streams.
背景技术Background technique
随着互联网技术和智能移动终端技术的快速发展,很多在计算机终端上实现的功能(例如购物、阅读)也都可以在智能移动终端上实现,例如使用智能手机或平板电脑等。另外,这些功能的实现需要在智能移动终端上安装相应的应用程序。例如,网上购物,需要安装例如淘宝客户端,听音乐需要安装音乐播放器客户端等。由此,很多软件公司提供了应用商店或应用市场,例如豌豆荚或者PP助手等。用户可以打开应用商店或者应用市场,从而能够快速搜索和下载所需要的各种应用程序,包括影音播放类、系统工具类、通讯社交类、网上购物类、阅读类等,当然还可以下载游戏等休闲娱乐类应用程序(APP)。With the rapid development of Internet technologies and smart mobile terminal technologies, many functions implemented on computer terminals (such as shopping, reading) can also be implemented on smart mobile terminals, such as using a smart phone or a tablet. In addition, the implementation of these functions requires the installation of the corresponding application on the smart mobile terminal. For example, online shopping requires installing a Taobao client, for example, to listen to music, and to install a music player client. As a result, many software companies offer application stores or application markets, such as pea pods or PP assistants. Users can open the app store or the app market, so they can quickly search and download the various applications they need, including video playback, system tools, communication and social, online shopping, reading, etc. Of course, you can download games, etc. Entertainment app (APP).
为了不断提升用户使用应用商店或者应用市场的良好体验感,目前的应用商店或应用市场除了能够将应用直接展示给用户之外,如图1A所示,还提供了一种新的应用发行方式:在应用商店增加信息流,通过有趣的文章、短视频或头条新闻等对应用进行介绍和推销,打开信息流能看到文章内容、视频或新闻等,并且页面底部会有提供该信息流的可供下载的应用,如图1A和图1B所示,当点击图1A所示的信息流展示页面中的新闻“微信最危险的功能你关了吗?”时,进入图1B所示的页面,上面除了介绍这篇新闻的详细内容时,页面底部还提供了该新闻的提供者-第三方应用“UC头条”及下载按钮。然而,“信息流”的信息来源渠道很多,许多渠道提供的信息流缺乏对信息的描述,另外各渠道对信息描述的规范不统一,而且信 息流的使用方-应用商店或应用市场目前没有一个好的方法来统一描述来自各种渠道的带有不规范描述的信息,由此给信息流标注标签的工作无法借助工具自动完成,而依赖人工实现标注标签,则费时费力、不容易做。In order to continuously improve the user's good experience in using the app store or the application market, the current app store or application market can display the app directly to the user, as shown in FIG. 1A, and also provides a new app release method: Add information to the app store, introduce and promote apps through interesting articles, short videos, or headlines, open the stream to see article content, videos or news, and have the stream at the bottom of the page. The application for downloading, as shown in FIG. 1A and FIG. 1B, when clicking on the news "Do you have the most dangerous function of WeChat?" in the information flow display page shown in FIG. 1A, the page shown in FIG. 1B is entered. In addition to the details of this news, the news provider is also provided at the bottom of the page - the third-party application "UC headline" and the download button. However, there are many sources of information sources for “information flow”. The information flow provided by many channels lacks description of information. In addition, the specifications of information description are not uniform, and the use of information flow-application store or application market does not currently have one. A good way to unify the information with irregular descriptions from various channels, so that the labeling of the information flow can not be done automatically by means of tools, and relying on manual implementation of labeling is time-consuming and laborious, not easy to do.
在信息流没有标注标签的情况下,会导致开展与信息本身相关的个性化推荐信息流的业务会遇到很多困难。In the case where the information flow is not labeled, the business that carries out the personalized recommendation information flow related to the information itself may encounter many difficulties.
发明内容Summary of the invention
本发明实施例的目的在于提供一种给信息流标注标签的方法和装置,以改善上述问题。It is an object of embodiments of the present invention to provide a method and apparatus for labeling information streams to improve the above problems.
根据第一方面,本发明实施例提供了一种给信息流标注标签的方法,包括:基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;基于兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;基于匹配度按预设方式选取对应的一定数量标签给信息流标注。According to a first aspect, an embodiment of the present invention provides a method for labeling an information flow, including: determining, according to a third-party application installed by the user on the terminal, the user's interest in different tags; based on the interest degree and the user information. The click condition of the stream determines the matching degree of the information flow to the label; according to the matching degree, a corresponding number of labels are selected according to the preset manner to label the information flow.
根据第二方面,本发明实施例提供了一种给信息流标注标签的装置,包括:兴趣度确定单元,用于基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;匹配度确定单元,用于基于兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;标注单元,用于基于匹配度按预设方式选取对应的一定数量标签给信息流标注。According to a second aspect, an embodiment of the present invention provides an apparatus for labeling an information flow, including: an interest degree determining unit, configured to determine, according to a third-party application installed by the user on the terminal, the user's interest level for different labels; The matching degree determining unit is configured to determine the matching degree of the information stream to the label based on the interest degree and the user's click condition of the information flow; the labeling unit is configured to select a corresponding quantity label according to the matching degree to the information flow labeling according to the preset manner.
根据第三方面,本发明实施例提供了一种终端设备,包括:一个或多个处理器;存储器;一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于:According to a third aspect, an embodiment of the present invention provides a terminal device, including: one or more processors; a memory; one or more applications, wherein the one or more applications are stored in the memory And configured to be executed by the one or more processors, the one or more programs configured to:
基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;Determining the user's interest in different tags based on the third-party application installed by the user on the terminal;
基于兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;Determining the matching degree of the information flow to the tag based on the degree of interest and the user's click on the information flow;
基于匹配度按预设方式选取对应的一定数量标签给信息流标注。According to the matching degree, a corresponding number of labels are selected according to a preset manner to label the information flow.
根据第四方面,本发明实施例提供了一种计算机可读存储介质,其上 承载一个或多个计算机指令程序,计算机指令程序被一个或多个处理器执行时,一个或多个处理器执行给信息流标注标签的方法,包括:According to a fourth aspect, an embodiment of the present invention provides a computer readable storage medium carrying one or more computer instruction programs executed by one or more processors when the computer instruction program is executed by one or more processors Methods for labeling information flows, including:
基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;Determining the user's interest in different tags based on the third-party application installed by the user on the terminal;
基于兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;Determining the matching degree of the information flow to the tag based on the degree of interest and the user's click on the information flow;
基于匹配度按预设方式选取对应的一定数量标签给信息流标注。According to the matching degree, a corresponding number of labels are selected according to a preset manner to label the information flow.
本发明实施例提供了一种给信息流标注标签的方法、装置、终端设备及存储介质,通过统计用户在其使用的终端上安装的第三方应用和用户点击信息流的情况,能够分析出用户对标签的兴趣度,进而分析出用户点击过的信息流与标签的匹配度,从而可以选取一定数量匹配度最高的标签作为该信息流的标签,对其进行标注,由此可以实现对应用商店或应用市场中提供的信息流自动标注标签,解决了人工标注标签的费时费力、不容易完成的问题,而且通过为信息流自动标注标签,有利于后续开展的与信息流本身相关的个性化推荐信息流的业务。The embodiment of the invention provides a method, a device, a terminal device and a storage medium for labeling information flow, and can analyze the user by counting the third-party application installed by the user on the terminal used by the user and clicking the information flow by the user. The degree of interest in the tag, and then the degree of matching between the information stream that the user clicked and the tag is analyzed, so that a certain number of tags with the highest matching degree can be selected as the tag of the information stream, and the tag is marked, thereby realizing the application store Or the information flow provided in the application market automatically labels the label, which solves the problem that the manual labeling is time-consuming and laborious, and is not easy to complete, and the automatic labeling of the information flow is beneficial to the subsequent personalized recommendation related to the information flow itself. Information flow business.
附图说明DRAWINGS
图1A是现有的应用商店采用信息流方式推荐应用的一个实例截图;FIG. 1A is a screenshot of an example of an existing application store adopting an information flow recommendation application;
图1B是信息流的一个信息的详情页的一个实例截图;Figure 1B is an example screenshot of a detail page of an information of an information stream;
图2是本发明实施例提供的给信息流标注标签的方法的流程图;2 is a flowchart of a method for labeling an information flow according to an embodiment of the present invention;
图3是本发实施例提供的给信息流标注标签的装置的示意性框图;3 is a schematic block diagram of an apparatus for labeling an information flow according to an embodiment of the present disclosure;
图4为本发明实施例提供的终端设备的内部结构框图。FIG. 4 is a block diagram of an internal structure of a terminal device according to an embodiment of the present invention.
具体实施方式detailed description
下面将结合本发明实施例和附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本发明一部分实施例,而不是全部的实施例。通常在此处附图中描述和示出的本发明实施例的组件可以以各种不同的配置来布置和设计。因此,以下对在附图中提供的本发明的实施例的详细描述并非旨在限制要求保护的本发明的范围,而是仅仅 表示本发明的选定实施例。基于本发明的实施例,本领域技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the embodiments of the present invention and the accompanying drawings. It is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of the embodiments of the invention, which are generally described and illustrated in the figures herein, may be arranged and designed in various different configurations. Therefore, the following detailed description of the embodiments of the invention in the claims All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本方案的主要思路是:通过用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;基于用户对于不同标签的兴趣度以及该用户点击信息流的情况来确定信息流对标签的匹配度;在得到了信息流对标签的匹配度后,基于所述匹配度按预设方式选取一定数量的标签作为所述信息流的标签,给所述信息流标注上。The main idea of the solution is to determine the user's interest in different tags by the third-party application installed by the user on the terminal; determine the information flow tag based on the user's interest in different tags and the user clicks on the information flow. Matching degree; after obtaining the matching degree of the information stream to the label, selecting a certain number of labels as the label of the information stream according to the matching degree according to the matching degree, and marking the information flow.
第一实施例First embodiment
图2是本发明实施例的给信息流标注标签的方法的流程图。如图2所示,本发明实施例的给信息流标注标签的方法包括以下步骤:2 is a flow chart of a method for labeling an information flow according to an embodiment of the present invention. As shown in FIG. 2, the method for labeling an information flow according to an embodiment of the present invention includes the following steps:
S1:基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度。S1: Determine the user's interest in different tags based on the third-party application installed by the user on the terminal.
基于上述思路,需要统计一个用户在其使用的终端上到目前为止都安装了哪些第三方应用。通过统计在一段天数内用户安装的预置应用库里的第三方应用具有的每个标签占该应用具有的所有标签的权重随时间衰减而得出该用户因安装了该第三方应用而对于标签集合里的不同标签的兴趣度,将该用户因在终端上安装的所有第三方应用而对于相同标签的兴趣度值进行累加,从而得出该用户对于不同标签的兴趣度。这样,基于这些安装的第三方应用来确定该用户对于不同标签的兴趣度。Based on the above ideas, it is necessary to count which third-party applications a user has installed so far on the terminal they use. By counting the weight of all the tags of the third-party application in the preset application library installed by the user within a certain number of days, the weight of all the tags of the application is decayed with time, and the user is attached to the third-party application. The degree of interest of different tags in the collection, the user is accumulated for the same tag's interest value due to all third-party applications installed on the terminal, thereby obtaining the user's interest in different tags. In this way, the user's interest in different tags is determined based on these installed third party applications.
首先计算所述用户在终端上安装的每个第三方应用对于标签集合里的不同标签的兴趣度,计算方法如下:First, calculate the degree of interest of each third-party application installed by the user on the terminal for different tags in the tag set, and the calculation method is as follows:
Figure PCTCN2017120182-appb-000001
Figure PCTCN2017120182-appb-000001
其中:cint p,i,j表示某个用户p在终端上安装的预置应用库里的第三方应用i对于标签集合里的标签j的兴趣度; Where: cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
ct1 p,i表示用户p在终端上在N天内安装了预置应用库里的第三方应用i并保持到今天的天数,其中用户p在N天内没有安装第三方应用i时ct1 p,i为N; Ct1 p,i indicates that the user p installed the third-party application i in the preset application library in the N days and maintained the number of days until today, where the user p does not install the third-party application i within N days, ct1 p,i is N;
tag i,j表示第三方应用i是否具有预置标签集合里的标签j,其中当有标签j时tag i,j为1,否则为0; Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
Figure PCTCN2017120182-appb-000002
表示第三方应用i具有的预置标签集合里的标签数量的累加之和;
Figure PCTCN2017120182-appb-000002
Represents the sum of the number of labels in the preset label set that the third party application i has;
n表示预置应用库里的第三方应用的数量;n represents the number of third-party applications in the preset application library;
m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
N为大于0的整数。N is an integer greater than zero.
N可以根据实践需要自行设定天数,例如60天、90天、180天等。N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
本领域技术人员都知道开发应用商店或应用市场的目的和作用,应用商店或应用市场里预置了应用库,该应用库里放置了由不同第三方应用程序开发商开发的大量的各种应用。另外,应用商店或应用市场提供的第三方应用都具有1个或多个标签,这些标签也都来自于在开发应用商店或应用市场时预置的标签集合,这里不对这些常规技术做过多介绍了。Those skilled in the art are aware of the purpose and function of developing an application store or application market, and an application library is preset in the application store or the application market, and a large number of various applications developed by different third-party application developers are placed in the application library. . In addition, third-party applications provided by the app store or the app market have one or more tags, all of which come from a collection of tags that are preset when developing the app store or app market. There is no introduction to these conventional technologies. It is.
通过以上运算就可以统计在一段天数内用户安装的预置应用库里的第三方应用具有的每个标签占该应用具有的所有标签的权重随时间衰减而得出该用户因安装了该第三方应用而对于标签集合里的不同标签的兴趣度。Through the above operation, it can be counted that the third-party application in the preset application library installed by the user within a certain number of days has a weight of all the labels of the application, and the weight of the label has decayed with time, and the user is installed because the third party has installed the third party. Application and interest in different tags in the tag collection.
另外,一个用户通常会在其使用的终端上安装的多个第三方应用,所以需要将在终端上安装的所有第三方应用对于相同标签的兴趣度值进行累加,从而得出该用户对于不同标签的兴趣度,方法如下:In addition, a third-party application that a user usually installs on a terminal that is used by the user, so that all third-party applications installed on the terminal need to accumulate interest values of the same tag, thereby obtaining that the user has different tags. The degree of interest is as follows:
Figure PCTCN2017120182-appb-000003
Figure PCTCN2017120182-appb-000003
其中:int p,j表示某个用户p对于预置标签集合里的标签j的兴趣度; Where: int p,j represents the degree of interest of a user p for the tag j in the preset tag set;
D表示用户p在其使用的终端上安装的预置应用库里的第三方应用的 数量;D represents the number of third-party applications in the preset application library installed by the user p on the terminal used by the user p;
cint p,i,j表示某个用户p在终端上安装的预置应用库里的第三方应用i对于标签集合里的标签j的兴趣度。 Cint p,i,j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set.
这样,将该用户因在终端上安装的所有第三方应用而对于相同标签的兴趣度值进行累加,从而得出该用户对于不同标签的兴趣度。In this way, the user is accumulated for the same tag's interest value due to all the third-party applications installed on the terminal, thereby obtaining the user's interest in different tags.
S2:基于所述兴趣度和所述用户对信息流的点击情况确定用户点击过的信息流对标签的匹配度;S2: determining, according to the interest degree and the click condition of the user on the information flow, the matching degree of the information flow that the user clicked on the label;
上面确定了一个用户对于不同标签的兴趣度后,在这里需要统计该用户对信息流的点击情况,然后结合所述用户对于不同标签的兴趣度和该用户对信息流的点击情况来确定用户点击过的信息流对标签的匹配度。After determining a user's interest in different tags, the user needs to count the user's click on the information stream, and then determine the user click based on the user's interest in different tags and the user's click on the information stream. The degree to which the information flow matches the label.
即,通过对不同用户对于不同标签的兴趣度随着用户点击信息流的时间而衰减进行统计,得到信息流对标签的匹配度。That is, the degree of interest of different users for different tags is attenuated according to the time when the user clicks on the information flow, and the matching degree of the information flow to the tags is obtained.
具体地,统计用户对信息流的点击数据,再结合所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的匹配度,计算方法如下:Specifically, the user clicks the data of the information flow, and combines the interest degree of the user with different tags, and calculates the matching degree of the information flow clicked by the user on different tags in the tag set, and the calculation method is as follows:
Figure PCTCN2017120182-appb-000004
Figure PCTCN2017120182-appb-000004
其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的匹配度; Where: S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set;
ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
N为大于0的整数;N is an integer greater than 0;
N可以根据实践需要自行设定天数,例如60天、90天、180天等。N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
本领域技术人员都知道,统计用户点击信息流的数据时,需要限定一个时间段才能有统计结果,这里优选设定统计用户点击信息流的数据的天数与上面介绍的统计用户在使用的终端上安装第三方应用的天数相同,例如选取60天、90天、180天等。另外,应用商店或应用市场也可以预置一个信息流库,需要展示的信息流都放置在该预置信息流库里。Those skilled in the art know that when counting data of a user clicking on a stream of information, it is necessary to define a period of time to have a statistical result. Here, it is preferable to set a number of days for which the data of the user clicks on the information stream is compared with the terminal used by the statistical user described above. The number of days to install a third-party application is the same, for example, 60 days, 90 days, 180 days, and the like. In addition, the application store or the application market can also preset a stream library, and the information streams that need to be displayed are placed in the preset stream library.
S3:基于所述匹配度按预设方式选取对应的一定数量标签给所述信息流标注。S3: Select a corresponding quantity label according to the matching degree to label the information flow according to a preset manner.
具体地,基于匹配度按预设方式选取对应的一定数量标签给信息流标注的步骤,包括:将标签按照信息流对标签的匹配度降序排列;从排列中按照从大到小选取一定数量的标签,并对信息流进行标注。Specifically, the step of selecting a corresponding quantity of labels according to a matching manner to label the information flow according to the matching manner includes: sorting the labels in descending order according to the matching degree of the information streams to the labels; and selecting a certain number from the arrangement according to the largest to the small Label and label the information flow.
优选地,基于所述匹配度从大到小顺序选取对应的一定数量标签给所述信息流标注,例如可以选取3-5个标签作为所述信息流的标签来进行标注,即从大到小排序在前3-5名的匹配度所分别对应的3-5个标签。或者,也可以从大于或等于预设阈值的所述匹配度对应的多个标签中随机选取一定数量标签给所述信息流标注,当然也可以按匹配度从大到小顺序选取,例如可以选取3-5个标签作为所述信息流的标签来进行标注。Preferably, the corresponding number of labels are selected according to the matching degree from the largest to the smallest, and the information flow is marked. For example, 3-5 labels may be selected as labels of the information flow, that is, from large to small. Sort the 3-5 labels corresponding to the matching scores of the first 3-5. Alternatively, a certain number of labels may be randomly selected from the plurality of labels corresponding to the matching degree that is greater than or equal to the preset threshold to label the information flow, and may also be selected according to the matching degree from large to small, for example, may be selected. 3-5 labels are labeled as labels for the information stream.
本发明实施例中涉及到的信息流优选指没有标注标签的信息流。当然即便通过人工方式给一部分信息流标注了标签,也可以使用本发明实施例的方法重新给这些信息流标注标签。The information flow involved in the embodiment of the present invention preferably refers to an information flow without a label. Of course, even if a part of the information flow is manually labeled, the information flow can be re-labeled using the method of the embodiment of the present invention.
另外,在上述的计算用户点击过的信息流对标签的匹配度时,当遇到的标签是热门的标签,例如:“社交”,基于该标签计算出来的匹配度比较大;当遇到的标签是冷门的标签,例如:“中国象棋”,基于该标签计算出 来的匹配度比较小。为了得到较为客观的匹配度,避免受到标签自身今天为冷门明天为热门的社会情绪影响,例如当中国象棋变成为世界性比赛而被社会大众广为关注时标签“中国象棋”会成为热门标签,很有必要对所述匹配度进行标准化处理,从而得出信息流对标签的最终的匹配度。In addition, in the above calculation of the matching degree of the information flow clicked by the user, when the tag encountered is a popular tag, for example: "social", the matching degree calculated based on the tag is relatively large; when encountered A label is an unpopular label, such as "Chinese Chess", and the matching calculated based on the label is relatively small. In order to get a more objective match, avoid the social emotions that the label itself is hot today, such as when the Chinese chess becomes a worldwide competition and the public is widely concerned, the label "Chinese Chess" will become a hot label. It is necessary to standardize the matching degree to obtain the final matching degree of the information stream to the label.
在一个优选实施例中,首先和上述步骤S2讲述的类似,统计用户对信息流的点击数据,再结合所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度,计算方法如下:In a preferred embodiment, first, similar to the description in step S2 above, the user clicks on the data of the information stream, and in combination with the user's interest in different tags, calculates the information stream clicked by the user for the tag set. The first matching degree of the different labels is calculated as follows:
Figure PCTCN2017120182-appb-000005
Figure PCTCN2017120182-appb-000005
其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; Where: S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set;
ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
N为大于0的整数;N is an integer greater than 0;
N可以根据实践需要自行设定天数,例如60天、90天、180天等。N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
本领域技术人员都知道,统计用户点击信息流的数据时,需要限定一个时间段才能有统计结果,这里优选设定统计用户点击信息流的数据的天数与上面介绍的统计用户在使用的终端上安装第三方应用的天数相同,例如选取60天、90天、180天等。另外,应用商店或应用市场也可以预置一 个信息流库,需要展示的信息流都放置在该预置信息流库里。Those skilled in the art know that when counting data of a user clicking on a stream of information, it is necessary to define a period of time to have a statistical result. Here, it is preferable to set a number of days for which the data of the user clicks on the information stream is compared with the terminal used by the statistical user described above. The number of days to install a third-party application is the same, for example, 60 days, 90 days, 180 days, and the like. In addition, the application store or the application market can also preset a stream library, and the information streams that need to be displayed are placed in the preset stream library.
然后,对所述第一匹配度进行标准化处理,得出用户所点击的信息流对标签的最终匹配度,方法如下:Then, the first matching degree is standardized, and the final matching degree of the information stream clicked by the user on the label is obtained, as follows:
Figure PCTCN2017120182-appb-000006
Figure PCTCN2017120182-appb-000006
其中:ptag l,j表示用户所点击的信息流l与预置标签集合里的标签j之间的匹配度; Where: ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set;
Figure PCTCN2017120182-appb-000007
Figure PCTCN2017120182-appb-000007
U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
m表示预置标签集合里的标签数量。m represents the number of labels in the preset label set.
执行标准化处理的实际意义是:将用户所点击的信息流l对于标签j的匹配度转换为对所有信息流对于标签j的平均匹配度的倍数,用这个倍数来衡量信息流l与标签j之间的匹配程度,倍数越高表示信息流l对标签j的匹配度越高,倍数越低表示信息流l对标签j的匹配度越低。例如:用户所点击的信息流a对标签b的匹配度为0.3,所有信息流对标签的b的平均匹配度为0.1,则信息流a对标签b的最终匹配度是3。The practical significance of performing the standardization process is to convert the matching degree of the information stream 1 clicked by the user to the label j to a multiple of the average matching degree of all the information streams for the label j, and use this multiple to measure the information flow 1 and the label j. The degree of matching between the two indicates that the higher the degree of matching of the information stream 1 to the label j, the lower the multiple indicates that the matching degree of the information stream 1 to the label j is lower. For example, the matching degree of the information stream a clicked by the user on the label b is 0.3, and the average matching degree of all the information streams to the label b is 0.1, and the final matching degree of the information stream a to the label b is 3.
根据本发明实施例的给信息流标注标签的方法,通过统计用户在其使用的终端上安装的第三方应用和用户点击信息流的情况,能够判断出用户对标签的兴趣度,进而得出用户点击过的信息流与标签的匹配度,从而可以选取一定数量匹配度最高的标签作为该信息流的标签,对其进行标注,由此可以实现对应用商店或应用市场中预置的信息流自动标注标签,解决了人工标注标签的费时费力、不容易完成的问题,而且通过为信息流自动标注标签,有利于后续开展的与信息流本身相关的个性化推荐信息流的业 务。According to the method for labeling information flow according to an embodiment of the present invention, by counting the third-party application installed by the user on the terminal used by the user and the user clicking the information flow, the user's interest in the label can be determined, and then the user is obtained. The degree of matching between the clicked stream and the tag, so that a certain number of tags with the highest matching degree can be selected as the tag of the stream, and the tag is marked, thereby realizing the automatic flow of information preset in the application store or the application market. Labeling labels solves the problem of time-consuming and laborious manual labeling, and it is not easy to complete. Moreover, by automatically labeling the information stream, it is beneficial to the subsequent personalized recommendation information flow related to the information flow itself.
第二实施例Second embodiment
图3是本发明另一个实施例提供的给信息流标注标签的装置的示意性框图。如图3所示,本发明实施例的给信息流标注标签的装置包括:FIG. 3 is a schematic block diagram of an apparatus for labeling an information flow according to another embodiment of the present invention. As shown in FIG. 3, the apparatus for labeling an information flow according to an embodiment of the present invention includes:
兴趣度确定单元,用于基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;a degree of interest determining unit, configured to determine, according to a third party application installed by the user on the terminal, the degree of interest of the user for different tags;
匹配度确定单元,用于基于所述兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;a matching degree determining unit, configured to determine a matching degree of the information stream to the tag based on the interest degree and the click condition of the user on the information flow;
标注单元,用于基于所述匹配度按预设方式选取对应的一定数量标签给所述信息流标注。And a labeling unit, configured to select, according to the matching degree, a corresponding quantity of labels to label the information flow according to a preset manner.
优选地,所述兴趣度确定单元具体用于通过统计在一段天数内用户安装的预置应用库里的第三方应用具有的每个标签占该应用具有的所有标签的权重随时间衰减,确定该用户因安装了该第三方应用而对于标签集合里的不同标签的兴趣度;所述兴趣度确定单元,具体还用于将该用户因在终端上安装的所有第三方应用而对于相同标签的兴趣度值进行累加,确定该用户对于不同标签的兴趣度。Preferably, the interest degree determining unit is specifically configured to determine, by counting, that each tag of the third-party application in the preset application library installed by the user in a period of time accounts for attenuating the weight of all tags of the application with time, and determining The user's interest in different tags in the tag set due to the installation of the third-party application; the interest degree determining unit is specifically used to interest the same tag for all third-party applications installed on the terminal. The degrees are accumulated to determine the user's interest in different tags.
其中,所述兴趣度确定单元具体还用于通过下述公式,计算所述用户在终端上安装的每个第三方应用对于标签集合里的不同标签的兴趣度:The interest degree determining unit is further configured to calculate, by using the following formula, the interest degree of each third-party application installed by the user on the terminal for different labels in the label set:
Figure PCTCN2017120182-appb-000008
Figure PCTCN2017120182-appb-000008
其中:cint p,i,j表示某个用户p在终端上安装的预置应用库里的第三方应用i对于标签集合里的标签j的兴趣度; Where: cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
ct1 p,i表示用户p在终端上在N天内安装了预置应用库里的第三方应用i并保持到今天的天数,其中用户p在N天内无安装时ct1 p,i为N; Ct1 p,i means that the user p installs the third-party application i in the preset application library in the N days on the terminal and keeps the number of days until today, wherein the user p does not install ct1 p, N is N in N days;
tag i,j表示第三方应用i是否具有预置标签集合里的标签j,其中当有标签j时tag i,j为1,否则为0; Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
Figure PCTCN2017120182-appb-000009
表示第三方应用i具有的预置标签集合里的标签数量的累加之和;
Figure PCTCN2017120182-appb-000009
Represents the sum of the number of labels in the preset label set that the third party application i has;
n表示预置应用库里的第三方应用的数量;n represents the number of third-party applications in the preset application library;
m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
N为大于0的整数;N is an integer greater than 0;
所述兴趣度确定单元具体还用于通过以下公式,确定该用户对于不同标签的兴趣度:The interest degree determining unit is further configured to determine the user's interest in different tags by using the following formula:
Figure PCTCN2017120182-appb-000010
Figure PCTCN2017120182-appb-000010
其中:int p,j表示某个用户p对于预置标签集合里的标签j的兴趣度; Where: int p,j represents the degree of interest of a user p for the tag j in the preset tag set;
D表示用户p在终端上安装的预置应用库里的第三方应用的数量。D represents the number of third-party applications in the preset application library installed by the user p on the terminal.
优选地,所述匹配度确定单元具体用于通过对不同用户对于不同标签的兴趣度随着用户点击信息流的时间而衰减进行统计,得到信息流对标签的匹配度。Preferably, the matching degree determining unit is specifically configured to obtain a matching degree of the information stream to the label by performing statistics on the degree of interest of different users for different tags as the user clicks on the information flow.
其中所述匹配度确定单元具体用于通过以下公式,确定信息流对标签的匹配度:The matching degree determining unit is specifically configured to determine the matching degree of the information flow to the label by using the following formula:
Figure PCTCN2017120182-appb-000011
Figure PCTCN2017120182-appb-000011
其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的匹配度; Where: S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set;
ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
N为大于0的整数。N is an integer greater than zero.
正如上面在方法的优选实施例里介绍的,有必要对所述匹配度进行标准化处理,因此这里也可以对所述匹配度做同样的处理。As described above in the preferred embodiment of the method, it is necessary to normalize the degree of matching, so that the same degree of processing can be done here.
在一个优选实施例中,首先和上述步骤S2讲述的类似,统计用户对信息流的点击数据,再结合所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度,计算方法如下:In a preferred embodiment, first, similar to the description in step S2 above, the user clicks on the data of the information stream, and in combination with the user's interest in different tags, calculates the information stream clicked by the user for the tag set. The first matching degree of the different labels is calculated as follows:
Figure PCTCN2017120182-appb-000012
Figure PCTCN2017120182-appb-000012
其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; Where: S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set;
ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
N为大于0的整数;N is an integer greater than 0;
N可以根据实践需要自行设定天数,例如60天、90天、180天等。N can set the number of days according to the actual needs, such as 60 days, 90 days, 180 days, and so on.
本领域技术人员都知道,统计用户点击信息流的数据时,需要限定一个时间段才能有统计结果,这里优选设定统计用户点击信息流的数据的天 数与上面介绍的统计用户在使用的终端上安装第三方应用的天数相同,例如选取60天、90天、180天等。另外,应用商店或应用市场也可以预置一个信息流库,需要展示的信息流都放置在该预置信息流库里。Those skilled in the art know that when counting data of a user clicking on a stream of information, it is necessary to define a period of time to have a statistical result. Here, it is preferable to set a number of days for which the data of the user clicks on the information stream is compared with the terminal used by the statistical user described above. The number of days to install a third-party application is the same, for example, 60 days, 90 days, 180 days, and the like. In addition, the application store or the application market can also preset a stream library, and the information streams that need to be displayed are placed in the preset stream library.
然后,对所述第一匹配度进行标准化处理,得出用户所点击的信息流对标签的最终匹配度,方法如下:Then, the first matching degree is standardized, and the final matching degree of the information stream clicked by the user on the label is obtained, as follows:
Figure PCTCN2017120182-appb-000013
Figure PCTCN2017120182-appb-000013
其中:ptag l,j表示用户所点击的信息流l与预置标签集合里的标签j之间的匹配度; Where: ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set;
Figure PCTCN2017120182-appb-000014
Figure PCTCN2017120182-appb-000014
U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
m表示预置标签集合里的标签数量。m represents the number of labels in the preset label set.
具体地,标注单元,具体用于将标签按照信息流对标签的匹配度降序排列。Specifically, the labeling unit is specifically configured to arrange the labels in descending order of the matching degree of the labels according to the information flow.
标注单元,具体还用于从排列中按照从大到小选取一定数量的标签,并对信息流进行标注。The labeling unit is specifically used to select a certain number of labels from the largest to the smallest in the arrangement, and mark the information flow.
优选地,所述标注单元基于所述匹配度从大到小顺序选取对应的一定数量标签给所述信息流标注,例如可以选取3-5个标签作为所述信息流的标签来进行标注,即从大到小排序在前3-5名的匹配度所分别对应的3-5个标签。或者,也可以从大于或等于预设阈值的所述匹配度对应的多个标签中随机选取一定数量标签给所述信息流标注,当然也可以再按匹配度从大到小顺序选取,例如可以选取3-5个标签作为所述信息流的标签来进行标注。Preferably, the labeling unit selects a corresponding number of labels to mark the information flow based on the matching degree from the largest to the smallest, for example, 3-5 labels may be selected as labels of the information stream, that is, Sort the 3-5 labels corresponding to the matching degrees of the first 3-5 names from large to small. Alternatively, a certain number of labels may be randomly selected from the plurality of labels corresponding to the matching degree that are greater than or equal to the preset threshold to label the information flow, and of course, the matching degree may be selected from the largest to the smallest, for example, 3-5 labels are selected as labels for the information stream for labeling.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上面结合图3描述的装置的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再重复描述。A person skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the device described above in connection with FIG. 3 can refer to the corresponding process in the foregoing method embodiments, and the description is not repeated here.
根据本发明实施例的给信息流标注标签的装置,通过统计用户在其使用的终端上安装的第三方应用和用户点击信息流的情况,能够判断出用户对标签的兴趣度,进而得出用户点击过的信息流与标签的匹配度,从而可以选取一定数量匹配度最高的标签作为该信息流的标签,对其进行标注,由此可以实现对应用商店或应用市场中预置的信息流自动标注标签,解决了人工标注标签的费时费力、不容易完成的问题,而且通过为信息流自动标注标签,有利于后续开展的与信息流本身相关的个性化推荐信息流的业务。According to the embodiment of the present invention, the device for labeling the information flow can determine the user's interest in the label by counting the third-party application installed by the user on the terminal used by the user and the user clicking the information flow, thereby obtaining the user. The degree of matching between the clicked stream and the tag, so that a certain number of tags with the highest matching degree can be selected as the tag of the stream, and the tag is marked, thereby realizing the automatic flow of information preset in the application store or the application market. Labeling labels solves the problem of time-consuming and laborious manual labeling, and it is not easy to complete. Moreover, by automatically labeling the information stream, it is beneficial to the subsequent personalized recommendation information flow related to the information flow itself.
本发明实施例所提供的给信息流标注标签的方法的计算机程序产品,包括存储了程序代码的计算机可读存储介质,所述程序代码包括的指令可用于执行前面方法实施例中所述的方法,具体实现可参见方法实施例,在此不再赘述。A computer program product for providing a method for labeling an information stream according to an embodiment of the present invention includes a computer readable storage medium storing program code, the program code comprising instructions for executing the method described in the foregoing method embodiment For specific implementation, refer to the method embodiment, and details are not described herein again.
第四实施例Fourth embodiment
为进一步说明上述给信息流标注标签的方法,本发明实施例提供了一种终端设备,具体如下:To further illustrate the foregoing method for labeling information flows, the embodiment of the present invention provides a terminal device, which is specifically as follows:
如图4所示,终端设备包括通过系统总线连接的处理器410、存储器420、内存储器430、网络接口440和显示屏450。处理器410用于实现计算功能和控制终端装置工作的功能,处理器410被配置为执行上述实施例提供的给信息流标注标签的方法。处理器410用于基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;基于兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;基于匹配度按预设方式选取对应的一定数量标签给信息流标注。存储器420是一种非易失性存储介质,存储有操作系统421、数据库422和用于实现上述实施例提供的基于信息流标注标签的方法的计算机程序,以及执行计算机程序产生的候选中间数据以及结果数据。网络接口440用于与服务器通信,网络接口440包括射频 收发器。As shown in FIG. 4, the terminal device includes a processor 410, a memory 420, an internal memory 430, a network interface 440, and a display screen 450 connected through a system bus. The processor 410 is configured to implement a computing function and a function of controlling the operation of the terminal device, and the processor 410 is configured to perform the method for labeling the information flow provided by the above embodiment. The processor 410 is configured to determine, according to the third party application installed by the user on the terminal, the degree of interest of the user for different tags; determining the matching degree of the information flow to the tag based on the degree of interest and the click condition of the user on the information flow; Set the mode to select a corresponding number of labels to label the information flow. The memory 420 is a non-volatile storage medium storing an operating system 421, a database 422, and a computer program for implementing the information stream labeling-based method provided by the above embodiments, and candidate intermediate data generated by executing the computer program, and Result data. Network interface 440 is used to communicate with the server, and network interface 440 includes a radio frequency transceiver.
进一步地,本发明实施例还提供一种计算机可读存储介质,其上承载一个或多个计算机指令程序,计算机指令程序被一个或多个处理器执行时,一个或多个处理器执行实现一种给信息流标注标签的方法,包括:基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;基于兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;基于匹配度按预设方式选取对应的一定数量标签给信息流标注。Further, an embodiment of the present invention further provides a computer readable storage medium carrying one or more computer instruction programs, and when the computer instruction program is executed by one or more processors, one or more processors execute the implementation The method for labeling information flow includes: determining, according to a third-party application installed by the user on the terminal, the user's interest in different tags; determining the matching degree of the information flow to the tag based on the interest degree and the user's click on the information flow According to the matching degree, a corresponding number of labels are selected according to a preset manner to mark the information flow.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述任意方法实施例的步骤;而前述的存储介质包括:移动存储设备、随机存取存储器(RAM,Random Access Memory)、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。A person skilled in the art can understand that all or part of the steps of implementing the above method embodiments may be completed by using hardware related to the program instructions. The foregoing program may be stored in a computer readable storage medium, and the program is executed when executed. The foregoing storage medium includes: a mobile storage device, a random access memory (RAM), a read-only memory (ROM), a magnetic disk, or an optical disk. A medium that can store program code.
或者,所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,平板电脑,智能手机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM)、随机存取存储器(RAM)、磁碟或者光盘等各种可以存储程序代码的介质。Alternatively, the functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including The instructions are used to cause a computer device (which may be a personal computer, tablet, smartphone, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes various media that can store program codes, such as a USB flash drive, a removable hard disk, a read only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. It should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the appended claims.

Claims (25)

  1. 一种给信息流标注标签的方法,包括:A method of labeling information flows, including:
    基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;Determining the user's interest in different tags based on the third-party application installed by the user on the terminal;
    基于所述兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;Determining a degree of matching of the information stream to the tag based on the degree of interest and the user's click on the information stream;
    基于所述匹配度按预设方式选取对应的一定数量标签给所述信息流标注。And selecting a corresponding quantity label according to the matching degree to label the information flow.
  2. 根据权利要求1所述的方法,基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度的步骤,包括:The method according to claim 1, wherein the step of determining the degree of interest of the user for different tags based on a third-party application installed by the user on the terminal comprises:
    通过统计在一段天数内用户安装的预置应用库里的第三方应用具有的每个标签占该应用具有的所有标签的权重随时间衰减,确定该用户因安装了该第三方应用而对于标签集合里的不同标签的兴趣度;By counting the weight of all the tags of the third-party application in the preset application library installed by the user in a period of days, the weight of all the tags of the application is attenuated over time, and determining that the user has installed the third-party application for the tag set. Interest in different labels;
    将该用户因在终端上安装的所有第三方应用而对于相同标签的兴趣度值进行累加,确定该用户对于不同标签的兴趣度。The user is accumulated for the same tag's interest value due to all third-party applications installed on the terminal, and the user's interest in different tags is determined.
  3. 根据权利要求1或2所述的方法,基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度的步骤,包括:The method according to claim 1 or 2, wherein the step of determining the degree of interest of the user for different tags based on the third-party application installed by the user on the terminal comprises:
    通过下述公式,计算所述用户在终端上安装的每个第三方应用对于标签集合里的不同标签的兴趣度:Calculate the interest of each third-party application installed by the user on the terminal for different tags in the tag set by using the following formula:
    Figure PCTCN2017120182-appb-100001
    Figure PCTCN2017120182-appb-100001
    其中:cint p,i,j表示某个用户p在终端上安装的预置应用库里的第三方应用i对于标签集合里的标签j的兴趣度; Where: cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
    ct1 p,i表示用户p在终端上在N天内安装了预置应用库里的第三方应用i并保持到今天的天数,其中用户p在N天内无安装时ct1 p,i为N; Ct1 p,i means that the user p installs the third-party application i in the preset application library in the N days on the terminal and keeps the number of days until today, wherein the user p does not install ct1 p, N is N in N days;
    tag i,j表示第三方应用i是否具有预置标签集合里的标签j,其中当有标签j时tag i,j为1,否则为0; Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
    Figure PCTCN2017120182-appb-100002
    表示第三方应用i具有的预置标签集合里的标签数量的累加之和;
    Figure PCTCN2017120182-appb-100002
    Represents the sum of the number of labels in the preset label set that the third party application i has;
    n表示预置应用库里的第三方应用的数量;n represents the number of third-party applications in the preset application library;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    N为大于0的整数;N is an integer greater than 0;
    其中,将在终端上安装的所有第三方应用对于相同标签的兴趣度值进行累加,确定该用户对于不同标签的兴趣度,包括:The third party application installed on the terminal accumulates the interest value of the same tag to determine the user's interest in different tags, including:
    通过下述公式,确定该用户对于不同标签的兴趣度:Determine the user's interest in different tags by the following formula:
    Figure PCTCN2017120182-appb-100003
    Figure PCTCN2017120182-appb-100003
    其中:int p,j表示某个用户p对于预置标签集合里的标签j的兴趣度; Where: int p,j represents the degree of interest of a user p for the tag j in the preset tag set;
    D表示用户p在终端上安装的预置应用库里的第三方应用的数量。D represents the number of third-party applications in the preset application library installed by the user p on the terminal.
  4. 根据权利要求1所述的方法,基于所述兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度的步骤,包括:The method according to claim 1, wherein the step of determining the matching degree of the information stream to the tag based on the degree of interest and the user's click on the information flow comprises:
    通过对不同用户对于不同标签的兴趣度随着用户点击信息流的时间而衰减进行统计,得到信息流对标签的匹配度。The degree of interest of different users for different tags is attenuated as the user clicks on the information flow, and the matching degree of the information flow to the tags is obtained.
  5. 根据权利要求1或4所述的方法,确定信息流对标签的匹配度的步骤,包括:The method according to claim 1 or 4, wherein the step of determining the matching degree of the information stream to the label comprises:
    通过以下公式,确定信息流对标签的匹配度:Determine the matching of the information flow to the label by the following formula:
    Figure PCTCN2017120182-appb-100004
    Figure PCTCN2017120182-appb-100004
    其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的匹配度; Where: S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set;
    ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
    int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
    N为大于0的整数。N is an integer greater than zero.
  6. 根据权利要求1或4所述的方法,确定信息流对标签的匹配度的步骤,包括:The method according to claim 1 or 4, wherein the step of determining the matching degree of the information stream to the label comprises:
    通过统计用户对信息流的点击情况和所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度;Calculating a first matching degree of the information flow clicked by the user for different tags in the tag set by counting a user's click on the information flow and the user's interest in different tags;
    对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度。The first matching degree is normalized to obtain a final matching degree of the information stream to different tags.
  7. 根据权利要求6所述的方法,其特征在于,通过统计用户对信息流的点击情况和所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度,包括:The method according to claim 6, wherein the information flow clicked by the user is calculated for different tags in the tag set by counting the user's click on the information stream and the user's interest in different tags. The first match, including:
    通过以下公式,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度:The first matching degree of the information stream clicked by the user for different labels in the label set is calculated by the following formula:
    Figure PCTCN2017120182-appb-100005
    Figure PCTCN2017120182-appb-100005
    其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第 一匹配度; Where: S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set;
    ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
    int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
    N为大于0的整数;N is an integer greater than 0;
    其中,对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度,包括:The first matching degree is normalized to obtain a final matching degree of the information stream to different labels, including:
    通过以下公式,对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度:The first matching degree is normalized by the following formula to obtain the final matching degree of the information stream to different labels:
    Figure PCTCN2017120182-appb-100006
    Figure PCTCN2017120182-appb-100006
    其中:ptag l,j表示用户所点击的信息流l与预置标签集合里的标签j之间的匹配度; Where: ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
    S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set;
    Figure PCTCN2017120182-appb-100007
    Figure PCTCN2017120182-appb-100007
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    m表示预置标签集合里的标签数量。m represents the number of labels in the preset label set.
  8. 根据权利要求5-7任一项所述的方法,基于所述匹配度按预设方式选取对应的一定数量标签给所述信息流标注的步骤,包括:The method according to any one of claims 5-7, wherein the step of selecting a corresponding number of labels to mark the information flow according to the matching degree in a preset manner comprises:
    将标签按照所述信息流对标签的匹配度降序排列;Arranging the labels in descending order according to the matching degree of the information flow to the labels;
    从所述排列中按照从大到小选取一定数量的标签,并对所述信息流进 行标注。A certain number of tags are selected from the arrangement in order from large to small, and the information flow is labeled.
  9. 一种给信息流标注标签的装置,包括:A device for labeling information flows, including:
    兴趣度确定单元,用于基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;a degree of interest determining unit, configured to determine, according to a third party application installed by the user on the terminal, the degree of interest of the user for different tags;
    匹配度确定单元,用于基于所述兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;a matching degree determining unit, configured to determine a matching degree of the information stream to the tag based on the interest degree and the click condition of the user on the information flow;
    标注单元,用于基于所述匹配度按预设方式选取对应的一定数量标签给所述信息流标注。And a labeling unit, configured to select, according to the matching degree, a corresponding quantity of labels to label the information flow according to a preset manner.
  10. 根据权利要求9所述的装置,所述兴趣度确定单元,具体用于通过统计在一段天数内用户安装的预置应用库里的第三方应用具有的每个标签占该应用具有的所有标签的权重随时间衰减,确定该用户因安装了该第三方应用而对于标签集合里的不同标签的兴趣度;The apparatus according to claim 9, wherein the interest degree determining unit is configured to collect, by counting, a third-party application in a preset application library installed by a user within a period of days, each label of the application has all the labels of the application. The weight is attenuated over time, determining the user's interest in different tags in the tag set due to the installation of the third party application;
    所述兴趣度确定单元,具体还用于将该用户因在终端上安装的所有第三方应用而对于相同标签的兴趣度值进行累加,确定该用户对于不同标签的兴趣度。The interest degree determining unit is further configured to accumulate the interest value of the same tag by the user for all third-party applications installed on the terminal, and determine the interest degree of the user for different tags.
  11. 根据权利要求9或10所述的装置,所述兴趣度确定单元具体还用于通过下述公式,计算所述用户在终端上安装的每个第三方应用对于标签集合里的不同标签的兴趣度:The apparatus according to claim 9 or 10, wherein the interest degree determining unit is further configured to calculate, by using the following formula, the degree of interest of each third-party application installed by the user on the terminal for different labels in the label set. :
    Figure PCTCN2017120182-appb-100008
    Figure PCTCN2017120182-appb-100008
    其中:cint p,i,j表示某个用户p在终端上安装的预置应用库里的第三方应用i对于标签集合里的标签j的兴趣度; Where: cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
    ct1 p,i表示用户p在终端上在N天内安装了预置应用库里的第三方应用i并保持到今天的天数,其中用户p在N天内无安装时ct1 p,i为N; Ct1 p,i means that the user p installs the third-party application i in the preset application library in the N days on the terminal and keeps the number of days until today, wherein the user p does not install ct1 p, N is N in N days;
    tag i,j表示第三方应用i是否具有预置标签集合里的标签j,其中当有标签j时tag i,j为1,否则为0; Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
    Figure PCTCN2017120182-appb-100009
    表示第三方应用i具有的预置标签集合里的标签数量的累加之和;
    Figure PCTCN2017120182-appb-100009
    Represents the sum of the number of labels in the preset label set that the third party application i has;
    n表示预置应用库里的第三方应用的数量;n represents the number of third-party applications in the preset application library;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    N为大于0的整数;N is an integer greater than 0;
    所述兴趣度确定单元具体还用于通过以下公式,确定该用户对于不同标签的兴趣度:The interest degree determining unit is further configured to determine the user's interest in different tags by using the following formula:
    Figure PCTCN2017120182-appb-100010
    Figure PCTCN2017120182-appb-100010
    其中:int p,j表示某个用户p对于预置标签集合里的标签j的兴趣度; Where: int p,j represents the degree of interest of a user p for the tag j in the preset tag set;
    D表示用户p在终端上安装的预置应用库里的第三方应用的数量。D represents the number of third-party applications in the preset application library installed by the user p on the terminal.
  12. 根据权利要求9所述的装置,所述匹配度确定单元具体用于通过对不同用户对于不同标签的兴趣度随着用户点击信息流的时间而衰减进行统计,得到信息流对标签的匹配度。The apparatus according to claim 9, wherein the matching degree determining unit is configured to obtain a matching degree of the information stream to the label by performing statistics on the degree of interest of different users for different tags as the user clicks on the information flow.
  13. 根据权利要求9或12所述的装置,所述匹配度确定单元具体用于通过以下公式,确定信息流对标签的匹配度:The apparatus according to claim 9 or 12, wherein the matching degree determining unit is specifically configured to determine the matching degree of the information stream to the label by using the following formula:
    Figure PCTCN2017120182-appb-100011
    Figure PCTCN2017120182-appb-100011
    其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的匹配度; Where: S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set;
    ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
    int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
    N为大于0的整数。N is an integer greater than zero.
  14. 根据权利要求9或12所述的装置,所述匹配度确定单元具体用于通过统计用户对信息流的点击情况和所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度;The apparatus according to claim 9 or 12, wherein the matching degree determining unit is specifically configured to calculate, by counting a user's click on the information stream and the user's interest in different tags, calculating the information flow clicked by the user. The first match of the different tags in the tag set;
    所述匹配度确定单元具体还用于对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度。The matching degree determining unit is further configured to perform normalization processing on the first matching degree to obtain a final matching degree of the information stream to different labels.
  15. 根据权利要求14所述的装置,The device of claim 14
    所述匹配度确定单元,具体还用于通过以下公式,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度:The matching degree determining unit is further configured to calculate, by using the following formula, a first matching degree of the information stream clicked by the user for different labels in the label set:
    Figure PCTCN2017120182-appb-100012
    Figure PCTCN2017120182-appb-100012
    其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; Where: S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set;
    ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
    int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
    N为大于0的整数;N is an integer greater than 0;
    所述匹配度确定单元,具体还用于通过以下公式,对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度:The matching degree determining unit is further configured to perform normalization processing on the first matching degree by using the following formula to obtain a final matching degree of the information stream to different labels:
    Figure PCTCN2017120182-appb-100013
    Figure PCTCN2017120182-appb-100013
    其中:ptag l,j表示用户所点击的信息流l与预置标签集合里的标签j之间的匹配度; Where: ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
    S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set;
    Figure PCTCN2017120182-appb-100014
    Figure PCTCN2017120182-appb-100014
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    m表示预置标签集合里的标签数量。m represents the number of labels in the preset label set.
  16. 根据权利要求13-15任一项所述的装置,所述标注单元,具体用于将标签按照所述信息流对标签的匹配度降序排列;The device according to any one of claims 13-15, wherein the labeling unit is specifically configured to arrange the labels in descending order according to the matching degree of the information flow to the labels;
    所述标注单元,具体还用于从所述排列中按照从大到小选取一定数量的标签,并对所述信息流进行标注。The labeling unit is further configured to select a certain number of labels from the largest to the smallest in the arrangement, and mark the information flow.
  17. 一种终端设备,包括:A terminal device comprising:
    一个或多个处理器;One or more processors;
    存储器;Memory
    一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于:One or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more programs configured to:
    基于用户在终端上安装的第三方应用确定该用户对于不同标签的兴趣度;Determining the user's interest in different tags based on the third-party application installed by the user on the terminal;
    基于所述兴趣度和用户对信息流的点击情况确定信息流对标签的匹配度;Determining a degree of matching of the information stream to the tag based on the degree of interest and the user's click on the information stream;
    基于所述匹配度按预设方式选取对应的一定数量标签给所述信息流标注。And selecting a corresponding quantity label according to the matching degree to label the information flow.
  18. 根据权利要求17所述的终端设备,所述一个或多个程序配置具体用于:The terminal device according to claim 17, wherein the one or more program configurations are specifically configured to:
    通过统计在一段天数内用户安装的预置应用库里的第三方应用具有的每个标签占该应用具有的所有标签的权重随时间衰减,确定该用户因安装了该第三方应用而对于标签集合里的不同标签的兴趣度;By counting the weight of all the tags of the third-party application in the preset application library installed by the user in a period of days, the weight of all the tags of the application is attenuated over time, and determining that the user has installed the third-party application for the tag set. Interest in different labels;
    将该用户因在终端上安装的所有第三方应用而对于相同标签的兴趣度值进行累加,确定该用户对于不同标签的兴趣度。The user is accumulated for the same tag's interest value due to all third-party applications installed on the terminal, and the user's interest in different tags is determined.
  19. 根据权利要求18所述的终端设备,所述一个或多个程序配置具体用于:The terminal device according to claim 18, wherein the one or more program configurations are specifically configured to:
    通过下述公式,计算所述用户在终端上安装的每个第三方应用对于标签集合里的不同标签的兴趣度:Calculate the interest of each third-party application installed by the user on the terminal for different tags in the tag set by using the following formula:
    Figure PCTCN2017120182-appb-100015
    Figure PCTCN2017120182-appb-100015
    其中:cint p,i,j表示某个用户p在终端上安装的预置应用库里的第三方应用i对于标签集合里的标签j的兴趣度; Where: cint p, i, j represents the degree of interest of the third party application i in the preset application library installed by the user p on the terminal for the tag j in the tag set;
    ct1 p,i表示用户p在终端上在N天内安装了预置应用库里的第三方应用i并保持到今天的天数,其中用户p在N天内无安装时ct1 p,i为N; Ct1 p,i means that the user p installs the third-party application i in the preset application library in the N days on the terminal and keeps the number of days until today, wherein the user p does not install ct1 p, N is N in N days;
    tag i,j表示第三方应用i是否具有预置标签集合里的标签j,其中当有标签j时tag i,j为1,否则为0; Tag i,j indicates whether the third-party application i has the tag j in the preset tag set, wherein the tag i, j is 1 when there is a tag j, otherwise 0;
    Figure PCTCN2017120182-appb-100016
    表示第三方应用i具有的预置标签集合里的标签数量的累加之和;
    Figure PCTCN2017120182-appb-100016
    Represents the sum of the number of labels in the preset label set that the third party application i has;
    n表示预置应用库里的第三方应用的数量;n represents the number of third-party applications in the preset application library;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    N为大于0的整数;N is an integer greater than 0;
    其中,所述一个或多个程序配置具体用于:Wherein the one or more program configurations are specifically used to:
    通过以下公式,确定该用户对于不同标签的兴趣度:Determine the user's interest in different tags by the following formula:
    Figure PCTCN2017120182-appb-100017
    Figure PCTCN2017120182-appb-100017
    其中:int p,j表示某个用户p对于预置标签集合里的标签j的兴趣度; Where: int p,j represents the degree of interest of a user p for the tag j in the preset tag set;
    D表示用户p在终端上安装的预置应用库里的第三方应用的数量。D represents the number of third-party applications in the preset application library installed by the user p on the terminal.
  20. 根据权利要求17所述的终端设备,所述一个或多个程序配置具体用于:The terminal device according to claim 17, wherein the one or more program configurations are specifically configured to:
    通过对不同用户对于不同标签的兴趣度随着用户点击信息流的时间而衰减进行统计,得到信息流对标签的匹配度。The degree of interest of different users for different tags is attenuated as the user clicks on the information flow, and the matching degree of the information flow to the tags is obtained.
  21. 根据权利要求17或20所述的终端设备,所述一个或多个程序配置具体用于:The terminal device according to claim 17 or 20, wherein the one or more program configurations are specifically used to:
    通过以下公式,确定信息流对标签的匹配度:Determine the matching of the information flow to the label by the following formula:
    Figure PCTCN2017120182-appb-100018
    Figure PCTCN2017120182-appb-100018
    其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的匹配度; Where: S l,j represents the degree of matching of the information stream 1 clicked by the user for the tag j in the preset tag set;
    ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
    int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
    N为大于0的整数。N is an integer greater than zero.
  22. 根据权利要求17或20所述的终端设备,所述一个或多个程序配置具体用于:The terminal device according to claim 17 or 20, wherein the one or more program configurations are specifically used to:
    通过统计用户对信息流的点击情况和所述用户对于不同标签的兴趣度,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度;Calculating a first matching degree of the information flow clicked by the user for different tags in the tag set by counting a user's click on the information flow and the user's interest in different tags;
    对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度。The first matching degree is normalized to obtain a final matching degree of the information stream to different tags.
  23. 根据权利要求22所述的终端设备,其特征在于,所述一个或多个程序配置具体用于:The terminal device according to claim 22, wherein the one or more program configurations are specifically configured to:
    通过以下公式,计算所述用户所点击的信息流对于标签集合里的不同标签的第一匹配度:The first matching degree of the information stream clicked by the user for different labels in the label set is calculated by the following formula:
    Figure PCTCN2017120182-appb-100019
    Figure PCTCN2017120182-appb-100019
    其中:S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; Where: S l,j represents the first matching degree of the information stream 1 clicked by the user for the label j in the preset label set;
    ct2 p,l表示用户p最后一次点击信息流l距离今天的天数,其中用户p在N天内没有点击信息流l时ct2 p,l为N; Ct2 p,l represents the number of days that the user p last clicked on the information stream l today, where user p did not click on the information stream l within N days, ct2 p, l is N;
    int p,j表示用户p对于预置标签集合里的标签j的兴趣度; Int p,j represents the degree of interest of the user p for the tag j in the preset tag set;
    m表示预置标签集合里的标签数量;m represents the number of labels in the preset label set;
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    F表示点击过信息流l的用户数量;F indicates the number of users who have clicked on the information stream l;
    N为大于0的整数;N is an integer greater than 0;
    其中,所述一个或多个程序配置具体用于:Wherein the one or more program configurations are specifically used to:
    通过以下公式,对所述第一匹配度进行标准化处理,得出信息流对不同标签的最终匹配度:The first matching degree is normalized by the following formula to obtain the final matching degree of the information stream to different labels:
    Figure PCTCN2017120182-appb-100020
    Figure PCTCN2017120182-appb-100020
    其中:ptag l,j表示用户所点击的信息流l与预置标签集合里的标签j之间的匹配度; Where: ptag l, j represents the degree of matching between the information stream 1 clicked by the user and the tag j in the preset tag set;
    S l,j表示用户所点击的信息流l对于预置标签集合里的标签j的第一匹配度; S l,j represents the first matching degree of the information stream 1 clicked by the user for the tag j in the preset tag set;
    Figure PCTCN2017120182-appb-100021
    Figure PCTCN2017120182-appb-100021
    U表示预置信息流库里的信息流数量;U represents the number of streams in the preset stream library;
    m表示预置标签集合里的标签数量。m represents the number of labels in the preset label set.
  24. 根据权利要求21-23任一项所述的终端设备,所述一个或多个程序配置具体用于:The terminal device according to any one of claims 21 to 23, wherein the one or more program configurations are specifically used to:
    将标签按照所述信息流对标签的匹配度降序排列;Arranging the labels in descending order according to the matching degree of the information flow to the labels;
    从所述排列中按照从大到小选取一定数量的标签,并对所述信息流进行标注。A certain number of tags are selected from the arrangement in order from large to small, and the information flow is labeled.
  25. 一种计算机可读存储介质,其上承载一个或多个计算机指令程序,所述计算机指令程序被一个或多个处理器执行时,所述一个或多个处理器执行权利要求1至8任一项所述的方法。A computer readable storage medium having one or more computer program programs thereon, the one or more processors executing any one of claims 1 to 8 when the computer program program is executed by one or more processors The method described in the item.
PCT/CN2017/120182 2017-03-22 2017-12-29 Method and apparatus for tagging information stream, terminal device, and storage medium WO2018171288A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710172953.X 2017-03-22
CN201710172953.XA CN106960033B (en) 2017-03-22 2017-03-22 Method and device for labeling information stream

Publications (1)

Publication Number Publication Date
WO2018171288A1 true WO2018171288A1 (en) 2018-09-27

Family

ID=59470859

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/120182 WO2018171288A1 (en) 2017-03-22 2017-12-29 Method and apparatus for tagging information stream, terminal device, and storage medium

Country Status (2)

Country Link
CN (1) CN106960033B (en)
WO (1) WO2018171288A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111522854A (en) * 2020-03-18 2020-08-11 大箴(杭州)科技有限公司 Data labeling method and device, storage medium and computer equipment
CN112784151A (en) * 2019-11-08 2021-05-11 北京搜狗科技发展有限公司 Method and related device for determining recommendation information

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106960033B (en) * 2017-03-22 2021-09-14 阿里巴巴(中国)有限公司 Method and device for labeling information stream
CN108900922B (en) * 2018-07-20 2021-03-19 广州方硅信息技术有限公司 Method and device for setting label of live broadcast component

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103389971A (en) * 2013-07-04 2013-11-13 北京卓易讯畅科技有限公司 Method and equipment for determining high-quality grade of comment content corresponding to application
CN104239571A (en) * 2014-09-30 2014-12-24 北京奇虎科技有限公司 Method and device for application recommendation
CN104750789A (en) * 2015-03-12 2015-07-01 百度在线网络技术(北京)有限公司 Label recommendation method and device
US20150310106A1 (en) * 2014-04-29 2015-10-29 Baidu Online Network Technology (Beijing) Co., Ltd Method and apparatus for providing information and method and apparatus for providing search result
CN105824961A (en) * 2016-03-31 2016-08-03 北京奇艺世纪科技有限公司 Tag determining method and device
CN106960033A (en) * 2017-03-22 2017-07-18 广州优视网络科技有限公司 A kind of method and apparatus that label is marked to information flow

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8275771B1 (en) * 2010-02-26 2012-09-25 Google Inc. Non-text content item search
CN102867016A (en) * 2012-07-18 2013-01-09 北京开心人信息技术有限公司 Label-based social network user interest mining method and device
CN105893478B (en) * 2016-03-29 2019-10-29 广州华多网络科技有限公司 A kind of tag extraction method and apparatus

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103389971A (en) * 2013-07-04 2013-11-13 北京卓易讯畅科技有限公司 Method and equipment for determining high-quality grade of comment content corresponding to application
US20150310106A1 (en) * 2014-04-29 2015-10-29 Baidu Online Network Technology (Beijing) Co., Ltd Method and apparatus for providing information and method and apparatus for providing search result
CN104239571A (en) * 2014-09-30 2014-12-24 北京奇虎科技有限公司 Method and device for application recommendation
CN104750789A (en) * 2015-03-12 2015-07-01 百度在线网络技术(北京)有限公司 Label recommendation method and device
CN105824961A (en) * 2016-03-31 2016-08-03 北京奇艺世纪科技有限公司 Tag determining method and device
CN106960033A (en) * 2017-03-22 2017-07-18 广州优视网络科技有限公司 A kind of method and apparatus that label is marked to information flow

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112784151A (en) * 2019-11-08 2021-05-11 北京搜狗科技发展有限公司 Method and related device for determining recommendation information
CN112784151B (en) * 2019-11-08 2024-02-06 北京搜狗科技发展有限公司 Method and related device for determining recommended information
CN111522854A (en) * 2020-03-18 2020-08-11 大箴(杭州)科技有限公司 Data labeling method and device, storage medium and computer equipment
CN111522854B (en) * 2020-03-18 2023-08-01 大箴(杭州)科技有限公司 Data labeling method and device, storage medium and computer equipment

Also Published As

Publication number Publication date
CN106960033B (en) 2021-09-14
CN106960033A (en) 2017-07-18

Similar Documents

Publication Publication Date Title
WO2018121700A1 (en) Method and device for recommending application information based on installed application, terminal device, and storage medium
WO2018192491A1 (en) Information pushing method and device
WO2018171288A1 (en) Method and apparatus for tagging information stream, terminal device, and storage medium
WO2018157818A1 (en) Method and apparatus for inferring preference of user, terminal device, and storage medium
WO2016197774A1 (en) Multimedia data pushing method and apparatus, and storage medium
WO2018192496A1 (en) Trend information generation method and device, storage medium and electronic device
CN111125574B (en) Method and device for generating information
WO2018188378A1 (en) Method and device for tagging label for application, terminal and computer readable storage medium
US20200356572A1 (en) Search ranking method and apparatus, electronic device and storage medium
CN107911448B (en) Content pushing method and device
WO2021082484A1 (en) Awr report automatic acquisition method and apparatus, electronic device, and storage medium
TW201710993A (en) Method, apparatus and system for detecting fraudulent software promotion
CN109753601B (en) Method and device for determining click rate of recommended information and electronic equipment
WO2017088496A1 (en) Search recommendation method, device, apparatus and computer storage medium
US10769196B2 (en) Method and apparatus for displaying electronic photo, and mobile device
WO2017206376A1 (en) Searching method, searching device and non-volatile computer storage medium
WO2017020779A1 (en) Service information push method and system
WO2018171295A1 (en) Method and apparatus for tagging article, terminal, and computer readable storage medium
WO2019072098A1 (en) Method and system for identifying core product terms
WO2020257991A1 (en) User identification method and related product
CN109284367B (en) Method and device for processing text
CN113076416A (en) Information heat evaluation method and device and electronic equipment
CN111311294A (en) Data processing method, device, medium and electronic equipment
WO2016184052A1 (en) Card addition method, device, and apparatus and computer storage medium
CN111782913A (en) Method and device for determining brand intention words

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17901946

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17901946

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 31/01/2020)