WO2019113977A1 - Method, device, and server for processing written articles, and storage medium - Google Patents

Method, device, and server for processing written articles, and storage medium Download PDF

Info

Publication number
WO2019113977A1
WO2019113977A1 PCT/CN2017/116646 CN2017116646W WO2019113977A1 WO 2019113977 A1 WO2019113977 A1 WO 2019113977A1 CN 2017116646 W CN2017116646 W CN 2017116646W WO 2019113977 A1 WO2019113977 A1 WO 2019113977A1
Authority
WO
WIPO (PCT)
Prior art keywords
promotion
article
target
feature
candidate
Prior art date
Application number
PCT/CN2017/116646
Other languages
French (fr)
Chinese (zh)
Inventor
周莜
徐澜
谢奕
阳丹
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Priority to CN201780054780.XA priority Critical patent/CN110325986B/en
Priority to PCT/CN2017/116646 priority patent/WO2019113977A1/en
Publication of WO2019113977A1 publication Critical patent/WO2019113977A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce

Definitions

  • the present invention relates to communication technologies, and in particular, to an article processing method, apparatus, server, and storage medium based on a self-media platform.
  • the articles published in the self-media platform carry the user's appeals for expressing emotions, disseminating information and socializing.
  • the related technologies provide a scheme for adding promotional information to the article. When the article reaches the user and is viewed, the promotional information added in the article is The user presents the article in the process of viewing the article, and realizes the effect of promoting the object.
  • a technical solution adopted by the related technology is to add the promotion information to the article published by the user on the self-media platform, and then push the article with the promotion information to the user, and the technical solution is read for the user.
  • the perception in the process of the article causes a lot of interference, which leads to a decrease in the user acceptance of the published article, which in turn affects the effect of object promotion;
  • a technical solution adopted by the related art is that a special account is opened from the media platform, and an article of various promotion information is published through an account, and the access traffic of the dedicated account has a large fluctuation. Sex, especially in the initial stage of account creation, it is difficult to support the timeliness of the promotion target and the needs of a specific user group.
  • embodiments of the present invention are expected to provide an article processing party based on a self-media platform.
  • the method, device, server and storage medium can realize the ideal fusion of promotion information in self-media articles, and promote the good timeliness of information reaching users.
  • an embodiment of the present invention provides an article processing method based on a self-media platform, including:
  • the promotion article to which the promotion information is added is sent.
  • an embodiment of the present invention provides an article processing method based on a self-media platform, where the method is performed by a server, where the server includes one or more processors and a memory, and one or more programs, where The one or more programs are stored in a memory, the program may include one or more units each corresponding to a set of instructions, the one or more processors being configured to execute instructions;
  • Methods include:
  • the promotion article to which the promotion information is added is sent.
  • an embodiment of the present invention provides an article processing apparatus based on a self-media platform, including:
  • a receiving unit configured to receive a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the first client;
  • a determining unit configured to determine, in the target article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article;
  • a generating unit configured to generate promotion information according to the determined material that matches the target promotion object
  • Adding a unit configured to add the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location
  • the sending unit is configured to send the promotion article to which the promotion information is added.
  • an embodiment of the present invention provides a server, including:
  • a memory configured to store an executable program
  • the processor configured to execute the executable program stored in the memory, implements the above-described self-media platform-based article processing method.
  • an embodiment of the present invention provides a storage medium, where an executable program is stored, and when the executable program is executed by a processor, the article processing party based on the self-media platform is implemented. law.
  • the source of the target article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic, and the batching and automatic addition of the promotion information can be realized; the location of the promotion information is automatically selected, and the position is flexible. It can avoid the sudden emergence of promotion information, make the content of the article and the content of the promotion information natural; through the process of publishing the article and reaching the user, the delivery of the promotion information is completed, and the promotion information is promoted by relying on the release/send traffic of the self-media platform itself. The information is covered by the traffic from the media platform and reaches the user in real time.
  • FIG. 1A is a schematic diagram of an optional application scenario of an article processing method based on a self-media platform according to an embodiment of the present invention
  • FIG. 1B is a schematic diagram of an optional application scenario of an article processing method based on a self-media platform according to an embodiment of the present disclosure
  • FIG. 2A is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention.
  • FIG. 2B is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention.
  • FIG. 2C is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of an optional hardware of an article processing apparatus based on a self-media platform according to an embodiment of the present disclosure
  • FIG. 4 is an optional schematic flowchart of an article processing method based on a self-media platform according to an embodiment of the present invention
  • FIG. 5A is a schematic diagram of performing topic prediction by using a keyword-topic classifier according to an embodiment of the present invention.
  • FIG. 5B is a schematic diagram of performing similarity calculation by using a text-to-text similarity classifier according to an embodiment of the present invention.
  • FIG. 5C is a similarity diagram of an image-image similarity classifier according to an embodiment of the present invention. Schematic diagram of the calculation;
  • 5D is a schematic diagram of performing similarity calculation by using a text-image similarity classifier according to an embodiment of the present invention
  • 6A is an optional schematic diagram of a text material according to an embodiment of the present invention.
  • FIG. 6B is an optional schematic diagram of a text material according to an embodiment of the present invention.
  • FIG. 7A is a schematic diagram of adding promotion information in a promotion article according to an embodiment of the present invention.
  • FIG. 7B is a schematic diagram of adding promotion information in a promotion article according to an embodiment of the present invention.
  • FIG. 7C is a schematic diagram of adding promotion information in a promotion article according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of promotion information according to an embodiment of the present invention.
  • 9A is a schematic diagram of a manner of displaying promotion information according to an embodiment of the present invention.
  • 9B is a schematic diagram of a manner of displaying promotion information according to an embodiment of the present invention.
  • 9C is a schematic diagram of a manner of displaying promotion information according to an embodiment of the present invention.
  • FIG. 10 is an optional schematic flowchart of an article processing method based on a self-media platform according to an embodiment of the present disclosure
  • FIG. 11 is an optional schematic flowchart of an article processing method based on a self-media platform according to an embodiment of the present disclosure
  • FIG. 12 is a schematic structural diagram of a composition processing apparatus based on a self-media platform according to an embodiment of the present invention.
  • the terms “including”, “including” or any of them are used.
  • the other variations are intended to cover a non-exclusive inclusion, such that a method or apparatus that includes a plurality of elements includes not only the elements that are specifically described, but also other elements that are not explicitly listed, or The inherent elements.
  • an element defined by the phrase “comprising a " does not exclude the presence of additional related elements in the method or device including the element (eg, a step in the method or a unit in the device)
  • the unit here may be part of a circuit, part of a processor, part of a program or software, etc.).
  • the article processing method based on the self-media platform provided by the embodiment of the present invention includes a series of steps, but the article processing method based on the self-media platform provided by the embodiment of the present invention is not limited to the described steps, and the present invention is similarly
  • the article processing device based on the self-media platform provided by the embodiment includes a series of units, but the device provided by the embodiment of the present invention is not limited to including the unit explicitly described, and may also include when the related information is acquired or processed based on the information. The unit that needs to be set.
  • Self-media platform also known as self-media, an information platform set up on the Internet for users (including individual users, groups, organizations, etc.) to publish articles, depending on the server and the implementation of the server deployment.
  • Software supports front-end access and background processing
  • self-media platforms such as Weibo, blogs, personal websites, forum communities, and public numbers for various social applications.
  • the accounts from the media platform can be different types of individuals, organizations, groups, and enterprises. After registering the account from the media, the news, dynamics, and other articles submitted by the client are related to the user's own preferences, dynamics, or business. The end is pushed to the appropriate user via the self-media platform.
  • promotion objects such as advertisements.
  • Word vector using a word-to-vector mapping model such as word-to-vector (Word2Vector), based on the degree of semantic similarity between different words, the vector obtained by mapping words into vector space, the distance between different word vectors It is negatively related to the degree of semantic similarity of the corresponding words, that is, the smaller the distance of the word vectors of the two words (such as the Euclidean distance), the closer the semantics of the two words are.
  • Word2Vector word-to-vector
  • Content features mapping a plurality of feature words extracted from the article into corresponding word vectors, and combining them, also referred to as content feature vectors.
  • Word segmentation also known as word segmentation, according to a certain word segmentation strategy refers to the division of characters in an article into separate words.
  • Stop words words that are filtered from the article and do not affect the classification decision of the article; usually the general words do not have a clear meaning (only if they are put into a complete sentence), for example, pronouns , articles such as articles and numerals, modal particles, adverbs, prepositions and conjunctions.
  • a classifier model also called a classifier, is a model for classification obtained by means of machine learning, and is used for predicting an article as a target category based on the sample characteristics of the article to indicate that the article is a target category. The probability.
  • the classifier model in this paper can use the two-classifier model of Support Vector Machines (SVM), the word bag-based classifier model, the classifier model based on prior probability and sparse features, based on neural network and depth.
  • SVM Support Vector Machines
  • the classifier model of the classifier model such as learning, if not specified, the classifier model described in this paper is used for two classifications, such as judging whether it belongs to a topic, and determining whether the article belongs to the target category.
  • Machine Learning through the sample of the training set (referred to as The sample is trained on the sample features and whether it belongs to the target category (such as beauty articles), and the classifier model is trained so that the trained classifier model has the performance of determining whether the article sample of the test set belongs to the target category.
  • the target category such as beauty articles
  • the training set including the article that trains the classifier model, the vector representation of the article and the prior classification results are used to construct the training samples to train the classifier model, so that the classifier model has the performance of classifying the target category by the article to be tested.
  • a test set comprising articles to be tested (classified), the vector representation of the article being used to input a classifier model to predict scores belonging to the target category.
  • Embodiments of the present invention provide an article processing method based on a self media platform, an article processing device based on a self media platform based on an article processing method based on a self media platform, and an executable program for implementing an article processing method based on a self media platform.
  • Storage medium for the implementation of the article processing method based on the self-media platform, the embodiment of the present invention provides a solution implemented by the terminal side and the server side. Next, an exemplary implementation scenario of the article processing will be described.
  • FIG. 1A and FIG. 1B are schematic diagrams of an optional application scenario of an article processing method based on a self-media platform according to an embodiment of the present invention.
  • the user terminal is not limited to a mobile phone.
  • the server may be any commercial or dedicated server.
  • the server is divided into two categories, namely, a social network server 21 and an advertisement back server. 22, and in practical applications, each type of server can be set according to the actual situation one or more.
  • the user terminal 11 to the user terminal 15 can exchange information with the social network server 21 and the advertisement background server 22 through a wired network, a wireless network, or a combination of the two, and each user terminal can send and receive information (such as an article) through the server. Delivery, etc.
  • the article processing method of the embodiment of the present invention will be described below with reference to FIG. 1A and FIG. 1B. It should be noted that the network shown in FIG. 1A and FIG. 1B is merely an example for easy understanding and does not constitute the network architecture of the present invention. Any restrictions.
  • the advertisement backend server 22 obtains from the advertiser terminal. Taking the promotion article carrying the promotion information (such as advertisement), and then sending the promotion article to the social network server 21 to send the promotion article carrying the promotion information to the social network through the social network server 21, so that the social network user can receive and read the carrier. Promote promotional articles for information.
  • the promotion article carrying the promotion information may be that the advertiser writes the advertisement for the specific group (such as the public number user) or the specific product (such as a designated shampoo), for each product or a specific group of people for promotion.
  • Articles also known as soft texts
  • FIG. 2A is a schematic diagram of an optional presentation manner of a promotion article provided by an advertiser according to an embodiment of the present invention.
  • the advertiser is an advertiser for the public number user (ie, through social interaction).
  • the web client pays attention to the user of the public number to write the soft text, and then the advertiser sends the soft text to the advertisement background server 22, and the advertisement background server 22 sends the soft text to the terminal of the user who pays attention to the public number through the social network server 21, as shown in FIG. 2A.
  • the user jumps to the interface 2 by clicking anywhere in the interface 1, and both sees the content of his own attention and the advertisement of the advertiser.
  • the social network server 21 obtains articles for a particular topic collection from a user, and then sends the collected articles to an advertisement backend server 22, which adds promotional information to the collected articles ( For example, an advertisement is sent to the social network through the social network server 21, so that the social network user can receive and read the promotion article with the promotion information.
  • FIG. 2B is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention.
  • the public number initiates an essay activity on a specific theme (such as Valentine's Day).
  • the social network server 21 that carries the public number function is obtained
  • the article sent by the user participating in the activity sends the obtained article to the advertisement background server 22, and the advertisement background server 22 adds an advertisement at the end of the article, obtains a promotion article for adding the advertisement, and sends the promotion article to the social network server 21 through the public.
  • the number will send the promotion article to the social network, and the social network user jumps to the interface 2 by clicking anywhere in the interface 1 to read the article content and add the advertisement.
  • the source of the above-mentioned implementations has a large limitation due to the fact that the article is collected from the user, and is limited to the user-generated content (UGC) article of the specific topic collected, and the advertisement content is mainly displayed. Users are unable to further manipulate the displayed ads, reducing the user’s desire to purchase ads and understanding desires.
  • URC user-generated content
  • the social network server 21 obtains an article to be published from a user, and then transmits the obtained article to an advertisement backend server 22, which adds a topic related to the article topic at a specific location of the article.
  • the promotion information (such as an advertisement) is promoted and the promotion article is sent to the social network through the social network server 21.
  • FIG. 2C is a schematic diagram of an optional presentation manner of the promotion article provided by the social network server 21, and the social network server 21 is operated from the operation user terminal of the news release platform.
  • the side obtains the article to be published, and then sends the obtained article to the advertisement background server 22, and the advertisement background server 22 adds the advertisement related to the article topic to the promotion article at the end of the article, and sends the promotion article to the social network server 21 through the news.
  • the publishing platform sends the promotion article to the social network, and the user jumps to the interface 2 by clicking anywhere in the interface 1, and can see the promotion article with the added advertisement.
  • the advertisement added in the article has a high degree of fit with the article theme
  • the combination of the advertisement and the article content is hard due to the fixed location, and the promotion information and the content in the article are not highly correlated or even Unrelated situations reduce the acceptance of promotional information.
  • embodiments of the present invention are based on articles from a media platform.
  • the implementation of the method may include: the social network server 21 carrying the self-media platform function receives the to-be-published article submitted by the user of the media platform through the first client connected to the media platform, and sends the to-be-published article to the advertisement background server 22
  • the advertisement background server 22 determines, in the article to be published, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article;
  • the social network server 21 determines the target promotion of the candidate promotion object that matches the promotion article.
  • the object and the material matching the target promotion object adding the promotion information including the material to the promotion location of the promotion article according to the determined promotion location; and sending the promotion article with the promotion information to the social network server 21, and the social network server 21 sends The promotion article of the promotion information is added to the second client of the media platform for presentation.
  • the implementation of the article processing method based on the self-media platform of the embodiment of the present invention may include: the social network server 21 carrying the self-media platform function receives the user from the media platform by connecting to the media platform.
  • the target article submitted by the first client; the social network server 21 obtains the candidate promotion object from the advertisement background server 22 or obtains the candidate promotion object stored by itself, and stores the material of the candidate promotion object itself, and determines the presentation for the presentation in the target article.
  • the social network server 21 determines the target promotion object matching the promotion article in the candidate promotion object, and the material matching the target promotion object; The location, adding the promotion information including the material to the promotion position of the promotion article; sending the promotion article adding the promotion information to the second client from the media platform for presentation.
  • the target article submitted by the client may include the article to be published and the original article; the original article here refers to the article that has been released and retracted through the self-media platform.
  • FIG. 3 an exemplary hardware structure of an apparatus corresponding to the self-media platform-based article processing method for implementing an embodiment of the present invention is described with reference to FIG. 3, and the article processing apparatus based on the self-media platform may be implemented in various forms, such as a terminal (eg, Various types of computer equipment, such as desktop computers, laptops or smart phones), servers, etc., are independent of computer equipment such as terminals and servers.
  • the self-media platform-based article processing method of the embodiment of the present invention is implemented in a coordinated manner.
  • the hardware structure of the article processing apparatus based on the self-media platform of the embodiment of the present invention is described in detail below. It can be understood that FIG. 3 only shows an exemplary structure of the article processing apparatus based on the self-media platform, and not all the structures, as needed. Part or all of the structure shown in Fig. 3 is implemented.
  • FIG. 3 is a schematic diagram of an optional hardware structure of an article processing apparatus based on a self-media platform according to an embodiment of the present disclosure, which can be applied to a server in the foregoing application scenario, such as a background server that can be a microblog/WeChat.
  • the article processing apparatus 100 shown in FIG. 3 includes at least one processor 101, a memory 102, and at least one network interface 103.
  • the various components in article processing device 100 are coupled together by bus system 104.
  • bus system 104 is used to implement connection communication between these components.
  • the bus system 104 includes, in addition to the data bus, a power bus, a control bus, and a status signal bus. However, for clarity of description, various buses are labeled as bus system 104 in FIG.
  • the memory 102 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory.
  • the memory 102 in the embodiment of the present invention is used to store various types of data to support the operation of the article processing apparatus 100 based on the self media platform.
  • Examples of such data include: any computer program for operating on the self-media platform-based article processing apparatus 100, such as the executable program 1021, the program implementing the self-media platform-based article processing method of the embodiment of the present invention may be included in It can be executed in the program 1021.
  • Network interface 103 may include one or more communication modules, such as a mobile communication module and a wireless internet module.
  • the article processing method based on the self-media platform disclosed in the embodiment of the present invention may be applied to the processor 101 or implemented by the processor 101.
  • Processor 101 may be an integrated circuit chip with signal processing capabilities. In the implementation process, the steps of the method in the embodiment of the present invention may be completed by using an integrated logic circuit of hardware in the processor 101 or an instruction in a software form.
  • the processor 101 can be a general purpose processor, a digital signal processor (DSP), or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, or the like.
  • DSP digital signal processor
  • the processor 101 can implement or perform the various methods, steps, and logic blocks disclosed in the embodiments of the present invention.
  • a general purpose processor can be a microprocessor or any conventional processor or the like.
  • the steps of the method disclosed in the embodiment of the present invention may be directly implemented as a hardware decoding processor, or may be performed by a combination of hardware and software modules in the decoding processor.
  • the software module may be located in a storage medium, and the storage medium is located in the memory 102.
  • the processor 101 reads the information in the memory 102, and completes the steps of the above-described self-media platform-based article processing method provided by the embodiment of the present invention.
  • the article processing apparatus 100 based on the self-media platform may be configured by one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), and complexities.
  • a Programmable Logic Device (CPLD) is used to execute the self-media platform-based article processing method of the embodiment of the present invention.
  • the embodiment of the present invention can be implemented as an advertisement background server implemented by the article processing device based on the self-media platform as an information promotion platform function, and the advertisement background server passes the social network.
  • the server receives the to-be-published article from the self-media platform client, determines the promotion article in the article to be published, obtains the candidate promotion object information, determines the target promotion object, and the material corresponding to the target promotion object, and generates promotion information including the material, and determines Promote the promotion location of the promotion information in the article, add the promotion information to the promotion location in the promotion article, and add the promotion article of the promotion information to the client presentation of the self-media platform through the social network server.
  • the article processing device of the platform can also be implemented in other application environments, such as a background server of a news push APP, a background server of a media website, a server of an information promotion platform, etc., and the article processing device based on the self media platform is not excluded from being provided. Any application environment from the media platform function.
  • FIG. 4 is a schematic flowchart of an article processing method based on the self-media platform provided by the embodiment of the present invention.
  • the embodiment of the present invention will be described as a social network server deployed from a media platform based on a self-media platform.
  • the article processing method based on the self-media platform provided by the embodiment of the present invention includes:
  • Step 301 The publisher sends the to-be-published article to the social network server by using the first client from the media platform.
  • Articles to be published may be articles to be published on self-media platforms (such as Weibo, public number, forum community, etc.).
  • the social network server carries the functions of a self-media platform (such as Weibo, public number, QQ space, forum community, etc.), and the social network may be an Internet social entity based on Weibo, public number, etc., which can be used by users to publish articles, and the publisher of the article.
  • a self-media platform such as Weibo, public number, QQ space, forum community, etc.
  • the social network may be an Internet social entity based on Weibo, public number, etc., which can be used by users to publish articles, and the publisher of the article.
  • the article to be published is sent to the social network server for sending to the client of the self-media platform through the social network server.
  • the self-media platform's spontaneous traffic about the article is used as the carrier of the promotion information, that is, the source of the article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic.
  • the public number operation user as the publisher of the article may submit the article through the client, for example, may be an article written by the user about the theme of mood, food, makeup, etc., or may be downloaded from the Internet and the above.
  • Topic-related articles which will be sent by the client to the social network server hosting the public number function to serve through the social network service. Send the article to the user who is following the public number.
  • the Weibo user acts as a publisher, and the article to be published is sent to the social network server hosting the Weibo function through the Weibo client, and the social network server sends the article to the user who pays attention to the publisher's Weibo account.
  • the social network server may directly send the to-be-published article to other users of the social network, or may perform the synthesis process of the promotion information for publishing the article to perform the synthesis.
  • the publication of articles with promotional information may directly send the to-be-published article to other users of the social network, or may perform the synthesis process of the promotion information for publishing the article to perform the synthesis.
  • the following takes the steps 302 to 305 to describe the synthesis processing of the promotion information of the social network server to be published.
  • Step 302 The social network server determines, in the article to be published, a promotion article for presenting the promotion information, and a location for promoting the promotion information in the promotion article.
  • the promotion information is an advertisement
  • articles that are not suitable for advertisement addition such as academic articles, so it is necessary to filter the articles to be published, and determine an article capable of presenting (adding) advertisements, that is, promotion articles.
  • the social network server may acquire the candidate promotion object through the advertisement background server, and itself stores the material information of the candidate promotion object, and the social network server may determine the promotion article for presenting the promotion information by:
  • the topic similarity degree is calculated by the topic of the article to be published and the topic represented by the candidate promotion object, and the article to be published that satisfies the topic similarity condition (such as the similarity exceeds the preset topic similarity threshold) is determined as the promotion for presenting the promotion information. article.
  • the feature words representing the topic of the article and the candidate promotion object are respectively input into a preset learning model (such as word2vec), and the theme feature vector of the corresponding article theme and the candidate promotion object is obtained through the learning model mapping, and then calculated.
  • the similarity between the topic feature vector of the article and the topic feature vector of the candidate promotion object, and the article whose similarity exceeds the similarity threshold is selected as the promotion article.
  • the topic features and candidate promotion of the article to be published can be achieved as follows: Subject similarity calculation of the subject feature of the object: input the keyword extracted from the article to be published, input the classifier model according to the feature word, and obtain the topic corresponding to the article to be published calculated by the classifier model; The keyword extracted from the material of the object is input into a classifier model for classifying the topic according to the feature word, and the topic corresponding to the candidate promotion object is obtained; and the semantic distance is determined according to the semantic distance of the topic corresponding to the article to be published and the topic corresponding to the candidate promotion object.
  • the topic similarity of negative correlations For example, calculating the Euclidean distance of the topic vector corresponding to the published article and the topic vector corresponding to the candidate promotion object, and then calculating the reciprocal thereof as the topic similarity.
  • the articles and the topics of the candidate promotion objects may be level-set according to a preset classification standard, such as a first-level theme, a second-level theme, and a third-level theme according to a hierarchical level from high to low;
  • the theme of each level may include multiple topics of the next level.
  • the first level theme may be military, sports, entertainment, finance, and the first level theme is entertainment, which may include food,
  • second-level themes such as travel, movies, and music.
  • the second-level theme is music.
  • it can include multiple third-level themes such as jazz music and classical music.
  • the social network server divides the topic of the article to be published into the corresponding first-level topic, and calculates the similarity between the first-level topic of the article to be published and the first-level topic of the candidate promotion object.
  • the similarity satisfies the similarity condition (for example, the topic similarity threshold exceeding the first-level topic)
  • the article to be published is a promotion article for presenting the promotion information.
  • the article is roughly screened to obtain the promotion article, but in other embodiments, the article may be filtered based on the second-level topic or the third-level topic to obtain the promotion article, but Since the first level topic includes a plurality of second level topics and a plurality of third level topics, that is, the feature vector dimension corresponding to the first level theme is lower than the dimension of the feature vector corresponding to the second/third level theme, and the social network
  • the traffic received by the server from the media platform is very large. Therefore, the screening of the article using the first-level theme is obviously faster than the filtering based on the second/third-level theme, that is, the promotion of the promotion information can be quickly determined.
  • the manner in which the topic of the article to be published is divided into the first-level topic is similar to the manner of dividing into the second-level theme and the third-level theme, and can be implemented by using a classifier of the corresponding level as a classifier. For example, by extracting features of the article (at least one of the text feature and the image feature), the extracted feature is mapped to a corresponding feature vector through a preset learning model, and the feature vector of the obtained article is input into a classifier of a different level.
  • the mapping obtains the topic of the corresponding level; for example, input the feature word of the extracted article into the preset word2vec model, obtain the corresponding feature vector of multiple dimensions, and input the obtained feature vector of the plurality of dimensions into the first classifier to obtain the corresponding The first level theme. For example, if the feature vector of the plurality of dimensions of the obtained article is input to the first-level classifier, the probability of the first-level theme is 10% for sports, 80% for entertainment, and 5% for financial. The most probable "entertainment" is selected as the final first-level theme.
  • the acquisition of different ranks of classifiers can be obtained in one of the following ways:
  • Supervised learning methods such as manually annotating text and/or pictures corresponding to several topics, and training a specific level of feature-thematic classifier model with the characteristics of the annotated data (text and/or picture features), obtained through training
  • the feature-theme classifier implements a mapping of the corresponding level of topics.
  • Unsupervised learning method clustering the text and/or picture features of the article to get the theme of the corresponding article.
  • the candidate promotion object may be a certain number of promotion objects prior to the prioritization (such as the auction ranking) obtained from the promotion system (such as the advertisement back-end server).
  • the social network server may also determine a promotional article for presenting promotional information by:
  • the social network server matches the subject of the to-be-published article with at least one of the name of the candidate promotion object, the category of the candidate, and the corresponding promotion information keyword, and determines that the matching condition is met (ie, the similarity condition is met, for example, the similarity reaches the threshold).
  • the article is a promotional article for presenting promotional information.
  • the subject of the above-mentioned article to be published may be one of the following two types:
  • the keyword may include: a keyword extracted from an article title or article content (such as each paragraph of the article); for example, the title of the article to be published is “Qingdao Food Analysis”, and the extracted keyword is “Gourmet”. .
  • FIG. 5A is a schematic diagram of a topic prediction using a keyword-topic classifier model according to an embodiment of the present invention.
  • a keyword-topic model may be a classifier model obtained by pre-training, and implements article keywords and articles.
  • the keyword extracted from the content of the article is “sugar”, “cookie”, “ instant noodles”, “chocolate”, and the vector corresponding to these keywords is input into the keyword-theme model to predict the theme.
  • the probability of getting the theme "food” is 80%
  • the probability of getting the theme "entertainment” is 3%
  • the result with the highest probability (food) is selected as the subject of the predicted article.
  • the candidate promotion object may be a service (such as a movie, a game, etc.) or a product (such as cosmetics, clothes, shoes, etc.);
  • the name, category, and promotion information keyword corresponding to the object may be a service name (such as a movie name), a service category (such as a movie), an advertisement word (such as "Who said the car cannot fly-XXX”);
  • the object is a product.
  • the name, category, and promotion information keyword corresponding to the candidate promotion object may be the product name corresponding to the product (such as women's brand-XX), product category (such as clothes), and advertising words (such as "from France.” Romantic, fashion clothing - XX").
  • the subject of the article to be published is: fruit, and it is determined that the article to be published satisfies the matching condition;
  • the promotion position for adding the promotion information in the promotion article is explained next.
  • the promotion location for adding the promotion information in the promotion article may be determined by: determining, according to the topic feature included in the promotion article, a paragraph having the included topic feature in the promotion article; when the included topic feature When the topic feature of the promotion information satisfies the topic similarity condition, it is determined that the position of the corresponding paragraph is a promotion position for adding the promotion information.
  • the promotion article may include one or more topics. Different topics may be distributed in different paragraphs of the promotion article. The positions corresponding to different paragraphs (corresponding to different topics) may be the middle position of the article, the end position of the article, or The position where two adjacent topics (paragraphs) are handed over; for example, when the promotion article contains only one topic, the topic feature and the topic feature of the promotion information satisfy the topic similarity condition, and the position of the paragraph including the topic (the end position of the article) In order to promote the location; when the promotion article contains two or more topics, multiple topics are distributed in different paragraphs, when at least one of the plurality of topic features and the topic feature of the promotion information satisfy the topic similarity condition, The position where the paragraph in which the topic feature satisfying the topic similarity condition is placed and the adjacent paragraph is used as the promotion position.
  • the selection of the location of the promotion information is automatically realized, the position is flexible, and the occurrence of the promotion information can be avoided, so that the content of the article and the content of the promotion information are naturally connected, and it is easy to be accepted by the user in the process of reading the article.
  • the promotion location for adding the promotion information in the promotion article may also be determined by: when the promotion information is added to the position between the adjacent paragraphs in the promotion article, according to the content style of the promotion article Whether the same type of content is segmented by the promotion information, and/or the display ratio occupied by the promotion information in the content style determines a corresponding completeness; when the completeness satisfies the preset completeness condition, determining The location between the adjacent paragraphs is the promotion location where the promotion information is added.
  • the promotion information when the promotion information is added at a position where the promotion article can add the promotion information (ie, the candidate location, for example, the middle position of any two paragraphs), according to whether the same type of content in the content style of the article is divided by the promotion information, Determining the completeness of the content style. If the same type of content is segmented by the promotion information, the content style is destroyed, and the corresponding completeness is 0; if the content of the same type in the content style of the article is not segmented by the promotion information, the content style is It is still complete, and the corresponding degree of completeness is 1.
  • the candidate position meets the integrity condition and can be used as a promotion location.
  • the end position of the article content can be used as a promotion location for adding promotion information (advertisement), so that the content style of the article does not destroyed.
  • the end position of each type or each picture may be used as a promotion position for adding the promotion information;
  • the promotion information is added in the middle of the article, the influence on the content style of the article is minimal, and the ideal integration degree of the promotion information in the article is formed.
  • the display ratio determined by the promotion information in the content style of the article is determined.
  • the social network server determines the promotion article for presenting the promotion information. And after the promotion location for adding the promotion information in the promotion article, the target promotion object matching the promotion article needs to be determined; the material corresponding to the target promotion object is used to generate the promotion information; that is, step 303 is performed: the social network server determines to match the promotion article. Target promotion object.
  • the target promotion object matching the promotion article may have one or more, and the target promotion object matching the promotion article may be determined by: performing the content feature of the promotion article and the content feature of the candidate promotion object.
  • the content similarity calculation determines the candidate promotion object that satisfies the content similarity condition as the target promotion object.
  • the social network server may determine the target promotion object that matches the promotion article by: the social network server may obtain the plurality of promotion objects to be promoted from the advertisement background server, and then perform the obtained plurality of promotion objects. Firstly, the candidate promotion object set matching the promotion article is obtained, and then the obtained candidate promotion object is subjected to secondary screening to obtain the target promotion object matching the promotion article.
  • the social network server determines a candidate promotion object that satisfies the topic similarity condition of the promotion article, forms a candidate promotion object set, completes a screening, and then determines each candidate promotion object in the candidate promotion object set and the promotion article about at least one type feature.
  • the similarity degree includes the image feature and the text feature, and the candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object, and the secondary screening is completed.
  • the candidate promotion object of the degree condition is determined as the target promotion object, which comprises: determining the similarity between the extracted image feature and the feature image feature of the promotion article, and determining the candidate promotion object as the target promotion when the determined similarity exceeds the similarity threshold of the image feature Object; or, to determine the similarity between the extracted text features and the text features of the promotional article, When the determined similarity exceeds the similarity threshold of the text feature, it is determined that the candidate promotion object is the target promotion object.
  • the feature extraction operation may be performed before performing feature similarity calculation, such as: extracting image features composed of colors, textures, and shapes; and/or performing word segmentation processing, filtering out stop words for word segmentation results , get the text features composed of feature words.
  • the candidate promotion object set matching the promotion article may be obtained by inputting at least one type of feature of the candidate promotion object into a classifier model for class classification according to the feature word, and obtaining a classifier model calculation output.
  • the process of classifying a topic by a classifier model according to a feature word may include: combining a word vector having a plurality of feature words to form an input vector (a word vector of each feature word is output according to a semantic-vector model), and predicting different according to the input vector The probability of the topic, taking the topic corresponding to the maximum probability as the topic to which the candidate promotion object belongs.
  • the candidate promotion object set matching the promotion article may be selected from the promotion objects to be promoted based on the second-level theme, for example, by obtaining the following Promoting the matching of candidate objects in the article matching: determining the second-level topic feature of the promotion object and the second-level topic feature of the promotion article, and calculating the similarity between the second-level topic feature of the promotion object and the second-level topic feature of the promotion article, When the preset second-level topic similarity threshold (which can be set according to actual needs, such as 70%) is exceeded, it is determined as a candidate promotion object that matches the promotion article.
  • the preset second-level topic similarity threshold which can be set according to actual needs, such as 70%
  • the second-level topic similarity calculation it is necessary to obtain the second-level theme of the promotion article and the promotion object, obtain the feature vector of the promotion object/promotion article, and input the acquired feature vector into the second classifier to obtain the corresponding first. Secondary theme.
  • the feature vector of the multiple dimensions of the promotion object/promotional article is mapped to obtain the corresponding second-level theme. Therefore, the above process of mapping to the corresponding second-level topic is equivalent to performing feature vectors of multiple dimensions. Dimensionality reduction The process, in this way, reduces the difficulty of the algorithm of the article processing.
  • the purpose of the extraction is to obtain a semantic description of the text from the article or the paragraph.
  • the two main operations of preprocessing and text feature extraction may be included, wherein the preprocessing may include the following steps. :
  • Step 1 invalid character filtering; for example: if the article is derived from a web page, it is usually necessary to filter the HTML tag by means of a regular expression or the like.
  • step 1 it is often necessary to first encode and convert the content obtained in step 1. Then, regular expressions can be used to match the punctuation marks and line breaks to divide the paragraphs of the article into sentences. Finally, the Chinese word segmentation can be used to divide the sentences into one sentence. Separate words.
  • Step 3 Filter the stop words
  • words of irrelevant semantics such as "", “ground”, etc.
  • word extraction may be further performed, so that subsequent text feature extraction is more convenient.
  • the pre-processing of the text feature extraction is completed, and then the text feature can be extracted.
  • the text feature can be extracted in one of the following ways:
  • Keyword extraction such as algorithm implementation using word frequency-inverse document frequency (TF-IDF, Term Frequency-Inverse Document Frequency).
  • BOW Bit of Words
  • Deep learning models such as Word Embedding, map words to get word vectors and operate them through word vectors.
  • the extraction of image features is described.
  • the purpose of the extraction is to obtain the image from the article.
  • the semantic description of the picture, in one implementation, the extraction of image features in the article can be one of the following ways:
  • Adopt global statistical features such as histogram, contrast, geometric invariant moment, and so on.
  • Text features such as Linear Back Projection (LBP), Generalized Search Trees (GIST), corner features (such as Harris corner detection, etc.) , edge features (such as multi-level edge detection algorithm - Canny operator), shape features (such as Hough transform).
  • LBP Linear Back Projection
  • GIST Generalized Search Trees
  • corner features such as Harris corner detection, etc.
  • edge features such as multi-level edge detection algorithm - Canny operator
  • shape features such as Hough transform
  • Feature extraction is performed by using at least one of a Scale-invariant feature transform (SIFT), a Histogram of Oriented Gradient (HOG), and a Haar classifier.
  • SIFT Scale-invariant feature transform
  • HOG Histogram of Oriented Gradient
  • Haar classifier a Haar classifier
  • CNN network has many specific implementation methods, such as AlexNet, VGG, ResNet, etc. In actual implementation, it can be used for general data training such as ImageNet.
  • AlexNet AlexNet
  • VGG VGG
  • ResNet ResNet
  • the result of the last convolutional layer of the model is characteristic of the CNN model.
  • the candidate promotion object selected by the second-level topic is subjected to secondary screening to obtain a target promotion object that matches the promotion article.
  • the candidate promotion object collection may be selected by the following manner.
  • the target promotion object matching the promotion article extracting the feature of the candidate promotion object, the extracted feature includes at least one type of the image feature and the text feature; calculating the similarity between the extracted feature and the corresponding type feature of the promotion article; When the similarity threshold of the corresponding type feature is exceeded, the target promotion object matching the promotion article is determined. In this way, the adaptation of the target promotion object is automatically realized, so that the promotion information (advertisement) in the promotion article has a high degree of fit with the article content, and does not create a process for the user to read the article. Interference has improved the user's reading experience.
  • the following describes the target promotion object obtained by pre-training to filter out the target promotion object that matches the promotion article from the candidate promotion object set.
  • FIG. 5B is a schematic diagram of the similarity calculation using the text-to-text similarity classifier according to the embodiment of the present invention.
  • the candidate is extracted.
  • the text-text similarity exceeds the text-text similarity threshold, determine that the candidate promotion object matches the promotion article.
  • Target promotion object determines that the candidate promotion object matches the promotion article.
  • FIG. 5C is a schematic diagram of the similarity calculation using the image-image similarity classifier according to the embodiment of the present invention
  • FIG. 5C the extraction promotion
  • the image feature of the object and the image feature of the article are extended, and the corresponding image-image similarity classifier is input.
  • the candidate promotion object is determined to be the target matching the promotion article. Promote the object.
  • the similarity can be calculated from the feature vector of the image, as obtained in the following manner:
  • the target promotion object matching the promotion article may be selected from the candidate promotion object set by calculating a third-level theme of the candidate promotion object in the candidate promotion object set and the third promotion article.
  • the similarity of the level topic, when the third level topic similarity threshold is exceeded, is determined as the target promotion object that matches the promotion article.
  • the screening from the set of candidate promotion objects can be implemented in the following manner.
  • Target promotion object matching the promotion article extracting the image features of the candidate promotion object and promoting the text feature of the article, determining the similarity between the image feature of the candidate promotion object and the text feature of the promotion article; when the text and image similarity threshold is exceeded, Determine the target promotion object that matches the promotion article.
  • FIG. 5D is a schematic diagram of performing similarity calculation by using a text-image similarity classifier according to an embodiment of the present invention. Referring to FIG. 5D, extracting image features of the promotion object and text features of the promotion article, and inputting corresponding text-image similarity classification. When the text-image similarity exceeds the text-image similarity threshold, it is determined that the promotion object is a promotion object that matches the promotion article.
  • the target promotion object matching the promotion article may be determined by: calculating a similarity between the obtained promotion object to be promoted and at least one of the image feature and the text feature of the promotion article; determining that the similarity satisfies the corresponding The promotion object of the similarity condition of the type feature is the target promotion object matching the promotion article.
  • performing feature extraction operations on at least one of the following types of material of the candidate promotion object and the promotion article extracting image features composed of colors, textures, and shapes; performing word segmentation processing, filtering and filtering the word segmentation results Using the word, the text feature composed of the feature word is obtained; determining the similarity between the candidate promotion object and the promotion article regarding the at least one type feature: the candidate promotion object satisfying the similarity condition of the corresponding type feature is determined as the target promotion object.
  • the candidate promotion object is the target promotion object; for example, calculating the text feature of the candidate promotion object text material (pre-stored in the media platform, such as classification information, advertisement words), and promoting the text feature of the text in the article Similarity, if the picture similarity condition is satisfied (greater than the text feature similarity threshold), the candidate promotion object is the target promotion object.
  • This implementation eliminates the need to The process of determining the set of candidate promotion objects directly determines the target promotion object based on the image features and/or text features of the promotion article.
  • the target promotion object matching the promotion article may also be determined by determining the similarity between the image feature of the candidate promotion object and the text feature of the promotion article; when the determined similarity exceeds the text and image similarity threshold When the candidate promotion object is determined as the target promotion object.
  • the feature vector refers to the feature vector in the embodiment of the present invention
  • the similarity between the candidate promotion object and the different types of features of the promotion article can be calculated, and then the threshold comparison is performed to determine the target promotion object.
  • the image features of the candidate promotion object are used here, and the similarity calculation is performed using the text features of the article because: for all candidate promotion objects, there will be corresponding image material in the self-media platform, and all the articles are Including text, it can guarantee that the similarity of the two can always be calculated; avoiding the problem that the similarity cannot be calculated by using the same type of features because the text material of the candidate promotion object is missing from the media platform and the image material is missing in the text.
  • step 302 there is no dependency relationship between step 302 and step 303, and the execution order is interchangeable.
  • step 304 the social network server determines the material that matches the target promotion object, and forms promotion information including the material.
  • the material that matches the target promotion object can be determined as follows:
  • Extracting the character keyword from the promotion article comprising: combining at least one of the character keyword and the tag keyword of the target promotion object with the template content of the promotion object to form a first text material that matches the promotion object.
  • the character keyword can be the title of the author or other person appearing in the article, such as: American friends, star balls, etc.; and the way to extract the character keywords can be extracted based on the semantic analysis method.
  • the tag keyword is a keyword used to identify the features, functions, etc. of the promotion object, and each promotion The object has a corresponding label keyword for identifying the feature, function, and the like of the promotion object.
  • the label keyword may be: moisturizing and hydrating.
  • a template for generating a text material (which may be a unified template or a template for classification of promotion objects for different topics) is preset for the promotion object, and a fixed text description is set in the template. And the blank text position to be supplemented, when the character keyword and/or the tag key of the promotion object are substituted into the template, the text material corresponding to the promotion object is formed.
  • FIG. 6A is an optional schematic diagram of the text material provided by the embodiment of the present invention.
  • the tag keyword is popular, and after entering it into the template, the text material generated by the text template + dynamic text (ie, the tag keyword) is: This is also the popular recommendation.
  • FIG. 6B is an optional schematic diagram of the text material provided by the embodiment of the present invention.
  • the character of the promotion object is the big cake, the label keyword is popular, and the text material generated by the text template + dynamic text (ie, the label keyword and the character keyword) is substituted into the template: this is also a big cake.
  • Popular items recommended by the main focus are also a big cake.
  • the template is combined with the mask template to form a corresponding text material: star ball A highly recommended mask that moisturizes and replenishes water.
  • the social network server can determine the material that matches the target promotion object by:
  • Image recognition is performed on the target promotion object, and the image recognition result representing the attribute of the target promotion object is obtained; the image recognition result is combined with the description information of the target promotion object to form a second text material that matches the target promotion object.
  • the material matching the target promotion object may include at least one of the first text material and the second text material.
  • the image recognition result represents the attributes of the target promotion object: such as the name (what is the promotion object, such as clothes and shoes), color, style, etc.;
  • the description information of the target promotion object may be information that is presented in a keyword form, and identifies related content of the target promotion object from different dimensions, such as a price description and a source of the target promotion object; and the description information of the target promotion object often includes A hyperlink that enables the user to interact with the target promotion object, so that when the user clicks on the description information, the page jumps to the corresponding page.
  • the target promotion information includes image material in addition to the formed text material, and the image material can be obtained by: image feature of the image material of the candidate promotion object and image features of the promotion article When the matching condition of the image feature is satisfied (for example, the similarity exceeds the preset threshold), the image material that matches the target promotion object is determined.
  • the image material can also be obtained by:
  • FIG. 7A and FIG. 7B are schematic diagrams showing the addition of promotion information to the end position of a picture in a promotion article according to an embodiment of the present invention.
  • block 72 is a picture in the article.
  • the promotion information added at the end position wherein the block 71 is a text module for carrying the text material generated by the promotion object-based template included in the promotion information, and the block 73 is a picture module for carrying the image material included in the promotion information.
  • the text material portion included in the promotion information includes the above description information in addition to the text material generated based on the template of the promotion object.
  • FIG. 7C is a schematic diagram of adding promotion information at the end position of the promotion article according to an embodiment of the present invention.
  • the block 70 corresponds to the promotion information
  • the block 77 is a text module, and is used to carry the promotion information.
  • the block 78 is a picture module, and is used to carry the image material included in the promotion information
  • the block 79 is a description information module, and is used to carry the description information of the target promotion object included in the promotion information (such as the promotion object). Details and sources).
  • the material matched by the target promotion object may be composed of image material and description information of the target promotion object.
  • FIG. 8 is a schematic diagram of the promotion information provided by the embodiment of the present invention.
  • the block 81 is a picture module, and is used to carry the image material included in the promotion information
  • the block 82 is a description information module, and is used for Carrying description information of the target promotion object included in the promotion information.
  • generating promotional information based on the obtained material may be accomplished by obtaining fixed content for first (time or location first) presentation in the promotional information, the fixed content being used to guide viewing additions After the promotion information; the fixed content obtained, and the obtained material are filled into the promotion information template to obtain the promotion information.
  • step 305 is performed: adding the promotion information to the promotion location according to the determined promotion location.
  • the synthetic promotion article and the promotion information including the material are synthesized, and the synthesized article is obtained.
  • the social network server may also set the display manner of the promotion information.
  • FIG. 9A to FIG. 9C are schematic diagrams showing the display manner of the promotion information provided by the embodiment of the present invention, for example, setting the text module correspondingly.
  • the content is hidden, that is, the display manner of the text material generated by the template based on the promotion object in the promotion information is hidden, as shown in FIG. 9A, when the user clicks on the text module part in the picture, the hidden text material can be displayed; or, for example, As shown in FIG. 9B, the text module of the promotion information is displayed to display the fixed content (the promotion information after the viewing is added); or, as shown in FIG. 9C, the text module of the promotion information is dynamically displayed, and the text generated based on the template of the promotion object is dynamically displayed.
  • the content of the material such as scrolling through the content of the text material.
  • the social network server performs the process of adding the promotion information to the promotion article, and then performs step 306: the social network server sends the added promotion article to the self-media level.
  • the second client in Taichung.
  • the social network server may obtain the user's article preference based on the self-media account that the user logs in (eg, the article category that is preferred by the user based on the user's article reading record, and the largest number of users that can be read by the user.
  • the article actively pushes a promotion article with promotion information to the client of the user's self-media platform based on the user's preference to the client from the media platform for presentation.
  • the social network server may send the promotion article with the promotion information to the client of the self-media platform for presentation based on the read request sent by the user terminal (ie, terminal pull).
  • step 307 is performed to display the promotion article.
  • the user can understand the added promotion information while seeing the content of the article that he is concerned about, and the user's reading experience is enhanced due to the natural transition of the promotion information and the article.
  • the first client and the second client in the self-media platform are described, and the publisher of the article may also be the reader of the article (ie, the first client and the second client are the same client), and the publisher of the article The same user is the reader of the article.
  • the first client submits the article to be published to the social network server
  • the submitted article to be published is determined to be the promotion article and there is a target promotion object matching the same
  • the first client obtains the article submitted by itself, it also receives the promotion information added with the material matching the target promotion object, and then presents the original content of the article, and displays the promotion location according to the promotion information in the article. Promotional information is presented when the original content of the article is promoted to the appropriate location.
  • the publisher of the article is not the same user as the reader of the article (ie, the first client and the second client are different clients), and at this time, the second client can pull according to the user's access request.
  • the promotion information may be presented in one of the following ways:
  • the self-media platform to spontaneously report the traffic of the article as the carrier of the promotion information, that is, the source of the article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic, and realizing the batch of the promotion information.
  • the promotion information can be reached in time to reach the user;
  • the location is flexible, and can avoid the promotion of information.
  • the abruptness makes the content of the article and the content of the promotion information natural, and is easy to accept in the process of reading the article.
  • FIG. 10 is a schematic flowchart of an optional article processing method based on the self-media platform provided by the server side in the embodiment of the present invention.
  • the promotion information is used as an advertisement
  • the promotion object is an advertisement object (advertisement product).
  • the article processing method based on the self-media platform provided by the embodiment of the present invention includes:
  • Step 401 The server performs semantic analysis on the article.
  • the article mentioned here is the self-media article to be published by the server or the article that has been published on the self-media platform but has been withdrawn.
  • the semantics of the article is used to understand the title of the article, from the media name (such as the public name). ), the author's name, understand the text of the entire article. Select the topic that belongs to this article as the basis for matching the advertising object. In turn, you can filter out articles that match the ad object.
  • the feature information that the advertisement object can match includes: a category of the advertisement object, an advertisement word, a name of the advertisement object, and the like.
  • the feature information that the article can match includes keywords and the like.
  • Semantic analysis that is, semantic understanding, refers to the transformation of unstructured or semi-structured natural language text into structured information that can be deeply processed by computers, and classified, analyzed, and so on.
  • Step 402 Identify the subject of the article.
  • it can be obtained by training the keyword-topic model, extracting the keywords of the article through semantic analysis, and then inputting the trained keyword-topic model to get the topic of the article.
  • Step 403 Determine whether the article matches the advertisement object based on the article theme. If the matching performs step 404, if there is no match, the article does not advertise.
  • the server may match the topic of the article with at least one of the name, category, and advertisement word corresponding to the advertisement object, and determine that the article satisfying the matching condition can be matched widely.
  • the article of the object may match the topic of the article with at least one of the name, category, and advertisement word corresponding to the advertisement object, and determine that the article satisfying the matching condition can be matched widely.
  • Step 404 Perform semantic analysis of the segmentation of the article.
  • a semantic analysis of the segmentation of the article yields whether the article has multiple (two or more) topics.
  • Step 405 Identify whether the article has multiple topics.
  • Step 406 Determine that there are multiple topics in the article, and perform step 408.
  • Step 407 Determine that the article has a single topic, and perform step 409.
  • Step 408 Mark multiple locations where the advertisement is added in the article.
  • Step 409 Add the location of the advertisement at the end of the article.
  • Step 410 Select a set of alternative advertisement object sets from the advertisement object library according to the article theme.
  • the set of alternative advertising objects (eg, determining that the similarity reaches a preset threshold) can be determined by calculating the similarity of the subject of the advertising object to the topic of the article.
  • Step 411 Match the advertisement object according to the text content of the article.
  • the pre-trained text-image similarity classifier can be used to input the text feature of the article and the image feature of the advertisement object to obtain the similarity between the two, when only the position where the advertisement is added is marked at the end of the article. You only need to match the most similar ad object. When you mark multiple ads in the article, you can select the corresponding number of ad objects according to the similarity.
  • Step 412 Perform image recognition on the matched advertising object, and obtain description information of the advertising object.
  • the advertising object may only have image information, and the image recognition of the advertising object can be obtained.
  • the material information of the advertisement object such as the specific content of the advertisement object, such as clothes, shoes, and the like.
  • the description information of the advertisement object includes the source, price, description details, and the like of the advertisement object.
  • Step 413 Add text according to the image recognition result and the description information of the advertisement object.
  • the above-mentioned added text is the text material of the advertisement for adding the article.
  • Step 414 The description information of the extracted advertisement object is displayed on the advertisement image.
  • the description information of the corresponding advertisement object includes a corresponding hyperlink, and when the user clicks, jumps to the corresponding page, such as jumping to the purchase page of the advertisement object.
  • Step 415 Add the added text and the interactive advertising object as an advertisement to the article.
  • the interactive means refers to a hyperlink included in the advertisement, and the user clicks to perform a page jump.
  • the server carrying the self-media platform first determines the promotion article for presenting the promotion information, and determines the target promotion object based on the determined promotion article, but in actual application, The target promotion object can be determined first, and then the promotion article for presenting the promotion information can be determined.
  • the article processing method based on the self-media platform is described in detail.
  • FIG. 11 is a schematic flowchart of an optional article processing method based on the self-media platform provided by the server side in the embodiment of the present invention.
  • the self-media platform can be carried on a social network server with social functions, and the promotion information is an advertisement, and the promotion object is an advertisement object (advertisement product).
  • the self-media provided by the embodiment of the present invention is provided.
  • the article's article processing methods include:
  • Step 501 The first client sends the target article to the self-media platform.
  • the client connects to the self-media platform, and the target article is submitted by the user of the self-media platform through the client.
  • the target article includes the article to be published and the original article.
  • Step 502 Determine a target promotion object and a target promotion object in the candidate promotion object. Matching material.
  • the self-media platform may determine the target promotion object among the candidate promotion objects stored in the self-media platform by:
  • determining a candidate promotion object that satisfies a topic similarity condition with a topic of the historical target article, forming a candidate promotion object set, and determining a similarity between each candidate promotion object in the candidate promotion object set and the historical target article regarding at least one type feature The feature includes an image feature and a text feature; the candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
  • Step 503 Determine, according to the determined target promotion object, a promotion article for presenting the promotion information in the received target article.
  • the theme feature of the target promotion object and the theme feature of the target article are subjected to topic similarity calculation, and the target article satisfying the topic similarity condition is determined as Describe the promotion article.
  • the keyword extracted from the target article is input into a classifier model that classifies the topic according to the feature word, and the topic corresponding to the target article calculated by the classifier model is obtained; the keyword extracted from the material of the candidate promotion object is input according to the keyword.
  • the classifier model of the feature word is used to obtain the topic corresponding to the candidate promotion object; according to the semantic distance of the topic corresponding to the target article and the topic corresponding to the candidate promotion object, the topic similarity with the negative relationship of the semantic distance is determined, and the received feature is received.
  • the target article in the target article that satisfies the topic similarity condition is determined as a promotion article.
  • Step 504 Determine a promotion location for adding promotion information in the promotion article.
  • the promotion location for adding promotional information in the promotion article may be determined as follows:
  • Step 505 Generate promotion information according to the determined material that matches the target promotion object.
  • the promotion information can be generated by:
  • the fixed content for first presentation in the promotion information is obtained, and the fixed content is used to guide the viewing of the added promotion information; and the obtained fixed content and the obtained material are filled into the promotion information template to obtain the promotion information.
  • Step 506 Add the promotion information to the corresponding promotion location in the promotion article according to the determined promotion location.
  • Step 507 Send a promotion article to which the promotion information is added to the second client.
  • Step 508 The second client displays the promotion article.
  • FIG. 12 is a schematic structural diagram of an article processing apparatus based on a self-media platform according to an embodiment of the present invention, including:
  • the receiving unit 31 is configured to receive a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the first client;
  • the determining unit 32 is configured to determine, in the target article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article;
  • the generating unit 33 is configured to generate promotion information according to the determined material that matches the target promotion object;
  • the adding unit 34 is configured to add the promotion information according to the determined promotion location Add to the corresponding promotion location in the promotion article;
  • the sending unit 35 is configured to send the promotion article to which the promotion information is added.
  • the determining unit 32 is further configured to perform topic similarity calculation on the topic feature of the target article and the topic feature of the candidate promotion object, and determine the target article that satisfies the topic similarity condition as the promotion. article;
  • the determining unit 32 is further configured to perform content similarity calculation on the content feature of the historical target article and the content feature of the candidate promotion object, and determine the candidate promotion object that satisfies the content similarity condition as the target. Promotion target;
  • the historical target article is received and sent by the self-media platform prior to the target article;
  • the determining unit 32 is further configured to determine, in the promotion article, a paragraph having the included topic feature according to the topic feature included in the promotion article;
  • determining the location of the paragraph is a promotion location for adding the promotion information.
  • the determining unit 32 is further configured to: when the promotion information is added at a position between adjacent paragraphs in the promotion article, according to whether the same type of content in the content style of the promotion article is Segmented by the promotion information, and/or a display ratio occupied by the promotion information in the content style, determining a corresponding completeness;
  • determining the position between the adjacent paragraphs is Add the promotion location of the promotion information.
  • the determining unit 32 is further configured to input a keyword extracted from the to-be-published article into a classifier model that classifies topics according to feature words, and obtain the calculated output of the classifier model.
  • the determining unit 32 is further configured to perform a feature extraction operation on at least one of the following types of material of the candidate promotion object and the promotion article: extracting image features composed of colors, textures, and shapes Perform word segmentation processing, filter out the stop words on the word segmentation results, and obtain text features composed of feature words;
  • the candidate promotion object that satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
  • the determining unit 32 is further configured to: determine a candidate promotion object that satisfies a topic similarity condition with a topic of the promotion article, and form a candidate promotion object set;
  • the image includes an image feature and a text feature
  • the candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
  • the determining unit 32 is further configured to input at least one type of feature of the candidate promotion object into a classifier model that performs topic classification according to the feature word, and obtain the candidate of the classifier model calculation output. Promote the subject to which the object belongs;
  • the candidate promotion object that satisfies the topic similarity condition with the topic of the promotion article is determined.
  • the determining unit 32 is further configured to determine a similarity between an image feature of the candidate promotion object and a text feature of the promotion article;
  • the candidate promotion object is the target promotion object.
  • the determining unit 32 is further configured to extract a character keyword from the promotion article
  • the determining unit 32 is further configured to perform image recognition on the target promotion object, and obtain an image recognition result that is used to represent the promotion object attribute;
  • the determining unit 32 is further configured to: when the image feature of the image material of the candidate promotion object and the image feature of the promotion article satisfy the matching condition of the image feature, the image that satisfies the matching condition is The material is an image material corresponding to the target promotion object.
  • the generating unit 33 is further configured to obtain fixed content for first presentation in the promotion information, where the fixed content is used to guide viewing the added promotion information;
  • the fixed content and the obtained material are filled into a promotion information template to obtain the promotion information.
  • the embodiment of the invention further provides a server, including:
  • a memory configured to store an executable program
  • the processor configured to execute the executable program stored in the memory, implements the above-described self-media platform-based article processing method.
  • the embodiment of the present invention further provides a readable storage medium, which may include: a mobile storage device, a random access memory (RAM), a read-only memory (ROM), a magnetic disk, or A variety of media such as optical discs that can store program code.
  • a readable storage medium stores an executable program
  • the executable program is configured to implement the above-described self-media platform-based article processing method when executed by a processor.
  • the embodiment of the present invention receives a target article sent by a client, where the client is used to connect to the self-media platform, and the target article is submitted by a user of the self-media platform through the client; Determining, in the article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article; determining a target promotion object and the target in the candidate promotion object stored in the self-media platform Promoting the matching material of the object; generating the promotion information according to the determined material matching the target promotion object; adding the promotion information to the corresponding promotion location in the promotion article according to the determined promotion location; sending The promotion article with the promotion information is added.
  • the source of the target article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic, and realizing the batching and automatic addition of the promotion information; automatically selecting the location of the promotion information, the location Flexible, able to avoid the abrupt appearance of promotional information, making the content of the article and the promotion information
  • the content is connected naturally; through the release of the article and the process of reaching the user, the delivery of the promotion information is completed, and the promotion information is realized by relying on the release/send traffic of the self-media platform itself, and the promotion information can cover the access traffic from the media platform and reach the user in real time.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • General Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Provided are a method, device, and server for processing written articles, and a server, employing user-generated content platforms, and a storage medium. The method comprises: receiving target articles sent by a client, wherein the client is used to connect to a user-generated content platform, and the target articles are submitted by users of the user-generated content platform by means of the client; determining a promotional article for presenting promotional information from among the target articles, as well as a position for adding the promotional information in the promotional article; determining a target promotional object from among candidate promotional objects stored in the user-generated content media platform and materials matching the target promotional object; generating promotional information according to the determined materials matching the target promotional object; according to the determined promotional position, adding the promotional information to the corresponding position in the promotional article; and sending the promotional article with the added promotional information.

Description

文章处理方法、装置、服务器及存储介质Article processing method, device, server and storage medium 技术领域Technical field
本发明涉及通信技术,尤其涉及一种基于自媒体平台的文章处理方法、装置、服务器及存储介质。The present invention relates to communication technologies, and in particular, to an article processing method, apparatus, server, and storage medium based on a self-media platform.
背景技术Background technique
随着互联网特别移动互联网的发展,微博、博客和公众号等自媒体平台成为普遍使用的社交途径,在自媒体平台中针对产品或服务等推广对象的宣传,成为推广商品、服务等各种推广对象的普遍使用的技术手段。With the development of the Internet's special mobile Internet, micro-blogs, blogs, and public accounts have become popular social channels. In the self-media platform, the promotion of products or services has become a promotion of goods and services. Promote the universal use of technical means.
在自媒体平台中发布的文章承载了用户表达情绪、传播信息和社交的诉求,相关技术提供在文章中添加推广信息的方案,当文章触达用户并被观看时,文章中添加的推广信息在用户观看文章的过程中呈现,实现宣传推广对象的效果。The articles published in the self-media platform carry the user's appeals for expressing emotions, disseminating information and socializing. The related technologies provide a scheme for adding promotional information to the article. When the article reaches the user and is viewed, the promotional information added in the article is The user presents the article in the process of viewing the article, and realizes the effect of promoting the object.
对于在文章中添加推广信息,相关技术采用的一种技术方案是,将用户在自媒体平台发表的文章中添加推广信息,然后向用户推送添加有推广信息的文章,这种技术方案对于用户阅读文章过程中的感知造成很大的干扰,导致发布的文章的用户接受度会下降,进而影响对象推广的效果;For the promotion information added in the article, a technical solution adopted by the related technology is to add the promotion information to the article published by the user on the self-media platform, and then push the article with the promotion information to the user, and the technical solution is read for the user. The perception in the process of the article causes a lot of interference, which leads to a decrease in the user acceptance of the published article, which in turn affects the effect of object promotion;
另外,相关技术还采用的一种技术方案是,在自媒体平台开设专用的账号,通过账号发布各种推广信息的文章的方案,由于这种专用账号在发布的的访问流量具有很大的波动性,特别是在账号创建的初期,难以支撑宣传推广对象的时效性和覆盖特定用户群体的需求。In addition, a technical solution adopted by the related art is that a special account is opened from the media platform, and an article of various promotion information is published through an account, and the access traffic of the dedicated account has a large fluctuation. Sex, especially in the initial stage of account creation, it is difficult to support the timeliness of the promotion target and the needs of a specific user group.
发明内容Summary of the invention
有鉴于此,本发明实施例期望提供一种基于自媒体平台的文章处理方 法、装置、服务器及存储介质,能够实现推广信息在自媒体文章的理想的融合,以及推广信息触达用户的良好时效性。In view of this, embodiments of the present invention are expected to provide an article processing party based on a self-media platform. The method, device, server and storage medium can realize the ideal fusion of promotion information in self-media articles, and promote the good timeliness of information reaching users.
为达到上述目的,本发明实施例的技术方案是这样实现的:To achieve the above objective, the technical solution of the embodiment of the present invention is implemented as follows:
第一方面,本发明实施例提供一种基于自媒体平台的文章处理方法,包括:In a first aspect, an embodiment of the present invention provides an article processing method based on a self-media platform, including:
接收客户端发送的目标文章,其中,所述客户端用于连接自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交;Receiving a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the client;
在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;Determining, in the target article, a promotion article for presenting promotion information, and a promotion location for adding promotion information in the promotion article;
在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;Determining, in a candidate promotion object stored in the self-media platform, a target promotion object and a material matching the target promotion object;
根据所确定的与所述目标推广对象匹配的素材生成推广信息;Generating promotion information according to the determined material that matches the target promotion object;
根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;And adding the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location;
发送添加有所述推广信息的所述推广文章。The promotion article to which the promotion information is added is sent.
第二方面,本发明实施例提供一种基于自媒体平台的文章处理方法,所述方法由服务器执行,所述服务器包括有一个或多个处理器以及存储器,以及一个或一个以上的程序,其中,所述一个或一个以上的程序存储于存储器中,所述程序可以包括一个或一个以上的每一个对应于一组指令的单元,所述一个或多个处理器被配置为执行指令;所述方法包括:In a second aspect, an embodiment of the present invention provides an article processing method based on a self-media platform, where the method is performed by a server, where the server includes one or more processors and a memory, and one or more programs, where The one or more programs are stored in a memory, the program may include one or more units each corresponding to a set of instructions, the one or more processors being configured to execute instructions; Methods include:
接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交;Receiving a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the client;
在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;Determining, in the target article, a promotion article for presenting promotion information, and a promotion location for adding promotion information in the promotion article;
在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及 与所述目标推广对象匹配的素材;Determining a target promotion object among candidate promotion objects stored in the self-media platform, and a material that matches the target promotion object;
根据所确定的与所述目标推广对象匹配的素材生成推广信息;Generating promotion information according to the determined material that matches the target promotion object;
根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;And adding the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location;
发送添加有所述推广信息的所述推广文章。The promotion article to which the promotion information is added is sent.
第三方面,本发明实施例提供一种基于自媒体平台的文章处理装置,包括:In a third aspect, an embodiment of the present invention provides an article processing apparatus based on a self-media platform, including:
接收单元,配置为接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述第一客户端提交;a receiving unit, configured to receive a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the first client;
确定单元,配置为在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;a determining unit configured to determine, in the target article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article;
以及,配置为在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;And configured to determine a target promotion object and a material matching the target promotion object among the candidate promotion objects stored in the self-media platform;
生成单元,配置为根据所确定的与所述目标推广对象匹配的素材生成推广信息;a generating unit, configured to generate promotion information according to the determined material that matches the target promotion object;
添加单元,配置为根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;Adding a unit, configured to add the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location;
发送单元,配置为发送添加有所述推广信息的所述推广文章。The sending unit is configured to send the promotion article to which the promotion information is added.
第四方面,本发明实施例提供一种服务器,包括:In a fourth aspect, an embodiment of the present invention provides a server, including:
存储器,配置为存储可执行程序;a memory configured to store an executable program;
处理器,配置为执行所述存储器中存储的可执行程序时,实现上述基于自媒体平台的文章处理方法。The processor, configured to execute the executable program stored in the memory, implements the above-described self-media platform-based article processing method.
第五方面,本发明实施例提供一种存储介质,存储有可执行程序,所述可执行程序被处理器执行时,实现上述的基于自媒体平台的文章处理方 法。In a fifth aspect, an embodiment of the present invention provides a storage medium, where an executable program is stored, and when the executable program is executed by a processor, the article processing party based on the self-media platform is implemented. law.
应用本发明上述实施例具有以下有益效果:The above embodiments of the present invention have the following beneficial effects:
目标文章的来源可以来自社交网络中任意一个用户终端,打破了靠征集特定主题的文章的局限性,可实现推广信息的批量化和自动化添加;自动实现推广信息的位置的选定,位置灵活,能够避免推广信息的出现突兀,使得文章内容与推广信息的内容衔接自然;通过文章发布以及触达用户的过程完成推广信息的传递,依赖自媒体平台自身的发布/发送流量实现了推广信息,推广信息得以覆盖自媒体平台的访问流量并实时触达用户。The source of the target article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic, and the batching and automatic addition of the promotion information can be realized; the location of the promotion information is automatically selected, and the position is flexible. It can avoid the sudden emergence of promotion information, make the content of the article and the content of the promotion information natural; through the process of publishing the article and reaching the user, the delivery of the promotion information is completed, and the promotion information is promoted by relying on the release/send traffic of the self-media platform itself. The information is covered by the traffic from the media platform and reaches the user in real time.
附图说明DRAWINGS
图1A为本发明实施例提供的基于自媒体平台的文章处理方法的一个可选的应用场景示意图;1A is a schematic diagram of an optional application scenario of an article processing method based on a self-media platform according to an embodiment of the present invention;
图1B为本发明实施例提供的基于自媒体平台的文章处理方法的一个可选的应用场景示意图;FIG. 1B is a schematic diagram of an optional application scenario of an article processing method based on a self-media platform according to an embodiment of the present disclosure;
图2A为本发明实施例提供的推广文章的一种可选的呈现方式示意图;2A is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention;
图2B为本发明实施例提供的推广文章的一种可选的呈现方式示意图;2B is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention;
图2C为本发明实施例提供的推广文章的一种可选的呈现方式示意图;2C is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention;
图3为本发明实施例提供的基于自媒体平台的文章处理装置的一个可选的硬件结构示意图;FIG. 3 is a schematic structural diagram of an optional hardware of an article processing apparatus based on a self-media platform according to an embodiment of the present disclosure;
图4为本发明实施例提供的基于自媒体平台的文章处理方法的一个可选的流程示意图;4 is an optional schematic flowchart of an article processing method based on a self-media platform according to an embodiment of the present invention;
图5A为本发明实施例提供的利用关键字-主题分类器进行主题预测的示意图;FIG. 5A is a schematic diagram of performing topic prediction by using a keyword-topic classifier according to an embodiment of the present invention; FIG.
图5B为本发明实施例提供的利用文本-文本相似度分类器进行相似度计算的示意图;FIG. 5B is a schematic diagram of performing similarity calculation by using a text-to-text similarity classifier according to an embodiment of the present invention; FIG.
图5C为本发明实施例提供的利用图像-图像相似度分类器进行相似度 计算的示意图;FIG. 5C is a similarity diagram of an image-image similarity classifier according to an embodiment of the present invention; Schematic diagram of the calculation;
图5D为本发明实施例提供的利用文本-图像相似度分类器进行相似度计算的示意图;5D is a schematic diagram of performing similarity calculation by using a text-image similarity classifier according to an embodiment of the present invention;
图6A为本发明实施例提供的文字素材的一个可选的示意图;6A is an optional schematic diagram of a text material according to an embodiment of the present invention;
图6B为本发明实施例提供的文字素材的一个可选的示意图;FIG. 6B is an optional schematic diagram of a text material according to an embodiment of the present invention; FIG.
图7A为本发明实施例提供的在推广文章中添加推广信息的示意图;FIG. 7A is a schematic diagram of adding promotion information in a promotion article according to an embodiment of the present invention; FIG.
图7B为本发明实施例提供的在推广文章中添加推广信息的示意图;FIG. 7B is a schematic diagram of adding promotion information in a promotion article according to an embodiment of the present invention; FIG.
图7C为本发明实施例提供的在推广文章中添加推广信息的示意图;FIG. 7C is a schematic diagram of adding promotion information in a promotion article according to an embodiment of the present invention; FIG.
图8为本发明实施例提供的推广信息的示意图;FIG. 8 is a schematic diagram of promotion information according to an embodiment of the present invention;
图9A为本发明实施例提供的推广信息的显示方式的示意图;9A is a schematic diagram of a manner of displaying promotion information according to an embodiment of the present invention;
图9B为本发明实施例提供的推广信息的显示方式的示意图;9B is a schematic diagram of a manner of displaying promotion information according to an embodiment of the present invention;
图9C为本发明实施例提供的推广信息的显示方式的示意图;9C is a schematic diagram of a manner of displaying promotion information according to an embodiment of the present invention;
图10为本发明实施例提供的基于自媒体平台的文章处理方法的一个可选的流程示意图;FIG. 10 is an optional schematic flowchart of an article processing method based on a self-media platform according to an embodiment of the present disclosure;
图11为本发明实施例提供的基于自媒体平台的文章处理方法的一个可选的流程示意图;FIG. 11 is an optional schematic flowchart of an article processing method based on a self-media platform according to an embodiment of the present disclosure;
图12为本发明实施例提供的基于自媒体平台的文章处理装置的组成结构示意图。FIG. 12 is a schematic structural diagram of a composition processing apparatus based on a self-media platform according to an embodiment of the present invention.
具体实施方式Detailed ways
以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所提供的实施例仅仅用以解释本发明,并不用于限定本发明。另外,以下所提供的实施例是用于实施本发明的部分实施例,而非提供实施本发明的全部实施例,在不冲突的情况下,本发明实施例记载的技术方案可以任意组合的方式实施。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It is to be understood that the examples are provided to illustrate the invention and not to limit the invention. In addition, the embodiments provided below are part of the embodiments for implementing the present invention, and do not provide all the embodiments for implementing the present invention. In the case of no conflict, the technical solutions described in the embodiments of the present invention may be combined in any combination. Implementation.
需要说明的是,在本发明实施例中,术语“包括”、“包含”或者其任 何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的方法或者装置不仅包括所明确记载的要素,而且还包括没有明确列出的其他要素,或者是还包括为实施方法或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的方法或者装置中还存在另外的相关要素(例如方法中的步骤或者装置中的单元,这里的单元可以是部分电路、部分处理器、部分程序或软件等等)。It should be noted that, in the embodiments of the present invention, the terms "including", "including" or any of them are used. The other variations are intended to cover a non-exclusive inclusion, such that a method or apparatus that includes a plurality of elements includes not only the elements that are specifically described, but also other elements that are not explicitly listed, or The inherent elements. In the absence of further limitation, an element defined by the phrase "comprising a ..." does not exclude the presence of additional related elements in the method or device including the element (eg, a step in the method or a unit in the device) The unit here may be part of a circuit, part of a processor, part of a program or software, etc.).
例如,本发明实施例提供的基于自媒体平台的文章处理方法包含了一系列的步骤,但是本发明实施例提供的基于自媒体平台的文章处理方法不限于所记载的步骤,同样地,本发明实施例提供的基于自媒体平台的文章处理装置包括了一系列单元,但是本发明实施例提供的装置不限于包括所明确记载的单元,还可以包括为获取相关信息、或基于信息进行处理时所需要设置的单元。For example, the article processing method based on the self-media platform provided by the embodiment of the present invention includes a series of steps, but the article processing method based on the self-media platform provided by the embodiment of the present invention is not limited to the described steps, and the present invention is similarly The article processing device based on the self-media platform provided by the embodiment includes a series of units, but the device provided by the embodiment of the present invention is not limited to including the unit explicitly described, and may also include when the related information is acquired or processed based on the information. The unit that needs to be set.
对本发明实施例进行进一步详细说明之前,对本发明实施例中涉及的名词和术语进行说明,本发明实施例中涉及的名词和术语适用于如下的解释。Before the embodiments of the present invention are further described in detail, the nouns and terms involved in the embodiments of the present invention are explained. The nouns and terms involved in the embodiments of the present invention are applicable to the following explanations.
1)自媒体平台,也称为自媒体,互联网中设置的用于供用户(包括个人用户、团体和组织等)发布文章的信息平台,依赖于服务器以及在服务器部署的实现自媒体功能的相关软件(支持前端访问和后台处理);自媒体平台如微博、博客、个人网站、论坛社区和各种社交应用的公众号等。1) Self-media platform, also known as self-media, an information platform set up on the Internet for users (including individual users, groups, organizations, etc.) to publish articles, depending on the server and the implementation of the server deployment. Software (supports front-end access and background processing); self-media platforms such as Weibo, blogs, personal websites, forum communities, and public numbers for various social applications.
自媒体平台的账户可以是个人、组织、团体和企业等不同类型,通过注册自媒体的账户以后,在客户端提交的新闻、动态等与用户自身的偏好、动态或业务相关的文章,通过客户端经由自媒体平台推送到合适的用户。The accounts from the media platform can be different types of individuals, organizations, groups, and enterprises. After registering the account from the media, the news, dynamics, and other articles submitted by the client are related to the user's own preferences, dynamics, or business. The end is pushed to the appropriate user via the self-media platform.
2)文章,用于在自媒体平台发布的文章,文章的内容包括文字和图片的一种或组合。2) Articles for articles published on the self-media platform, the content of which includes one or a combination of text and images.
3)推广信息,针对推广对象进行宣传的适用于在互联网进行传播的各 种类型的信息,推广信息中所宣传的对象称为推广对象,例如广告。3) Promotion of information, for the promotion of the object of promotion, applicable to the spread of the Internet Types of information, the objects advertised in the promotion information are called promotion objects, such as advertisements.
4)词向量,利用词到向量的映射模型如词到向量(Word2Vector),根据不同词之间的语义的近似程度,将词映射到向量空间中而得到的向量,不同词向量之间的距离与对对应的词在语义上的近似度程度负相关,即两个词的词向量的距离(如欧式距离)越小,则这两个词的语义越接近。4) Word vector, using a word-to-vector mapping model such as word-to-vector (Word2Vector), based on the degree of semantic similarity between different words, the vector obtained by mapping words into vector space, the distance between different word vectors It is negatively related to the degree of semantic similarity of the corresponding words, that is, the smaller the distance of the word vectors of the two words (such as the Euclidean distance), the closer the semantics of the two words are.
5)主题特征,将表示主题的关键字映射成相应的词向量,并进行组合得到,也称主题特征向量。5) The topic feature, mapping the keywords representing the theme into corresponding word vectors, and combining them, also called theme feature vectors.
6)内容特征,将从文章中提取的多个特征词映射成相应的词向量,并进行组合得到,也称内容特征向量。6) Content features, mapping a plurality of feature words extracted from the article into corresponding word vectors, and combining them, also referred to as content feature vectors.
7)分词,又称为切词,按照一定的分词策略指的是将文章中的字符分割为单独的词。7) Word segmentation, also known as word segmentation, according to a certain word segmentation strategy refers to the division of characters in an article into separate words.
8)停用词,从文章中过滤的对文章的分类决策不会产生影响的词;通常通用词不具有明确意义(只有将其放入一个完整的句子中才有一定作用),例如,代词、冠词和数词、语气助词、副词、介词和连词等功能词。8) Stop words, words that are filtered from the article and do not affect the classification decision of the article; usually the general words do not have a clear meaning (only if they are put into a complete sentence), for example, pronouns , articles such as articles and numerals, modal particles, adverbs, prepositions and conjunctions.
9)特征词,对文章进行分词后,从文章中过滤停用词后,从剩余的词中提取得到的可以表示文章主题的词。9) Feature words, after the article is segmented, after filtering the stop words from the article, the words that can be extracted from the remaining words can represent the subject of the article.
10)分类器模型,也称为分类器,即通过机器学习的方式获得的用于分类的模型,用于根据文章的样本特征,预测文章是目标类别的文章的得分用以表示文章是目标类别的概率。10) a classifier model, also called a classifier, is a model for classification obtained by means of machine learning, and is used for predicting an article as a target category based on the sample characteristics of the article to indicate that the article is a target category. The probability.
例如,本文中分类器模型可以采用支持向量机(SVM,Support Vector Machines)的二分类器模型、基于词袋的分类器模型、基于先验概率和稀疏特征的分类器模型、基于神经网络和深度学习的分类器模型等类别的分类器模型,如无特别说明,本文中所记载的分类器模型用于二分类,如判断是否属于一个主题,判断文章是否属于目标类别。For example, the classifier model in this paper can use the two-classifier model of Support Vector Machines (SVM), the word bag-based classifier model, the classifier model based on prior probability and sparse features, based on neural network and depth. The classifier model of the classifier model such as learning, if not specified, the classifier model described in this paper is used for two classifications, such as judging whether it belongs to a topic, and determining whether the article belongs to the target category.
11)机器学习(Machine Learning),通过对训练集的文章样本(简称为 样本)进行样本特征和是否属于目标类别(如美妆类文章)的标记,对分类器模型进行训练,使训练后的分类器模型具有对测试集的文章样本判断是否属于目标类别的性能。11) Machine Learning, through the sample of the training set (referred to as The sample is trained on the sample features and whether it belongs to the target category (such as beauty articles), and the classifier model is trained so that the trained classifier model has the performance of determining whether the article sample of the test set belongs to the target category.
12)训练集,包括训练分类器模型的文章,文章的向量表示和先验的分类结果用于构造训练样本以训练分类器模型,使分类器模型具有对待测试文章就目标类别进行二分类的性能。12) The training set, including the article that trains the classifier model, the vector representation of the article and the prior classification results are used to construct the training samples to train the classifier model, so that the classifier model has the performance of classifying the target category by the article to be tested. .
13)测试集,包括待测试(分类)的文章,文章的向量表示用于输入分类器模型以预测属于目标类别的得分。13) A test set comprising articles to be tested (classified), the vector representation of the article being used to input a classifier model to predict scores belonging to the target category.
本发明实施例提供基于自媒体平台的文章处理方法、实施基于自媒体平台的文章处理方法的基于自媒体平台的文章处理装置、以及存储用于实现基于自媒体平台的文章处理方法的可执行程序的存储介质。就基于自媒体平台的文章处理方法的实施而言,本发明实施例提供终端侧实施和服务器侧实施的方案,接下来将对文章处理的示例性实施场景进行说明。Embodiments of the present invention provide an article processing method based on a self media platform, an article processing device based on a self media platform based on an article processing method based on a self media platform, and an executable program for implementing an article processing method based on a self media platform. Storage medium. For the implementation of the article processing method based on the self-media platform, the embodiment of the present invention provides a solution implemented by the terminal side and the server side. Next, an exemplary implementation scenario of the article processing will be described.
图1A及图1B为本发明实施例提供的基于自媒体平台的文章处理方法的可选的应用场景示意图,如图1A、图1B所示,在本发明实施例中,用户终端不限于手机、平板电脑、PC机等类型,服务器可以采用任何商用或专用的服务器,在本发明实施例中,基于服务器实现的功能的不同,将其划分为两类,分别为社交网络服务器21及广告后台服务器22,而在实际应用中,每类服务器均可以依据实际情况设置一个或多个。用户终端11至用户终端15可通过有线网络、无线网络或二者的组合与社交网络服务器21及广告后台服务器22进行信息交互,各个用户终端之间可以通过服务器进行信息(如文章)收发、广告投放等。以下结合图1A及图1B对本发明实施例的文章处理方法进行说明,需要说明的是,图1A及图1B所示的网络仅仅是一种示例,以便于理解,而不对本发明的网络架构构成任何限制。1A and FIG. 1B are schematic diagrams of an optional application scenario of an article processing method based on a self-media platform according to an embodiment of the present invention. As shown in FIG. 1A and FIG. 1B, in the embodiment of the present invention, the user terminal is not limited to a mobile phone. For a tablet, a PC, or the like, the server may be any commercial or dedicated server. In the embodiment of the present invention, based on the functions implemented by the server, the server is divided into two categories, namely, a social network server 21 and an advertisement back server. 22, and in practical applications, each type of server can be set according to the actual situation one or more. The user terminal 11 to the user terminal 15 can exchange information with the social network server 21 and the advertisement background server 22 through a wired network, a wireless network, or a combination of the two, and each user terminal can send and receive information (such as an article) through the server. Delivery, etc. The article processing method of the embodiment of the present invention will be described below with reference to FIG. 1A and FIG. 1B. It should be noted that the network shown in FIG. 1A and FIG. 1B is merely an example for easy understanding and does not constitute the network architecture of the present invention. Any restrictions.
参见图1A,在一些实施例中,广告后台服务器22从广告主终端处获 取携带推广信息(如广告)的推广文章,然后将推广文章发送至社交网络服务器21,以通过社交网络服务器21将携带推广信息的推广文章发送至社交网络,使得社交网络用户得以接收和阅读携带推广信息的推广文章。Referring to FIG. 1A, in some embodiments, the advertisement backend server 22 obtains from the advertiser terminal. Taking the promotion article carrying the promotion information (such as advertisement), and then sending the promotion article to the social network server 21 to send the promotion article carrying the promotion information to the social network through the social network server 21, so that the social network user can receive and read the carrier. Promote promotional articles for information.
其中,上述携带推广信息的推广文章可以为,广告商为广告主针对特定人群(如公众号用户)或特定商品(如某指定洗发水)、针对每个商品或特定人群逐个撰写用于推广的文章(也称为软文),可以包括文本和/或图片的形式,将广告和文章内容融合在一起。The promotion article carrying the promotion information may be that the advertiser writes the advertisement for the specific group (such as the public number user) or the specific product (such as a designated shampoo), for each product or a specific group of people for promotion. Articles (also known as soft texts) can be in the form of text and/or images that combine advertising and article content.
以社交网络服务器21承载公众号功能为例,参见图2A,图2A为本发明实施例提供的推广文章的一种可选的呈现方式示意图,广告商为广告主针对公众号用户(即通过社交网络客户端关注了公众号的用户)进行软文撰写,然后广告主将软文发送至广告后台服务器22,广告后台服务器22通过社交网络服务器21将软文发送至关注公众号的用户的终端,如图2A所示,用户在访问公众号的过程中,通过点击界面1中任意位置跳转到界面2,既看到了自身关注的内容,也了解了广告主发布的广告。For example, FIG. 2A is a schematic diagram of an optional presentation manner of a promotion article provided by an advertiser according to an embodiment of the present invention. The advertiser is an advertiser for the public number user (ie, through social interaction). The web client pays attention to the user of the public number to write the soft text, and then the advertiser sends the soft text to the advertisement background server 22, and the advertisement background server 22 sends the soft text to the terminal of the user who pays attention to the public number through the social network server 21, as shown in FIG. 2A. In the process of accessing the public number, the user jumps to the interface 2 by clicking anywhere in the interface 1, and both sees the content of his own attention and the advertisement of the advertiser.
然而,上述实现方式的人力成本很高,需要针对不同的广告主撰写相应的文章,无法做到规模化和批量化,同时由于专门撰写软文导致无法满足实时推广的需求。However, the labor cost of the above implementation method is very high, and it is necessary to write corresponding articles for different advertisers, and it is impossible to achieve scale and batchization, and at the same time, the demand for real-time promotion cannot be met due to the special writing of soft texts.
参见图1A,在一些实施例中,社交网络服务器21从用户处获取针对特定主题征集的文章,然后将征集的文章发送至广告后台服务器22,广告后台服务器22在征集的文章中添加推广信息(如广告),通过社交网络服务器21将添加有推广信息的文章发送至社交网络,使得社交网络用户得以接收和阅读添加推广信息的推广文章。Referring to FIG. 1A, in some embodiments, the social network server 21 obtains articles for a particular topic collection from a user, and then sends the collected articles to an advertisement backend server 22, which adds promotional information to the collected articles ( For example, an advertisement is sent to the social network through the social network server 21, so that the social network user can receive and read the promotion article with the promotion information.
以社交网络服务器21承载公众号功能为例,参见图2B,图2B为本发明实施例提供的推广文章的一种可选的呈现方式示意图,公众号发起特定主题(如情人节)的征文活动,承载公众号功能的社交网络服务器21获得 参与该活动的用户发送的文章,将获得的文章发送至广告后台服务器22,广告后台服务器22在文章的文末添加广告,得到添加广告的推广文章,将推广文章发送至社交网络服务器21,通过公众号将推广文章发送至社交网络,社交网络用户通过点击界面1中任意位置跳转到界面2,进行文章内容及添加广告的阅读。For example, FIG. 2B is a schematic diagram of an optional presentation manner of a promotion article according to an embodiment of the present invention. The public number initiates an essay activity on a specific theme (such as Valentine's Day). , the social network server 21 that carries the public number function is obtained The article sent by the user participating in the activity sends the obtained article to the advertisement background server 22, and the advertisement background server 22 adds an advertisement at the end of the article, obtains a promotion article for adding the advertisement, and sends the promotion article to the social network server 21 through the public. The number will send the promotion article to the social network, and the social network user jumps to the interface 2 by clicking anywhere in the interface 1 to read the article content and add the advertisement.
然而,上述实现方式的文章来源由于依赖于向用户征集文章因而存在较大的局限性,仅限于征集的特定主题的用户原创内容(UGC,User Generated Content)文章,且广告内容以展示为主,用户无法针对展示的广告进一步进行操作,降低了用户对广告产品的购买率及了解欲望。However, the source of the above-mentioned implementations has a large limitation due to the fact that the article is collected from the user, and is limited to the user-generated content (UGC) article of the specific topic collected, and the advertisement content is mainly displayed. Users are unable to further manipulate the displayed ads, reducing the user’s desire to purchase ads and understanding desires.
参见图1A,在一些实施例中,社交网络服务器21从用户处获取待发布的文章,然后将获取的文章发送至广告后台服务器22,广告后台服务器22在文章的特定位置添加与文章主题相关的推广信息(如广告)得到推广文章,通过社交网络服务器21将推广文章发送至社交网络。Referring to FIG. 1A, in some embodiments, the social network server 21 obtains an article to be published from a user, and then transmits the obtained article to an advertisement backend server 22, which adds a topic related to the article topic at a specific location of the article. The promotion information (such as an advertisement) is promoted and the promotion article is sent to the social network through the social network server 21.
以社交网络服务器21承载新闻发布平台功能为例,参见图2C,图2C为本发明实施例提供的推广文章的一种可选的呈现方式示意图,社交网络服务器21从新闻发布平台的运营用户终端侧获取待发布的文章,然后将获取的文章发送至广告后台服务器22,广告后台服务器22在文章的结束添加与文章主题相关的广告得到推广文章,将推广文章发送至社交网络服务器21,通过新闻发布平台将推广文章发送至社交网络,用户通过点击界面1中任意位置跳转到界面2,得以看到添加了广告的推广文章。For example, FIG. 2C is a schematic diagram of an optional presentation manner of the promotion article provided by the social network server 21, and the social network server 21 is operated from the operation user terminal of the news release platform. The side obtains the article to be published, and then sends the obtained article to the advertisement background server 22, and the advertisement background server 22 adds the advertisement related to the article topic to the promotion article at the end of the article, and sends the promotion article to the social network server 21 through the news. The publishing platform sends the promotion article to the social network, and the user jumps to the interface 2 by clicking anywhere in the interface 1, and can see the promotion article with the added advertisement.
然而,上述实现方式中,虽然文章中添加的广告与文章主题的契合度较高,然而由于添加位置固定,使得广告与文章内容的结合生硬,出现推广信息与文章中内容关联性不高甚至毫无关联的情况,降低了推广信息的接受度。However, in the above implementation manner, although the advertisement added in the article has a high degree of fit with the article theme, the combination of the advertisement and the article content is hard due to the fixed location, and the promotion information and the content in the article are not highly correlated or even Unrelated situations reduce the acceptance of promotional information.
参见图1B,在一些实施例中,本发明实施例基于自媒体平台的文章处 理方法的实现可以包括:承载有自媒体平台功能的社交网络服务器21接收自媒体平台的用户通过连接自媒体平台的第一客户端提交的待发布文章,将待发布文章发送给广告后台服务器22;广告后台服务器22在待发布文章中,确定用于呈现推广信息的推广文章、以及推广文章中用于添加推广信息的推广位置;社交网络服务器21确定候选推广对象中与推广文章匹配的目标推广对象、以及与目标推广对象匹配的素材;根据确定的推广位置,添加包括素材的推广信息至推广文章的推广位置;将添加有推广信息的推广文章发送给社交网络服务器21,社交网络服务器21发送添加推广信息的推广文章至自媒体平台的第二客户端进行呈现。Referring to FIG. 1B, in some embodiments, embodiments of the present invention are based on articles from a media platform. The implementation of the method may include: the social network server 21 carrying the self-media platform function receives the to-be-published article submitted by the user of the media platform through the first client connected to the media platform, and sends the to-be-published article to the advertisement background server 22 The advertisement background server 22 determines, in the article to be published, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article; the social network server 21 determines the target promotion of the candidate promotion object that matches the promotion article. The object and the material matching the target promotion object; adding the promotion information including the material to the promotion location of the promotion article according to the determined promotion location; and sending the promotion article with the promotion information to the social network server 21, and the social network server 21 sends The promotion article of the promotion information is added to the second client of the media platform for presentation.
参见图1B,在一些实施例中,本发明实施例基于自媒体平台的文章处理方法的实现可以包括:承载有自媒体平台功能的社交网络服务器21接收自媒体平台的用户通过连接自媒体平台的第一客户端提交的目标文章;社交网络服务器21从广告后台服务器22处获得候选推广对象或者获得自身存储的候选推广对象,而自身存储了候选推广对象的素材,在目标文章中确定用于呈现推广信息的推广文章、以及推广文章中用于添加推广信息的推广位置;社交网络服务器21确定候选推广对象中与推广文章匹配的目标推广对象、以及与目标推广对象匹配的素材;根据确定的推广位置,添加包括素材的推广信息至推广文章的推广位置;发送添加推广信息的推广文章至自媒体平台的第二客户端进行呈现。其中,客户端提交的目标文章可以包括待发布文章及原始文章;这里的原始文章指已经通过自媒体平台被发布过又被撤回的文章。Referring to FIG. 1B, in some embodiments, the implementation of the article processing method based on the self-media platform of the embodiment of the present invention may include: the social network server 21 carrying the self-media platform function receives the user from the media platform by connecting to the media platform. The target article submitted by the first client; the social network server 21 obtains the candidate promotion object from the advertisement background server 22 or obtains the candidate promotion object stored by itself, and stores the material of the candidate promotion object itself, and determines the presentation for the presentation in the target article. a promotion article of the promotion information, and a promotion location for adding the promotion information in the promotion article; the social network server 21 determines the target promotion object matching the promotion article in the candidate promotion object, and the material matching the target promotion object; The location, adding the promotion information including the material to the promotion position of the promotion article; sending the promotion article adding the promotion information to the second client from the media platform for presentation. Among them, the target article submitted by the client may include the article to be published and the original article; the original article here refers to the article that has been released and retracted through the self-media platform.
接下来根据图3说明实现本发明实施例的基于自媒体平台的文章处理方法对应的装置的示例性的硬件结构,基于自媒体平台的文章处理装置可以以各种形式来实施,例如终端(如台式机电脑、笔记本电脑或智能手机)、服务器等各种类型的计算机设备,由终端、服务器等计算机设备采用独立 或协同的方式实现本发明实施例的基于自媒体平台的文章处理方法。下面对本发明实施例的基于自媒体平台的文章处理装置的硬件结构做详细说明,可以理解,图3仅仅示出了基于自媒体平台的文章处理装置的示例性结构而非全部结构,根据需要可以实施图3示出的部分结构或全部结构。Next, an exemplary hardware structure of an apparatus corresponding to the self-media platform-based article processing method for implementing an embodiment of the present invention is described with reference to FIG. 3, and the article processing apparatus based on the self-media platform may be implemented in various forms, such as a terminal (eg, Various types of computer equipment, such as desktop computers, laptops or smart phones), servers, etc., are independent of computer equipment such as terminals and servers. The self-media platform-based article processing method of the embodiment of the present invention is implemented in a coordinated manner. The hardware structure of the article processing apparatus based on the self-media platform of the embodiment of the present invention is described in detail below. It can be understood that FIG. 3 only shows an exemplary structure of the article processing apparatus based on the self-media platform, and not all the structures, as needed. Part or all of the structure shown in Fig. 3 is implemented.
参见图3,图3为本发明实施例提供的基于自媒体平台的文章处理装置的一个可选的硬件结构示意图,可以应用于前述应用场景中的服务器,如可以为微博/微信的后台服务器;自媒体网站的后台服务器,图3所示的文章处理装置100包括:至少一个处理器101、存储器102、至少一个网络接口103。文章处理装置100中的各个组件通过总线系统104耦合在一起。可以理解,总线系统104用于实现这些组件之间的连接通信。总线系统104除包括数据总线之外,还包括电源总线、控制总线和状态信号总线。但是为了清楚说明起见,在图3中将各种总线都标为总线系统104。Referring to FIG. 3, FIG. 3 is a schematic diagram of an optional hardware structure of an article processing apparatus based on a self-media platform according to an embodiment of the present disclosure, which can be applied to a server in the foregoing application scenario, such as a background server that can be a microblog/WeChat. From the background server of the media website, the article processing apparatus 100 shown in FIG. 3 includes at least one processor 101, a memory 102, and at least one network interface 103. The various components in article processing device 100 are coupled together by bus system 104. It will be appreciated that the bus system 104 is used to implement connection communication between these components. The bus system 104 includes, in addition to the data bus, a power bus, a control bus, and a status signal bus. However, for clarity of description, various buses are labeled as bus system 104 in FIG.
其中,存储器102可以是易失性存储器或非易失性存储器,也可包括易失性和非易失性存储器两者。The memory 102 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory.
本发明实施例中的存储器102用于存储各种类型的数据以支持基于自媒体平台的文章处理装置100的操作。这些数据的示例包括:用于在基于自媒体平台的文章处理装置100上操作的任何计算机程序,如可执行程序1021,实现本发明实施例的基于自媒体平台的文章处理方法的程序可以包含在可执行程序1021中。The memory 102 in the embodiment of the present invention is used to store various types of data to support the operation of the article processing apparatus 100 based on the self media platform. Examples of such data include: any computer program for operating on the self-media platform-based article processing apparatus 100, such as the executable program 1021, the program implementing the self-media platform-based article processing method of the embodiment of the present invention may be included in It can be executed in the program 1021.
网络接口103可以包括一个或多个通信模块,如包括移动通信模块及无线互联网模块。 Network interface 103 may include one or more communication modules, such as a mobile communication module and a wireless internet module.
本发明实施例揭示的基于自媒体平台的文章处理方法可以应用于处理器101中,或者由处理器101实现。处理器101可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,本发明实施例方法的各步骤可以通过处理器101中的硬件的集成逻辑电路或者软件形式的指令完成。上述的 处理器101可以是通用处理器、数字信号处理器(DSP,Digital Signal Processor),或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。处理器101可以实现或者执行本发明实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者任何常规的处理器等。结合本发明实施例所公开的方法的步骤,可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于存储介质中,该存储介质位于存储器102,处理器101读取存储器102中的信息,结合其硬件完成本发明实施例提供的上述基于自媒体平台的文章处理方法的步骤。The article processing method based on the self-media platform disclosed in the embodiment of the present invention may be applied to the processor 101 or implemented by the processor 101. Processor 101 may be an integrated circuit chip with signal processing capabilities. In the implementation process, the steps of the method in the embodiment of the present invention may be completed by using an integrated logic circuit of hardware in the processor 101 or an instruction in a software form. abovementioned The processor 101 can be a general purpose processor, a digital signal processor (DSP), or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, or the like. The processor 101 can implement or perform the various methods, steps, and logic blocks disclosed in the embodiments of the present invention. A general purpose processor can be a microprocessor or any conventional processor or the like. The steps of the method disclosed in the embodiment of the present invention may be directly implemented as a hardware decoding processor, or may be performed by a combination of hardware and software modules in the decoding processor. The software module may be located in a storage medium, and the storage medium is located in the memory 102. The processor 101 reads the information in the memory 102, and completes the steps of the above-described self-media platform-based article processing method provided by the embodiment of the present invention.
在示例性实施例中,基于自媒体平台的文章处理装置100可以被一个或多个应用专用集成电路(ASIC,Application Specific Integrated Circuit)、DSP、可编程逻辑器件(PLD,Programmable Logic Device)、复杂可编程逻辑器件(CPLD,Complex Programmable Logic Device),用于执行本发明实施例的基于自媒体平台的文章处理方法。In an exemplary embodiment, the article processing apparatus 100 based on the self-media platform may be configured by one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), and complexities. A Programmable Logic Device (CPLD) is used to execute the self-media platform-based article processing method of the embodiment of the present invention.
基于上述基于自媒体平台的文章处理方法的应用场景及基于自媒体平台的文章处理装置,接下来对本发明实施例的基于自媒体平台的文章处理方法的实现过程进行说明。Based on the application scenario of the article processing method based on the self-media platform and the article processing device based on the self-media platform, the implementation process of the article processing method based on the self-media platform according to the embodiment of the present invention is described.
作为上述基于自媒体平台的文章处理方法的一个可选实施例,本发明实施例可将基于自媒体平台的文章处理装置实施为信息推广平台功能的广告后台服务器进行说明,广告后台服务器通过社交网络服务器接收来自自媒体平台客户端的待发布文章,确定待发布文章中的推广文章,获取候选推广对象信息,确定目标推广对象,以及与目标推广对象对应的素材,并生成包括素材的推广信息,确定推广文章中呈现推广信息的推广位置,将推广信息添加至推广文章中的推广位置,并将添加了推广信息的推广文章,通过社交网络服务器发送至自媒体平台的客户端呈现。当然,基于自媒体 平台的文章处理装置也可以实施到其他应用环境中,例如新闻推送APP的后台服务器、自媒体网站的后台服务器、信息推广平台的服务器等,本文不排除基于自媒体平台的文章处理装置实施为提供自媒体平台功能的任意应用环境。As an optional embodiment of the article processing method based on the self-media platform, the embodiment of the present invention can be implemented as an advertisement background server implemented by the article processing device based on the self-media platform as an information promotion platform function, and the advertisement background server passes the social network. The server receives the to-be-published article from the self-media platform client, determines the promotion article in the article to be published, obtains the candidate promotion object information, determines the target promotion object, and the material corresponding to the target promotion object, and generates promotion information including the material, and determines Promote the promotion location of the promotion information in the article, add the promotion information to the promotion location in the promotion article, and add the promotion article of the promotion information to the client presentation of the self-media platform through the social network server. Of course, based on self-media The article processing device of the platform can also be implemented in other application environments, such as a background server of a news push APP, a background server of a media website, a server of an information promotion platform, etc., and the article processing device based on the self media platform is not excluded from being provided. Any application environment from the media platform function.
作为上述基于自媒体平台的文章处理方法的另一个可选实施例,图4示出了本发明实施例提供的基于自媒体平台的文章处理方法的一个可选的流程示意图,参见图4,本发明实施例将以基于自媒体平台的文章处理装置实施为部署自媒体平台功能的社交网络服务器进行说明,本发明实施例提供的基于自媒体平台的文章处理方法包括:As an alternative embodiment of the article processing method based on the self-media platform, FIG. 4 is a schematic flowchart of an article processing method based on the self-media platform provided by the embodiment of the present invention. The embodiment of the present invention will be described as a social network server deployed from a media platform based on a self-media platform. The article processing method based on the self-media platform provided by the embodiment of the present invention includes:
步骤301:发布者通过自媒体平台的第一客户端发送待发布文章给社交网络服务器。Step 301: The publisher sends the to-be-published article to the social network server by using the first client from the media platform.
待发布文章可以为待于自媒体平台(如微博、公众号、论坛社区等)发布的文章。Articles to be published may be articles to be published on self-media platforms (such as Weibo, public number, forum community, etc.).
社交网络服务器承载有自媒体平台(如微博、公众号、QQ空间、论坛社区等)功能,社交网络可以为基于微博、公众号等可以供用户发布文章的互联网社交实体,文章的发布者可以为社交网络中的任意用户,用户基于终端上的自媒体平台的客户端完成文章的撰写后,将待发布的文章发送至社交网络服务器,以通过社交网络服务器发送至自媒体平台的客户端进行呈现。如此,利用自媒体平台自发的关于文章的流量作为推广信息的载体,即文章的来源可以来自社交网络中任意一个用户终端,打破了靠征集特定主题的文章的局限性。The social network server carries the functions of a self-media platform (such as Weibo, public number, QQ space, forum community, etc.), and the social network may be an Internet social entity based on Weibo, public number, etc., which can be used by users to publish articles, and the publisher of the article. For any user in the social network, after the user completes the writing of the article based on the client of the self-media platform on the terminal, the article to be published is sent to the social network server for sending to the client of the self-media platform through the social network server. Present it. In this way, the self-media platform's spontaneous traffic about the article is used as the carrier of the promotion information, that is, the source of the article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic.
示例性地,公众号运营用户作为文章的发布者,可以通过客户端提交文章,例如可以是用户撰写的关于心情、美食、化妆等主题进行分享的文章,也可以是从互联网中下载的与上述主题相关的文章,将待发布文章由客户端发送至承载有公众号功能的社交网络服务器,以通过社交网络服务 器将文章发送至关注公众号的用户。Exemplarily, the public number operation user as the publisher of the article may submit the article through the client, for example, may be an article written by the user about the theme of mood, food, makeup, etc., or may be downloaded from the Internet and the above. Topic-related articles, which will be sent by the client to the social network server hosting the public number function to serve through the social network service. Send the article to the user who is following the public number.
再如,微博用户作为发布者,待发布文章通过微博客户端发送至承载有微博功能的社交网络服务器,社交网络服务器将文章发送关注发布者的微博账号的用户。For another example, the Weibo user acts as a publisher, and the article to be published is sent to the social network server hosting the Weibo function through the Weibo client, and the social network server sends the article to the user who pays attention to the publisher's Weibo account.
在一个实施例中,社交网络服务器接收了发布者终端发送的待发布文章后,可以直接将待发布文章发送至社交网络的其他用户,也可以对待发布文章进行推广信息的合成处理,以进行合成有推广信息的文章的发布。In an embodiment, after receiving the article to be published sent by the publisher terminal, the social network server may directly send the to-be-published article to other users of the social network, or may perform the synthesis process of the promotion information for publishing the article to perform the synthesis. The publication of articles with promotional information.
下面结合步骤302至步骤305,对社交网络服务器对待发布文章进行推广信息的合成处理进行说明。The following takes the steps 302 to 305 to describe the synthesis processing of the promotion information of the social network server to be published.
步骤302:社交网络服务器在待发布文章中,确定用于呈现推广信息的推广文章、以及推广文章中用于添加推广信息的位置。Step 302: The social network server determines, in the article to be published, a promotion article for presenting the promotion information, and a location for promoting the promotion information in the promotion article.
当推广信息为广告时,其中可能存在不适合进行广告添加的文章,如学术性文章,因此需要对待发布文章进行筛选,确定能够呈现(添加)广告的文章,即推广文章。When the promotion information is an advertisement, there may be articles that are not suitable for advertisement addition, such as academic articles, so it is necessary to filter the articles to be published, and determine an article capable of presenting (adding) advertisements, that is, promotion articles.
在一些实施例中,社交网络服务器可以通过广告后台服务器获取候选推广对象,且自身存储了候选推广对象的素材信息,社交网络服务器可以通过如下方式确定用于呈现推广信息的推广文章:In some embodiments, the social network server may acquire the candidate promotion object through the advertisement background server, and itself stores the material information of the candidate promotion object, and the social network server may determine the promotion article for presenting the promotion information by:
将待发布文章的主题与候选推广对象表示的主题进行主题相似度计算,将满足主题相似度条件(如相似度超过预设主题相似度阈值)的待发布文章确定为用于呈现推广信息的推广文章。The topic similarity degree is calculated by the topic of the article to be published and the topic represented by the candidate promotion object, and the article to be published that satisfies the topic similarity condition (such as the similarity exceeds the preset topic similarity threshold) is determined as the promotion for presenting the promotion information. article.
在一实施例中,分别将表征文章及候选推广对象的主题的特征词输入预设的学习模型(如word2vec),经该学习模型映射得到对应文章主题及候选推广对象的主题特征向量,然后计算文章的主题特征向量与候选推广对象的主题特征向量之间的相似度,选取相似度超过相似度阈值的文章作为推广文章。例如,可通过如下方式实现待发布文章的主题特征与候选推广 对象的主题特征的主题相似度计算:将从待发布文章提取的关键词,输入根据特征词进行主题分类的分类器模型,获得分类器模型计算输出的待发布文章对应的主题;将从候选推广对象的素材提取的关键词,输入根据特征词进行主题分类的分类器模型,获得候选推广对象对应的主题;根据待发布文章所对应主题与候选推广对象所对应主题的语义距离,确定与语义距离负相关关系的主题相似度。例如,计算发布文章所对应主题向量与候选推广对象所对应主题向量的欧式距离,然后计算其倒数,作为主题相似度。In an embodiment, the feature words representing the topic of the article and the candidate promotion object are respectively input into a preset learning model (such as word2vec), and the theme feature vector of the corresponding article theme and the candidate promotion object is obtained through the learning model mapping, and then calculated. The similarity between the topic feature vector of the article and the topic feature vector of the candidate promotion object, and the article whose similarity exceeds the similarity threshold is selected as the promotion article. For example, the topic features and candidate promotion of the article to be published can be achieved as follows: Subject similarity calculation of the subject feature of the object: input the keyword extracted from the article to be published, input the classifier model according to the feature word, and obtain the topic corresponding to the article to be published calculated by the classifier model; The keyword extracted from the material of the object is input into a classifier model for classifying the topic according to the feature word, and the topic corresponding to the candidate promotion object is obtained; and the semantic distance is determined according to the semantic distance of the topic corresponding to the article to be published and the topic corresponding to the candidate promotion object. The topic similarity of negative correlations. For example, calculating the Euclidean distance of the topic vector corresponding to the published article and the topic vector corresponding to the candidate promotion object, and then calculating the reciprocal thereof as the topic similarity.
在一些实施例中,可依据预设的分类标准对文章及候选推广对象的主题进行等级设置,如按照等级层次由高到低设置为第一级主题、第二级主题及第三级主题;其中,每个等级的主题可以包括其下一级的多个主题,举例说明:第一级主题可以为军事、体育、娱乐、财经,以第一级主题为娱乐为例,其可以包括美食、旅游、电影和音乐等多个第二级主题,以第二级主题为音乐为例,其可以包括爵士音乐、古典音乐等多个第三级主题。In some embodiments, the articles and the topics of the candidate promotion objects may be level-set according to a preset classification standard, such as a first-level theme, a second-level theme, and a third-level theme according to a hierarchical level from high to low; The theme of each level may include multiple topics of the next level. For example, the first level theme may be military, sports, entertainment, finance, and the first level theme is entertainment, which may include food, There are a number of second-level themes such as travel, movies, and music. The second-level theme is music. For example, it can include multiple third-level themes such as jazz music and classical music.
相应的,在一实施例中,社交网络服务器将待发布文章的主题划分到对应的第一级主题,计算待发布文章的第一级主题与候选推广对象的第一级主题的相似度,当相似度满足相似度条件(例如超出第一级主题的主题相似度阈值)时,确定待发布文章为用于呈现推广信息的推广文章。也就是说,基于第一级主题特征对待发布文章进行粗略筛选,得到推广文章,然而在另一些实施例中,亦可基于第二级主题或第三级主题对文章进行筛选得到推广文章,但由于第一级主题包括多个第二级主题及多个第三级主题,也即第一级主题对应的特征向量维度低于第二/第三级主题对应的特征向量的维度,而社交网络服务器接收的自媒体平台中的文章流量很大,因此采用第一级主题进行文章的筛选显然要比基于第二/第三级主题进行筛选速度要快,即快速判断出可以添加推广信息的推广文章,减少网络延迟。 Correspondingly, in an embodiment, the social network server divides the topic of the article to be published into the corresponding first-level topic, and calculates the similarity between the first-level topic of the article to be published and the first-level topic of the candidate promotion object. When the similarity satisfies the similarity condition (for example, the topic similarity threshold exceeding the first-level topic), it is determined that the article to be published is a promotion article for presenting the promotion information. That is to say, based on the first-level topic feature, the article is roughly screened to obtain the promotion article, but in other embodiments, the article may be filtered based on the second-level topic or the third-level topic to obtain the promotion article, but Since the first level topic includes a plurality of second level topics and a plurality of third level topics, that is, the feature vector dimension corresponding to the first level theme is lower than the dimension of the feature vector corresponding to the second/third level theme, and the social network The traffic received by the server from the media platform is very large. Therefore, the screening of the article using the first-level theme is obviously faster than the filtering based on the second/third-level theme, that is, the promotion of the promotion information can be quickly determined. Article to reduce network latency.
在一实施例中,将待发布文章的主题划分到第一级主题的方式与划分到第二级主题、第三级主题的方式类似,均可通过对应等级的分类器实现,作为使用分类器的示例,通过提取文章的特征(文本特征、图像特征至少之一),将提取的特征通过预设的学习模型映射得到相应的特征向量,将得到的文章的特征向量输入不同等级的分类器,映射得到相应等级的主题;例如将提取的文章的特征词输入预设的word2vec模型,得到相应的多个维度的特征向量,将得到的多个维度的特征向量输入第一级分类器,得到对应的第一级主题。例如,将得到的文章的多个维度的特征向量输入第一级分类器后输出得到的第一级主题为体育的概率为10%、为娱乐的概率为80%、为财经的概率为5%,选取概率最高的“娱乐”作为最终的第一级主题。In an embodiment, the manner in which the topic of the article to be published is divided into the first-level topic is similar to the manner of dividing into the second-level theme and the third-level theme, and can be implemented by using a classifier of the corresponding level as a classifier. For example, by extracting features of the article (at least one of the text feature and the image feature), the extracted feature is mapped to a corresponding feature vector through a preset learning model, and the feature vector of the obtained article is input into a classifier of a different level. The mapping obtains the topic of the corresponding level; for example, input the feature word of the extracted article into the preset word2vec model, obtain the corresponding feature vector of multiple dimensions, and input the obtained feature vector of the plurality of dimensions into the first classifier to obtain the corresponding The first level theme. For example, if the feature vector of the plurality of dimensions of the obtained article is input to the first-level classifier, the probability of the first-level theme is 10% for sports, 80% for entertainment, and 5% for financial. The most probable "entertainment" is selected as the final first-level theme.
在一些实施例中,不同等级分类器的获取可通过以下方式之一得到:In some embodiments, the acquisition of different ranks of classifiers can be obtained in one of the following ways:
1)有监督的学习方法,如采用人工标注文本和/或图片对应若干个主题,并用标注数据的特征(文本和/或图片特征)训练一个特定等级的特征-主题分类器模型,通过训练得到的特征-主题分类器实现相应等级的主题的映射。2)无监督的学习方法,对文章的文本和/或图片特征进行聚类,得到对应文章的主题。1) Supervised learning methods, such as manually annotating text and/or pictures corresponding to several topics, and training a specific level of feature-thematic classifier model with the characteristics of the annotated data (text and/or picture features), obtained through training The feature-theme classifier implements a mapping of the corresponding level of topics. 2) Unsupervised learning method, clustering the text and/or picture features of the article to get the theme of the corresponding article.
候选推广对象可以为从推广系统(如广告后台服务器)处获得的优先级排序(如竞价排名)在前的一定数量的推广对象。The candidate promotion object may be a certain number of promotion objects prior to the prioritization (such as the auction ranking) obtained from the promotion system (such as the advertisement back-end server).
在一些实施例中,社交网络服务器还可以通过如下方式确定用于呈现推广信息的推广文章:In some embodiments, the social network server may also determine a promotional article for presenting promotional information by:
社交网络服务器将待发布文章的主题,与候选推广对象的名称、所属类别和对应的推广信息关键字至少之一进行匹配,确定满足匹配条件(即满足相似度条件,如相似度达到阈值)的文章为用于呈现推广信息的推广文章。其中,上述待发布文章的主题可以为以下两种之一: The social network server matches the subject of the to-be-published article with at least one of the name of the candidate promotion object, the category of the candidate, and the corresponding promotion information keyword, and determines that the matching condition is met (ie, the similarity condition is met, for example, the similarity reaches the threshold). The article is a promotional article for presenting promotional information. The subject of the above-mentioned article to be published may be one of the following two types:
1)待发布文章的关键字;1) keywords of the article to be published;
例如,关键字可以包括:从文章标题或文章内容(如文章的每个段落)中提取的关键字;例如:待发布的文章的标题为“青岛美食剖析”,提取得到关键字为“美食”。For example, the keyword may include: a keyword extracted from an article title or article content (such as each paragraph of the article); for example, the title of the article to be published is “Qingdao Food Analysis”, and the extracted keyword is “Gourmet”. .
2)利用关键字-主题模型对文章进行主题预测得到的主题;2) Using the keyword-topic model to predict the topic of the article;
图5A为本发明实施例提供的利用关键字-主题分类器模型进行主题预测的示意图,参见图5A,关键字-主题模型可以为预先进行训练得到的分类器模型,实现了文章关键字与文章主题的关系映射,两个或两个以上的关键字通过关键字-主题分类器映射得到文章的主题。例如:关键字为从文章内容中提取得到的关键字为“糖”、“饼干”、“方便面”、“巧克力”,将这些关键字对应的向量输入关键字-主题模型,对主题进行预测,可得到主题为“食品”的概率为80%,得到主题为“娱乐”的概率为3%,选取概率最大的结果(食品)作为预测得到的文章的主题。FIG. 5A is a schematic diagram of a topic prediction using a keyword-topic classifier model according to an embodiment of the present invention. Referring to FIG. 5A, a keyword-topic model may be a classifier model obtained by pre-training, and implements article keywords and articles. The relationship mapping of the topic, two or more keywords through the keyword-topic classifier mapping to get the topic of the article. For example, the keyword extracted from the content of the article is “sugar”, “cookie”, “ instant noodles”, “chocolate”, and the vector corresponding to these keywords is input into the keyword-theme model to predict the theme. The probability of getting the theme "food" is 80%, the probability of getting the theme "entertainment" is 3%, and the result with the highest probability (food) is selected as the subject of the predicted article.
接下来对本发明实施例中的候选推广对象进行说明,候选推广对象可以为服务(如电影、游戏等)或产品(如化妆品、衣服、鞋等);以候选推广对象是服务为例,候选推广对象对应的名称、所属类别和推广信息关键字可以为服务对应的服务名称(如电影名称)、服务类别(如电影)、广告词(如“谁说汽车不能飞-XXX”);以候选推广对象是产品为例,候选推广对象对应的名称、所属类别和推广信息关键字可以为产品对应的产品名称(如女装品牌-XX)、产品类别(如衣服)、广告词(如“来自法兰西的浪漫、时尚服饰-XX”)。Next, the candidate promotion object in the embodiment of the present invention is described. The candidate promotion object may be a service (such as a movie, a game, etc.) or a product (such as cosmetics, clothes, shoes, etc.); The name, category, and promotion information keyword corresponding to the object may be a service name (such as a movie name), a service category (such as a movie), an advertisement word (such as "Who said the car cannot fly-XXX"); The object is a product. For example, the name, category, and promotion information keyword corresponding to the candidate promotion object may be the product name corresponding to the product (such as women's brand-XX), product category (such as clothes), and advertising words (such as "from France." Romantic, fashion clothing - XX").
基于上述对文章的主题、候选推广对象等的说明,在一些实施例中,将待发布文章的主题,与推广对象的名称、所属类别和对应的推广信息关键字至少之一进行匹配时,可以通过以下方式之一确定用于呈现推广信息的推广文章: Based on the foregoing description of the topic of the article, the candidate promotion object, and the like, in some embodiments, when the topic of the article to be published is matched with at least one of the name of the promotion object, the category of the promotion, and the corresponding promotion information keyword, Promote promotional articles for presenting promotional information in one of the following ways:
1)候选推广对象的名称、所属类别和对应的推广信息关键字至少之一所对应的内容,包括了待发布文章的主题;例如:候选推广对象对应的推广信息关键字为:水果王后-山竹,待发布的文章的主题为:水果,确定该待发布的文章满足匹配条件;1) The content corresponding to at least one of the name of the candidate promotion object, the category to which it belongs, and the corresponding promotion information keyword, including the subject of the article to be published; for example, the promotion information keyword corresponding to the candidate promotion object is: Queen of Fruits - Mangosteen The subject of the article to be published is: fruit, and it is determined that the article to be published satisfies the matching condition;
2)分别计算候选推广对象的名称、所属类别和对应的推广信息关键字与待发布文章的主题的相似度,当计算得到的三个相似度值至少之一超过预设的相似度阈值时,确定待发布文章满足匹配条件。2) separately calculating the similarity between the name of the candidate promotion object, the category and the corresponding promotion information keyword and the topic of the article to be published, and when at least one of the calculated three similarity values exceeds the preset similarity threshold, Make sure the article to be published meets the matching criteria.
在确定了用于呈现推广信息的推广文章之后,接下来对推广文章中用于添加推广信息的推广位置进行说明。After the promotion article for presenting the promotion information is determined, the promotion position for adding the promotion information in the promotion article is explained next.
在一些实施例中,可以通过如下方式确定推广文章中用于添加推广信息的推广位置:根据推广文章包括的主题特征,在推广文章中确定具有所包括的主题特征的段落;当所包括的主题特征与推广信息的主题特征满足主题相似度条件时,确定对应该段落的位置为用于添加推广信息的推广位置。In some embodiments, the promotion location for adding the promotion information in the promotion article may be determined by: determining, according to the topic feature included in the promotion article, a paragraph having the included topic feature in the promotion article; when the included topic feature When the topic feature of the promotion information satisfies the topic similarity condition, it is determined that the position of the corresponding paragraph is a promotion position for adding the promotion information.
推广文章包括的主题可以有一个或一个以上,不同的主题可分布于推广文章的不同段落中,对应不同段落的位置(对应不同主题的位置)可以为文章的中间位置、文章的结束位置、或相邻两个主题(段落)交接的位置;例如:当推广文章仅包含一个主题,该主题特征与推广信息的主题特征满足主题相似度条件,对应包括该主题的段落的位置(文章的结束位置)为推广位置;当推广文章包含两个或两个以上的主题,多个主题分布在不同的段落中,当多个主题特征中至少之一与推广信息的主题特征满足主题相似度条件时,将满足主题相似度条件的主题特征所在的段落与相邻段落交接的位置作为推广位置。如此,自动实现推广信息的位置的选定,位置灵活,能够避免推广信息的出现突兀,使得文章内容与推广信息的内容衔接自然,易于在用户在阅读文章的过程中接受。 The promotion article may include one or more topics. Different topics may be distributed in different paragraphs of the promotion article. The positions corresponding to different paragraphs (corresponding to different topics) may be the middle position of the article, the end position of the article, or The position where two adjacent topics (paragraphs) are handed over; for example, when the promotion article contains only one topic, the topic feature and the topic feature of the promotion information satisfy the topic similarity condition, and the position of the paragraph including the topic (the end position of the article) In order to promote the location; when the promotion article contains two or more topics, multiple topics are distributed in different paragraphs, when at least one of the plurality of topic features and the topic feature of the promotion information satisfy the topic similarity condition, The position where the paragraph in which the topic feature satisfying the topic similarity condition is placed and the adjacent paragraph is used as the promotion position. In this way, the selection of the location of the promotion information is automatically realized, the position is flexible, and the occurrence of the promotion information can be avoided, so that the content of the article and the content of the promotion information are naturally connected, and it is easy to be accepted by the user in the process of reading the article.
在一些实施例中,还可以通过如下方式确定推广文章中用于添加推广信息的推广位置:当在推广文章中相邻段落之间的位置添加推广信息时,根据所述推广文章的内容样式中相同类型的内容是否被所述推广信息分割,和/或所述推广信息在所述内容样式中所占用的显示比例,确定相应的完整度;当完整度满足预设的完整度条件时,确定该相邻段落之间的位置为添加推广信息的推广位置。In some embodiments, the promotion location for adding the promotion information in the promotion article may also be determined by: when the promotion information is added to the position between the adjacent paragraphs in the promotion article, according to the content style of the promotion article Whether the same type of content is segmented by the promotion information, and/or the display ratio occupied by the promotion information in the content style determines a corresponding completeness; when the completeness satisfies the preset completeness condition, determining The location between the adjacent paragraphs is the promotion location where the promotion information is added.
示例性地,当在推广文章可以添加推广信息的位置(即候选位置,例如,任意两个段落的中间位置)添加推广信息后,根据文章的内容样式中相同类型的内容是否被推广信息分割,确定内容样式的完整度,如果相同类型的内容被推广信息分割,则内容样式被破坏,相应的完整度为0;若文章的内容样式中相同类型的内容未被推广信息分割时,则内容样式仍然完整,对应的完整度为1,候选位置符合完整度条件从而能够作为推广位置。Illustratively, when the promotion information is added at a position where the promotion article can add the promotion information (ie, the candidate location, for example, the middle position of any two paragraphs), according to whether the same type of content in the content style of the article is divided by the promotion information, Determining the completeness of the content style. If the same type of content is segmented by the promotion information, the content style is destroyed, and the corresponding completeness is 0; if the content of the same type in the content style of the article is not segmented by the promotion information, the content style is It is still complete, and the corresponding degree of completeness is 1. The candidate position meets the integrity condition and can be used as a promotion location.
例如,若推广文章的内容样式仅包含文本,则为了保证文章中内容样式的完整性,可令文章内容的结束位置作为用于添加推广信息(广告)的推广位置,从而文章的内容样式不会被破坏。For example, if the content style of the promotion article only contains text, in order to ensure the integrity of the content style in the article, the end position of the article content can be used as a promotion location for adding promotion information (advertisement), so that the content style of the article does not destroyed.
又例如,若推广文章的内容样本除了包括文本,还包含多个类别或多个图片时,可以令每一类或每一个图片的结束位置作为用于添加推广信息的推广位置;如此,可以使得在文章的中间位置添加推广信息时对文章内容样式的影响最小,形成推广信息在文章中理想的融入度。For another example, if the content sample of the promotion article includes a plurality of categories or a plurality of pictures in addition to the text, the end position of each type or each picture may be used as a promotion position for adding the promotion information; When the promotion information is added in the middle of the article, the influence on the content style of the article is minimal, and the ideal integration degree of the promotion information in the article is formed.
示例性地,当在推广文章可以添加推广信息的位置(即候选位置,例如,任意两个段落的中间位置)添加推广信息后,根据推广信息在文章的内容样式中所占用的显示比例确定对应的完整度,占用的显示比例越大则对应的内容完整度越小,二者具有负相关的关系(可以采用反比例关系);当完整度小于完整度阈值时,说明候选位置不符合完整度条件。Illustratively, when the promotion information is added at a position where the promotion article can add the promotion information (ie, the candidate location, for example, the middle position of any two paragraphs), the display ratio determined by the promotion information in the content style of the article is determined. The degree of completeness, the larger the proportion of the occupied display, the smaller the completeness of the corresponding content, the negative correlation between the two (inverse proportional relationship can be adopted); when the completeness is less than the completeness threshold, the candidate position does not meet the integrity condition .
在实际实施时,社交网络服务器确定了用于呈现推广信息的推广文章 以及推广文章中用于添加推广信息的推广位置后,需要确定与推广文章匹配的目标推广对象;目标推广对象对应的素材用于生成推广信息;即执行步骤303:社交网络服务器确定与推广文章匹配的目标推广对象。In actual implementation, the social network server determines the promotion article for presenting the promotion information. And after the promotion location for adding the promotion information in the promotion article, the target promotion object matching the promotion article needs to be determined; the material corresponding to the target promotion object is used to generate the promotion information; that is, step 303 is performed: the social network server determines to match the promotion article. Target promotion object.
基于本发明上述实施例,与推广文章匹配的目标推广对象可以有一个或多个,可通过如下方式确定与推广文章匹配的目标推广对象:将推广文章的内容特征与候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象。According to the above embodiment of the present invention, the target promotion object matching the promotion article may have one or more, and the target promotion object matching the promotion article may be determined by: performing the content feature of the promotion article and the content feature of the candidate promotion object. The content similarity calculation determines the candidate promotion object that satisfies the content similarity condition as the target promotion object.
在一实施例中,社交网络服务器可通过如下方式确定与推广文章匹配的目标推广对象:社交网络服务器可从广告后台服务器处获得待推广的多个推广对象,然后对获得的多个推广对象进行首次筛选得到与推广文章匹配的候选推广对象集合,然后对得到的候选推广对象进行二次筛选得到与推广文章匹配的目标推广对象。In an embodiment, the social network server may determine the target promotion object that matches the promotion article by: the social network server may obtain the plurality of promotion objects to be promoted from the advertisement background server, and then perform the obtained plurality of promotion objects. Firstly, the candidate promotion object set matching the promotion article is obtained, and then the obtained candidate promotion object is subjected to secondary screening to obtain the target promotion object matching the promotion article.
例如,社交网络服务器确定与推广文章的主题满足主题相似度条件的候选推广对象,形成候选推广对象集合,完成一次筛选;然后确定候选推广对象集合中各候选推广对象与推广文章关于至少一个类型特征的相似度,特征包括图像特征和文本特征,将相似度满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象,完成二次筛选。通过如上方式确定目标推广对象,首先基于主题进行筛选,然后基于特征进行筛选,目的在于节约全部使用特征进行筛选对算力的消耗。For example, the social network server determines a candidate promotion object that satisfies the topic similarity condition of the promotion article, forms a candidate promotion object set, completes a screening, and then determines each candidate promotion object in the candidate promotion object set and the promotion article about at least one type feature. The similarity degree includes the image feature and the text feature, and the candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object, and the secondary screening is completed. By determining the target promotion object in the above manner, the first screening is based on the theme, and then the feature is selected for screening, and the purpose is to save all the use characteristics to filter and consume the computing power.
这里需要说明的是,这里提到的相应类型特征的相似度,即图像特征与图像特征的相似度,或者,文本特征与文本特征的相似度;也即,将相似度满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象,包括:确定所提取的图像特征与推广文章图像特征的相似度,当所确定的相似度超出图像特征的相似度阈值时,确定候选推广对象为目标推广对象;或者,确定所提取的文本特征与推广文章文本特征的相似度,当所 确定的相似度超出文本特征的相似度阈值时,确定候选推广对象为目标推广对象。在实际实施时,在进行特征的相似度计算之前可以执行特征提取的操作,如:提取由颜色、纹理和形状构成的图像特征;和/或,进行分词处理,对分词结果过滤掉停用词,得到由特征词构成的文本特征。What needs to be explained here is that the similarity of the corresponding type of features mentioned here, that is, the similarity between the image features and the image features, or the similarity between the text features and the text features; that is, the similarity satisfies the similarity of the corresponding type features. The candidate promotion object of the degree condition is determined as the target promotion object, which comprises: determining the similarity between the extracted image feature and the feature image feature of the promotion article, and determining the candidate promotion object as the target promotion when the determined similarity exceeds the similarity threshold of the image feature Object; or, to determine the similarity between the extracted text features and the text features of the promotional article, When the determined similarity exceeds the similarity threshold of the text feature, it is determined that the candidate promotion object is the target promotion object. In actual implementation, the feature extraction operation may be performed before performing feature similarity calculation, such as: extracting image features composed of colors, textures, and shapes; and/or performing word segmentation processing, filtering out stop words for word segmentation results , get the text features composed of feature words.
在一些实施例中,可通过如下方式得到与推广文章匹配的候选推广对象集合:将候选推广对象的至少一个类型的特征,输入根据特征词进行主题分类的分类器模型,获得分类器模型计算输出的候选推广对象所属的主题;当映射得到的主题与推广文章的主题的相似度超出主题相似度阈值时,确定为与推广文章的主题满足主题相似度条件的候选推广对象。其中,分类器模型根据特征词进行主题分类的过程可以包括:具有多个特征词的词向量组合形成输入向量(每次特征词的词向量根据语义-向量模型输出),根据输入向量预测属于不同主题的概率,取最大概率对应的主题作为候选推广对象所属的主题。In some embodiments, the candidate promotion object set matching the promotion article may be obtained by inputting at least one type of feature of the candidate promotion object into a classifier model for class classification according to the feature word, and obtaining a classifier model calculation output. The topic to which the candidate promotion object belongs; when the similarity between the mapped topic and the topic of the promotion article exceeds the topic similarity threshold, the candidate promotion object that satisfies the topic similarity condition with the topic of the promotion article is determined. The process of classifying a topic by a classifier model according to a feature word may include: combining a word vector having a plurality of feature words to form an input vector (a word vector of each feature word is output according to a semantic-vector model), and predicting different according to the input vector The probability of the topic, taking the topic corresponding to the maximum probability as the topic to which the candidate promotion object belongs.
接续将主题划分为三个等级进行举例说明,在一些实施例中,可基于第二级主题从待推广的推广对象中筛选出与推广文章匹配的候选推广对象集合,如可通过如下方式得到与推广文章匹配的候选推广对象集合:确定推广对象的第二级主题特征及推广文章的第二级主题特征,计算推广对象的第二级主题特征与推广文章的第二级主题特征的相似度,当超出预设的第二级主题相似度阈值(可以依据实际需要进行设定,如70%)时,确定为与推广文章匹配的候选推广对象。The categorization is divided into three levels for exemplification. In some embodiments, the candidate promotion object set matching the promotion article may be selected from the promotion objects to be promoted based on the second-level theme, for example, by obtaining the following Promoting the matching of candidate objects in the article matching: determining the second-level topic feature of the promotion object and the second-level topic feature of the promotion article, and calculating the similarity between the second-level topic feature of the promotion object and the second-level topic feature of the promotion article, When the preset second-level topic similarity threshold (which can be set according to actual needs, such as 70%) is exceeded, it is determined as a candidate promotion object that matches the promotion article.
当然,在进行第二级主题相似度计算之前需要得到推广文章及推广对象的第二级主题,获取推广对象/推广文章的特征向量,将获取的特征向量输入二级分类器,得到对应的第二级主题。在实际实施时为将推广对象/推广文章的多个维度的特征向量映射得到对应的第二级主题,因此,上述映射到相应的第二级主题的过程相当于对多个维度的特征向量进行降维处理 的过程,如此,降低了文章处理的算法难度。Of course, before the second-level topic similarity calculation, it is necessary to obtain the second-level theme of the promotion article and the promotion object, obtain the feature vector of the promotion object/promotion article, and input the acquired feature vector into the second classifier to obtain the corresponding first. Secondary theme. In actual implementation, the feature vector of the multiple dimensions of the promotion object/promotional article is mapped to obtain the corresponding second-level theme. Therefore, the above process of mapping to the corresponding second-level topic is equivalent to performing feature vectors of multiple dimensions. Dimensionality reduction The process, in this way, reduces the difficulty of the algorithm of the article processing.
接下来对主题映射过程中不同类型的特征的提取分别进行说明。Next, the extraction of different types of features in the topic mapping process will be described separately.
首先,对于文本特征的提取来说,提取目的是从文章或段落中获得文本的语义描述,在一实施中,可以包括预处理及文本特征提取两个主要操作,其中,预处理可以包括如下步骤:First, for the extraction of text features, the purpose of the extraction is to obtain a semantic description of the text from the article or the paragraph. In an implementation, the two main operations of preprocessing and text feature extraction may be included, wherein the preprocessing may include the following steps. :
步骤1、无效字符过滤;例如:文章如果是来源于网页,通常需要通过正则表达式等方式将HTML的Tag过滤掉。Step 1, invalid character filtering; for example: if the article is derived from a web page, it is usually necessary to filter the HTML tag by means of a regular expression or the like.
步骤2、分词处理;Step 2, word segmentation;
在实际实施时,往往需要先对步骤1得到内容进行编码转换,然后可利用正则表达式匹配标点符号、分行符实现将文章的段落划分为句子,最后可利用中文分词法将句子划分为一个一个单独的词。In actual implementation, it is often necessary to first encode and convert the content obtained in step 1. Then, regular expressions can be used to match the punctuation marks and line breaks to divide the paragraphs of the article into sentences. Finally, the Chinese word segmentation can be used to divide the sentences into one sentence. Separate words.
步骤3、过滤停用词;Step 3. Filter the stop words;
在实际实施时,可根据预先设定的词典过滤“的”、“地”等无关语义的词。在一些实施例中,执行完步骤3后,还可以进一步进行特征词的提取,使得后续的文本特征提取更简便。In actual implementation, words of irrelevant semantics such as "", "ground", etc., may be filtered according to a preset dictionary. In some embodiments, after step 3 is performed, feature word extraction may be further performed, so that subsequent text feature extraction is more convenient.
在上述步骤1至步骤3执行完成后,即完成对文本特征提取的预处理,之后,即可进行文本特征的提取,在实际实施时,可采用如下方式之一进行文本特征的提取:After the execution of the above steps 1 to 3 is completed, the pre-processing of the text feature extraction is completed, and then the text feature can be extracted. In actual implementation, the text feature can be extracted in one of the following ways:
1)关键词提取,如采用词频-逆向文件频率(TF-IDF,Term Frequency-Inverse Document Frequency)等算法实施。1) Keyword extraction, such as algorithm implementation using word frequency-inverse document frequency (TF-IDF, Term Frequency-Inverse Document Frequency).
2)词袋(BOW,Bag of Words)模型,忽略语法将文本表示为词集,即词的组合。2) BOW (Bag of Words) model, ignoring the grammar to represent the text as a set of words, that is, a combination of words.
3)深度学习模型,例如Word Embedding,将词映射得到词向量,通过词向量进行运算。3) Deep learning models, such as Word Embedding, map words to get word vectors and operate them through word vectors.
接下来,对图像特征的提取进行说明,提取的目的是从文章图片中获 得图片的语义描述,在一实施中,对文章中图像特征的提取可以采用以下方式之一:Next, the extraction of image features is described. The purpose of the extraction is to obtain the image from the article. The semantic description of the picture, in one implementation, the extraction of image features in the article can be one of the following ways:
1)采用图片矩阵的代数特征,例如通过奇异值分解(SVD,Singular value decomposition)、可编程计数器阵列(PCA,Programmable Counter Array)等方法对表示图片的矩阵降维得到。1) Using the algebraic features of the picture matrix, for example, by Singular value decomposition (SVD, Singular value decomposition), Programmable Counter Array (PCA), etc., the matrix of the represented picture is reduced in dimension.
2)采用全局统计特征,例如直方图、对比度、几何不变矩Hu矩等。2) Adopt global statistical features such as histogram, contrast, geometric invariant moment, and so on.
3)采用局部直观特征,例如纹理特征(如采用线性反投影算法(LBP,Linear Back Projection)、通用搜索树(GIST,Generalized Search Trees)等)、角点特征(如采用Harris角点检测等)、边缘特征(如采用多级边缘检测算法-Canny算子)、形状特征(如采用Hough变换)等。3) Use local intuitive features, such as texture features (such as Linear Back Projection (LBP), Generalized Search Trees (GIST), corner features (such as Harris corner detection, etc.) , edge features (such as multi-level edge detection algorithm - Canny operator), shape features (such as Hough transform).
4)采用尺度不变特征变换(SIFT,Scale-invariant feature transform)、方向梯度直方图(HOG,Histogram of Oriented Gradient)、Haar分类器至少之一进行特征提取。4) Feature extraction is performed by using at least one of a Scale-invariant feature transform (SIFT), a Histogram of Oriented Gradient (HOG), and a Haar classifier.
5)采用卷积神经网络(CNN,Convolutional Neural Network)进行特征提取,CNN网络有多种具体实现方式,例如AlexNet、VGG、ResNet等;在实际实施时,可以采用ImageNet等公开数据集训练的通用模型的最后一个卷积层的结果作为CNN模型的特征。5) Using Convolutional Neural Network (CNN) for feature extraction, CNN network has many specific implementation methods, such as AlexNet, VGG, ResNet, etc. In actual implementation, it can be used for general data training such as ImageNet. The result of the last convolutional layer of the model is characteristic of the CNN model.
接下来,对经第二级主题进行筛选得到的候选推广对象进行二次筛选得到与推广文章匹配的目标推广对象进行说明,在一些实施例中,可通过以下方式实现从候选推广对象集合中筛选出与推广文章匹配的目标推广对象:提取候选推广对象的特征,所提取的特征包括图像特征和文本特征中至少一个类型的特征;计算所提取的特征与推广文章的相应类型特征的相似度;当超出相应类型特征的相似度阈值时,确定为与推广文章匹配的目标推广对象。如此,自动实现目标推广对象的适配,使得推广文章中的推广信息(广告)与文章内容的契合度较高,不会对用户阅读文章的过程造 成干扰,提高了用户的阅读体验。Next, the candidate promotion object selected by the second-level topic is subjected to secondary screening to obtain a target promotion object that matches the promotion article. In some embodiments, the candidate promotion object collection may be selected by the following manner. The target promotion object matching the promotion article: extracting the feature of the candidate promotion object, the extracted feature includes at least one type of the image feature and the text feature; calculating the similarity between the extracted feature and the corresponding type feature of the promotion article; When the similarity threshold of the corresponding type feature is exceeded, the target promotion object matching the promotion article is determined. In this way, the adaptation of the target promotion object is automatically realized, so that the promotion information (advertisement) in the promotion article has a high degree of fit with the article content, and does not create a process for the user to read the article. Interference has improved the user's reading experience.
下面对通过预先训练得到的分类器实现上述从候选推广对象集合中筛选出与推广文章匹配的目标推广对象进行说明。The following describes the target promotion object obtained by pre-training to filter out the target promotion object that matches the promotion article from the candidate promotion object set.
在一个示例中,针对待发布文章和候选推广对象均具有文本素材的情况,图5B为本发明实施例提供的利用文本-文本相似度分类器进行相似度计算的示意图,参见图5B,提取候选推广对象的文本特征及推广文章的文本特征,输入对应的文本-文本相似度分类器,当得到文本-文本相似度超出文本-文本相似度阈值时,确定该候选推广对象为与推广文章匹配的目标推广对象。In an example, for the case that the article to be published and the candidate promotion object both have the text material, FIG. 5B is a schematic diagram of the similarity calculation using the text-to-text similarity classifier according to the embodiment of the present invention. Referring to FIG. 5B, the candidate is extracted. Promote the text feature of the object and promote the text feature of the article, and input the corresponding text-text similarity classifier. When the text-text similarity exceeds the text-text similarity threshold, determine that the candidate promotion object matches the promotion article. Target promotion object.
在又一个示例中,待发布文章和候选推广对象均具有图像素材的情况,图5C为本发明实施例提供的利用图像-图像相似度分类器进行相似度计算的示意图,参见图5C,提取推广对象的图像特征及推广文章的图像特征,输入对应的图像-图像相似度分类器,当得到图像-图像相似度超出图像-图像相似度阈值时,确定该候选推广对象为与推广文章匹配的目标推广对象。在一些实施例中,可通过图像的特征向量计算相似度,如采用以下方式得到:In another example, the case where the article to be published and the candidate promotion object both have the image material, FIG. 5C is a schematic diagram of the similarity calculation using the image-image similarity classifier according to the embodiment of the present invention, and FIG. 5C, the extraction promotion The image feature of the object and the image feature of the article are extended, and the corresponding image-image similarity classifier is input. When the image-image similarity exceeds the image-image similarity threshold, the candidate promotion object is determined to be the target matching the promotion article. Promote the object. In some embodiments, the similarity can be calculated from the feature vector of the image, as obtained in the following manner:
1)欧式距离:将向量想象为N维空间的点,欧式距离衡量点与点之间的距离;1) Euclidean distance: imagine the vector as the point of the N-dimensional space, and the Euclidean distance measures the distance between the point and the point;
2)Cosine相似度:衡量两个向量之间的夹角大小;2) Cosine similarity: measure the angle between two vectors;
3)Jaccard相似度:把两个向量看作一个集合,衡量集合间的重合度。3) Jaccard similarity: Think of two vectors as a set, measuring the degree of coincidence between sets.
在一些实施例中,可通过以下方式实现从候选推广对象集合中筛选出与推广文章匹配的目标推广对象:计算候选推广对象集合中候选推广对象的第三级主题与所述推广文章的第三级主题的相似度,当超出第三级主题相似度阈值时,确定为与所述推广文章匹配的目标推广对象。In some embodiments, the target promotion object matching the promotion article may be selected from the candidate promotion object set by calculating a third-level theme of the candidate promotion object in the candidate promotion object set and the third promotion article. The similarity of the level topic, when the third level topic similarity threshold is exceeded, is determined as the target promotion object that matches the promotion article.
在一些实施例中,可通过以下方式实现从候选推广对象集合中筛选出 与推广文章匹配的目标推广对象:提取候选推广对象的图像特征及推广文章的文本特征,确定候选推广对象的图像特征与推广文章的文本特征的相似度;当超出文本与图像相似度阈值时,确定为与推广文章匹配的目标推广对象。In some embodiments, the screening from the set of candidate promotion objects can be implemented in the following manner. Target promotion object matching the promotion article: extracting the image features of the candidate promotion object and promoting the text feature of the article, determining the similarity between the image feature of the candidate promotion object and the text feature of the promotion article; when the text and image similarity threshold is exceeded, Determine the target promotion object that matches the promotion article.
下面对通过预先训练得到的文本-图像相似度分类器,实现上述候选推广对象的图像特征与推广文章的文本特征相似度的计算进行说明;The following is a description of the calculation of the text feature similarity of the image features of the candidate promotion object and the promotion article by using the text-image similarity classifier obtained by the pre-training;
图5D为本发明实施例提供的利用文本-图像相似度分类器进行相似度计算的示意图,参见图5D,提取推广对象的图像特征及推广文章的文本特征,输入对应的文本-图像相似度分类器,当得到文本-图像相似度超出文本-图像相似度阈值时,确定该推广对象为与推广文章匹配的推广对象。FIG. 5D is a schematic diagram of performing similarity calculation by using a text-image similarity classifier according to an embodiment of the present invention. Referring to FIG. 5D, extracting image features of the promotion object and text features of the promotion article, and inputting corresponding text-image similarity classification. When the text-image similarity exceeds the text-image similarity threshold, it is determined that the promotion object is a promotion object that matches the promotion article.
在一些实施例中,还可通过如下方式确定与推广文章匹配的目标推广对象:计算获得的待推广的推广对象与推广文章的图像特征和文本特征至少之一的相似度;确定相似度满足相应类型特征的相似度条件的推广对象,为与推广文章匹配的目标推广对象。作为一种实施方式,例如,对候选推广对象的素材以及推广文章执行以下类型至少之一的特征提取操作:提取由颜色、纹理和形状构成的图像特征;进行分词处理,对分词结果过滤掉停用词,得到由特征词构成的文本特征;确定候选推广对象与推广文章关于至少一个类型特征的相似度:将满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象。举例说明,计算候选推广对象的图片素材(在自媒体平台中预先存储)提取的图像特征,与推广文章中图片的图像特征的相似度,如果满足图片相似度条件(大于图像特征相似度阈值),则候选推广对象为目标推广对象;又例如,计算候选推广对象的文字素材(在自媒体平台中预先存储,例如分类信息、广告词)提取的文本特征,与推广文章中文字的文本特征的相似度,如果满足图片相似度条件(大于文本特征相似度阈值),则候选推广对象为目标推广对象。该实现方式省去了确 定候选推广对象集合的过程,直接基于推广文章的图像特征和/或文本特征确定目标推广对象。In some embodiments, the target promotion object matching the promotion article may be determined by: calculating a similarity between the obtained promotion object to be promoted and at least one of the image feature and the text feature of the promotion article; determining that the similarity satisfies the corresponding The promotion object of the similarity condition of the type feature is the target promotion object matching the promotion article. As an implementation manner, for example, performing feature extraction operations on at least one of the following types of material of the candidate promotion object and the promotion article: extracting image features composed of colors, textures, and shapes; performing word segmentation processing, filtering and filtering the word segmentation results Using the word, the text feature composed of the feature word is obtained; determining the similarity between the candidate promotion object and the promotion article regarding the at least one type feature: the candidate promotion object satisfying the similarity condition of the corresponding type feature is determined as the target promotion object. For example, calculating the similarity between the image features extracted by the picture material of the candidate promotion object (pre-stored in the media platform) and the image features of the image in the promotion article, if the image similarity condition is satisfied (greater than the image feature similarity threshold) , the candidate promotion object is the target promotion object; for example, calculating the text feature of the candidate promotion object text material (pre-stored in the media platform, such as classification information, advertisement words), and promoting the text feature of the text in the article Similarity, if the picture similarity condition is satisfied (greater than the text feature similarity threshold), the candidate promotion object is the target promotion object. This implementation eliminates the need to The process of determining the set of candidate promotion objects directly determines the target promotion object based on the image features and/or text features of the promotion article.
在一些实施例中,还可通过如下方式确定与推广文章匹配的目标推广对象:确定候选推广对象的图像特征与推广文章的文本特征的相似度;当确定的相似度超出文字与图像相似度阈值时,确定候选推广对象为目标推广对象。这里需要说明的是,由于本发明实施例中的特征均指的特征向量,可以计算候选推广对象及推广文章的不同类型特征的相似度,然后进行阈值比较确定目标推广对象。然而,此处使用候选推广对象的图像特征,以及使用文章的文本特征进行相似度计算,是因为:对于所有的候选推广对象,在自媒体平台中都会有对应的图像素材,而所有的文章都包括文字,能够保证总是能够计算二者的相似度;避免了因为自媒体平台中缺失候选推广对象的文字素材、以及文本中缺失图像素材、从而使用相同类型特征无法计算相似度的问题。In some embodiments, the target promotion object matching the promotion article may also be determined by determining the similarity between the image feature of the candidate promotion object and the text feature of the promotion article; when the determined similarity exceeds the text and image similarity threshold When the candidate promotion object is determined as the target promotion object. It should be noted that, because the feature vector refers to the feature vector in the embodiment of the present invention, the similarity between the candidate promotion object and the different types of features of the promotion article can be calculated, and then the threshold comparison is performed to determine the target promotion object. However, the image features of the candidate promotion object are used here, and the similarity calculation is performed using the text features of the article because: for all candidate promotion objects, there will be corresponding image material in the self-media platform, and all the articles are Including text, it can guarantee that the similarity of the two can always be calculated; avoiding the problem that the similarity cannot be calculated by using the same type of features because the text material of the candidate promotion object is missing from the media platform and the image material is missing in the text.
需要说明的是,在本发明实施例中,步骤302及步骤303并不存在依赖关系,其执行顺序可互换。It should be noted that, in the embodiment of the present invention, there is no dependency relationship between step 302 and step 303, and the execution order is interchangeable.
接下来,执行步骤304:社交网络服务器确定与目标推广对象匹配的素材,形成包括素材的推广信息。Next, step 304 is performed: the social network server determines the material that matches the target promotion object, and forms promotion information including the material.
在一些实施例中,可通过如下方式确定与目标推广对象匹配的素材:In some embodiments, the material that matches the target promotion object can be determined as follows:
从推广文章中提取人物关键字;将人物关键字和目标推广对象的标签关键字至少之一与推广对象的模板内容组合,形成与推广对象匹配的第一文字素材。Extracting the character keyword from the promotion article; combining at least one of the character keyword and the tag keyword of the target promotion object with the template content of the promotion object to form a first text material that matches the promotion object.
在实际实施时,人物关键字可以为文章中出现的文章作者对自己或他人的称谓,如:美国朋友、明星球球等;而提取人物关键字的方式可以为基于语义分析的方法进行提取。In actual implementation, the character keyword can be the title of the author or other person appearing in the article, such as: American friends, star balls, etc.; and the way to extract the character keywords can be extracted based on the semantic analysis method.
标签关键字为用于标识推广对象的特征、功能等的关键字,每个推广 对象都存在对应的标签关键字用于标识该推广对象的特征、功能等,如对于推广对象为某款面膜来说,其标签关键字可以为:保湿、补水。The tag keyword is a keyword used to identify the features, functions, etc. of the promotion object, and each promotion The object has a corresponding label keyword for identifying the feature, function, and the like of the promotion object. For example, if the promotion object is a certain mask, the label keyword may be: moisturizing and hydrating.
在一发明实施例中,针对推广对象预先设定了用于生成文字素材的模板(可以为统一的模板,或针对不同主题的推广对象分类设置的模板),模板中设置有固定的文字描述,以及待补充的空白文字位置,当将人物关键字和/或推广对象的标签关键字代入模板后,形成对应推广对象的文字素材。In an embodiment of the present invention, a template for generating a text material (which may be a unified template or a template for classification of promotion objects for different topics) is preset for the promotion object, and a fixed text description is set in the template. And the blank text position to be supplemented, when the character keyword and/or the tag key of the promotion object are substituted into the template, the text material corresponding to the promotion object is formed.
对将推广对象的标签关键字代入模板后,形成对应推广对象的文字素材的实现方式举例说明:图6A为本发明实施例提供的文字素材的一个可选的示意图;参见图6A,推广对象的标签关键字为人气款、将其代入模板后得到文字模板+动态文字(即标签关键字)生成的文字素材为:这款也是重点推荐的人气款。An example of the implementation of the text material corresponding to the promotion object after the label keyword of the promotion object is substituted into the template is as follows: FIG. 6A is an optional schematic diagram of the text material provided by the embodiment of the present invention; The tag keyword is popular, and after entering it into the template, the text material generated by the text template + dynamic text (ie, the tag keyword) is: This is also the popular recommendation.
对将推广对象的人物关键字和标签关键字代入模板后,形成对应推广对象的文字素材的实现方式举例说明:图6B为本发明实施例提供的文字素材的一个可选的示意图;参见图6B,推广对象的人物关键字为大饼、标签关键字为人气款,将其代入模板后得到文字模板+动态文字(即标签关键字及人物关键字)生成的文字素材为:这款也是大饼重点推荐的人气款。An example of the implementation of the text material corresponding to the promotion object after the character keyword and the tag keyword of the promotion object are substituted into the template: FIG. 6B is an optional schematic diagram of the text material provided by the embodiment of the present invention; The character of the promotion object is the big cake, the label keyword is popular, and the text material generated by the text template + dynamic text (ie, the label keyword and the character keyword) is substituted into the template: this is also a big cake. Popular items recommended by the main focus.
再如,当推广对象为面膜、提取的人物关键字为明星球球、推广对象的标签关键字为保湿、补水时,将其与面膜的模板进行组合,形成对应的文字素材为:明星球球大力推荐的面膜,既保湿又补水。For example, when the promotion object is a mask, the extracted character keyword is a star ball, and the label keyword of the promotion object is moisturizing or hydrating, the template is combined with the mask template to form a corresponding text material: star ball A highly recommended mask that moisturizes and replenishes water.
在一些实施例中,社交网络服务器可通过如下方式确定与目标推广对象匹配的素材:In some embodiments, the social network server can determine the material that matches the target promotion object by:
对目标推广对象进行图像识别,得到表征目标推广对象属性的图像识别结果;将图像识别结果与目标推广对象的描述信息组合,形成与目标推广对象匹配的第二文字素材。在一些实施例中,与目标推广对象匹配的素材可以包括上述第一文字素材、第二文字素材至少之一。 Image recognition is performed on the target promotion object, and the image recognition result representing the attribute of the target promotion object is obtained; the image recognition result is combined with the description information of the target promotion object to form a second text material that matches the target promotion object. In some embodiments, the material matching the target promotion object may include at least one of the first text material and the second text material.
图像识别结果表征目标推广对象的属性:如名称(推广对象具体是什么,如衣服鞋子)、颜色、款式等;The image recognition result represents the attributes of the target promotion object: such as the name (what is the promotion object, such as clothes and shoes), color, style, etc.;
目标推广对象的描述信息可以为以关键字形式呈现的、从不同维度标识目标推广对象的相关内容的信息,如目标推广对象的价格描述、来源等;在,目标推广对象的描述信息往往包括可以实现用户与目标推广对象交互的超级链接,使得用户点击描述信息时进行页面跳转至相应页面。The description information of the target promotion object may be information that is presented in a keyword form, and identifies related content of the target promotion object from different dimensions, such as a price description and a source of the target promotion object; and the description information of the target promotion object often includes A hyperlink that enables the user to interact with the target promotion object, so that when the user clicks on the description information, the page jumps to the corresponding page.
在实际实施时,目标推广信息除包括形成的文字素材外,还包括图像素材;而对图像素材的获取可通过如下方式:当候选推广对象的图像素材的图像特征、与推广文章的图像特征的满足图像特征的匹配条件(如相似度超过预设阈值)时,确定为与目标推广对象匹配的图像素材。In actual implementation, the target promotion information includes image material in addition to the formed text material, and the image material can be obtained by: image feature of the image material of the candidate promotion object and image features of the promotion article When the matching condition of the image feature is satisfied (for example, the similarity exceeds the preset threshold), the image material that matches the target promotion object is determined.
在一些实施例中,还可通过如下方式获取图像素材:In some embodiments, the image material can also be obtained by:
从目标推广对象的原始推广信息中直接提取图像,作为与目标推广对象匹配的图像素材,然后将所提取的图像作为图像素材连同形成的文字素材与推广文章合成。需要说明的是,推广信息中包括的图像素材可以为一个或一个以上的图片,且该图片既可以为目标推广对象对应的图片,还可以为与目标推广对象对应的图片相关联的其它图片。如图7A、图7B所示,图7A、图7B为本发明实施例提供的在推广文章中图片的结束位置添加推广信息的示意图,在图7A、图7B中,块72为在文章中图片的结束位置添加的推广信息,其中,块71为文字模块,用于承载推广信息包括的基于推广对象的模板生成的文字素材,块73为图片模块,用于承载推广信息包括的图像素材。The image is directly extracted from the original promotion information of the target promotion object, and is used as an image material matching the target promotion object, and then the extracted image is synthesized as an image material together with the formed text material and the promotion article. It should be noted that the image material included in the promotion information may be one or more images, and the image may be a picture corresponding to the target promotion object, or may be another picture associated with the picture corresponding to the target promotion object. As shown in FIG. 7A and FIG. 7B, FIG. 7A and FIG. 7B are schematic diagrams showing the addition of promotion information to the end position of a picture in a promotion article according to an embodiment of the present invention. In FIG. 7A and FIG. 7B, block 72 is a picture in the article. The promotion information added at the end position, wherein the block 71 is a text module for carrying the text material generated by the promotion object-based template included in the promotion information, and the block 73 is a picture module for carrying the image material included in the promotion information.
在一些实施例中,推广信息包括的文字素材部分除包括基于推广对象的模板生成的文字素材外,还包括上述描述信息。如图7C所示,图7C为本发明实施例提供的在推广文章的结束位置添加推广信息的示意图,在图7C中,块70对应推广信息,块77为文字模块,用于承载推广信息包括的 基于推广对象的模板生成的文字素材、块78为图片模块,用于承载推广信息包括的图像素材、块79为描述信息模块,用于承载推广信息包括的目标推广对象的描述信息(如推广对象的详情及来源)。In some embodiments, the text material portion included in the promotion information includes the above description information in addition to the text material generated based on the template of the promotion object. As shown in FIG. 7C, FIG. 7C is a schematic diagram of adding promotion information at the end position of the promotion article according to an embodiment of the present invention. In FIG. 7C, the block 70 corresponds to the promotion information, and the block 77 is a text module, and is used to carry the promotion information. of The text material generated based on the template of the promotion object, the block 78 is a picture module, and is used to carry the image material included in the promotion information, and the block 79 is a description information module, and is used to carry the description information of the target promotion object included in the promotion information (such as the promotion object). Details and sources).
在一些实施例中,目标推广对象匹配的素材可以由目标推广对象的图像素材及描述信息组成。如图8所示,图8为本发明实施例提供的推广信息的示意图,在图8中,块81为图片模块,用于承载推广信息包括的图像素材、块82为描述信息模块,用于承载推广信息包括的目标推广对象的描述信息。In some embodiments, the material matched by the target promotion object may be composed of image material and description information of the target promotion object. As shown in FIG. 8, FIG. 8 is a schematic diagram of the promotion information provided by the embodiment of the present invention. In FIG. 8, the block 81 is a picture module, and is used to carry the image material included in the promotion information, and the block 82 is a description information module, and is used for Carrying description information of the target promotion object included in the promotion information.
在一些实施例中,根据所获得的素材生成推广信息可通过如下方式实现:获得用于在推广信息中首先(时间或位置上最先)呈现的固定内容,所述固定内容用于引导观看添加后的推广信息;将获得的固定内容、以及所获得的素材填充至推广信息模板,得到推广信息。In some embodiments, generating promotional information based on the obtained material may be accomplished by obtaining fixed content for first (time or location first) presentation in the promotional information, the fixed content being used to guide viewing additions After the promotion information; the fixed content obtained, and the obtained material are filled into the promotion information template to obtain the promotion information.
社交网络服务器基于确定的文字素材形成推广信息后,执行步骤305:根据确定的推广位置,将推广信息添加至推广位置。如此,合成推广文章与包括素材的推广信息,得到经过合成处理的推广文章。After the social network server forms the promotion information based on the determined text material, step 305 is performed: adding the promotion information to the promotion location according to the determined promotion location. In this way, the synthetic promotion article and the promotion information including the material are synthesized, and the synthesized article is obtained.
在一些实施例中,社交网络服务器还可设置推广信息的显示方式,参见图9A至9C,图9A至9C均为本发明实施例提供的推广信息的显示方式的示意图,例如:设置文字模块对应的内容隐藏,即设置推广信息中基于推广对象的模板生成的文字素材的显示方式为隐藏,如图9A所示,当用户点击图片中的文字模块部分时可以显示隐藏的文字素材;或者,如图9B所示,设置推广信息中文字模块显示固定内容(引导观看添加后的推广信息);或者,如图9C所示,设置推广信息中文字模块中,动态显示基于推广对象的模板生成的文字素材的内容,如滚动显示文字素材的内容。In some embodiments, the social network server may also set the display manner of the promotion information. Referring to FIG. 9A to FIG. 9C, FIG. 9A to FIG. 9C are schematic diagrams showing the display manner of the promotion information provided by the embodiment of the present invention, for example, setting the text module correspondingly. The content is hidden, that is, the display manner of the text material generated by the template based on the promotion object in the promotion information is hidden, as shown in FIG. 9A, when the user clicks on the text module part in the picture, the hidden text material can be displayed; or, for example, As shown in FIG. 9B, the text module of the promotion information is displayed to display the fixed content (the promotion information after the viewing is added); or, as shown in FIG. 9C, the text module of the promotion information is dynamically displayed, and the text generated based on the template of the promotion object is dynamically displayed. The content of the material, such as scrolling through the content of the text material.
至此,社交网络服务器对推广文章进行推广信息的添加处理描述完成,然后执行步骤306:社交网络服务器发送经添加处理的推广文章至自媒体平 台中的第二客户端。At this point, the social network server performs the process of adding the promotion information to the promotion article, and then performs step 306: the social network server sends the added promotion article to the self-media level. The second client in Taichung.
在一些实施例中,社交网络服务器可基于用户登录的自媒体账号获取该用户的文章偏好(如基于该用户的文章阅读记录得到用户偏爱的文章类别,可将用户阅读过的数量最多的一类文章作为用户偏爱的文章类别),基于用户的偏爱向该用户的自媒体平台的客户端主动推送添加有推广信息的推广文章至自媒体平台的客户端进行呈现。In some embodiments, the social network server may obtain the user's article preference based on the self-media account that the user logs in (eg, the article category that is preferred by the user based on the user's article reading record, and the largest number of users that can be read by the user. As a user-favored article category, the article actively pushes a promotion article with promotion information to the client of the user's self-media platform based on the user's preference to the client from the media platform for presentation.
在另一实施例中,社交网络服务器可基于用户终端发送的阅读请求(即终端拉取),将添加有推广信息的推广文章发送至自媒体平台的客户端进行呈现。In another embodiment, the social network server may send the promotion article with the promotion information to the client of the self-media platform for presentation based on the read request sent by the user terminal (ie, terminal pull).
自媒体平台中的第二客户端接收到经添加处理的推广文章后,执行步骤307:显示推广文章。如此,使得用户在看到自身关注的文章内容的同时,还了解了添加的推广信息,且由于推广信息与文章的过渡自然,增强了用户的阅读感受。After the second client in the media platform receives the added promotion article, step 307 is performed to display the promotion article. In this way, the user can understand the added promotion information while seeing the content of the article that he is concerned about, and the user's reading experience is enhanced due to the natural transition of the promotion information and the article.
对自媒体平台中的第一客户端及第二客户端进行说明,文章的发布者亦可为文章的阅读者(即第一客户端与第二客户端为同一客户端),文章的发布者与文章的阅读者为同一用户,在该场景下,第一客户端在向社交网络服务器提交待发布文章后,若提交的该待发布文章被确定为推广文章且存在与之匹配的目标推广对象,第一客户端获得自身提交的文章的同时,亦接收了添加有与目标推广对象匹配的素材的推广信息,之后,呈现文章的原始内容,并根据文章中添加推广信息的推广位置,当显示推广文章的原始内容至相应位置时呈现推广信息。The first client and the second client in the self-media platform are described, and the publisher of the article may also be the reader of the article (ie, the first client and the second client are the same client), and the publisher of the article The same user is the reader of the article. In this scenario, after the first client submits the article to be published to the social network server, if the submitted article to be published is determined to be the promotion article and there is a target promotion object matching the same When the first client obtains the article submitted by itself, it also receives the promotion information added with the material matching the target promotion object, and then presents the original content of the article, and displays the promotion location according to the promotion information in the article. Promotional information is presented when the original content of the article is promoted to the appropriate location.
在另一场景下,文章的发布者与文章的阅读者不是同一用户(即第一客户端与第二客户端为不同客户端),此时,第二客户端可以根据用户的访问请求拉取社交网络中发布的推广文章,或者根据与发布用户的社交(关注/订阅)关系获得社交网络服务器推送的推广文章,之后,呈现文章的原 始内容,并根据文章中添加推广信息的推广位置,当显示推广文章的原始内容至相应位置时呈现推广信息。In another scenario, the publisher of the article is not the same user as the reader of the article (ie, the first client and the second client are different clients), and at this time, the second client can pull according to the user's access request. A promotion article published in a social network, or a promotion article pushed by a social network server according to a social (concern/subscription) relationship with a publishing user, and then presenting the original article Start content, and according to the promotion location of the promotion information in the article, when the original content of the promotion article is displayed to the corresponding location, the promotion information is presented.
在一些实施例中,基于对推广信息的显示方式的设置,当显示推广文章的原始内容至相应位置时可采用以下方式之一呈现推广信息:In some embodiments, based on the setting of the display manner of the promotion information, when the original content of the promotion article is displayed to the corresponding location, the promotion information may be presented in one of the following ways:
1)呈现推广信息中的图像素材,并在推广信息中的文字素材被触发(如接收到用户的点击操作)时呈现文字素材;例如,开始仅显示图像素材,文字素材处于隐藏不可见状态,当用户点击文字素材所处位置时,呈现文字素材;1) presenting the image material in the promotion information, and presenting the text material when the text material in the promotion information is triggered (such as receiving the user's click operation); for example, starting to display only the image material, the text material is hidden and invisible. Rendering text material when the user clicks on the location of the text material;
2)响应于对添加推广信息的推广位置呈现的固定内容的操作(如点击操作),呈现推广信息中的文字素材及图像素材;固定内容用于引导观看推广信息;也即,开始呈现的为用于引导用户观看推广信息的固定内容,当用户触发时显示图像素材及文字素材;2) in response to the operation of fixing the fixed content presented by the promotion location of the promotion information (such as a click operation), presenting the text material and the image material in the promotion information; the fixed content is used to guide the viewing of the promotion information; that is, the presentation is started. Fixed content for guiding the user to view the promotion information, and displaying the image material and the text material when the user triggers;
可见,上面两种方式仅在用户触发的情况下才显示推广信息中的文字素材及图像素材,这在一定程度上降低了对用户进行文章阅读的干扰;It can be seen that the above two methods only display the text material and the image material in the promotion information only when the user triggers, which reduces the interference to the user to read the article to a certain extent;
3)当显示推广文章的原始内容至相应位置时,直接呈现推广信息中的文字素材及图像素材。3) When the original content of the promotion article is displayed to the corresponding location, the text material and the image material in the promotion information are directly presented.
应用上述实施例,具备以下有益效果:Applying the above embodiments, the following beneficial effects are obtained:
1)利用自媒体平台自发的关于文章的流量作为推广信息的载体,即文章的来源可以来自社交网络中任意一个用户终端,打破了靠征集特定主题的文章的局限性,可实现推广信息的批量化和自动化添加,借助与自媒体平台的流量频发的特性,可以实现推广信息及时触达用户;1) Using the self-media platform to spontaneously report the traffic of the article as the carrier of the promotion information, that is, the source of the article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic, and realizing the batch of the promotion information. Adding and automating the addition, with the characteristics of frequent traffic with the self-media platform, the promotion information can be reached in time to reach the user;
2)自动实现推广对象、以及相应的文本素材的适配,使得推广文章中的推广信息(广告)与文章内容的契合度较高,不会对用户阅读文章的过程造成干扰,提高了用户的阅读体验;2) Automatically realize the promotion object and the adaptation of the corresponding text material, so that the promotion information (advertisement) in the promotion article has a high degree of fit with the article content, and does not interfere with the process of the user reading the article, thereby improving the user's Reading experience
3)自动实现推广信息的位置的选定,位置灵活,能够避免推广信息的 出现突兀,使得文章内容与推广信息的内容衔接自然,易于在用户在阅读文章的过程中接受。3) Automatically realize the location of the promotion information, the location is flexible, and can avoid the promotion of information. The abruptness makes the content of the article and the content of the promotion information natural, and is easy to accept in the process of reading the article.
作为上述基于自媒体平台的文章处理方法的另一个可选实施例,图10示出了本发明实施例服务器侧提供的基于自媒体平台的文章处理方法的一个可选的流程示意图,在本实施例中,以推广信息为广告、推广对象为广告对象(广告商品)为例进行说明,参见图10,本发明实施例提供的基于自媒体平台的文章处理方法包括:As an alternative embodiment of the above-described self-media platform-based article processing method, FIG. 10 is a schematic flowchart of an optional article processing method based on the self-media platform provided by the server side in the embodiment of the present invention. For example, the promotion information is used as an advertisement, and the promotion object is an advertisement object (advertisement product). For example, referring to FIG. 10, the article processing method based on the self-media platform provided by the embodiment of the present invention includes:
步骤401:服务器对文章进行语义分析。Step 401: The server performs semantic analysis on the article.
这里提到的文章为服务器获取的待发布的自媒体文章或者已经在自媒体平台被发布过但又被撤回的文章,通过对文章进行语义分析理解文章的标题,自媒体名称(如公众号名称)、作者名称,理解整篇文章的文字。从中挑选属于本篇文章的主题(topic),作为匹配广告对象的依据。进而可筛选出匹配广告对象的文章。The article mentioned here is the self-media article to be published by the server or the article that has been published on the self-media platform but has been withdrawn. The semantics of the article is used to understand the title of the article, from the media name (such as the public name). ), the author's name, understand the text of the entire article. Select the topic that belongs to this article as the basis for matching the advertising object. In turn, you can filter out articles that match the ad object.
其中,广告对象可供匹配的特征信息包括:广告对象的类别、广告词、广告对象的名称等。文章可供匹配的特征信息包括关键词等。The feature information that the advertisement object can match includes: a category of the advertisement object, an advertisement word, a name of the advertisement object, and the like. The feature information that the article can match includes keywords and the like.
语义分析即语义理解,指将非结构化或半结构化的自然语言文本转化为计算机可深层处理的结构化信息、并进行分类、分析等操作。Semantic analysis, that is, semantic understanding, refers to the transformation of unstructured or semi-structured natural language text into structured information that can be deeply processed by computers, and classified, analyzed, and so on.
步骤402:识别文章的主题。Step 402: Identify the subject of the article.
在实际实施时,可通过训练得到的关键字-主题模型得到,通过语义分析提取文章的关键字,然后输入训练得到的关键字-主题模型得到文章的主题。In actual implementation, it can be obtained by training the keyword-topic model, extracting the keywords of the article through semantic analysis, and then inputting the trained keyword-topic model to get the topic of the article.
步骤403:基于文章主题判断文章是否匹配广告对象,如果匹配执行步骤404,如果不匹配,文章不出广告。Step 403: Determine whether the article matches the advertisement object based on the article theme. If the matching performs step 404, if there is no match, the article does not advertise.
在实际实施时,服务器可以将文章的主题,与广告对象对应的名称、类别和广告词至少之一进行匹配,确定满足匹配条件的文章为可以匹配广 告对象的文章。In actual implementation, the server may match the topic of the article with at least one of the name, category, and advertisement word corresponding to the advertisement object, and determine that the article satisfying the matching condition can be matched widely. The article of the object.
如果通过上述匹配确定文章不适合添加广告,即不存在匹配的广告,不对文章添加广告,直接将其发布至社交网络。If it is determined by the above matching that the article is not suitable for adding an advertisement, that is, there is no matching advertisement, no advertisement is added to the article, and the article is directly posted to the social network.
步骤404:对文章进行分段的语义分析。Step 404: Perform semantic analysis of the segmentation of the article.
对文章进行分段的语义分析得到文章是否存在多个(两个或两个以上)的主题。A semantic analysis of the segmentation of the article yields whether the article has multiple (two or more) topics.
步骤405:识别文章是否存在多个主题。Step 405: Identify whether the article has multiple topics.
步骤406:确定文章存在多个主题,执行步骤408。Step 406: Determine that there are multiple topics in the article, and perform step 408.
步骤407:确定文章存在单一主题,执行步骤409。Step 407: Determine that the article has a single topic, and perform step 409.
步骤408:在文章中标记多个添加广告的位置。Step 408: Mark multiple locations where the advertisement is added in the article.
在实际实施时,当确定文章存在多个主题时,可选择在对应主题的位置添加广告,如在两个相邻主题的交界处。In actual implementation, when it is determined that there are multiple topics in the article, it is optional to add an advertisement at the location of the corresponding topic, such as at the junction of two adjacent topics.
步骤409:在文章结束标记添加广告的位置。Step 409: Add the location of the advertisement at the end of the article.
步骤410:根据文章主题从广告对象库中选出一组备选广告对象集合。Step 410: Select a set of alternative advertisement object sets from the advertisement object library according to the article theme.
可通过计算广告对象的主题与文章主题的相似度确定备选广告对象集合(如确定相似度达到预设阈值的)。The set of alternative advertising objects (eg, determining that the similarity reaches a preset threshold) can be determined by calculating the similarity of the subject of the advertising object to the topic of the article.
步骤411:根据文章的文本内容匹配广告对象。Step 411: Match the advertisement object according to the text content of the article.
在实际实施时,可采用预先训练得到的文本-图像相似度分类器,输入文章的文本特征及广告对象的图像特征,得到二者的相似度,当仅在文章结束标记了添加广告的位置时,仅需匹配相似度最高的广告对象即可,当在文章中标记多个添加广告的位置时,则可依据相似度的排序选取相应数量的广告对象。In actual implementation, the pre-trained text-image similarity classifier can be used to input the text feature of the article and the image feature of the advertisement object to obtain the similarity between the two, when only the position where the advertisement is added is marked at the end of the article. You only need to match the most similar ad object. When you mark multiple ads in the article, you can select the corresponding number of ad objects according to the similarity.
步骤412:对匹配得到的广告对象进行图像识别,并获取广告对象的描述信息。Step 412: Perform image recognition on the matched advertising object, and obtain description information of the advertising object.
广告对象可能仅有图片信息,则通过对广告对象进行图像识别可得到 对应广告对象的素材信息,如该广告对象的具体内容是什么,如衣服、鞋等。广告对象的描述信息包括该广告对象的来源、价格、描述详情等等。The advertising object may only have image information, and the image recognition of the advertising object can be obtained. Corresponding to the material information of the advertisement object, such as the specific content of the advertisement object, such as clothes, shoes, and the like. The description information of the advertisement object includes the source, price, description details, and the like of the advertisement object.
步骤413:根据图像识别结果及广告对象的描述信息,添加文字。Step 413: Add text according to the image recognition result and the description information of the advertisement object.
在实际实施时,上述添加的文字即为用于添加文章的广告的文字素材。In actual implementation, the above-mentioned added text is the text material of the advertisement for adding the article.
步骤414:抽取广告对象的描述信息显示在广告图片上。Step 414: The description information of the extracted advertisement object is displayed on the advertisement image.
在实际实施时,对应广告对象的描述信息包括相应的超级链接,当用户点击时,跳转至相应的页面,如跳转至广告对象的购买页。In actual implementation, the description information of the corresponding advertisement object includes a corresponding hyperlink, and when the user clicks, jumps to the corresponding page, such as jumping to the purchase page of the advertisement object.
步骤415:将所述添加文字及可交互的广告对象作为广告添加至文章。Step 415: Add the added text and the interactive advertising object as an advertisement to the article.
所述可交互的指的是广告包括的超级链接,用户点击可进行页面跳转。The interactive means refers to a hyperlink included in the advertisement, and the user clicks to perform a page jump.
前述基于自媒体平台的文章处理方法的实施例,以承载有自媒体平台的服务器先确定用于呈现推广信息的推广文章,在基于确定的推广文章确定目标推广对象,然而在实际应用中,还可以先确定目标推广对象,再确定用于呈现推广信息的推广文章,接下来对此方式的基于自媒体平台的文章处理方法进行详细说明。In the foregoing embodiment of the article processing method based on the self-media platform, the server carrying the self-media platform first determines the promotion article for presenting the promotion information, and determines the target promotion object based on the determined promotion article, but in actual application, The target promotion object can be determined first, and then the promotion article for presenting the promotion information can be determined. Next, the article processing method based on the self-media platform is described in detail.
作为基于自媒体平台的文章处理方法的另一个可选实施例,图11示出了本发明实施例服务器侧提供的基于自媒体平台的文章处理方法的一个可选的流程示意图,在本实施例中,自媒体平台可以承载于具有社交功能的社交网络服务器上,以推广信息为广告、推广对象为广告对象(广告商品)为例进行说明,参见图11,本发明实施例提供的基于自媒体平台的文章处理方法包括:As an alternative embodiment of the article processing method based on the self-media platform, FIG. 11 is a schematic flowchart of an optional article processing method based on the self-media platform provided by the server side in the embodiment of the present invention. The self-media platform can be carried on a social network server with social functions, and the promotion information is an advertisement, and the promotion object is an advertisement object (advertisement product). For example, referring to FIG. 11 , the self-media provided by the embodiment of the present invention is provided. The article's article processing methods include:
步骤501:第一客户端发送目标文章至自媒体平台。Step 501: The first client sends the target article to the self-media platform.
这里,客户端连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交。在实际应用中,目标文章包括待发布文章及原始文章。Here, the client connects to the self-media platform, and the target article is submitted by the user of the self-media platform through the client. In practical applications, the target article includes the article to be published and the original article.
步骤502:在候选推广对象中确定目标推广对象、以及与目标推广对象 匹配的素材。Step 502: Determine a target promotion object and a target promotion object in the candidate promotion object. Matching material.
在一实施例中,自媒体平台可通过如下方式在存储于自媒体平台的候选推广对象中确定目标推广对象:In an embodiment, the self-media platform may determine the target promotion object among the candidate promotion objects stored in the self-media platform by:
将历史目标文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象;其中,所述历史目标文章先于所述目标文章在所述自媒体平台接收并发送。Performing content similarity calculation on the content feature of the historical target article and the content feature of the candidate promotion object, and determining the candidate promotion object satisfying the content similarity condition as the target promotion object; wherein the historical target article precedes the target The article is received and sent on the self-media platform.
例如:确定与历史目标文章的主题满足主题相似度条件的候选推广对象,形成候选推广对象集合;确定候选推广对象集合中各候选推广对象与历史目标文章关于至少一个类型特征的相似度,所述特征包括图像特征和文本特征;将相似度满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象。For example, determining a candidate promotion object that satisfies a topic similarity condition with a topic of the historical target article, forming a candidate promotion object set, and determining a similarity between each candidate promotion object in the candidate promotion object set and the historical target article regarding at least one type feature, The feature includes an image feature and a text feature; the candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
步骤503:基于确定的目标推广对象,在接收的目标文章中确定用于呈现推广信息的推广文章。Step 503: Determine, according to the determined target promotion object, a promotion article for presenting the promotion information in the received target article.
在一实施例中,基于确定的所述目标推广对象,将所述目标推广对象的主题特征与所述目标文章的主题特征进行主题相似度计算,将满足主题相似度条件的目标文章确定为所述推广文章。例如:将从目标文章提取的关键词,输入根据特征词进行主题分类的分类器模型,获得分类器模型计算输出的目标文章对应的主题;将从候选推广对象的素材提取的关键词,输入根据特征词进行主题分类的分类器模型,获得候选推广对象对应的主题;根据目标文章所对应主题与候选推广对象所对应主题的语义距离,确定与语义距离负相关关系的主题相似度,将接收的目标文章中满足主题相似度条件的目标文章确定为推广文章。In an embodiment, based on the determined target promotion object, the theme feature of the target promotion object and the theme feature of the target article are subjected to topic similarity calculation, and the target article satisfying the topic similarity condition is determined as Describe the promotion article. For example, the keyword extracted from the target article is input into a classifier model that classifies the topic according to the feature word, and the topic corresponding to the target article calculated by the classifier model is obtained; the keyword extracted from the material of the candidate promotion object is input according to the keyword. The classifier model of the feature word is used to obtain the topic corresponding to the candidate promotion object; according to the semantic distance of the topic corresponding to the target article and the topic corresponding to the candidate promotion object, the topic similarity with the negative relationship of the semantic distance is determined, and the received feature is received. The target article in the target article that satisfies the topic similarity condition is determined as a promotion article.
步骤504:确定推广文章中用于添加推广信息的推广位置。Step 504: Determine a promotion location for adding promotion information in the promotion article.
在一实施例中,可通过如下方式确定推广文章中用于添加推广信息的推广位置: In an embodiment, the promotion location for adding promotional information in the promotion article may be determined as follows:
根据推广文章包括的主题特征,在推广文章中确定具有所包括的主题特征的段落;当所包括的主题特征与推广信息的主题特征满足主题相似度条件时,确定所述段落的位置为用于添加所述推广信息的推广位置。Determining a paragraph having the included topic feature in the promotion article according to the topic feature included in the promotion article; determining the location of the paragraph for adding when the included topic feature and the topic feature of the promotion information satisfy the topic similarity condition The promotion location of the promotion information.
步骤505:根据所确定的与目标推广对象匹配的素材生成推广信息。Step 505: Generate promotion information according to the determined material that matches the target promotion object.
在一实施例中,可通过如下方式生成推广信息:In an embodiment, the promotion information can be generated by:
获得用于在推广信息中首先呈现的固定内容,所述固定内容用于引导观看添加后的推广信息;将获得的固定内容、以及所获得的素材填充至推广信息模板,得到推广信息。The fixed content for first presentation in the promotion information is obtained, and the fixed content is used to guide the viewing of the added promotion information; and the obtained fixed content and the obtained material are filled into the promotion information template to obtain the promotion information.
步骤506:根据所确定的推广位置,将推广信息添加到推广文章中相应的推广位置。Step 506: Add the promotion information to the corresponding promotion location in the promotion article according to the determined promotion location.
步骤507:发送添加有所述推广信息的推广文章至第二客户端。Step 507: Send a promotion article to which the promotion information is added to the second client.
步骤508:第二客户端显示推广文章。Step 508: The second client displays the promotion article.
本发明实施例还提供了一种基于自媒体平台的文章处理装置300,参见图12,图12为本发明实施例提供的基于自媒体平台的文章处理装置的组成结构示意图,包括:The embodiment of the present invention further provides an article processing apparatus 300 based on a self-media platform. Referring to FIG. 12, FIG. 12 is a schematic structural diagram of an article processing apparatus based on a self-media platform according to an embodiment of the present invention, including:
接收单元31,配置为接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述第一客户端提交;The receiving unit 31 is configured to receive a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the first client;
确定单元32,配置为在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;The determining unit 32 is configured to determine, in the target article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article;
在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;Determining, in a candidate promotion object stored in the self-media platform, a target promotion object and a material matching the target promotion object;
生成单元33,配置为根据所确定的与所述目标推广对象匹配的素材生成推广信息;The generating unit 33 is configured to generate promotion information according to the determined material that matches the target promotion object;
添加单元34,配置为根据所确定的所述推广位置,将所述推广信息添 加到所述推广文章中相应的推广位置;The adding unit 34 is configured to add the promotion information according to the determined promotion location Add to the corresponding promotion location in the promotion article;
发送单元35,配置为发送添加有所述推广信息的所述推广文章。The sending unit 35 is configured to send the promotion article to which the promotion information is added.
在一些实施例中,所述确定单元32,还配置为将所述目标文章的主题特征与候选推广对象的主题特征进行主题相似度计算,将满足主题相似度条件的目标文章确定为所述推广文章;In some embodiments, the determining unit 32 is further configured to perform topic similarity calculation on the topic feature of the target article and the topic feature of the candidate promotion object, and determine the target article that satisfies the topic similarity condition as the promotion. article;
以及配置为基于确定的所述推广文章,将所述推广文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象。And configuring, according to the determined promotion article, performing content similarity calculation on the content feature of the promotion article and the content feature of the candidate promotion object, and determining the candidate promotion object satisfying the content similarity condition as the target promotion object.
在一些实施例中,所述确定单元32,还配置为将历史目标文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象;In some embodiments, the determining unit 32 is further configured to perform content similarity calculation on the content feature of the historical target article and the content feature of the candidate promotion object, and determine the candidate promotion object that satisfies the content similarity condition as the target. Promotion target;
其中,所述历史目标文章先于所述目标文章在所述自媒体平台接收并发送;The historical target article is received and sent by the self-media platform prior to the target article;
以及配置为基于确定的所述目标推广对象,将所述目标推广对象的主题特征与所述目标文章的主题特征进行主题相似度计算,将满足主题相似度条件的目标文章确定为所述推广文章。And configuring, according to the determined target promotion object, performing theme similarity calculation on the theme feature of the target promotion object and the theme feature of the target article, and determining a target article satisfying the topic similarity condition as the promotion article .
在一些实施例中,所述确定单元32,还配置为根据所述推广文章包括的主题特征,在所述推广文章中确定具有所包括的主题特征的段落;In some embodiments, the determining unit 32 is further configured to determine, in the promotion article, a paragraph having the included topic feature according to the topic feature included in the promotion article;
当所包括的主题特征与所述推广信息的主题特征满足主题相似度条件时,确定所述段落的位置为用于添加所述推广信息的推广位置。When the included topic feature and the topic feature of the promotion information satisfy the topic similarity condition, determining the location of the paragraph is a promotion location for adding the promotion information.
在一些实施例中,所述确定单元32,还配置为当在所述推广文章中相邻段落之间的位置添加所述推广信息时,根据所述推广文章的内容样式中相同类型的内容是否被所述推广信息分割,和/或所述推广信息在所述内容样式中所占用的显示比例,确定相应的完整度;In some embodiments, the determining unit 32 is further configured to: when the promotion information is added at a position between adjacent paragraphs in the promotion article, according to whether the same type of content in the content style of the promotion article is Segmented by the promotion information, and/or a display ratio occupied by the promotion information in the content style, determining a corresponding completeness;
当所述完整度满足完整度条件时,确定所述相邻段落之间的位置为添 加所述推广信息的推广位置。When the completeness satisfies the integrity condition, determining the position between the adjacent paragraphs is Add the promotion location of the promotion information.
在一些实施例中,所述确定单元32,还配置为将从所述待发布文章提取的关键词,输入根据特征词进行主题分类的分类器模型,获得所述分类器模型计算输出的所述待发布文章对应的主题;In some embodiments, the determining unit 32 is further configured to input a keyword extracted from the to-be-published article into a classifier model that classifies topics according to feature words, and obtain the calculated output of the classifier model. The topic corresponding to the article to be published;
将从所述候选推广对象的素材提取的关键词,输入根据特征词进行主题分类的分类器模型,获得所述候选推广对象对应的主题;Inputting a keyword extracted from the material of the candidate promotion object into a classifier model that performs topic classification according to the feature word, and obtaining a topic corresponding to the candidate promotion object;
根据所述待发布文章所对应主题与所述候选推广对象所对应主题的语义距离,确定与所述语义距离负相关关系的主题相似度。And determining a topic similarity that is negatively related to the semantic distance according to a semantic distance of a topic corresponding to the to-be-published article and a topic corresponding to the candidate promotion object.
在一些实施例中,所述确定单元32,还配置为对所述候选推广对象的素材以及所述推广文章执行以下类型至少之一的特征提取操作:提取由颜色、纹理和形状构成的图像特征;进行分词处理,对分词结果过滤掉停用词,得到由特征词构成的文本特征;In some embodiments, the determining unit 32 is further configured to perform a feature extraction operation on at least one of the following types of material of the candidate promotion object and the promotion article: extracting image features composed of colors, textures, and shapes Perform word segmentation processing, filter out the stop words on the word segmentation results, and obtain text features composed of feature words;
确定所述候选推广对象与所述推广文章关于至少一个类型特征的相似度:Determining the similarity between the candidate promotion object and the promotion article regarding at least one type of feature:
将满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象。The candidate promotion object that satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
在一些实施例中,所述确定单元32,还配置为确定与所述推广文章的主题满足主题相似度条件的候选推广对象,形成候选推广对象集合;In some embodiments, the determining unit 32 is further configured to: determine a candidate promotion object that satisfies a topic similarity condition with a topic of the promotion article, and form a candidate promotion object set;
确定所述候选推广对象集合中各候选推广对象与所述推广文章关于至少一个类型特征的相似度,所述特征包括图像特征和文本特征;Determining a similarity between each candidate promotion object in the candidate promotion object set and the promotion article with respect to at least one type of feature, the image includes an image feature and a text feature;
将相似度满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象。The candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
在一些实施例中,所述确定单元32,还配置为将候选推广对象的至少一个类型的特征,输入根据特征词进行主题分类的分类器模型,获得所述分类器模型计算输出的所述候选推广对象所属的主题; In some embodiments, the determining unit 32 is further configured to input at least one type of feature of the candidate promotion object into a classifier model that performs topic classification according to the feature word, and obtain the candidate of the classifier model calculation output. Promote the subject to which the object belongs;
当映射得到的主题与所述推广文章的主题的相似度超出主题相似度阈值时,确定为与所述推广文章的主题满足主题相似度条件的候选推广对象。When the similarity between the mapped topic and the topic of the promotion article exceeds the topic similarity threshold, the candidate promotion object that satisfies the topic similarity condition with the topic of the promotion article is determined.
在一些实施例中,所述确定单元32,还配置为确定候选推广对象的图像特征与所述推广文章的文本特征的相似度;In some embodiments, the determining unit 32 is further configured to determine a similarity between an image feature of the candidate promotion object and a text feature of the promotion article;
当确定的所述相似度超出文字与图像相似度阈值时,确定所述候选推广对象为目标推广对象。When the determined similarity exceeds the text and image similarity threshold, it is determined that the candidate promotion object is the target promotion object.
在一些实施例中,所述确定单元32,还配置为从所述推广文章中提取人物关键字;In some embodiments, the determining unit 32 is further configured to extract a character keyword from the promotion article;
将所述人物关键字和所述目标推广对象的标签关键字至少之一,与所述目标推广对象的模板内容组合,形成与所述目标推广对象对应的文字素材。And combining at least one of the character keyword and the tag keyword of the target promotion object with the template content of the target promotion object to form a character material corresponding to the target promotion object.
在一些实施例中,所述确定单元32,还配置为对所述目标推广对象进行图像识别,得到表征所述推广对象属性的图像识别结果;In some embodiments, the determining unit 32 is further configured to perform image recognition on the target promotion object, and obtain an image recognition result that is used to represent the promotion object attribute;
将所述图像识别结果与所述目标推广对象的描述信息组合,形成与所述目标推广对象对应的文字素材。Combining the image recognition result with the description information of the target promotion object to form a text material corresponding to the target promotion object.
在一些实施例中,所述确定单元32,还配置为当候选推广对象的图像素材的图像特征、与所述推广文章的图像特征满足图像特征的匹配条件时,将满足所述匹配条件的图像素材作为与所述目标推广对象对应的图像素材。In some embodiments, the determining unit 32 is further configured to: when the image feature of the image material of the candidate promotion object and the image feature of the promotion article satisfy the matching condition of the image feature, the image that satisfies the matching condition is The material is an image material corresponding to the target promotion object.
在一些实施例中,所述生成单元33,还配置为获得用于在所述推广信息中首先呈现的固定内容,所述固定内容用于引导观看添加后的所述推广信息;In some embodiments, the generating unit 33 is further configured to obtain fixed content for first presentation in the promotion information, where the fixed content is used to guide viewing the added promotion information;
将所述固定内容、以及所获得的素材填充至推广信息模板,得到所述推广信息。The fixed content and the obtained material are filled into a promotion information template to obtain the promotion information.
本发明实施例还提供了一种服务器,包括: The embodiment of the invention further provides a server, including:
存储器,配置为存储可执行程序;a memory configured to store an executable program;
处理器,配置为执行所述存储器中存储的可执行程序时,实现上述基于自媒体平台的文章处理方法。The processor, configured to execute the executable program stored in the memory, implements the above-described self-media platform-based article processing method.
本发明实施例还提供了一种可读存储介质,存储介质可以包括:移动存储设备、随机存取存储器(RAM,Random Access Memory)、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。所述可读存储介质存储有可执行程序;The embodiment of the present invention further provides a readable storage medium, which may include: a mobile storage device, a random access memory (RAM), a read-only memory (ROM), a magnetic disk, or A variety of media such as optical discs that can store program code. The readable storage medium stores an executable program;
所述可执行程序,用于被处理器执行时实现上述基于自媒体平台的文章处理方法。The executable program is configured to implement the above-described self-media platform-based article processing method when executed by a processor.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. It should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the appended claims.
工业实用性Industrial applicability
本发明实施例接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交;在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;根据所确定的与所述目标推广对象匹配的素材生成推广信息;根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;发送添加有所述推广信息的所述推广文章。如此,目标文章的来源可以来自社交网络中任意一个用户终端,打破了靠征集特定主题的文章的局限性,可实现推广信息的批量化和自动化添加;自动实现推广信息的位置的选定,位置灵活,能够避免推广信息的出现突兀,使得文章内容与推广信息的内 容衔接自然;通过文章发布以及触达用户的过程完成推广信息的传递,依赖自媒体平台自身的发布/发送流量实现了推广信息,推广信息得以覆盖自媒体平台的访问流量并实时触达用户。 The embodiment of the present invention receives a target article sent by a client, where the client is used to connect to the self-media platform, and the target article is submitted by a user of the self-media platform through the client; Determining, in the article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article; determining a target promotion object and the target in the candidate promotion object stored in the self-media platform Promoting the matching material of the object; generating the promotion information according to the determined material matching the target promotion object; adding the promotion information to the corresponding promotion location in the promotion article according to the determined promotion location; sending The promotion article with the promotion information is added. In this way, the source of the target article can come from any user terminal in the social network, breaking the limitation of the article collecting the specific topic, and realizing the batching and automatic addition of the promotion information; automatically selecting the location of the promotion information, the location Flexible, able to avoid the abrupt appearance of promotional information, making the content of the article and the promotion information The content is connected naturally; through the release of the article and the process of reaching the user, the delivery of the promotion information is completed, and the promotion information is realized by relying on the release/send traffic of the self-media platform itself, and the promotion information can cover the access traffic from the media platform and reach the user in real time.

Claims (21)

  1. 一种基于自媒体平台的文章处理方法,包括:An article processing method based on a self-media platform, comprising:
    接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交;Receiving a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the client;
    在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;Determining, in the target article, a promotion article for presenting promotion information, and a promotion location for adding promotion information in the promotion article;
    在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;Determining, in a candidate promotion object stored in the self-media platform, a target promotion object and a material matching the target promotion object;
    根据所确定的与所述目标推广对象匹配的素材生成推广信息;Generating promotion information according to the determined material that matches the target promotion object;
    根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;And adding the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location;
    发送添加有所述推广信息的所述推广文章。The promotion article to which the promotion information is added is sent.
  2. 如权利要求1所述的方法,其中,所述在所述目标文章中确定用于呈现推广信息的推广文章,包括:The method of claim 1 wherein said determining a promotional article for presenting promotional information in said target article comprises:
    将所述目标文章的主题特征与候选推广对象的主题特征进行主题相似度计算,将满足主题相似度条件的目标文章确定为所述推广文章;Performing topic similarity calculation on the topic feature of the target article and the topic feature of the candidate promotion object, and determining the target article satisfying the topic similarity condition as the promotion article;
    所述在存储于所述自媒体平台的候选推广对象中确定目标推广对象,包括:Determining the target promotion object in the candidate promotion objects stored in the self-media platform, including:
    基于确定的所述推广文章,将所述推广文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象。Based on the determined promotion article, the content feature of the promotion article and the content feature of the candidate promotion object are subjected to content similarity calculation, and the candidate promotion object satisfying the content similarity condition is determined as the target promotion object.
  3. 如权利要求1所述的方法,其中,所述在存储于所述自媒体平台的候选推广对象中确定目标推广对象,包括:The method of claim 1, wherein the determining the target promotion object among the candidate promotion objects stored in the self-media platform comprises:
    将历史目标文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象; Performing content similarity calculation on the content feature of the historical target article and the content feature of the candidate promotion object, and determining the candidate promotion object satisfying the content similarity condition as the target promotion object;
    其中,所述历史目标文章先于所述目标文章在所述自媒体平台接收并发送;The historical target article is received and sent by the self-media platform prior to the target article;
    所述在所述目标文章中确定用于呈现推广信息的推广文章,包括:Determining, in the target article, a promotion article for presenting promotion information, including:
    基于确定的所述目标推广对象,将所述目标推广对象的主题特征与所述目标文章的主题特征进行主题相似度计算,将满足主题相似度条件的目标文章确定为所述推广文章。Based on the determined target promotion object, the theme feature of the target promotion object and the theme feature of the target article are subjected to topic similarity calculation, and the target article satisfying the topic similarity condition is determined as the promotion article.
  4. 如权利要求1或2所述的方法,其中,所述确定所述推广文章中用于添加推广信息的推广位置,包括:The method according to claim 1 or 2, wherein said determining a promotion location for adding promotion information in said promotion article comprises:
    根据所述推广文章包括的主题特征,在所述推广文章中确定具有所包括的主题特征的段落;Determining a paragraph having the included topic feature in the promotion article according to the topic feature included in the promotion article;
    当所包括的主题特征与所述推广信息的主题特征满足主题相似度条件时,确定所述段落的位置为用于添加所述推广信息的推广位置。When the included topic feature and the topic feature of the promotion information satisfy the topic similarity condition, determining the location of the paragraph is a promotion location for adding the promotion information.
  5. 如权利要求1或2所述的方法,其中,所述确定所述推广文章中用于添加推广信息的推广位置,包括:The method according to claim 1 or 2, wherein said determining a promotion location for adding promotion information in said promotion article comprises:
    当在所述推广文章中相邻段落之间的位置添加所述推广信息时,When the promotion information is added at a position between adjacent paragraphs in the promotion article,
    根据所述推广文章的内容样式中相同类型的内容是否被所述推广信息分割,和/或所述推广信息在所述内容样式中所占用的显示比例,确定相应的完整度;Whether the content of the same type in the content style of the promotion article is segmented by the promotion information, and/or the display ratio occupied by the promotion information in the content style, determining a corresponding completeness;
    当所述完整度满足完整度条件时,确定所述相邻段落之间的位置为添加所述推广信息的推广位置。When the degree of completeness satisfies the integrity condition, determining a position between the adjacent paragraphs is a promotion position to which the promotion information is added.
  6. 如权利要求2所述的方法,其中,所述将确定的所述推广文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象,包括:The method according to claim 2, wherein the content feature of the determined promotion article and the content feature of the candidate promotion object are subjected to content similarity calculation, and the candidate promotion object satisfying the content similarity condition is determined as Target promotion targets, including:
    确定与所述推广文章的主题满足主题相似度条件的候选推广对象,形成候选推广对象集合; Determining a candidate promotion object that satisfies a topic similarity condition with a topic of the promotion article, and forming a candidate promotion object set;
    确定所述候选推广对象集合中各候选推广对象与所述推广文章关于至少一个类型特征的相似度,所述特征包括图像特征和文本特征;Determining a similarity between each candidate promotion object in the candidate promotion object set and the promotion article with respect to at least one type of feature, the image includes an image feature and a text feature;
    将相似度满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象。The candidate promotion object whose similarity satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
  7. 如权利要求6所述的方法,其中,所述确定与所述推广文章的主题满足主题相似度条件的候选推广对象集合,包括:The method of claim 6, wherein the determining the set of candidate promotion objects that satisfy the topic similarity condition with the topic of the promotion article comprises:
    将候选推广对象的至少一个类型的特征,输入根据特征词进行主题分类的分类器模型,获得所述分类器模型计算输出的所述候选推广对象所属的主题;Entering at least one type of feature of the candidate promotion object into a classifier model for classifying the topic according to the feature word, and obtaining a topic to which the candidate promotion object is calculated and output by the classifier model;
    当映射得到的主题与所述推广文章的主题的相似度超出主题相似度阈值时,确定为与所述推广文章的主题满足主题相似度条件的候选推广对象。When the similarity between the mapped topic and the topic of the promotion article exceeds the topic similarity threshold, the candidate promotion object that satisfies the topic similarity condition with the topic of the promotion article is determined.
  8. 如权利要求1所述的方法,其中,所述确定与所述目标推广对象匹配的素材,包括:The method of claim 1, wherein the determining the material that matches the target promotion object comprises:
    对所述目标推广对象进行图像识别,得到表征所述推广对象属性的图像识别结果;Performing image recognition on the target promotion object, and obtaining an image recognition result characterizing the attribute of the promotion object;
    将所述图像识别结果与所述目标推广对象的描述信息组合,形成与所述目标推广对象对应的文字素材。Combining the image recognition result with the description information of the target promotion object to form a text material corresponding to the target promotion object.
  9. 如权利要求1所述的方法,其中,所述确定与所述目标推广对象匹配的素材,包括:The method of claim 1, wherein the determining the material that matches the target promotion object comprises:
    当候选推广对象的图像素材的图像特征、与所述推广文章的图像特征满足图像特征的匹配条件时,将满足所述匹配条件的图像素材作为与所述目标推广对象对应的图像素材。When the image feature of the image material of the candidate promotion object and the image feature of the promotion article satisfy the matching condition of the image feature, the image material that satisfies the matching condition is used as the image material corresponding to the target promotion object.
  10. 如权利要求1所述的方法,其中,所述根据所确定的与所述目标推广对象匹配的素材生成推广信息,包括:The method of claim 1, wherein the generating the promotion information according to the determined material that matches the target promotion object comprises:
    获得用于在所述推广信息中首先呈现的固定内容,所述固定内容用于 引导观看添加后的所述推广信息;Obtaining fixed content for first presentation in the promotion information, the fixed content being used for Guiding to view the added promotional information;
    将所述固定内容、以及所获得的素材填充至推广信息模板,得到所述推广信息。The fixed content and the obtained material are filled into a promotion information template to obtain the promotion information.
  11. 一种基于自媒体平台的文章处理方法,所述方法由服务器执行,所述服务器包括有一个或多个处理器以及存储器,以及一个或一个以上的程序,其中,所述一个或一个以上的程序存储于存储器中,所述程序可以包括一个或一个以上的每一个对应于一组指令的单元,所述一个或多个处理器被配置为执行指令;所述方法包括:An article processing method based on a self-media platform, the method being performed by a server, the server comprising one or more processors and a memory, and one or more programs, wherein the one or more programs Stored in a memory, the program can include one or more units each corresponding to a set of instructions, the one or more processors being configured to execute instructions; the method comprising:
    接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交;Receiving a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the client;
    在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;Determining, in the target article, a promotion article for presenting promotion information, and a promotion location for adding promotion information in the promotion article;
    在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;Determining, in a candidate promotion object stored in the self-media platform, a target promotion object and a material matching the target promotion object;
    根据所确定的与所述目标推广对象匹配的素材生成推广信息;Generating promotion information according to the determined material that matches the target promotion object;
    根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;And adding the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location;
    发送添加有所述推广信息的所述推广文章。The promotion article to which the promotion information is added is sent.
  12. 如权利要求11所述的方法,其中,所述在所述目标文章中确定用于呈现推广信息的推广文章,包括:The method of claim 11 wherein said determining a promotional article for presenting promotional information in said target article comprises:
    将所述目标文章的主题特征与候选推广对象的主题特征进行主题相似度计算,将满足主题相似度条件的目标文章确定为推广文章。A topic similarity calculation is performed on the topic feature of the target article and the topic feature of the candidate promotion object, and the target article satisfying the topic similarity condition is determined as a promotion article.
  13. 如权利要求12所述的方法,其中,所述将所述目标文章的主题特征与候选推广对象的主题特征进行主题相似度计算,包括:The method of claim 12, wherein the subject feature of the target article and the topic feature of the candidate promotion object are subjected to topic similarity calculation, including:
    将从所述目标文章提取的关键词,输入根据特征词进行主题分类的分 类器模型,获得所述分类器模型计算输出的所述目标文章对应的主题;Subdividing the keywords extracted from the target article into sub-categories based on feature words a classifier model, obtaining a theme corresponding to the target article output by the classifier model;
    将从所述候选推广对象的素材提取的关键词,输入根据特征词进行主题分类的分类器模型,获得所述候选推广对象对应的主题;Inputting a keyword extracted from the material of the candidate promotion object into a classifier model that performs topic classification according to the feature word, and obtaining a topic corresponding to the candidate promotion object;
    根据所述目标文章所对应主题与所述候选推广对象所对应主题的语义距离,确定与所述语义距离负相关关系的主题相似度。And determining a topic similarity that is negatively related to the semantic distance according to a semantic distance of a topic corresponding to the target article and a topic corresponding to the candidate promotion object.
  14. 如权利要求11所述的方法,其中,所述在存储于所述自媒体平台的候选推广对象中确定目标推广对象,包括:The method of claim 11, wherein the determining the target promotion object among the candidate promotion objects stored in the self-media platform comprises:
    将确定的所述推广文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象。And determining the content similarity of the determined content feature of the promotion article and the content feature of the candidate promotion object, and determining the candidate promotion object that satisfies the content similarity condition as the target promotion object.
  15. 如权利要求14所述的方法,其中,所述将确定的所述推广文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的推广对象确定为目标推广对象,包括:The method according to claim 14, wherein the content feature of the determined promotion article and the content feature of the candidate promotion object are subjected to content similarity calculation, and the promotion object satisfying the content similarity condition is determined as a target. Promotion targets, including:
    对所述候选推广对象的素材以及所述推广文章执行以下类型至少之一的特征提取操作:提取由颜色、纹理和形状构成的图像特征;进行分词处理,对分词结果过滤掉停用词,得到由特征词构成的文本特征;Performing a feature extraction operation on at least one of the following types of material of the candidate promotion object and the promotion article: extracting image features composed of colors, textures, and shapes; performing word segmentation processing, filtering out stop words for word segmentation results, and obtaining a text feature consisting of feature words;
    确定所述候选推广对象与所述推广文章关于至少一个类型特征的相似度:Determining the similarity between the candidate promotion object and the promotion article regarding at least one type of feature:
    将满足相应类型特征的相似度条件的候选推广对象确定为目标推广对象。The candidate promotion object that satisfies the similarity condition of the corresponding type feature is determined as the target promotion object.
  16. 如权利要求14所述的方法,其中,所述将确定的所述推广文章的内容特征与所述候选推广对象的内容特征进行内容相似度计算,将满足内容相似度条件的候选推广对象确定为目标推广对象,包括:The method according to claim 14, wherein the content feature of the determined promotion article and the content feature of the candidate promotion object are subjected to content similarity calculation, and the candidate promotion object satisfying the content similarity condition is determined as Target promotion targets, including:
    确定候选推广对象的图像特征与所述推广文章的文本特征的相似度;Determining a similarity between an image feature of the candidate promotion object and a text feature of the promotion article;
    当确定的所述相似度超出文字与图像相似度阈值时,确定所述候选推 广对象为目标推广对象。Determining the candidate push when the determined similarity exceeds a text and image similarity threshold A wide object is a target promotion object.
  17. 如权利要求11所述的方法,其中,所述确定与所述目标推广对象匹配的素材,包括:The method of claim 11 wherein said determining a material that matches said target promotion object comprises:
    从所述推广文章中提取人物关键字;Extracting a character keyword from the promotion article;
    将所述人物关键字和所述目标推广对象的标签关键字至少之一,与所述目标推广对象的模板内容组合,形成与所述目标推广对象对应的文字素材。And combining at least one of the character keyword and the tag keyword of the target promotion object with the template content of the target promotion object to form a character material corresponding to the target promotion object.
  18. 一种基于自媒体平台的文章处理装置,包括:An article processing device based on a self-media platform, comprising:
    接收单元,配置为接收客户端发送的目标文章,其中,所述客户端用于连接所述自媒体平台,所述目标文章由所述自媒体平台的用户通过所述客户端提交;a receiving unit, configured to receive a target article sent by the client, where the client is used to connect to the self-media platform, and the target article is submitted by the user of the self-media platform through the client;
    确定单元,配置为在所述目标文章中确定用于呈现推广信息的推广文章、以及所述推广文章中用于添加推广信息的推广位置;a determining unit configured to determine, in the target article, a promotion article for presenting the promotion information, and a promotion location for adding the promotion information in the promotion article;
    以及,配置为在存储于所述自媒体平台的候选推广对象中确定目标推广对象、以及与所述目标推广对象匹配的素材;And configured to determine a target promotion object and a material matching the target promotion object among the candidate promotion objects stored in the self-media platform;
    生成单元,配置为根据所确定的与所述目标推广对象匹配的素材生成推广信息;a generating unit, configured to generate promotion information according to the determined material that matches the target promotion object;
    添加单元,配置为根据所确定的所述推广位置,将所述推广信息添加到所述推广文章中相应的推广位置;Adding a unit, configured to add the promotion information to a corresponding promotion location in the promotion article according to the determined promotion location;
    发送单元,配置为发送添加有所述推广信息的所述推广文章。The sending unit is configured to send the promotion article to which the promotion information is added.
  19. 一种服务器,包括:A server that includes:
    存储器,配置为存储可执行程序;a memory configured to store an executable program;
    处理器,配置为执行所述存储器中存储的可执行程序时,实现如权利要求1至10任一项所述的基于自媒体平台的文章处理方法。The self-media platform-based article processing method according to any one of claims 1 to 10, when the processor is configured to execute the executable program stored in the memory.
  20. 一种存储介质,存储有可执行程序,所述可执行程序被处理器执 行时,实现如权利要求1至10任一项所述的基于自媒体平台的文章处理方法。A storage medium storing an executable program, the executable program being executed by a processor In the line, the self-media platform-based article processing method according to any one of claims 1 to 10 is implemented.
  21. 一种存储介质,存储有可执行程序,所述可执行程序被处理器执行时,实现如权利要求11至17任一项所述的基于自媒体平台的文章处理方法。 A storage medium storing an executable program, the executable program being executed by a processor, implementing the self-media platform-based article processing method according to any one of claims 11 to 17.
PCT/CN2017/116646 2017-12-15 2017-12-15 Method, device, and server for processing written articles, and storage medium WO2019113977A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201780054780.XA CN110325986B (en) 2017-12-15 2017-12-15 Article processing method, article processing device, server and storage medium
PCT/CN2017/116646 WO2019113977A1 (en) 2017-12-15 2017-12-15 Method, device, and server for processing written articles, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/116646 WO2019113977A1 (en) 2017-12-15 2017-12-15 Method, device, and server for processing written articles, and storage medium

Publications (1)

Publication Number Publication Date
WO2019113977A1 true WO2019113977A1 (en) 2019-06-20

Family

ID=66818894

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/116646 WO2019113977A1 (en) 2017-12-15 2017-12-15 Method, device, and server for processing written articles, and storage medium

Country Status (2)

Country Link
CN (1) CN110325986B (en)
WO (1) WO2019113977A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334356A (en) * 2019-07-15 2019-10-15 腾讯科技(深圳)有限公司 Article matter method for determination of amount, article screening technique and corresponding device
CN110737783A (en) * 2019-10-08 2020-01-31 腾讯科技(深圳)有限公司 method, device and computing equipment for recommending multimedia content
CN110781377A (en) * 2019-09-03 2020-02-11 腾讯科技(深圳)有限公司 Article recommendation method and device
CN110874313A (en) * 2019-11-18 2020-03-10 北京百度网讯科技有限公司 Writing tool testing method and device
CN111353532A (en) * 2020-02-26 2020-06-30 北京三快在线科技有限公司 Image generation method and device, computer-readable storage medium and electronic device
CN112149653A (en) * 2020-09-16 2020-12-29 北京达佳互联信息技术有限公司 Information processing method, information processing device, electronic equipment and storage medium
CN112364610A (en) * 2020-12-01 2021-02-12 深圳市房多多网络科技有限公司 Method and device for inserting building card in house source article and computing equipment
CN112465530A (en) * 2019-09-06 2021-03-09 阳光学院 Big data-based tool for network marketing

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111210258A (en) * 2019-12-23 2020-05-29 北京三快在线科技有限公司 Advertisement putting method and device, electronic equipment and readable storage medium
CN111292134A (en) * 2020-02-25 2020-06-16 上海昌投网络科技有限公司 Method and device for judging whether WeChat public number can be advertised
CN111885399B (en) * 2020-06-29 2023-06-13 腾讯科技(武汉)有限公司 Content distribution method, device, electronic equipment and storage medium
CN112800083B (en) * 2021-02-24 2022-03-18 山东省住房和城乡建设发展研究院 Government decision-oriented government affair big data analysis method and equipment
CN113379481A (en) * 2021-05-25 2021-09-10 北京大米科技有限公司 Data processing method and device
CN115271822B (en) * 2022-08-11 2023-08-11 北京创新乐知网络技术有限公司 Popularization information delivery method and device

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1674001A (en) * 2005-04-04 2005-09-28 栾奕 Method for establishing key word indx advertisement for articles in internet
US20060116926A1 (en) * 2004-11-27 2006-06-01 Chen Michael W Method and system for internet publishing and advertising forums
CN101071443A (en) * 2007-06-26 2007-11-14 腾讯科技(深圳)有限公司 Content-related advertising identifying method and content-related advertising server
US20090312040A1 (en) * 2008-06-13 2009-12-17 Embarq Holdings Company, Llc System and method for inserting advertisements into SMS messages
CN102262632A (en) * 2010-05-28 2011-11-30 国际商业机器公司 Method and system for processing text
CN102402763A (en) * 2011-11-30 2012-04-04 江苏奇异点网络有限公司 Method of inserting advertisements into documents of document service website
CN103177383A (en) * 2013-03-21 2013-06-26 北京亿部文化有限公司 Method for implanting advertisements in electronic books
CN103853824A (en) * 2014-03-03 2014-06-11 沈之锐 In-text advertisement releasing method and system based on deep semantic mining
CN105593888A (en) * 2013-10-08 2016-05-18 株式会社纬兹 Advertisement informatin sharing system
CN106326379A (en) * 2016-08-16 2017-01-11 廖文广 Management system and method for embedded advertisement in webpage article

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020049791A1 (en) * 2000-07-11 2002-04-25 Bridgewell, Inc. Method and system for using a personal electronic document for advertising
TWI352934B (en) * 2007-11-27 2011-11-21 Inst Information Industry Advertisement selection systems and methods for in

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060116926A1 (en) * 2004-11-27 2006-06-01 Chen Michael W Method and system for internet publishing and advertising forums
CN1674001A (en) * 2005-04-04 2005-09-28 栾奕 Method for establishing key word indx advertisement for articles in internet
CN101071443A (en) * 2007-06-26 2007-11-14 腾讯科技(深圳)有限公司 Content-related advertising identifying method and content-related advertising server
US20090312040A1 (en) * 2008-06-13 2009-12-17 Embarq Holdings Company, Llc System and method for inserting advertisements into SMS messages
CN102262632A (en) * 2010-05-28 2011-11-30 国际商业机器公司 Method and system for processing text
CN102402763A (en) * 2011-11-30 2012-04-04 江苏奇异点网络有限公司 Method of inserting advertisements into documents of document service website
CN103177383A (en) * 2013-03-21 2013-06-26 北京亿部文化有限公司 Method for implanting advertisements in electronic books
CN105593888A (en) * 2013-10-08 2016-05-18 株式会社纬兹 Advertisement informatin sharing system
CN103853824A (en) * 2014-03-03 2014-06-11 沈之锐 In-text advertisement releasing method and system based on deep semantic mining
CN106326379A (en) * 2016-08-16 2017-01-11 廖文广 Management system and method for embedded advertisement in webpage article

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334356B (en) * 2019-07-15 2023-08-04 腾讯科技(深圳)有限公司 Article quality determining method, article screening method and corresponding device
CN110334356A (en) * 2019-07-15 2019-10-15 腾讯科技(深圳)有限公司 Article matter method for determination of amount, article screening technique and corresponding device
CN110781377A (en) * 2019-09-03 2020-02-11 腾讯科技(深圳)有限公司 Article recommendation method and device
CN110781377B (en) * 2019-09-03 2024-02-20 深圳市雅阅科技有限公司 Article recommendation method and device
CN112465530A (en) * 2019-09-06 2021-03-09 阳光学院 Big data-based tool for network marketing
CN110737783A (en) * 2019-10-08 2020-01-31 腾讯科技(深圳)有限公司 method, device and computing equipment for recommending multimedia content
CN110737783B (en) * 2019-10-08 2023-01-17 腾讯科技(深圳)有限公司 Method and device for recommending multimedia content and computing equipment
CN110874313B (en) * 2019-11-18 2023-07-25 北京百度网讯科技有限公司 Writing tool testing method and device
CN110874313A (en) * 2019-11-18 2020-03-10 北京百度网讯科技有限公司 Writing tool testing method and device
CN111353532A (en) * 2020-02-26 2020-06-30 北京三快在线科技有限公司 Image generation method and device, computer-readable storage medium and electronic device
CN112149653A (en) * 2020-09-16 2020-12-29 北京达佳互联信息技术有限公司 Information processing method, information processing device, electronic equipment and storage medium
CN112149653B (en) * 2020-09-16 2024-03-29 北京达佳互联信息技术有限公司 Information processing method, information processing device, electronic equipment and storage medium
CN112364610A (en) * 2020-12-01 2021-02-12 深圳市房多多网络科技有限公司 Method and device for inserting building card in house source article and computing equipment

Also Published As

Publication number Publication date
CN110325986B (en) 2022-02-11
CN110325986A (en) 2019-10-11

Similar Documents

Publication Publication Date Title
WO2019113977A1 (en) Method, device, and server for processing written articles, and storage medium
Kumar et al. Sentiment analysis of multimodal twitter data
Alzate et al. Mining the text of online consumer reviews to analyze brand image and brand positioning
Klostermann et al. Extracting brand information from social networks: Integrating image, text, and social tagging data
US10496752B1 (en) Consumer insights analysis using word embeddings
WO2021174890A1 (en) Data recommendation method and apparatus, and computer device and storage medium
US11113714B2 (en) Filtering machine for sponsored content
US9830404B2 (en) Analyzing language dependency structures
US10685183B1 (en) Consumer insights analysis using word embeddings
US11182806B1 (en) Consumer insights analysis by identifying a similarity in public sentiments for a pair of entities
Chehal et al. Implementation and comparison of topic modeling techniques based on user reviews in e-commerce recommendations
US8306962B1 (en) Generating targeted paid search campaigns
US10311479B2 (en) System for producing promotional media content and method thereof
FR3102276A1 (en) METHODS AND SYSTEMS FOR SUMMARIZING MULTIPLE DOCUMENTS USING AN AUTOMATIC LEARNING APPROACH
US20150178786A1 (en) Pictollage: Image-Based Contextual Advertising Through Programmatically Composed Collages
CN107958385B (en) Bidding based on buyer defined function
US20140108143A1 (en) Social content distribution network
US10558759B1 (en) Consumer insights analysis using word embeddings
US10509863B1 (en) Consumer insights analysis using word embeddings
US10803248B1 (en) Consumer insights analysis using word embeddings
Hsu et al. Effects of sentiment on recommendations in social network
US20190303413A1 (en) Embedding media content items in text of electronic documents
US11030539B1 (en) Consumer insights analysis using word embeddings
US20140025496A1 (en) Social content distribution network
Brown et al. Transforming unstructured data into useful information

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17934601

Country of ref document: EP

Kind code of ref document: A1