WO2022127339A1 - Website registration-based user portrait generating method and apparatus, device and medium - Google Patents

Website registration-based user portrait generating method and apparatus, device and medium Download PDF

Info

Publication number
WO2022127339A1
WO2022127339A1 PCT/CN2021/124602 CN2021124602W WO2022127339A1 WO 2022127339 A1 WO2022127339 A1 WO 2022127339A1 CN 2021124602 W CN2021124602 W CN 2021124602W WO 2022127339 A1 WO2022127339 A1 WO 2022127339A1
Authority
WO
WIPO (PCT)
Prior art keywords
registered
user
website
current
user portrait
Prior art date
Application number
PCT/CN2021/124602
Other languages
French (fr)
Chinese (zh)
Inventor
王天宇
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2022127339A1 publication Critical patent/WO2022127339A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Definitions

  • the present application relates to a method, device, device and medium for generating user portraits based on website registration.
  • the user's attribute label such as education, gender, etc.
  • a method, apparatus, device and medium for generating user portraits based on website registration are provided.
  • a method for generating user portraits based on website registration comprising:
  • the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
  • User portraits are calculated based on the number of registered websites in each category.
  • a device for generating user portraits based on website registration comprising:
  • the website list acquisition module is used to obtain the list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID.
  • the registration mark is obtained by classifying the registered registration records;
  • a classification module for comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website
  • the portrait generation module is used to calculate the user portrait according to the number of registered websites in each category.
  • a computer device comprising a memory and one or more processors, the memory having computer-readable instructions stored therein, the computer-readable instructions, when executed by the processor, cause the one or more processors to execute The following steps:
  • the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
  • User portraits are calculated based on the number of registered websites in each category.
  • One or more computer-readable storage media storing computer-readable instructions that, when executed by one or more processors, cause the one or more processors to perform the following steps:
  • the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
  • User portraits are calculated based on the number of registered websites in each category.
  • the above-mentioned method, device, equipment and medium for generating user portraits based on website registration fully consider the user's website registration situation, and quantify and structure each person's website registration situation, so that user portraits can be obtained according to the user's website registration situation, Improves the accuracy of user portraits.
  • FIG. 1 is an application scenario diagram of a method for generating user portraits based on website registration according to one or more embodiments.
  • FIG. 2 is a schematic flowchart of a method for generating user portraits based on website registration according to one or more embodiments.
  • FIG. 3 is a schematic diagram of classification of registration websites according to one or more embodiments.
  • FIG. 4 is a schematic flowchart according to one or more embodiments of step S208 in the embodiment shown in FIG. 2 .
  • FIG. 5 is a flowchart according to another or more embodiments of step S208 in the embodiment shown in FIG. 2 .
  • FIG. 6 is a block diagram of an apparatus for generating user portraits based on website registration according to one or more embodiments.
  • FIG. 7 is a block diagram of a computer device in accordance with one or more embodiments.
  • the method for generating user portraits based on website registration can be applied to the application environment shown in FIG. 1 .
  • the terminal 102 communicates with the server 104 through the network.
  • the server 104 can obtain the list of registered websites corresponding to the user from the terminal 102, for example, by traversing the applications installed in the terminal 102 to obtain the list of corresponding registered websites. Use time and installation time to obtain the corresponding registered website list, so that the server 104 can compare the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website, and count The number of registered websites in each category, so as to calculate the user portrait according to the number of registered websites in each category.
  • the terminal 102 can be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers and portable wearable devices, and the server 104 can be implemented by an independent server or a server cluster composed of multiple servers.
  • FIG. 2 a method for generating user portraits based on website registration is provided, and the method is applied to the server in FIG. 1 as an example to illustrate, including the following steps:
  • S202 Obtain a list of registered websites corresponding to the user.
  • the list of registered websites is to crawl the corresponding registration records including the registered user ID and the registration logo from the server of the preset website in advance, and classify the registered registration records representing the registration according to the user ID. owned.
  • the list of registered websites may be obtained according to a preset website, for example, crawling information on whether the corresponding user is registered from the server of each website. In order to ensure the security of the user's privacy, in this embodiment only Obtain the information about whether the user is registered. As for the specific information of registration, it will not be crawled. Preferably, the information about whether the user is registered can be set by means of a flag. If the flag is 0, the user is not registered. Otherwise, The user has been registered, and the registered information of the user is stored in the corresponding user's registered website list.
  • the registered website list is obtained by crawling the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and classifying the registration records representing the registration by the registration mark according to the user ID.
  • the acquiring the list of registered websites corresponding to the user may be connected to the user terminal, acquiring the application programs that have been installed in the user terminal, or being connected to each website server, reading the registered users in each website server, and Generate a list of registered sites.
  • S204 Compare the registered website in the website registration list with the identification of the standard website of the preset classification, so as to classify the registered website.
  • FIG. 3 is a schematic diagram of classification of registration websites in one embodiment.
  • the server can preset the type of registered website, for example, according to the dimensions of the user portrait, such as four dimensions of wealth, risk, interest, and industry, each dimension includes several different types of website collections, and each collection covers several Register the website.
  • the categories of the websites can be preset, for example, including: insurance practitioner websites, insurance websites, car club websites, programmer websites, movie websites, early childhood education websites, two-dimensional websites, legal websites , high-end hotel websites, public exam websites, airline websites, accounting websites, marriage and love websites, refueling and charging websites, construction websites, fitness websites, teacher websites, financial investment service/information websites, overseas Travel/Quality Tourism Websites, Financial Management Websites, International Students Websites, Papers and Periodical Websites, Travel Websites, Food Websites, Beauty and Skin Care Websites, Cute Pet Websites, Maternal and Baby Websites, Quality Life Websites, Cars Maintenance websites, automobile websites, comprehensive automobile portal websites, luxury websites, photography websites, online loan websites, health websites, doctor websites, game websites, primary and secondary education websites, comprehensive education websites, video site.
  • the server classifies registration sites according to the above categories.
  • the server can compare the logo of the registered website with the logo of the preset standard website to determine the classification of the registered website.
  • the reason why the website logo is used is that the logo adopts the method of serial code, not complicated natural language, which can improve the efficiency of classification.
  • the server when classifying registered websites, can set a counter corresponding to each type. When there are registered websites that are classified into this type, the number of the counters is incremented, and after processing the registered website list of the same user is completed , the counter is cleared, so as to complete the statistical work of the number of registered websites.
  • S208 Calculate and obtain the user portrait according to the number of registered websites in each category.
  • the number of registrations of some websites is much larger than that of other websites, so it is meaningless to compare the number of registrations between websites.
  • the larger the number of registrations the more obvious the performance in this dimension.
  • the data/modeler can introduce the number of registrations of different types of websites as a feature, or use it directly as the threshold of the rule. The specific usage method depends on the scenario. For details, please refer to the following. .
  • the above-mentioned list of registered websites and user portraits can also be stored in a node of a blockchain.
  • the above-mentioned method, device, equipment and medium for generating user portraits based on website registration fully consider the user's website registration situation, and quantify and structure each person's website registration situation, so that the user portrait can be obtained according to the user's website registration situation, Improves the accuracy of user portraits.
  • FIG. 4 is a schematic flowchart of step S208 in the embodiment shown in FIG. 2 .
  • step S208 is based on the number of registered websites in each category. Calculate the user portrait, including:
  • S402 Acquire multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to the multiple tags.
  • the corresponding relationship between scenarios and user portraits is established based on business experience in various industries and scenarios.
  • only four types of application scenarios are set: customer value, product demand, rights and interests, and channels, but the corresponding sub-scenarios are expanded based on 40 website categories.
  • the application scenarios can be set to more Multiple, wherein the category tags of each scene may include multiple categories, for example, each type of website corresponds to one, or multiple related types of websites correspond to one category tag.
  • S406 From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type.
  • the method for generating the user portrait may include: if a user's registered website includes 3 luxury websites and 2 international student websites, and based on the accumulated historical data statistics, the average number of registered luxury websites is 1.5 The average number of registrations for international student websites is 0.3.
  • the specific threshold of each type of website can also be set according to business experience. If the customer's net worth level is required to be higher, the threshold can be appropriately increased before making a judgment), and The person registered well above average for both types of sites, so he added a "Potential High Net Worth Client" label to the user.
  • multiple tags can be added for the user, and the combination of the multiple tags is the user portrait.
  • the corresponding relationship between the scene and the user portrait is established, so that the scene can be used against the user's registration website, and then the user label can be obtained through the user's registration website, so that the user portrait can be obtained.
  • FIG. 5 is a flowchart of another embodiment of step S208 in the embodiment shown in FIG. 2.
  • step S208 that is, according to each classification
  • the number of registered websites in the user profile is calculated, including:
  • S504 From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type.
  • S506 Perform model training according to the current number of registrations to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  • the current scenario can be set according to the needs of the model, such as a marketing scenario or a risk control scenario, etc.
  • Each scenario corresponds to the corresponding registered website type, and the number of registrations corresponding to the registered website type corresponding to this scenario is obtained.
  • the user portrait model can be obtained by training the model according to the obtained data, for example, adding the obtained number of registered website types to the training data for model training, that is, including other types of features, and adding website Type this feature, which makes the model more complete.
  • the user portrait is processed according to the model obtained by training.
  • performing model training according to the current number of registrations to obtain a user portrait model includes: generating a first feature vector of a first preset dimension according to the current number of registrations; acquiring a second preset dimension generated according to basic user information second feature vector; generating a user portrait model according to the first feature vector and the second feature vector; obtaining the user portrait according to the user portrait model, including: obtaining a user portrait representing the probability of product demand according to the user portrait model; the above method further includes: according to the product The demand probability sorts the users, and pushes the corresponding products to the users according to the sorting.
  • the number of registered website types is introduced into the model as a feature for prediction/recommendation in different scenarios. For example, if there is an existing demand, it is necessary to find out which people in a batch of customer samples have video membership requirements, that is, to predict who has a higher probability of getting a response by pushing video membership rights, then the "number of registered video websites" can be used as Features are introduced, and model training is performed in combination with other dimensional data. For example, a decision tree model is used to predict the probability of each person's response to the push of rights and interests, and then they are sorted by probability. The business side can choose the top X% of customers for key marketing.
  • performing model training according to the current number of registrations to obtain a user portrait model includes: generating a user portrait model based on a scorecard model according to the current number of registrations; obtaining a user portrait according to the user portrait model, including: comparing the current number of registrations with The number of websites in each segment in the scorecard model is compared to determine the user risk score; the corresponding user portrait is obtained according to the user risk score.
  • the above-mentioned embodiment is a marketing scenario, and the marketing scenario uses more decision tree models, while the present embodiment is a risk control scenario, involving more scorecard models based on logistic regression.
  • the larger the total score of the credit score the better the credit of the customer, so the user portrait can be obtained according to this setting.
  • steps in the flowcharts of FIGS. 2 , 4 and 5 are sequentially displayed in accordance with the arrows, these steps are not necessarily executed in the order indicated by the arrows. Unless explicitly stated herein, the execution of these steps is not strictly limited to the order, and these steps may be performed in other orders. Moreover, at least a part of the steps in FIG. 2 , FIG. 4 and FIG. 5 may include multiple sub-steps or multiple stages, and these sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. The order of execution of the sub-steps or phases is also not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the sub-steps or phases of the other steps.
  • a device for generating user portraits based on website registration including: a website list acquisition module 100, a classification module 200, a statistics module 300 and a portrait generation module 400, wherein:
  • the website list obtaining module 100 is used to obtain a list of registered websites corresponding to the user.
  • the registered website list is to crawl the corresponding registration records including the registered user ID and the registered logo from the server of the preset website in advance, and to characterize the registered logo according to the user ID.
  • the registered registration records are classified;
  • the classification module 200 is used to compare the registered website in the website registration list with the identifier of the standard website of the preset classification, so as to classify the registered website;
  • a statistics module 300 for counting the number of registered websites in each category.
  • the portrait generation module 400 is configured to calculate and obtain the user portrait according to the number of registered websites in each category.
  • the above-mentioned portrait generation module 400 includes:
  • a first scene acquisition unit configured to acquire multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to the multiple tags
  • the first currently registered website type acquiring unit is used to acquire the current registered website type corresponding to each of the multiple scenarios
  • the quantity selection unit is used to select the current registration quantity corresponding to the current registration website type from the number of registered websites in each category of the statistics;
  • a comparison unit for comparing the current registration number with a threshold to obtain a label
  • the first portrait generation unit is used for combining the obtained tags to obtain the user portrait.
  • the above-mentioned portrait generation module 400 includes:
  • the second scene obtaining unit is used to obtain the current scene and the currently registered website type corresponding to the current scene;
  • the second current registered website type acquisition unit is used to select the current registered number corresponding to the current registered website type from the number of registered websites in each category of the statistics;
  • the model generation unit is used to perform model training according to the current number of registrations to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  • the above-mentioned model generation unit may include:
  • the first feature vector generating subunit is used to generate the first feature vector of the first preset dimension according to the current registration quantity
  • the second feature vector generating subunit is used to obtain the second feature vector of the second preset dimension generated according to the basic information of the user;
  • the first model generation subunit is used for generating a user portrait model according to the first feature vector and the second feature vector;
  • the above-mentioned model generation unit is further configured to obtain a user portrait representing the probability of product demand according to the user portrait model;
  • the above-mentioned device for generating user portraits based on website registration may also include:
  • the push module is used to sort users according to the probability of product demand, and push corresponding products to users according to the sorting.
  • the above-mentioned model generation unit may include:
  • the second model generation subunit is used to generate a user portrait model based on the scorecard model according to the current registration number
  • a score calculation subunit for comparing the current number of registrations with the number of sites for each segment in the scorecard model to determine a user risk score
  • the portrait generation sub-unit is used to obtain the corresponding user portrait according to the user risk score.
  • Each module in the above-mentioned device for generating user portrait based on website registration can be implemented in whole or in part by software, hardware and combinations thereof.
  • the above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
  • a computer device in one embodiment, is provided, and the computer device can be a server, and its internal structure diagram can be as shown in FIG. 7 .
  • the computer device includes a processor, memory, a network interface, and a database connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes non-volatile storage media, internal memory.
  • the non-volatile storage medium stores an operating system, computer readable instructions and a database.
  • the internal memory provides an environment for the execution of the operating system and computer-readable instructions in the non-volatile storage medium.
  • the computer device's database is used to store a list of registered websites.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instructions when executed by the processor, implement a method for generating user portraits based on website registration.
  • FIG. 7 is only a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
  • a computer device comprising a memory and one or more processors, the memory stores computer-readable instructions, and when the computer-readable instructions are executed by the processor, the one or more processors perform the following steps: obtaining user corresponding The registered website list, the registered website list is obtained by crawling the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and classifying the registration records representing the registration by the registration mark according to the user ID; The registered websites in the website registration list are compared with the identifiers of the standard websites in the preset classification, so as to classify the registered websites; and count the number of registered websites in each classification; calculate the user according to the number of registered websites in each classification. portrait.
  • the user profile is calculated and obtained according to the number of registered websites in each category, including: acquiring a plurality of preset scenes, a plurality of tags corresponding to each scene, and Thresholds corresponding to multiple labels; obtain the currently registered website types corresponding to multiple scenarios; select the current registered website type corresponding to the current registered website type from the counted number of registered websites in each category; comparing with a threshold to obtain a label; and combining the obtained labels to obtain a user portrait.
  • the user profile is calculated and obtained according to the number of registered websites in each category, including: obtaining the current scene and the current registered website type corresponding to the current scene; From the number of registered websites in each category of the statistics, select the current registration number corresponding to the current registered website type; and perform model training according to the current registration number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  • performing model training according to the current number of registrations to obtain a user portrait model includes: generating a first feature vector of a first preset dimension according to the current number of registrations; The second feature vector of the second preset dimension generated by the basic user information; the user portrait model is generated according to the first feature vector and the second feature vector; when the processor executes the computer-readable instruction, the user portrait is obtained according to the user portrait model, The method includes: obtaining user portraits representing product demand probability according to the user portrait model; and when the processor executes the computer-readable instructions, the processor further implements the following steps: sorting users according to the product demand probability, and pushing corresponding products to the users according to the sorting.
  • performing model training according to the current number of registrations to obtain a user portrait model includes: generating a user portrait model based on the scorecard model according to the current number of registrations; and the processor executing Obtaining the user portrait according to the user portrait model realized by the computer readable instruction includes: comparing the current registration number with the number of websites in each segment in the scorecard model to determine the user risk score; obtaining the corresponding user portrait according to the user risk score. .
  • One or more computer-readable storage media storing computer-readable instructions, when the computer-readable instructions are executed by one or more processors, cause one or more processors to perform the following steps: acquiring a list of registered websites corresponding to the user, The registered website list is obtained by crawling the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and classifying the registration records representing the registration by the registration mark according to the user ID; Compare the registered website of the registered website with the identification of the standard website of the preset classification, so as to classify the registered website; count the number of registered websites in each classification; and calculate the user portrait according to the number of registered websites in each classification.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the user profile is calculated and obtained according to the number of registered websites in each category, including: acquiring multiple preset scenes and multiple tags corresponding to each scene and the thresholds corresponding to multiple labels; obtain the current registered website types corresponding to multiple scenarios; select the current registered website type corresponding to the current registered website type from the counted number of registered websites in each category; The number is compared with the threshold to obtain a label; and the obtained labels are combined to obtain a user portrait.
  • the user profile is calculated and obtained according to the number of registered websites in each category, including: obtaining the current scene and the current registered website type corresponding to the current scene; From the counted number of registered websites in each category, select the current registered number corresponding to the current registered website type; and perform model training according to the current registered number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  • performing model training according to the current number of registrations to obtain a user portrait model includes: generating a first feature vector of a first preset dimension according to the current number of registrations; obtaining The second feature vector of the second preset dimension generated according to the basic information of the user; the user portrait model is generated according to the first feature vector and the second feature vector; when the computer readable instruction is executed by the processor, the user is obtained according to the user portrait model.
  • the portrait includes: obtaining a user portrait representing the product demand probability according to the user portrait model; and when the processor executes the computer-readable instruction, the processor further implements the following steps: sorting the users according to the product demand probability, and pushing corresponding products to the users according to the sorting.
  • performing model training according to the current number of registrations to obtain a user portrait model includes: generating a user portrait model based on the scorecard model according to the current number of registrations; and the computer can Obtaining the user portrait according to the user portrait model realized when the read instruction is executed by the processor includes: comparing the current registration number with the number of websites in each segment in the scorecard model to determine the user risk score; obtaining the corresponding user risk score according to the user risk score.
  • the blockchain referred to in the present invention is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • Nonvolatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in various forms such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Road (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application relates to the technical field of big data, and in particular to a website registration-based user portrait generating method, comprising: acquiring a registered website list corresponding to a user, wherein the registered website list is obtained by crawling in advance, from a server of a preset website, a corresponding registration record that comprises a registered user identifier and a registration mark, and classifying the registration record which is characterized and registered with the registration mark according to the user identifier; comparing registered websites in the registered website list with an identifier of a preset classification standard website so as to classify the registered websites; counting the number of registered websites in each classification; and carrying out calculations to obtain a user portrait according to the number of registered websites in each classification.

Description

基于网站注册的用户画像生成方法、装置、设备和介质User portrait generation method, device, device and medium based on website registration
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求于2020年12月15日提交中国专利局,申请号为202011473435X,申请名称为“基于网站注册的用户画像生成方法、装置、设备和介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on December 15, 2020 with the application number 202011473435X and the application name is "method, device, equipment and medium for generating user portraits based on website registration", the entire contents of which are Incorporated herein by reference.
技术领域technical field
本申请涉及一种基于网站注册的用户画像生成方法、装置、设备和介质。The present application relates to a method, device, device and medium for generating user portraits based on website registration.
背景技术Background technique
随着大数据技术的发展,出现了各种各样的场景,其中对于用户进行画像构建是一个比较重要的长江,对用户进行画像构建是通过对用户进行打标签、划分客群、构建画像,有助于加深企业对用户的理解,从而提供有针对性的服务和营销,减轻企业的营销成本,并提升实际业务上的质量和效率。With the development of big data technology, various scenarios have appeared. Among them, the construction of user portraits is a relatively important Yangtze River. The construction of user portraits is done by tagging users, dividing customer groups, and building portraits. It helps to deepen the enterprise's understanding of users, so as to provide targeted services and marketing, reduce the marketing cost of the enterprise, and improve the quality and efficiency of the actual business.
然而,发明人意识到,目前的用户画像需要抽取用户的属性标签(如学历、性别等),传统的用户画像方法根据某一平台上的用户的社交、使用习惯数据抽取用户的属性标签,容易因数据单一、数据缺陷导致抽取用户属性标签的准确率低。如何提升抽取用户属性标签的准确率成为亟待解决的问题。However, the inventor realized that the current user portrait needs to extract the user's attribute label (such as education, gender, etc.), and the traditional user portrait method extracts the user's attribute label according to the user's social and usage data on a certain platform, which is easy to Due to the single data and data defects, the accuracy of extracting user attribute labels is low. How to improve the accuracy of extracting user attribute labels has become an urgent problem to be solved.
发明内容SUMMARY OF THE INVENTION
根据本申请公开的各种实施例,提供一种基于网站注册的用户画像生成方法、装置、设备和介质。According to various embodiments disclosed in the present application, a method, apparatus, device and medium for generating user portraits based on website registration are provided.
一种基于网站注册的用户画像生成方法,包括:A method for generating user portraits based on website registration, comprising:
获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;Obtain a list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;Comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
统计每一分类中注册网站的数量;及Count the number of registered websites in each category; and
根据每一分类中注册网站的数量计算得到用户画像。User portraits are calculated based on the number of registered websites in each category.
一种基于网站注册的用户画像生成装置,包括:A device for generating user portraits based on website registration, comprising:
网站列表获取模块,用于获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;The website list acquisition module is used to obtain the list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID. The registration mark is obtained by classifying the registered registration records;
分类模块,用于将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行 比较,以对所述注册网站进行分类;A classification module, for comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
统计模块,用于统计每一分类中注册网站的数量;及Statistics module for counting the number of registered websites in each category; and
画像生成模块,用于根据每一分类中注册网站的数量计算得到用户画像。The portrait generation module is used to calculate the user portrait according to the number of registered websites in each category.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device comprising a memory and one or more processors, the memory having computer-readable instructions stored therein, the computer-readable instructions, when executed by the processor, cause the one or more processors to execute The following steps:
获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;Obtain a list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;Comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
统计每一分类中注册网站的数量;及Count the number of registered websites in each category; and
根据每一分类中注册网站的数量计算得到用户画像。User portraits are calculated based on the number of registered websites in each category.
一个或多个存储有计算机可读指令的计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:One or more computer-readable storage media storing computer-readable instructions that, when executed by one or more processors, cause the one or more processors to perform the following steps:
获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;Obtain a list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;Comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
统计每一分类中注册网站的数量;及Count the number of registered websites in each category; and
根据每一分类中注册网站的数量计算得到用户画像。User portraits are calculated based on the number of registered websites in each category.
上述基于网站注册的用户画像生成方法、装置、设备和介质,充分考虑到用户的网站注册情况,将每个人的网站注册情况进行量化和结构化,从而可以根据用户的网站注册情况得到用户画像,提高了用户画像的准确性。The above-mentioned method, device, equipment and medium for generating user portraits based on website registration fully consider the user's website registration situation, and quantify and structure each person's website registration situation, so that user portraits can be obtained according to the user's website registration situation, Improves the accuracy of user portraits.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征和优点将从说明书、附图以及权利要求书变得明显。The details of one or more embodiments of the application are set forth in the accompanying drawings and the description below. Other features and advantages of the present application will be apparent from the description, drawings, and claims.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings required in the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without any creative effort.
图1为根据一个或多个实施例中基于网站注册的用户画像生成方法的应用场景图。FIG. 1 is an application scenario diagram of a method for generating user portraits based on website registration according to one or more embodiments.
图2为根据一个或多个实施例中基于网站注册的用户画像生成方法的流程示意图。FIG. 2 is a schematic flowchart of a method for generating user portraits based on website registration according to one or more embodiments.
图3为根据一个或多个实施例中的注册网站的分类示意图。FIG. 3 is a schematic diagram of classification of registration websites according to one or more embodiments.
图4为根据图2所示实施例中的步骤S208的一个或多个实施例的流程示意图。FIG. 4 is a schematic flowchart according to one or more embodiments of step S208 in the embodiment shown in FIG. 2 .
图5为根据图2所示实施例中的步骤S208的另一个或多个实施例的流程图。FIG. 5 is a flowchart according to another or more embodiments of step S208 in the embodiment shown in FIG. 2 .
图6为根据一个或多个实施例中基于网站注册的用户画像生成装置的框图。FIG. 6 is a block diagram of an apparatus for generating user portraits based on website registration according to one or more embodiments.
图7为根据一个或多个实施例中计算机设备的框图。7 is a block diagram of a computer device in accordance with one or more embodiments.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.
本申请提供的基于网站注册的用户画像生成方法,可以应用于如图1所示的应用环境中。终端102通过网络与服务器104通过网络进行通信。服务器104可以从终端102获取到用户对应的注册网站列表,例如遍历终端102中安装的应用程序以获取到对应的注册网站列表,可选地,服务器104还可以基于终端102中安装的应用程序的使用时间以及安装时间来获取到对应的注册网站列表,这样服务器104可以将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对注册网站进行分类进行,并统计每一分类中注册网站的数量,从而根据每一分类中注册网站的数量计算得到用户画像。这样充分考虑到用户的网站注册情况,将每个人的网站注册情况进行量化和结构化,从而可以根据用户的网站注册情况得到用户画像,提高了用户画像的准确性。终端102可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备,服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The method for generating user portraits based on website registration provided by this application can be applied to the application environment shown in FIG. 1 . The terminal 102 communicates with the server 104 through the network. The server 104 can obtain the list of registered websites corresponding to the user from the terminal 102, for example, by traversing the applications installed in the terminal 102 to obtain the list of corresponding registered websites. Use time and installation time to obtain the corresponding registered website list, so that the server 104 can compare the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website, and count The number of registered websites in each category, so as to calculate the user portrait according to the number of registered websites in each category. In this way, the user's website registration situation is fully considered, and each person's website registration situation is quantified and structured, so that the user portrait can be obtained according to the user's website registration situation, which improves the accuracy of the user portrait. The terminal 102 can be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers and portable wearable devices, and the server 104 can be implemented by an independent server or a server cluster composed of multiple servers.
在其中一个实施例中,如图2所示,提供了一种基于网站注册的用户画像生成方法,以该方法应用于图1中的服务器为例进行说明,包括以下步骤:In one of the embodiments, as shown in FIG. 2 , a method for generating user portraits based on website registration is provided, and the method is applied to the server in FIG. 1 as an example to illustrate, including the following steps:
S202:获取用户对应的注册网站列表,注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据用户标识对注册标志表征注册的注册记录进行分类得到的。S202: Obtain a list of registered websites corresponding to the user. The list of registered websites is to crawl the corresponding registration records including the registered user ID and the registration logo from the server of the preset website in advance, and classify the registered registration records representing the registration according to the user ID. owned.
具体地,注册网站列表可以是根据预先设置的网站来进行获取的,例如从各个网站的服务器爬取对应的用户是否注册的信息,其中为了保证用户的隐私的安全性,本实施例中仅是获取都用户是否注册的信息,至于注册的具体信息则不会进行爬取,优选地,可以通过一标志位的方式来设置用户是否注册的信息,若是标志位为0,则用户没有注册,否则用户已经注册,并将用户已经注册的信息存储到对应的用户的注册网站列表。例如注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据用户标识对注册标志表征注册的注册记录进行分类得到的。Specifically, the list of registered websites may be obtained according to a preset website, for example, crawling information on whether the corresponding user is registered from the server of each website. In order to ensure the security of the user's privacy, in this embodiment only Obtain the information about whether the user is registered. As for the specific information of registration, it will not be crawled. Preferably, the information about whether the user is registered can be set by means of a flag. If the flag is 0, the user is not registered. Otherwise, The user has been registered, and the registered information of the user is stored in the corresponding user's registered website list. For example, the registered website list is obtained by crawling the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and classifying the registration records representing the registration by the registration mark according to the user ID.
具体地,该获取用户对应的注册网站列表可以是与用户终端相连接,获取用户终端中已经安装的应用程序或者是与各个网站服务器相连接,读取各个网站服务器中的已经注册的用户,并生成注册网站列表。Specifically, the acquiring the list of registered websites corresponding to the user may be connected to the user terminal, acquiring the application programs that have been installed in the user terminal, or being connected to each website server, reading the registered users in each website server, and Generate a list of registered sites.
S204:将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以 对注册网站进行分类进行。S204: Compare the registered website in the website registration list with the identification of the standard website of the preset classification, so as to classify the registered website.
具体地,结合图3,图3为一个实施例中的注册网站的分类示意图。其中服务器可以预设注册网站的类型,例如根据用户画像的维度,如财富、风险、兴趣、行业四个维度,每个维度包括若干个不同种类的网站集合,每个集合下又覆盖了若干个注册网站。其中网站的类别可以是预先进行设置,例如包括:保险从业者类网站、保险类网站、车友会网站、程序员类网站、电影类网站、儿童早教类网站、二次元类网站、法律类网站、高端酒店类网站、公考类网站、航空公司类网站、会计类网站、婚恋类网站、加油充电类网站、建筑类网站、健身类网站、教师类网站、金融投资服务/资讯类网站、境外游/品质类旅游网站、理财类网站、留学生类网站、论文期刊类网站、旅游类网站、美食类网站、美妆护肤类网站、萌宠类网站、母婴类网站、品质生活类网站、汽车保养类网站、汽车类网站、汽车综合门户类网站、奢侈品类网站、摄影类网站、网贷类网站、养生类网站、医生类网站、游戏类网站、中小学教育类网站、综合学历类网站、视频类网站。Specifically, with reference to FIG. 3 , FIG. 3 is a schematic diagram of classification of registration websites in one embodiment. The server can preset the type of registered website, for example, according to the dimensions of the user portrait, such as four dimensions of wealth, risk, interest, and industry, each dimension includes several different types of website collections, and each collection covers several Register the website. The categories of the websites can be preset, for example, including: insurance practitioner websites, insurance websites, car club websites, programmer websites, movie websites, early childhood education websites, two-dimensional websites, legal websites , high-end hotel websites, public exam websites, airline websites, accounting websites, marriage and love websites, refueling and charging websites, construction websites, fitness websites, teacher websites, financial investment service/information websites, overseas Travel/Quality Tourism Websites, Financial Management Websites, International Students Websites, Papers and Periodical Websites, Travel Websites, Food Websites, Beauty and Skin Care Websites, Cute Pet Websites, Maternal and Baby Websites, Quality Life Websites, Cars Maintenance websites, automobile websites, comprehensive automobile portal websites, luxury websites, photography websites, online loan websites, health websites, doctor websites, game websites, primary and secondary education websites, comprehensive education websites, video site.
服务器根据以上类别对注册网站进行分类。其中分类的时候,服务器可以将注册网站的标识与预设分类的标准网站的标识进行比较,以确定注册网站的分类,之所以采用网站标识是由于标识采用的是序列码的方式,而非复杂的自然语言,这样可以提高分类的效率。The server classifies registration sites according to the above categories. When classifying, the server can compare the logo of the registered website with the logo of the preset standard website to determine the classification of the registered website. The reason why the website logo is used is that the logo adopts the method of serial code, not complicated natural language, which can improve the efficiency of classification.
S206:统计每一分类中注册网站的数量。S206: Count the number of registered websites in each category.
具体地,服务器在对注册网站进行分类的时候,可以设置对应每一类型的计数器,当存在注册网站被分到该类时,则计数器的数量递增,且在同一用户的注册网站列表处理完成后,则对计数器进行清空处理,从而完成注册网站的数量的统计工作。Specifically, when classifying registered websites, the server can set a counter corresponding to each type. When there are registered websites that are classified into this type, the number of the counters is incremented, and after processing the registered website list of the same user is completed , the counter is cleared, so as to complete the statistical work of the number of registered websites.
S208:根据每一分类中注册网站的数量计算得到用户画像。S208: Calculate and obtain the user portrait according to the number of registered websites in each category.
具体地,由于不同规模网站的宣传力度不同,某些网站的注册人数基数要远大于其他网站,所以在网站之间进行注册数的横向对比没有太多意义,应对全部样本在相同类型网站的注册数进行对比,注册数目越大,说明在该维度的表现越为明显。在不同场景、不同模型中,数据/建模人员可以将不同类型网站的注册个数作为特征引入,也可以直接作为规则的阈值进行使用,具体的使用方法可视场景而定,具体可以参见下文。Specifically, due to the different propaganda efforts of websites of different scales, the number of registrations of some websites is much larger than that of other websites, so it is meaningless to compare the number of registrations between websites. The larger the number of registrations, the more obvious the performance in this dimension. In different scenarios and different models, the data/modeler can introduce the number of registrations of different types of websites as a feature, or use it directly as the threshold of the rule. The specific usage method depends on the scenario. For details, please refer to the following. .
需要强调的是,为进一步保证上述注册网站列表和用户画像的私密和安全性,上述注册网站列表和用户画像还可以存储于一区块链的节点中。It should be emphasized that, in order to further ensure the privacy and security of the above-mentioned list of registered websites and user portraits, the above-mentioned list of registered websites and user portraits can also be stored in a node of a blockchain.
上述基于网站注册的用户画像生成方法、装置、设备和介质,充分考虑到用户的网站注册情况,将每个人的网站注册情况进行量化和结构化,从而可以根据用户的网站注册情况得到用户画像,提高了用户画像的准确性。The above-mentioned method, device, equipment and medium for generating user portraits based on website registration fully consider the user's website registration situation, and quantify and structure each person's website registration situation, so that the user portrait can be obtained according to the user's website registration situation, Improves the accuracy of user portraits.
在其中一个实施例中,参见图4所示,图4为图2所示实施例中的步骤S208的流程示意图,在该实施例中,该步骤S208,即根据每一分类中注册网站的数量计算得到用户画像,包括:In one embodiment, referring to FIG. 4 , FIG. 4 is a schematic flowchart of step S208 in the embodiment shown in FIG. 2 . In this embodiment, step S208 is based on the number of registered websites in each category. Calculate the user portrait, including:
S402:获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值。S402: Acquire multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to the multiple tags.
S404:获取多个场景各自对应的当前注册网站类型。S404: Acquire currently registered website types corresponding to each of the multiple scenarios.
具体地,场景与用户画像的对应关系是基于各行业与场景的业务经验建立的。在本实施例中,应用场景仅设置客户价值、产品需求、权益、渠道四种,但其分别对应的细分场景是基于40个网站类别进行扩展,在其他实施例中,应用场景可以设置更多个,其中每个场景的类别标签可以包括多个,例如每一类型的网站对应一个,或者是多个相关类型的网站对应一个类别标签。Specifically, the corresponding relationship between scenarios and user portraits is established based on business experience in various industries and scenarios. In this embodiment, only four types of application scenarios are set: customer value, product demand, rights and interests, and channels, but the corresponding sub-scenarios are expanded based on 40 website categories. In other embodiments, the application scenarios can be set to more Multiple, wherein the category tags of each scene may include multiple categories, for example, each type of website corresponds to one, or multiple related types of websites correspond to one category tag.
S406:从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量。S406: From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type.
S408:将当前注册数量与阈值进行比较得到标签。S408: Compare the current registration number with a threshold to obtain a label.
具体地,用户画像的生成方法可以包括:如果若是某个用户的注册网站包括3个奢侈品类网站、2个留学生类网站,而基于所积累的历史数据统计,奢侈品类网站的平均注册数为1.5个,留学生类网站的平均注册数为0.3个,(每类网站的具体阈值也可以根据业务经验来设定,若对客户的净值水平要求较高,可适当调高阈值再作判断),而此人这两类网站的注册数均远高于平均水平,于是为该用户添加一个“潜在高净值客户”的标签。Specifically, the method for generating the user portrait may include: if a user's registered website includes 3 luxury websites and 2 international student websites, and based on the accumulated historical data statistics, the average number of registered luxury websites is 1.5 The average number of registrations for international student websites is 0.3. (The specific threshold of each type of website can also be set according to business experience. If the customer's net worth level is required to be higher, the threshold can be appropriately increased before making a judgment), and The person registered well above average for both types of sites, so he added a "Potential High Net Worth Client" label to the user.
S410:将所得到的标签进行组合得到用户画像。S410: Combine the obtained tags to obtain a user portrait.
具体地,根据上述判断,可以为用户添加多个标签,而多个标签的组合即为用户画像。Specifically, according to the above judgment, multiple tags can be added for the user, and the combination of the multiple tags is the user portrait.
上述实施例中,建立场景与用户画像的对应关系,从而场景可以对用用户的注册网站,进而可以通过用户的注册网站来得到用户标签,从而可以得到用户画像。In the above embodiment, the corresponding relationship between the scene and the user portrait is established, so that the scene can be used against the user's registration website, and then the user label can be obtained through the user's registration website, so that the user portrait can be obtained.
在其中一个实施例中,参见图5所示,图5为图2所示实施例中的步骤S208的另一实施例的流程图,在该实施例中,该步骤S208,即根据每一分类中注册网站的数量计算得到用户画像,包括:In one of the embodiments, referring to FIG. 5, FIG. 5 is a flowchart of another embodiment of step S208 in the embodiment shown in FIG. 2. In this embodiment, step S208, that is, according to each classification The number of registered websites in the user profile is calculated, including:
S502:获取当前场景,以及当前场景对应的当前注册网站类型。S502: Acquire the current scene and the currently registered website type corresponding to the current scene.
S504:从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量。S504: From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type.
S506:根据当前注册数量进行模型训练得到用户画像模型,根据用户画像模型得到用户画像。S506: Perform model training according to the current number of registrations to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
具体地,当前场景可以是根据模型需要所设置的,例如营销场景或者是风控场景等,每种场景则对应关注有对应的注册网站类型,获取该种场景对应的注册网站类型对应的注册数量,从而可以根据所获取的数据来进行模型的训练得到用户画像模型,例如将所获取的注册网站类型的数量添加到模型训练的训练数据中,即包括了其他的类型的特征,还增加了网站类型这一特征,从而使得模型更加完善。最后根据训练得到的模型来进行用户画像的处理。Specifically, the current scenario can be set according to the needs of the model, such as a marketing scenario or a risk control scenario, etc. Each scenario corresponds to the corresponding registered website type, and the number of registrations corresponding to the registered website type corresponding to this scenario is obtained. , so that the user portrait model can be obtained by training the model according to the obtained data, for example, adding the obtained number of registered website types to the training data for model training, that is, including other types of features, and adding website Type this feature, which makes the model more complete. Finally, the user portrait is processed according to the model obtained by training.
在其中一个实施例中,根据当前注册数量进行模型训练得到用户画像模型,包括:根据当前注册数量生成第一预设维度的第一特征向量;获取根据用户基本信息生成的第二预设维度的第二特征向量;根据第一特征向量和第二特征向量生成用户画像模型;根据用户画像模型得到用户画像,包括:根据用户画像模型得到表征产品需求概率的用户画像;上 述方法还包括:根据产品需求概率对用户进行排序,按照排序向用户推送对应的产品。In one embodiment, performing model training according to the current number of registrations to obtain a user portrait model includes: generating a first feature vector of a first preset dimension according to the current number of registrations; acquiring a second preset dimension generated according to basic user information second feature vector; generating a user portrait model according to the first feature vector and the second feature vector; obtaining the user portrait according to the user portrait model, including: obtaining a user portrait representing the probability of product demand according to the user portrait model; the above method further includes: according to the product The demand probability sorts the users, and pushes the corresponding products to the users according to the sorting.
具体地,在该实施例中,该注册网站类型的数量作为特征引入到模型中,用于不同场景的预测/推荐。例如现有一个需求,需要找出一批客户样本中哪些人有视频会员需求,即预测给哪些人推送视频会员权益获得响应的概率更大,那么就可以将“视频类网站注册个数”作为特征引入,结合其他维度数据进行模型训练。比如通过决策树模型来预测每个人对该权益推送的响应概率,然后按概率大小进行排序,业务方可选择前X%客群进行重点营销。Specifically, in this embodiment, the number of registered website types is introduced into the model as a feature for prediction/recommendation in different scenarios. For example, if there is an existing demand, it is necessary to find out which people in a batch of customer samples have video membership requirements, that is, to predict who has a higher probability of getting a response by pushing video membership rights, then the "number of registered video websites" can be used as Features are introduced, and model training is performed in combination with other dimensional data. For example, a decision tree model is used to predict the probability of each person's response to the push of rights and interests, and then they are sorted by probability. The business side can choose the top X% of customers for key marketing.
在其中一个实施例中,根据当前注册数量进行模型训练得到用户画像模型,包括:根据当前注册数量生成基于评分卡模型的用户画像模型;根据用户画像模型得到用户画像,包括:将当前注册数量与评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;根据用户风险评分得到对应的用户画像。In one embodiment, performing model training according to the current number of registrations to obtain a user portrait model includes: generating a user portrait model based on a scorecard model according to the current number of registrations; obtaining a user portrait according to the user portrait model, including: comparing the current number of registrations with The number of websites in each segment in the scorecard model is compared to determine the user risk score; the corresponding user portrait is obtained according to the user risk score.
具体地,上述实施例中是营销场景,营销场景用决策树模型偏多,而本实施例中是风控场景,涉及基于逻辑回归的评分卡模型更多。在评分卡模型的结果中,每个特征不同的枚举值会对应不同的分数,比如“网贷类网站注册个数”这个字段X,若X=0,则该项得分为10分;若0<X<=3,则该项得分为7.5分;若3<x<=5,则该项得分为5分;若8<x<=10,则该项得分为2.5分;若x>10,则该项得分为0分。而信用分的总分越大,表示该客户的信用越好,因此可以根据该设置来获取用户画像。Specifically, the above-mentioned embodiment is a marketing scenario, and the marketing scenario uses more decision tree models, while the present embodiment is a risk control scenario, involving more scorecard models based on logistic regression. In the results of the scorecard model, different enumeration values for each feature will correspond to different scores, such as the field X of "number of online loan website registrations". If X=0, the score for this item is 10 points; if 0<X<=3, the score for this item is 7.5 points; if 3<x<=5, the score for this item is 5 points; if 8<x<=10, the score for this item is 2.5 points; if x> 10, the score for this item is 0. The larger the total score of the credit score, the better the credit of the customer, so the user portrait can be obtained according to this setting.
应该理解的是,虽然图2、图4和图5的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2、图4和图5中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowcharts of FIGS. 2 , 4 and 5 are sequentially displayed in accordance with the arrows, these steps are not necessarily executed in the order indicated by the arrows. Unless explicitly stated herein, the execution of these steps is not strictly limited to the order, and these steps may be performed in other orders. Moreover, at least a part of the steps in FIG. 2 , FIG. 4 and FIG. 5 may include multiple sub-steps or multiple stages, and these sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. The order of execution of the sub-steps or phases is also not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the sub-steps or phases of the other steps.
在其中一个实施例中,如图6所示,提供了一种基于网站注册的用户画像生成装置,包括:网站列表获取模块100、分类模块200、统计模块300和画像生成模块400,其中:In one embodiment, as shown in FIG. 6 , a device for generating user portraits based on website registration is provided, including: a website list acquisition module 100, a classification module 200, a statistics module 300 and a portrait generation module 400, wherein:
网站列表获取模块100,用于获取用户对应的注册网站列表,注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据用户标识对注册标志表征注册的注册记录进行分类得到的;The website list obtaining module 100 is used to obtain a list of registered websites corresponding to the user. The registered website list is to crawl the corresponding registration records including the registered user ID and the registered logo from the server of the preset website in advance, and to characterize the registered logo according to the user ID. The registered registration records are classified;
分类模块200,用于将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对注册网站进行分类进行;The classification module 200 is used to compare the registered website in the website registration list with the identifier of the standard website of the preset classification, so as to classify the registered website;
统计模块300,用于统计每一分类中注册网站的数量;及A statistics module 300 for counting the number of registered websites in each category; and
画像生成模块400,用于根据每一分类中注册网站的数量计算得到用户画像。The portrait generation module 400 is configured to calculate and obtain the user portrait according to the number of registered websites in each category.
在其中一个实施例中,上述的画像生成模块400包括:In one embodiment, the above-mentioned portrait generation module 400 includes:
第一场景获取单元,用于获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;a first scene acquisition unit, configured to acquire multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to the multiple tags;
第一当前注册网站类型获取单元,用于获取多个场景各自对应的当前注册网站类型;The first currently registered website type acquiring unit is used to acquire the current registered website type corresponding to each of the multiple scenarios;
数量选取单元,用于从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量;The quantity selection unit is used to select the current registration quantity corresponding to the current registration website type from the number of registered websites in each category of the statistics;
比较单元,用于将当前注册数量与阈值进行比较得到标签;及a comparison unit for comparing the current registration number with a threshold to obtain a label; and
第一画像生成单元,用于将所得到的标签进行组合得到用户画像。The first portrait generation unit is used for combining the obtained tags to obtain the user portrait.
在其中一个实施例中,上述的画像生成模块400包括:In one embodiment, the above-mentioned portrait generation module 400 includes:
第二场景获取单元,用于获取当前场景,以及当前场景对应的当前注册网站类型;The second scene obtaining unit is used to obtain the current scene and the currently registered website type corresponding to the current scene;
第二当前注册网站类型获取单元,用于从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量;及The second current registered website type acquisition unit is used to select the current registered number corresponding to the current registered website type from the number of registered websites in each category of the statistics; and
模型生成单元,用于根据当前注册数量进行模型训练得到用户画像模型,根据用户画像模型得到用户画像。The model generation unit is used to perform model training according to the current number of registrations to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
在其中一个实施例中,上述的模型生成单元可以包括:In one embodiment, the above-mentioned model generation unit may include:
第一特征向量生成子单元,用于根据当前注册数量生成第一预设维度的第一特征向量;The first feature vector generating subunit is used to generate the first feature vector of the first preset dimension according to the current registration quantity;
第二特征向量生成子单元,用于获取根据用户基本信息生成的第二预设维度的第二特征向量;The second feature vector generating subunit is used to obtain the second feature vector of the second preset dimension generated according to the basic information of the user;
第一模型生成子单元,用于根据第一特征向量和第二特征向量生成用户画像模型;The first model generation subunit is used for generating a user portrait model according to the first feature vector and the second feature vector;
上述模型生成单元还用于根据用户画像模型得到表征产品需求概率的用户画像;及The above-mentioned model generation unit is further configured to obtain a user portrait representing the probability of product demand according to the user portrait model; and
上述的基于网站注册的用户画像生成装置还可以包括:The above-mentioned device for generating user portraits based on website registration may also include:
推送模块,用于根据产品需求概率对用户进行排序,按照排序向用户推送对应的产品。The push module is used to sort users according to the probability of product demand, and push corresponding products to users according to the sorting.
在其中一个实施例中,上述的模型生成单元可以包括:In one embodiment, the above-mentioned model generation unit may include:
第二模型生成子单元,用于根据当前注册数量生成基于评分卡模型的用户画像模型;The second model generation subunit is used to generate a user portrait model based on the scorecard model according to the current registration number;
评分计算子单元,用于将当前注册数量与评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;及A score calculation subunit for comparing the current number of registrations with the number of sites for each segment in the scorecard model to determine a user risk score; and
画像生成子单元,用于根据用户风险评分得到对应的用户画像。The portrait generation sub-unit is used to obtain the corresponding user portrait according to the user risk score.
关于基于网站注册的用户画像生成装置的具体限定可以参见上文中对于基于网站注册的用户画像生成方法的限定,在此不再赘述。上述基于网站注册的用户画像生成装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For specific limitations on the device for generating user portraits based on website registration, please refer to the above limitations on the method for generating user portraits based on website registration, which will not be repeated here. Each module in the above-mentioned device for generating user portrait based on website registration can be implemented in whole or in part by software, hardware and combinations thereof. The above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图7所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器 包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储注册网站列表。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种基于网站注册的用户画像生成方法。In one embodiment, a computer device is provided, and the computer device can be a server, and its internal structure diagram can be as shown in FIG. 7 . The computer device includes a processor, memory, a network interface, and a database connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes non-volatile storage media, internal memory. The non-volatile storage medium stores an operating system, computer readable instructions and a database. The internal memory provides an environment for the execution of the operating system and computer-readable instructions in the non-volatile storage medium. The computer device's database is used to store a list of registered websites. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instructions, when executed by the processor, implement a method for generating user portraits based on website registration.
本领域技术人员可以理解,图7中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 7 is only a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
一种计算机设备,包括存储器和一个或多个处理器,存储器中储存有计算机可读指令,计算机可读指令被处理器执行时,使得一个或多个处理器执行以下步骤::获取用户对应的注册网站列表,注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据用户标识对注册标志表征注册的注册记录进行分类得到的;将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对注册网站进行分类进行;及统计每一分类中注册网站的数量;根据每一分类中注册网站的数量计算得到用户画像。A computer device, comprising a memory and one or more processors, the memory stores computer-readable instructions, and when the computer-readable instructions are executed by the processor, the one or more processors perform the following steps: obtaining user corresponding The registered website list, the registered website list is obtained by crawling the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and classifying the registration records representing the registration by the registration mark according to the user ID; The registered websites in the website registration list are compared with the identifiers of the standard websites in the preset classification, so as to classify the registered websites; and count the number of registered websites in each classification; calculate the user according to the number of registered websites in each classification. portrait.
在其中一个实施例中,处理器执行计算机可读指令时所实现的根据每一分类中注册网站的数量计算得到用户画像,包括:获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;获取多个场景各自对应的当前注册网站类型;从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量;将当前注册数量与阈值进行比较得到标签;及将所得到的标签进行组合得到用户画像。In one embodiment, when the processor executes the computer-readable instructions, the user profile is calculated and obtained according to the number of registered websites in each category, including: acquiring a plurality of preset scenes, a plurality of tags corresponding to each scene, and Thresholds corresponding to multiple labels; obtain the currently registered website types corresponding to multiple scenarios; select the current registered website type corresponding to the current registered website type from the counted number of registered websites in each category; comparing with a threshold to obtain a label; and combining the obtained labels to obtain a user portrait.
在其中一个实施例中,处理器执行计算机可读指令时所实现的根据每一分类中注册网站的数量计算得到用户画像,包括:获取当前场景,以及当前场景对应的当前注册网站类型;从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量;及根据当前注册数量进行模型训练得到用户画像模型,根据用户画像模型得到用户画像。In one embodiment, when the processor executes the computer-readable instructions, the user profile is calculated and obtained according to the number of registered websites in each category, including: obtaining the current scene and the current registered website type corresponding to the current scene; From the number of registered websites in each category of the statistics, select the current registration number corresponding to the current registered website type; and perform model training according to the current registration number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
在其中一个实施例中,处理器执行计算机可读指令时所实现的根据当前注册数量进行模型训练得到用户画像模型,包括:根据当前注册数量生成第一预设维度的第一特征向量;获取根据用户基本信息生成的第二预设维度的第二特征向量;根据第一特征向量和第二特征向量生成用户画像模型;处理器执行计算机可读指令时所实现的根据用户画像模型得到用户画像,包括:根据用户画像模型得到表征产品需求概率的用户画像;及处理器执行计算机可读指令时还实现以下步骤:根据产品需求概率对用户进行排序,按照排序向用户推送对应的产品。In one embodiment, when the processor executes the computer-readable instructions, performing model training according to the current number of registrations to obtain a user portrait model includes: generating a first feature vector of a first preset dimension according to the current number of registrations; The second feature vector of the second preset dimension generated by the basic user information; the user portrait model is generated according to the first feature vector and the second feature vector; when the processor executes the computer-readable instruction, the user portrait is obtained according to the user portrait model, The method includes: obtaining user portraits representing product demand probability according to the user portrait model; and when the processor executes the computer-readable instructions, the processor further implements the following steps: sorting users according to the product demand probability, and pushing corresponding products to the users according to the sorting.
在其中一个实施例中,处理器执行计算机可读指令时所实现的根据当前注册数量进行模型训练得到用户画像模型,包括:根据当前注册数量生成基于评分卡模型的用户画像模型;及处理器执行计算机可读指令时所实现的根据用户画像模型得到用户画像,包括:将 当前注册数量与评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;根据用户风险评分得到对应的用户画像。In one embodiment, when the processor executes the computer-readable instructions, performing model training according to the current number of registrations to obtain a user portrait model includes: generating a user portrait model based on the scorecard model according to the current number of registrations; and the processor executing Obtaining the user portrait according to the user portrait model realized by the computer readable instruction includes: comparing the current registration number with the number of websites in each segment in the scorecard model to determine the user risk score; obtaining the corresponding user portrait according to the user risk score. .
一个或多个存储有计算机可读指令的计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:获取用户对应的注册网站列表,注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据用户标识对注册标志表征注册的注册记录进行分类得到的;将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对注册网站进行分类进行;统计每一分类中注册网站的数量;及根据每一分类中注册网站的数量计算得到用户画像。One or more computer-readable storage media storing computer-readable instructions, when the computer-readable instructions are executed by one or more processors, cause one or more processors to perform the following steps: acquiring a list of registered websites corresponding to the user, The registered website list is obtained by crawling the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and classifying the registration records representing the registration by the registration mark according to the user ID; Compare the registered website of the registered website with the identification of the standard website of the preset classification, so as to classify the registered website; count the number of registered websites in each classification; and calculate the user portrait according to the number of registered websites in each classification.
其中,该计算机可读存储介质可以是非易失性,也可以是易失性的。Wherein, the computer-readable storage medium may be non-volatile or volatile.
在其中一个实施例中,计算机可读指令被处理器执行时所实现的根据每一分类中注册网站的数量计算得到用户画像,包括:获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;获取多个场景各自对应的当前注册网站类型;从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量;将当前注册数量与阈值进行比较得到标签;及将所得到的标签进行组合得到用户画像。In one embodiment, when the computer-readable instructions are executed by the processor, the user profile is calculated and obtained according to the number of registered websites in each category, including: acquiring multiple preset scenes and multiple tags corresponding to each scene and the thresholds corresponding to multiple labels; obtain the current registered website types corresponding to multiple scenarios; select the current registered website type corresponding to the current registered website type from the counted number of registered websites in each category; The number is compared with the threshold to obtain a label; and the obtained labels are combined to obtain a user portrait.
在其中一个实施例中,计算机可读指令被处理器执行时所实现的根据每一分类中注册网站的数量计算得到用户画像,包括:获取当前场景,以及当前场景对应的当前注册网站类型;从所统计的每一分类中的注册网站的数量中,选取与当前注册网站类型对应的当前注册数量;及根据当前注册数量进行模型训练得到用户画像模型,根据用户画像模型得到用户画像。In one embodiment, when the computer-readable instructions are executed by the processor, the user profile is calculated and obtained according to the number of registered websites in each category, including: obtaining the current scene and the current registered website type corresponding to the current scene; From the counted number of registered websites in each category, select the current registered number corresponding to the current registered website type; and perform model training according to the current registered number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
在其中一个实施例中,计算机可读指令被处理器执行时所实现的根据当前注册数量进行模型训练得到用户画像模型,包括:根据当前注册数量生成第一预设维度的第一特征向量;获取根据用户基本信息生成的第二预设维度的第二特征向量;根据第一特征向量和第二特征向量生成用户画像模型;计算机可读指令被处理器执行时所实现的根据用户画像模型得到用户画像,包括:根据用户画像模型得到表征产品需求概率的用户画像;及处理器执行计算机可读指令时还实现以下步骤:根据产品需求概率对用户进行排序,按照排序向用户推送对应的产品。In one of the embodiments, when the computer-readable instructions are executed by the processor, performing model training according to the current number of registrations to obtain a user portrait model includes: generating a first feature vector of a first preset dimension according to the current number of registrations; obtaining The second feature vector of the second preset dimension generated according to the basic information of the user; the user portrait model is generated according to the first feature vector and the second feature vector; when the computer readable instruction is executed by the processor, the user is obtained according to the user portrait model. The portrait includes: obtaining a user portrait representing the product demand probability according to the user portrait model; and when the processor executes the computer-readable instruction, the processor further implements the following steps: sorting the users according to the product demand probability, and pushing corresponding products to the users according to the sorting.
在其中一个实施例中,计算机可读指令被处理器执行时所实现的根据当前注册数量进行模型训练得到用户画像模型,包括:根据当前注册数量生成基于评分卡模型的用户画像模型;及计算机可读指令被处理器执行时所实现的根据用户画像模型得到用户画像,包括:将当前注册数量与评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;根据用户风险评分得到对应的用户画像。In one embodiment, when the computer-readable instructions are executed by the processor, performing model training according to the current number of registrations to obtain a user portrait model includes: generating a user portrait model based on the scorecard model according to the current number of registrations; and the computer can Obtaining the user portrait according to the user portrait model realized when the read instruction is executed by the processor includes: comparing the current registration number with the number of websites in each segment in the scorecard model to determine the user risk score; obtaining the corresponding user risk score according to the user risk score. User portrait.
本发明所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证 其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in the present invention is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一计算机可读取存储介质中,该计算机可读取存储介质可以为易失性的或非易失性的,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through computer-readable instructions, and the computer-readable instructions can be stored in a computer-readable storage In the medium, the computer-readable storage medium may be volatile or non-volatile, and when the computer-readable instructions are executed, they may include the processes of the foregoing method embodiments. Wherein, any reference to memory, storage, database or other medium used in the various embodiments provided in this application may include non-volatile and/or volatile memory. Nonvolatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Road (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. In order to make the description simple, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features It is considered to be the range described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several embodiments of the present application, and the descriptions thereof are relatively specific and detailed, but should not be construed as a limitation on the scope of the invention patent. It should be pointed out that for those skilled in the art, without departing from the concept of the present application, several modifications and improvements can be made, which all belong to the protection scope of the present application. Therefore, the scope of protection of the patent of the present application shall be subject to the appended claims.

Claims (20)

  1. 一种基于网站注册的用户画像生成方法,包括:A method for generating user portraits based on website registration, comprising:
    获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;Obtain a list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
    将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;Comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
    统计每一分类中注册网站的数量;及Count the number of registered websites in each category; and
    根据每一分类中注册网站的数量计算得到用户画像。User portraits are calculated based on the number of registered websites in each category.
  2. 根据权利要求1所述的方法,其中,所述根据每一分类中注册网站的数量计算得到用户画像,包括:The method according to claim 1, wherein the calculating and obtaining the user portrait according to the number of registered websites in each category comprises:
    获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;Obtain multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to multiple tags;
    获取所述多个场景各自对应的当前注册网站类型;obtaining the current registered website type corresponding to each of the multiple scenarios;
    从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;From the counted number of registered websites in each category, select the current registered number corresponding to the type of the current registered website;
    将所述当前注册数量与所述阈值进行比较得到标签;及comparing the current number of registrations to the threshold to obtain a label; and
    将所得到的标签进行组合得到用户画像。The obtained tags are combined to obtain the user portrait.
  3. 根据权利要求1所述的方法,其中,所述根据每一分类中注册网站的数量计算得到用户画像,包括:The method according to claim 1, wherein the calculating and obtaining the user portrait according to the number of registered websites in each category comprises:
    获取当前场景,以及所述当前场景对应的当前注册网站类型;Obtain the current scene, and the current registered website type corresponding to the current scene;
    从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;及From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type; and
    根据所述当前注册数量进行模型训练得到用户画像模型,根据所述用户画像模型得到用户画像。Perform model training according to the current registration number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  4. 根据权利要求3所述的方法,其中,所述根据所述当前注册数量进行模型训练得到用户画像模型,包括:The method according to claim 3, wherein the user portrait model obtained by performing model training according to the current registration number comprises:
    根据所述当前注册数量生成第一预设维度的第一特征向量;Generate a first feature vector of a first preset dimension according to the current registration number;
    获取根据用户基本信息生成的第二预设维度的第二特征向量;obtaining the second feature vector of the second preset dimension generated according to the basic information of the user;
    根据所述第一特征向量和所述第二特征向量生成用户画像模型;及generating a user portrait model according to the first feature vector and the second feature vector; and
    所述根据所述用户画像模型得到用户画像,包括:The obtaining of the user portrait according to the user portrait model includes:
    根据所述用户画像模型得到表征产品需求概率的用户画像;及obtaining a user portrait representing the probability of product demand according to the user portrait model; and
    所述方法还包括:The method also includes:
    根据所述产品需求概率对用户进行排序,按照所述排序向用户推送对应的产品。The users are sorted according to the product demand probability, and corresponding products are pushed to the users according to the sorting.
  5. 根据权利要求3所述的方法,其中,所述根据所述当前注册数量进行模型训练得到用户画像模型,包括:The method according to claim 3, wherein the user portrait model obtained by performing model training according to the current registration number comprises:
    根据所述当前注册数量生成基于评分卡模型的用户画像模型;及generating a user profile model based on the scorecard model according to the current number of registrations; and
    所述根据所述用户画像模型得到用户画像,包括:The obtaining of the user portrait according to the user portrait model includes:
    将所述当前注册数量与所述评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;comparing the current number of registrations to the number of websites for each segment in the scorecard model to determine a user risk score;
    根据所述用户风险评分得到对应的用户画像。A corresponding user portrait is obtained according to the user risk score.
  6. 一种基于网站注册的用户画像生成装置,其中,所述装置包括:A device for generating user portraits based on website registration, wherein the device comprises:
    网站列表获取模块,用于获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;The website list acquisition module is used to obtain the list of registered websites corresponding to the user, and the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID. The registration mark is obtained by classifying the registered registration records;
    分类模块,用于将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;A classification module, configured to compare the registered website in the website registration list with the identification of the standard website of the preset classification, so as to classify the registered website;
    统计模块,用于统计每一分类中注册网站的数量;及Statistics module for counting the number of registered websites in each category; and
    画像生成模块,用于根据每一分类中注册网站的数量计算得到用户画像。The portrait generation module is used to calculate the user portrait according to the number of registered websites in each category.
  7. 根据权利要求6所述的装置,其中,所述画像生成模块包括:The device according to claim 6, wherein the profile generation module comprises:
    第一场景获取单元,用于获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;a first scene acquisition unit, configured to acquire multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to the multiple tags;
    第一当前注册网站类型获取单元,用于获取所述多个场景各自对应的当前注册网站类型;a first currently registered website type acquiring unit, configured to acquire the respective current registered website types corresponding to the multiple scenarios;
    数量选取单元,用于从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;A quantity selection unit, used for selecting the current registration quantity corresponding to the type of the current registration website from the number of registered websites in each category of the statistics;
    比较单元,用于将所述当前注册数量与所述阈值进行比较得到标签;及a comparison unit for comparing the current registration number with the threshold to obtain a label; and
    第一画像生成单元,用于将所得到的标签进行组合得到用户画像。The first portrait generation unit is used for combining the obtained tags to obtain the user portrait.
  8. 根据权利要求6所述的装置,其中,所述画像生成模块包括:The device according to claim 6, wherein the profile generation module comprises:
    第二场景获取单元,用于获取当前场景,以及所述当前场景对应的当前注册网站类型;a second scene acquisition unit, configured to acquire the current scene and the currently registered website type corresponding to the current scene;
    第二当前注册网站类型获取单元,用于从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;及The second current registered website type acquisition unit is used to select the current registered number corresponding to the currently registered website type from the counted number of registered websites in each category; and
    模型生成单元,用于根据所述当前注册数量进行模型训练得到用户画像模型,根据所述用户画像模型得到用户画像。A model generation unit, configured to perform model training according to the current registration number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  9. 根据权利要求8所述的装置,其中,所述模型生成单元包括:The apparatus according to claim 8, wherein the model generating unit comprises:
    第一特征向量生成子单元,用于根据所述当前注册数量生成第一预设维度的第一特征向量;a first feature vector generating subunit, for generating a first feature vector of a first preset dimension according to the current registration number;
    第二特征向量生成子单元,用于获取根据用户基本信息生成的第二预设维度的第二特征向量;The second feature vector generating subunit is used to obtain the second feature vector of the second preset dimension generated according to the basic information of the user;
    第一模型生成子单元,用于根据所述第一特征向量和所述第二特征向量生成用户画像模型;及a first model generation subunit for generating a user portrait model according to the first feature vector and the second feature vector; and
    所述模型生成单元还用于根据所述用户画像模型得到表征产品需求概率的用户画像;及The model generation unit is further configured to obtain a user portrait representing the probability of product demand according to the user portrait model; and
    所述基于网站注册的用户画像生成装置还包括:The device for generating user portraits based on website registration also includes:
    推送模块,用于根据所述产品需求概率对用户进行排序,按照所述排序向用户推送对应的产品。A push module is configured to sort the users according to the product demand probability, and push corresponding products to the users according to the sorting.
  10. 根据权利要求8所述的装置,其中,所述模型生成单元包括:The apparatus according to claim 8, wherein the model generating unit comprises:
    第二模型生成子单元,用于根据所述当前注册数量生成基于评分卡模型的用户画像模型;及A second model generation subunit for generating a user portrait model based on the scorecard model according to the current registration number; and
    评分计算子单元,用于将所述当前注册数量与所述评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;a score calculation subunit for comparing the current registration number with the number of websites in each segment in the scorecard model to determine a user risk score;
    画像生成子单元,用于根据所述用户风险评分得到对应的用户画像。The portrait generation subunit is used to obtain the corresponding user portrait according to the user risk score.
  11. 一种计算机设备,包括存储器及一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device comprising a memory and one or more processors, the memory having computer-readable instructions stored in the memory that, when executed by the one or more processors, cause the one or more processors to Each processor performs the following steps:
    获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;Obtain a list of registered websites corresponding to the user, the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
    将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;Comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
    统计每一分类中注册网站的数量;及Count the number of registered websites in each category; and
    根据每一分类中注册网站的数量计算得到用户画像。User portraits are calculated based on the number of registered websites in each category.
  12. 根据权利要求10所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述根据每一分类中注册网站的数量计算得到用户画像,包括:The computer device according to claim 10, wherein the calculating and obtaining the user portrait according to the number of registered websites in each category, which is implemented when the processor executes the computer-readable instructions, comprises:
    获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;Obtain multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to multiple tags;
    获取所述多个场景各自对应的当前注册网站类型;obtaining the current registered website type corresponding to each of the multiple scenarios;
    从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;From the counted number of registered websites in each category, select the current registered number corresponding to the type of the current registered website;
    将所述当前注册数量与所述阈值进行比较得到标签;及comparing the current number of registrations to the threshold to obtain a label; and
    将所得到的标签进行组合得到用户画像。The obtained tags are combined to obtain the user portrait.
  13. 根据权利要求11所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述根据每一分类中注册网站的数量计算得到用户画像,包括:The computer device according to claim 11, wherein the calculating and obtaining the user portrait according to the number of registered websites in each category, which is implemented when the processor executes the computer-readable instructions, comprises:
    获取当前场景,以及所述当前场景对应的当前注册网站类型;Obtain the current scene, and the current registered website type corresponding to the current scene;
    从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;及From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type; and
    根据所述当前注册数量进行模型训练得到用户画像模型,根据所述用户画像模型得到用户画像。Perform model training according to the current registration number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  14. 根据权利要求13所述的计算机设备,其中,所述处理器执行所述计算机可读指令时 所实现的所述根据所述当前注册数量进行模型训练得到用户画像模型,包括:computer equipment according to claim 13, wherein, when the processor executes the computer-readable instruction, the described carrying out model training according to the current number of registrations realized to obtain a user portrait model, comprising:
    根据所述当前注册数量生成第一预设维度的第一特征向量;Generate a first feature vector of a first preset dimension according to the current registration number;
    获取根据用户基本信息生成的第二预设维度的第二特征向量;obtaining the second feature vector of the second preset dimension generated according to the basic information of the user;
    根据所述第一特征向量和所述第二特征向量生成用户画像模型;及generating a user portrait model according to the first feature vector and the second feature vector; and
    所述处理器执行所述计算机可读指令时所实现的所述根据所述用户画像模型得到用户画像,包括:The obtaining of the user portrait according to the user portrait model, which is realized when the processor executes the computer-readable instructions, includes:
    根据所述用户画像模型得到表征产品需求概率的用户画像;及obtaining a user portrait representing the probability of product demand according to the user portrait model; and
    所述处理器执行所述计算机可读指令时还执行以下步骤:The processor also performs the following steps when executing the computer-readable instructions:
    根据所述产品需求概率对用户进行排序,按照所述排序向用户推送对应的产品。The users are sorted according to the product demand probability, and corresponding products are pushed to the users according to the sorting.
  15. 根据权利要求13所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述根据所述当前注册数量进行模型训练得到用户画像模型,包括:The computer device according to claim 13, wherein the user portrait model obtained by performing model training according to the current registration number, which is implemented when the processor executes the computer-readable instructions, comprises:
    根据所述当前注册数量生成基于评分卡模型的用户画像模型;及generating a user profile model based on the scorecard model according to the current number of registrations; and
    所述处理器执行所述计算机可读指令时所实现的所述根据所述用户画像模型得到用户画像,包括:The obtaining of the user portrait according to the user portrait model, which is realized when the processor executes the computer-readable instructions, includes:
    将所述当前注册数量与所述评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;comparing the current number of registrations to the number of websites for each segment in the scorecard model to determine a user risk score;
    根据所述用户风险评分得到对应的用户画像。A corresponding user portrait is obtained according to the user risk score.
  16. 一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more non-volatile computer-readable storage media storing computer-readable instructions that, when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取用户对应的注册网站列表,所述注册网站列表是预先从预设网站的服务器爬取对应的包括注册用户标识以及注册标志的注册记录,并根据所述用户标识对注册标志表征注册的注册记录进行分类得到的;Obtain a list of registered websites corresponding to the user, the registered website list is to crawl the corresponding registration records including the registered user ID and the registration mark from the server of the preset website in advance, and according to the user ID, the registration mark represents the registered registration record. classified;
    将所述网站注册列表中的注册网站与预设分类的标准网站的标识进行比较,以对所述注册网站进行分类;Comparing the registered website in the website registration list with the identification of the standard website of the preset classification, to classify the registered website;
    统计每一分类中注册网站的数量;及Count the number of registered websites in each category; and
    根据每一分类中注册网站的数量计算得到用户画像。User portraits are calculated based on the number of registered websites in each category.
  17. 根据权利要求16所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述根据每一分类中注册网站的数量计算得到用户画像,包括:The storage medium according to claim 16, wherein the calculating and obtaining the user portrait according to the number of registered websites in each category, which is implemented when the computer-readable instructions are executed by the processor, comprises:
    获取预设的多个场景、各个场景对应的多个标签以及多个标签对应的阈值;Obtain multiple preset scenes, multiple tags corresponding to each scene, and thresholds corresponding to multiple tags;
    获取所述多个场景各自对应的当前注册网站类型;obtaining the current registered website type corresponding to each of the multiple scenarios;
    从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type;
    将所述当前注册数量与所述阈值进行比较得到标签;及comparing the current number of registrations to the threshold to obtain a label; and
    将所得到的标签进行组合得到用户画像。The obtained tags are combined to obtain the user portrait.
  18. 根据权利要求16所述的存储介质,其中,所述计算机可读指令被所述处理器执行时 所实现的所述根据每一分类中注册网站的数量计算得到用户画像,包括:The storage medium according to claim 16, wherein, when the computer-readable instructions are executed by the processor, the user portrait is calculated according to the number of registered websites in each category, including:
    获取当前场景,以及所述当前场景对应的当前注册网站类型;Obtain the current scene, and the current registered website type corresponding to the current scene;
    从所统计的每一分类中的注册网站的数量中,选取与所述当前注册网站类型对应的当前注册数量;及From the counted number of registered websites in each category, select the current registered number corresponding to the currently registered website type; and
    根据所述当前注册数量进行模型训练得到用户画像模型,根据所述用户画像模型得到用户画像。Perform model training according to the current registration number to obtain a user portrait model, and obtain a user portrait according to the user portrait model.
  19. 根据权利要求18所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述根据所述当前注册数量进行模型训练得到用户画像模型,包括:The storage medium according to claim 18, wherein, when the computer-readable instructions are executed by the processor, the user portrait model obtained by performing model training according to the current registration number comprises:
    根据所述当前注册数量生成第一预设维度的第一特征向量;Generate a first feature vector of a first preset dimension according to the current registration number;
    获取根据用户基本信息生成的第二预设维度的第二特征向量;obtaining the second feature vector of the second preset dimension generated according to the basic information of the user;
    根据所述第一特征向量和所述第二特征向量生成用户画像模型;及generating a user portrait model according to the first feature vector and the second feature vector; and
    所述计算机可读指令被所述处理器执行时所实现的所述根据所述用户画像模型得到用户画像,包括:The obtaining the user portrait according to the user portrait model, which is realized when the computer-readable instructions are executed by the processor, includes:
    根据所述用户画像模型得到表征产品需求概率的用户画像;及obtaining a user portrait representing the probability of product demand according to the user portrait model; and
    所述计算机可读指令被所述处理器执行时还执行以下步骤:The computer-readable instructions, when executed by the processor, also perform the following steps:
    根据所述产品需求概率对用户进行排序,按照所述排序向用户推送对应的产品。The users are sorted according to the product demand probability, and corresponding products are pushed to the users according to the sorting.
  20. 根据权利要求18所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述根据所述当前注册数量进行模型训练得到用户画像模型,包括:The storage medium according to claim 18, wherein, when the computer-readable instructions are executed by the processor, the user portrait model obtained by performing model training according to the current registration number comprises:
    根据所述当前注册数量生成基于评分卡模型的用户画像模型;及generating a user profile model based on the scorecard model according to the current number of registrations; and
    所述计算机可读指令被所述处理器执行时所实现的所述根据所述用户画像模型得到用户画像,包括:The obtaining the user portrait according to the user portrait model, which is realized when the computer-readable instructions are executed by the processor, includes:
    将所述当前注册数量与所述评分卡模型中各个分段的网站数量进行比较以确定用户风险评分;comparing the current number of registrations to the number of websites for each segment in the scorecard model to determine a user risk score;
    根据所述用户风险评分得到对应的用户画像。A corresponding user portrait is obtained according to the user risk score.
PCT/CN2021/124602 2020-12-15 2021-10-19 Website registration-based user portrait generating method and apparatus, device and medium WO2022127339A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011473435.X 2020-12-15
CN202011473435.XA CN112417315A (en) 2020-12-15 2020-12-15 User portrait generation method, device, equipment and medium based on website registration

Publications (1)

Publication Number Publication Date
WO2022127339A1 true WO2022127339A1 (en) 2022-06-23

Family

ID=74775153

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/124602 WO2022127339A1 (en) 2020-12-15 2021-10-19 Website registration-based user portrait generating method and apparatus, device and medium

Country Status (2)

Country Link
CN (1) CN112417315A (en)
WO (1) WO2022127339A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115967580A (en) * 2023-01-06 2023-04-14 一汽解放汽车有限公司 Vehicle service message pushing method and device, computer equipment and storage medium
CN117749880A (en) * 2023-12-21 2024-03-22 北京易丰嘉诚科技有限公司 Webpage data pushing method and system
CN115967580B (en) * 2023-01-06 2024-06-04 一汽解放汽车有限公司 Vehicle service message pushing method, device, computer equipment and storage medium

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112417315A (en) * 2020-12-15 2021-02-26 深圳壹账通智能科技有限公司 User portrait generation method, device, equipment and medium based on website registration
CN113362162A (en) * 2021-06-29 2021-09-07 深圳壹账通智能科技有限公司 Wind control identification method and device based on network behavior data, electronic equipment and medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106202297A (en) * 2016-06-30 2016-12-07 北京奇虎科技有限公司 Identify the method and device of user interest
CN106874435A (en) * 2017-01-25 2017-06-20 北京航空航天大学 User portrait construction method and device
WO2017157146A1 (en) * 2016-03-15 2017-09-21 平安科技(深圳)有限公司 User portrait-based personalized recommendation method and apparatus, server, and storage medium
US20190147366A1 (en) * 2017-11-13 2019-05-16 International Business Machines Corporation Intelligent Recommendations Implemented by Modelling User Profile Through Deep Learning of Multimodal User Data
CN111091351A (en) * 2019-12-16 2020-05-01 北京政信1890智能科技有限公司 User portrait construction method and device, electronic equipment and readable storage medium
CN112417315A (en) * 2020-12-15 2021-02-26 深圳壹账通智能科技有限公司 User portrait generation method, device, equipment and medium based on website registration

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017157146A1 (en) * 2016-03-15 2017-09-21 平安科技(深圳)有限公司 User portrait-based personalized recommendation method and apparatus, server, and storage medium
CN106202297A (en) * 2016-06-30 2016-12-07 北京奇虎科技有限公司 Identify the method and device of user interest
CN106874435A (en) * 2017-01-25 2017-06-20 北京航空航天大学 User portrait construction method and device
US20190147366A1 (en) * 2017-11-13 2019-05-16 International Business Machines Corporation Intelligent Recommendations Implemented by Modelling User Profile Through Deep Learning of Multimodal User Data
CN111091351A (en) * 2019-12-16 2020-05-01 北京政信1890智能科技有限公司 User portrait construction method and device, electronic equipment and readable storage medium
CN112417315A (en) * 2020-12-15 2021-02-26 深圳壹账通智能科技有限公司 User portrait generation method, device, equipment and medium based on website registration

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115967580A (en) * 2023-01-06 2023-04-14 一汽解放汽车有限公司 Vehicle service message pushing method and device, computer equipment and storage medium
CN115967580B (en) * 2023-01-06 2024-06-04 一汽解放汽车有限公司 Vehicle service message pushing method, device, computer equipment and storage medium
CN117749880A (en) * 2023-12-21 2024-03-22 北京易丰嘉诚科技有限公司 Webpage data pushing method and system

Also Published As

Publication number Publication date
CN112417315A (en) 2021-02-26

Similar Documents

Publication Publication Date Title
EP3985578A1 (en) Method and system for automatically training machine learning model
Rodrigues et al. Combining time-series and textual data for taxi demand prediction in event areas: A deep learning approach
WO2022127339A1 (en) Website registration-based user portrait generating method and apparatus, device and medium
WO2021027317A1 (en) Relationship network-based attribute information processing method and device, computer apparatus, and storage medium
CN109493199A (en) Products Show method, apparatus, computer equipment and storage medium
Chen et al. Predicting the influence of users’ posted information for eWOM advertising in social networks
CN111597348B (en) User image drawing method, device, computer equipment and storage medium
CN109934619A (en) User&#39;s portrait tag modeling method, apparatus, electronic equipment and readable storage medium storing program for executing
Sari Aslam et al. A high-precision heuristic model to detect home and work locations from smart card data
CN111275491A (en) Data processing method and device
CN111192153B (en) Crowd relation network construction method, device, computer equipment and storage medium
CN112861980B (en) Calendar task table mining method based on big data and computer equipment
CN107633257B (en) Data quality evaluation method and device, computer readable storage medium and terminal
CN110751285A (en) Training method and system and prediction method and system of neural network model
Rutskiy et al. Prospects for the Use of Artificial Intelligence to Combat Fraud in Bank Payments
CN114398557A (en) Information recommendation method and device based on double portraits, electronic equipment and storage medium
CN116029760A (en) Message pushing method, device, computer equipment and storage medium
CN112084408B (en) List data screening method, device, computer equipment and storage medium
CN115081447A (en) Method, device and equipment for constructing requirement document of software development and storage medium
CN109919811B (en) Insurance agent culture scheme generation method based on big data and related equipment
Mahalle et al. Data Acquisition and Preparation
CN111552827A (en) Labeling method and device, and behavior willingness prediction model training method and device
CN112581042B (en) Performance capability evaluation system and method and electronic equipment
CN117633594A (en) Verification method for portrait information of electric power user
CN113434660A (en) Product recommendation method, device, equipment and storage medium based on multi-domain classification

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21905268

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 021023)

122 Ep: pct application non-entry in european phase

Ref document number: 21905268

Country of ref document: EP

Kind code of ref document: A1