CN111143689A - Method for constructing recommendation engine according to user requirements and user portrait - Google Patents

Method for constructing recommendation engine according to user requirements and user portrait Download PDF

Info

Publication number
CN111143689A
CN111143689A CN201911411250.3A CN201911411250A CN111143689A CN 111143689 A CN111143689 A CN 111143689A CN 201911411250 A CN201911411250 A CN 201911411250A CN 111143689 A CN111143689 A CN 111143689A
Authority
CN
China
Prior art keywords
user
information
house
source
recommended
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201911411250.3A
Other languages
Chinese (zh)
Inventor
李昭
陈浩
高靖
崔岩
卢述奇
陈呈
张宵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingwutong Co ltd
Original Assignee
Qingwutong Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingwutong Co ltd filed Critical Qingwutong Co ltd
Priority to CN201911411250.3A priority Critical patent/CN111143689A/en
Publication of CN111143689A publication Critical patent/CN111143689A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0255Targeted advertisements based on user history
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0255Targeted advertisements based on user history
    • G06Q30/0256User search
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0269Targeted advertisements based on user profile or attribute
    • G06Q30/0271Personalized advertisement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/16Real estate

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Databases & Information Systems (AREA)
  • Accounting & Taxation (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application discloses a method for constructing a recommendation engine according to user requirements and user figures, which relates to the technical field of data statistics, and a specific implementation mode of the method comprises the following steps: collecting information of house resources browsed by a user and house resource information, acquiring user information, and determining user portrait information through keywords; the system also comprises user demand information, and the demand of the user on the house price, the distance working place or the subway station and the business circle; obtaining a first recall data source and a second recall data source to obtain a recommended data source; performing characteristic engineering processing through a Spark distributed framework to obtain a second recommended data source, and training to obtain a ranking model; inputting the house source information into a sequencing model to obtain a first recommended house source; meanwhile, generating a second recommended house source according to the house owner demand; and constructing a recommendation engine. The method and the device for determining the user portrait information according to the keywords can improve the accuracy and the integrity of the user portrait information and are beneficial to improving the accuracy of the sequencing model.

Description

Method for constructing recommendation engine according to user requirements and user portrait
Technical Field
The application relates to the technical field of data statistics, in particular to a method for constructing a recommendation engine according to user requirements and user portraits.
Background
With the development of the internet, the interests of users are more and more extensive, and with the change of the environment and living standard of users, the demands of users are also changing. At present, the LBS (Location Based Services) recommendation engine recall and sorting method has the problems in the applicability of the long-rented apartment field:
1. the current long-rent apartment collection user information is generally only according to the online record of the user, and the channel is single, so that the user information is omitted, and the sample information base is lost;
2. the recall mode industry is integrally inverted index, and does not consider the properties of bulk objects such as high-quality plot price and the like of high-quality public places;
3. the general recommendation rough arrangement stage in the rough arrangement layer is not optimized for a high-value sparse target scene, a long-rented apartment scene needs multilayer rough arrangement, LBS and value matching strategies need to be carried out for matching of strong intentions of clients and key attributes of target objects, and matching strategies need to be carried out for secondary requirements of users and secondary house states and district characteristics of houses;
4. the ranking layer industry generally bases on the user online behavior to do the learning ToRank model ranking of pair-wise or list-wise. However, in a long-rented apartment, the conditions of on-line feedback of users and the like need to be considered, the sequencing model is more complex, multi-level sample weight depiction needs to be performed on the basis of the sequencing model, and meanwhile, the scenes such as areas, house states and the like need to be deeply subdivided on the basis of the high sparsity of the samples and the large-volume attribute of transactions, so that the recommendation quality is improved;
5. in the interference adjustment layer industry, accurate advertisement insertion is generally performed based on a bidding ranking mode, modeling is not performed based on a specific LBS mode of a long rental apartment, and the recommendation effect is not ideal.
Disclosure of Invention
In view of the above, the present application discloses a method for constructing a recommendation engine according to user requirements and a user profile, comprising the steps of:
collecting information of house resources browsed by a user and house resource information, acquiring user information, wherein the user information comprises user portrait information, determining a browsing information set of the user to a house according to the information of the house resources browsed by the user, the browsing information set comprises a plurality of keywords, the keywords comprise preference values and consumption values, the preference values are used for identifying preference degrees of the user to the browsed house, the consumption values are used for identifying consumption receiving degrees of the user to the browsed house, the keywords are integrated to obtain probability information of at least one to-be-selected user portrait, and the user portrait information is determined according to the probability information of the at least one to-be-selected user portrait;
the user information also comprises user demand information, and the user demand information comprises the demand of a user on the room price, the distance working place or the subway station and the business circle;
the house source information is recalled according to the user demand information, and the first recall data source is obtained through filtering and formatting of a data collector;
the house source information is recalled according to the user portrait information, and a second recall data source is obtained by screening through an optical disc arranged in an offline data warehouse and LBS (location based service);
carrying out price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source;
performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
training the second recommended data source to obtain a ranking model;
inputting the house source information into the sequencing model to obtain the grading sequencing condition of the house source information;
setting a scoring threshold value, wherein the house source with the score of the house source information larger than the scoring threshold value is a first recommended house source;
meanwhile, generating a second recommended house source according to the house owner demand;
and constructing a recommendation engine according to the first recommended house source and the second recommended house source.
Preferably, the user information is collected by means of online, offline and electrical pinning.
Preferably, the first recommended house source and the second recommended house source are arranged in a crossed manner to construct a recommendation engine.
Preferably, the first recalled data source is recalled by way of a search.
Preferably, the feature engineering process includes sequentially performing aggregation, refining, screening, and completion on the first recommended data source to obtain the second recommended data source.
Preferably, the inputting the room source information into the ranking model to obtain the ranking condition of the scores of the room source information includes: and carrying out weight depiction on the house source information, and carrying out deep division on the scenes such as regions, house states and the like according to the sparsity and the transaction attribute of the house source information to obtain the grading and sequencing conditions of the house source information.
Preferably, the recommendation engine further comprises a hotspot source.
Compared with the prior art, the method for constructing the recommendation engine according to the user requirements and the user portrait provided by the invention has the following beneficial effects that:
1. according to the method for constructing the recommendation engine according to the user requirements and the user portrait, the first recommendation data source is subjected to feature engineering processing through the Spark distributed framework to obtain the second recommendation data source, and the response time of the obtained result is favorably shortened.
2. The method for constructing the recommendation engine according to the user requirements and the user portrait determines the user portrait information according to the probability information of at least one user portrait to be selected, and can improve the accuracy and the integrity of the user portrait information.
3. The method for constructing the recommendation engine according to the user requirements and the user portrait provided by the invention has rich information acquisition channels, including user online APP record, offline watching and communication and electric marketing modes; a large amount of user information is collected, an integral sample model is generated, and the bulk object attributes such as high-quality parcel prices of high-quality long-rent apartments are covered.
4. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes the personalized recommendation mode, and extracts apartments with high recommendation matching degree from the sample model library through comprehensive measurement and calculation of the user requirements and the portrait.
5. According to the method for constructing the recommendation engine according to the user requirements and the user portrait, the recommendation sequencing model is optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, the sequencing model is subjected to multilevel sample weight portrayal, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
6. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes an intervention adjustment layer, models based on a specific LBS (location based service) mode of the long-rented apartment, and designs the characteristic recommendation engine of the long-rented apartment with bulk commodities with sparsely distributed landmass.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
FIG. 1 is a flow diagram of a method of constructing a recommendation engine based on user needs and a user profile in accordance with the present invention;
FIG. 2 is a flowchart of another method for constructing a recommendation engine based on user requirements and user profiles in accordance with the present invention.
Detailed Description
The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. It should be noted that the described embodiments are merely some embodiments, rather than all embodiments, of the invention and are merely illustrative in nature and in no way intended to limit the invention, its application, or uses. The protection scope of the present application shall be subject to the definitions of the appended claims.
Example 1:
fig. 1 is a flowchart of a method for building a recommendation engine according to user requirements and a user profile according to the present invention, and as shown in fig. 1, the method for building a recommendation engine according to user requirements and a user profile provided in this embodiment includes the steps of:
step 101, collecting information of house sources browsed by a user and house source information, acquiring user information, wherein the user information comprises user portrait information, determining a browsing information set of the house to the user according to the information of the house sources browsed by the user, the browsing information set comprises a plurality of keywords, the keywords comprise preference values and consumption values, the preference values are used for identifying preference degrees of the user to the browsed house, the consumption values are used for identifying consumption receiving degrees of the user to the browsed house, the keywords are integrated to obtain probability information of at least one to-be-selected user portrait, and the user portrait information is determined according to the probability information of at least one to-be-selected user portrait; the user information also comprises user demand information, and the user demand information comprises the demand of the user on the house price, the distance working place or the subway station and the business circle;
it can be understood that the probability information of at least one to-be-selected user portrait is obtained by integrating the keywords, the probability information can be obtained by using a neural network model, and the model is trained through a preset data set in advance, so that the first model has the processing capacity of generating the user portrait. By determining the user portrait information based on the method, the accuracy and the integrity of the information can be improved.
In step 101, user information may be collected by online, offline, and electrical pinning. It can be understood that, in the first online acquisition mode, according to browsing and searching records of a user on a webpage and an APP, user intention information is acquired; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; the method is characterized by enriching an information acquisition channel, acquiring a large amount of user information, generating an integral sample model, and covering the attributes of a large number of targets such as the price of a high-quality land parcel owned by a high-quality long-rent apartment.
Step 102, recalling house source information according to user demand information, and filtering and formatting through a data collector Logstash to obtain a first recall data source;
and the house source information is recalled according to the user portrait information, and a second recall data source is obtained by screening through an optical disk UDF (Universal description framework) arranged in a Hive offline data warehouse and LBS (location based service).
In step 102, optionally, user requirement feedback is collected through user filling or inquiry, and the search mode is recalled, so that the technology can adopt an ELK framework to ensure the real-time performance of the index establishment and query function executing MySQL query and the expandability of the recommendation engine. The ELK framework is Elasticisearch, Logstash, and Kibana is a set of solution for real-time data collection, storage, indexing, retrieval, statistical analysis and visualization. MySQL is a relational database management system.
In step 102, optionally, the data is recalled by way of workplace, place of life, and point of interest, with the least importance. But as a cold start solution without insufficient collection of user demand. UDF is an english abbreviation of Universal Disc Format (Universal Disc Format), a Universal Disc file system established by the international organization for standardization in 1996, and standard Packet Writing technology (PW) is used to simplify the use of a recorder.
103, performing price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source, wherein the first recommended data source comprises an online data source and an offline data source;
in step 103, the first recall data source and the second recall data source are sorted according to the LBS service price matching strategy and the segment characteristics matching strategy to generate a first recommended data source.
And further, an LBS and value matching strategy is carried out for matching the strong intention of the client and the key attributes of the target objects, and a matching strategy is carried out for the secondary requirements of the user and the characteristics of the secondary house state and the section of the house, so that a rough sequencing result is obtained. Optimizing an individualized recommendation mode, comprehensively measuring and calculating the requirements and images of the user, and extracting a high-quality apartment with high recommendation matching degree from a sample model library. Meanwhile, a recommended sequencing model can be optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, multi-level sample weight depiction is carried out on the sequencing model, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
In the above steps, it can be understood that the user information correspondingly generates a first recall data source, a second recall data source and a third recall data source through the user demand information, the user behavior information and the user portrait information house source information, and processes the first recall data source, the second recall data source and the third recall data source to generate a first recommended data source; some user information with relevancy can be recalled from massive user information and roughly sorted, the subsequent fine sorting can be carried out, the workload of work is reduced, and the response time of the obtained result is favorably prolonged.
104, performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
it is to be understood that since the first recommended data source includes an online data source and an offline data source; the Spark distributed framework comprises an offline calculation module and an online module calculation module, the online data source of the first recommended data source is processed by the Spark distributed framework comprising the online calculation module, and the offline data source of the first recommended data source is processed by the Spark distributed framework comprising the offline calculation module, so that the response time of the obtained result can be shortened.
105, training a second recommended data source to obtain a ranking model;
step 106, inputting the house source information into a ranking model to obtain the scoring ranking condition of the house source information;
in step 106, the house source information is further subjected to weight portrayal, and scenes such as areas, house states and the like are deeply divided according to the sparsity and the transaction attributes of the house source information to obtain the ranking score of the house source information.
Step 107, setting a score threshold value, wherein the house source with the score of the house source information larger than the score threshold value is a first recommended house source;
in steps 104-107, it is understood that the feature engineering process includes aggregating, refining, screening, and complementing the first recommended data source to obtain a second recommended data source. And further training a corresponding sorting model after the first recommended data source is collected and processed through characteristic engineering measurement and calculation of a Spark frame, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-amount attribute of transaction, giving accurate sorting scores of house source information, and finally recommending high-quality long-rented apartment resources to users.
Step 108, generating a second recommended house source according to the demands of the homeowners;
and step 109, constructing a recommendation engine according to the first recommended house source and the second recommended house source. Optionally, the recommendation engine further includes a hotspot source. The hot house source can be a house source with a rental rate of more than 90%, and preferably, the hot house source can be a house source with a rental rate of more than 90% near a subway station, a commercial district or a city center.
In steps 108-109, it can be understood that after modeling based on LBS specific to the long rental apartment, the system inserts some specific house sources into specific positions in the recommendation list according to customized requirements of the company such as the out-of-house tilt policy, the operation activity policy, the advertisement promotion, etc., and recommends more selection spaces for the user; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
And the first recommended house source and the second recommended house source are arranged in a crossed mode to construct a recommendation engine. When a user searches for a house source by using the recommendation engine, mixed data of the first recommended house source and the second recommended house source appear, and the first recommended house source or the second recommended house source is not part of the mixed data, so that the user demand can be met, and the income of long-renting apartments can be improved.
Example 2:
FIG. 2 is a flowchart illustrating a method for building a recommendation engine according to user requirements and user profiles according to the present invention, and as shown in FIG. 2, the present embodiment provides a method for building a recommendation engine according to user requirements and user profiles, which includes the steps of:
step 201, user information acquisition:
the information collection method mainly includes three methods: the method comprises the steps that a first online acquisition mode is adopted, and user intention information is obtained according to browsing search records of a user on a webpage and an APP; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; constructing a sample model of the information acquired by all channels, storing the sample model in a database and facilitating retrieval; enriching collected information channels, including on-line APP recording, off-line communication with watching and selling modes of a user; a large amount of user information is collected, an integral sample model is generated, and the attributes of large objects such as high-quality parcel prices of high-quality long-rent apartments are covered.
Step 202, recall:
after a user searches for information of a long rental apartment near 'XX prefecture', firstly, user requirements are acquired through a sample database: recalling according to the requirements of users: the system selects data information of a long rental apartment near the XX first house from a MySQL house source library, transmits the data to a data collector Logstash for filtering and formatting, and then stores the data in an ES database;
further, collected data information is collected and carefully selected, then multi-layer rough sequencing is carried out on the long-rent apartment scenes, LBS and value matching strategies are carried out aiming at the matching of the strong intentions of the clients and the key attributes of the target objects, and matching strategies are carried out on the secondary requirements of the users and the secondary house state and the district characteristics of the house, so that a rough sequencing result is obtained; a new matching strategy is formulated for the conditions of on-line and off-line watching, signing and the like, and a multi-level sample weight depiction is carried out on the sequencing model, so that the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
Step 203, sorting:
for recalled house source data rough ordering information, after comprehensively acquiring demand preference information of a user and house state plot business circle images and performing characteristic engineering measurement and calculation processing on a Spark distributed framework, respectively performing corresponding training on online data and offline data to construct an ordering model, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-volume attribute of transaction, giving accurate ordering and scoring of house source information, and finally recommending high-quality long-rental apartment resources to the user;
step 204, intervention and adjustment:
after modeling based on the specific LBS of the long rental apartment, the system inserts some specific house sources into specific positions in a recommendation list according to customized requirements of a company such as a house-leaving inclined policy, an operation activity policy, advertisement promotion and the like, and recommends more selection spaces for users; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
It should be understood that the recall method of the present invention may be based on only one of the user requirement information and the image information, or may be based on both methods. The embodiment only shows the recall according to the requirements of the user, and the embodiment does not make specific requirements on the recall mode and can be set according to actual conditions.
Example 3:
with continued reference to FIG. 2, FIG. 2 is a flowchart illustrating a method for building a recommendation engine according to user requirements and a user profile according to the present invention; the method for constructing the recommendation engine according to the user requirements and the user portrait provided by the embodiment comprises the following steps:
step 301, user information acquisition:
the information collection method mainly includes three methods: the method comprises the steps that a first online acquisition mode is adopted, and user intention information is obtained according to browsing search records of a user on a webpage and an APP; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; constructing a sample model of the information acquired by all channels, storing the sample model in a database and facilitating retrieval; enriching collected information channels, including on-line APP recording, off-line communication with watching and selling modes of a user; a large amount of user information is collected, an integral sample model is generated, and the attributes of large objects such as high-quality parcel prices of high-quality long-rent apartments are covered.
Step 302, recall:
after a user searches for information of a long rental apartment near 'XX house', user portrait information is first acquired through a sample database: recall from user profile: mining relevant information of a long-rented apartment near the apartment through a Hive interest business district, screening the information through content and LBS, and storing the information into a Redis recall list index base;
collecting and carefully selecting the acquired data information, then carrying out multilayer rough sequencing on the long-rent apartment scenes, carrying out LBS and value matching strategies aiming at the matching of the strong intentions of the clients and the key attributes of the target objects, and carrying out matching strategies on the secondary requirements of the users, the secondary house state and the district characteristics of the house to obtain a rough sequencing result; a new matching strategy is formulated for the conditions of on-line and off-line watching, signing and the like, and a multi-level sample weight depiction is carried out on the sequencing model, so that the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
Step 303, sorting:
for recalled house source data rough sorting information, comprehensively acquiring demand preference information of a user and house state plot business circle images, performing corresponding training on online data and offline data respectively to construct a sorting model after characteristic engineering measurement and calculation processing of a Spark distributed framework, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-volume attribute of transaction, giving accurate sorting scoring of the house source information, and finally recommending high-quality long-renting apartment resources to the user;
step 304, intervention and adjustment:
after modeling based on the specific LBS of the long rental apartment, the system inserts some specific house sources into specific positions in a recommendation list according to customized requirements of a company such as a house-leaving inclined policy, an operation activity policy, advertisement promotion and the like, and recommends more selection spaces for users; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
It should be understood that the recall method of the present invention may be based on only one of the user requirement information and the image information, or may be based on both methods. The embodiment only shows that the recall is performed according to the user portrait, and the recall mode is not specifically required and can be set according to the actual situation.
According to the embodiments, the application has the following beneficial effects:
1. according to the method for constructing the recommendation engine according to the user requirements and the user portrait, the first recommendation data source is subjected to feature engineering processing through the Spark distributed framework to obtain the second recommendation data source, and the response time of the obtained result is favorably shortened.
2. The invention provides a method for constructing a recommendation engine according to user requirements and user portraits,
the user portrait information is determined according to the probability information of at least one user portrait to be selected, so that the accuracy and the integrity of the user portrait information can be improved.
3. The method for constructing the recommendation engine according to the user requirements and the user portrait provided by the invention has rich information acquisition channels, including user online APP record, offline watching and communication and electric marketing modes; a large amount of user information is collected, an integral sample model is generated, and the bulk object attributes such as high-quality parcel prices of high-quality long-rent apartments are covered.
4. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes the personalized recommendation mode, and extracts apartments with high recommendation matching degree from the sample model library through comprehensive measurement and calculation of the user requirements and the portrait.
5. According to the method for constructing the recommendation engine according to the user requirements and the user portrait, the recommendation sequencing model is optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, the sequencing model is subjected to multilevel sample weight portrayal, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
6. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes an intervention adjustment layer, models based on a specific LBS (location based service) mode of the long-rented apartment, and designs the characteristic recommendation engine of the long-rented apartment with bulk commodities with sparsely distributed landmass.
While the present invention has been described in detail with reference to the drawings and examples, it is to be understood that the foregoing examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention. The scope of the invention is defined by the appended claims.

Claims (7)

1. A method for constructing a recommendation engine according to user requirements and user portraits is characterized by comprising the following steps:
collecting information of house resources browsed by a user and house resource information, acquiring user information, wherein the user information comprises user portrait information, determining a browsing information set of the user to a house according to the information of the house resources browsed by the user, the browsing information set comprises a plurality of keywords, the keywords comprise preference values and consumption values, the preference values are used for identifying preference degrees of the user to the browsed house, the consumption values are used for identifying consumption receiving degrees of the user to the browsed house, the keywords are integrated to obtain probability information of at least one to-be-selected user portrait, and the user portrait information is determined according to the probability information of the at least one to-be-selected user portrait;
the user information also comprises user demand information, and the user demand information comprises the demand of a user on the room price, the distance working place or the subway station and the business circle;
the house source information is recalled according to the user demand information, and the first recall data source is obtained through filtering and formatting of a data collector;
the house source information is recalled according to the user portrait information, and a second recall data source is obtained by screening through an optical disc arranged in an offline data warehouse and LBS (location based service);
carrying out price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source;
performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
training the second recommended data source to obtain a ranking model;
inputting the house source information into the sequencing model to obtain the grading sequencing condition of the house source information;
setting a scoring threshold value, wherein the house source with the score of the house source information larger than the scoring threshold value is a first recommended house source;
meanwhile, generating a second recommended house source according to the house owner demand;
and constructing a recommendation engine according to the first recommended house source and the second recommended house source.
2. The method of claim 1, wherein the user information is collected online, offline, and telemarketing.
3. The method of claim 1, wherein the first recommended source and the second recommended source are arranged to intersect to construct the recommendation engine.
4. The method of claim 1, wherein the first recall data source is recalled by searching.
5. The method of claim 1, wherein the feature engineering process comprises sequentially aggregating, refining, screening, and complementing the first recommended data source to obtain the second recommended data source.
6. The method of claim 1, wherein said inputting said source information into said ranking model and obtaining a ranking score of said source information comprises: and carrying out weight depiction on the house source information, and carrying out deep division on the scenes such as regions, house states and the like according to the sparsity and the transaction attribute of the house source information to obtain the grading and sequencing conditions of the house source information.
7. The method of claim 1, wherein the recommendation engine further comprises a hotspot source.
CN201911411250.3A 2019-12-31 2019-12-31 Method for constructing recommendation engine according to user requirements and user portrait Withdrawn CN111143689A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911411250.3A CN111143689A (en) 2019-12-31 2019-12-31 Method for constructing recommendation engine according to user requirements and user portrait

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911411250.3A CN111143689A (en) 2019-12-31 2019-12-31 Method for constructing recommendation engine according to user requirements and user portrait

Publications (1)

Publication Number Publication Date
CN111143689A true CN111143689A (en) 2020-05-12

Family

ID=70522534

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911411250.3A Withdrawn CN111143689A (en) 2019-12-31 2019-12-31 Method for constructing recommendation engine according to user requirements and user portrait

Country Status (1)

Country Link
CN (1) CN111143689A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111708949A (en) * 2020-06-19 2020-09-25 微医云(杭州)控股有限公司 Medical resource recommendation method and device, electronic equipment and storage medium
CN111768232A (en) * 2020-06-24 2020-10-13 长春初唐网络科技有限公司 AI-based online and offline marketing tracking matching recommendation method for real estate
CN112232933A (en) * 2020-12-11 2021-01-15 深圳市房多多网络科技有限公司 House source information recommendation method, device, equipment and readable storage medium
CN112256943A (en) * 2020-10-22 2021-01-22 上海适享文化传播有限公司 Portal portrait extraction method based on combination of natural language processing and knowledge graph

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150032639A1 (en) * 2013-07-29 2015-01-29 Verizon Patent And Licensing Inc. System and method for providing notifications on product recalls
CN105869001A (en) * 2015-01-19 2016-08-17 苏宁云商集团股份有限公司 Customized product recommendation guiding method and system
CN107423442A (en) * 2017-08-07 2017-12-01 火烈鸟网络(广州)股份有限公司 Method and system, storage medium and computer equipment are recommended in application based on user's portrait behavioural analysis
US20180253780A1 (en) * 2017-05-05 2018-09-06 James Wang Smart matching for real estate transactions
CN109190024A (en) * 2018-08-20 2019-01-11 平安科技(深圳)有限公司 Information recommendation method, device, computer equipment and storage medium
CN109377329A (en) * 2018-12-25 2019-02-22 北京时光荏苒科技有限公司 A kind of source of houses recommended method, device, storage medium and electronic equipment
CN109615432A (en) * 2018-12-14 2019-04-12 成都德迈安科技有限公司 Consumer behaviour portrait tool based on big data
CN109658192A (en) * 2018-12-20 2019-04-19 重庆锐云科技有限公司 A kind of source of houses recommended method and server
CN109815386A (en) * 2018-12-21 2019-05-28 厦门市美亚柏科信息股份有限公司 A kind of construction method, device and storage medium based on user's portrait

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150032639A1 (en) * 2013-07-29 2015-01-29 Verizon Patent And Licensing Inc. System and method for providing notifications on product recalls
CN105869001A (en) * 2015-01-19 2016-08-17 苏宁云商集团股份有限公司 Customized product recommendation guiding method and system
US20180253780A1 (en) * 2017-05-05 2018-09-06 James Wang Smart matching for real estate transactions
CN107423442A (en) * 2017-08-07 2017-12-01 火烈鸟网络(广州)股份有限公司 Method and system, storage medium and computer equipment are recommended in application based on user's portrait behavioural analysis
CN109190024A (en) * 2018-08-20 2019-01-11 平安科技(深圳)有限公司 Information recommendation method, device, computer equipment and storage medium
CN109615432A (en) * 2018-12-14 2019-04-12 成都德迈安科技有限公司 Consumer behaviour portrait tool based on big data
CN109658192A (en) * 2018-12-20 2019-04-19 重庆锐云科技有限公司 A kind of source of houses recommended method and server
CN109815386A (en) * 2018-12-21 2019-05-28 厦门市美亚柏科信息股份有限公司 A kind of construction method, device and storage medium based on user's portrait
CN109377329A (en) * 2018-12-25 2019-02-22 北京时光荏苒科技有限公司 A kind of source of houses recommended method, device, storage medium and electronic equipment

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111708949A (en) * 2020-06-19 2020-09-25 微医云(杭州)控股有限公司 Medical resource recommendation method and device, electronic equipment and storage medium
CN111708949B (en) * 2020-06-19 2023-07-25 微医云(杭州)控股有限公司 Medical resource recommendation method and device, electronic equipment and storage medium
CN111768232A (en) * 2020-06-24 2020-10-13 长春初唐网络科技有限公司 AI-based online and offline marketing tracking matching recommendation method for real estate
CN112256943A (en) * 2020-10-22 2021-01-22 上海适享文化传播有限公司 Portal portrait extraction method based on combination of natural language processing and knowledge graph
CN112256943B (en) * 2020-10-22 2024-01-23 上海适享文化传播有限公司 Portal store image extraction method based on natural language processing combined with knowledge graph
CN112232933A (en) * 2020-12-11 2021-01-15 深圳市房多多网络科技有限公司 House source information recommendation method, device, equipment and readable storage medium

Similar Documents

Publication Publication Date Title
CN110019396B (en) Data analysis system and method based on distributed multidimensional analysis
CN109241405B (en) Learning resource collaborative filtering recommendation method and system based on knowledge association
US10572565B2 (en) User behavior models based on source domain
CN111143689A (en) Method for constructing recommendation engine according to user requirements and user portrait
US9043302B1 (en) Campaign and competitive analysis and data visualization based on search interest data
CN107862022B (en) Culture resource recommendation system
CN111159561A (en) Method for constructing recommendation engine according to user behaviors and user portrait
CN108829652B (en) Picture labeling system based on crowdsourcing
CN110532351B (en) Recommendation word display method, device and equipment and computer readable storage medium
US7783636B2 (en) Personalized information retrieval search with backoff
CN111127105A (en) User hierarchical model construction method and system, and operation analysis method and system
CN111159559A (en) Method for constructing recommendation engine according to user requirements and user behaviors
CN110968801A (en) Real estate product searching method, storage medium and electronic device
CN112000889A (en) Information gathering and presenting system
CN111429161B (en) Feature extraction method, feature extraction device, storage medium and electronic equipment
WO2017143703A1 (en) Offline resource mining method and device
CN112669113A (en) Product recommendation method and device, storage medium and electronic device
KR100836877B1 (en) System and Method For Deduction About Future Signal And Issue Using R&D Environmental Information
CN116244513A (en) Random group POI recommendation method, system, equipment and storage medium
CN108304570B (en) Processing method and display method of search results, server and client
CN112765374A (en) Education resource screening system and method for information push
CN115408618B (en) Point-of-interest recommendation method based on social relation fusion position dynamic popularity and geographic features
TWI684147B (en) Cloud self-service analysis platform and analysis method thereof
CN116049543A (en) Comprehensive energy efficiency service business mixed recommendation method, system and storage medium
CN113971213A (en) Smart city management public information sharing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20200512