CN111143689A - Method for constructing recommendation engine according to user requirements and user portrait - Google Patents
Method for constructing recommendation engine according to user requirements and user portrait Download PDFInfo
- Publication number
- CN111143689A CN111143689A CN201911411250.3A CN201911411250A CN111143689A CN 111143689 A CN111143689 A CN 111143689A CN 201911411250 A CN201911411250 A CN 201911411250A CN 111143689 A CN111143689 A CN 111143689A
- Authority
- CN
- China
- Prior art keywords
- user
- information
- house
- source
- recommended
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 238000012163 sequencing technique Methods 0.000 claims abstract description 31
- 238000012545 processing Methods 0.000 claims abstract description 9
- 238000012549 training Methods 0.000 claims abstract description 7
- 238000012216 screening Methods 0.000 claims description 7
- 238000005516 engineering process Methods 0.000 claims description 5
- 238000001914 filtration Methods 0.000 claims description 4
- 230000003287 optical effect Effects 0.000 claims description 3
- 238000007670 refining Methods 0.000 claims description 3
- 230000004931 aggregating effect Effects 0.000 claims description 2
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000004891 communication Methods 0.000 description 10
- 238000004364 calculation method Methods 0.000 description 9
- 238000005259 measurement Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 230000002860 competitive effect Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0251—Targeted advertisements
- G06Q30/0255—Targeted advertisements based on user history
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0251—Targeted advertisements
- G06Q30/0255—Targeted advertisements based on user history
- G06Q30/0256—User search
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0251—Targeted advertisements
- G06Q30/0269—Targeted advertisements based on user profile or attribute
- G06Q30/0271—Personalized advertisement
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/16—Real estate
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Strategic Management (AREA)
- Development Economics (AREA)
- Finance (AREA)
- Databases & Information Systems (AREA)
- Accounting & Taxation (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Tourism & Hospitality (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Primary Health Care (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The application discloses a method for constructing a recommendation engine according to user requirements and user figures, which relates to the technical field of data statistics, and a specific implementation mode of the method comprises the following steps: collecting information of house resources browsed by a user and house resource information, acquiring user information, and determining user portrait information through keywords; the system also comprises user demand information, and the demand of the user on the house price, the distance working place or the subway station and the business circle; obtaining a first recall data source and a second recall data source to obtain a recommended data source; performing characteristic engineering processing through a Spark distributed framework to obtain a second recommended data source, and training to obtain a ranking model; inputting the house source information into a sequencing model to obtain a first recommended house source; meanwhile, generating a second recommended house source according to the house owner demand; and constructing a recommendation engine. The method and the device for determining the user portrait information according to the keywords can improve the accuracy and the integrity of the user portrait information and are beneficial to improving the accuracy of the sequencing model.
Description
Technical Field
The application relates to the technical field of data statistics, in particular to a method for constructing a recommendation engine according to user requirements and user portraits.
Background
With the development of the internet, the interests of users are more and more extensive, and with the change of the environment and living standard of users, the demands of users are also changing. At present, the LBS (Location Based Services) recommendation engine recall and sorting method has the problems in the applicability of the long-rented apartment field:
1. the current long-rent apartment collection user information is generally only according to the online record of the user, and the channel is single, so that the user information is omitted, and the sample information base is lost;
2. the recall mode industry is integrally inverted index, and does not consider the properties of bulk objects such as high-quality plot price and the like of high-quality public places;
3. the general recommendation rough arrangement stage in the rough arrangement layer is not optimized for a high-value sparse target scene, a long-rented apartment scene needs multilayer rough arrangement, LBS and value matching strategies need to be carried out for matching of strong intentions of clients and key attributes of target objects, and matching strategies need to be carried out for secondary requirements of users and secondary house states and district characteristics of houses;
4. the ranking layer industry generally bases on the user online behavior to do the learning ToRank model ranking of pair-wise or list-wise. However, in a long-rented apartment, the conditions of on-line feedback of users and the like need to be considered, the sequencing model is more complex, multi-level sample weight depiction needs to be performed on the basis of the sequencing model, and meanwhile, the scenes such as areas, house states and the like need to be deeply subdivided on the basis of the high sparsity of the samples and the large-volume attribute of transactions, so that the recommendation quality is improved;
5. in the interference adjustment layer industry, accurate advertisement insertion is generally performed based on a bidding ranking mode, modeling is not performed based on a specific LBS mode of a long rental apartment, and the recommendation effect is not ideal.
Disclosure of Invention
In view of the above, the present application discloses a method for constructing a recommendation engine according to user requirements and a user profile, comprising the steps of:
collecting information of house resources browsed by a user and house resource information, acquiring user information, wherein the user information comprises user portrait information, determining a browsing information set of the user to a house according to the information of the house resources browsed by the user, the browsing information set comprises a plurality of keywords, the keywords comprise preference values and consumption values, the preference values are used for identifying preference degrees of the user to the browsed house, the consumption values are used for identifying consumption receiving degrees of the user to the browsed house, the keywords are integrated to obtain probability information of at least one to-be-selected user portrait, and the user portrait information is determined according to the probability information of the at least one to-be-selected user portrait;
the user information also comprises user demand information, and the user demand information comprises the demand of a user on the room price, the distance working place or the subway station and the business circle;
the house source information is recalled according to the user demand information, and the first recall data source is obtained through filtering and formatting of a data collector;
the house source information is recalled according to the user portrait information, and a second recall data source is obtained by screening through an optical disc arranged in an offline data warehouse and LBS (location based service);
carrying out price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source;
performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
training the second recommended data source to obtain a ranking model;
inputting the house source information into the sequencing model to obtain the grading sequencing condition of the house source information;
setting a scoring threshold value, wherein the house source with the score of the house source information larger than the scoring threshold value is a first recommended house source;
meanwhile, generating a second recommended house source according to the house owner demand;
and constructing a recommendation engine according to the first recommended house source and the second recommended house source.
Preferably, the user information is collected by means of online, offline and electrical pinning.
Preferably, the first recommended house source and the second recommended house source are arranged in a crossed manner to construct a recommendation engine.
Preferably, the first recalled data source is recalled by way of a search.
Preferably, the feature engineering process includes sequentially performing aggregation, refining, screening, and completion on the first recommended data source to obtain the second recommended data source.
Preferably, the inputting the room source information into the ranking model to obtain the ranking condition of the scores of the room source information includes: and carrying out weight depiction on the house source information, and carrying out deep division on the scenes such as regions, house states and the like according to the sparsity and the transaction attribute of the house source information to obtain the grading and sequencing conditions of the house source information.
Preferably, the recommendation engine further comprises a hotspot source.
Compared with the prior art, the method for constructing the recommendation engine according to the user requirements and the user portrait provided by the invention has the following beneficial effects that:
1. according to the method for constructing the recommendation engine according to the user requirements and the user portrait, the first recommendation data source is subjected to feature engineering processing through the Spark distributed framework to obtain the second recommendation data source, and the response time of the obtained result is favorably shortened.
2. The method for constructing the recommendation engine according to the user requirements and the user portrait determines the user portrait information according to the probability information of at least one user portrait to be selected, and can improve the accuracy and the integrity of the user portrait information.
3. The method for constructing the recommendation engine according to the user requirements and the user portrait provided by the invention has rich information acquisition channels, including user online APP record, offline watching and communication and electric marketing modes; a large amount of user information is collected, an integral sample model is generated, and the bulk object attributes such as high-quality parcel prices of high-quality long-rent apartments are covered.
4. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes the personalized recommendation mode, and extracts apartments with high recommendation matching degree from the sample model library through comprehensive measurement and calculation of the user requirements and the portrait.
5. According to the method for constructing the recommendation engine according to the user requirements and the user portrait, the recommendation sequencing model is optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, the sequencing model is subjected to multilevel sample weight portrayal, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
6. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes an intervention adjustment layer, models based on a specific LBS (location based service) mode of the long-rented apartment, and designs the characteristic recommendation engine of the long-rented apartment with bulk commodities with sparsely distributed landmass.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
FIG. 1 is a flow diagram of a method of constructing a recommendation engine based on user needs and a user profile in accordance with the present invention;
FIG. 2 is a flowchart of another method for constructing a recommendation engine based on user requirements and user profiles in accordance with the present invention.
Detailed Description
The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. It should be noted that the described embodiments are merely some embodiments, rather than all embodiments, of the invention and are merely illustrative in nature and in no way intended to limit the invention, its application, or uses. The protection scope of the present application shall be subject to the definitions of the appended claims.
Example 1:
fig. 1 is a flowchart of a method for building a recommendation engine according to user requirements and a user profile according to the present invention, and as shown in fig. 1, the method for building a recommendation engine according to user requirements and a user profile provided in this embodiment includes the steps of:
step 101, collecting information of house sources browsed by a user and house source information, acquiring user information, wherein the user information comprises user portrait information, determining a browsing information set of the house to the user according to the information of the house sources browsed by the user, the browsing information set comprises a plurality of keywords, the keywords comprise preference values and consumption values, the preference values are used for identifying preference degrees of the user to the browsed house, the consumption values are used for identifying consumption receiving degrees of the user to the browsed house, the keywords are integrated to obtain probability information of at least one to-be-selected user portrait, and the user portrait information is determined according to the probability information of at least one to-be-selected user portrait; the user information also comprises user demand information, and the user demand information comprises the demand of the user on the house price, the distance working place or the subway station and the business circle;
it can be understood that the probability information of at least one to-be-selected user portrait is obtained by integrating the keywords, the probability information can be obtained by using a neural network model, and the model is trained through a preset data set in advance, so that the first model has the processing capacity of generating the user portrait. By determining the user portrait information based on the method, the accuracy and the integrity of the information can be improved.
In step 101, user information may be collected by online, offline, and electrical pinning. It can be understood that, in the first online acquisition mode, according to browsing and searching records of a user on a webpage and an APP, user intention information is acquired; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; the method is characterized by enriching an information acquisition channel, acquiring a large amount of user information, generating an integral sample model, and covering the attributes of a large number of targets such as the price of a high-quality land parcel owned by a high-quality long-rent apartment.
Step 102, recalling house source information according to user demand information, and filtering and formatting through a data collector Logstash to obtain a first recall data source;
and the house source information is recalled according to the user portrait information, and a second recall data source is obtained by screening through an optical disk UDF (Universal description framework) arranged in a Hive offline data warehouse and LBS (location based service).
In step 102, optionally, user requirement feedback is collected through user filling or inquiry, and the search mode is recalled, so that the technology can adopt an ELK framework to ensure the real-time performance of the index establishment and query function executing MySQL query and the expandability of the recommendation engine. The ELK framework is Elasticisearch, Logstash, and Kibana is a set of solution for real-time data collection, storage, indexing, retrieval, statistical analysis and visualization. MySQL is a relational database management system.
In step 102, optionally, the data is recalled by way of workplace, place of life, and point of interest, with the least importance. But as a cold start solution without insufficient collection of user demand. UDF is an english abbreviation of Universal Disc Format (Universal Disc Format), a Universal Disc file system established by the international organization for standardization in 1996, and standard Packet Writing technology (PW) is used to simplify the use of a recorder.
103, performing price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source, wherein the first recommended data source comprises an online data source and an offline data source;
in step 103, the first recall data source and the second recall data source are sorted according to the LBS service price matching strategy and the segment characteristics matching strategy to generate a first recommended data source.
And further, an LBS and value matching strategy is carried out for matching the strong intention of the client and the key attributes of the target objects, and a matching strategy is carried out for the secondary requirements of the user and the characteristics of the secondary house state and the section of the house, so that a rough sequencing result is obtained. Optimizing an individualized recommendation mode, comprehensively measuring and calculating the requirements and images of the user, and extracting a high-quality apartment with high recommendation matching degree from a sample model library. Meanwhile, a recommended sequencing model can be optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, multi-level sample weight depiction is carried out on the sequencing model, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
In the above steps, it can be understood that the user information correspondingly generates a first recall data source, a second recall data source and a third recall data source through the user demand information, the user behavior information and the user portrait information house source information, and processes the first recall data source, the second recall data source and the third recall data source to generate a first recommended data source; some user information with relevancy can be recalled from massive user information and roughly sorted, the subsequent fine sorting can be carried out, the workload of work is reduced, and the response time of the obtained result is favorably prolonged.
104, performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
it is to be understood that since the first recommended data source includes an online data source and an offline data source; the Spark distributed framework comprises an offline calculation module and an online module calculation module, the online data source of the first recommended data source is processed by the Spark distributed framework comprising the online calculation module, and the offline data source of the first recommended data source is processed by the Spark distributed framework comprising the offline calculation module, so that the response time of the obtained result can be shortened.
105, training a second recommended data source to obtain a ranking model;
step 106, inputting the house source information into a ranking model to obtain the scoring ranking condition of the house source information;
in step 106, the house source information is further subjected to weight portrayal, and scenes such as areas, house states and the like are deeply divided according to the sparsity and the transaction attributes of the house source information to obtain the ranking score of the house source information.
Step 107, setting a score threshold value, wherein the house source with the score of the house source information larger than the score threshold value is a first recommended house source;
in steps 104-107, it is understood that the feature engineering process includes aggregating, refining, screening, and complementing the first recommended data source to obtain a second recommended data source. And further training a corresponding sorting model after the first recommended data source is collected and processed through characteristic engineering measurement and calculation of a Spark frame, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-amount attribute of transaction, giving accurate sorting scores of house source information, and finally recommending high-quality long-rented apartment resources to users.
Step 108, generating a second recommended house source according to the demands of the homeowners;
and step 109, constructing a recommendation engine according to the first recommended house source and the second recommended house source. Optionally, the recommendation engine further includes a hotspot source. The hot house source can be a house source with a rental rate of more than 90%, and preferably, the hot house source can be a house source with a rental rate of more than 90% near a subway station, a commercial district or a city center.
In steps 108-109, it can be understood that after modeling based on LBS specific to the long rental apartment, the system inserts some specific house sources into specific positions in the recommendation list according to customized requirements of the company such as the out-of-house tilt policy, the operation activity policy, the advertisement promotion, etc., and recommends more selection spaces for the user; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
And the first recommended house source and the second recommended house source are arranged in a crossed mode to construct a recommendation engine. When a user searches for a house source by using the recommendation engine, mixed data of the first recommended house source and the second recommended house source appear, and the first recommended house source or the second recommended house source is not part of the mixed data, so that the user demand can be met, and the income of long-renting apartments can be improved.
Example 2:
FIG. 2 is a flowchart illustrating a method for building a recommendation engine according to user requirements and user profiles according to the present invention, and as shown in FIG. 2, the present embodiment provides a method for building a recommendation engine according to user requirements and user profiles, which includes the steps of:
step 201, user information acquisition:
the information collection method mainly includes three methods: the method comprises the steps that a first online acquisition mode is adopted, and user intention information is obtained according to browsing search records of a user on a webpage and an APP; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; constructing a sample model of the information acquired by all channels, storing the sample model in a database and facilitating retrieval; enriching collected information channels, including on-line APP recording, off-line communication with watching and selling modes of a user; a large amount of user information is collected, an integral sample model is generated, and the attributes of large objects such as high-quality parcel prices of high-quality long-rent apartments are covered.
Step 202, recall:
after a user searches for information of a long rental apartment near 'XX prefecture', firstly, user requirements are acquired through a sample database: recalling according to the requirements of users: the system selects data information of a long rental apartment near the XX first house from a MySQL house source library, transmits the data to a data collector Logstash for filtering and formatting, and then stores the data in an ES database;
further, collected data information is collected and carefully selected, then multi-layer rough sequencing is carried out on the long-rent apartment scenes, LBS and value matching strategies are carried out aiming at the matching of the strong intentions of the clients and the key attributes of the target objects, and matching strategies are carried out on the secondary requirements of the users and the secondary house state and the district characteristics of the house, so that a rough sequencing result is obtained; a new matching strategy is formulated for the conditions of on-line and off-line watching, signing and the like, and a multi-level sample weight depiction is carried out on the sequencing model, so that the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
Step 203, sorting:
for recalled house source data rough ordering information, after comprehensively acquiring demand preference information of a user and house state plot business circle images and performing characteristic engineering measurement and calculation processing on a Spark distributed framework, respectively performing corresponding training on online data and offline data to construct an ordering model, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-volume attribute of transaction, giving accurate ordering and scoring of house source information, and finally recommending high-quality long-rental apartment resources to the user;
step 204, intervention and adjustment:
after modeling based on the specific LBS of the long rental apartment, the system inserts some specific house sources into specific positions in a recommendation list according to customized requirements of a company such as a house-leaving inclined policy, an operation activity policy, advertisement promotion and the like, and recommends more selection spaces for users; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
It should be understood that the recall method of the present invention may be based on only one of the user requirement information and the image information, or may be based on both methods. The embodiment only shows the recall according to the requirements of the user, and the embodiment does not make specific requirements on the recall mode and can be set according to actual conditions.
Example 3:
with continued reference to FIG. 2, FIG. 2 is a flowchart illustrating a method for building a recommendation engine according to user requirements and a user profile according to the present invention; the method for constructing the recommendation engine according to the user requirements and the user portrait provided by the embodiment comprises the following steps:
step 301, user information acquisition:
the information collection method mainly includes three methods: the method comprises the steps that a first online acquisition mode is adopted, and user intention information is obtained according to browsing search records of a user on a webpage and an APP; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; constructing a sample model of the information acquired by all channels, storing the sample model in a database and facilitating retrieval; enriching collected information channels, including on-line APP recording, off-line communication with watching and selling modes of a user; a large amount of user information is collected, an integral sample model is generated, and the attributes of large objects such as high-quality parcel prices of high-quality long-rent apartments are covered.
Step 302, recall:
after a user searches for information of a long rental apartment near 'XX house', user portrait information is first acquired through a sample database: recall from user profile: mining relevant information of a long-rented apartment near the apartment through a Hive interest business district, screening the information through content and LBS, and storing the information into a Redis recall list index base;
collecting and carefully selecting the acquired data information, then carrying out multilayer rough sequencing on the long-rent apartment scenes, carrying out LBS and value matching strategies aiming at the matching of the strong intentions of the clients and the key attributes of the target objects, and carrying out matching strategies on the secondary requirements of the users, the secondary house state and the district characteristics of the house to obtain a rough sequencing result; a new matching strategy is formulated for the conditions of on-line and off-line watching, signing and the like, and a multi-level sample weight depiction is carried out on the sequencing model, so that the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
Step 303, sorting:
for recalled house source data rough sorting information, comprehensively acquiring demand preference information of a user and house state plot business circle images, performing corresponding training on online data and offline data respectively to construct a sorting model after characteristic engineering measurement and calculation processing of a Spark distributed framework, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-volume attribute of transaction, giving accurate sorting scoring of the house source information, and finally recommending high-quality long-renting apartment resources to the user;
step 304, intervention and adjustment:
after modeling based on the specific LBS of the long rental apartment, the system inserts some specific house sources into specific positions in a recommendation list according to customized requirements of a company such as a house-leaving inclined policy, an operation activity policy, advertisement promotion and the like, and recommends more selection spaces for users; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
It should be understood that the recall method of the present invention may be based on only one of the user requirement information and the image information, or may be based on both methods. The embodiment only shows that the recall is performed according to the user portrait, and the recall mode is not specifically required and can be set according to the actual situation.
According to the embodiments, the application has the following beneficial effects:
1. according to the method for constructing the recommendation engine according to the user requirements and the user portrait, the first recommendation data source is subjected to feature engineering processing through the Spark distributed framework to obtain the second recommendation data source, and the response time of the obtained result is favorably shortened.
2. The invention provides a method for constructing a recommendation engine according to user requirements and user portraits,
the user portrait information is determined according to the probability information of at least one user portrait to be selected, so that the accuracy and the integrity of the user portrait information can be improved.
3. The method for constructing the recommendation engine according to the user requirements and the user portrait provided by the invention has rich information acquisition channels, including user online APP record, offline watching and communication and electric marketing modes; a large amount of user information is collected, an integral sample model is generated, and the bulk object attributes such as high-quality parcel prices of high-quality long-rent apartments are covered.
4. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes the personalized recommendation mode, and extracts apartments with high recommendation matching degree from the sample model library through comprehensive measurement and calculation of the user requirements and the portrait.
5. According to the method for constructing the recommendation engine according to the user requirements and the user portrait, the recommendation sequencing model is optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, the sequencing model is subjected to multilevel sample weight portrayal, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
6. The method for constructing the recommendation engine according to the user requirements and the user portrait optimizes an intervention adjustment layer, models based on a specific LBS (location based service) mode of the long-rented apartment, and designs the characteristic recommendation engine of the long-rented apartment with bulk commodities with sparsely distributed landmass.
While the present invention has been described in detail with reference to the drawings and examples, it is to be understood that the foregoing examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention. The scope of the invention is defined by the appended claims.
Claims (7)
1. A method for constructing a recommendation engine according to user requirements and user portraits is characterized by comprising the following steps:
collecting information of house resources browsed by a user and house resource information, acquiring user information, wherein the user information comprises user portrait information, determining a browsing information set of the user to a house according to the information of the house resources browsed by the user, the browsing information set comprises a plurality of keywords, the keywords comprise preference values and consumption values, the preference values are used for identifying preference degrees of the user to the browsed house, the consumption values are used for identifying consumption receiving degrees of the user to the browsed house, the keywords are integrated to obtain probability information of at least one to-be-selected user portrait, and the user portrait information is determined according to the probability information of the at least one to-be-selected user portrait;
the user information also comprises user demand information, and the user demand information comprises the demand of a user on the room price, the distance working place or the subway station and the business circle;
the house source information is recalled according to the user demand information, and the first recall data source is obtained through filtering and formatting of a data collector;
the house source information is recalled according to the user portrait information, and a second recall data source is obtained by screening through an optical disc arranged in an offline data warehouse and LBS (location based service);
carrying out price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source;
performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
training the second recommended data source to obtain a ranking model;
inputting the house source information into the sequencing model to obtain the grading sequencing condition of the house source information;
setting a scoring threshold value, wherein the house source with the score of the house source information larger than the scoring threshold value is a first recommended house source;
meanwhile, generating a second recommended house source according to the house owner demand;
and constructing a recommendation engine according to the first recommended house source and the second recommended house source.
2. The method of claim 1, wherein the user information is collected online, offline, and telemarketing.
3. The method of claim 1, wherein the first recommended source and the second recommended source are arranged to intersect to construct the recommendation engine.
4. The method of claim 1, wherein the first recall data source is recalled by searching.
5. The method of claim 1, wherein the feature engineering process comprises sequentially aggregating, refining, screening, and complementing the first recommended data source to obtain the second recommended data source.
6. The method of claim 1, wherein said inputting said source information into said ranking model and obtaining a ranking score of said source information comprises: and carrying out weight depiction on the house source information, and carrying out deep division on the scenes such as regions, house states and the like according to the sparsity and the transaction attribute of the house source information to obtain the grading and sequencing conditions of the house source information.
7. The method of claim 1, wherein the recommendation engine further comprises a hotspot source.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911411250.3A CN111143689A (en) | 2019-12-31 | 2019-12-31 | Method for constructing recommendation engine according to user requirements and user portrait |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911411250.3A CN111143689A (en) | 2019-12-31 | 2019-12-31 | Method for constructing recommendation engine according to user requirements and user portrait |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111143689A true CN111143689A (en) | 2020-05-12 |
Family
ID=70522534
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911411250.3A Withdrawn CN111143689A (en) | 2019-12-31 | 2019-12-31 | Method for constructing recommendation engine according to user requirements and user portrait |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111143689A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111708949A (en) * | 2020-06-19 | 2020-09-25 | 微医云(杭州)控股有限公司 | Medical resource recommendation method and device, electronic equipment and storage medium |
CN111768232A (en) * | 2020-06-24 | 2020-10-13 | 长春初唐网络科技有限公司 | AI-based online and offline marketing tracking matching recommendation method for real estate |
CN112232933A (en) * | 2020-12-11 | 2021-01-15 | 深圳市房多多网络科技有限公司 | House source information recommendation method, device, equipment and readable storage medium |
CN112256943A (en) * | 2020-10-22 | 2021-01-22 | 上海适享文化传播有限公司 | Portal portrait extraction method based on combination of natural language processing and knowledge graph |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150032639A1 (en) * | 2013-07-29 | 2015-01-29 | Verizon Patent And Licensing Inc. | System and method for providing notifications on product recalls |
CN105869001A (en) * | 2015-01-19 | 2016-08-17 | 苏宁云商集团股份有限公司 | Customized product recommendation guiding method and system |
CN107423442A (en) * | 2017-08-07 | 2017-12-01 | 火烈鸟网络(广州)股份有限公司 | Method and system, storage medium and computer equipment are recommended in application based on user's portrait behavioural analysis |
US20180253780A1 (en) * | 2017-05-05 | 2018-09-06 | James Wang | Smart matching for real estate transactions |
CN109190024A (en) * | 2018-08-20 | 2019-01-11 | 平安科技(深圳)有限公司 | Information recommendation method, device, computer equipment and storage medium |
CN109377329A (en) * | 2018-12-25 | 2019-02-22 | 北京时光荏苒科技有限公司 | A kind of source of houses recommended method, device, storage medium and electronic equipment |
CN109615432A (en) * | 2018-12-14 | 2019-04-12 | 成都德迈安科技有限公司 | Consumer behaviour portrait tool based on big data |
CN109658192A (en) * | 2018-12-20 | 2019-04-19 | 重庆锐云科技有限公司 | A kind of source of houses recommended method and server |
CN109815386A (en) * | 2018-12-21 | 2019-05-28 | 厦门市美亚柏科信息股份有限公司 | A kind of construction method, device and storage medium based on user's portrait |
-
2019
- 2019-12-31 CN CN201911411250.3A patent/CN111143689A/en not_active Withdrawn
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150032639A1 (en) * | 2013-07-29 | 2015-01-29 | Verizon Patent And Licensing Inc. | System and method for providing notifications on product recalls |
CN105869001A (en) * | 2015-01-19 | 2016-08-17 | 苏宁云商集团股份有限公司 | Customized product recommendation guiding method and system |
US20180253780A1 (en) * | 2017-05-05 | 2018-09-06 | James Wang | Smart matching for real estate transactions |
CN107423442A (en) * | 2017-08-07 | 2017-12-01 | 火烈鸟网络(广州)股份有限公司 | Method and system, storage medium and computer equipment are recommended in application based on user's portrait behavioural analysis |
CN109190024A (en) * | 2018-08-20 | 2019-01-11 | 平安科技(深圳)有限公司 | Information recommendation method, device, computer equipment and storage medium |
CN109615432A (en) * | 2018-12-14 | 2019-04-12 | 成都德迈安科技有限公司 | Consumer behaviour portrait tool based on big data |
CN109658192A (en) * | 2018-12-20 | 2019-04-19 | 重庆锐云科技有限公司 | A kind of source of houses recommended method and server |
CN109815386A (en) * | 2018-12-21 | 2019-05-28 | 厦门市美亚柏科信息股份有限公司 | A kind of construction method, device and storage medium based on user's portrait |
CN109377329A (en) * | 2018-12-25 | 2019-02-22 | 北京时光荏苒科技有限公司 | A kind of source of houses recommended method, device, storage medium and electronic equipment |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111708949A (en) * | 2020-06-19 | 2020-09-25 | 微医云(杭州)控股有限公司 | Medical resource recommendation method and device, electronic equipment and storage medium |
CN111708949B (en) * | 2020-06-19 | 2023-07-25 | 微医云(杭州)控股有限公司 | Medical resource recommendation method and device, electronic equipment and storage medium |
CN111768232A (en) * | 2020-06-24 | 2020-10-13 | 长春初唐网络科技有限公司 | AI-based online and offline marketing tracking matching recommendation method for real estate |
CN112256943A (en) * | 2020-10-22 | 2021-01-22 | 上海适享文化传播有限公司 | Portal portrait extraction method based on combination of natural language processing and knowledge graph |
CN112256943B (en) * | 2020-10-22 | 2024-01-23 | 上海适享文化传播有限公司 | Portal store image extraction method based on natural language processing combined with knowledge graph |
CN112232933A (en) * | 2020-12-11 | 2021-01-15 | 深圳市房多多网络科技有限公司 | House source information recommendation method, device, equipment and readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110019396B (en) | Data analysis system and method based on distributed multidimensional analysis | |
CN109241405B (en) | Learning resource collaborative filtering recommendation method and system based on knowledge association | |
US10572565B2 (en) | User behavior models based on source domain | |
CN111143689A (en) | Method for constructing recommendation engine according to user requirements and user portrait | |
US9043302B1 (en) | Campaign and competitive analysis and data visualization based on search interest data | |
CN107862022B (en) | Culture resource recommendation system | |
CN111159561A (en) | Method for constructing recommendation engine according to user behaviors and user portrait | |
CN108829652B (en) | Picture labeling system based on crowdsourcing | |
CN110532351B (en) | Recommendation word display method, device and equipment and computer readable storage medium | |
US7783636B2 (en) | Personalized information retrieval search with backoff | |
CN111127105A (en) | User hierarchical model construction method and system, and operation analysis method and system | |
CN111159559A (en) | Method for constructing recommendation engine according to user requirements and user behaviors | |
CN110968801A (en) | Real estate product searching method, storage medium and electronic device | |
CN112000889A (en) | Information gathering and presenting system | |
CN111429161B (en) | Feature extraction method, feature extraction device, storage medium and electronic equipment | |
WO2017143703A1 (en) | Offline resource mining method and device | |
CN112669113A (en) | Product recommendation method and device, storage medium and electronic device | |
KR100836877B1 (en) | System and Method For Deduction About Future Signal And Issue Using R&D Environmental Information | |
CN116244513A (en) | Random group POI recommendation method, system, equipment and storage medium | |
CN108304570B (en) | Processing method and display method of search results, server and client | |
CN112765374A (en) | Education resource screening system and method for information push | |
CN115408618B (en) | Point-of-interest recommendation method based on social relation fusion position dynamic popularity and geographic features | |
TWI684147B (en) | Cloud self-service analysis platform and analysis method thereof | |
CN116049543A (en) | Comprehensive energy efficiency service business mixed recommendation method, system and storage medium | |
CN113971213A (en) | Smart city management public information sharing system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20200512 |