CN110245286B - travel recommendation method and device based on data mining - Google Patents

travel recommendation method and device based on data mining Download PDF

Info

Publication number
CN110245286B
CN110245286B CN201910380945.3A CN201910380945A CN110245286B CN 110245286 B CN110245286 B CN 110245286B CN 201910380945 A CN201910380945 A CN 201910380945A CN 110245286 B CN110245286 B CN 110245286B
Authority
CN
China
Prior art keywords
data set
webpage
basic
emotion
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910380945.3A
Other languages
Chinese (zh)
Other versions
CN110245286A (en
Inventor
余恒兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Terminus Beijing Technology Co Ltd
Original Assignee
Terminus Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Terminus Beijing Technology Co Ltd filed Critical Terminus Beijing Technology Co Ltd
Priority to CN201910380945.3A priority Critical patent/CN110245286B/en
Publication of CN110245286A publication Critical patent/CN110245286A/en
Application granted granted Critical
Publication of CN110245286B publication Critical patent/CN110245286B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0203Market surveys; Market polls
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/14Travel agencies

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Databases & Information Systems (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Primary Health Care (AREA)
  • Human Resources & Organizations (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The method comprises the steps of obtaining basic data and comment data in a website, establishing a mapping relation among a basic data set, a scenery spot data set and an emotion data set, building a big travel recommendation data analysis environment, cleaning the data set, performing -based processing on concepts, obtaining user browsing history, performing main and scenery spot analysis on a skipped webpage, obtaining emotion analysis assignment scores of the skipped webpage comment data, combining the basic scenery spots obtained by analyzing the basic data set according to the emotion data set, the dwell time of the user webpage and the scenery spots of the skipped webpage, and returning a travel recommendation result after sorting.

Description

travel recommendation method and device based on data mining
Technical Field
The application relates to the field of travel recommendation and data mining, in particular to travel recommendation methods and devices based on data mining.
Background
The tourism recommendation method is characterized in that tourism resources, tourism economy, tourism activities, tourists and other information are integrated according to the actual conditions of the tourists, and tourists are provided with tour routes most suitable for the tourists, so that the tourism experience of the tourists is improved.
Therefore, the travel recommendation method and device based on data mining can be designed by considering the fusion data mining technology.
Disclosure of Invention
In view of this, the present application aims to provide travel recommendation methods and apparatuses based on data mining, so as to improve the accuracy of travel recommendation, and achieve the technical effect of improving the accuracy of travel information recommendation by analyzing the posted sentiment in a website.
In view of the above, the present application provides data mining-based travel recommendation methods, including:
acquiring basic data and comment data in a website, extracting user attribute information in the basic data through a rule matching algorithm to form a basic data set, extracting scenery spot information in the comment data through a named entity recognition algorithm to form a scenery spot data set, extracting emotion information in the comment data through an emotion analysis algorithm to form an emotion data set, establishing a mapping relation among the basic data set, the scenery spot data set and the emotion data set, importing the mapping relation into a data warehouse, and establishing a travel recommendation big data analysis environment;
cleaning the basic data set, the scenery spot data set and the emotion data set, performing grouping processing on concepts in the basic data set and the scenery spot data set, performing travel basic recommendation index on the basic data set, and performing travel concept expansion on the scenery spot data set;
acquiring user browsing history, extracting the stay time and the skip sequence of each webpage of a user, recording anchor text information clicked in the skip process of the user, performing main and scenery spot analysis on the skipped webpages, acquiring emotion analysis assignment scores of review data of the skipped webpages, and importing the travel recommendation big data analysis environment;
and according to the emotion data set, the retention time of the user webpage and the theme scenery spot of the user webpage, combining the basic scenery spot obtained by analyzing the basic data set, and returning a travel recommendation result after sequencing.
In , the extracting the sight information in the comment data by the named entity recognition algorithm to form a sight data set further includes:
and obtaining the gist of each sight spot in the webpage according to the font size, color and position of the sight spot information in the webpage, and determining the gist and sight spots of the webpage after sequencing.
In , the establishing a mapping relationship among the base data set, the sight data set, and the emotion data set includes:
establishing th mapping relation between the basic data set and the sight spot data set;
and establishing a second mapping relation between the sight spot data set and the emotion data set.
In embodiments, the importance of each sight spot in the web page is obtained according to the font size, color, and position of the sight spot information in the web page, and is calculated by the following formula:
D=∑ωi·Pi
wherein D is the subject degree of the attraction, ωiWeighting factor, P, for the ith web page attributeiAnd the quantized value of the ith webpage attribute in the webpage is obtained.
In , the indexing the base data set for travel base recommendations and the extending the concepts of the sights data set for travel comprises:
inquiring a preset travel basic recommendation model according to user basic information input during user registration to obtain a basic recommendation result, and establishing an index relation with scenic spots in the basic recommendation result;
and expanding the scenic spots in the geographical areas to which the scenic spots belong according to the names of the scenic spots, and expanding to obtain the scenic spots with the travel characteristics.
In embodiments, the recording anchor text information clicked during the user's jumping process, and performing a subject matter scene analysis on the jumping webpage includes:
extracting the sight spot information in the anchor text, and performing semantic expansion to obtain a th subject concept;
analyzing the main subject sight spot of the jump webpage to obtain a second main subject concept;
and performing intersection operation on the th main concept and the second main concept to obtain the main concept and the sight spot concept concerned by the user.
In , the obtaining of sentiment analysis score of comment data of the jumped webpage includes:
when comment information of a login user exists in the skipped webpage, emotion analysis is directly performed on the comment information, and emotion analysis assignment scores of the skipped webpage are determined;
and when the comment information of the login user does not exist in the skipped webpage, inquiring the emotion data set, and determining the emotion analysis assignment score of the skipped webpage.
In embodiments, the step of returning a travel recommendation result after sorting according to the emotion data set, the dwell time of the user webpage, and the theme spot of the user-skipped webpage by combining the basic scenery spot obtained by analyzing the basic data set includes:
and when no intersection exists between the main scenic spot and the basic scenic spot, returning all the main scenic spots and the basic scenic spots as recommendation results.
In view of the above, the present application also proposes data mining-based travel recommendation devices, including:
the system comprises a building module, a database module and a database module, wherein the building module is used for acquiring basic data and comment data in a website, extracting user attribute information in the basic data through a rule matching algorithm to form a basic data set, extracting scenery spot information in the comment data through a named entity recognition algorithm to form a scenery spot data set, extracting emotion information in the comment data through an emotion analysis algorithm to form an emotion data set, establishing a mapping relation among the basic data set, the scenery spot data set and the emotion data set, importing the mapping relation into a data warehouse, and building a travel recommendation big data analysis environment;
the arrangement module is used for cleaning the basic data set, the scenery spot data set and the emotion data set, performing grouping processing on concepts in the basic data set and the scenery spot data set, performing travel basic recommendation index on the basic data set, and performing travel concept expansion on the scenery spot data set;
the skip module is used for acquiring the browsing history of a user, extracting the staying time and the skip sequence of the user in each webpage, recording the anchor text information clicked in the skip process of the user, performing main and scenery spot analysis on the skipped webpage, acquiring emotion analysis assignment scores of the review data of the skipped webpage, and importing the travel recommendation big data analysis environment;
and the return module is used for combining the basic scenic spots obtained by analyzing the basic data set according to the emotion data set, the stay time of the user webpage and the theme scenic spots of the user webpage, and returning the travel recommendation result after sequencing.
In , the building module includes:
the mapping unit is used for controlling the distribution and resource allocation of tasks, establishing mapping relation between the basic data set and the scenery spot data set;
and the second mapping unit is used for establishing a second mapping relation between the sight spot data set and the emotion data set.
Drawings
In the drawings, like numerals refer to the same or similar parts or elements throughout the several views unless otherwise specified, and in which not are drawn to scale, it should be understood that these drawings depict only embodiments of in accordance with the present disclosure and are not to be considered limiting of the scope of the disclosure.
FIG. 1 shows a flow diagram of a data mining based travel recommendation method according to an embodiment of the invention.
Fig. 2 is a block diagram illustrating a travel recommendation apparatus based on data mining according to an embodiment of the present invention.
Fig. 3 shows a constitutional diagram of a building block according to an embodiment of the present invention.
Detailed Description
The present application is described in further detail in with reference to the drawings and the examples, it being understood that the specific examples are set forth herein for the purpose of illustration and not as a definition of the limits of the invention.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
FIG. 1 shows a flow diagram of a data mining based travel recommendation method according to an embodiment of the invention. As shown in fig. 1, the data mining-based travel recommendation method includes:
s11, obtaining basic data and comment data in a website, extracting user attribute information in the basic data through a rule matching algorithm to form a basic data set, extracting scenery spot information in the comment data through a named entity recognition algorithm to form a scenery spot data set, extracting emotion information in the comment data through an emotion analysis algorithm to form an emotion data set, establishing a mapping relation among the basic data set, the scenery spot data set and the emotion data set, importing the mapping relation into a data warehouse, and building a travel recommendation big data analysis environment.
In , the extracting the sight information in the comment data by the named entity recognition algorithm to form a sight data set further includes:
and obtaining the gist of each sight spot in the webpage according to the font size, color and position of the sight spot information in the webpage, and determining the gist and sight spots of the webpage after sequencing.
In , the establishing a mapping relationship among the base data set, the sight data set, and the emotion data set includes:
establishing th mapping relation between the basic data set and the sight spot data set;
and establishing a second mapping relation between the sight spot data set and the emotion data set.
Specifically, the basic data information of the scenic spot can be inquired through the th mapping relation, the emotion data information of the scenic spot and the user for the scenic spot can be inquired through the second mapping relation of the same country, and the mapping process can realize the quick search of data through the modes of establishing an index and the like.
In embodiments, the importance of each sight spot in the web page is obtained according to the font size, color, and position of the sight spot information in the web page, and is calculated by the following formula:
D=∑ωi·Pi
wherein D is the subject degree of the attraction, ωiWeighting factor, P, for the ith web page attributeiAnd the quantized value of the ith webpage attribute in the webpage is obtained.
And S12, cleaning the basic data set, the scenery spot data set and the emotion data set, performing -based treatment on concepts in the basic data set and the scenery spot data set, performing travel basic recommendation index on the basic data set, and performing travel concept expansion on the scenery spot data set.
In real-time modes, according to user basic information input during user registration, a preset travel basic recommendation model is inquired to obtain a basic recommendation result, and an index relation with a scenic spot in the basic recommendation result is established;
in embodiments, the sights in the geographical area of the sight are expanded according to the sight name, and the sight with the travel characteristic is expanded.
Specifically, concept expansion may be performed by looking up public geographic information databases and travel databases.
And step S13, acquiring user browsing history, extracting the stay time and the skip sequence of each webpage of the user, recording anchor text information clicked in the skip process of the user, performing main and scenery spot analysis on the skipped webpages, acquiring emotion analysis assignment scores of the review data of the skipped webpages, and importing the travel recommendation big data analysis environment.
For example, in a travel website, - web pages introduce sights or geographic areas that may be understood as the primary sights of the web page.
In embodiments, the recording anchor text information clicked during the user's jumping process, and performing a subject matter scene analysis on the jumping webpage includes:
extracting the sight spot information in the anchor text, and performing semantic expansion to obtain a th subject concept;
analyzing the main subject sight spot of the jump webpage to obtain a second main subject concept;
and performing intersection operation on the th main concept and the second main concept to obtain the main concept and the sight spot concept concerned by the user.
For example, when the anchor text information has the 'Laogong' word, the anchor text information can be expanded into concepts related to the 'Laogong' such as 'Beijing', 'Tianan field', 'great wall', 'Saxiong' and the like through the ontology concept.
In , the obtaining of sentiment analysis assigned scores of the comment data of the jumped web page includes:
when comment information of a login user exists in the skipped webpage, emotion analysis is directly performed on the comment information, and emotion analysis assignment scores of the skipped webpage are determined;
and when the comment information of the login user does not exist in the skipped webpage, inquiring the emotion data set, and determining the emotion analysis assignment score of the skipped webpage.
And step S14, according to the emotion data set, the stay time of the user webpage and the theme spot of the user webpage, combining the basic scenery spot obtained by analyzing the basic data set, and returning a travel recommendation result after sequencing.
For example, the longer the user stays in the webpage, the more detailed the user pays attention to the main scenery point pointed by the webpage, and the higher the coincidence degree between the main scenery point of the user jumping to the next webpage through the webpage and the main scenery point of the webpage is, which indicates that the user expects the detailed information of the main scenery point and the more attention.
In embodiments, the sorting and returning a travel recommendation result according to the emotion data set, the dwell time of the user webpage, and the theme spot of the user webpage, in combination with the basic spot obtained by analyzing the basic data set, includes:
and when no intersection exists between the main scenic spot and the basic scenic spot, returning all the main scenic spots and the basic scenic spots as recommendation results.
Fig. 2 is a block diagram of a travel recommendation apparatus based on data mining according to an embodiment of the present invention. As shown in fig. 2, the data mining-based travel recommendation apparatus may be divided into:
the system comprises a construction module 21, a database and a database, wherein the construction module 21 is used for acquiring basic data and comment data in a website, extracting user attribute information in the basic data through a rule matching algorithm to form a basic data set, extracting scenery spot information in the comment data through a named entity recognition algorithm to form a scenery spot data set, extracting emotion information in the comment data through an emotion analysis algorithm to form an emotion data set, establishing a mapping relation among the basic data set, the scenery spot data set and the emotion data set, importing the mapping relation into a data warehouse, and constructing a travel recommendation big data analysis environment;
the arrangement module 22 is configured to clean the basic data set, the scenery spot data set, and the emotion data set, perform grouping processing on concepts in the basic data set and the scenery spot data set, perform travel basic recommendation indexing on the basic data set, and perform travel concept expansion on the scenery spot data set;
the skip module 23 is configured to obtain a user browsing history, extract a retention time and a skip sequence of each webpage of a user, record anchor text information clicked by the user in a skip process, perform a subject-to-scene analysis on a skipped webpage, obtain an emotion analysis assignment score of review data of the skipped webpage, and import the emotion analysis result into the travel recommendation big data analysis environment;
and the returning module 24 is used for combining the basic scenic spots obtained by analyzing the basic data set according to the emotion data set, the stay time of the user webpage and the main scenic spot of the user webpage, and returning the travel recommendation result after sequencing.
Fig. 3 shows a constitutional diagram of a building block according to an embodiment of the present invention.
As can be seen in fig. 3, the building block 21, comprises:
the mapping unit 211 is used for controlling the distribution and resource allocation of tasks, establishing mapping relation between the basic data set and the attraction data set;
a second mapping unit 212, configured to establish a second mapping relationship between the sight data set and the emotion data set.
In the description herein, reference to the terms " embodiments," " embodiments," "examples," "specific examples," or " examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least embodiments or examples of the invention.
Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include or more executable instructions for implementing specific logical functions or steps in the process, and the scope of the preferred embodiments of the present invention includes other implementations in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.
For the purposes of this description, a "computer-readable medium" can be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device (e.g., a computer-based system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions).
For example, if implemented in hardware, and in another embodiment , it may be implemented using any item or combination thereof known in the art, a discrete logic circuit having logic circuits for implementing logic functions on data signals, an application specific integrated circuit having appropriate combinational logic circuits, a programmable array (PGA), a field programmable array (FPGA), or the like.
It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware associated with instructions of a program, which may be stored in computer readable storage media, and when executed, the program includes or a combination of the steps of the method embodiments.
In addition, each functional unit in each embodiment of the present invention may be integrated into processing modules, or each unit may exist alone physically, or two or more units are integrated into modules.
The above description is only for the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive various changes or substitutions within the technical scope of the present invention, and these should be covered by the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.

Claims (9)

1, A travel recommendation method based on data mining, comprising:
acquiring basic data and comment data in a website, extracting user attribute information in the basic data through a rule matching algorithm to form a basic data set, extracting scenery spot information in the comment data through a named entity recognition algorithm to form a scenery spot data set, extracting emotion information in the comment data through an emotion analysis algorithm to form an emotion data set, establishing a mapping relation among the basic data set, the scenery spot data set and the emotion data set, importing the mapping relation into a data warehouse, and establishing a travel recommendation big data analysis environment;
cleaning the basic data set, the scenery spot data set and the emotion data set, performing classification processing on concepts in the basic data set and the scenery spot data set, establishing an index after performing travel basic recommendation on the basic data set, and performing travel concept expansion on the scenery spot data set;
acquiring user browsing history, extracting the stay time and the skip sequence of each webpage of a user, recording anchor text information clicked in the skip process of the user, performing main and scenery spot analysis on the skipped webpages, acquiring emotion analysis assignment scores of review data of the skipped webpages, and importing the travel recommendation big data analysis environment;
according to the emotion data set, the retention time of the user webpage and the theme scene point of the user webpage, combining the basic scene point obtained by analyzing the basic data set, and returning a travel recommendation result after sequencing;
the method for analyzing the main and the sight spots of the skipped webpage comprises the following steps of recording anchor text information clicked by a user in the skipping process, and analyzing the main and the sight spots of the skipped webpage, wherein the anchor text information comprises:
extracting the sight spot information in the anchor text, and performing semantic expansion to obtain a th subject concept;
analyzing the main subject sight spot of the jump webpage to obtain a second main subject concept;
and performing intersection operation on the th main concept and the second main concept to obtain the main concept and the sight spot concept concerned by the user.
2. The method of claim 1, wherein the extracting the sight information in the comment data by a named entity recognition algorithm forms a sight data set, further comprising:
and obtaining the gist of each sight spot in the webpage according to the font size, color and position of the sight spot information in the webpage, and determining the gist and sight spots of the webpage after sequencing.
3. The method of claim 1, wherein the establishing a mapping relationship between the base data set, the attraction data set, and the emotion data set comprises:
establishing th mapping relation between the basic data set and the sight spot data set;
and establishing a second mapping relation between the sight spot data set and the emotion data set.
4. The method of claim 2, wherein the degree of gist of each sight spot in the web page is obtained according to the font size, color and position of the sight spot information in the web page, and is calculated by the following formula:
wherein D is the degree of the gist of the sight spot,
Figure DEST_PATH_IMAGE004
is the weighting coefficient of the ith web page attribute,
Figure DEST_PATH_IMAGE006
and the quantized value of the ith webpage attribute in the webpage is obtained.
5. The method of claim 1, wherein indexing the base data set after the travel base recommendation and the attraction data set for travel concept augmentation comprises:
inquiring a preset travel basic recommendation model according to user basic information input during user registration to obtain a basic recommendation result, and establishing an index relation with a scenic spot in the basic recommendation result;
and expanding the scenic spots in the geographical areas to which the scenic spots belong according to the names of the scenic spots, and expanding to obtain the scenic spots with the travel characteristics.
6. The method of claim 1, wherein obtaining sentiment analysis assigned scores for the skipped web page comment data comprises:
when comment information of a login user exists in the skipped webpage, emotion analysis is directly performed on the comment information, and emotion analysis assignment scores of the skipped webpage are determined;
and when the comment information of the login user does not exist in the skipped webpage, inquiring the emotion data set, and determining the emotion analysis assignment score of the skipped webpage.
7. The method of claim 1, wherein the step of returning the travel recommendation results after sorting according to the emotion data set, the dwell time of the user webpage, and the subject matter scene of the user jump webpage in combination with the base scene obtained by analyzing the base data set comprises:
and when no intersection exists between the main scenic spot and the basic scenic spot, returning all the main scenic spots and the basic scenic spots as recommendation results.
8, A travel recommendation device based on data mining, comprising:
the system comprises a building module, a database module and a database module, wherein the building module is used for acquiring basic data and comment data in a website, extracting user attribute information in the basic data through a rule matching algorithm to form a basic data set, extracting scenery spot information in the comment data through a named entity recognition algorithm to form a scenery spot data set, extracting emotion information in the comment data through an emotion analysis algorithm to form an emotion data set, establishing a mapping relation among the basic data set, the scenery spot data set and the emotion data set, importing the mapping relation into a data warehouse, and building a travel recommendation big data analysis environment;
the arrangement module is used for cleaning the basic data set, the scenery spot data set and the emotion data set, performing grouping processing on concepts in the basic data set and the scenery spot data set, establishing an index after travel basic recommendation is performed on the basic data set, and performing travel concept expansion on the scenery spot data set;
the skip module is used for acquiring the browsing history of a user, extracting the staying time and the skip sequence of the user in each webpage, recording the anchor text information clicked in the skip process of the user, performing main and scenery spot analysis on the skipped webpage, acquiring emotion analysis assignment scores of the review data of the skipped webpage, and importing the travel recommendation big data analysis environment;
the return module is used for combining the basic scenic spots obtained by analyzing the basic data set according to the emotion data set, the stay time of the user webpage and the theme scenic spots of the user webpage, and returning a travel recommendation result after sequencing;
the method for analyzing the main and the sight spots of the skipped webpage comprises the following steps of recording anchor text information clicked by a user in the skipping process, and analyzing the main and the sight spots of the skipped webpage, wherein the anchor text information comprises:
extracting the sight spot information in the anchor text, and performing semantic expansion to obtain a th subject concept;
analyzing the main subject sight spot of the jump webpage to obtain a second main subject concept;
and performing intersection operation on the th main concept and the second main concept to obtain the main concept and the sight spot concept concerned by the user.
9. The apparatus of claim 8, the build module, comprising:
the mapping unit is used for controlling the distribution and resource allocation of tasks, establishing mapping relation between the basic data set and the scenery spot data set;
and the second mapping unit is used for establishing a second mapping relation between the sight spot data set and the emotion data set.
CN201910380945.3A 2019-05-08 2019-05-08 travel recommendation method and device based on data mining Active CN110245286B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910380945.3A CN110245286B (en) 2019-05-08 2019-05-08 travel recommendation method and device based on data mining

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910380945.3A CN110245286B (en) 2019-05-08 2019-05-08 travel recommendation method and device based on data mining

Publications (2)

Publication Number Publication Date
CN110245286A CN110245286A (en) 2019-09-17
CN110245286B true CN110245286B (en) 2020-01-31

Family

ID=67883835

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910380945.3A Active CN110245286B (en) 2019-05-08 2019-05-08 travel recommendation method and device based on data mining

Country Status (1)

Country Link
CN (1) CN110245286B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111191127B (en) * 2019-12-24 2023-02-03 重庆特斯联智慧科技股份有限公司 Travel recommendation method and system based on correlation analysis algorithm
CN111612590A (en) * 2020-03-19 2020-09-01 江苏智檬智能科技有限公司 Scenic spot recommendation method and device based on artificial intelligence big data
CN117077901B (en) * 2023-10-17 2024-01-05 北京铭洋商务服务有限公司 Travel data processing method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105718184A (en) * 2014-12-05 2016-06-29 北京搜狗科技发展有限公司 Data processing method and apparatus
CN106202252A (en) * 2016-06-29 2016-12-07 厦门趣处网络科技有限公司 Method, system are recommended in a kind of trip analyzed based on user emotion
CN108681739A (en) * 2018-03-26 2018-10-19 安徽师范大学 One kind recommending method based on user feeling and time dynamic tourist famous-city
CN109284443A (en) * 2018-11-28 2019-01-29 四川亨通网智科技有限公司 A kind of tourism recommended method and system based on crawler technology

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105427209A (en) * 2015-11-24 2016-03-23 余元辉 Panoramic smart travel system
US10332039B2 (en) * 2016-08-17 2019-06-25 International Business Machines Corporation Intelligent travel planning
CN107423837A (en) * 2017-04-12 2017-12-01 宁夏丝路风情旅游网络股份有限公司 The Intelligent planning method and system of tourism route

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105718184A (en) * 2014-12-05 2016-06-29 北京搜狗科技发展有限公司 Data processing method and apparatus
CN106202252A (en) * 2016-06-29 2016-12-07 厦门趣处网络科技有限公司 Method, system are recommended in a kind of trip analyzed based on user emotion
CN108681739A (en) * 2018-03-26 2018-10-19 安徽师范大学 One kind recommending method based on user feeling and time dynamic tourist famous-city
CN109284443A (en) * 2018-11-28 2019-01-29 四川亨通网智科技有限公司 A kind of tourism recommended method and system based on crawler technology

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
采用在线评论的景点个性化推荐;王少兵 等;《华侨大学学报(自然科学版)》;20180520;第39卷(第3期);467-472 *

Also Published As

Publication number Publication date
CN110245286A (en) 2019-09-17

Similar Documents

Publication Publication Date Title
CN107168991B (en) Search result display method and device
CN110245286B (en) travel recommendation method and device based on data mining
CN108334632B (en) Entity recommendation method and device, computer equipment and computer-readable storage medium
CN112766607A (en) Travel route recommendation method and device, electronic device and readable storage medium
CN109919437B (en) big data-based intelligent tourism target matching method and system
US20160034968A1 (en) Method and device for determining target user, and network server
WO2019137391A1 (en) Method and apparatus for performing categorised matching of videos, and selection engine
JPWO2019069505A1 (en) Information processing device, join condition generation method and join condition generation program
CN108009147B (en) Electronic book cover generation method, electronic device and computer storage medium
CN110659409A (en) Point of interest (POI) recommendation method and device
CN105653547A (en) Method and device for extracting keywords of text
CN112330510A (en) Volunteer recommendation method and device, server and computer-readable storage medium
KR20190124436A (en) Method for searching building based on image and apparatus for the same
CN110889029B (en) Urban target recommendation method and device
KR20210065773A (en) Big data based emotional information analysis and evaluation system and Driving method of the Same
JP2018205978A (en) Information extracting device and information extracting method
Mikhailov et al. Smartphone-based tourist trip planning system: a context-based approach to offline attraction recommendation
JP5639549B2 (en) Information retrieval apparatus, method, and program
JP7503493B2 (en) Posted information extraction control device, posted information extraction control program
CN115577190A (en) Tourist behavior data extraction method
CN112766288B (en) Image processing model construction method, device, electronic equipment and readable storage medium
JP6517072B2 (en) Method for generating store establishment data or management support data from big data based on vocabulary semantic pattern analysis method
CN111080343B (en) House source searching method and system based on multiple users
CN113987333A (en) Destination area recommendation method and device
TWI524281B (en) Place name ranking method, system and computer-readable storage medium thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant