CN110245286A - A kind of travelling recommended method and device based on data mining - Google Patents

A kind of travelling recommended method and device based on data mining Download PDF

Info

Publication number
CN110245286A
CN110245286A CN201910380945.3A CN201910380945A CN110245286A CN 110245286 A CN110245286 A CN 110245286A CN 201910380945 A CN201910380945 A CN 201910380945A CN 110245286 A CN110245286 A CN 110245286A
Authority
CN
China
Prior art keywords
data collection
webpage
sight spot
data
basic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910380945.3A
Other languages
Chinese (zh)
Other versions
CN110245286B (en
Inventor
余恒兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Terminus Beijing Technology Co Ltd
Original Assignee
Terminus Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Terminus Beijing Technology Co Ltd filed Critical Terminus Beijing Technology Co Ltd
Priority to CN201910380945.3A priority Critical patent/CN110245286B/en
Publication of CN110245286A publication Critical patent/CN110245286A/en
Application granted granted Critical
Publication of CN110245286B publication Critical patent/CN110245286B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0203Market surveys; Market polls
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/14Travel agencies

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present application provides a kind of travelling recommended method and device based on data mining.This method comprises: obtaining the basic data in website, comment data, basic data collection, scene data collection, the mapping relations between affection data collection are established, travelling is built and recommends big data analysis environment;Data set is cleaned, concept is normalized;It obtains user and browses history, carry out the analysis of purport sight spot to webpage is jumped, and obtain the sentiment analysis assignment scoring for jumping webpage comment data;The purport sight spot of webpage is jumped according to affection data collection, user's webpage residence time and user, the basic sight spot that binding analysis basic data collection obtains returns to travelling recommendation results after sequence.A kind of travelling recommended method and device based on data mining of the embodiment of the present application improves travelling and recommends accuracy by combined data method for digging.

Description

A kind of travelling recommended method and device based on data mining
Technical field
This application involves travelling recommendation and the field of data mining more particularly to a kind of travelling recommendation sides based on data mining Method and device.
Background technique
Tourist attractions refer to using tourism and its correlated activation as the region place of one of major function or major function, energy Enough meet the tourism demands such as tourist's visit and sightseeing, vacation tourism, health and happiness body-building, have corresponding tourist facilities and provides corresponding Area is managed in tourist service independently.Travelling is recommended, and refers to the actual conditions according to tourist, comprehensive tourist resources, Tourism Economy, trip The information of trip activity, traveller etc. provides the touring line for being most suitable for tourist to tourist, to improve the tourist article of tourist It tests.Tradition travelling recommended method is general only to be matched by the single dimensional characteristics of tourist.With the arrival of cybertimes, Many tourists are that travelling planning is carried out by way of network order, and it can be considered to the browsings by the analysis network user Process targetedly provides travelling recommendation service for tourist.User journal excavation in data mining refers to is dug using data Pick technology is analyzed and processed the daily record data generated during site users access network server, to find network Access module and hobby of user etc., these information are to the intelligible unknown message of web construction potentially useful and know Know, for analyzing the accessed situation of website, secondary site management and decision support etc..
Therefore, it may be considered that fused data digging technology designs travelling recommended method and device based on data mining.
Summary of the invention
In view of this, the purpose of the application is to propose a kind of travelling recommended method and device based on data mining, mention Precision is recommended in height travelling, by emotion of posting in analyzing web site, realizes and improves the technical effect that travel information recommends accuracy.
Based on above-mentioned purpose, the travelling recommended method based on data mining that present applicant proposes a kind of, comprising:
Basic data, the comment data in website are obtained, is extracted in the basic data by rule matching algorithm Customer attribute information basis of formation data set is formed by naming entity identification algorithms to extract the sight spot information in comment data Scene data collection extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection, described in foundation Basic data collection, scene data collection, the mapping relations between affection data collection, and import in data warehouse, it builds travelling and recommends Big data analysis environment;
The basic data collection, the scene data collection, the affection data collection are cleaned, to the basic data The concept that collection, scene data are concentrated is normalized, and carries out basis recommendation index of travelling to the basic data collection, right The scene data collection carries out Concept of Tourism expansion;
It obtains user and browses history, extract user in the residence time of each webpage and jump sequence, record user jumps The Anchor Text information clicked in the process carries out the analysis of purport sight spot to webpage is jumped, and obtains the feelings for jumping webpage comment data Sense analysis assignment scoring, imports the travelling and recommends big data analysis environment;
The purport sight spot of webpage, binding analysis are jumped according to the affection data collection, user's webpage residence time and user The basic sight spot that the basic data collection obtains returns to travelling recommendation results after sequence.
In some embodiments, described to be formed by naming entity identification algorithms to extract the sight spot information in comment data Scene data collection, further includes:
According to font size, color, position of the sight spot information in webpage, the purport degree at each sight spot in webpage is obtained, By the purport sight spot for determining webpage after sequence.
In some embodiments, described to establish the basic data collection, scene data collection, the mapping between affection data collection Relationship, comprising:
Establish the first mapping relations between the basic data collection and the scene data collection;
Establish the second mapping relations between the scene data collection and the affection data collection.
In some embodiments, the font size according to sight spot information in webpage, color, position, obtain webpage In each sight spot different degree, be calculated by the following formula:
D=∑ ωi·Pi,
Wherein D is the purport degree at sight spot, ωiFor the weighting coefficient of i-th of webpage attribute, PiFor i-th of net in the webpage The quantized value of page attribute.
In some embodiments, described that basis recommendation index of travelling is carried out to the basic data collection, to the sight spot number Concept of Tourism expansion is carried out according to collection, comprising:
The user base information inputted when according to user's registration inquires the basic recommended models of preset travelling, obtains basis Recommendation results, and establish and establish index relative with the sight spot in basic recommendation results;
According to sight name, expand the sight spot in the affiliated geographic area in sight spot out, and extends and obtain with travelling feature Sight spot.
In some embodiments, the Anchor Text information clicked in record user's jump procedure, carries out to webpage is jumped The analysis of purport sight spot, comprising:
The sight spot information in Anchor Text is extracted, and carries out semantic extension, obtains the first purport concept;
The analysis of purport sight spot is carried out to webpage is jumped, obtains the second purport concept;
The first purport concept and the second purport concept are subjected to intersection operation, obtain the purport scape of user's concern Point concept.
It is in some embodiments, described to obtain the sentiment analysis assignment scoring for jumping webpage comment data, comprising:
When jumping in webpage there are when the comment information of login user, sentiment analysis directly is carried out to comment information, is determined Jump the sentiment analysis assignment scoring of webpage;
When jumping the comment information that login user is not present in webpage, the affection data collection is inquired, determination jumps net The sentiment analysis assignment scoring of page.
In some embodiments, described that webpage is jumped according to the affection data collection, user's webpage residence time and user Purport sight spot, the basic sight spot that basic data collection described in binding analysis obtains returns to travelling recommendation results after sequence, comprising:
When intersection is not present between the purport sight spot and the basic sight spot, by the purport sight spot and the basis Sight spot is all used as recommendation results to return.
Based on above-mentioned purpose, the application also proposed a kind of travelling recommendation apparatus based on data mining, comprising:
Module is constructed, for obtaining the basic data in website, comment data, is extracted by rule matching algorithm described Customer attribute information basis of formation data set in basic data, by naming entity identification algorithms to extract in comment data Sight spot information forms scene data collection, extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data Collection is established the basic data collection, scene data collection, the mapping relations between affection data collection, and is imported in data warehouse, takes It builds travelling and recommends big data analysis environment;
Sorting module, for being cleaned to the basic data collection, the scene data collection, the affection data collection, The concept concentrated to the basic data collection, scene data is normalized, and travels to the basic data collection Index is recommended on basis, carries out Concept of Tourism expansion to the scene data collection;
Jump module browses history for obtaining user, extracts user in the residence time of each webpage and jumps sequence, The Anchor Text information clicked in record user's jump procedure carries out the analysis of purport sight spot to webpage is jumped, and obtains and jump webpage The sentiment analysis assignment of comment data scores, and imports the travelling and recommends big data analysis environment;
Return module, for jumping the purport of webpage according to the affection data collection, user's webpage residence time and user Sight spot, the basic sight spot that basic data collection described in binding analysis obtains return to travelling recommendation results after sequence.
In some embodiments, the building module, comprising:
First map unit, distribution, resource allocation for control task;Establish the basic data collection and the sight spot The first mapping relations between data set;
Second map unit, the second mapping for establishing between the scene data collection and the affection data collection are closed System.
Detailed description of the invention
In the accompanying drawings, unless specified otherwise herein, otherwise indicate the same or similar through the identical appended drawing reference of multiple attached drawings Component or element.What these attached drawings were not necessarily to scale.It should be understood that these attached drawings depict only according to the present invention Disclosed some embodiments, and should not serve to limit the scope of the present invention.
Fig. 1 shows the flow chart of the travelling recommended method according to an embodiment of the present invention based on data mining.
The composition figure for the travelling recommendation apparatus that Fig. 2 shows according to an embodiment of the present invention based on data mining.
Fig. 3 shows the composition figure of building module according to an embodiment of the present invention.
Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is only used for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to just Part relevant to related invention is illustrated only in description, attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 shows the flow chart of the travelling recommended method according to an embodiment of the present invention based on data mining.Such as Fig. 1 institute Show, being somebody's turn to do the travelling recommended method based on data mining includes:
Step S11, basic data, the comment data in website are obtained, the basis is extracted by rule matching algorithm Customer attribute information basis of formation data set in data, by naming entity identification algorithms to extract the sight spot in comment data Information forms scene data collection, extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection, The basic data collection, scene data collection, the mapping relations between affection data collection are established, and are imported in data warehouse, are built Big data analysis environment is recommended in travelling.
In one embodiment, described by naming entity identification algorithms to extract the sight spot information shape in comment data At scene data collection, further includes:
According to font size, color, position of the sight spot information in webpage, the purport degree at each sight spot in webpage is obtained, By the purport sight spot for determining webpage after sequence.
In one embodiment, described to establish the basic data collection, scene data collection, reflecting between affection data collection Penetrate relationship, comprising:
Establish the first mapping relations between the basic data collection and the scene data collection;
Establish the second mapping relations between the scene data collection and the affection data collection.
Specifically, the basic data data information that sight spot can be inquired by the first mapping relations is reflected with various countries second The relationship of penetrating can inquire sight spot and user for the affection data information at the sight spot.Mapping process can be by establishing the side such as index The quick lookup of formula realization data.
In one embodiment, the font size according to sight spot information in webpage, color, position, obtain net The different degree at each sight spot in page, is calculated by the following formula:
D=∑ ωi·Pi,
Wherein D is the purport degree at sight spot, ωiFor the weighting coefficient of i-th of webpage attribute, PiFor i-th of net in the webpage The quantized value of page attribute.
Step S12, the basic data collection, the scene data collection, the affection data collection are cleaned, to described The concept that basic data collection, scene data are concentrated is normalized, and carries out travelling basis to the basic data collection and push away Index is recommended, Concept of Tourism expansion is carried out to the scene data collection.
In a kind of real-time mode, the user base information inputted when according to user's registration inquires preset travelling basis Recommended models obtain basic recommendation results, and establish and establish index relative with the sight spot in basic recommendation results;
In one embodiment, according to sight name, expand the sight spot in the affiliated geographic area in sight spot out, and extend To the sight spot with travelling feature.
Specifically, concept approximation can carry out by searching for disclosed geographic information database and travel database.
Step S13, it obtains user and browses history, extract user in the residence time of each webpage and jump sequence, record The Anchor Text information clicked in user's jump procedure carries out the analysis of purport sight spot to webpage is jumped, and obtains and jump webpage comment The sentiment analysis assignment of data scores, and imports the travelling and recommends big data analysis environment.
Specifically, purport sight spot refers to the sight spot mainly introduced or described in a webpage.For example, in travel site, A general webpage can introduce a sight spot or a geographic area, the sight spot or geographic area and be construed as the webpage Purport sight spot.
In one embodiment, the Anchor Text information clicked in record user's jump procedure, to jump webpage into The analysis of row purport sight spot, comprising:
The sight spot information in Anchor Text is extracted, and carries out semantic extension, obtains the first purport concept;
The analysis of purport sight spot is carried out to webpage is jumped, obtains the second purport concept;
The first purport concept and the second purport concept are subjected to intersection operation, obtain the purport scape of user's concern Point concept.
Specifically, semantic extension process can carry out in such a way that the Ontological concept of broad sense extends.For example, when anchor text There is " the Forbidden City " two word in this information is that " Beijing ", " Tian'anmen Square ", " Great Wall ", " storage can be extended to by Ontological concept The concept relevant to " the Forbidden City " such as elegant palace ".
It is in one embodiment, described to obtain the sentiment analysis assignment scoring for jumping webpage comment data, comprising:
When jumping in webpage there are when the comment information of login user, sentiment analysis directly is carried out to comment information, is determined Jump the sentiment analysis assignment scoring of webpage;
When jumping the comment information that login user is not present in webpage, the affection data collection is inquired, determination jumps net The sentiment analysis assignment scoring of page.
Step S14, the purport sight spot of webpage is jumped according to the affection data collection, user's webpage residence time and user, The basic sight spot that basic data collection described in binding analysis obtains returns to travelling recommendation results after sequence.
Specifically, webpage residence time and the purport information for jumping webpage can embody concern of the user to the sight spot Degree.For example, user pays close attention to more refinement for purport sight spot pointed by the webpage when user is longer in the residence time of webpage It causes, degree of concern is also higher;User jumps to the purport sight spot of next webpage and the purport sight spot of the webpage by the webpage The goodness of fit is higher, illustrates that user expects the details at the purport sight spot, it is higher also to embody its degree of concern.
In one embodiment, described that net is jumped according to the affection data collection, user's webpage residence time and user The purport sight spot of page, the basic sight spot that basic data collection described in binding analysis obtains return to travelling recommendation results, packet after sequence It includes:
When intersection is not present between the purport sight spot and the basic sight spot, by the purport sight spot and the basis Sight spot is all used as recommendation results to return.
The composition figure of Fig. 2 travelling recommendation apparatus according to an embodiment of the present invention based on data mining.As shown in Fig. 2, should Travelling recommendation apparatus based on data mining can integrally be divided into:
It constructs module 21 and institute is extracted by rule matching algorithm for obtaining the basic data in website, comment data The customer attribute information basis of formation data set in basic data is stated, by naming entity identification algorithms to extract in comment data Sight spot information formed scene data collection, by sentiment analysis algorithm extract the emotion information in comment data formed emotion number According to collection, the basic data collection, scene data collection, the mapping relations between affection data collection are established, and import in data warehouse, It builds travelling and recommends big data analysis environment;
Sorting module 22, it is clear for being carried out to the basic data collection, the scene data collection, the affection data collection It washes, the concept concentrated to the basic data collection, scene data is normalized, and carries out trip to the basic data collection Index is recommended on row basis, carries out Concept of Tourism expansion to the scene data collection;
Jump module 23 browses history for obtaining user, extracts user in the residence time of each webpage and jump suitable Sequence records the Anchor Text information clicked in user's jump procedure, carries out the analysis of purport sight spot to webpage is jumped, and obtain and jump net The sentiment analysis assignment scoring of page comment data, imports the travelling and recommends big data analysis environment;
Return module 24, for jumping the master of webpage according to the affection data collection, user's webpage residence time and user Purport sight spot, the basic sight spot that basic data collection described in binding analysis obtains return to travelling recommendation results after sequence.
Fig. 3 shows the composition figure of building module according to an embodiment of the present invention.
From figure 3, it can be seen that building module 21, comprising:
First map unit 211, distribution, resource allocation for control task;Establish the basic data collection and described The first mapping relations between scene data collection;
Second map unit 212, the second mapping for establishing between the scene data collection and the affection data collection Relationship.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means specific features, structure, material or spy described in conjunction with this embodiment or example Point is included at least one embodiment or example of the invention.Moreover, particular features, structures, materials, or characteristics described It may be combined in any suitable manner in any one or more of the embodiments or examples.In addition, without conflicting with each other, this The technical staff in field can be by the spy of different embodiments or examples described in this specification and different embodiments or examples Sign is combined.
Any process described otherwise above or method description are construed as in flow chart or herein, and expression includes It is one or more for realizing specific logical function or process the step of executable instruction code module, segment or portion Point, and the range of the preferred embodiment of the present invention includes other realization, wherein can not press shown or discussed suitable Sequence, including according to related function by it is basic simultaneously in the way of or in the opposite order, Lai Zhihang function, this should be of the invention Embodiment person of ordinary skill in the field understood.
Expression or logic and/or step described otherwise above herein in flow charts, for example, being considered use In the order list for the executable instruction for realizing logic function, may be embodied in any computer-readable medium, for Instruction execution system, device or equipment (such as computer based system, including the system of processor or other can be held from instruction The instruction fetch of row system, device or equipment and the system executed instruction) it uses, or combine these instruction execution systems, device or set It is standby and use.For the purpose of this specification, " computer-readable medium ", which can be, any may include, stores, communicates, propagates or pass Defeated program is for instruction execution system, device or equipment or the dress used in conjunction with these instruction execution systems, device or equipment It sets.The more specific example (non-exhaustive list) of computer-readable medium include the following: there is the electricity of one or more wirings Interconnecting piece (electronic device), portable computer diskette box (magnetic device), random access memory (RAM), read-only memory (ROM), erasable edit read-only storage (EPROM or flash memory), fiber device and portable read-only memory (CDROM).In addition, computer-readable medium can even is that the paper that can print described program on it or other suitable Jie Matter, because can then be edited, be interpreted or when necessary with other for example by carrying out optical scanner to paper or other media Suitable method is handled electronically to obtain described program, is then stored in computer storage.
It should be appreciated that each section of the invention can be realized with hardware, software, firmware or their combination.Above-mentioned In embodiment, software that multiple steps or method can be executed in memory and by suitable instruction execution system with storage Or firmware is realized.It, and in another embodiment, can be under well known in the art for example, if realized with hardware Any one of column technology or their combination are realized: having a logic gates for realizing logic function to data-signal Discrete logic, with suitable combinational logic gate circuit specific integrated circuit, programmable gate array (PGA), scene Programmable gate array (FPGA) etc..
Those skilled in the art are understood that realize all or part of step that above-described embodiment method carries It suddenly is that relevant hardware can be instructed to complete by program, the program can store in a kind of computer-readable storage medium In matter, which when being executed, includes the steps that one or a combination set of embodiment of the method.
It, can also be in addition, each functional unit in each embodiment of the present invention can integrate in a processing module It is that each unit physically exists alone, can also be integrated in two or more units in a module.Above-mentioned integrated mould Block both can take the form of hardware realization, can also be realized in the form of software function module.The integrated module is such as Fruit is realized and when sold or used as an independent product in the form of software function module, also can store in a computer In readable storage medium storing program for executing.The storage medium can be read-only memory, disk or CD etc..
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in its various change or replacement, These should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with the guarantor of the claim It protects subject to range.

Claims (10)

1. a kind of travelling recommended method based on data mining characterized by comprising
Basic data, the comment data in website are obtained, the user in the basic data is extracted by rule matching algorithm Attribute information basis of formation data set forms sight spot by naming entity identification algorithms to extract the sight spot information in comment data Data set extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection, establishes the basis Data set, scene data collection, the mapping relations between affection data collection, and import in data warehouse, it builds travelling and recommends big number According to analysis environment;
The basic data collection, the scene data collection, the affection data collection are cleaned, to the basic data collection, The concept that scene data is concentrated is normalized, and carries out basis recommendation index of travelling to the basic data collection, to institute It states scene data collection and carries out Concept of Tourism expansion;
It obtains user and browses history, extract user in the residence time of each webpage and jump sequence, record user's jump procedure The Anchor Text information of middle click carries out the analysis of purport sight spot to webpage is jumped, and obtains the emotion point for jumping webpage comment data Assignment scoring is analysed, the travelling is imported and recommends big data analysis environment;
The purport sight spot of webpage is jumped according to the affection data collection, user's webpage residence time and user, described in binding analysis The basic sight spot that basic data collection obtains returns to travelling recommendation results after sequence.
2. the method according to claim 1, wherein described by naming entity identification algorithms to extract comment number Sight spot information in forms scene data collection, further includes:
According to font size, color, position of the sight spot information in webpage, the purport degree at each sight spot in webpage is obtained, is passed through The purport sight spot of webpage is determined after sequence.
3. the method according to claim 1, wherein described establish the basic data collection, scene data collection, feelings Feel the mapping relations between data set, comprising:
Establish the first mapping relations between the basic data collection and the scene data collection;
Establish the second mapping relations between the scene data collection and the affection data collection.
4. according to the method described in claim 2, it is characterized in that, the font size according to sight spot information in webpage, Color, position obtain the different degree at each sight spot in webpage, are calculated by the following formula:
D=∑ ωi·Pi,
Wherein D is the purport degree at sight spot, ωiFor the weighting coefficient of i-th of webpage attribute, PiFor i-th of webpage category in the webpage The quantized value of property.
5. the method according to claim 1, wherein described carry out basis recommendation of travelling to the basic data collection Index carries out Concept of Tourism expansion to the scene data collection, comprising:
The user base information inputted when according to user's registration inquires the basic recommended models of preset travelling, obtains basic recommendation As a result, and establishing and establishing index relative with the sight spot in basic recommendation results;
According to sight name, expand the sight spot in the affiliated geographic area in sight spot out, and extend and obtain the sight spot with travelling feature.
6. the method according to claim 1, wherein the Anchor Text letter clicked in record user's jump procedure Breath carries out the analysis of purport sight spot to webpage is jumped, comprising:
The sight spot information in Anchor Text is extracted, and carries out semantic extension, obtains the first purport concept;
The analysis of purport sight spot is carried out to webpage is jumped, obtains the second purport concept;
The first purport concept and the second purport concept are subjected to intersection operation, the purport sight spot for obtaining user's concern is general It reads.
7. the method according to claim 1, wherein described obtain the sentiment analysis tax for jumping webpage comment data Value scoring, comprising:
When jumping in webpage there are when the comment information of login user, sentiment analysis directly is carried out to comment information, determination jumps The sentiment analysis assignment of webpage scores;
When jumping the comment information that login user is not present in webpage, the affection data collection is inquired, determination jumps webpage The scoring of sentiment analysis assignment.
8. the method according to claim 1, wherein described stop according to the affection data collection, user's webpage Time and user jump the purport sight spot of webpage, and the basic sight spot that basic data collection described in binding analysis obtains returns after sequence Travelling recommendation results, comprising:
When intersection is not present between the purport sight spot and the basic sight spot, by the purport sight spot and the basic sight spot All returned as recommendation results.
9. a kind of travelling recommendation apparatus based on data mining characterized by comprising
It constructs module and the basis is extracted by rule matching algorithm for obtaining the basic data in website, comment data Customer attribute information basis of formation data set in data, by naming entity identification algorithms to extract the sight spot in comment data Information forms scene data collection, extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection, The basic data collection, scene data collection, the mapping relations between affection data collection are established, and are imported in data warehouse, are built Big data analysis environment is recommended in travelling;
Sorting module, for being cleaned to the basic data collection, the scene data collection, the affection data collection, to institute State basic data collection, the concept that scene data is concentrated is normalized, and carries out basis of travelling to the basic data collection Recommend index, Concept of Tourism expansion is carried out to the scene data collection;
Jump module browses history for obtaining user, extracts user in the residence time of each webpage and jumps sequence, record The Anchor Text information clicked in user's jump procedure carries out the analysis of purport sight spot to webpage is jumped, and obtains and jump webpage comment The sentiment analysis assignment of data scores, and imports the travelling and recommends big data analysis environment;
Return module, for jumping the purport sight spot of webpage according to the affection data collection, user's webpage residence time and user, The basic sight spot that basic data collection described in binding analysis obtains returns to travelling recommendation results after sequence.
10. device according to claim 9, the building module, comprising:
First map unit, distribution, resource allocation for control task;Establish the basic data collection and the scene data The first mapping relations between collection;
Second map unit, the second mapping relations for establishing between the scene data collection and the affection data collection.
CN201910380945.3A 2019-05-08 2019-05-08 travel recommendation method and device based on data mining Active CN110245286B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910380945.3A CN110245286B (en) 2019-05-08 2019-05-08 travel recommendation method and device based on data mining

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910380945.3A CN110245286B (en) 2019-05-08 2019-05-08 travel recommendation method and device based on data mining

Publications (2)

Publication Number Publication Date
CN110245286A true CN110245286A (en) 2019-09-17
CN110245286B CN110245286B (en) 2020-01-31

Family

ID=67883835

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910380945.3A Active CN110245286B (en) 2019-05-08 2019-05-08 travel recommendation method and device based on data mining

Country Status (1)

Country Link
CN (1) CN110245286B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111191127A (en) * 2019-12-24 2020-05-22 重庆特斯联智慧科技股份有限公司 Travel recommendation method and system based on correlation analysis algorithm
CN111612590A (en) * 2020-03-19 2020-09-01 江苏智檬智能科技有限公司 Scenic spot recommendation method and device based on artificial intelligence big data
CN117077901A (en) * 2023-10-17 2023-11-17 北京铭洋商务服务有限公司 Travel data processing method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105427209A (en) * 2015-11-24 2016-03-23 余元辉 Panoramic smart travel system
CN105718184A (en) * 2014-12-05 2016-06-29 北京搜狗科技发展有限公司 Data processing method and apparatus
CN106202252A (en) * 2016-06-29 2016-12-07 厦门趣处网络科技有限公司 Method, system are recommended in a kind of trip analyzed based on user emotion
CN107423837A (en) * 2017-04-12 2017-12-01 宁夏丝路风情旅游网络股份有限公司 The Intelligent planning method and system of tourism route
US20180053121A1 (en) * 2016-08-17 2018-02-22 International Business Machines Corporation Intelligent travel planning
CN108681739A (en) * 2018-03-26 2018-10-19 安徽师范大学 One kind recommending method based on user feeling and time dynamic tourist famous-city
CN109284443A (en) * 2018-11-28 2019-01-29 四川亨通网智科技有限公司 A kind of tourism recommended method and system based on crawler technology

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105718184A (en) * 2014-12-05 2016-06-29 北京搜狗科技发展有限公司 Data processing method and apparatus
CN105427209A (en) * 2015-11-24 2016-03-23 余元辉 Panoramic smart travel system
CN106202252A (en) * 2016-06-29 2016-12-07 厦门趣处网络科技有限公司 Method, system are recommended in a kind of trip analyzed based on user emotion
US20180053121A1 (en) * 2016-08-17 2018-02-22 International Business Machines Corporation Intelligent travel planning
CN107423837A (en) * 2017-04-12 2017-12-01 宁夏丝路风情旅游网络股份有限公司 The Intelligent planning method and system of tourism route
CN108681739A (en) * 2018-03-26 2018-10-19 安徽师范大学 One kind recommending method based on user feeling and time dynamic tourist famous-city
CN109284443A (en) * 2018-11-28 2019-01-29 四川亨通网智科技有限公司 A kind of tourism recommended method and system based on crawler technology

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王少兵 等: "采用在线评论的景点个性化推荐", 《华侨大学学报(自然科学版)》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111191127A (en) * 2019-12-24 2020-05-22 重庆特斯联智慧科技股份有限公司 Travel recommendation method and system based on correlation analysis algorithm
CN111612590A (en) * 2020-03-19 2020-09-01 江苏智檬智能科技有限公司 Scenic spot recommendation method and device based on artificial intelligence big data
CN117077901A (en) * 2023-10-17 2023-11-17 北京铭洋商务服务有限公司 Travel data processing method and system
CN117077901B (en) * 2023-10-17 2024-01-05 北京铭洋商务服务有限公司 Travel data processing method and system

Also Published As

Publication number Publication date
CN110245286B (en) 2020-01-31

Similar Documents

Publication Publication Date Title
Wu et al. Tourism knowledge domains: A keyword analysis
Touya et al. Detecting level-of-detail inconsistencies in volunteered geographic information data sets
Shelton et al. The nature of neighborhoods: Using big data to rethink the geographies of Atlanta’s neighborhood planning unit system
Perez-Encinas et al. Factors influencing student mobility: a comparative European study
Khoshkar et al. Planning for green qualities in the densification of suburban Stockholm–opportunities and challenges
CN110245286A (en) A kind of travelling recommended method and device based on data mining
ES2394002T3 (en) Search engine to perform a search referring to a place
Popescu et al. Mining social media to create personalized recommendations for tourist visits
MXPA05010594A (en) System and method for providing a geographic search function.
Highfield et al. Challenges of tracking topical discussion networks online
KR20030003396A (en) Method for Content Recommendation Service using Content Category-based Personal Profile structures
CN102314452A (en) Method for navigation through input method platform and system
Durán-Sánchez et al. Sustainability and quality of life in smart cities: analysis of scientific production
Omer et al. Structural properties of the angular and metric street network's centralities and their implications for movement flows
Al-Rojaie Mapping perceptions of linguistic variation in Qassim, Saudi Arabia, using GIS technology
Özen Evaluation of tourist reviews on TripAdvisor for the protection of the world heritage sites: Text mining approach
CN107408125A (en) For inquiring about the image of answer
CN111191127B (en) Travel recommendation method and system based on correlation analysis algorithm
KR20210065773A (en) Big data based emotional information analysis and evaluation system and Driving method of the Same
Mubaideen et al. Using the EAMENA database to document modern heritage: the Amman Heritage Houses, Jordan, case study
Li et al. Identifying urban form typologies in Seoul using a new Gaussian mixture model-based clustering framework
Li et al. Cluster analysis of China’s inbound tourism market: A new multi-attribute approach based on association rule mining of tourist preferences at scenic spots
Christy et al. Critical Analysis of Urban Livability Measures Based on the Perspective of Placemaking
Chaykina et al. Destination brand personality of Portugal for the Russian-speaking market
CN110633890A (en) Land utilization efficiency judgment method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant