CN110245286A - A kind of travelling recommended method and device based on data mining - Google Patents
A kind of travelling recommended method and device based on data mining Download PDFInfo
- Publication number
- CN110245286A CN110245286A CN201910380945.3A CN201910380945A CN110245286A CN 110245286 A CN110245286 A CN 110245286A CN 201910380945 A CN201910380945 A CN 201910380945A CN 110245286 A CN110245286 A CN 110245286A
- Authority
- CN
- China
- Prior art keywords
- data collection
- webpage
- sight spot
- data
- basic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 238000007418 data mining Methods 0.000 title claims abstract description 19
- 238000013480 data collection Methods 0.000 claims abstract description 124
- 238000004458 analytical method Methods 0.000 claims abstract description 49
- 238000013507 mapping Methods 0.000 claims abstract description 23
- 230000009191 jumping Effects 0.000 claims abstract description 16
- 238000007405 data analysis Methods 0.000 claims abstract description 12
- 239000000284 extract Substances 0.000 claims description 12
- 230000008451 emotion Effects 0.000 claims description 9
- 230000008676 import Effects 0.000 claims description 8
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 238000013468 resource allocation Methods 0.000 claims description 3
- 230000006870 function Effects 0.000 description 9
- 230000008569 process Effects 0.000 description 6
- 239000000203 mixture Substances 0.000 description 4
- 101100100125 Mus musculus Traip gene Proteins 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 238000013439 planning Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0201—Market modelling; Market analysis; Collecting market data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0201—Market modelling; Market analysis; Collecting market data
- G06Q30/0203—Market surveys; Market polls
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/14—Travel agencies
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Strategic Management (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Finance (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Entrepreneurship & Innovation (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Economics (AREA)
- Marketing (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Game Theory and Decision Science (AREA)
- Human Resources & Organizations (AREA)
- Primary Health Care (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The embodiment of the present application provides a kind of travelling recommended method and device based on data mining.This method comprises: obtaining the basic data in website, comment data, basic data collection, scene data collection, the mapping relations between affection data collection are established, travelling is built and recommends big data analysis environment;Data set is cleaned, concept is normalized;It obtains user and browses history, carry out the analysis of purport sight spot to webpage is jumped, and obtain the sentiment analysis assignment scoring for jumping webpage comment data;The purport sight spot of webpage is jumped according to affection data collection, user's webpage residence time and user, the basic sight spot that binding analysis basic data collection obtains returns to travelling recommendation results after sequence.A kind of travelling recommended method and device based on data mining of the embodiment of the present application improves travelling and recommends accuracy by combined data method for digging.
Description
Technical field
This application involves travelling recommendation and the field of data mining more particularly to a kind of travelling recommendation sides based on data mining
Method and device.
Background technique
Tourist attractions refer to using tourism and its correlated activation as the region place of one of major function or major function, energy
Enough meet the tourism demands such as tourist's visit and sightseeing, vacation tourism, health and happiness body-building, have corresponding tourist facilities and provides corresponding
Area is managed in tourist service independently.Travelling is recommended, and refers to the actual conditions according to tourist, comprehensive tourist resources, Tourism Economy, trip
The information of trip activity, traveller etc. provides the touring line for being most suitable for tourist to tourist, to improve the tourist article of tourist
It tests.Tradition travelling recommended method is general only to be matched by the single dimensional characteristics of tourist.With the arrival of cybertimes,
Many tourists are that travelling planning is carried out by way of network order, and it can be considered to the browsings by the analysis network user
Process targetedly provides travelling recommendation service for tourist.User journal excavation in data mining refers to is dug using data
Pick technology is analyzed and processed the daily record data generated during site users access network server, to find network
Access module and hobby of user etc., these information are to the intelligible unknown message of web construction potentially useful and know
Know, for analyzing the accessed situation of website, secondary site management and decision support etc..
Therefore, it may be considered that fused data digging technology designs travelling recommended method and device based on data mining.
Summary of the invention
In view of this, the purpose of the application is to propose a kind of travelling recommended method and device based on data mining, mention
Precision is recommended in height travelling, by emotion of posting in analyzing web site, realizes and improves the technical effect that travel information recommends accuracy.
Based on above-mentioned purpose, the travelling recommended method based on data mining that present applicant proposes a kind of, comprising:
Basic data, the comment data in website are obtained, is extracted in the basic data by rule matching algorithm
Customer attribute information basis of formation data set is formed by naming entity identification algorithms to extract the sight spot information in comment data
Scene data collection extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection, described in foundation
Basic data collection, scene data collection, the mapping relations between affection data collection, and import in data warehouse, it builds travelling and recommends
Big data analysis environment;
The basic data collection, the scene data collection, the affection data collection are cleaned, to the basic data
The concept that collection, scene data are concentrated is normalized, and carries out basis recommendation index of travelling to the basic data collection, right
The scene data collection carries out Concept of Tourism expansion;
It obtains user and browses history, extract user in the residence time of each webpage and jump sequence, record user jumps
The Anchor Text information clicked in the process carries out the analysis of purport sight spot to webpage is jumped, and obtains the feelings for jumping webpage comment data
Sense analysis assignment scoring, imports the travelling and recommends big data analysis environment;
The purport sight spot of webpage, binding analysis are jumped according to the affection data collection, user's webpage residence time and user
The basic sight spot that the basic data collection obtains returns to travelling recommendation results after sequence.
In some embodiments, described to be formed by naming entity identification algorithms to extract the sight spot information in comment data
Scene data collection, further includes:
According to font size, color, position of the sight spot information in webpage, the purport degree at each sight spot in webpage is obtained,
By the purport sight spot for determining webpage after sequence.
In some embodiments, described to establish the basic data collection, scene data collection, the mapping between affection data collection
Relationship, comprising:
Establish the first mapping relations between the basic data collection and the scene data collection;
Establish the second mapping relations between the scene data collection and the affection data collection.
In some embodiments, the font size according to sight spot information in webpage, color, position, obtain webpage
In each sight spot different degree, be calculated by the following formula:
D=∑ ωi·Pi,
Wherein D is the purport degree at sight spot, ωiFor the weighting coefficient of i-th of webpage attribute, PiFor i-th of net in the webpage
The quantized value of page attribute.
In some embodiments, described that basis recommendation index of travelling is carried out to the basic data collection, to the sight spot number
Concept of Tourism expansion is carried out according to collection, comprising:
The user base information inputted when according to user's registration inquires the basic recommended models of preset travelling, obtains basis
Recommendation results, and establish and establish index relative with the sight spot in basic recommendation results;
According to sight name, expand the sight spot in the affiliated geographic area in sight spot out, and extends and obtain with travelling feature
Sight spot.
In some embodiments, the Anchor Text information clicked in record user's jump procedure, carries out to webpage is jumped
The analysis of purport sight spot, comprising:
The sight spot information in Anchor Text is extracted, and carries out semantic extension, obtains the first purport concept;
The analysis of purport sight spot is carried out to webpage is jumped, obtains the second purport concept;
The first purport concept and the second purport concept are subjected to intersection operation, obtain the purport scape of user's concern
Point concept.
It is in some embodiments, described to obtain the sentiment analysis assignment scoring for jumping webpage comment data, comprising:
When jumping in webpage there are when the comment information of login user, sentiment analysis directly is carried out to comment information, is determined
Jump the sentiment analysis assignment scoring of webpage;
When jumping the comment information that login user is not present in webpage, the affection data collection is inquired, determination jumps net
The sentiment analysis assignment scoring of page.
In some embodiments, described that webpage is jumped according to the affection data collection, user's webpage residence time and user
Purport sight spot, the basic sight spot that basic data collection described in binding analysis obtains returns to travelling recommendation results after sequence, comprising:
When intersection is not present between the purport sight spot and the basic sight spot, by the purport sight spot and the basis
Sight spot is all used as recommendation results to return.
Based on above-mentioned purpose, the application also proposed a kind of travelling recommendation apparatus based on data mining, comprising:
Module is constructed, for obtaining the basic data in website, comment data, is extracted by rule matching algorithm described
Customer attribute information basis of formation data set in basic data, by naming entity identification algorithms to extract in comment data
Sight spot information forms scene data collection, extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data
Collection is established the basic data collection, scene data collection, the mapping relations between affection data collection, and is imported in data warehouse, takes
It builds travelling and recommends big data analysis environment;
Sorting module, for being cleaned to the basic data collection, the scene data collection, the affection data collection,
The concept concentrated to the basic data collection, scene data is normalized, and travels to the basic data collection
Index is recommended on basis, carries out Concept of Tourism expansion to the scene data collection;
Jump module browses history for obtaining user, extracts user in the residence time of each webpage and jumps sequence,
The Anchor Text information clicked in record user's jump procedure carries out the analysis of purport sight spot to webpage is jumped, and obtains and jump webpage
The sentiment analysis assignment of comment data scores, and imports the travelling and recommends big data analysis environment;
Return module, for jumping the purport of webpage according to the affection data collection, user's webpage residence time and user
Sight spot, the basic sight spot that basic data collection described in binding analysis obtains return to travelling recommendation results after sequence.
In some embodiments, the building module, comprising:
First map unit, distribution, resource allocation for control task;Establish the basic data collection and the sight spot
The first mapping relations between data set;
Second map unit, the second mapping for establishing between the scene data collection and the affection data collection are closed
System.
Detailed description of the invention
In the accompanying drawings, unless specified otherwise herein, otherwise indicate the same or similar through the identical appended drawing reference of multiple attached drawings
Component or element.What these attached drawings were not necessarily to scale.It should be understood that these attached drawings depict only according to the present invention
Disclosed some embodiments, and should not serve to limit the scope of the present invention.
Fig. 1 shows the flow chart of the travelling recommended method according to an embodiment of the present invention based on data mining.
The composition figure for the travelling recommendation apparatus that Fig. 2 shows according to an embodiment of the present invention based on data mining.
Fig. 3 shows the composition figure of building module according to an embodiment of the present invention.
Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched
The specific embodiment stated is only used for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to just
Part relevant to related invention is illustrated only in description, attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase
Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 shows the flow chart of the travelling recommended method according to an embodiment of the present invention based on data mining.Such as Fig. 1 institute
Show, being somebody's turn to do the travelling recommended method based on data mining includes:
Step S11, basic data, the comment data in website are obtained, the basis is extracted by rule matching algorithm
Customer attribute information basis of formation data set in data, by naming entity identification algorithms to extract the sight spot in comment data
Information forms scene data collection, extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection,
The basic data collection, scene data collection, the mapping relations between affection data collection are established, and are imported in data warehouse, are built
Big data analysis environment is recommended in travelling.
In one embodiment, described by naming entity identification algorithms to extract the sight spot information shape in comment data
At scene data collection, further includes:
According to font size, color, position of the sight spot information in webpage, the purport degree at each sight spot in webpage is obtained,
By the purport sight spot for determining webpage after sequence.
In one embodiment, described to establish the basic data collection, scene data collection, reflecting between affection data collection
Penetrate relationship, comprising:
Establish the first mapping relations between the basic data collection and the scene data collection;
Establish the second mapping relations between the scene data collection and the affection data collection.
Specifically, the basic data data information that sight spot can be inquired by the first mapping relations is reflected with various countries second
The relationship of penetrating can inquire sight spot and user for the affection data information at the sight spot.Mapping process can be by establishing the side such as index
The quick lookup of formula realization data.
In one embodiment, the font size according to sight spot information in webpage, color, position, obtain net
The different degree at each sight spot in page, is calculated by the following formula:
D=∑ ωi·Pi,
Wherein D is the purport degree at sight spot, ωiFor the weighting coefficient of i-th of webpage attribute, PiFor i-th of net in the webpage
The quantized value of page attribute.
Step S12, the basic data collection, the scene data collection, the affection data collection are cleaned, to described
The concept that basic data collection, scene data are concentrated is normalized, and carries out travelling basis to the basic data collection and push away
Index is recommended, Concept of Tourism expansion is carried out to the scene data collection.
In a kind of real-time mode, the user base information inputted when according to user's registration inquires preset travelling basis
Recommended models obtain basic recommendation results, and establish and establish index relative with the sight spot in basic recommendation results;
In one embodiment, according to sight name, expand the sight spot in the affiliated geographic area in sight spot out, and extend
To the sight spot with travelling feature.
Specifically, concept approximation can carry out by searching for disclosed geographic information database and travel database.
Step S13, it obtains user and browses history, extract user in the residence time of each webpage and jump sequence, record
The Anchor Text information clicked in user's jump procedure carries out the analysis of purport sight spot to webpage is jumped, and obtains and jump webpage comment
The sentiment analysis assignment of data scores, and imports the travelling and recommends big data analysis environment.
Specifically, purport sight spot refers to the sight spot mainly introduced or described in a webpage.For example, in travel site,
A general webpage can introduce a sight spot or a geographic area, the sight spot or geographic area and be construed as the webpage
Purport sight spot.
In one embodiment, the Anchor Text information clicked in record user's jump procedure, to jump webpage into
The analysis of row purport sight spot, comprising:
The sight spot information in Anchor Text is extracted, and carries out semantic extension, obtains the first purport concept;
The analysis of purport sight spot is carried out to webpage is jumped, obtains the second purport concept;
The first purport concept and the second purport concept are subjected to intersection operation, obtain the purport scape of user's concern
Point concept.
Specifically, semantic extension process can carry out in such a way that the Ontological concept of broad sense extends.For example, when anchor text
There is " the Forbidden City " two word in this information is that " Beijing ", " Tian'anmen Square ", " Great Wall ", " storage can be extended to by Ontological concept
The concept relevant to " the Forbidden City " such as elegant palace ".
It is in one embodiment, described to obtain the sentiment analysis assignment scoring for jumping webpage comment data, comprising:
When jumping in webpage there are when the comment information of login user, sentiment analysis directly is carried out to comment information, is determined
Jump the sentiment analysis assignment scoring of webpage;
When jumping the comment information that login user is not present in webpage, the affection data collection is inquired, determination jumps net
The sentiment analysis assignment scoring of page.
Step S14, the purport sight spot of webpage is jumped according to the affection data collection, user's webpage residence time and user,
The basic sight spot that basic data collection described in binding analysis obtains returns to travelling recommendation results after sequence.
Specifically, webpage residence time and the purport information for jumping webpage can embody concern of the user to the sight spot
Degree.For example, user pays close attention to more refinement for purport sight spot pointed by the webpage when user is longer in the residence time of webpage
It causes, degree of concern is also higher;User jumps to the purport sight spot of next webpage and the purport sight spot of the webpage by the webpage
The goodness of fit is higher, illustrates that user expects the details at the purport sight spot, it is higher also to embody its degree of concern.
In one embodiment, described that net is jumped according to the affection data collection, user's webpage residence time and user
The purport sight spot of page, the basic sight spot that basic data collection described in binding analysis obtains return to travelling recommendation results, packet after sequence
It includes:
When intersection is not present between the purport sight spot and the basic sight spot, by the purport sight spot and the basis
Sight spot is all used as recommendation results to return.
The composition figure of Fig. 2 travelling recommendation apparatus according to an embodiment of the present invention based on data mining.As shown in Fig. 2, should
Travelling recommendation apparatus based on data mining can integrally be divided into:
It constructs module 21 and institute is extracted by rule matching algorithm for obtaining the basic data in website, comment data
The customer attribute information basis of formation data set in basic data is stated, by naming entity identification algorithms to extract in comment data
Sight spot information formed scene data collection, by sentiment analysis algorithm extract the emotion information in comment data formed emotion number
According to collection, the basic data collection, scene data collection, the mapping relations between affection data collection are established, and import in data warehouse,
It builds travelling and recommends big data analysis environment;
Sorting module 22, it is clear for being carried out to the basic data collection, the scene data collection, the affection data collection
It washes, the concept concentrated to the basic data collection, scene data is normalized, and carries out trip to the basic data collection
Index is recommended on row basis, carries out Concept of Tourism expansion to the scene data collection;
Jump module 23 browses history for obtaining user, extracts user in the residence time of each webpage and jump suitable
Sequence records the Anchor Text information clicked in user's jump procedure, carries out the analysis of purport sight spot to webpage is jumped, and obtain and jump net
The sentiment analysis assignment scoring of page comment data, imports the travelling and recommends big data analysis environment;
Return module 24, for jumping the master of webpage according to the affection data collection, user's webpage residence time and user
Purport sight spot, the basic sight spot that basic data collection described in binding analysis obtains return to travelling recommendation results after sequence.
Fig. 3 shows the composition figure of building module according to an embodiment of the present invention.
From figure 3, it can be seen that building module 21, comprising:
First map unit 211, distribution, resource allocation for control task;Establish the basic data collection and described
The first mapping relations between scene data collection;
Second map unit 212, the second mapping for establishing between the scene data collection and the affection data collection
Relationship.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show
The description of example " or " some examples " etc. means specific features, structure, material or spy described in conjunction with this embodiment or example
Point is included at least one embodiment or example of the invention.Moreover, particular features, structures, materials, or characteristics described
It may be combined in any suitable manner in any one or more of the embodiments or examples.In addition, without conflicting with each other, this
The technical staff in field can be by the spy of different embodiments or examples described in this specification and different embodiments or examples
Sign is combined.
Any process described otherwise above or method description are construed as in flow chart or herein, and expression includes
It is one or more for realizing specific logical function or process the step of executable instruction code module, segment or portion
Point, and the range of the preferred embodiment of the present invention includes other realization, wherein can not press shown or discussed suitable
Sequence, including according to related function by it is basic simultaneously in the way of or in the opposite order, Lai Zhihang function, this should be of the invention
Embodiment person of ordinary skill in the field understood.
Expression or logic and/or step described otherwise above herein in flow charts, for example, being considered use
In the order list for the executable instruction for realizing logic function, may be embodied in any computer-readable medium, for
Instruction execution system, device or equipment (such as computer based system, including the system of processor or other can be held from instruction
The instruction fetch of row system, device or equipment and the system executed instruction) it uses, or combine these instruction execution systems, device or set
It is standby and use.For the purpose of this specification, " computer-readable medium ", which can be, any may include, stores, communicates, propagates or pass
Defeated program is for instruction execution system, device or equipment or the dress used in conjunction with these instruction execution systems, device or equipment
It sets.The more specific example (non-exhaustive list) of computer-readable medium include the following: there is the electricity of one or more wirings
Interconnecting piece (electronic device), portable computer diskette box (magnetic device), random access memory (RAM), read-only memory
(ROM), erasable edit read-only storage (EPROM or flash memory), fiber device and portable read-only memory
(CDROM).In addition, computer-readable medium can even is that the paper that can print described program on it or other suitable Jie
Matter, because can then be edited, be interpreted or when necessary with other for example by carrying out optical scanner to paper or other media
Suitable method is handled electronically to obtain described program, is then stored in computer storage.
It should be appreciated that each section of the invention can be realized with hardware, software, firmware or their combination.Above-mentioned
In embodiment, software that multiple steps or method can be executed in memory and by suitable instruction execution system with storage
Or firmware is realized.It, and in another embodiment, can be under well known in the art for example, if realized with hardware
Any one of column technology or their combination are realized: having a logic gates for realizing logic function to data-signal
Discrete logic, with suitable combinational logic gate circuit specific integrated circuit, programmable gate array (PGA), scene
Programmable gate array (FPGA) etc..
Those skilled in the art are understood that realize all or part of step that above-described embodiment method carries
It suddenly is that relevant hardware can be instructed to complete by program, the program can store in a kind of computer-readable storage medium
In matter, which when being executed, includes the steps that one or a combination set of embodiment of the method.
It, can also be in addition, each functional unit in each embodiment of the present invention can integrate in a processing module
It is that each unit physically exists alone, can also be integrated in two or more units in a module.Above-mentioned integrated mould
Block both can take the form of hardware realization, can also be realized in the form of software function module.The integrated module is such as
Fruit is realized and when sold or used as an independent product in the form of software function module, also can store in a computer
In readable storage medium storing program for executing.The storage medium can be read-only memory, disk or CD etc..
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any
Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in its various change or replacement,
These should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with the guarantor of the claim
It protects subject to range.
Claims (10)
1. a kind of travelling recommended method based on data mining characterized by comprising
Basic data, the comment data in website are obtained, the user in the basic data is extracted by rule matching algorithm
Attribute information basis of formation data set forms sight spot by naming entity identification algorithms to extract the sight spot information in comment data
Data set extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection, establishes the basis
Data set, scene data collection, the mapping relations between affection data collection, and import in data warehouse, it builds travelling and recommends big number
According to analysis environment;
The basic data collection, the scene data collection, the affection data collection are cleaned, to the basic data collection,
The concept that scene data is concentrated is normalized, and carries out basis recommendation index of travelling to the basic data collection, to institute
It states scene data collection and carries out Concept of Tourism expansion;
It obtains user and browses history, extract user in the residence time of each webpage and jump sequence, record user's jump procedure
The Anchor Text information of middle click carries out the analysis of purport sight spot to webpage is jumped, and obtains the emotion point for jumping webpage comment data
Assignment scoring is analysed, the travelling is imported and recommends big data analysis environment;
The purport sight spot of webpage is jumped according to the affection data collection, user's webpage residence time and user, described in binding analysis
The basic sight spot that basic data collection obtains returns to travelling recommendation results after sequence.
2. the method according to claim 1, wherein described by naming entity identification algorithms to extract comment number
Sight spot information in forms scene data collection, further includes:
According to font size, color, position of the sight spot information in webpage, the purport degree at each sight spot in webpage is obtained, is passed through
The purport sight spot of webpage is determined after sequence.
3. the method according to claim 1, wherein described establish the basic data collection, scene data collection, feelings
Feel the mapping relations between data set, comprising:
Establish the first mapping relations between the basic data collection and the scene data collection;
Establish the second mapping relations between the scene data collection and the affection data collection.
4. according to the method described in claim 2, it is characterized in that, the font size according to sight spot information in webpage,
Color, position obtain the different degree at each sight spot in webpage, are calculated by the following formula:
D=∑ ωi·Pi,
Wherein D is the purport degree at sight spot, ωiFor the weighting coefficient of i-th of webpage attribute, PiFor i-th of webpage category in the webpage
The quantized value of property.
5. the method according to claim 1, wherein described carry out basis recommendation of travelling to the basic data collection
Index carries out Concept of Tourism expansion to the scene data collection, comprising:
The user base information inputted when according to user's registration inquires the basic recommended models of preset travelling, obtains basic recommendation
As a result, and establishing and establishing index relative with the sight spot in basic recommendation results;
According to sight name, expand the sight spot in the affiliated geographic area in sight spot out, and extend and obtain the sight spot with travelling feature.
6. the method according to claim 1, wherein the Anchor Text letter clicked in record user's jump procedure
Breath carries out the analysis of purport sight spot to webpage is jumped, comprising:
The sight spot information in Anchor Text is extracted, and carries out semantic extension, obtains the first purport concept;
The analysis of purport sight spot is carried out to webpage is jumped, obtains the second purport concept;
The first purport concept and the second purport concept are subjected to intersection operation, the purport sight spot for obtaining user's concern is general
It reads.
7. the method according to claim 1, wherein described obtain the sentiment analysis tax for jumping webpage comment data
Value scoring, comprising:
When jumping in webpage there are when the comment information of login user, sentiment analysis directly is carried out to comment information, determination jumps
The sentiment analysis assignment of webpage scores;
When jumping the comment information that login user is not present in webpage, the affection data collection is inquired, determination jumps webpage
The scoring of sentiment analysis assignment.
8. the method according to claim 1, wherein described stop according to the affection data collection, user's webpage
Time and user jump the purport sight spot of webpage, and the basic sight spot that basic data collection described in binding analysis obtains returns after sequence
Travelling recommendation results, comprising:
When intersection is not present between the purport sight spot and the basic sight spot, by the purport sight spot and the basic sight spot
All returned as recommendation results.
9. a kind of travelling recommendation apparatus based on data mining characterized by comprising
It constructs module and the basis is extracted by rule matching algorithm for obtaining the basic data in website, comment data
Customer attribute information basis of formation data set in data, by naming entity identification algorithms to extract the sight spot in comment data
Information forms scene data collection, extracts the emotion information in comment data by sentiment analysis algorithm and forms affection data collection,
The basic data collection, scene data collection, the mapping relations between affection data collection are established, and are imported in data warehouse, are built
Big data analysis environment is recommended in travelling;
Sorting module, for being cleaned to the basic data collection, the scene data collection, the affection data collection, to institute
State basic data collection, the concept that scene data is concentrated is normalized, and carries out basis of travelling to the basic data collection
Recommend index, Concept of Tourism expansion is carried out to the scene data collection;
Jump module browses history for obtaining user, extracts user in the residence time of each webpage and jumps sequence, record
The Anchor Text information clicked in user's jump procedure carries out the analysis of purport sight spot to webpage is jumped, and obtains and jump webpage comment
The sentiment analysis assignment of data scores, and imports the travelling and recommends big data analysis environment;
Return module, for jumping the purport sight spot of webpage according to the affection data collection, user's webpage residence time and user,
The basic sight spot that basic data collection described in binding analysis obtains returns to travelling recommendation results after sequence.
10. device according to claim 9, the building module, comprising:
First map unit, distribution, resource allocation for control task;Establish the basic data collection and the scene data
The first mapping relations between collection;
Second map unit, the second mapping relations for establishing between the scene data collection and the affection data collection.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910380945.3A CN110245286B (en) | 2019-05-08 | 2019-05-08 | travel recommendation method and device based on data mining |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910380945.3A CN110245286B (en) | 2019-05-08 | 2019-05-08 | travel recommendation method and device based on data mining |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110245286A true CN110245286A (en) | 2019-09-17 |
CN110245286B CN110245286B (en) | 2020-01-31 |
Family
ID=67883835
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910380945.3A Active CN110245286B (en) | 2019-05-08 | 2019-05-08 | travel recommendation method and device based on data mining |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110245286B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111191127A (en) * | 2019-12-24 | 2020-05-22 | 重庆特斯联智慧科技股份有限公司 | Travel recommendation method and system based on correlation analysis algorithm |
CN111612590A (en) * | 2020-03-19 | 2020-09-01 | 江苏智檬智能科技有限公司 | Scenic spot recommendation method and device based on artificial intelligence big data |
CN117077901A (en) * | 2023-10-17 | 2023-11-17 | 北京铭洋商务服务有限公司 | Travel data processing method and system |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105427209A (en) * | 2015-11-24 | 2016-03-23 | 余元辉 | Panoramic smart travel system |
CN105718184A (en) * | 2014-12-05 | 2016-06-29 | 北京搜狗科技发展有限公司 | Data processing method and apparatus |
CN106202252A (en) * | 2016-06-29 | 2016-12-07 | 厦门趣处网络科技有限公司 | Method, system are recommended in a kind of trip analyzed based on user emotion |
CN107423837A (en) * | 2017-04-12 | 2017-12-01 | 宁夏丝路风情旅游网络股份有限公司 | The Intelligent planning method and system of tourism route |
US20180053121A1 (en) * | 2016-08-17 | 2018-02-22 | International Business Machines Corporation | Intelligent travel planning |
CN108681739A (en) * | 2018-03-26 | 2018-10-19 | 安徽师范大学 | One kind recommending method based on user feeling and time dynamic tourist famous-city |
CN109284443A (en) * | 2018-11-28 | 2019-01-29 | 四川亨通网智科技有限公司 | A kind of tourism recommended method and system based on crawler technology |
-
2019
- 2019-05-08 CN CN201910380945.3A patent/CN110245286B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105718184A (en) * | 2014-12-05 | 2016-06-29 | 北京搜狗科技发展有限公司 | Data processing method and apparatus |
CN105427209A (en) * | 2015-11-24 | 2016-03-23 | 余元辉 | Panoramic smart travel system |
CN106202252A (en) * | 2016-06-29 | 2016-12-07 | 厦门趣处网络科技有限公司 | Method, system are recommended in a kind of trip analyzed based on user emotion |
US20180053121A1 (en) * | 2016-08-17 | 2018-02-22 | International Business Machines Corporation | Intelligent travel planning |
CN107423837A (en) * | 2017-04-12 | 2017-12-01 | 宁夏丝路风情旅游网络股份有限公司 | The Intelligent planning method and system of tourism route |
CN108681739A (en) * | 2018-03-26 | 2018-10-19 | 安徽师范大学 | One kind recommending method based on user feeling and time dynamic tourist famous-city |
CN109284443A (en) * | 2018-11-28 | 2019-01-29 | 四川亨通网智科技有限公司 | A kind of tourism recommended method and system based on crawler technology |
Non-Patent Citations (1)
Title |
---|
王少兵 等: "采用在线评论的景点个性化推荐", 《华侨大学学报(自然科学版)》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111191127A (en) * | 2019-12-24 | 2020-05-22 | 重庆特斯联智慧科技股份有限公司 | Travel recommendation method and system based on correlation analysis algorithm |
CN111612590A (en) * | 2020-03-19 | 2020-09-01 | 江苏智檬智能科技有限公司 | Scenic spot recommendation method and device based on artificial intelligence big data |
CN117077901A (en) * | 2023-10-17 | 2023-11-17 | 北京铭洋商务服务有限公司 | Travel data processing method and system |
CN117077901B (en) * | 2023-10-17 | 2024-01-05 | 北京铭洋商务服务有限公司 | Travel data processing method and system |
Also Published As
Publication number | Publication date |
---|---|
CN110245286B (en) | 2020-01-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wu et al. | Tourism knowledge domains: A keyword analysis | |
Touya et al. | Detecting level-of-detail inconsistencies in volunteered geographic information data sets | |
Shelton et al. | The nature of neighborhoods: Using big data to rethink the geographies of Atlanta’s neighborhood planning unit system | |
Perez-Encinas et al. | Factors influencing student mobility: a comparative European study | |
Khoshkar et al. | Planning for green qualities in the densification of suburban Stockholm–opportunities and challenges | |
CN110245286A (en) | A kind of travelling recommended method and device based on data mining | |
ES2394002T3 (en) | Search engine to perform a search referring to a place | |
Popescu et al. | Mining social media to create personalized recommendations for tourist visits | |
MXPA05010594A (en) | System and method for providing a geographic search function. | |
Highfield et al. | Challenges of tracking topical discussion networks online | |
KR20030003396A (en) | Method for Content Recommendation Service using Content Category-based Personal Profile structures | |
CN102314452A (en) | Method for navigation through input method platform and system | |
Durán-Sánchez et al. | Sustainability and quality of life in smart cities: analysis of scientific production | |
Omer et al. | Structural properties of the angular and metric street network's centralities and their implications for movement flows | |
Al-Rojaie | Mapping perceptions of linguistic variation in Qassim, Saudi Arabia, using GIS technology | |
Özen | Evaluation of tourist reviews on TripAdvisor for the protection of the world heritage sites: Text mining approach | |
CN107408125A (en) | For inquiring about the image of answer | |
CN111191127B (en) | Travel recommendation method and system based on correlation analysis algorithm | |
KR20210065773A (en) | Big data based emotional information analysis and evaluation system and Driving method of the Same | |
Mubaideen et al. | Using the EAMENA database to document modern heritage: the Amman Heritage Houses, Jordan, case study | |
Li et al. | Identifying urban form typologies in Seoul using a new Gaussian mixture model-based clustering framework | |
Li et al. | Cluster analysis of China’s inbound tourism market: A new multi-attribute approach based on association rule mining of tourist preferences at scenic spots | |
Christy et al. | Critical Analysis of Urban Livability Measures Based on the Perspective of Placemaking | |
Chaykina et al. | Destination brand personality of Portugal for the Russian-speaking market | |
CN110633890A (en) | Land utilization efficiency judgment method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |