CN104008203A

CN104008203A - User interest discovering method with ontology situation blended in

Info

Publication number: CN104008203A
Application number: CN201410269562.6A
Authority: CN
Inventors: 陈庭贵; 周广澜; 许翀寰; 封毅
Original assignee: Zhejiang Gongshang University
Current assignee: Zhejiang Gongshang University
Priority date: 2014-06-17
Filing date: 2014-06-17
Publication date: 2014-08-27
Anticipated expiration: 2034-06-17
Also published as: CN104008203B

Abstract

A user interest discovering method with an ontology situation blended in comprises the steps that firstly, a user interest characteristic extracting model based on a second-order hidden Markov model is constructed for complex and multi-dimensional Web user interest behavior characteristic data in an e-commerce website; secondly, situation information capable of reflecting user interests is analyzed, wherein the situation information comprises individual information, environment information, device information and the like; thirdly, a user interest model based on situation ontology is constructed, and meanwhile the interest degree of the user individual information is measured and expressed by using the ideology of fuzzy logic; and lastly, a model is established according to user browsing paths and based on a user interest drifting detection method for a hidden semi-Markov model, and the average value of the average logarithm probable probabilities of a sequence is regarded as a threshold value point which is used for judging whether the interests are drifted. According to the user interest discovering method with the ontology situation blended in, the interest model capable of meeting user demands is constructed so as to provide individualized recommendation services and provide an effective means to improve the user satisfaction degree, and the user interest discovering method has good application value.

Description

A kind of Users' Interests Mining method that incorporates body situation

Technical field

The present invention relates to data mining and ontology field, especially a kind of Users' Interests Mining method, is specially adapted to the problem that user personalized information is served.

Background technology

Network application becomes increasingly complex, data volume is also increasing, some become more complicated and heavy as work such as ecommerce, web site designs, this need to be on the basis of user's existing information, dynamically adjust structure of web page from behavior aspects such as user's Access Interest, access time, visiting frequencies, carry out targetedly ecommerce and meet consumers' demand, provide personalized service.The individual info service of Internet is exactly the feature different according to user, and user interest hobby carries out the service of automatic Information Organization and adjustment, quick with one, and efficiently, acquisition of information mode solves the problems such as user profile is isotropic accurately.Based on this, how accurate understanding user's information requirement from the information of rapid expansion, builds and characterizes the user model of network user's feature, interest, target and behavior preference and carry out accordingly predictive user behavior, become a difficult problem for user provides personalized service better.How to find in time and exactly user interest drift, build the user interest model dynamically updating, to meet the customized information Demand and service of different user, become the key issue of individual info service simultaneously.

Summary of the invention

For the interest model that cannot meet consumers' demand that overcomes existing data mining mode is to provide the deficiency of personalized recommendation clothes, the present invention builds the interest model that can meet consumers' demand so that personalized recommendation service to be provided, the effective means that improves user satisfaction, provides a kind of Users' Interests Mining method that incorporates body situation.

The technical solution adopted for the present invention to solve the technical problems is:

Incorporate a Users' Interests Mining method for body situation, described Users' Interests Mining method comprises the following steps:

1) set up the user interest profile extraction model based on Second-Order Hidden Markov Model:

First need to collect and obtain the data that those can reflect user interest, process is as follows: obtain user source data from client, server end, proxy server end, after these source datas are obtained, they are carried out to pre-service and preserve for later use excavation of family interest with the form of setting.

Secondly, adopt Second-Order Hidden Markov Model to extract user interest profile, comprise training part and Extraction parts;

Training department divides the characteristic information sequencing comprising user interest to carry out pre-service, form text document, then to text after overscanning, utilize separator, space, line feed, colon typesetting retrtieval sequence to be converted to the text sections sequence of mark, finally with second order HMM model, it is calculated to following model parameter, definite algorithm of its parameter is as shown in formula:

1. initial probability distribution vector

π_{i} = \frac{Init (i)}{Σ_{j = 1}^{N} Init (j)}, 1 \leq i \leq N - - - (1)

Wherein, Init (i) refers in the whole training sample of mark, with state S _ifor the number of initial state sequence, refer to the number summation taking all states as initial state sequence;

2. original state transition probability

a_{ij} = \frac{C_{ij}}{Σ_{k = 1}^{N} C_{ik}}, 1 \leq i, j \leq N - - - (2)

a_{ijk} = \frac{C_{ijk}}{Σ_{u = 1}^{N} C_{iju}}, 1 \leq i, j, k \leq N - - - (3)

Wherein, C _ijand C _ijkrepresent respectively from state S _ito S _jtransfer number, and the state S in t-1 moment _i, t moment state S _j, transferring to t+1 moment state is S _knumber of times. with represent respectively from state S _ito the transfer number sum of all states, and the state S in t-1 moment _i, t moment state S _j, transfer to the number of times sum of all states;

3. observed value discharges probability

b_{j} (O_{k}) = \frac{E_{j} (O_{k})}{Σ_{i = 1}^{M} E_{j} (O_{i})}, 1 \leq j \leq N - - - (4)

b_{ij} (O_{k}) = \frac{E_{ij} (O_{k})}{Σ_{i = 1}^{M} E_{ij} (O_{u})}, 1 \leq i, j \leq N, 1 \leq k \leq M - - - (5)

Wherein, E _j(O _k) and E _ij(O _k) represent respectively state S _jtime discharge observed value O _knumber of times, and the state S in t-1 moment _i, t moment state S _j, discharge observed value O _knumber of times. with represent respectively state S _jtime discharge the number of times sum of all observed values and the state S in t-1 moment _i, t moment state S _j, discharge the number of times sum of all observed values;

Extraction parts comprises two steps, that is: (a) carries out pre-service to the text of feature to be extracted, after overscanning, utilizes separator, space, line feed, colon typesetting retrtieval sequence to be converted to the text sections sequence of mark to text; (b) the second order HMM model of combined training part output, utilizes Viterbi algorithm to calculate, and the HMM model that application has established carries out user interest profile extraction, the State-output observed value O=O after processing is obtained ₁o ₂... O _tas mode input, therefrom find out maximum probability in state tag sequence, the content that user characteristics extracts is exactly the observation text that is marked as dbjective state label;

2) analyze the contextual information of reflection user interest: by the search to user, browse the analysis of behavior and purchaser record information, derive interior user's of a period of time true interest;

3) the user interest ontology model that incorporates situation builds: first by several to region, sex, age, marriage, education background and income key factor indexs as a setting that affect user interest, and buy information and user behavior feature is carried out Fuzzy Processing to obtain its interest level in conjunction with user's history; Then adopt the method for expressing of body situation, by many granularity division, build user interest ontology model;

4) user interest drift detection method based on hidden semi-Markov model:

Choose two observed values and describe user's the behavior of browsing: a) the browse path sequence of user's accessed web page; B) arrive time interval of another webpage from a webpage; All state sets are expressed as S={S ₁, S ₂..., S _n, corresponding observation set is expressed as V={v ₁, v ₂..., v _n, the time interval is expressed as set I={1, and 2 ...; Browse behavior for user a certain, the number of its browse path link is a stochastic variable, the number of the observed value of exporting under given state this can be browsed behavior representation become set 1 ..., D}.Be that two-dimentional observed value sequence table is shown as O={ (r user's browse path sequence ₁, τ ₁) ..., (r _t, τ _t), wherein: r _t∈ V represents the object of user's browsed web content; τ _t∈ I represents that user is from a page jump to another page r _twith r _t-1between the time interval; The output probability matrix B={b of model _i(v, q) } represent, for given state i ∈ S, b _i(v, q) represents that user is at a page r _t=v ∈ V and with the time interval of the previous page be τ _tthe probability of=q ∈ I, and meet ∑ _v,qb _i(v, q)=1; Use P={p _i(d) } be illustrated under given state i, export observed value number be d ∈ 1 ..., the probability of D}, is the probability matrix of state duration in hidden semi-Markov model, and meets ∑ _dp _i(d)=1; State transition probability matrix passes through A={a _ijrepresent a _ijrepresent the probability shifting from i ∈ S to j ∈ S; Initial π for probability vector={ π _irepresent π _irepresent the probability of original state in the time of i ∈ S;

One of user important interest behavior record is defined as: U _interest=user, and background, history, behavior, timestamp, content}, wherein, user user represents, as ID; Background represents the concrete contextual factor of user; History represents user's historical purchaser record; Behavior identifies concrete interest behavior operating result; Timestamp represents the execution time of user behavior; Content represents interest topic content;

In user's accessing work, between any two behaviors operation, exist access transition probability P (q _i→ q _j), represent that interest weight is as follows:

P (q_{i} &RightArrow; q_{j}) = P (q_{j} | q_{i}) = \frac{P (q_{i} q_{j})}{P (q_{i})} = \{\begin{matrix} \frac{θ_{1} W_{B} (q_{i}, q_{j}) + θ_{2} W_{HI} (q_{i}, q_{j}) + θ_{3} W_{IB} (q_{i}, q_{j}) + θ_{4} W_{L} (q_{i}, q_{j})}{θ_{1} W_{B} (q_{i}) + θ_{2} W_{HI} (q_{i}) + θ_{3} W_{IB} (q_{i}) + θ_{4} W_{L} (q_{i})}, & i &NotEqual; j \\ 0, & i = j \end{matrix} - - - (6)

For each q _jand corresponding concept all there is an observed value probability distribution be that u is to q _jall access in, right interest probability, can be by _icomprise access node set be Q _i=q ' ₁..., q' _f| q' ∈ IC}, Q _i,jrepresent at _iin at q _jthe set of all access nodes afterwards, represent Q _i,jin contain the set of node:

Q_{i, j} = \{\begin{matrix} {q_{k + l}^{'} | q_{k}^{'} = q_{j}, l = 0, . . ., (f - k)}, q_{j} &Element; Q_{i} \\ Null \end{matrix} - - - (7)

By u at q _jupper observed value probability distribution be defined as:

Then in user u basis institute likely in access sequence, find a status switch, set up the hidden semi-Markov model of user interest behavior, make it have maximum access probability:

P_{\max} (σ_{z}^{k}) = \arg \max ΠP (q_{k} &RightArrow; q_{k + 1}) P (σ_{z}^{k} | q_{k}) - - - (9)

In the process that user interest drift is detected, first need to gather the observation sequence in HSMM model, and before model training, data are carried out to pre-service, determine after model parameter, then, by calling HSMM algorithm, obtain the probable value that user interest is constant, its probable value is calculated with the probable probability of average logarithm, when user's interest value is in normal range, user data is joined to training data and concentrate, to upgrade the parameter of hidden semi-Markov model; Otherwise this user will be considered to interest drift.

Further, described step 1) in, the approach that obtains user personalized information has two kinds: (a) by network surveying, the mode that user oneself participates in is collected; (b) obtain user's interest information by following the tracks of user behavior, adopt the feature extracting method of user behavior data.

Further, described step 2) in, user's behavioural information comprises user search keyword, the historical purchaser record of user and the behavior of user's historical viewings.

Further again, described step 3) in, according to user's interest situation information, building in User-ontology situation, user context is divided into the individual situation of user, user environment situation and subscriber equipment situation.Body adopts the form of level conceptional tree, and a certain element of user context represents by the each node in tree, builds situation ontologies tree.

Technical conceive of the present invention is: user oriented personalized service field, according to the related concept drift of method and Question Scene, propose to incorporate the Users' Interests Mining method of body situation, build the interest model that can meet consumers' demand so that personalized recommendation service to be provided, improved the effective means of user satisfaction.

Based on this, the present invention, taking user personalized information service as research object, introduces data mining, ontology, takes into full account user individual feature, proposes a kind of Users' Interests Mining method that incorporates body situation, effectively realizes user individual demand for services.

Introduce data mining, ontology, take into full account user individual feature, first for the Web user interest behavioural characteristic data of complex multi-dimensional in e-commerce website, build the user interest profile extraction model based on Second-Order Hidden Markov Model (Second-Order Hidden Markov Model); Next has analyzed the contextual information that can reflect user interest, comprises user's individual information, environmental information and facility information etc.; Again build the user interest model based on situation ontologies, adopt the thought that logic is fuzzy that the interest-degree of user's individual information is measured and expressed simultaneously, finally based on hidden semi-Markov model (Hidden Semi-Markov Model, HSMM) user interest drift detection method, build model according to user's browse path, using the average of the probable probability of average logarithm of sequence as threshold point, in order to judge whether interest drift has occurred.

Beneficial effect of the present invention is: the present invention has built the interest model that can meet consumers' demand so that personalized recommendation service to be provided, and improves the effective means of user satisfaction, has good using value.

Brief description of the drawings

Fig. 1 is the algorithm flow chart that the interest characteristics based on second order HMM extracts.

Fig. 2 is the structure flow process of user context body.

Fig. 3 interest drift detects block diagram.

Embodiment

Below in conjunction with accompanying drawing, the invention will be further described.

With reference to Fig. 1, Fig. 2 and Fig. 3, a kind of Users' Interests Mining method that incorporates body situation, described Users' Interests Mining method comprises the following steps:

5) set up the user interest profile extraction model based on Second-Order Hidden Markov Model: Web information extraction (Web Information Extraction) belongs to the category that web content excavates, extracted data from semi-structured Web document, the category information abstracting method using Web as information source.This step comprises the collection of user data and the foundation of user interest profile extraction model.

In order to build user interest model, first need collection to obtain the data that those can reflect user interest.Under normal circumstances, user's data are often a lot, comprise the information that user registers, log information, and page of text content-data, website topological structure, user's behavioral data, and page hyperlink information etc.These data can obtain from data sources such as client, server end, proxy server ends, after these metadata are obtained, they can be carried out to pre-service and preserve with suitable form, for later use the excavation of family interest.Be summed up, the approach that obtains user personalized information mainly contains two kinds: (a) by network surveying, the mode that user oneself participates in is collected.This method can directly be obtained user's interest and information requirement tendency, but must have user's positive cooperation; (b) obtain user's interest information by following the tracks of user behavior.Because the first is obtained the approach of user data, for example log-on message, directly provided in the mode of list by user, import background data base into, the extraction comparison of its user interest profile is convenient, and infer that by the implicit expression behavior of following the tracks of user the data of user interest cannot directly obtain, so mainly adopt the feature extracting method of user behavior data here.

Secondly, the feature extraction of user interest belongs to Text Information Extraction category, and information extraction has become an important directions of natural language processing, and theoretical research is constantly developed.The model extracting for information about at present mainly contains 3 classes: a kind of is model based on dictionary; One is rule-based model, as body; The model based on statistics, as hidden Markov model (HMM).Because HMM has very the statistical basis that is applicable to natural language processing, add its extract strong robustness, precision high, be easy to set up and the advantage such as strong adaptability, more and more receive researcher's concern.Here adopt Second-Order Hidden Markov Model to extract user interest profile, process flow diagram as shown in Figure 1.Mainly comprise two large divisions, i.e. training part and Extraction parts.

Training department divides some characteristic information sequencings that comprise user interest to carry out pre-service, form text document, then to text after overscanning, utilize the typesettings such as separator, space, line feed, colon retrtieval sequence to be converted to the text sections sequence of mark, finally with second order HMM model, it is calculated to following model parameter, definite algorithm of its parameter is as shown in formula:

1. initial probability distribution vector

π_{i} = \frac{Init (i)}{Σ_{j = 1}^{N} Init (j)}, 1 \leq i \leq N - - - (10)

Wherein, Init (i) refers in the whole training sample of mark, with state S _ifor the number of initial state sequence, refer to the number summation taking all states as initial state sequence.

2. original state transition probability

a_{ij} = \frac{C_{ij}}{Σ_{k = 1}^{N} C_{ik}}, 1 \leq i, j \leq N - - - (11)

a_{ijk} = \frac{C_{ijk}}{Σ_{u = 1}^{N} C_{iju}}, 1 \leq i, j, k \leq N - - - (12)

Wherein, C _ijand C _ijkrepresent respectively from state S _ito S _jtransfer number, and the state S in t-1 moment _i, t moment state S _j, transferring to t+1 moment state is S _knumber of times. with represent respectively from state S _ito the transfer number sum of all states, and the state S in t-1 moment _i, t moment state S _j, transfer to the number of times sum of all states.

3. observed value discharges probability

b_{j} (O_{k}) = \frac{E_{j} (O_{k})}{Σ_{i = 1}^{M} E_{j} (O_{i})}, 1 \leq j \leq N - - - (13)

b_{ij} (O_{k}) = \frac{E_{ij} (O_{k})}{Σ_{i = 1}^{M} E_{ij} (O_{u})}, 1 \leq i, j \leq N, 1 \leq k \leq M - - - (14)

Wherein, E _j(O _k) and E _ij(O _k) represent respectively state S _jtime discharge observed value O _knumber of times, and the state S in t-1 moment _i, t moment state S _j, discharge observed value O _knumber of times. with represent respectively state S _jtime discharge the number of times sum of all observed values and the state S in t-1 moment _i, t moment state S _j, discharge the number of times sum of all observed values.

Extraction parts comprises two steps, that is: (a) carries out pre-service to the text of feature to be extracted, after overscanning, utilizes the typesettings such as separator, space, line feed, colon retrtieval sequence to be converted to the text sections sequence of mark to text; (b) the second order HMM model of combined training part output, utilizes Viterbi algorithm to calculate.The HMM model that application has established carries out user interest profile extraction.State-output observed value O=O after processing is obtained ₁o ₂... O _tas mode input, therefrom find out maximum probability in state tag sequence, the content that user characteristics extracts is exactly the observation text that is marked as dbjective state label.

6) analyze the contextual information that reflects user interest: the network user's interest characteristics is mainly to be affected by the internal factor relevant to user interest and external factor.Internal factor has the aspects such as sex, age, occupation, personality, education, income, and external factor has comprised the aspects such as culture background, social environment, home background, and inherent with external many factors has caused the generation of the different behaviors of the network user.Just because of this reason makes different users have many-sided difference, also different with deflection to the level of interest of commodity.

User's interest usually can be reflected in the behavior of self, when they will produce certain tendentiousness to whatsit is interesting, user's demand and interest can be recorded in their behavioural information, therefore can be by the search to user, browse the analysis of the information such as behavior and purchaser record, derive the true interest of user in a period of time.Here, user's behavioural information mainly comprises the following aspects: user search keyword, the historical purchaser record of user, the behavior of user's historical viewings etc.

7) the user interest ontology model that incorporates situation builds: first by several to region, sex, age, marriage, education background and income key factor indexs as a setting that affect user interest, and buy information and user behavior feature is carried out Fuzzy Processing to obtain its interest level in conjunction with user's history; Then adopt the method for expressing of body situation, by many granularity division, build user interest ontology model.Build the process flow diagram of user context ontology model as shown in Figure 2.

According to user's interest situation information, building in User-ontology situation, user context is divided into the individual situation of user, user environment situation and subscriber equipment situation.Body normally adopts the form of level conceptional tree, and a certain element of user context represents by the each node in tree, builds situation ontologies tree.

8) user interest drift detection method based on hidden semi-Markov model: the shopping action process of user on the network in browsing browsed the complex process that the multiple individual factors such as object, culture background, hobby affect, by contextual factor, user behavior and interest content are considered to user's interest, and set up hidden semi-Markov model (HSMM) and detect user interest and whether drift about.

Suppose that user is in the process of browsing page, it is browsed behavior and meets Markov property, chooses following two observed values herein and describe user's the behavior of browsing: a) the browse path sequence of user's accessed web page; B) arrive time interval of another webpage from a webpage.All state sets are expressed as S={S ₁, S ₂..., S _n, corresponding observation set is expressed as V={v ₁, v ₂..., v _n, the time interval is expressed as set I={1, and 2 ...; Browse behavior for user a certain, the number of its browse path link is a stochastic variable, the number of the observed value of exporting under given state this can be browsed behavior representation become set 1 ..., D}.Be that two-dimentional observed value sequence table is shown as O={ (r user's browse path sequence ₁, τ ₁) ..., (r _t, τ _t), wherein: r _t∈ V represents the object of user's browsed web content; τ _t∈ I represents that user is from a page jump to another page r _twith r _t-1between the time interval.The output probability matrix B={b of model _i(v, q) } represent, for given state i ∈ S, b _i(v, q) represents that user is at a page r _t=v ∈ V and with the time interval of the previous page be τ _tthe probability of=q ∈ I, and meet ∑ _v,qb _i(v, q)=1.Use P={p _i(d) } be illustrated under given state i, export observed value number be d ∈ 1 ..., the probability of D}, is the probability matrix of state duration in hidden semi-Markov model, and meets ∑ _dp _i(d)=1.State transition probability matrix passes through A={a _ijrepresent a _ijrepresent the probability shifting from i ∈ S to j ∈ S.Initial π for probability vector={ π _irepresent π _irepresent the probability of original state in the time of i ∈ S.

One of user important interest behavior record is defined as: U _interest={ user, background, history, behavior, timestamp, content}.Wherein, user user represents, as ID; Background represents the concrete contextual factor of user; History represents user's historical purchaser record; Behavior identifies concrete interest behavior operating result; Timestamp represents the execution time of user behavior; Content represents interest topic content.

In user's accessing work, between any two behaviors operation, exist access transition probability P (q _i→ q _j), can represent that interest weight is as follows:

P (q_{i} &RightArrow; q_{j}) = P (q_{j} | q_{i}) = \frac{P (q_{i} q_{j})}{P (q_{i})} = \{\begin{matrix} \frac{θ_{1} W_{B} (q_{i}, q_{j}) + θ_{2} W_{HI} (q_{i}, q_{j}) + θ_{3} W_{IB} (q_{i}, q_{j}) + θ_{4} W_{L} (q_{i}, q_{j})}{θ_{1} W_{B} (q_{i}) + θ_{2} W_{HI} (q_{i}) + θ_{3} W_{IB} (q_{i}) + θ_{4} W_{L} (q_{i})}, & i &NotEqual; j \\ 0, & i = j \end{matrix} - - - (15)

For each q _jand corresponding concept all there is an observed value probability distribution be that u is to q _jall access in.Right interest probability, can be by _icomprise access node set be Q _i=q ' ₁..., q' _f| q' ∈ IC}, Q _i,jrepresent at _iin at q _jthe set of all access nodes afterwards, represent Q _i,jin contain the set of node:

Q_{i, j} = \{\begin{matrix} {q_{k + l}^{'} | q_{k}^{'} = q_{j}, l = 0, . . ., (f - k)}, q_{j} &Element; Q_{i} \\ Null \end{matrix} - - - (16)

By u at q _jupper observed value probability distribution be defined as:

P_{\max} (σ_{z}^{k}) = \arg \max ΠP (q_{k} &RightArrow; q_{k + 1}) P (σ_{z}^{k} | q_{k}) - - - (18)

In the process that user interest drift is detected, first need to gather the observation sequence in HSMM model, here be mainly that user's the behavioral data of browsing is used as to observed value sequence, and before model training, data are carried out to pre-service, determine after model parameter, then by calling HSMM algorithm, obtain the probable value that user interest is constant, its probable value is calculated with the probable probability of average logarithm.When user's interest value is in normal range, user data is joined to training data and concentrate, to upgrade the parameter of hidden semi-Markov model; Otherwise this user will be considered to interest drift.The implementation method that drift detects as shown in Figure 3.

Claims

1. a Users' Interests Mining method that incorporates body situation, is characterized in that: described Users' Interests Mining method comprises the following steps:

1. initial probability distribution vector

π_{i} = \frac{Init (i)}{Σ_{j = 1}^{N} Init (j)}, 1 \leq i \leq N - - - (1)

2. original state transition probability

a_{ij} = \frac{C_{ij}}{Σ_{k = 1}^{N} C_{ik}}, 1 \leq i, j \leq N - - - (2)

a_{ijk} = \frac{C_{ijk}}{Σ_{u = 1}^{N} C_{iju}}, 1 \leq i, j, k \leq N - - - (3)

3. observed value discharges probability

b_{j} (O_{k}) = \frac{E_{j} (O_{k})}{Σ_{i = 1}^{M} E_{j} (O_{i})}, 1 \leq j \leq N - - - (4)

b_{ij} (O_{k}) = \frac{E_{ij} (O_{k})}{Σ_{i = 1}^{M} E_{ij} (O_{u})}, 1 \leq i, j \leq N, 1 \leq k \leq M - - - (5)

4) user interest drift detection method based on hidden semi-Markov model:

P (q_{i} &RightArrow; q_{j}) = P (q_{j} | q_{i}) = \frac{P (q_{i} q_{j})}{P (q_{i})} = \{\begin{matrix} \frac{θ_{1} W_{B} (q_{i}, q_{j}) + θ_{2} W_{HI} (q_{i}, q_{j}) + θ_{3} W_{IB} (q_{i}, q_{j}) + θ_{4} W_{L} (q_{i}, q_{j})}{θ_{1} W_{B} (q_{i}) + θ_{2} W_{HI} (q_{i}) + θ_{3} W_{IB} (q_{i}) + θ_{4} W_{L} (q_{i})}, & i &NotEqual; j \\ 0, & i = j \end{matrix} - - - (6)

Q_{i, j} = \{\begin{matrix} {q_{k + l}^{'} | q_{k}^{'} = q_{j}, l = 0, . . ., (f - k)}, q_{j} &Element; Q_{i} \\ Null \end{matrix} - - - (7)

By u at q _jupper observed value probability distribution be defined as:

P_{\max} (σ_{z}^{k}) = \arg \max ΠP (q_{k} &RightArrow; q_{k + 1}) P (σ_{z}^{k} | q_{k}) - - - (9)

2. a kind of Users' Interests Mining method that incorporates body situation as claimed in claim 1, it is characterized in that: described step 1) in, the approach that obtains user personalized information has two kinds: (a) by network surveying, the mode that user oneself participates in is collected; (b) obtain user's interest information by following the tracks of user behavior, adopt the feature extracting method of user behavior data.

3. a kind of Users' Interests Mining method that incorporates body situation as claimed in claim 1 or 2, is characterized in that: described step 2) in, user's behavioural information comprises user search keyword, the historical purchaser record of user and the behavior of user's historical viewings.

4. a kind of Users' Interests Mining method that incorporates body situation as claimed in claim 1 or 2, it is characterized in that: described step 3) in, according to user's interest situation information, in structure User-ontology situation, user context is divided into the individual situation of user, user environment situation and subscriber equipment situation.Body adopts the form of level conceptional tree, and a certain element of user context represents by the each node in tree, builds situation ontologies tree.