CN111988744B

CN111988744B - Position prediction method based on user moving mode

Info

Publication number: CN111988744B
Application number: CN202010898332.1A
Authority: CN
Inventors: 苏畅; 严杨志; 谢显中
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2020-08-31
Filing date: 2020-08-31
Publication date: 2022-04-01
Anticipated expiration: 2040-08-31
Also published as: CN111988744A

Abstract

The invention relates to a position prediction method based on a user movement mode, and belongs to the field of machine learning. The method comprises the following steps: adopting an Apriori algorithm to dig out an individual moving mode of each user and finding out internal factors influencing the sign-in of the user; calculating the similarity between the individual movement modes of the user by using a dynamic time warping algorithm DTW; grouping individual moving modes of users through clustering to obtain a central mode of each group, and finding out external factors influencing check-in; training a Markov model with the individual movement patterns and the overall movement patterns, respectively; training a Markov chain model based on IMP and AMP to predict a next location of the user; considering the influence of external weather, creating a weather general characteristic; calculating the similarity between the weather of the current place and the weather of other places by using a Gaussian kernel function, and correcting a predicted result; setting an evaluation standard and a reference method. The invention enables the predicted result to be more suitable for actual life.

Description

Position prediction method based on user moving mode

Technical Field

The invention belongs to the field of machine learning, and relates to a position prediction method based on a user movement mode.

Background

With the popularization of mobile terminals, human mobile data are more easily obtained, a social network platform based on the position collects a large amount of user check-in data, the research on the human mobile rule becomes a hotspot, the research on the mobile mode of people also becomes possible, and the position prediction is more common. Through position prediction, the mobile preference of the user can be known in advance, the mobile tendency of people flow can also be known, and not only can targeted service be provided for the user, but also benefits are brought to merchants. The existing research mainly analyzes the behavior of a user through the sign-in historical record of the user, finds the movement rule of the user and then predicts the place. Most of the considered factors have time, space, social contact and the like, and mainly aim at the preference of the user and ignore the connection between the positions. In addition, most studies are predicated on the individual movement patterns of the user, and if the user goes to a place that has never been visited, no data is available to train the model; however, the prediction based on the overall data is too coarse-grained, and if the users are currently located at the same place, the final prediction results are all the same place and are not in accordance with the actual situation.

Aiming at the problem that the traditional position prediction model based on the discrete state sequence cannot well predict the position, the invention respectively excavates the individual movement mode and the whole movement mode of the user from the historical sign-in data of the user by considering the association between different positions in the sign-in track of the user, namely, the internal cause and the external cause which influence the sign-in of the user. Wherein, the overall movement mode is mainly embodied by the group behaviors; the individual movement pattern takes into account the dynamic changes of the movement pattern within the individual user, taking into account the personalized location prediction. The time factors are also taken into consideration in the process of mining the movement patterns of the users, and the movement patterns of the users in different time periods in the week are mined aiming at the changes of the user movement patterns in two different time periods, namely a working day and a weekend, so that the actual transfer condition between the spatial positions of the users can be reflected, and the potential time law of the movement of the users is also included, and the predicted result is more suitable for the actual life. In addition, a weather general characteristic is created, and the result is corrected by calculating weather similarity by adopting a Gaussian kernel function.

Disclosure of Invention

In view of the above, the present invention provides a method for predicting a location based on a user movement pattern.

In order to achieve the purpose, the invention provides the following technical scheme:

a method for location prediction based on user movement patterns, the method comprising the steps of:

define the moving pattern MP as: a set of locations sequentially visited by a user in a continuous time; a pattern in which the user frequently moves is referred to as a user's movement pattern, and the movement pattern is denoted as MP ═ l₁,l₂,l₃,…,l_nN is the number of positions contained in the mobile mode of the user;

define the individual mobility pattern IMP as: a sequence of locations frequently occurring in a user's personal historical visit location, for a given user, whose movement pattern is a set of all movement patterns in the historical check-in record;

defining the support degree as: the frequency with which the user's movement pattern appears in its movement trajectory; in the historical check-in record of the user, including multiple movement tracks, the support degree of the movement mode may be calculated as:

define the global mobile mode AMP as: frequently occurring movement patterns in historical check-ins for all users in different groups; calculating the similarity of the user moving modes by adopting a DTW algorithm, then dividing the similarity into a plurality of groups through clustering, and finding a central mode according to the moving modes in the historical access positions of all users in each group to obtain the integral moving mode of all users in the group;

defining the general weather characteristics: combining the rainfall, temperature and wind speed into a new characteristic according to a weighting fusion mode;

s1: adopting an Apriori algorithm to dig out an individual moving mode of each user and finding out internal factors influencing the sign-in of the user;

s2: calculating the similarity between the individual movement modes of the user by using a dynamic time warping algorithm DTW;

s3: grouping individual moving modes of users through clustering to obtain a central mode of each group, namely an integral moving mode AMP, and finding out external factors influencing check-in;

s4: training a Markov model with the individual movement patterns and the overall movement patterns, respectively;

s5: training a Markov chain model based on IMP and AMP, combining probability vectors of the IMP and AMP, and predicting the next position of the user;

s6: considering the influence of external weather, creating a weather general characteristic;

s7: calculating the similarity between the weather of the current place and the weather of other places by using a Gaussian kernel function, and correcting a predicted result;

s8: setting an evaluation standard and a reference method.

Optionally, step S1 specifically includes:

s11: in a given time range, through analyzing Gowalla, finding out a moving mode with the length of 1 in the check-in record of the user;

s12: then finding out the mobile mode with the length of 2 in sequence, then calculating whether the support degree sigma meets the requirement, and circulating until the length of the mobile mode cannot be increased to obtain a candidate mobile mode;

s13: and finding out the movement mode with the support degree meeting the condition from the obtained candidate movement modes to obtain the individual movement mode of the user.

Optionally, in the step, the euclidean distance between the two points is not simply calculated for the similarity between the two movement modes, but the Haversine distance is calculated, and the coordinates of the two points are introduced to obtain the geospatial distance between the actual two points, which is specifically as follows:

wherein:

|M_pl represents the length of the movement pattern, i.e. the number of positions in the pattern; rest (M)_p) Indicating a movement pattern to remove the first position, d (l, l)_i) Representing the true distance between two locations.

Optionally, step S3 specifically includes:

s31: initializing a plurality of classes according to the personal movement mode of a user, and setting a distance threshold tau;

s32: for each movement pattern of each user, calculating the distance between each movement pattern and each class, and selecting the class with the minimum distance;

s33: then, calculating the distance between the moving mode and the class by adopting a DTW algorithm, and if the distance is smaller than a threshold value tau, adding the distance into the class and updating; otherwise, a new class is created for the mobile mode;

s34: and obtaining a clustering result, namely the overall movement pattern of each person.

Optionally, step S4 specifically includes:

s41: after clustering the individual movement patterns of the user, obtaining the overall movement pattern of the user, and combining the obtained overall movement pattern, wherein the next position to be reached is as follows:

s42: based on the personal movement pattern, the next location to go is:

wherein the content of the first and second substances,

is represented by containingMoving pattern of N positions, MP_cA set of classes of movement patterns is represented,

the expression sequence is

The moving mode of (A) appears in the MP_cThe number of times of (1) to (d),

is shown in MP_cIn position l_iThe number of subsequent occurrences.

Optionally, in step S5, for each person, there is a personal movement pattern and a global movement pattern, which are used for training the markov model respectively; finally, a predicted probability vector is obtained; the vector based on the whole is P^AMP＝(l₁,l₂,l₃,…,l_n) Vector based on individual movement pattern is P^IMP＝(l₁,l₂,l₃,…,l_n) Wherein n represents the number of predicted positions; then combining the two obtained results to obtain a final prediction result; the final combination is as follows:

P＝α·P^IMP+(1-α)P^AMP。

optionally, the step S6 includes:

s61: creating a weather Total feature X_weather＝[Temperature,Rain,Windspeed]；

S62: the three kinds of weather of the user check-in place are subjected to weighted summation, the influence of the three kinds of weather on the user check-in place is comprehensively considered, and the total weather characteristic of each user check-in place is obtained, and the total weather characteristic is specifically represented as follows:

X_weather＝ω₁Temperature+ω₂Windspeed+ω₃Rain

wherein, the weight of rainfall is calculated as follows:

refers to one of the places l where the user checks in_iThe total number of user check-ins in a given rainfall interval,

the total number of days of the rainfall interval in the corresponding time period; the weighting calculations for wind speed and temperature are also consistent.

Optionally, in the step S7, after calculating the weather preference of the user through the created weather total feature, calculating the current location X of the user by using a gaussian kernel function_lTo other locations

The similarity of weather is obtained to obtain a final prediction result; the specific calculation is as follows:

wherein, X_lIndicating the weather conditions of the location where the user is currently located,

the weather conditions of other locations.

Optionally, the step S8 includes:

s81: taking the Accuracy and the APR of the site prediction as evaluation standards of the experiment;

s82: accuracy: the index defines the proportion of the correct prediction place to the total prediction place in the prediction result list of the user; when the prediction result is consistent with the actual result, p (l) is 1;

s83: average percent ranking APR: the prediction problem also has a certain relation with the sequence, user u_iSign-in location l_jIn the prediction list PR is defined as:

obtaining APR values of all users by taking the average value of the sum of the PR values, wherein the larger the value is, the better the prediction effect is; the formula is as follows:

s84: in order to verify the effectiveness of the proposed location prediction method based on the user movement pattern, the following models are selected for comparison with the proposed model:

NextPlace: the method is a classical position prediction method, and is used for predicting user behaviors based on nonlinear time series analysis of arrival time and predicting by using similarity of time series;

SimPreT: associating the historical pattern with the current user trajectory, and determining the next position of the user by using pattern similarity;

HMM-based: the model simultaneously considers non-Gaussian and spatiotemporal characteristics in actual human check-in data by constructing a hybrid Markov model.

The invention has the beneficial effects that: according to the method, the relevance among different positions in the user sign-in track is considered, and the individual movement mode and the overall movement mode of the user are respectively mined from the historical sign-in data of the user, namely, the internal cause and the external cause of the user sign-in are influenced. Wherein, the overall movement mode is mainly embodied by the group behaviors; the individual movement pattern takes into account the dynamic changes of the movement pattern within the individual user, taking into account the personalized location prediction. The time factors are also taken into consideration in the process of mining the movement patterns of the users, and the movement patterns of the users in different time periods in the week are mined aiming at the changes of the user movement patterns in two different time periods, namely a working day and a weekend, so that the actual transfer condition between the spatial positions of the users can be reflected, and the potential time law of the movement of the users is also included, and the predicted result is more suitable for the actual life. In addition, a weather general characteristic is created, and the result is corrected by calculating weather similarity by adopting a Gaussian kernel function.

Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention may be realized and attained by the means of the instrumentalities and combinations particularly pointed out hereinafter.

Drawings

For the purposes of promoting a better understanding of the objects, aspects and advantages of the invention, reference will now be made to the following detailed description taken in conjunction with the accompanying drawings in which:

FIG. 1 is a general flow diagram of the present invention;

FIG. 2 is check-in location information in a data set;

FIG. 3 is a comparison of prediction accuracy for different movement patterns;

FIG. 4 is a comparison of the accuracy of the model in two cities;

FIG. 5 is a comparison of model APR values.

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention in a schematic way, and the features in the following embodiments and examples may be combined with each other without conflict.

Wherein the showings are for the purpose of illustrating the invention only and not for the purpose of limiting the same, and in which there is shown by way of illustration only and not in the drawings in which there is no intention to limit the invention thereto; to better illustrate the embodiments of the present invention, some parts of the drawings may be omitted, enlarged or reduced, and do not represent the size of an actual product; it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.

The same or similar reference numerals in the drawings of the embodiments of the present invention correspond to the same or similar components; in the description of the present invention, it should be understood that if there is an orientation or positional relationship indicated by terms such as "upper", "lower", "left", "right", "front", "rear", etc., based on the orientation or positional relationship shown in the drawings, it is only for convenience of description and simplification of description, but it is not an indication or suggestion that the referred device or element must have a specific orientation, be constructed in a specific orientation, and be operated, and therefore, the terms describing the positional relationship in the drawings are only used for illustrative purposes, and are not to be construed as limiting the present invention, and the specific meaning of the terms may be understood by those skilled in the art according to specific situations.

The invention relates to a position prediction method based on a user moving mode. Firstly, excavating a moving mode of each user; then clustering is carried out according to the similarity by utilizing a DTW algorithm to obtain an integral moving mode; next, a markov model is trained using the individual movement pattern and the entire movement pattern, respectively, to predict the position. Finally, the total weather characteristics are created, and the result is corrected by calculating weather similarity through a Gaussian kernel function.

In order to enable a more concise and clear description, some noun definitions are explained:

define the Movement Pattern (MP) as: a set of locations that a user visits in a sequential temporal order. A pattern in which the user frequently moves is referred to as a movement pattern of the user, and the movement pattern may be expressed as MP ═ l₁,l₂,l₃,…,l_nN is the movement of the userNumber of positions contained in the pattern.

Define an Individual Movement Pattern (IMP) as: the user's individual historical visit locations frequently appear in a sequence of locations whose movement patterns are a set of all movement patterns in the historical check-in record for a given user.

Defining the support degree as: the frequency with which the user's movement pattern appears in its movement trajectory. In the historical check-in record of the user, including multiple movement tracks, the support degree of the movement mode may be calculated as:

define the global movement pattern (AMP) as: all users of different groups have a pattern of movements that frequently occurs in historical check-ins. And calculating the similarity of the movement patterns of the users by adopting a DTW algorithm, then dividing the similarity into a plurality of groups through clustering, and finding a central pattern according to the movement patterns in the historical access positions of all the users in each group to obtain the integral movement pattern of all the users in the group.

Defining the general weather characteristics: the rainfall, temperature and wind speed weather features are combined into a new feature according to a weighting fusion mode.

As shown in fig. 1, the present invention is divided into the following steps:

s1: and excavating the individual movement mode of each user by adopting an Apriori algorithm, and finding out internal factors influencing the sign-in of the user.

S2: the similarity between individual movement patterns of the user is calculated using a Dynamic Time Warping (DTW) algorithm.

S3: and grouping the individual movement patterns of the users through clustering to obtain a central pattern of each group, namely an overall movement pattern (AMP), and finding out external factors influencing check-in.

S4: the markov model is trained with individual and global movement patterns, respectively.

S5: the Markov chain model is trained based on IMP and AMP, and the probability vectors of the two are combined to predict the next position of the user.

S6: the total weather characteristics are created taking into account the influence of the extrinsic weather.

S7: and calculating the similarity between the weather of the current place and the weather of other places by using the Gaussian kernel function, and correcting the predicted result.

S8: setting an evaluation standard and a reference method.

In step S1, an Apriori algorithm is used to mine the movement pattern of the user, where the Apriori algorithm is a data mining algorithm based on association rules and is subsequently applied to mine the movement pattern of the user. A time factor is added in the excavation process of the mobile mode, so that the excavated user mobile mode has a time rule, and the change situation of the user mobile mode along with the time can be known. First, in a given time range (weekday, weekend), Gowalla is analyzed to find out a moving pattern with length of 1 in the check-in record of the user, and the check-in information is shown in FIG. 2. And then finding out the moving mode with the length of 2 in sequence, calculating whether the support degree sigma meets the requirement, and circulating until the length of the moving mode cannot be increased, thus obtaining the candidate moving mode. And finding out the movement mode with the support degree meeting the condition from the obtained candidate movement modes to obtain the individual movement mode of the user.

In step S2, the similarity between two movement patterns is calculated by not simply calculating the euclidean distance between two points, but calculating the Haversine distance, and the geographical distance between two points can be obtained by introducing the coordinates of the two points, so that the calculated distance is more accurate. The method comprises the following specific steps:

wherein:

In step S3, the user' S individual movement pattern is obtained in step S1, and a plurality of classes are initialized and the distance threshold τ is set. For each movement pattern of each user, its distance from each class is calculated and the class with the smallest distance is selected. Then, the distance between the moving pattern and the class is calculated by adopting a DTW algorithm, if the distance is smaller than a threshold value tau, the distance is added into the class and updated, and otherwise, a new class is created for the moving pattern. And finally, obtaining a clustering result, namely the overall movement mode of each person.

In step S4, after clustering the personal movement patterns of the user according to step S3, we obtain the movement patterns of the whole user, and the next position to be reached by combining the obtained whole movement patterns is:

based on the personal mobile mode, the next location to go is:

wherein the content of the first and second substances,

representing a moving pattern, MP, having N positions_cA set of classes of movement patterns is represented,

the expression sequence is

The moving mode of (A) appears in the MP_cThe number of times of (1) to (d),

is shown in MP_cIn position l_iThe number of subsequent occurrences.

In step S5, for each individual, there is an individual movement pattern and a global movement pattern, and each of these movement patterns is used for training a markov model. A predicted probability vector is finally obtained. The vector based on the whole is P^AMP＝(l₁,l₂,l₃,…,l_n) Vector based on individual movement pattern is P^IMP＝(l₁,l₂,l₃,…,l_n) Wherein n represents the number of predicted positions; the two results are then combined to obtain the final predicted result. The final combination is therefore as follows:

P＝α·P^IMP+(1-α)P^AMP

in step S6, the weather total feature X is first created_weather＝[Temperature,Rain,Windspeed]. Then, carrying out weighted summation on the three kinds of weather of the user check-in place, and comprehensively considering the influence of the three kinds of weather on the user check-in place to obtain the total weather characteristics of each user check-in place, wherein the total weather characteristics are specifically represented as follows:

X_weather＝ω₁Temperature+ω₂Windspeed+ω₃Rain

wherein, the weight of rainfall is calculated as follows:

correspond toThe total number of days that the interval of rainfall occurred within the time period. The weights for wind speed and temperature are similarly calculated.

Wherein, in step S7, after calculating the weather preference of the user according to S6 by the created weather general characteristics, we adopt Gaussian kernel function to calculate the current location of the user (X)_l) To other locations

And (5) obtaining a final prediction result according to the similarity of the weather. The specific calculation is as follows:

the weather conditions of other locations.

In step S8, check-in records of two cities of the Gowalla dataset in one year, which are data of London (LON) and Los Angeles (LA), respectively, are selected as the dataset for the test. Dividing experimental data into a test set, a training set and a verification set, training the training set, then verifying the training set on the verification set, and finally testing the test set. The information of the data set is shown in fig. 2:

accuracy and APR for site prediction were used as evaluation criteria for the experiment. The definition is as follows:

accuracy: the index defines the proportion of the correctly predicted location to the total predicted location in the prediction result list of the user. When the prediction result coincides with reality, p (l) becomes 1.

Average Percent Ranking (APR): the prediction problem also has a certain relation with the sequence, user u_iSign-in location l_jIn advance ofThe measurement list PR is defined as:

and taking the average value of the sum of the PR values to obtain the APR values of all users, wherein the larger the value is, the better the prediction effect is. The formula is as follows:

secondly, in order to verify the effectiveness of the proposed location prediction method based on the user movement pattern, the following models are selected for comparison with the proposed model:

NextPlace: the method is a classical position prediction method, and predicts the user behavior based on the nonlinear time series analysis of the arrival time, and predicts by using the similarity of the time series.

SimPreT: historical patterns are associated with the current user trajectory, and pattern similarity is used to determine the next location of the user.

HMM-based (Hybrid Markov Model-based): the model simultaneously considers non-Gaussian and spatiotemporal characteristics in actual human check-in data by constructing a hybrid Markov model.

Fig. 3 shows the results of the comparison of the predicted performance of the Individual Movement Pattern (IMP) and the overall movement pattern (AMP). From the figure we can see that the predicted performance gradually decreases with increasing position when only the overall movement pattern is considered; the method based on the individual activity mode obviously encounters the cold start problem in the early stage, the prediction accuracy is low, but the prediction accuracy is greatly improved along with the accumulation of the individual historical activity information. Especially in the later stages, it performs better than methods based on individual movement patterns and overall activity patterns. Despite this, we still consider both individual and overall movement patterns in our approach. It is believed that preserving the user's overall pattern makes our prediction method more robust and can handle some cases that cannot be predicted by relying only on personal movement patterns.

The accuracy ratio of the proposed model to other models is shown in fig. 4. It can be seen from the figure that the accuracy of the model proposed by the invention is much higher than that of NextPlace and SimPreT algorithm, and is slightly higher than that of HMM-based algorithm. The data in LA data set are respectively increased by 15%, 5% and 2.1%; the improvement in the LON data set was 14%, 6.5%, 4.2%, respectively. The illustration of considering both the user's individual and overall movement patterns, as well as weather factors, helps to improve the predicted outcome.

As shown in FIG. 5, in LA, the performance of the APR of the proposed model is better than that of other models, the APR of the model is improved by 19% compared with NextPlace, 7% compared with SimPret model and 4% compared with HMM-based model, and meanwhile, on LON data set, the model of the invention is improved by 18% compared with NextPlace, 9.2% compared with SimPret model and 5% compared with HMM-based model.

Finally, the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the present invention, and although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all of them should be covered by the claims of the present invention.

Claims

1. A position prediction method based on a user moving mode is characterized in that: the method comprises the following steps:

defining the support degree as: the frequency with which the user's movement pattern appears in its movement trajectory; in the historical check-in record of the user, the history check-in record includes a plurality of movement tracks, and then the support degree of the movement mode is calculated as follows:

define the global mobile mode AMP as: frequently occurring movement patterns in historical check-ins for all users in different groups; calculating the similarity of the user moving modes by adopting a DTW algorithm, then dividing the user moving modes into a plurality of groups through clustering, and finding a central mode according to the moving modes in the historical access positions of all users in each group to obtain the integral moving mode of all users in the group;

s8: setting evaluation Standard and reference method'

The step S1 specifically includes:

s13: finding out a mobile mode with the support degree meeting the condition from the obtained candidate mobile modes to obtain an individual mobile mode of the user;

in the step, the Euclidean distance between two points is not simply calculated for the similarity of two moving modes, but the Haversine distance is calculated, and the coordinates of the two points are transmitted to obtain the geographic space distance between the two actual points, which is specifically as follows:

wherein:

2. The method of claim 1, wherein the location prediction based on the user's moving pattern is: the step S3 specifically includes:

3. The method of claim 2, wherein the location prediction based on the user's moving pattern is: the step S4 specifically includes:

s42: based on the personal movement pattern, the next location to go is:

wherein the content of the first and second substances,

the expression sequence is

The moving mode of (A) appears in the MP_cThe number of times of (1) to (d),

is shown in MP_cIn position l_iThe number of subsequent occurrences.

4. A method according to claim 3, wherein the method comprises: the step S5 includes, for each person, a personal movement pattern and a whole movement pattern, and each of the movement patterns is used for training a markov model; finally, a predicted probability vector is obtained; the vector based on the whole is P^AMP＝(l₁,l₂,l₃,…,l_n) Vector based on individual movement pattern is P^IMP＝(l₁,l₂,l₃,…,l_n) Wherein n represents the number of predicted positions; then combining the two obtained results to obtain a final prediction result; the final combination is as follows:

P＝α·P^IMP+(1-α)P^AMP。

5. the method of claim 4, wherein the location prediction based on the user's moving pattern is: the step S6 includes:

X_weather＝ω₁Rain+ω₂Windspeed+ω₃Temperature

wherein, the weight of rainfall is calculated as follows:

6. The method of claim 5, wherein the location prediction based on the user's moving pattern is: in the step S7, after calculating the weather preference of the user through the created weather total characteristics, the current location X of the user is calculated by using the gaussian kernel function_lTo other locations

the weather conditions of other locations.

7. The method of claim 6, wherein the location prediction based on the user's moving pattern is: the step S8 includes:

s83: average hundredRanking by ratio APR: the prediction problem also has a certain relation with the sequence, user u_iSign-in location l_jIn the prediction list PR is defined as: