CN107273913B

CN107273913B - Short text similarity calculation method based on multi-feature fusion

Info

Publication number: CN107273913B
Application number: CN201710328364.6A
Authority: CN
Inventors: 高曙; 周润; 王讷; 龚磊
Original assignee: Wuhan University of Technology WUT
Current assignee: Wuhan University of Technology WUT
Priority date: 2017-05-11
Filing date: 2017-05-11
Publication date: 2020-04-21
Anticipated expiration: 2037-05-11
Also published as: CN107273913A

Abstract

The invention discloses a short text similarity calculation method based on multi-feature fusion, which comprises the following steps: firstly, an HTI method is designed to extract word frequency characteristics of a short text, secondly, the existing Skip _ gram training model of word2vec is utilized to extract grammatical characteristics of the short text, then, an HSBM model is designed to organically fuse the word frequency and the grammatical characteristics in semantic dimension, and finally, an MFSM model is designed to calculate to vectorize the fusion result and calculate the similarity between the short texts. The method extracts the features of the short text from multiple dimensions, so that the calculation accuracy of the similarity of the short text can be effectively improved.

Description

Short text similarity calculation method based on multi-feature fusion

Technical Field

The invention relates to a natural language processing technology, in particular to a short text similarity calculation method based on multi-feature fusion.

Background

The space vector model (VSM) converts the feature terms in the short text into a numerical form that can be recognized by a computer and reflects the importance degree of the feature terms in the short text to a certain extent.

The feature extraction based on word frequency is a process of selecting a feature term set which can reflect the features of the short text most in the original term set according to the calculation of a specific feature evaluation function. The word frequency-inverse document frequency (TF-IDF) and Mutual Information (MI) are two common word frequency feature extraction methods. The concept of Information Entropy (IE) is derived from statistical thermodynamics and is used for measuring the chaos degree of a system, and the concept is not directly used for feature extraction of texts but is often fused into other short text word frequency feature extraction methods.

The characteristic extraction based on grammar can be directly investigated from the context environment of words by utilizing a language model so as to extract the grammatical characteristic of the short text; and modeling the distribution of subsequent words in the short text under a given context condition by using a neural network, namely extracting grammatical features of the short text by using a deep learning method. The Skip _ gram training model of word2vec is an implementation of a Neural Network Language Model (NNLM), omits a nonlinear hidden layer of the NNLM, quickly improves the prediction process of words by sacrificing the training precision, and compensates the training precision by increasing the training corpus, so that the training model can effectively and quickly generate word vectors. The Skip _ gram training model predicts the probability of the context generation through the current word to obtain the feature words with different probabilities, thereby keeping the grammatical relation among the feature words.

The word pair topic model (BTM) is a relatively common short text semantic feature extraction model, and is a perfect combination of a unary mixed model and a topic model: first, to solve the data sparseness problem, BTMs combine the advantages of unary mixture models: all short texts share a topic distribution; then, in order to eliminate the defect that each short text only has one theme, the BTM models the co-occurrence word pair on the whole corpus; and finally, mapping the short text to a corresponding semantic space (or a theme space), thereby analyzing and judging the short text semantics. If the method is described by a mathematical language, the theme represents the conditional probability distribution of the feature words in the feature word set, and the conditional probability value of the feature words reflects the degree of closeness of the relationship between the feature words and the theme.

The short text similarity calculation can be defined as: for a given short text set, on the basis of researching a short text structure, various short text features (such as word frequency, grammar and semantic features) are extracted and quantized, so that the same points and different points among the short texts are reflected by data, the more the same points are, the higher the similarity is, and conversely, the lower the similarity is. The JS distance is a commonly used short text similarity calculation method, is suitable for the situation that short text features are presented in a probability form, can reflect the difference situation of two probability distributions in the same probability space, is based on the KL distance, and improves the results of the KL distance to be not satisfied with the defects of nonnegativity, symmetry and the like.

Short text similarity calculation is a difficult point and a hot point in the fields of Natural Language Processing (NLP) and machine learning, is an important task in NLP, can be used as a separate task, and can be used as the basis of other NLP applications. In the field of short text similarity calculation, students mostly prefer to extract single-dimensional features of word frequency or semantics, and few short text features across dimensions are extracted and fused, so that the obtained features are one-sided and incomplete, and the similarity precision obtained by using the features is not high. In addition, in the aspect of word frequency dimension feature combination, most of the current researches are combined in a feature pool or two-dimensional feature space mode, and deep integration is lacked; in the aspect of semantic dimension feature extraction, the current research direction generally applies BTM directly on the original short text set, that is, feature extraction is performed on information by directly using rich words in the original short text set, which may amplify adverse effects caused by noise features.

Disclosure of Invention

The technical problem to be solved by the invention is to provide a short text similarity calculation method based on multi-feature fusion aiming at the defects in the prior art.

The technical scheme adopted by the invention for solving the technical problems is as follows: a short text similarity calculation method based on multi-feature fusion comprises the following steps:

(1) acquiring the text quantity | M | and the topic quantity | K | in the short text set D to be calculated, and acquiring a short text-topic matrix S through a three-dimensional feature extraction and fusion process (namely process (A)) of word frequency, grammar and semantics;

1.1) extracting the word frequency characteristics of the short texts from the short text set D;

1.2) extracting short text grammatical features of the short text set D;

1.3) short text semantic fusion:

firstly, acquiring a weight matrix W' with fused word frequency and grammatical features; then, modeling by using BTM on a co-occurrence binary pair consisting of ternary elements < feature item t, short text d and fusion weight W '> in W'; finally, calculating to obtain the theme distribution probability of the short text, and obtaining a semantic fusion result short text-theme matrix S of the short text;

(2) converting the short text-theme matrix S into a short text vector set Z according to a formula (10), and initializing a similarity calculation result matrix Y;

(3) non-repeated selection of short text vectors d in set Z₁If the set Z has no selectable short text, turning to the step (6);

(4) non-repeated selection of short text vectors d in set Z₂If the set Z has no selectable short text, turning to the step (3);

(5) computing short text d₁And d₂Recording the result into a similarity result matrix Y according to the similarity between the two groups, and turning to the step (4);

(6) and obtaining a short text similarity result matrix Y.

According to the scheme, the text quantity in the short text set D is set to be | M |, the quantity of the non-repetitive characteristic items in the dictionary is set to be | N |, and a weight matrix is calculated by adopting an HTI method in the step 1.1), and the specific steps are as follows:

1.1.1): initializing the values of the characteristic item index i and the short text index j to be 0, and initializing a weight matrix W to be a zero matrix;

1.1.2): statistically calculating the feature term t_iIn short text d_jFrequency of occurrence in and assigning to TF (t)_i,d_j)；

1.1.3): calculating local factors of the characteristic items, wherein the adopted calculation formula is as follows:

localT(t_i,d_j)＝log(TF(t_i,d_j)+β) (1)

wherein, TF (t)_i,d_j) Representing a feature item t_iIn short text d_jβ is a constant factor (typically taken to be an empirical value of 1).

1.1.4): computing a feature term t_iAnd short text d_jThe adopted calculation formula is as follows:

wherein，P(t_i,d_j) Representing a feature item t_iAnd short text d_jProbability of co-occurrence, P (t)_i) Representing a feature item t_iProbability of occurrence in short text sets, P (d)_j) Representing short text d_jProbability of occurrence in a short text set.

1.1.5): calculating the global factor of the feature item, wherein the adopted calculation formula is as follows:

where n is the total number of short texts, C (t)_i,d_j) Representing a feature item t_iAnd short text d_jα is a constant factor (typically taken to be an empirical value of 1).

1.1.6): computing feature item-short text pairs (t)_i,d_j) And assigning to W_ijHTI weight calculation formula:

HTI(t_i,d_j)＝localT(t_i,d_j)×globalT(t_i,d_j) (4)

wherein localT (t)_i,d_j) Representing a local factor of a feature item, globalT (t)_i,d_j) Representing a feature item global factor;

1.1.7): for each feature item-short text pair (t)_i,d_j) And repeating the operations from 1.1.2) to 1.1.6) to obtain an HTI weight matrix W of the short text set D.

According to the scheme, in the step 1.2), short text grammatical feature extraction is that a short text set D is trained by using a Skip _ gram model of word2vec to obtain a word vector set X:

X＝(x₁,x₂,...,x_i) (5)

wherein x is_iRepresenting a feature item t_iThe word vector of (2).

According to the scheme, the short text semantic fusion in the step 1.3) comprises the following specific steps:

1.3.1): each word vector X of the set of word vectors X obtained according to step 1.2)_iAnd calculating a word vector normalization factor:

where m denotes the dimension of a predetermined word vector and k denotes the word vector x_iThe value of the k-th dimension.

1.3.2): for each ternary element < feature t, text d, HTI weight W > in the HTI weight matrix W, a weight normalization factor is calculated:

wherein, HTI (t)_i,d_j) Representing short text d_jMiddle characteristic term t_iHTI weight w.

1.3.3): and calculating fusion weight by using the word vector normalization factor and the weight normalization factor, and replacing the HTI weight W of each ternary element in the matrix W by using the fusion weight to obtain a new word frequency and grammar fusion weight matrix W'. The fusion weight calculation formula:

NL(t_i,d_j)＝F(t_i,d_j)×G(i) (8)

1.3.4): generating corpus B (or called co-occurrence binary pair set B) by BTM on fusion weight matrix W')

1.3.5): for each co-occurrence binary pair B in set B ═ (c)_i,c_j) Randomly initializing a theme, wherein the initialization iteration number i is 0;

1.3.6): each co-occurring binary pair B in pair set B ═ c_i,c_j) Calculating the state transition probability:

1.3.7): step 1.3.6) is repeated while updating the frequency n in the state transition probability formula (9)_s、

And

until reaching the upper limit of iteration times;

1.3.8): calculating topic distribution theta of whole short text set by BTM_sAnd the distribution of the ternary element c under a specific topic_c|sThus, the topic probability distribution of each short text is obtained, namely, the short text-topic matrix S is obtained.

According to the scheme, the short text vector in the step 2) is calculated as follows:

after the short text set D passes through the HSBM model, a short text-topic distribution matrix S is obtained, each element in the S is a conditional probability, and each column in the matrix S is converted into a vector form of a short text:

d_i＝(P(s₁|d_i),P(s₂|d_i),P(s₃|d_i),...,P(s_|K||d_i)) (10)

wherein, P(s)_i|d_i) Representing short text d_iIs assigned to a topic s_iA conditional probability value, | K | represents the number of topics;

based on equation (10), the short text-topic distribution matrix S is converted into a set of short text vectors Z.

According to the scheme, the short text d is calculated in the step 6)₁And d₂The similarity between them is given by the following formula:

the calculation formula of the KL distance and the JS distance is as follows:

wherein d is₁、d₂As a probability distribution vector for short text, d₁(k)、d₂(k) Respectively representing probability distribution vectors d₁、d₂The k-th probability.

The invention has the following beneficial effects:

(1) the method is based on analyzing TF-IDF and mutual information two word frequency feature extraction methods, combines the concept of information entropy to effectively fuse the two word frequency feature extraction methods, provides a short text word frequency feature extraction method HTI, and realizes deep integration of multiple word frequency dimensionality features.

(2) According to the method, a semantic feature extraction model HSBM of a short text is constructed based on BTM, a word pair generation process in a short text corpus is not modeled directly, a short text-feature word fusion weight matrix W ' is obtained first, and then a co-occurrence binary pair formed by ternary elements < feature item t, text d and fusion weight W ' > in W ' is modeled, so that adverse effects caused by noise features are removed to a certain extent.

(3) The method extracts features from multiple dimensions of word frequency, grammar and semantics, and effectively improves the calculation accuracy of the similarity of the short text.

Drawings

The invention will be further described with reference to the accompanying drawings and examples, in which:

FIG. 1 is a model structure diagram of an HSBM of an embodiment of the present invention;

fig. 2 is a model structural diagram of an MFSM of an embodiment of the present invention;

fig. 3 is a flowchart of a short text similarity calculation method based on multi-feature fusion according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

As shown in FIG. 1, FIG. 1 is a Model structure diagram of HSBM (HTI-Skip _ gram-BTM fusion Model), in which the parameters are described as follows:

in process (I), rounded rectangles (e.g., "HTI") represent feature extraction methods or models, and hexagons represent short text collections; the circles represent the weight matrix: w is an HTI weight matrix obtained by an HTI method, X is a feature word vector set obtained by a Skip _ gram training model, and W' is a short text-feature word fusion weight matrix obtained by carrying out normalization operation (NL) on the HTI weight matrix W and the feature word vector set X; | M | represents the total text quantity in the short text set, and | N | represents the quantity of the feature terms.

The transparent circle (e.g., "θ") in Process (II) represents an implicit parameter, the shaded circle (e.g., "c)_i") represents variables that can be directly obtained by sampling, α is a hyper-parameter for the Dirichlet distribution theta, β is the Dirichlet distribution theta

S is the underlying topic distribution, c is the ternary element in the fusion weight matrix W<Feature item t, text d, fusion weight w'>，(c_i,c_j) Is a co-occurrence binary pair consisting of ternary elements c; | K | represents the number of topics and | B | represents the number of co-occurrence binary pairs.

As shown in fig. 3, the basic steps of the method of the present invention are as follows:

if the number of texts in the short text set D is | M |, the number of topics is | K |

(1) Obtaining a short text-topic matrix S through a three-dimensional feature extraction and fusion process (namely, process (A)) of word frequency, grammar and semantics;

(5) calculating short text d according to formula (11) and formula (12)₁And d₂Recording the result into a similarity result matrix Y according to the similarity between the two groups, and turning to the step (4);

(6) and obtaining a short text similarity result matrix Y.

Three-dimensional feature extraction and fusion process of word frequency, grammar and semantics

The three-dimensional feature extraction and fusion process of word frequency, grammar and semantics is mainly realized by using the HSBM model designed by the patent, and the three-dimensional feature extraction and fusion process mainly comprises the following basic steps: firstly, short text features are respectively extracted from two dimensions of word frequency and grammar, and then organic fusion is carried out in semantic dimension. Therefore, the implementation process is divided into three phases: extracting short text word frequency characteristics; short text grammatical feature extraction; and a short text semantic fusion stage. These three stages will be described separately below.

Short text word frequency characteristic extraction stage

The method is mainly realized by using an HTI (Hybrid TF-IDF) method designed by the patent at the short text word frequency feature extraction stage, improves the TF-IDF by using the concepts of MI and IE, retains the important function of the TF on short text feature extraction, and optimizes the structure of the IDF to more accurately reflect the distribution condition and the importance degree of feature words in all short texts, thereby more effectively adjusting the weight of the feature words and improving the precision of similarity calculation.

If the number of texts in the short text set D is | M |, and the number of unrepeated feature items in the dictionary is | N |, the basic steps of calculating the weight matrix by the HTI method are as follows:

the first step is as follows: initializing the values of the characteristic item index i and the short text index j to be 0, and initializing a weight matrix W to be a zero matrix;

the second step is that: statistically calculating the feature term t_iIn short text d_jFrequency of occurrence in and assigning to TF (t)_i,d_j)；

The third step: calculating a local factor of the characteristic item, wherein the calculation formula is as follows:

localT(t_i,d_j)＝log(TF(t_i,d_j)+β) (1)

The fourth step: computing a feature term t_iAnd short text d_jThe correlation factor is calculated by the formula:

wherein, P (t)_i,d_j) Representing a feature item t_iAnd short text d_jProbability of co-occurrence, P (t)_i) Representing a feature item t_iProbability of occurrence in short text sets, P (d)_j) Representing short text d_jProbability of occurrence in a short text set.

The fifth step: calculating a global factor of the feature item, wherein the calculation formula is as follows:

And a sixth step: computing feature item-short text pairs (t)_i,d_j) And assigning to W_ijHTI weight calculation formula:

HTI(t_i,d_j)＝localT(t_i,d_j)×globalT(t_i,d_j) (4)

wherein localT (t)_i,d_j) Representing a local factor of a feature item, globalT (t)_i,d_j) Representing a feature item global factor.

The seventh step: for each feature item-short text pair (t)_i,d_j) And repeating the operations of the second step and the sixth step to obtain an HTI weight matrix W of the short text set D.

Short text grammatical feature extraction stage

In the short text grammatical feature extraction stage, a short text set D is trained by mainly utilizing a Skip _ gram model of word2vec to obtain a word vector set X:

X＝(x₁,x₂,...,x_i) (5)

wherein x is_iRepresenting a feature item t_iThe word vector of (2).

Short text semantic fusion phase

In the HSBM model, the implementation of the short text semantic fusion stage is as follows: firstly, acquiring a weight matrix W' with fused word frequency and grammatical features; then, modeling by using BTM on a co-occurrence binary pair consisting of ternary elements < feature item t, short text d and fusion weight W '> in W'; and finally, calculating to obtain the topic distribution probability of the short text, and obtaining the semantic fusion result of the short text. The specific steps of this stage (in fig. 1, process (I) includes steps one to three, and process (II) includes steps four to eight):

the first step is as follows: for each word vector X of the set of word vectors X in equation (5)_iAnd calculating a word vector normalization factor:

The second step is that: for each ternary element < feature t, text d, HTI weight W > in the HTI weight matrix W, a weight normalization factor is calculated:

The third step: and calculating fusion weight by using the word vector normalization factor and the weight normalization factor, and replacing the HTI weight W of each ternary element in the matrix W by using the fusion weight to obtain a new word frequency and grammar fusion weight matrix W'. The fusion weight calculation formula:

NL(t_i,d_j)＝F(t_i,d_j)×G(i) (8)

the fourth step: generating corpus B (or called co-occurrence binary pair set B) by BTM on fusion weight matrix W')

The fifth step: for each co-occurrence binary pair B in set B ═ (c)_i,c_j) Randomly initializing a theme, wherein the initialization iteration number i is 0;

and a sixth step: each co-occurring binary pair B in pair set B ═ c_i,c_j) Calculating the state transition probability:

the seventh step: repeating the step six, and simultaneously updating the frequency n in the state transition probability formula (9)_s、

And

until reaching the upper limit of iteration times;

eighth step: calculating topic distribution theta of whole short text set by BTM_sAnd the distribution of the ternary element c under a specific topic_c|sAnd then, the topic probability distribution of each short text is obtained, namely, a short text-topic matrix S is obtained.

Implementation of short text similarity calculation method based on multi-feature fusion

The basic idea of the short text similarity calculation method based on multi-feature fusion is as follows: firstly, short text features are extracted from word frequency, grammar and semantic dimensions respectively, then the short text features are organically fused, and a fusion result is quantized, so that the similarity between short texts is calculated. The method is mainly implemented by using an MFSM Model designed by the present patent, as shown in fig. 2, fig. 2 is a Model structure diagram of an MFSM (Multi-Feature based Similarity-calculation Model), where S is a short text-topic distribution matrix obtained by a short text set through an HSBM Model, Z is a short text vector set, Y is a short text Similarity result matrix, | M | represents a text quantity in the short text set, | K | represents a topic quantity, and JS represents a Similarity calculation method (that is, JS distance) for processing the short text vector set Z. The specific implementation of the method is mainly divided into 3 processes: (A) extracting and fusing three-dimensional characteristics of word frequency, grammar and semantics; (B) calculating a short text vector; (C) and calculating the similarity of the short texts. Wherein process (a) has been carried out as hereinbefore described. The following first describes process (B) and process (C), and then the basic steps of the method are described.

Calculation of short text vectors

d_i＝(P(s₁|d_i),P(s₂|d_i),P(s₃|d_i),...,P(s_|K||d_i)) (10)

wherein, P(s)_i|d_i) Representing short text d_iIs assigned to a topic s_iThe conditional probability value of, | K | represents the number of topics.

Obviously, the short text has been mapped to the corresponding semantic space (i.e., topic space). Based on formula (10), the short text-topic distribution matrix S is converted into a short text vector set Z (i.e., process (B) in fig. 2), which can be used as an input for the short text similarity calculation in process (C).

Calculation of short text similarity

Because each short text vector in the short text vector set Z is presented in a probability form, the method utilizes the JS distance to calculate the similarity between the short texts, and the similarity is based on the KL distance, and the result of improving the KL distance does not meet the defects of nonnegativity, symmetry and the like. The calculation formula of the KL distance and the JS distance is as follows:

At present, short text features used for short text similarity calculation are single in dimensionality, most of the short text features are biased to extracting single-dimensionality features of word frequency or semantics, and cross-dimensionality short text features are rarely extracted and fused, so that the obtained features are one-sided and incomplete, and the similarity precision obtained by using the features is not high. The patent provides a short text similarity calculation method based on multi-feature fusion, which comprises the steps of firstly, designing an HTI method to extract word frequency features of short texts, secondly, extracting grammatical features of the short texts by using an existing Skip _ gram training model of word2vec, then designing an HSBM model to organically fuse the word frequency and the grammatical features in semantic dimension, and finally designing an MFSM model to calculate a fusion result and calculate the similarity between the short texts. The short text feature extraction method based on the multi-dimension extraction can effectively improve the short text similarity calculation accuracy.

It will be understood that modifications and variations can be made by persons skilled in the art in light of the above teachings and all such modifications and variations are intended to be included within the scope of the invention as defined in the appended claims.

Claims

1. A short text similarity calculation method based on multi-feature fusion is characterized by comprising the following steps:

(1) acquiring the text quantity | M |, the quantity of unrepeated feature items in a dictionary | N |, and the quantity of topics | K |, in the short text set D to be calculated, and acquiring a short text-topic matrix S through three-dimensional feature extraction and fusion processes of word frequency, grammar and semantics;

1.2) extracting the grammatical features of the short text from the short text set D to obtain a word vector set;

1.3) short text semantic fusion:

(2) converting the short text-theme matrix S into a short text vector set Z, and initializing a similarity calculation result matrix Y;

(6) and obtaining a short text similarity result matrix Y.

2. The method for calculating similarity of short texts according to claim 1, wherein the number of texts in the short text set D obtained in step 1.1) is | M |, the number of unrepeated feature items in the dictionary is | N |, and a weight matrix is calculated by using an HTI method, and the method specifically includes the following steps:

localT(t_i,d_j)＝log(TF(t_i,d_j)+β)

wherein, TF (t)_i,d_j) Representing a feature item t_iIn short text d_jβ is a constant factor;

1.1.4): computing feature itemst_iAnd short text d_jThe adopted calculation formula is as follows:

wherein, P (t)_i,d_j) Representing a feature item t_iAnd short text d_jProbability of co-occurrence, P (t)_i) Representing a feature item t_iProbability of occurrence in short text sets, P (d)_j) Representing short text d_jProbability of occurrence in a short text set;

where n is the total number of short texts, C (t)_i,d_j) Representing a feature item t_iAnd short text d_jα is a constant factor;

HTI(t_i,d_j)＝localT(t_i,d_j)×globalT(t_i,d_j)

3. The short text similarity calculation method according to claim 1, wherein the short text grammar feature extraction in step 1.2) is to train a short text set D to obtain a word vector set X by using a Skip _ gram model of word2 vec:

X＝(x₁,x₂,...,x_i)

wherein x is_iRepresenting a feature item t_iThe word vector of (2).

4. The short text similarity calculation method according to claim 2, wherein the short text semantic fusion in step 1.3) specifically comprises the following steps:

where m denotes the dimension of a predetermined word vector and k denotes the word vector x_iA value of the k-th dimension;

wherein, HTI (t)_i,d_j) Representing short text d_jMiddle characteristic term t_iHTI weight w of (a);

1.3.3): calculating a fusion weight by using the word vector normalization factor and the weight normalization factor, and replacing the HTI weight W of each ternary element in the matrix W by using the fusion weight to obtain a new word frequency and grammar fusion weight matrix W'; the fusion weight calculation formula:

NL(t_i,d_j)＝F(t_i,d_j)×G(i)

1.3.4): generating a corpus B by using BTM on the fusion weight matrix W';

1.3.6): (ii) for each co-occurring binary pair B in the set Bc_i,c_j) Calculating the state transition probability:

1.3.7): repeating step 1.3.6) while updating the frequency n in the state transition probability formula_s、

And

until reaching the upper limit of iteration times;

5. The method for calculating similarity of short texts according to claim 1, wherein the short text vector in step 2) is calculated as follows:

d_i＝(P(s₁|d_i),P(s₂|d_i),P(s₃|d_i),...,P(s_|K||d_i))

and converting the short text-topic distribution matrix S into a short text vector set Z based on the formula.

6. The short text similarity calculation method according to claim 1, wherein the short text similarity calculation method is applied to the short text similarity calculation methodCalculating short text d in step 6)₁And d₂The similarity between them is given by the following formula:

the calculation formula of the KL distance and the JS distance is as follows: