CN110047504A

CN110047504A - Method for distinguishing speek person under identity vector x-vector linear transformation

Info

Publication number: CN110047504A
Application number: CN201910312097.2A
Authority: CN
Inventors: 徐珑婷; 张光林; 赵萍; 张磊; 季云云
Original assignee: Donghua University
Current assignee: Donghua University; National Dong Hwa University
Priority date: 2019-04-18
Filing date: 2019-04-18
Publication date: 2019-07-23
Anticipated expiration: 2039-04-18
Also published as: CN110047504B

Abstract

The present invention relates to the method for distinguishing speek person under a kind of identity vector x-vector linear transformation, have main steps that: carrying out feature extraction to voice, and extract its identity vector x-vector and i-vector respectively；The training of parallel factor analysis device is carried out using the x-vector and i-vector of the same speaker；The corresponding parameter of x-vector in parallel factor analysis device is chosen, linear transformation is carried out to identity vector x-vector on this parameter basis and obtains xl-vector；To new identity vector xl-vector training PLDA model；Voice to be tested is carried out feature extraction and x-vector to extract, the linear quantizer that the training stage obtains is entered into and obtains new identity vector xl-vector, the PLDA model that the training stage obtains finally is entered into, to obtain final result.The present invention improves the recognition performance of Speaker Identification while guaranteeing that memory requirements is similar with baseline system with calculating speed.

Description

Method for distinguishing speek person under identity vector x-vector linear transformation

Technical field

The present invention relates to the present invention relates to the speaker Recognition Technologies in bio-identification, more specifically to a kind of identity Speaker Recognition Technology under vector x-vector linear transformation.

Background technique

Voice is the mode for the most direct convenience that the mankind carry out communication exchange, it is with its distinctive convenience, economy, standard The advantage of the various aspects such as true property causes the attention of each research institution.The research of Speech processing to promote human-computer interaction, Artificial Intelligence Development has great significance.For this purpose, the related fields of Speech processing, such as speech recognition, voice coding, language The directions such as sound synthesis, Speaker Identification receive more and more attention and theoretical research.Speaker Identification, also known as Application on Voiceprint Recognition, Its goal in research is to carry out authentication according to unique pronunciation of each speaker.The voice of each speaker suffers from unique Personal presentation, this is because the born phonatory organ of each speaker is different, while the such environmental effects where by the day after tomorrow and The one's own unique voice being trained.Just because of this otherness, so that voice is special as a kind of biology Property as identification target be possibly realized, Speaker Identification has also gradually formed a set of fairly perfect identification system of oneself.

Speaker Recognition System includes preprocessing part, characteristic extraction part, model training and matching primitives part.It says The key technology for talking about people's identification includes characteristic parameter extraction algorithm, the selection of model and Model Matching algorithm, directly determines knowledge The performance of other system.Speaker model, which is divided into, generates model and discrimination model.Generating model is each respective spy of classification of study Sign, i.e., multiple models, identification data are mapped in each model, and then determine which kind of identification data belong to；Discrimination model is Learning classification face, the classifying face can be used to distinguish which kind of different data are belonging respectively to.The two models are based on global poor The identity vector i-vector of anomalous mode type (TotalVariabilityModeling, TVM), Delayed Neural Networks are based on The identity vector x-vector of (Time-delayDeepNeuralNetwork, TDNN) is representative, is that current use is most extensive Two vector models.

The rear end part and i-vector rear end part of x-vector generally uses probability linear discriminant analysis (prob Abilisticlineardiscriminantanalysis, PLDA) rear end methods of marking.Result under x-vector model With the result of i-vector when long under voice quite, result is more preferable under Short Time Speech.How different paper studies mention System performance under high x-vector model, research shows that the model superposition or PLDA by i-vector and x-vector obtain Divide fusion that system performance can be improved, however this kind of method is designed into two kinds of systems, needs a large amount of memory requirements, calculates simultaneously Speed also will receive influence.Then, more researchs improve the robustness of x-vector by way of data extending, still This method is influenced by environment-identification.

Summary of the invention

The object of the present invention is to provide a kind of amount of ram for considering online recognition target speaker and calculate speaking for time People's recognition methods.

In order to achieve the above object, the technical solution of the present invention is to provide a kind of identity-based vector x-vector lines Property transformation under method for distinguishing speek person, which comprises the steps of:

Step 1, extract speaker training voice feature of the mel-frequency cepstrum coefficient as speaker；

Step 2 uses deep neural network structured training x-vector model using the feature that step 1 obtains, and establishes body Part vector x-vector model, to obtain identity vector x-vector；

Step 3 is based on EM algorithm training i-vector model using the feature that step 1 obtains, and establishes identity vector i- Vector model, to obtain identity vector i-vector；

Step 4 thinks that the i-vector of the same speaker and x-vector are projected in the same vector, is calculated based on EM Method training obtains the parameter of parallel factor analysis device, to complete the training of parallel factor analysis device；

Step 5 passes through linear quantizer, and the corresponding parameter of x-vector is retained in the parameter of parallel factor analysis device, On the basis of Linear Transformation device, identity vector xl-vector is expressed with the linear transformation of x-vector, to establish body Part vector x_l- vector model obtains identity vector x_l-vector；

Step 6 utilizes identity vector x_l- vector is updated the parameter model of PLDA using EM algorithm, completion pair The training of PLDA model；

Step 7, the Speaker Identification of test phase

Identity vector x-vector model will be passed through after the corresponding voice progress feature extraction to be identified of registration voice Identity vector x-vector is obtained, the linear quantizer after identity vector x-vector input training is obtained into new identity vector x_l- vector, finally by identity vector x_l- vector is input to the PLDA model after training, to obtain Speaker Identification knot Fruit.

Preferably, in step 4, it is contemplated that different identity vector may map to the same vector space, using it is parallel because The method of son analysis obtains this common vector.

Preferably, in step 4, the identity vector i-vector of first of speaker is expressed as φ_i(l, 1) ..., φ_i(l, K), identity vector x-vector is expressed as φ_x(l, 1) ..., φ_x(l, k), wherein k indicates the input voice of the speaker Quantity, φ_i(l, k) indicates identity the vector i-vector, φ of the kth section voice of the identity vector of first of speaker_x(l, k) table Show the identity vector x-vector of the kth section voice of the identity vector of first of speaker, the identity vector i- of the same speaker Vector and identity vector x-vector can be projected in the same vector, therefore can be expressed asWherein, μ_iIndicate the average vector of identity vector i-vector；μ_xIt indicates The average vector of identity vector x-vector；F_iIndicate the corresponding projection matrix of i-vector；F_xIndicate the corresponding throwing of x-vector Shadow matrix；H (l) indicates the hidden variable of first of speaker；ε_i(l, k) indicates the kth section language of the identity vector of first of speaker Residual vector after the identity vector i-vector Linear Transformation of sound, ε_i~N (0, ∑_i), ∑_iIndicate the association side of i-vector Poor matrix, and N (0, ∑_i) indicate ε_iMeeting matrix is 0, and covariance is ∑_iNormal distribution；ε_x(l, k) indicates to say for first Residual vector after talking about the identity vector x-vector Linear Transformation of the kth section voice of the identity vector of people, ε_x~N (0, ∑_x), ∑_xIndicate residual epsilon_xCovariance matrix, N (0, ∑_x) indicate ε_xMeeting matrix is 0, and covariance is ∑_xNormal distribution； By EM algorithm, parameter θ={ μ of parallel factor analysis device is obtained_i, F_i, ∑_i, μ_x, F_x, ∑_x}。

Preferably, in step 6, according to the corresponding parameter θ of x-vector_x={ μ_x, F_x, ∑_xOn, after linear transformation Identity vector x_l- vector is expressed asWherein,Indicate x_lThe posteriority covariance of-vectorFurther write it as φ_xl=A φ_xThe form of-b, A, b are linear dimensions, so that identity be sweared Measure x_l- vector is expressed as the linear transformation mode of x-vector.

The present invention in view of i-vector generation model information be it is helpful to x-vector model system, Training stage introduces i-vector, obtains the matrix of a linear transformation suitable for x-vector, and propose that a kind of x-vector is linear Method for distinguishing speek person under transformation.

In step 4 of the present invention, using x-vector and i-vector training parallel factor analysis device, in this way this point Parser had not only contained the information of x-vector, but also contained the information of i-vector, therefore obtained on the basis of this analyzer The linear quantizer of x-vector preferably remains the information of i-vector, so that new identity vector x_l-vector With i-vector information, the final recognition performance for improving system.

Step of the present invention does not need to carry out i- again in the test phase of step 7 after the completion of the step of training stage of 1-6 Vector identity vector extracts, while parallel factor analysis device is after the training stage obtains, it is only necessary to retain the line of x-vector Property converter, therefore the memory requirements of test phase does not increase, while linear transformation has little effect actual operation.

The method of the present invention is to carry out in Speaker Identification using the identity vector after a kind of pair of x-vector linear transformation Identification.By rationally using i-vector information during the test, achieve the effect that improve recognition performance.Specifically It is exactly in test phase, by carrying out parallel factor analysis device instruction using the x-vector and i-vector of the same speaker Practice, choose parallel factor analysis device in the corresponding parameter of x-vector, on this parameter basis to identity vector x-vector into Row linear transformation obtains x_l-vector；In test phase, voice to be tested is subjected to feature extraction and x-vector is extracted, It is entered into the linear quantizer that the training stage obtains and obtains new identity vector x_l- vector is finally entered into instruction The PLDA model that the white silk stage obtains, to obtain final result.

Thus can produce it is such the utility model has the advantages that

(1) using x-vector and i-vector training parallel factor analysis device, this analyzer so both contains x- The information of vector, but the x-vector's for containing the information of i-vector, therefore obtaining on the basis of this analyzer is linear Converter preferably remains the information of i-vector, so that new identity vector x_l- vector believes with i-vector Breath, the final recognition performance for improving system；

(2) test phase does not need to carry out the extraction of i-vector identity vector again, while parallel factor analysis device is in training After stage obtains, it is only necessary to retain the linear quantizer of x-vector, therefore the memory requirements of test phase does not increase, simultaneously Linear transformation has little effect actual operation.

Detailed description of the invention

Fig. 1 is the Speaker Identification flow chart under present invention implementation identity vector x-vector linear transformation；

Fig. 2 is the parameter setting situation of frame number layer in x-vector neural network framework.

Specific embodiment

Present invention will be further explained below with reference to specific examples.It should be understood that these embodiments are merely to illustrate the present invention Rather than it limits the scope of the invention.In addition, it should also be understood that, after reading the content taught by the present invention, those skilled in the art Member can make various changes or modifications the present invention, and such equivalent forms equally fall within the application the appended claims and limited Range.

The side of speaker Recognition Technology under a kind of identity vector x-vector linear transformation disclosed by the embodiments of the present invention Method, as shown in Figure 1, comprising the following steps:

Step 1, feature extraction --- the present invention use mel-frequency cepstrum coefficient (Mel Frequency Cepstral Coefficients, MFCC) feature as speaker.The log series model that mel-frequency scale is corresponding generally to actual frequency closes System: Mel (f)=2595lg (1+f/700), in formula, Mel (f) indicates that mel-frequency, f indicate common frequencies.According to following behaviour Make the available MFCC feature of mode: (1) pre-processing, including preemphasis, framing adding window, end-point detection, if voice signal x (m) It is after pretreatment x_i(m), i indicates frame number；(2) Fast Fourier Transform (FFT) X (i, k)=FFT [x_i(m)], X (i, k) indicates frequency Spectrum signal；(3) line energy calculates E (i, k)=[X (i, k)]²；(4) Meier filter energy is calculated Wherein H_mIt (k) is Meier filter function, M indicates the number of filter；(5) dct transform and logarithm is sought.

Step 2, identity vector x-vector model foundation --- x-vector model training are based on deep neural network knot Structure.First 5 layers are that frame level is other, and TDNN total input is a Duan Yuyin, and each TDNN takes fixed frame number, and the network parameter of first five layer is set Definite opinion Fig. 2.Then after pond layer accumulates the output vector of each TDNN, mean value and standard deviation are calculated as pond layer Output.Followed by two layers of omnidirectional's articulamentum of pond layer adds one softmax layers finally to export.The neuron number of output and Speaker's number is consistent in our training sets, and the output of neural network is a posterior probability.It is logical based on the neural network Crossing multiple repetitive exercise uses layer 6 output as x-vector model.

Step 3, identity vector i-vector model foundation --- the voice sequence of a given speaker s is O={ o₁, o₂..., o_T, c-th of Gaussian component can be expressed as o in the voice sequence of t moment_{C, t}=μ_c+T_cX+ ε, o_{C, t}Indicate c Voice sequence of a Gaussian component in t moment, μ_cIndicate the mean value of c-th of Gaussian component, T_cIndicate the throwing of c-th of Gaussian component Shadow matrix, x indicate the hidden variable of the speaker, and ε indicates residual error portion, select EM algorithm training i-vector model.Wherein E is walked Suddenly in (seeking desired value), first order statistic F_cWith second-order statistic S_cDefinition be respectively as follows: F_c=∑_tγ_c(t)(o_{C, t}-μ_c), S_c =∑_tγ_c(t)(o_{C, t}-μ_c)(o_{C, t}-μ_c)^T, γ_c(t) indicate t frame voice in c-th of Gaussian component occupation rate, the posteriority of x Mean value is expressed as φ=L^-1T^T∑^-1F, wherein L^-1Indicate the posteriority covariance of identity vector i-vector,N_cIndicate the zero order statistical amount of c-th of Gaussian component, I indicates that unit vector, T indicate all Gausses point Measure T_cThe matrix of composition, F indicate that first order statistic, ∑ indicate the covariance matrix of residual epsilon.M step (maximization) main purpose It is optimization matrix T and matrix ∑, by rightDerivation obtains the two matrixes Optimal solution, F (s) indicate that the first order statistic of s sections of voices, x (s) indicate the hidden variable of s sections of voices, and N (s) indicates s The zero order statistical amount of Duan Yuyin.By establishing i-vector model to E the and M step process that successively iteration updates.

Step 4, parallel factor analysis device --- the identity vector i-vector of first of speaker is expressed as φ for training_i(l, ..., φ 1)_i(l, k), identity vector x-vector are expressed as φ_x(l, 1) ..., φ_x(l, k), wherein k indicates the speaker Input voice quantity, φ_i(l, k) indicates the identity vector i- of the kth section voice of the identity vector of first of speaker Vector, φ_x(l, k) indicates the identity vector x-vector of the kth section voice of the identity vector of first of speaker, same The identity vector i-vector and identity vector x-vector of speaker can be projected in the same vector, therefore can be indicated ForWherein, μ_iIndicate the average vector of identity vector i-vector；μ_xIndicate identity vector x- The average vector of vector；F_iIndicate the corresponding projection matrix of i-vector；F_xIndicate the corresponding projection matrix of x-vector；h (l) hidden variable of first of speaker is indicated；ε_i(l, k) indicates the identity of the kth section voice of the identity vector of first of speaker Residual vector after vector i-vector Linear Transformation, ε_i~N (0, ∑_i), ∑_iIndicate the covariance matrix of i-vector, N (0, ∑_i) indicate ε_iMeeting matrix is 0, and covariance is ∑_iNormal distribution；ε_x(l, k) indicates the body of first of speaker Residual vector after the identity vector x-vector Linear Transformation of the kth section voice of part vector, ε_x~N (0, ∑_x), ∑_xIt indicates Residual epsilon_xCovariance matrix, N (0, ∑_x) indicate ε_xMeeting matrix is 0, and covariance is ∑_xNormal distribution；By EM algorithm, Obtain parameter θ={ μ of parallel factor analysis device_i,F_i,Σ_i,μ_x,F_x,Σ_x}。

Step 5, linear quantizer --- i-vector and x- are contained in the parallel factor analysis device that the training stage obtains The parameter of vector, in practical on-line operation, it is only necessary to the corresponding parameter θ of x-vector_x={ μ_x, F_x, ∑_x}.In this parameter On obtain the identity vector x after linear transformation_l- vector model.

Step 6, identity vector xl-vector model foundation --- in the corresponding parameter θ of x-vector_x={ μ_x, F_x, ∑_x} On, by the identity vector x after linear transformation_l- vector is expressed as Indicate identity vector x- The posteriority covariance of vectorFurther write it as φ_xl=A φ_xThe form of-b, A, b are linear Parameter, thus by identity vector x_l- vector is expressed as the linear transformation mode of x-vector.

Step 7, PLDA model training --- assuming that training data voice is made of the voice of i speaker, wherein each Speaker has j sections of oneself different voices.So, the j-th strip voice that we define i-th of speaker is x_ij.Then, according to because Sub- analytic definition x_ijGeneration model are as follows: x_ij=μ+Fh_i+Gw_ij+ε_ij, μ expression mean value vector, F expression speaker information matrix, h_iIndicate the hidden variable of i-th of speaker, G indicates channel information matrix, w_ijIndicate the j-th strip voice of i-th of speaker The hidden variable of channel, ε_ijIndicate the residual error portion of the j-th strip voice of i-th of speaker.Using EM algorithm to the parameter mould of PLDA Type is updated.

Step 8, the Speaker Identification of test phase --- the corresponding voice to be identified of registration voice is subjected to feature It extracts and x-vector is extracted, be entered into the linear quantizer that the training stage obtains and obtain new identity vector xl- Vector is finally entered into the PLDA model that the training stage obtains, to obtain final result.

The method of the present invention is emulated and analyzed below.

In 2010 test set of NIST SRE, to the Speaker Identification under x-vector identity vector after linear transformation Simulating, verifying is can be carried out.The test set includes the test assignment of 9 scenes (common condition, CC), includes interview (interview), the data of microphone (microphone) and telephone channel (telephone), wherein telephone channel also for Speaker's style different volume incorporated above mainly includes louder volume (high vocal effort), usual volume (normal vocal effort) and amount of bass (low vocal effort).The present invention uses the 5th scene (CC ' 5), i.e., Scene based on different telephone channels under usual volume.Evaluation standard using etc. error rates (Equal Error Rate, EER) with And the performance of Detectability loss function (Detection Cost Function, DCF) Lai Hengliang Speaker Recognition System.

In tri- task test sets of coreext-coreext, core-10sec, 10sec-10sec of NIST SRE 2010 It closes and is tested, wherein coreext and core refers to that long Shi Yuyin, 10sec refer to Short Time Speech.It is used in emulation The voice data of Switchboard2, Switchboard Cellular and NIST SRE 2004 to 2008 is as training Data.Experiment is using x-vector and i-vector system as baseline model.Men and women's sound UBM is trained together, x-vector model Use acoustic feature for 20 dimension MFCC feature, i-vector model using it is same 20 dimension mfcc static nature parameters and Its single order and second differnce, i.e. 60 dimensional features.To each section of voice segments, 600 dimension i-vector vectors and 512 dimension x- are respectively obtained Vector vector.Identity vector dimensionality reduction to 400 is tieed up with the method for LDA in baseline system, then trains speaker's sky Between order be 200 dimensions, channel space order is 0 dimension, and the PLDA model of full variance matrix.Xl-vector proposed by the present invention exists Speaker's variable between class distance maximum, the smallest factor of inter- object distance are had been contemplated that in design process, therefore do not use LDA to walk Suddenly.

Table 1 is in tri- tasks of coreext-coreext, core-10sec, 10sec-10sec, and homologous ray is not in EER The comparison of evaluation criterion and DCF evaluation criterion, the dimension of xl-vector are 512.Wherein i-vector and x-vector is two A baseline system, emerging system are the system that i-vector is added with the score of the PLDA model of x-vector.? In tri- tasks of coreext-coreext, core-10sec, 10sec-10sec, xl-vector proposed by the present invention is commented in EER It is better than two baseline systems in price card standard, in DCF evaluation criterion in 10sec-10sec task slightly with respect to x-vector system There is reduction, other two tasks are better than two baseline systems.Xl-vector system is compared to emerging system in coreext- EER advantage in coreext task is obvious, memory and calculating speed phase needed for xl-vector and x-vector Seemingly, however emerging system needs to consider x-vector and i-vector, therefore more memories, calculating speed are needed when operation It is slack-off.To sum up, xl-vector of the invention suffers from apparent advantage compared to two baseline systems and emerging system.

Table 1

Table 2 is the new identity vector in tri- tasks of coreext-coreext, core-10sec, 10sec-10sec The comparison of xl-vector EER evaluation criterion and DCF evaluation criterion under different dimensions.It can be found that in coreext-coreext With the increase of dimension in task, the performance of EER is become better and better, and when dimension is 500, performance is optimal value, is in dimension When 512, optimal value is kept substantially；The performance of DCF remains unchanged substantially.In core-10sec and 10sec-10sec task with The increase of dimension, the performance of EER worse and worse, dimension be 200 when, performance is optimal value；The transformation range of DCF maintains Within 10%.To sum up, when test statement is long when sentence, dimension more high-performance is better, is sentence in short-term in test statement When, dimension more low performance is better.

Table 2

It can be seen that the xl-vector model that inventor proposes passes through x-vector and i-vector in the training stage Parallel factor analysis device obtains a linear transformation algorithm to x-vector, improves the performance of Speaker Recognition System, and Keep memory demand and the unaffected advantage of calculating speed.

Claims

1. the method for distinguishing speek person under identity-based vector x-vector linear transformation, which comprises the steps of:

Step 2 uses deep neural network structured training x-vector model using the feature that step 1 obtains, and establishes identity arrow X-vector model is measured, to obtain identity vector x-vector；

Step 3 is based on EM algorithm training i-vector model using the feature that step 1 obtains, and establishes identity vector i-vector Model, to obtain identity vector i-vector；

Step 4 thinks that the i-vector of the same speaker and x-vector are projected in the same vector, is instructed based on EM algorithm The parameter of parallel factor analysis device is got, to complete the training of parallel factor analysis device；

Step 5 passes through linear quantizer, the corresponding parameter of x-vector is retained in the parameter of parallel factor analysis device, online Property converter on the basis of, identity vector xl-vector is expressed with the linear transformation of x-vector, thus establish identity arrow Measure x_l- vector model obtains identity vector x_l-vector；

Step 6 utilizes identity vector x_l- vector is updated the parameter model of PLDA using EM algorithm, completes to PLDA mould The training of type；

Step 7, the Speaker Identification of test phase

It will be obtained after the corresponding voice progress feature extraction to be identified of registration voice by identity vector x-vector model Linear quantizer after identity vector x-vector input training is obtained new identity vector x by identity vector x-vector_l- Vector, finally by identity vector x_l- vector is input to the PLDA model after training, to obtain Speaker Identification result.

2. the method for distinguishing speek person under identity vector x-vector linear transformation according to claim 1, it is characterised in that: In step 4, it is contemplated that different identity vector may map to the same vector space, be obtained using the method for parallel factor analysis This common vector.

3. the method for distinguishing speek person under identity vector x-vector linear transformation according to claim 1, it is characterised in that: In step 4, the identity vector i-vector of first of speaker is expressed as φ_i(l,1),…,φ_i(l, k), identity vector x- Vector is expressed as φ_x(l,1),…,φ_x(l, k), wherein k indicates the quantity of the input voice of the speaker, φ_i(l, k) table Show identity the vector i-vector, φ of the kth section voice of the identity vector of first of speaker_x(l, k) indicates first of speaker Identity vector kth section voice identity vector x-vector, the identity vector i-vector and identity of the same speaker Vector x-vector can be projected in the same vector, therefore can be expressed asWherein, μ_iIndicate the average vector of identity vector i-vector；μ_xIt indicates The average vector of identity vector x-vector；F_iIndicate the corresponding projection matrix of i-vector；F_xIndicate the corresponding throwing of x-vector Shadow matrix；H (l) indicates the hidden variable of first of speaker；ε_i(l, k) indicates the kth section language of the identity vector of first of speaker Residual vector after the identity vector i-vector Linear Transformation of sound, ε_i~N (0, Σ_i), Σ_iIndicate the association side of i-vector Poor matrix, N (0, Σ_i) indicate ε_iMeeting matrix is 0, covariance Σ_iNormal distribution；ε_x(l, k) indicates to say for first Residual vector after talking about the identity vector x-vector Linear Transformation of the kth section voice of the identity vector of people, ε_x~N (0, Σ_x), Σ_xIndicate residual epsilon_xCovariance matrix, N (0, Σ_x) indicate ε_xMeeting matrix is 0, covariance Σ_xNormal distribution； By EM algorithm, parameter θ={ μ of parallel factor analysis device is obtained_i,F_i,Σ_i,μ_x,F_x,Σ_x}。

4. the method for distinguishing speek person under identity vector x-vector linear transformation according to claim 1, it is characterised in that: In step 6, according to the corresponding parameter θ of x-vector_x={ μ_x,F_x,Σ_xOn, by the identity vector x after linear transformation_l-vector It is expressed asWherein,Indicate x_lThe posteriority covariance of-vector Further write it as φ_xl=A φ_xThe form of-b, A, b are linear dimensions, thus by identity vector x_l- vector is expressed as x- The linear transformation mode of vector.