CN109273002A

CN109273002A - Vehicle configuration method, system, vehicle device and vehicle

Info

Publication number: CN109273002A
Application number: CN201811260569.6A
Authority: CN
Inventors: 徐曼; 孙珏
Original assignee: NIO Nextev Ltd
Current assignee: NIO Holding Co Ltd
Priority date: 2018-10-26
Filing date: 2018-10-26
Publication date: 2019-01-25
Anticipated expiration: 2038-10-26
Also published as: CN109273002B

Abstract

The invention belongs to technical field of voice recognition, a kind of vehicle configuration method, system, vehicle device and vehicle are specifically provided, it is intended to solve the problems, such as to need manual setting individual cultivation again when the prior art replaces driver and passenger.For this purpose, the present invention provides a kind of vehicle configuration method, including obtaining the voice messaging of user and identifying the corresponding user's voiceprint of voice messaging；According to preset multiple voiceprints User ID corresponding with the one-to-one relationship of multiple User ID acquisition user's voiceprint；Execute the associated vehicle configuration operation of User ID corresponding with user's voiceprint in advance.Based on above-mentioned steps, the method provided according to the present invention, vehicle can execute automatically according to the individual cultivation before user to be operated with the associated vehicle configuration of the User ID, improve the car experience of user, corresponding vehicle configuration operation can also be executed according to the ID of different user, improve the intelligence of vehicle.

Description

Vehicle configuration method, system, vehicle device and vehicle

Technical field

The invention belongs to technical field of voice recognition, and in particular to a kind of vehicle configuration method, system, vehicle device and vehicle ?.

Background technique

With the development of vehicle technology, vehicle has become essential a part in people's life, interior scene At an important place in for people's lives.Requirement with people to vehicle is higher and higher, and people are more than and want to Safe driving, under the premise of guaranteeing safe driving, it is desirable also to possess good driving experience by people.

When driving vehicle, different driver and passenger can carry out vehicle according to personalized preference and level of comfort personalized Configuration, such as adjustment height of seat, door mirror angle and seat back angle, but when replacement driver and passenger, new driving Personnel need manual setting individual cultivation again, cumbersome reduction user experience.

With the development of voice technology, user drive when can to vehicle carry out interactive voice to realize relevant operation, Therefore, how vehicle to be carried out individual cultivation to improve user's car experience being that those skilled in the art need at present by voice It solves the problems, such as.

Summary of the invention

In order to solve the above problem in the prior art, in order to be needed again when solving prior art replacement driver and passenger The problem of manual setting vehicle personalization configures, the first aspect of the present invention provides a kind of vehicle configuration method, comprising:

It obtains the voice messaging of user and identifies the corresponding user's voiceprint of the voice messaging；

User's voiceprint is obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID Corresponding User ID；

Execute the associated vehicle configuration operation of User ID corresponding with user's voiceprint in advance.

Optionally, in provided vehicle configuration method, " identifying the corresponding user's voiceprint of the voice messaging " The step of include:

The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and according to the mean value meter Calculate fixed reference feature；

The phonetic feature of the voice messaging is extracted, and the gauss of distribution function is carried out according to the phonetic feature Interpolation obtains the corresponding gauss of distribution function of the phonetic feature；

According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature；

Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging；

Method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of the voice messaging:

W=(s-m)/T

Wherein, w indicates the acoustic feature, and s indicates the target signature, and m indicates the fixed reference feature, described in T expression Global variable space matrix；

Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging；

Wherein, the universal background model is constructed based on preset gauss hybrid models and preset voice training collection Model.

Optionally, it in provided vehicle configuration method, " is obtaining each high in the universal background model constructed in advance Before the step of mean value of this distribution function, and according to the mean value computation fixed reference feature ", the method also includes:

The phonetic feature that the voice training concentrates voice messaging is extracted, and utilizes speech detection algorithms by the voice The quiet data of feature is rejected, and universal background model training characteristics are obtained；

According to the universal background model training characteristics and utilize the expectation-maximization algorithm training general back Scape model.

Optionally, in provided vehicle configuration method, the voice training collection includes multiple preset sub- training sets；

The quantity of the universal background model is multiple and each universal background model and each son training Collection corresponds.

Optionally, in provided vehicle configuration method, " according to preset multiple voiceprints and multiple User ID One-to-one relationship obtains the corresponding User ID of user's voiceprint " the step of include:

Calculate separately the similarity between the multiple voiceprint and user's voiceprint；

Judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:

A similarity more than or equal to the similarity threshold if it exists, the then voiceprint for obtaining the similarity are corresponding User ID；

Multiple similarities more than or equal to the similarity threshold if it exists then obtain maximum phase in the multiple similarity Like the corresponding User ID of voiceprint of degree.

Optionally, in provided vehicle configuration method, the vehicle configuration operation includes body configurations operation, amusement Configuration operation and driving configuration operation.

Optionally, in provided vehicle configuration method, the body configurations operation includes adjusting vehicle rearview pitch-angle Degree, seat back angle and height of seat, the entertainment configuration operation includes logging in multimedia corresponding with the User ID Account, the driving configuration operation include obtaining the User ID often to go to drive place.

The second aspect of the present invention provides a kind of vehicle configuration system, comprising:

User's voiceprint acquiring unit is configured to obtain the voice messaging of user and identifies that the voice messaging is corresponding User's voiceprint；

User ID acquiring unit is configured to the one-to-one relationship according to preset multiple voiceprints and multiple User ID Obtain the corresponding User ID of user's voiceprint；

Vehicle configuration operation execution unit is configured to execute User ID association corresponding with user's voiceprint in advance Vehicle configuration operation.

Optionally, in provided vehicle configuration system, user's voiceprint acquiring unit is further configured to Execute following operation:

W=(s-m)/T

Optionally, in provided vehicle configuration system, the vehicle configuration system further includes model training unit, institute Model training unit is stated to be configured to execute following operation:

The phonetic feature that the voice training concentrates voice messaging is extracted, by speech detection algorithms by the phonetic feature Quiet data reject, obtain universal background model training characteristics；

According to the universal background model training characteristics and utilize the expectation-maximization algorithm training common background mould Type.

Optionally, in provided vehicle configuration system, the voice training collection includes multiple preset sub- training sets；

Optionally, in provided vehicle configuration system, the User ID acquiring unit be further configured to execute with Lower operation:

Optionally, in provided vehicle configuration system, the vehicle configuration operation includes body configurations operation, amusement Configuration operation and driving configuration operation.

Optionally, in provided vehicle configuration system, the body configurations operation includes adjusting vehicle rearview pitch-angle Degree, seat back angle and height of seat, the entertainment configuration operation includes logging in and the associated multimedia of the User ID Account, the driving configuration operation, which includes that the acquisition User ID is corresponding, often goes to drive place.

Third aspect present invention provides a kind of vehicle device, including vehicle configuration system described in any of the above embodiments.

The fourth aspect of the present invention provides a kind of vehicle, including vehicle device described above.

The fifth aspect of the present invention provides a kind of storage device, wherein be stored with a plurality of program, described program be suitable for by Processor is loaded to execute vehicle configuration method described in any of the above embodiments.

The sixth aspect of the present invention provides a kind of control device, including processor and storage equipment；The storage equipment, Suitable for storing a plurality of program；Described program is suitable for being loaded by the processor to execute vehicle configuration side described in any of the above embodiments Method.

Compared with the immediate prior art, above-mentioned technical proposal is at least had the following beneficial effects:

1, vehicle configuration method provided by the invention, can be by obtaining the voice messaging of user and identifying voice messaging pair The user's voiceprint answered obtains user's vocal print according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of information, executes User ID corresponding with voiceprint associated vehicle configuration operation in advance, user with vehicle Carry out interactive voice during, vehicle can execute automatically associated with the User ID according to the individual cultivation before user Vehicle configuration operation, improves the car experience of user.

2, vehicle configuration method provided by the invention can identify the corresponding voiceprint of voice messaging, according to multiple sound The one-to-one relationship of line information and multiple User ID obtains the corresponding User ID of voiceprint, uses when there is multiple vehicles to use When family, corresponding vehicle configuration operation can be executed according to the ID of different user, improves the intelligence of vehicle.

3, vehicle configuration method provided by the invention, can be according to the multiple common background moulds of the classification of voice training collection training Type obtains the voiceprint of voice messaging based on trained universal background model, and the frame alignment of voice messaging can be improved Accuracy, and then available and user's strong correlation voiceprint, improve the accuracy rate of identification.

Scheme 1, a kind of vehicle configuration method, characterized by comprising:

It is corresponding with the one-to-one relationship of multiple User ID acquisition user's voiceprint according to preset multiple voiceprints User ID；

Scheme 2, vehicle configuration method according to scheme 1, which is characterized in that " identify the corresponding user of the voice messaging The step of voiceprint " includes:

The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to the mean value computation Examine feature；

The phonetic feature of the voice messaging is extracted, and according to the phonetic feature gauss of distribution function is carried out slotting Value, obtains the corresponding gauss of distribution function of the phonetic feature；

Described in method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of voice messaging:

W=(s-m)/T

Wherein, w indicates that the acoustic feature, s indicate the target signature, and m indicates that the fixed reference feature, T indicate the overall situation Variable space matrix；

Wherein, the universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.

Scheme 3, the vehicle configuration method according to scheme 2, which is characterized in that " obtaining the common background mould constructed in advance Before the step of mean value of each gauss of distribution function in type, and according to the mean value computation fixed reference feature ", the method is also Include:

The phonetic feature that the voice training concentrates voice messaging is extracted, and utilizes speech detection algorithms by the phonetic feature Quiet data reject, obtain universal background model training characteristics；

Scheme 4, vehicle configuration method according to scheme 3, which is characterized in that the voice training collection includes multiple preset Sub- training set；

The quantity of the universal background model is multiple and each universal background model and each sub- training set one One is corresponding.

Scheme 5, vehicle configuration method according to scheme 1, which is characterized in that " according to preset multiple voiceprints and more The one-to-one relationship of a User ID obtains the corresponding User ID of user's voiceprint " the step of include:

A similarity more than or equal to the similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID；

Multiple similarities more than or equal to the similarity threshold if it exists, then obtain maximum similarity in the multiple similarity The corresponding User ID of voiceprint.

Scheme 6, vehicle configuration method according to scheme 1, which is characterized in that the vehicle configuration operation includes body configurations Operation, entertainment configuration operation and driving configuration operation.

Scheme 7, vehicle configuration method according to scheme 6, which is characterized in that the body configurations operation includes adjusting vehicle Door mirror angle, seat back angle and height of seat, the entertainment configuration operation include logging in be associated with the User ID Multimedia account, driving configuration operation includes obtaining that the User ID is corresponding often to go to drive place.

Scheme 8, a kind of vehicle configuration system, characterized by comprising:

User's voiceprint acquiring unit is configured to obtain the voice messaging of user and identifies the corresponding user of the voice messaging Voiceprint；

User ID acquiring unit is configured to be obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of user's voiceprint；

Vehicle configuration operation execution unit is configured to execute the associated vehicle of User ID corresponding with user's voiceprint in advance Configuration operation.

Scheme 9, the vehicle configuration system according to scheme 8, which is characterized in that user's voiceprint acquiring unit is into one Step is configured to execute following operation:

W=(s-m)/T

Scheme 10, vehicle configuration system according to scheme 9, which is characterized in that the vehicle configuration system further includes model Training unit, the model training unit are configured to execute following operation:

The phonetic feature that the voice training concentrates voice messaging is extracted, by speech detection algorithms by the quiet of the phonetic feature Sound data are rejected, and universal background model training characteristics are obtained；

According to the universal background model training characteristics and utilize the expectation-maximization algorithm training universal background model.

Scheme 11, vehicle configuration system according to scheme 10, which is characterized in that the voice training collection includes multiple default Sub- training set；

Scheme 12, the vehicle configuration system according to scheme 8, which is characterized in that the User ID acquiring unit is further matched It is set to the following operation of execution:

Scheme 13, the vehicle configuration system according to scheme 8, which is characterized in that the vehicle configuration operation is matched including car body It sets operation, entertainment configuration operation and drives configuration operation.

Scheme 14, vehicle configuration system according to scheme 13, which is characterized in that the body configurations operation includes adjusting vehicle Door mirror angle, seat back angle and height of seat, the entertainment configuration operation include logging in close with the User ID The multimedia account of connection, the driving configuration operation, which includes that the acquisition User ID is corresponding, often goes to drive place.

Scheme 15, a kind of vehicle device, which is characterized in that including vehicle configuration system described in any one of scheme 8-14.

Scheme 16, a kind of vehicle, which is characterized in that including vehicle device described in scheme 15.

Scheme 17, a kind of storage device, wherein being stored with a plurality of program, which is characterized in that described program is suitable for being added by processor It carries with vehicle configuration method described in any one of the 1-7 that carries into execution a plan.

Scheme 18, a kind of control device, including processor and storage equipment；The storage equipment is suitable for storing a plurality of program；Its It is characterized in that, described program is suitable for being loaded as the processor with vehicle configuration method described in any one of the 1-7 that carries into execution a plan.

Detailed description of the invention

Fig. 1 is the key step schematic diagram of the vehicle configuration method of an embodiment of the present invention；

Fig. 2 is the key step schematic diagram of the corresponding voiceprint of identification voice messaging of an embodiment of the present invention；

Fig. 3 is the primary structure schematic diagram of the vehicle configuration system of an embodiment of the present invention.

Specific embodiment

In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art Every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.

The preferred embodiment of the present invention described with reference to the accompanying drawings.It will be apparent to a skilled person that this A little embodiments are used only for explaining technical principle of the invention, it is not intended that limit the scope of the invention.

Refering to attached drawing 1, Fig. 1 illustratively gives the key step of vehicle configuration method in the present embodiment.Such as Fig. 1 institute Show, vehicle configuration method includes the following steps: in the present embodiment

Step S101: the voice messaging of user is obtained.

User is during carrying out interactive voice with vehicle, the available user of vehicle voice messaging currently entered, Wherein it is possible to obtain the voice messaging of user by the voice acquisition device of vehicle, the voice messaging of user may include user With the interactive voice of vehicle and the registration voice of user.What the interactive voice of user and vehicle can send for user to vehicle Voice, such as " playing music " are controlled, the registration voice of user can be user and carry out specified number according to the prompt of onboard system The voice of specified duration typing is measured, onboard system can store the registration voice of multiple common users.

Step S102: the corresponding user's voiceprint of identification voice messaging.

The voice messaging of user may include the content and user's voiceprint of voice, can be with by user's voiceprint Differentiate the identity of user.

The identity that user is differentiated by user's voiceprint may include registration phase and cognitive phase.In registration phase, Several voice messagings can be collected or recorded for each user, and vehicle device is extracted from voice messaging embodies user personality spy The parameters,acoustic of sign, the parameters,acoustic for embodying the user personality feature can be user's voiceprint, further according to certain model Method for building up forms the template for being directed to the user, and is stored in user template library.In cognitive phase, system can with registration rank The identical method of section extracts the voiceprint of user speech information to be identified, and the body of user is differentiated according to certain method of discrimination Part.

Refering to attached drawing 2, Fig. 2, which illustratively gives, identifies the corresponding user's voiceprint of voice messaging in the present embodiment Key step.As shown in Fig. 2, identifying that the corresponding user's voiceprint of voice messaging may include following step in the present embodiment:

Step S1021: the mean value of each gauss of distribution function in the universal background model that acquisition constructs in advance, and according to Mean value computation fixed reference feature.

Universal background model (UBM, Universal Background Model) can pass through the language of a large amount of speaker Sound training forms, and has very high degree of mixing and reference background meaning, and careful statistical nature point can be carried out to voice messaging Cloth description.Wherein, universal background model can be based on preset gauss hybrid models (GMM, Gaussian Mixed Model) and preset voice training collection building model.

Gauss hybrid models are the feature distribution situations that speaker is described with the linear combination of multiple Gaussian Profiles, when identification The corresponding speaker of speaker model of tested speech feature can will most be generated as recognition result, same speaker is when different Even the phase says that also it occur frequently that biggish variation, but the personal characteristics of the speaker has from the angle of statistics in short There is consistency, the gauss hybrid models Speaker Identification effect unrelated for text is preferable.Gauss hybrid models can be high by M The summation of this probability density function linear weighted function is constituted, and when the M of gauss hybrid models is sufficiently high, can subtly approach speaker The spatial distribution of characteristic vector.Preset voice training collection can be the voice collection being made of the voice messaging of a large amount of speaker It closes.

Specifically, in available universal background model each gauss of distribution function mean value, according to mean value computation voice The fixed reference feature of information, fixed reference feature can be the Gaussian mean super vector of universal background model, the super vector with specifically The channel for talking about people and speaker's voice is unrelated, has good reference background meaning.

Before the step S1021 the step of, the method for the present embodiment further includes trained universal background model, specifically, instruction The method for practicing universal background model includes the following steps:

Step S10211: the phonetic feature that voice training concentrates voice messaging is extracted, and will using speech detection algorithms The quiet data of phonetic feature is rejected, and universal background model training characteristics are obtained.

The training data of universal background model, therefore can not in training data from a large amount of different backgrounds and noise circumstance Avoid there are quiet datas, and it is the characteristic of voice that Application on Voiceprint Recognition, which utilizes, unrelated with quiet data, subsequent in order to reduce Calculation amount can use the quiet data that speech detection algorithms detect voice messaging, and it rejected from voice messaging, obtain To universal background model training characteristics.Wherein, speech detection algorithms can be VAD (Voice Activity Detection, language Sound activation detection) algorithm etc., universal background model training characteristics can be MFCC (Mel Frequency Cepstral Coefficent, mel-frequency cepstrum coefficient) feature etc., MFCC feature is the perception in order to simulate human ear to different frequency voice Ability and the nonlinear characteristic parameters similar with human hearing characteristic proposed.It should be noted that protection scope of the present invention It is not limited to above-mentioned specific embodiment, other than vad algorithm and MFCC feature, in the premise without departing from the principle of the present invention Under, speech detection algorithms can also can reject the algorithm of quiet data for other, and the training characteristics of universal background model can also Think that other can train the feature of universal background model.

Step S10212: according to universal background model training characteristics and expectation-maximization algorithm training common background is utilized Model.

After obtaining universal background model training characteristics, expectation-maximization algorithm training universal background model can use.Tool Body, training universal background model can use unsupervised e-learning, and the acoustic feature by counting each speaker is had Some probability density functions construct the model of speaker, and voice training collection may include multiple preset sub- training sets, training When, a universal background model can be trained with all data sets, but be the need to ensure that the equilibrium of training data, can also train Multiple universal background models, then integrate them, for example, voice training collection can be divided into two son instructions according to gender Practice collection, recycles the two sub- training sets that two universal background models are respectively trained, lack of balance data control can be effectively utilized Make the universal background model generated.The quantity of universal background model can be multiple and each universal background model and every height Training set can correspond.

Step S1022: extracting the phonetic feature of voice messaging, and is carried out to gauss of distribution function according to phonetic feature slotting Value, obtains the corresponding gauss of distribution function of phonetic feature.

The phonetic feature for extracting voice messaging, according to phonetic feature and using maximum a posteriori probability adaptive algorithm to general The Gaussian function of background model carries out linear interpolation, obtains the gauss hybrid models of user corresponding with the voice messaging, in turn Obtain the corresponding gauss of distribution function of phonetic feature.On the basis of obtaining universal background model, in conjunction with height related to user This mixed model can remove the influence in phonetic feature with background data similar portion, retain larger with background data difference Part, increase to the robustness of noise, improve recognition effect.It is each general in the case where there is multiple universal background models The Gaussian Mixture degree of background model can correspond, and facilitate the complete modification of model.

Step S1023: according to the mean value computation target signature of the corresponding gauss of distribution function of phonetic feature.

After obtaining the corresponding gauss of distribution function of phonetic feature, the mean value of gauss of distribution function is calculated as target spy Sign.In practical applications, if universal background model has C Gaussian component, the dimension of feature is F, then the dimension of target signature is C*F.Target signature contains the most individual information of user, can as the user characteristics with strong distinction, but Be target signature dimension it is higher, a large amount of redundancy and scramble data related with channel are contained, in order to reduce the later period Target signature can be decomposed using global space and carry out dimensionality reduction, obtain more compact feature by calculation amount.

Step S1024: global variable space matrix is obtained according to expectation-maximization algorithm and voice messaging.

In practical applications, it can use the Baum-Welch statistic that universal background model extracts every section of voice of user, Pass through expectation-maximization algorithm (EM, Expectation Maximization Algorithm) iteration based on maximum-likelihood criterion Solve global variable space matrix.Maximum likelihood requires the distribution of model description that can approach training dataset to the maximum extent Distribution, therefore, training set is bigger and can more react the true distribution of user characteristics, and the performance of user's identifying system is also better. The E step of expectation-maximization algorithm can be according to the mean value and variance of Baum-Welch normalized set acoustic feature, it is expected that maximum The mean value and maximum variance for the acoustic feature that the M step for changing algorithm can be such that E step calculates, are walked by iterative solution E step and M, can To obtain global variable space matrix.

Global variable space matrix can reduce the dimension of mean value super vector, reduce unwanted redundancy, reservation and user The relevant low-dimensional vector of identity indicates.In embodiments of the present invention, preparatory trained DNN (Deep Neural can be passed through Networks, deep neural network) model calculating Baum-Welch statistic, the accuracy of the frame alignment of voice messaging is improved, And then the available voiceprint with user's strong correlation, improve the accuracy rate of identification.

Step S1025: according to fixed reference feature, target signature and global variable space matrix and according to shown in formula (1) Method obtain voice messaging acoustic feature:

W=(s-m)/T (1)

Wherein, w indicates that acoustic feature, s indicate target signature, and m indicates that fixed reference feature, T indicate global variable space matrix.

After obtaining fixed reference feature, target signature and global variable space matrix, voice messaging can be further obtained Acoustic feature, acoustic feature had both included the information of user, further included the channel information of voice, but the presence meeting of channel information Interference is generated to user identity identification.

Step S1026: channel compensation is carried out to acoustic feature, obtains the corresponding voiceprint of voice messaging.

It, can be using general since the acoustic feature of voice messaging includes that channel information can generate interference to user identity identification Rate linear discriminant analysis (PLDA, Probabilistic Linear Discriminant Analysis) method is to acoustic feature Carry out channel compensation.Specifically, the voiceprint of user speech information, each user can be obtained according to expectation-maximization algorithm Every words can generate a voiceprint, the voiceprint of the same user is averaging, it is unique that the user can be obtained Voiceprint.

Step S103: user's vocal print is obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of information.

When user specifies duration typing voice according to the prompt progress specified quantity of onboard system, onboard system can be deposited It stores up the registration voice of multiple common users and obtains the corresponding voiceprint of user's registration voice according to step S101-S102, And the one-to-one relationship of multiple voiceprints Yu multiple User ID is established, and the corresponding relationship is stored in onboard system. After identifying user speech information corresponding user's voiceprint, from the one of preset multiple voiceprints and multiple User ID User ID corresponding with user's voiceprint is obtained in one corresponding relationship, is associated with so as to subsequent according to User ID execution Vehicle configuration operation.

Specifically, the similarity between multiple voiceprints and user's voiceprint can be calculated separately, respectively judgement meter Whether obtained multiple similarities are more than or equal to preset similarity threshold:

A similarity more than or equal to similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID；

Multiple similarities more than or equal to similarity threshold if it exists, then obtain the sound of maximum similarity in multiple similarities The corresponding User ID of line information.

In practical applications, it is understood that there may be the user for being not belonging to registration sound-groove model library interacts with vehicle, in order to mention The accuracy of height identification, can calculate the similarity between multiple voiceprints and user's voiceprint, and judgement calculates respectively Whether obtained multiple similarities are more than or equal to preset similarity threshold, if multiple similarities are respectively less than similarity threshold, Illustrate the voiceprint of the user not in registration sound-groove model library, it is multiple similar more than or equal to similarity threshold if it exists Degree, then using the corresponding User ID of the maximum voiceprint of similarity as the User ID of the user.In embodiments of the present invention, phase It can be Likelihood Score like degree.

Step S104: the associated vehicle configuration operation of User ID corresponding with voice messaging in advance is executed.

In practical applications, in order to improve the driving experience of user, onboard system can obtain the traveling number of user automatically According to driving data so that the later period provides intelligent service for user.Wherein, running data may include the driving that user often goes Place and point-of-interest etc., driving data may include that user adjusts height of seat and backrest angle etc..When onboard system obtains After taking the corresponding User ID of family voiceprint, the associated vehicle configuration behaviour of User ID corresponding with voice messaging in advance is executed Make, wherein vehicle configuration operation may include body configurations operation, entertainment configuration operation and driving configuration operation.Car body is matched Setting operation may include adjusting vehicle rearview mirror angle, seat back angle and height of seat, can also include the sky of vehicle Adjust set temperature etc..After user gets on the bus, vehicle can be adjusted to above-mentioned hardware facility to meet users ' individualized requirement automatically, In practical application scene, by taking height of seat as an example, if the height gap of a upper user and active user are larger, at upper one After user has used vehicle, active user gets on the bus after determining identity, and onboard system can be automatically according to the height of active user Height of seat is adjusted to suitable position by it.Entertainment configuration operation may include logging in and the associated multimedia account of User ID Deng, it specifically, can be with automated log on and the associated multimedia account of the User ID after determining user identity, acquisition user's sense The multimedia messages of interest, such as the music list of user's collection can obtain user when user wants to play music in time Interested music.Driving configuration operation, which may include that acquisition User ID is corresponding, often goes to driving place and point-of-interest etc., After determining user identity, available User ID is corresponding often to go to drive place, such as CompanyAddress, the home address of user, Further, it is also possible to obtain the point-of-interest of user, the power-up website gone as usual is rationally advised when vehicle needs to be powered on for user Draw route.

Vehicle configuration method provided by the invention, the voice messaging of available user simultaneously identify the corresponding use of voice messaging Family voiceprint obtains user's voiceprint pair according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The User ID answered, executes User ID corresponding with user's voiceprint associated vehicle configuration operation in advance, user with vehicle During carrying out interactive voice, vehicle can execute automatically and the associated vehicle of the User ID according to the individual cultivation before user Configuration operation, improve the car experience of user.

Although each step is described in the way of above-mentioned precedence in above-described embodiment, this field Technical staff is appreciated that the effect in order to realize the present embodiment, executes between different steps not necessarily in such order, It (parallel) execution simultaneously or can be executed with reverse order, these simple variations all protection scope of the present invention it It is interior.

Refering to attached drawing 3, Fig. 3 illustratively gives the primary structure of vehicle configuration system in the present embodiment.Such as Fig. 3 institute Show, vehicle configuration system includes that user's voiceprint acquiring unit 1, User ID acquiring unit 2 and vehicle configuration operation execute Unit 3.

User's voiceprint acquiring unit 1 is configured to obtain the voice messaging of user and identifies the corresponding use of voice messaging Family voiceprint；

User ID acquiring unit 2 is configured to be closed according to preset multiple voiceprints and the one-to-one correspondence of multiple User ID System obtains the corresponding User ID of user's voiceprint；

It is associated to be configured to execute User ID corresponding with user's voiceprint in advance for vehicle configuration operation execution unit 3 Vehicle configuration operation.

Optionally, in provided vehicle configuration system, user's voiceprint acquiring unit 1 is further configured to execute It operates below:

The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to mean value computation Examine feature；

The phonetic feature of voice messaging is extracted, and interpolation is carried out to gauss of distribution function according to phonetic feature, obtains language The corresponding gauss of distribution function of sound feature；

According to the mean value computation target signature of the corresponding gauss of distribution function of phonetic feature；

Global variable space matrix is obtained according to expectation-maximization algorithm and voice messaging；

It is obtained according to fixed reference feature, target signature and global variable space matrix and according to method shown in formula (1) The acoustic feature of voice messaging:

Channel compensation is carried out to acoustic feature, obtains the corresponding voiceprint of voice messaging；

Wherein, universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.

Optionally, in provided vehicle configuration system, vehicle configuration system further includes model training unit, model instruction Practice unit to be configured to execute following operation:

Firstly, the phonetic feature that voice training concentrates voice messaging is extracted, by speech detection algorithms by phonetic feature Quiet data is rejected, and universal background model training characteristics are obtained.

Then, universal background model is trained according to universal background model training characteristics and using expectation-maximization algorithm.

Optionally, in provided vehicle configuration system, voice training collection includes multiple preset sub- training sets, accordingly ,

The quantity of universal background model is that multiple and each universal background model and every sub- training set correspond.

Optionally, in provided vehicle configuration system, User ID acquiring unit 2 can be further configured to execute with Lower operation:

Firstly, calculating separately the similarity between multiple voiceprints and user's voiceprint.

Then, judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:

Optionally, in provided vehicle configuration system, vehicle configuration operation may include body configurations operation, amusement Configuration operation and driving configuration operation.

Optionally, in provided vehicle configuration system, body configurations operation may include adjusting vehicle rearview pitch-angle Degree, seat back angle and height of seat, entertainment configuration operation may include log in the associated multimedia account of User ID, Driving configuration operation, which may include that acquisition User ID is corresponding, often goes to driving place.

Further, it is based on the above system embodiment, it may include above-mentioned vehicle that the present invention also provides a kind of vehicle devices Configuration system.In the present embodiment, vehicle device can be that by the control system of specific function, in some cases, may Directly refer to the controller of the control system.It should be noted that protection scope of the present invention is not limited to this specific implementation Mode.Under the premise of without departing from the principle of the present invention, it should be appreciated by the person skilled in the art that vehicle device can be vehicle-mounted joy Happy information system can also be vehicle-mounted central control system and carried-on-vehicle computer system etc., the technology after these changes or replacement Scheme will fall within the scope of protection of the present invention.

Still further, being based on above-mentioned vehicle device embodiment, it may include above-mentioned reality that the present invention also provides a kind of vehicles Apply vehicle device described in example.

Still further, being based on above method embodiment, the present invention also provides a kind of storage devices, wherein being stored with more Program, the program may be adapted to be loaded as processor to execute vehicle configuration method described in above method embodiment.

Still further, being based on above method embodiment, it may include place that the present invention also provides a kind of control devices Manage device, storage equipment；Storage equipment may be adapted to store a plurality of program, which may be adapted to be loaded by processor on to execute State vehicle configuration method described in embodiment of the method.

Person of ordinary skill in the field can be understood that, for convenience of description and succinctly, system of the present invention The specific work process and related description of system, vehicle device and vehicle embodiments, can be with reference to the correspondence in preceding method embodiment Process, and with above method beneficial effect having the same, details are not described herein.

Those skilled in the art should be able to recognize that, side described in conjunction with the examples disclosed in the embodiments of the present disclosure Method step, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate electronic hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is executed actually with electronic hardware or software mode, specific application and design constraint depending on technical solution. Those skilled in the art can use different methods to achieve the described function each specific application, but this reality Now it should not be considered as beyond the scope of the present invention.

So far, it has been combined preferred embodiment shown in the drawings and describes technical solution of the present invention, still, this field Technical staff is it is easily understood that protection scope of the present invention is expressly not limited to these specific embodiments.Without departing from this Under the premise of the principle of invention, those skilled in the art can make equivalent change or replacement to the relevant technologies feature, these Technical solution after change or replacement will fall within the scope of protection of the present invention.

Claims

1. a kind of vehicle configuration method, characterized by comprising:

2. vehicle configuration method according to claim 1, which is characterized in that " identify the corresponding user of the voice messaging The step of voiceprint " includes:

W=(s-m)/T

3. vehicle configuration method according to claim 2, which is characterized in that " obtaining the common background mould constructed in advance Before the step of mean value of each gauss of distribution function in type, and according to the mean value computation fixed reference feature ", the method is also Include:

4. vehicle configuration method according to claim 3, which is characterized in that the voice training collection includes multiple preset Sub- training set；

5. vehicle configuration method according to claim 1, which is characterized in that " according to preset multiple voiceprints and more The one-to-one relationship of a User ID obtains the corresponding User ID of user's voiceprint " the step of include:

6. vehicle configuration method according to claim 1, which is characterized in that the vehicle configuration operation includes body configurations Operation, entertainment configuration operation and driving configuration operation.

7. vehicle configuration method according to claim 6, which is characterized in that the body configurations operation includes adjusting vehicle Door mirror angle, seat back angle and height of seat, the entertainment configuration operation include logging in be associated with the User ID Multimedia account, driving configuration operation includes obtaining that the User ID is corresponding often to go to drive place.

8. a kind of vehicle configuration system, characterized by comprising:

9. vehicle configuration system according to claim 8, which is characterized in that user's voiceprint acquiring unit is into one Step is configured to execute following operation:

W=(s-m)/T

10. vehicle configuration system according to claim 9, which is characterized in that the vehicle configuration system further includes model Training unit, the model training unit are configured to execute following operation: