CN105390136B - Vehicle arrangement control device and method for user's adaptive type service - Google Patents

Vehicle arrangement control device and method for user's adaptive type service Download PDF

Info

Publication number
CN105390136B
CN105390136B CN201510514457.9A CN201510514457A CN105390136B CN 105390136 B CN105390136 B CN 105390136B CN 201510514457 A CN201510514457 A CN 201510514457A CN 105390136 B CN105390136 B CN 105390136B
Authority
CN
China
Prior art keywords
user
adaptive type
service
information
vehicle arrangement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510514457.9A
Other languages
Chinese (zh)
Other versions
CN105390136A (en
Inventor
安恩贞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hyundai Mobis Co Ltd
Original Assignee
Hyundai Mobis Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hyundai Mobis Co Ltd filed Critical Hyundai Mobis Co Ltd
Publication of CN105390136A publication Critical patent/CN105390136A/en
Application granted granted Critical
Publication of CN105390136B publication Critical patent/CN105390136B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60RVEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
    • B60R16/00Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for
    • B60R16/02Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements
    • B60R16/037Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements for occupant comfort, e.g. for automatic adjustment of appliances according to personal settings, e.g. seats, mirrors, steering wheel
    • B60R16/0373Voice control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mechanical Engineering (AREA)
  • User Interface Of Digital Computer (AREA)
  • Navigation (AREA)

Abstract

The present invention provides a kind of voice pattern for driver being distinguished by speech recognition and the driver is analyzed by much information, guides the vehicle arrangement control device and method for the service of user's adaptive type of best-of-breed functionality from vehicle to driver first.Vehicle arrangement control device for the service of user's adaptive type of the invention includes: characteristic information generating unit, and the characteristic information of user is generated according to the voice messaging of user;Voice messaging analysis unit obtains meaning information by parsing voice messaging;Adaptive type services determining section, and the adaptive type service to user is determined according to characteristic information and meaning information;And vehicle arrangement control unit, it controls the control object equipment including vehicle arrangement and to execute adaptive type service.The present invention can be realized natural speech recognition system, use common function convenient for driver.

Description

Vehicle arrangement control device and method for user's adaptive type service
Technical field
The present invention relates to the device and methods of control vehicle arrangement more particularly to a kind of control vehicle arrangement to execute use The vehicle arrangement control device and method for the service of user's adaptive type of family adaptive type service.
Background technique
Speech recognition is that the acoustic information from voice extracts phoneme i.e. language message and makes the one of machine recognition and reaction Serial procedures.
Although generally believing that with voice dialogue be most natural, the simplest side in the information exchange medium of the mankind and machine Method, but human speech must be converted to the code that machine is capable of handling can use voice and machine to talk with.Speech recognition is just It is this process for being converted into code.
It is applicable in the speech recognition technology developed in recent years on vehicle at present, therefore only needs the voice command of driver It can be driven simple convenient means, such as lifting window, starting and stop wiper, open air-conditioning, open or close preceding photograph Lamp etc..
Illustrate current vehicle audio recognition methods below.
Current vehicle audio recognition methods includes that driver is connect when issuing equipment start command with voice by microphone The step of receiving driver's voice, the step of being pre-processed analog signal for digital signal by filtering and analog-to-digital conversion, by mentioning The step of taking eigen vector and classification voice pattern voice command recognition, and according to the voice command drive control object of identification The step of device.
Current speech recognition can use speech engine to identify a small amount of vocabulary even large capacity vocabulary, only by pressing I.e. logical (Push-to-Talk;PTT) speech identifying function activates when key.
But the currently used ability that is speech recognition system in the case where words person issues to speech recognition system and orders according to The order constitutes the one way system of corresponding scene, can not two-way exchange.
KR published patent the 2014-0051630th discloses one kind and passes through the audio-visual navigation of speech recognition controlled vehicle The method of system.But this method is also to provide speech identifying function by remote control speech recognition key, therefore can not solve above-mentioned ask Topic.
Summary of the invention
Technical problem
To solve the above problems, distinguishing driver by speech recognition the purpose of the present invention is to provide one kind and passing through Much information analyzes the voice pattern of the driver, is used for user's adaptive type from vehicle to driver's guidance best-of-breed functionality first The vehicle arrangement control device and method of service.
The purpose of the present invention is not limited to above-mentioned purpose, and those skilled in the art can be defined by following record Understand unmentioned other purposes.
Technical solution
To reach above-mentioned purpose, the present invention provides a kind of vehicle arrangement control device for user's adaptive type service, packet Include: characteristic information generating unit generates the characteristic information of the user according to the voice messaging of user;Voice messaging analysis unit, It obtains meaning information by parsing the voice messaging;Adaptive type service determining section, according to the characteristic information with it is described Meaning information determines the adaptive type service to the user;And vehicle arrangement control unit, it controls including vehicle arrangement Control object equipment to execute the adaptive type service.
Preferably, the characteristic information generating unit from the voice messaging extract formant (formant) value, frequency values, At least one in speech energy value and linear predictive coding (linear prediction coding, hereinafter referred to as ' LPC ') value A value, and the characteristic information is generated according at least one described value in real time.
Preferably, the characteristic information generating unit generates the age letter of the gender information of the user, the user in real time At least one of the emotion information of breath and user information is as the characteristic information.
Preferably, the vehicle arrangement control device further include: voice messaging selector is receiving at least two languages A voice messaging is selected when message ceases from multiple voice messagings.
Preferably, the voice messaging selector according to the size of voice messaging, input voice messaging be stored in advance Voice messaging between comparison result, the user position and multilayer perceptron (multilayer perceptron) in At least one select one voice messaging.
Preferably, the vehicle arrangement control device further include: voice messaging input unit is received from each seat of vehicle The voice messaging, the vehicle arrangement control unit control vehicle arrangement to execute the adaptive type respectively by each seat Service.
Preferably, the voice messaging input unit includes the directional microphone for being set to each seat (directional microphone)。
Preferably, the vehicle arrangement control device further include: adaptive type service execution judging part, according to the user The information of input judges whether to execute the adaptive type service;And alternative service determining section, it is not execute in judging result When the adaptive type services, the alternative service for substituting the adaptive type service is determined according to the information that the user inputs.
Preferably, the vehicle arrangement of the vehicle arrangement control unit control is audio-visual navigation (Audio Video Navigation;AVN) system.
Also, the present invention provides a kind of vehicle arrangement control method for user's adaptive type service, comprising: according to user Voice messaging the step of generating the characteristic information of the user;The step of meaning information is obtained by parsing the voice messaging Suddenly;The step of servicing the adaptive type of the user is determined according to the characteristic information and the meaning information;And control packet It includes the control object equipment including vehicle arrangement and makes the step of executing adaptive type service.
Preferably, the step of generation extracts formant (formant) value, frequency values, voice from the voice messaging At least one of energy value and linear predictive coding (linear prediction coding, hereinafter referred to as ' LPC ') value value, And the characteristic information is generated according at least one described value in real time.
Preferably, the step of generation generate in real time the gender information of the user, the user age information and At least one of the emotion information of user information is as the characteristic information.
Preferably, before the step of generation further include: when receiving at least two voice messagings from multiple described A step of voice messaging is selected in voice messaging.
Preferably, the step of selection is according to the size of voice messaging, the voice messaging of input and pre-stored language In the position of comparison result, the user between message breath and multilayer perceptron (multilayer perceptron) extremely A kind of few one voice messaging of selection.
Preferably, before the step of selection further include: the step of receiving the voice messaging from each seat of vehicle, The step of control is specifically to control vehicle arrangement to execute the adaptive type service respectively by each seat.
Preferably, the received step utilizes the directional microphone for being set to each seat.
Preferably, determination is also wrapped between the step of adaptive type service and the step of control of the user It includes: the step of executing adaptive type service is judged whether according to the information that the user inputs;And in judging result for not When executing adaptive type service, determine that the substitution for substituting the adaptive type service takes according to the information of user input The step of business.
Preferably, the vehicle arrangement of the step control of control is image and sound guidance system (Audio Video Navigation;AVN).
Technical effect
The present invention is distinguished driver by speech recognition and analyzes the voice pattern of the driver by much information, first Best-of-breed functionality first is guided from vehicle to driver, to have the following beneficial effects:
First, it can comply with and gradually change into two-way communication from one way system to the trend that two-way exchange mode changes, from And it can be realized natural speech recognition system.
Second, system uses common function according to driver's correspondingly recommendation function, therefore convenient for driver.
Detailed description of the invention
Fig. 1 is the concept map for showing vehicle Speaker identification system according to an embodiment of the invention;
Fig. 2 is the flow chart for showing the first embodiment of working method of vehicle Speaker identification system;
Fig. 3 is the flow chart for showing the second embodiment of working method of vehicle Speaker identification system;
Fig. 4 is to summarize the display vehicle arrangement control dress according to the preferred embodiment of the invention for the service of user's adaptive type The block diagram set;
Fig. 5 is to summarize to show the vehicle arrangement controlling party according to the preferred embodiment of the invention for the service of user's adaptive type The flow chart of method.
Specific embodiment
The preferred embodiments of the present invention are described in detail referring to the drawings.Firstly, it is necessary to it is to be noted that in the structure to each figure In terms of at element addition appended drawing reference, added as far as possible being appeared in even if identical constituent element on different attached drawings identical Appended drawing reference.And if think may be to this hair to illustrating for related known structure or function for judgement in explaining the present invention Bright theme causes to obscure, then omits related detailed description.In addition, will be described below the preferred embodiment of the present invention, but this hair Bright technical solution is simultaneously not restricted or limited to this, and person of ordinary skill in the field can do various deformation implementation.
The present invention is characterized in that according to the technology trends changed to two-way communication mode, it is this using speech recognition Speaker identification function distinguishing driver and the voice pattern for analyzing the driver recommend most suitable function first for driver, Artificial intelligence trend can be complied with.
Fig. 1 is the concept map for showing vehicle Speaker identification system according to an embodiment of the invention.
As shown in Figure 1, the present invention is the driver for the voice pattern for passing through speech recognition words person and analyzing each driver Friendly vehicle interior system.
Private car is generally shared by more people, and the present invention stores the language of the characteristics of speech sounds of each driver and each driver of analysis Sound pattern.The voice pattern of driver can be search ground, nearest call catalog, audio-frequency function etc. recently.After driver rides Distinguish who driver words person is, confirmation is suitble to the function of driver's voice pattern on vehicle when speaking by microphone, Guarantee faster to be easier access to the common function of driver.
The function of input unit 110 is the voice command for receiving driver.Input unit 110 can be microphone.
Identification part 120 is identified by the voice signal of the input of input unit 110.Identification part 120 turns text (Speech by sound To Text, hereinafter referred to as ' STT ') receive is any voice for operation judgement.
Analysis portion 130 distinguishes words person, gender and age bracket is analyzed by the database (Database) of study, by finding out Formant (formant) value identifies everyone characteristic.
Analysis portion 130 passes through formant (formant) value of words person's voice, basic frequency value, speech energy value, linear pre- The gender of the users of real time discriminatings statistics such as survey coding (linear prediction coding, hereinafter referred to as ' LPC ') value/ Age/mood/state etc..
The characteristic for each driver that the storage of storage unit 160 is collected by analysis portion 130.At this point, the storage identification of storage unit 160 Portion 120 turns the result of text to the voice command sound that driver says.
Processing unit 140 goes to next scene using the DB plan stored by driver, and whether inquiry driver will go to The corresponding scene that vehicle is recommended, so that vehicle guides driver's common function first.
For example, the purpose that the building of processing unit 140 recommends driver A often to go in special time period from vehicle to driver first Ground, the radio broadcasting often listened, digital media broadcast (Digital Media Broadcasting;DMB) the scene of channel etc., Or it is prioritized to the music that driver often listens in the music for playing vehicle storage, or grasp the age of driver and passing through clothes Business device plays age bracket people when listening to music likes the music listened.
Processing unit 140 searches for (searching) data, when ensuring the function needed for handling user, especially in user The convenient function of adaptive type for the user personality that most suitable analysis portion 130 is analyzed in real time is capable of providing in the case where not specified precise information It can information.
It is suitable that processing unit 140 provides selection music, recommendation radio broadcasting, recommending digital multimedia broadcasting, searching facility etc. Distribution type facilitating functions information.
Inquire whether driver will make the system in vehicle by the specific letter according to driver by loudspeaker in output section 150 Cease the scene work obtained.
Processing unit 140 is transmitted to user and handles the result obtained in output section 150.
Fig. 2 is the flow chart for showing the first embodiment of working method of vehicle Speaker identification system.
When user says " broadcast " by microphone (MIC), in step S210, input unit 110 obtains the voice signal.
Then in step S220, identification part 120 executes speech identifying function, is converted to broadcasting command extensively by STT Broadcast text.
Then in step S230, analysis portion 130 using study DB and resonance peak, basic frequency value, speech energy value, LPC value etc. analyzes the voice command of driver and storage.
Then in step S240, processing unit 140 confirms current time, and according to the recognition result of identification part 120, analysis The analysis result in portion 130, the information for being stored in storage unit 160 etc. generate waveform (wave) file, to guide time driver The TBS that the frequency listened to is FM95.1 is broadcasted.
Then in step s 250, output section 150 with loudspeaker export " listening to FM95.1 TBS Traffic Announcement? ".
When receiving the information for agreeing to the content exported by loudspeaker from user, in step S260, vehicle it is audio-visual (Audio Video Navigation, hereinafter referred to as ' the AVN ') system of navigation exports FM95.1 TBS Traffic Announcement.
On the contrary, if receiving the information for disagreeing the content exported by loudspeaker from user, in step S270, place Reason portion 140 exports " please say channel " by the loudspeaker of output section 150.Then in step S270, driver passes through input When portion 110 orders required frequency, processing unit 140 exports the broadcast of corresponding frequencies.
Speech recognition system differentiate after driver by driver characteristics recommend and in the case that driver refuses, as above to driving The person of sailing inquires required function and by the function operation.
Speech recognition system has the case where erroneous judgement driver, therefore adds driver's identification function in AVN function and open Open/close (ON/OFF) function, error when driver be set as close (OFF) make speech recognition system nonrecognition words person.
Speech recognition system and dependent of the invention carries out speech recognition, but by real with other module shared informations Vehicular system now more friendly to driver.
Fig. 3 is the flow chart for showing the second embodiment of working method of vehicle Speaker identification system.
When saying broadcast or DMB with voice in step s310, analysis portion 130 is extracted by the step S320 of speech analysis The characteristic of user.Then, processing unit 140 selects the broadcast for being most suitable for user personality.
With if, in the state connecting with server, analysis portion 130 is extracted by voice and is used when phonetic search music The characteristic at family is simultaneously selected from the music list that different sexes/age/state crowd that server provides likes.
Even if not connecting with server, analysis portion 130 can also will be about the general characteristic of the music file (property liked Not, age bracket, mood) it is stored in storage unit 160, most suitable music is thought according to the broadcasting of the voice status of user.
When passing through navigating search facility, if processing unit 140 searches for such as periphery dining room by the characteristics of speech sounds of user, The dining room that each gender/age bracket is liked then is shown at first.In the case where using Xi Li (SIRI), when with phonetic search periphery dining room Provide a user by YELP search as a result, the other informations such as gender/age bracket is also utilized to search for when searching for periphery dining room The dining room that more approximate people like with the user.
If there is network connection, processing unit 140 carries out web search using voice command and user personality information and will letter Breath is supplied to user.Congee shop is searched for when learning that health state of user is bad by speech analysis first, if when search periphery hospital Think that user's illnesses for flu, then first look for internal medicine by analyzing user speech.
And when with phonetic search concert or concert, processing unit 140 passes through web search ring approximate with the user The concert or concert that the people in border like.
Content described above is summarized below.
In step s310, input unit 110 obtains corresponding voice messaging when user gives orders or instructions.For example, user says " broadcast " Or input unit 110 obtains the voice messaging when " DMB ".
Then, analysis portion 130 analyzes voice messaging and the judgement property in step S330~S350 in real time in step s 320 Not/age/state etc..
Analysis portion 130 judges that words person is male or women by voice messaging in step S330.Then, analysis portion 130 judge words person's age bracket (for example, two teens, three teens, four teens etc.) in step S340.Then, analysis portion 130 judge that mood/state of words person is good bad in step S350.
Then in step S360, processing unit 140 searches for the electricity for being most suitable for user according to the analysis result of analysis portion 130 Platform.For example, processing unit 140, which searches for two good teens males of mood/state, likes the radio station listened, or search mood/state is not Three good teens males like the radio station listened, or search for the good teens women of mood/state and like the radio station listened, or search Two teens women of mood/be not in good state like the radio station listened.
Then in step S370, output section 150 (is executed corresponding by the processing result of loudspeaker output processing part 140 Service).
The present invention is not limited to the drivers in vehicle, and when not only considering all speech recognitions when searching for information Speech content, while also considering words person's characteristic.
Different from the prior art, the present invention is grasped the User Status that may change and is accordingly provided by analysis voice in real time User's adaptive type information.
Illustrate that speech recognition of the invention is applicable in back-propagation algorithm (Back Propagation Algorithm) below Method.
According to general noise filtering methods, opens speech recognition microphone and issued after the predetermined time and known for voice The signal that speech recognition advances into microphone is judged as noise in vehicle and the noise in trap signal by other voice.
Although there is the directional microphone being arranged towards driver, due to the time of short duration before voice is given orders or instructions in vehicle In the case that the signal of input is judged as noise, therefore speech recognition gives orders or instructions time point other seats are also given orders or instructions in addition to a driver Voice is mutually mixed, therefore phonetic recognization rate declines.
Therefore, the present invention four seating area being set to point to property microphones in vehicle respectively, with driver region On the basis of the input signal of microphone, by other regions, microphone signal is determined as noise and filters.During handling signal The characteristic of the driver in real time discriminating driver region, to ensure that multimedia equipment provides the information of suitable driver.
This is described in further details below, driver's seat is defined as a-quadrant in explanation below, passenger seat is defined For B area, the rear side seat of the back seat of driver's seat and passenger seat is respectively defined as the region C and the region D.
It the microphone in the region A, B, C, D while being opened when driver starts speech identifying function, passes through microphone and receive four The voice signal in a region.When the vehicle noise for not being human speech is input to the microphone in four regions, its input value is almost It is identical, therefore by a-quadrant filter vehicle noise figure.Also, analyze the voice in four regions.The expression in four regions is analyzed first The speech vector value of gender, on the basis of a-quadrant, when from B, C, D extracted region to the vector value indicated with a-quadrant different sexes When, a-quadrant filtering corresponds to the signal of the vector value.Age, mood/state are analyzed in the same way after having analyzed gender Deng.
Although maximum voice signal is the voice signal of driver in a-quadrant, when there are also the voices in the region B, C, D In the case where signal, the sound of a-quadrant driver can not be only extracted, therefore use this method.
Correlation (CORRELATION), independent component analysis (Independent Component can be passed through at this time Analysis;ICA) other algorithms except technology, Wave beam forming (BEAM FORMING) technology differentiate that signal is independent or has There is approximation.
It can be filtered by four microphones to grasp the individual characteristic of words person, can use grasp individual characteristic and obtain The information filtering noise arrived, so as to improve discrimination.
Multilayer perceptron (multilayer perceptron) is illustrated below.
Existing theoretical voice (judgement when receiving voice for identification of perceptron (perceptron) relevant to voice The content of voice) or differentiate people emotion.
Multilayer perceptron (multilayer perceptron) is that have among more than one between input layer and output layer The neural network of layer.Network is that there is no the connection in each layer and output layers by input layer, concealment layer, the connection of output layer direction To feedforward (Feedforward) network of input layer being directly connected to.
Vehicle generally has there are four seat, and the user of speech recognition system is usually driver in vehicle, makes in driver During with speech recognition system, the voice overlapping of more people when the passenger when other seats gives orders or instructions, therefore speech recognition system It can not identify the order of driver.The speech recognition system being commonly used is to be configured without voice before speech recognition section Section and the section is identified as noise and crosses the structure of noise filtering in voice input interval.
The present invention is that the characteristic of voice is extracted using perceptron theory to identify words person's characteristic, according to the data in real time to words Person provides the technology of suitable information.By perceptron, 1. adaptive type information can be provided by the characteristic of each words person, 2. can known Not words person position and function needed for person if the position is provided.Below to 1. and being 2. described in further details.
1. providing adaptive type information according to words person's characteristic
In the case where using multilayer perceptron composition system, driver can be extracted the voice of more people is superimposed Voice.This method is not limited to driver, can be also used for identifying other people.For example, the characteristics of speech sounds of a-quadrant is only extracted, Ignore the voice signal in remaining region B, C, D.
The major premise of perceptron is to be formed with to utilize backpropagation (BACK PROPAGATION) skill previously according to a large amount of DB The state of the algorithm of art training.
It models, such as is extracted by two teens of analysis and a large amount of voices of the good Soul women of state special about perceptron Property (formant, basic frequency, energy value, LPC value etc.) and by input terminal input, output (OUTPUT) object be more than 20 Year and when the good Soul women of state, perceptron inside configuration determines suitable by backpropagation (BACK PROPAGATION) process When weighting (WEIGHT) value.As above in the case where being trained to the people of multifrequency nature, no matter inputting any voice can Characteristic is found in trained structure.LPC value is linear prediction symbolism value, is the voice symbol based on mankind's generation model One of number change mode, the vectors with 26 dimensions.
When inputting formant, the basic frequency, 26 dimensional vector value of LPC model of a large amount of voices of special object, by anti- The specified operation of weighted value appropriate (two teens and the good Soul women of state, three are repeated to multiple objects to expansion process Teens and the road the Qing Shang area male ... being not in good state).
By above-mentioned training process, no matter receiving any voice can be by being input to the eigen vector to the voice The perceptron structure of modeling learns the characteristic of words person.
Benchmark is selected as seat with putting call through immediately after connection (Push-to-Talk, hereinafter referred to as ' PTT ').If there are four PTT key, It is the voice for needing to analyze by the phonetic decision that the microphone for receiving the position of corresponding PTT receives then according to its position, it will Remaining is judged as noise and filters.Identify by filtered voice and provides optimal information, such as words person for words person Command lookup periphery dining room is issued to media product, then searches out the periphery dining room of suitable words person's characteristic first.
Following characteristic can be exported by arranging content described above.
Firstly, differentiating the position PTT and extracting the vector for corresponding to each voice signal characteristic.
Then, the eigen vector of four kinds of signals is inputted to multilayer perceptron structure.
Then, the characteristic of each voice signal is extracted.
Then, when having other characteristics different from reference speech (A), judge other characteristics in A microphone signal Value is judged as noise and filters.
Then, the data execution speech recognition obtained using a-quadrant voice is only extracted, judges the meaning of voice.
Then, the information of the order of most suitable a-quadrant words person is provided.
2. function needed for person if identifying words person position and providing the position
Benchmark is selected as seat with putting call through immediately after connection (Push-to-Talk, hereinafter referred to as ' PTT ').If there are four PTT key, It is the voice for needing to analyze by the phonetic decision that the microphone for receiving the position of corresponding PTT receives then according to its position, it will Remaining is judged as noise and filters.It, can when the passenger for being sitting in the region D issues the order about air-conditioner temperature by taking air-conditioning as an example Only to change the air-conditioning gear of the air-conditioning device in the region D according to order.
It is explained above and distinguishes that the speech recognition system of words person provides the sheet of best information to driver by speech recognition One embodiment of invention.Illustrate the preferred embodiment of the present invention that can be reasoned out from these embodiments below.
Fig. 4 is to summarize the display vehicle arrangement control dress according to the preferred embodiment of the invention for the service of user's adaptive type The block diagram set.
Referring to Fig. 4, the vehicle arrangement control device 400 for user's adaptive type service include characteristic information generating unit 410, Voice messaging analysis unit 420, adaptive type service determining section 430, vehicle arrangement control unit 440, power supply unit 450 and main control unit 460。
The function of power supply unit 450 is to provide power supply to each composition for constituting vehicle arrangement control device 400.Main control unit 460 function is all working respectively constituted that control constitutes vehicle arrangement control device 400.In view of can be by vehicle arrangement Even control device 400 is arranged in AVN system, therefore the present embodiment does not have power supply unit 450 and main control unit 460.
The function of characteristic information generating unit 410 is the characteristic information that user is generated according to the voice messaging of user.
Characteristic information generating unit 410 from voice messaging extract formant (formant) value, frequency values, speech energy value and At least one value in LPC value, can be according to the real-time formation characteristic information of at least one value.
The characteristic information for the user that characteristic information generating unit 410 generates in real time can be the gender information of user, user In age information and the emotion information of user to planting a kind of information.
Characteristic information generating unit 410 corresponds to the concept of the analysis portion 130 in Fig. 1.
The function of voice messaging analysis unit 420 is to obtain meaning information by parsing the voice messaging of user.
The function of adaptive type service determining section 430 is the characteristic information and voice generated according to characteristic information generating unit 410 The meaning information that information analyzing section 420 obtains determines the adaptive type service to user.
The function of vehicle arrangement control unit 440 is that control object equipment of the control including vehicle arrangement makes execution suitable Distribution type services the adaptive type service that determining section 430 determines.
The vehicle arrangement that vehicle arrangement control unit 440 controls can be audio-visual navigation (Audio Video Navigation; AVN) system.
Vehicle arrangement control device 400 can also include voice messaging selector (not shown).
The function of voice messaging selector is selected from these voice messagings when receiving at least two voice messagings One voice messaging.
Voice messaging selector can be believed according to the size of voice messaging, the voice messaging of input and pre-stored voice The choosing of at least one of the position of comparison result, user between breath and multilayer perceptron (multilayer perceptron) Select a voice messaging.
In the case where multilayer perceptron, voice messaging selector can select voice messaging in the following order.
Firstly, each area voice of the vehicle interior received by microphone is input to housebroken perceptron model To extract driver information.
Then, if other regions have the voice of other characteristics of the characteristic in the region being different from the basis of driver, The signal in the voice signal for the microphone that then will enter into driver region is determined as noise and filters.
Then, according to the voice for being input to all positions be separately input to the result that perceptron model obtains be filtered with Obtain voice messaging.
Vehicle arrangement control device 400 can also include voice messaging input unit (not shown).
The function of voice messaging input unit is to receive at least one voice messaging.In particular, the function of voice messaging input unit It is to receive voice messaging at each seat of vehicle.Voice messaging input unit can be with directional microphone prominent form in each seat Position.
In this case, vehicle arrangement control unit 440 can control vehicle arrangement and to execute adaptive type clothes by each seat Business.
Vehicle arrangement control device 400 can also include that adaptive type service execution judging part (not shown) and alternative service are true Determine portion's (not shown).
The function of adaptive type service execution judging part is to be determined whether to execute adaptive type clothes according to the information of user's input Business.
The function of alternative service determining section is when judging result is not execute adaptive type service, according to the letter of user's input Breath determines the alternative service for substituting adaptive type service.
In this case, vehicle arrangement control unit 440 makes vehicle arrangement execute alternative service by control.
Working method of the explanation for the vehicle arrangement control device 400 of user's adaptive type service below.
Fig. 5 is to summarize to show the vehicle arrangement controlling party according to the preferred embodiment of the invention for the service of user's adaptive type The flow chart of method.It is illustrated referring to Fig. 5.
First in step S510, voice messaging input unit receives the voice messaging of user from each seat of vehicle.
Then in step S530, characteristic information generating unit 410 is believed according to the characteristic that the voice messaging of user generates user Breath.And in step S520, voice messaging analysis unit 420 obtains meaning information by the voice messaging of parsing user.Step S530 can be performed simultaneously with step S520, but can also be executed before step S520 or step S520 after execute.
Then in step S540, adaptive type services determining section 430 to be believed according to the characteristic that characteristic information generating unit 410 generates Breath determines the adaptive type service to user with the meaning information that voice messaging analysis unit 420 obtains.
Then in step S550, vehicle arrangement control unit 440 controls the control object equipment including vehicle arrangement So that executing the adaptive type service that adaptive type service determining section 430 determines.
In addition after step S510, voice messaging selector can be from these languages when receiving at least two voice messagings A voice messaging is selected in message breath.The above-mentioned steps of voice messaging selector can be between step S510 and step S520 It is executed between execution or step S510 and step S530.
In addition between step S540 and step S550, adaptive type service execution judging part is sentenced according to the information that user inputs It is disconnected whether to execute adaptive type service.Then, alternative service determining section judging result be do not execute adaptive type service when, according to The information of user's input determines the alternative service for substituting adaptive type service.
All constituent elements for describing the composition embodiment of the present invention above are combined into one or combine work, but the present invention It is not limited to these embodiments.I.e. within the scope of the purpose of the present invention, all constituent elements can be by more than one selectivity Ground combination work.Also, its all constituent element can occur in the form of an independent hardware respectively, but the also property of can choose Ground combines part or all of each component, by have for execute part that one or more hardware combinations are realized or The computer program of the program module of repertoire is realized.Also, this computer program can be stored in USB storage, CD The computer-readable recording medium such as disk, flash disk (Flash Memory) (Computer Readable Media), by counting Calculate it is machine-readable take and execute, to realize the embodiment of the present invention.The recording medium of computer program may include magnetic recording medium, light Recording medium, carrier wave (Carrier Wave) medium etc..
Also, include the case where all terms including technology or scientific words in illustrating without separately defining following table Show and is generally understood the identical meaning with general technical staff of the technical field of the invention.Usually used dictionary definition Term should be interpreted that the meaning consistent with the meaning of the context of the relevant technologies must not if undefined in the present invention It is construed to ideal or excessively formality the meaning.
The above description is only an example illustrates technical solution of the present invention, and those of ordinary skill in the art are not departing from this A variety of amendments, change and replacement can be carried out in the range of inventive nature characteristic.Therefore, disclosed embodiment of this invention and attached drawing And non-limiting technical solution of the present invention, but for illustrating, technical solution of the present invention be not limited to these embodiments and Attached drawing.Protection scope of the present invention is determined by technical solution, is both contained in the present invention with all technical solutions of its equivalency range Technical solution in.

Claims (11)

1. a kind of vehicle arrangement control device for user's adaptive type service characterized by comprising
Characteristic information generating unit generates the characteristic information of the user according to the voice messaging of user;
Voice messaging analysis unit obtains meaning information by parsing the voice messaging;
Adaptive type services determining section, is determined according to the characteristic information and the meaning information and is taken to the adaptive type of the user Business;
Vehicle arrangement control unit controls the control object equipment including vehicle arrangement and to execute the adaptive type clothes Business;
Adaptive type service execution judging part judges whether to execute the adaptive type service according to the information that the user inputs; And
Alternative service determining section, when judging result is not execute adaptive type service, according to the letter of user input Breath determines the alternative service for substituting the adaptive type service.
2. the vehicle arrangement control device according to claim 1 for the service of user's adaptive type, it is characterised in that:
The characteristic information generating unit extracts resonance peak, frequency values, speech energy value and linear prediction from the voice messaging At least one of encoded radio value, and the characteristic information is generated according at least one described value in real time.
3. the vehicle arrangement control device according to claim 1 for the service of user's adaptive type, it is characterised in that:
The characteristic information generating unit generates the gender information of the user, the age information of the user and the user in real time At least one of emotion information information as the characteristic information.
4. the vehicle arrangement control device according to claim 1 for the service of user's adaptive type, which is characterized in that also wrap It includes:
Voice messaging selector selects a language when receiving at least two voice messagings from multiple voice messagings Message breath.
5. the vehicle arrangement control device according to claim 4 for the service of user's adaptive type, it is characterised in that:
The voice messaging selector according to the size of voice messaging, the voice messaging of input and pre-stored voice messaging it Between comparison result, the user position and the one voice messaging of at least one of multilayer perceptron selection.
6. the vehicle arrangement control device according to claim 1 for the service of user's adaptive type, which is characterized in that also wrap It includes:
Voice messaging input unit receives the voice messaging from each seat of vehicle,
The vehicle arrangement control unit control vehicle arrangement to execute the adaptive type service respectively by each seat.
7. the vehicle arrangement control device according to claim 6 for the service of user's adaptive type, it is characterised in that:
The voice messaging input unit includes the directional microphone for being set to each seat.
8. the vehicle arrangement control device according to claim 1 for the service of user's adaptive type, it is characterised in that:
The vehicle arrangement of the vehicle arrangement control unit control is image and sound guidance system.
9. a kind of vehicle arrangement control method for user's adaptive type service characterized by comprising
The step of generating the characteristic information of the user according to the voice messaging of user;
By parsing the step of voice messaging obtains meaning information;
The step of servicing the adaptive type of the user is determined according to the characteristic information and the meaning information;
Control includes the steps that the control object equipment including vehicle arrangement to execute the adaptive type service;
Judge whether the step of executing adaptive type service according to the information that the user inputs;And
When judging result is not execute adaptive type service, determined according to the information of user input described for substituting The step of alternative service of adaptive type service.
10. the vehicle arrangement control method according to claim 9 for the service of user's adaptive type, it is characterised in that:
The step generated generates the sense of the gender information of the user, the age information of the user and the user in real time At least one of feelings information information is as the characteristic information.
11. the vehicle arrangement control method according to claim 9 for the service of user's adaptive type, which is characterized in that also Include:
The step of receiving the voice messaging from each seat of vehicle,
The step of control is specifically to control vehicle arrangement to execute the adaptive type service respectively by each seat.
CN201510514457.9A 2014-09-02 2015-08-20 Vehicle arrangement control device and method for user's adaptive type service Active CN105390136B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2014-0116184 2014-09-02
KR1020140116184A KR102249392B1 (en) 2014-09-02 2014-09-02 Apparatus and method for controlling device of vehicle for user customized service

Publications (2)

Publication Number Publication Date
CN105390136A CN105390136A (en) 2016-03-09
CN105390136B true CN105390136B (en) 2019-05-21

Family

ID=55422356

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510514457.9A Active CN105390136B (en) 2014-09-02 2015-08-20 Vehicle arrangement control device and method for user's adaptive type service

Country Status (2)

Country Link
KR (1) KR102249392B1 (en)
CN (1) CN105390136B (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105976815A (en) * 2016-04-22 2016-09-28 乐视控股(北京)有限公司 Vehicle voice recognition method and vehicle voice recognition device
KR102497299B1 (en) 2016-06-29 2023-02-08 삼성전자주식회사 Electronic apparatus and method for controlling the electronic apparatus
CN107590120A (en) * 2016-07-07 2018-01-16 深圳狗尾草智能科技有限公司 Artificial intelligence process method and device
KR102568143B1 (en) * 2016-09-23 2023-08-18 주식회사 케이티 Method and device for providing customized service mode
KR102329888B1 (en) 2017-01-09 2021-11-23 현대자동차주식회사 Speech recognition apparatus, vehicle having the same and controlling method of speech recognition apparatus
KR101883301B1 (en) 2017-01-11 2018-07-30 (주)파워보이스 Method for Providing Personalized Voice Recognition Service Using Artificial Intellignent Speaker Recognizing Method, and Service Providing Server Used Therein
KR20180106196A (en) 2017-03-17 2018-10-01 현대자동차주식회사 Apparatus and method for optimizing navigation performance
KR102437833B1 (en) 2017-06-13 2022-08-31 현대자동차주식회사 Apparatus for selecting at least one task based on voice command, a vehicle including the same and a method thereof
KR20190114325A (en) 2018-03-29 2019-10-10 삼성전자주식회사 The apparatus for processing user voice input
KR102562227B1 (en) * 2018-06-12 2023-08-02 현대자동차주식회사 Dialogue system, Vehicle and method for controlling the vehicle
CN110503947B (en) * 2018-05-17 2024-06-18 现代自动车株式会社 Dialogue system, vehicle including the same, and dialogue processing method
KR102114843B1 (en) * 2018-11-26 2020-05-26 한국생산기술연구원 Interactive Module Service System and Method for Custom Assembly Vehicle Industry based on emotion
KR102275873B1 (en) * 2018-12-18 2021-07-12 한국전자기술연구원 Apparatus and method for speaker recognition
KR102235091B1 (en) * 2019-02-21 2021-04-02 주식회사 에스디아이컴퍼니 Fabrics
JP7211856B2 (en) * 2019-03-11 2023-01-24 本田技研工業株式会社 AGENT DEVICE, AGENT SYSTEM, SERVER DEVICE, CONTROL METHOD FOR AGENT DEVICE, AND PROGRAM
CN114049894A (en) * 2022-01-11 2022-02-15 广州小鹏汽车科技有限公司 Voice interaction method and device, vehicle and storage medium
CN115170239A (en) * 2022-07-14 2022-10-11 艾象科技(深圳)股份有限公司 Commodity customization service system and commodity customization service method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102802114A (en) * 2012-06-20 2012-11-28 北京语言大学 Method and system for screening seat by using voices
CN102800315A (en) * 2012-07-13 2012-11-28 上海博泰悦臻电子设备制造有限公司 Vehicle-mounted voice control method and system
CN103137125A (en) * 2011-11-30 2013-06-05 北京德信互动网络技术有限公司 Intelligent electronic device based on voice control and voice control method
CN103137043A (en) * 2011-11-23 2013-06-05 财团法人资讯工业策进会 Advertisement display system and advertisement display method in combination with search engine service
CN103324729A (en) * 2013-06-27 2013-09-25 北京小米科技有限责任公司 Method and device for recommending multimedia resources
CN103491411A (en) * 2013-09-26 2014-01-01 深圳Tcl新技术有限公司 Method and device based on language recommending channels

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5152570B2 (en) * 2008-03-06 2013-02-27 株式会社デンソー Automotive user hospitality system
JP5182113B2 (en) * 2009-01-16 2013-04-10 三菱自動車工業株式会社 Control device for in-vehicle equipment
JP5972372B2 (en) 2012-06-25 2016-08-17 三菱電機株式会社 Car information system
KR101467298B1 (en) * 2012-11-16 2014-12-03 에스케이플래닛 주식회사 System and method for recommending contents in vehicle
KR20140067687A (en) * 2012-11-27 2014-06-05 현대자동차주식회사 Car system for interactive voice recognition

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103137043A (en) * 2011-11-23 2013-06-05 财团法人资讯工业策进会 Advertisement display system and advertisement display method in combination with search engine service
CN103137125A (en) * 2011-11-30 2013-06-05 北京德信互动网络技术有限公司 Intelligent electronic device based on voice control and voice control method
CN102802114A (en) * 2012-06-20 2012-11-28 北京语言大学 Method and system for screening seat by using voices
CN102800315A (en) * 2012-07-13 2012-11-28 上海博泰悦臻电子设备制造有限公司 Vehicle-mounted voice control method and system
CN103324729A (en) * 2013-06-27 2013-09-25 北京小米科技有限责任公司 Method and device for recommending multimedia resources
CN103491411A (en) * 2013-09-26 2014-01-01 深圳Tcl新技术有限公司 Method and device based on language recommending channels

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《多类噪声环境下的语音端点检测》;汤霖等;《计算机工程与应用》;20121031;第48卷(第29期);第114-118、156页

Also Published As

Publication number Publication date
KR20160027728A (en) 2016-03-10
KR102249392B1 (en) 2021-05-07
CN105390136A (en) 2016-03-09

Similar Documents

Publication Publication Date Title
CN105390136B (en) Vehicle arrangement control device and method for user's adaptive type service
CN111508474B (en) Voice interruption method, electronic equipment and storage device
CN105765650B (en) With multidirectional decoded voice recognition
US11184412B1 (en) Modifying constraint-based communication sessions
EP3090429B1 (en) Modifying operations based on acoustic ambience classification
KR20220054602A (en) Systems and methods that support selective listening
CN109189980A (en) The method and electronic equipment of interactive voice are carried out with user
DE112021001064T5 (en) Device-directed utterance recognition
CN107819929A (en) It is preferred that the identification and generation of emoticon
CN113168832A (en) Alternating response generation
CN111145721A (en) Personalized prompt language generation method, device and equipment
DE102018125966A1 (en) SYSTEM AND METHOD FOR RECORDING KEYWORDS IN A ENTERTAINMENT
US11393473B1 (en) Device arbitration using audio characteristics
CN116417003A (en) Voice interaction system, method, electronic device and storage medium
CN110286745A (en) Dialog process system, the vehicle with dialog process system and dialog process method
CN109274922A (en) A kind of Video Conference Controlling System based on speech recognition
CN101867742A (en) Television system based on sound control
US20240062164A1 (en) Data ingestion and understanding for natural language processing systems
DE112022000504T5 (en) Interactive content delivery
CN109791764A (en) Communication based on speech
US20240071408A1 (en) Acoustic event detection
CN103390406A (en) Speaker authentication method, preparation method of speaker authentication and electronic device
CN117882131A (en) Multiple wake word detection
US11798538B1 (en) Answer prediction in a speech processing system
WO2021125037A1 (en) Signal processing device, signal processing method, program, and signal processing system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant