CN102332265A - Method for improving voice recognition rate of automobile voice control system - Google Patents

Method for improving voice recognition rate of automobile voice control system Download PDF

Info

Publication number
CN102332265A
CN102332265A CN201110164289A CN201110164289A CN102332265A CN 102332265 A CN102332265 A CN 102332265A CN 201110164289 A CN201110164289 A CN 201110164289A CN 201110164289 A CN201110164289 A CN 201110164289A CN 102332265 A CN102332265 A CN 102332265A
Authority
CN
China
Prior art keywords
voice
phonetic
order
keyword
level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201110164289A
Other languages
Chinese (zh)
Other versions
CN102332265B (en
Inventor
张方伟
邓健
陈冰
朱祝阳
丁武俊
熊想涛
陈文强
潘之杰
赵福全
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Geely Holding Group Co Ltd
Zhejiang Geely Automobile Research Institute Co Ltd
Original Assignee
Zhejiang Geely Holding Group Co Ltd
Zhejiang Geely Automobile Research Institute Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Geely Holding Group Co Ltd, Zhejiang Geely Automobile Research Institute Co Ltd filed Critical Zhejiang Geely Holding Group Co Ltd
Priority to CN201110164289.7A priority Critical patent/CN102332265B/en
Publication of CN102332265A publication Critical patent/CN102332265A/en
Application granted granted Critical
Publication of CN102332265B publication Critical patent/CN102332265B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for improving the voice recognition rate of an automobile voice control system. The method comprises the following steps of: (1) configuring a voice prompt module for the automobile voice control system to output a voice prompt signal to perform time-sharing control on entertainment signals played by an automobile sound system; (2) configuring a process of voice instruction classification method using classifying input and classifying recognition for the voice recognition module of the automobile voice control system; and (3) configuring a process using classified voice instruction robustness recognition method for the voice recognition module of the automobile voice control system. The method provided by the invention can effectively improve the voice difference factors influencing the recognition rate of the automobile voice recognition system, thereby keeping the expected accurate recognition rate and improving the voice recognition reliability and stability of the automobile voice recognition system. The method can be also applied to electrical appliance voice control systems of automobiles of different grades.

Description

A kind of method that improves automobile voice activated control phonetic recognization rate
Technical field
The invention belongs to car electrics control technology field, relate to the speech recognition system of automobile voice acoustic control, relate in particular to a kind of method that improves automobile voice activated control phonetic recognization rate, be used for the speech recognition of automobile voice voice activated control.
Background technology
In present stage, be used for improving the method for speech recognition system discrimination, often just on algorithm or hardware device, make an issue of.Rapid Expansion along with automobile market; The continuous expansion of automobile consumption crowd troop; Automobile consumption crowd's structure generation great variety, is done manual work to the common people from the ECO white collar to the rural area of the length and breadth of land from the city of prosperity; Well educated to the low educational background of obligation teaching from higher education, same serious hope is expressed to the pursuit of automotive performance by these consumer groups.But pronunciation is not a standard very in numerous car owners, and China is the multi-ethnic big country that has vast territory; Do not say that native language and various places local accent vary, just big zone receives the influence of local accent such as the pronunciation of the mandarin in the China north and southern mandarin; Distinguish greatlyyer, the pronunciation local, colloquial expression that is exactly the Pekinese has weighed, and also has than big difference with standard mandarin; At cacuminal, there is bigger difference on pre-nasal sound and the back nasal sound especially.This is a unavoidable key factor that influences the automobile speech recognition systems discrimination, and it is directly connected to the big problem of automobile speech recognition systems discrimination.At algorithm or the hardware device speech recognition system discrimination of getting on the car, how to adopt several different methods further to improve the speech recognition system discrimination simultaneously, this is the important topic that each big automobile enterprise of China Automobile Industry and even the world extremely pays close attention to.The speech recognition of prior art, like application number 200810103679.1, name is called the patent of invention of " based on the vehicle electrical apparatus sound control method of command word list ", and this method comprises: generate a voice vocabulary and noise vocabulary; Speech recognition engine record driver's the sound of speaking; Merge said voice vocabulary and noise vocabulary; Be loaded into after the merging in the speech recognition engine, select pronunciation and driver's the most close word of sound of speaking from said voice vocabulary; According to the word of speech recognition engine output, driver's the intention of speaking is discerned the electrical equipment control order of the result of identification for confirming; The electrical equipment control command conversion that obtains is become the CAN bus control signal, output on the CAN bus, in order to electrical equipment is controlled.Though this control method has stronger inhibition ability to the noise under the car steering environment, not enough below existing: there be the influence of driver's speech utterance to the automobile speech recognition systems discrimination in (1).(2) can not discern the several passengers' beyond the driver voice.(3) select pronunciation and driver's the most close word of sound of speaking from said voice vocabulary, the reliability that driver's the intention of speaking is discerned does not have substantial raising.
At present on the automobile market, more existing automobile brands are controlled vehicle with sound control technique and are carried out some actions, like playing back music, open reading lamp etc.The development of automobile speech control system is in the good impetus of rising always, and patents such as " the automobile entertainment speech control systems " of the application number 200920315418 of prior art such as this group is seen in generation.The reliability of the speech recognition system of automobile speech control system and speech recognition probability are core technology problems always in the prior art.The wherein identification of the keyword of phonetic order and confirm just a lot of problems to be arranged; With acoustic control electrical equipment object is that air-conditioning is an example; Two basic keywords are respectively " opening air-conditioning " and " closing air-conditioning "; These two keywords itself just have two speech identical, and this can reduce the reliability of system identification to a great extent, even maloperation can occur.
Summary of the invention
The objective of the invention is in order to overcome the deficiency that prior art exists; A kind of method that improves the automobile speech recognition systems discrimination is proposed; Improve the voice variance factor that influences the automobile speech recognition systems discrimination; Can discern the several passengers' beyond the driver voice, and improve the automobile speech recognition systems discrimination.
The objective of the invention is to realize through following technical scheme.
A kind of method that improves the automobile speech recognition systems discrimination, it may further comprise the steps:
Step 1, be automobile voice activated control configured voice reminding module, the playback parts of voice cue module shared motor vehicle sound system, voice cue module output voice suggestion signal can be implemented timesharing control to the entertain mem signal that car audio system is play;
The sound identification module of step 2, automobile voice activated control also disposes the flow process of the phonetic order stage division that adopts classification input and hierarchical identification; Through with the phonetic order classification, make in to occur identical word less or the close word that pronounces with the sound instruction of one-level as far as possible, so just can improve the speech recognition reliability greatly;
The sound identification module of step 3, automobile voice activated control also disposes the flow process that adopts the recognition methods of classification phonetic order robustness; Be used to realize under the coarse condition of voice, make sound identification module can keep the identification probability of expection, thereby can improve the discrimination of automobile voice activated control speech recognition greatly.
Described method, its phonetic order stage division flow process that is said step 2 may further comprise the steps:
(1), starts the automobile voice activated control through button or non-button;
(2) voice cue module output voice suggestion signal " welcomes to use first order phonetic order ", comprises " Mytip is welcome you ", and the voice suggestion signal is play the content of " first order keyword phrase " through the playback parts of sound system;
(3) the voice voice activated control is gathered " first order keyword " voice signal that car takes advantage of personnel to send;
(4) sound identification module is done speech recognition to " first order keyword " voice signal, accomplishes the affirmation to first order phonetic order; If be judged as " denying ", judge that the voice that receive are not first order phonetic orders, return " first order keyword " voice that cars to be collected such as (3) takes advantage of personnel to send; If be judged as " being ", judge that the voice that receive are first order phonetic orders, carry out next step flow process (5);
(5) voice cue module is according to the different keywords of first order keyword; The different phonetic cue of output second level phonetic order; Comprise " second level phonetic order available commands is opened, stopped or closes "; " available commands liter, stop or fall ", voice suggestion signal are play the corresponding voice suggestion content of " second level keyword phrase " through the playback parts of sound system;
(6) the voice voice activated control is gathered " second level keyword " voice signal that car takes advantage of personnel to send;
(7) sound identification module is done speech recognition to " second level keyword ", accomplishes the affirmation to second level phonetic order; If be judged as " denying ", judge that the voice that receive are not second level phonetic orders, return " second level keyword " voice that cars to be collected such as (6) takes advantage of personnel to send; If be judged as " being ", judge that the voice that receive are second level phonetic orders, carry out next step flow process (8);
(8) output of voice voice activated control is accomplished the control of corresponding acoustic control electrical equipment by the control signal of first order phonetic order, the combination of second level phonetic order by system control module;
(9) process ends;
Through with the phonetic order classification, make in to occur identical word less or the close word that pronounces with the sound instruction of one-level as far as possible, so just can improve the speech recognition reliability greatly.
Described method, its method that is said classification phonetic order robustness identification may further comprise the steps:
(1) starts the voice voice activated control;
(2) voice voice activated control initialization;
1) definition and foundation are comparable to the non-accurate phonetic model database abbreviation phonetic order robustness phonetic model database of robustness of the accurate phonetic model of phonetic order; Consider the spoken voice of band of common phonetic order and the variance factor of received pronunciation; The accurate phonetic model that is the basis with the received pronunciation of phonetic order keyword; Set up phonetic order robustness phonetic model database; Its target is under phonetic order pronounces coarse condition, makes sound identification module can keep the accurate discrimination of expection, thereby improves the reliability and stability of voice voice activated control speech recognition.
2) confirm the non-accurate proximity criterion of robustness phonetic model database, comprise
A. be judged to be cacuminal voice phonetic and non-cacuminal voice phonetic close;
B. be judged to be the voice phonetic of pre-nasal sound and the voice phonetic of non-pre-nasal sound close;
C. be judged to be the voice phonetic of back nasal sound and the voice phonetic of non-back nasal sound close;
3) first order keyword phrase of definition first order phonetic order, the accurate phonetic model of each keyword received pronunciation and the robustness phonetic model thereof of structure first order keyword phrase;
The phonetic order first order keyword word bank that foundation comprises first order keyword phrase is called for short first order keyword word bank, and each " first order keyword " phonetic model comprises a robustness phonetic model that accurate phonetic model is close with several;
4) second level keyword phrase of definition second level phonetic order, the accurate phonetic model of keyword received pronunciation and the robustness phonetic model thereof of structure second level keyword phrase;
The phonetic order second level keyword word bank that foundation comprises second level keyword phrase is called for short second level keyword word bank, and each " second level keyword " phonetic model comprises a robustness phonetic model that accurate phonetic model is close with several;
(3) the speech recognition identification module receives the first order phonetic order that the instruction people sends;
(4) sound identification module calls " first order keyword word bank " earlier and does the identification of phonetic order robustness coupling; The robustness phonetic model close with several with an accurate phonetic model of each first order keyword compares, with " or " meet and be judged to be " coupling "; If be judged to be " denying ", return step (3); If be judged to be " being ", change step (5);
The keyword code of the first order phonetic order of (5) output coupling;
(6) the speech recognition identification module receives the second level phonetic order that the instruction people sends;
(7) sound identification module calls " first order keyword word bank " earlier and does the identification of phonetic order robustness coupling; The robustness phonetic model close with several with an accurate phonetic model of each first order keyword compares, with " or " meet and be judged to be " coupling "; If be judged to be " denying ", return step (6); If be judged to be " being ", change step (8);
The action keyword code of the second level phonetic order of (8) output coupling;
Call " second level keyword word bank " behind the sound identification module and do the identification of phonetic order robustness coupling; The robustness phonetic model close with several with an accurate phonetic model of each second keyword compares; With " or " meet; Then be judged to be " coupling ", export the code value of this second keyword;
(9) combined treatment made in the first order and the second level phonetic order keyword of coupling: the code value combination of the code value of first order keyword and second level keyword constitutes the combine voice instruction code;
The control signal of the combine voice instruction code of (10) output coupling;
(11) process ends.
When have in the phonetic order cacuminal or pre-nasal sound with after the speech of nasal sound; RP like " liter " is " sheng ", and this is the speech that sticks up nasal sound behind the tongue, close with it non-standard pronunciation non-cacuminal " sen " and non-cacuminal " seng " arranged; They non-standard pronunciation not with prerequisite that other phonetic order is obscured mutually under; " sheng " of above-mentioned RP and " sen " and " seng " of non-standard pronunciation are come together in the same phonetic order, by that analogy, be built into the automobile phonetic order database of taking into account northern regional mandarin and southern regional mandarin; In this database each phonetic order be comprise a plurality of close voice speech or the set; Speech recognition system is extracted a plurality of close voice speech of this or set respectively, identification one by one, and to the recognition result of a plurality of close voice speech do " or " handle.Thereby can improve the discrimination of speech recognition system greatly.
Described method, it is that the timesharing control of said step 1 is to generate the timesharing control signal with voice cue module output voice suggestion signal, the timesharing control signal is implemented timesharing to the entertain mem signal of sound system and is enabled control.
Described method, it is that said first order phonetic order comprises DVD sound equipment, air-conditioning, reading lamp, vehicle window, skylight, rearview mirror, boot and other controlling object title, and distributes a code for each first order phonetic order.
Described method, its be said second level phonetic order comprise with first order phonetic order in the action keyword that is complementary of controlling object title, and distribute a code for each second level phonetic order.
Described method, it is that said first order phonetic order code is associated with second level phonetic order code, I and II voice combination instruction code is made up of both code combinations; Be used to implement relevant control to corresponding electric appliance.
Described method, it is that the action keyword of said second level phonetic order allows to be complementary with a plurality of first order phonetic orders.
Described method, its action keyword that is said second level phonetic order comprises: unlatching and synonym thereof are opened, open, open, play, answer, are navigated; Stop and synonym stops, stops, breaks; Close and synonym closes, shuts, withdraws from, breaks off.
Variance factor in view of spoken voice of the band of phonetic order and received pronunciation; The accurate phonetic model that is the basis with the received pronunciation of phonetic order keyword; Set up phonetic order robustness phonetic model database; Its target is under phonetic order pronounces coarse condition, makes sound identification module can keep the accurate discrimination of expection, thereby improves the reliability and stability of voice voice activated control speech recognition.
Substantial effect of the present invention:
1, the inventive method is effectively improved influences the voice of automobile speech recognition systems discrimination variance factor,
2, the inventive method serves as an identification comparison sample with phonetic order robustness phonetic model database; Sound identification module is effectively improved influences the voice of automobile speech recognition systems discrimination variance factor; The accurate discrimination that keeps expection improves the reliability and stability of automobile speech recognition systems speech recognition.
3, the inventive method allows comprising driver's several passengers, carries out phonetic order identification, meets vast automobile consumption crowd's actual user demand.
4, adopt the inventive method can be applied to the electrical equipment voice activated control of the automobile of class separately.
Description of drawings
Fig. 1 phonetic order stage division of the present invention logical flow chart.
The robustness method logical flow chart of Fig. 2 classification phonetic order of the present invention.
The process flow diagram of Fig. 3 embodiment of the invention sound instruction stage division.
The process flow diagram of Fig. 4 embodiment of the invention classification acoustic control phonetic order robustness recognition methods.
Embodiment
Below through embodiment and combine accompanying drawing that technical scheme of the present invention is done further to specify.
Usually the formation of automobile speech recognition systems comprises speech transducer, sound identification module, command execution module and voice cue module, and the method for raising automobile speech recognition systems discrimination of the present invention is based on the automobile speech recognition systems of prior art.
It is as shown in Figure 1 that the present invention improves the phonetic order stage division flow process of automobile speech recognition systems discrimination.Phonetic order stage division flow process may further comprise the steps:
S101 starts the voice voice activated control through button;
The voice cue module of S102 voice voice activated control is sent voice suggestion: welcome to use first order phonetic order " first order keyword phrase " (the keyword phrase comprises the title of whole automobile voice acoustic control objects);
S103 voice voice activated control is gathered " first order keyword " voice signal that car takes advantage of personnel to send through the voice sensing;
The S104 sound identification module is discerned the first order keyword of phonetic order and is confirmed; As if discerning and confirming as " denying ", return S103; If discern and confirm as " being ", change S105;
Voice suggestion is sent in the voice cue module continuation of S105 voice voice activated control: welcome to use second level phonetic order " second level keyword phrase " (the keyword phrase comprises the control action name with whole automobile voice acoustic control objects);
S106 voice voice activated control continues to gather " second level keyword " voice signal that car takes advantage of personnel to send through the voice sensing;
The S107 sound identification module is discerned the second level keyword of phonetic order and is confirmed; As if discerning and confirming as " denying ", return S106; If discern and confirm as " being ", change S108;
The S108 voice voice activated control combination first order and second level phonetic order, and output control signal word is to command execution module;
The S109 process ends.
The present invention improves the flow process of the classification phonetic order robustness recognition methods of automobile speech recognition systems discrimination, referring to Fig. 2.The flow process of classification phonetic order robustness recognition methods may further comprise the steps:
S201 starts the voice voice activated control through button;
The initialization of S202 voice voice activated control comprises:
The S203 definition is also set up phonetic order robustness phonetic model database;
S204 confirms the non-accurate proximity criterion of robustness phonetic model database;
The first order keyword phrase of S205 definition first order phonetic order is also set up phonetic order first order keyword word bank;
The second level keyword phrase of S206 definition second level phonetic order is also set up phonetic order second level keyword word bank;
The S207 sound identification module receives the first order phonetic order that car takes advantage of personnel to send;
The S208 sound identification module is done the identification of robustness coupling with first order keyword word bank to first order phonetic order; If coupling is identified as " denying ", return S207; If coupling is identified as " being ", change S209, simultaneously, change S210;
The first order phonetic order code value of S209 output coupling is delivered to S213;
The S210 sound identification module continues to receive the second level phonetic order that car takes advantage of personnel to send;
The S211 sound identification module is done the identification of robustness coupling with second level keyword word bank to second level phonetic order; If coupling is identified as " denying ", return S210; If coupling is identified as " being ", change S212;
The second level phonetic order code value of S212 output coupling is delivered to S213;
The S213 sound identification module is made combined treatment to the first order and the second level phonetic order code of coupling;
The control signal of the combine voice instruction code of S214 output coupling;
The S215 process ends.
Fig. 3 has provided the process flow diagram of embodiment of the invention phonetic order stage division.
Phonetic order is carried out classification according to certain principle; Make each grade phonetic order after the classification not have close keyword basically; The present invention is through being divided into two-stage with module and action; Shown in Figure 3, with originally " opening air-conditioning " and " closing air-conditioning " with 4 keywords of one-level, " reading lamp is bright " and " reading lamp goes out " is divided into secondary: the first order is " air-conditioning " and " reading lamp "; The second level is according to instruction respectively has 2 keywords to be to the first order: " opening " or " closing ", " bright " or " going out ".
Phonetic order classification action to other sound-controlled electric module can realize with this phonetic order stage division.
Fig. 4 has provided the process flow diagram of embodiment of the invention classification acoustic control phonetic order robustness recognition methods.
The tip of the tongue was tip-tilted zh ch sh r when the described cacuminal rhythm symbol of the Chinese phonetic alphabet comprised pronunciation.
The rhythm symbol of described pre-nasal sound comprises an, en, and in, un, the rhythm symbol of described back nasal sound comprises ang, eng, ing, ong.
When have in the instruction voice keyword of voice activated control stick up tongue after nasal sound " liter " " sheng "; To not stick up close " shen " of tongue or back nasal sound; " sen ", " seng " three voice all are identified as keyword " liter ", even car owner user's pronunciation can not divide situation such as cacuminal, back nasal sound or pre-nasal sound like this; Can not discern the problem that maybe can't discern yet, thereby improve the speech recognition system discrimination.
Embodiment of the invention classification acoustic control phonetic order robustness identification process is a first order phonetic order keyword with " air-conditioning ", " reading lamp ", and second level phonetic order " opens " or " closing ", " bright " or " going out " are made illustrative and described.
(1), starts the automobile voice activated control through button or non-button;
(2) voice cue module output voice suggestion signal " first order phonetic order is used in welcome: DVD sound equipment, air-conditioning, reading lamp, vehicle window, skylight, rearview mirror and other controlling object title "; To each first order phonetic order allocation of codes, the voice suggestion signal is play through the playback parts of sound system;
(3) the system voice identification module receives first order phonetic order " air-conditioning ", " reading lamp ", " ... " and identification and the affirmation that car takes advantage of personnel to send, and output first order phonetic order confirmation signal is delivered to voice cue module;
(4) voice cue module output voice suggestion signal: available second level phonetic order " opens " or " closing ", " bright " or " going out ",
(5) the system voice identification module receive that second level phonetic order that car takes advantage of personnel to send " is opened " or " closing ", " bright " or " going out " ... And identification and affirmation; Output second level phonetic order confirmation signal is to sound identification module;
(6) control and executive module of voice activated control is arrived in sound identification module output first and second grade voice combined commands " air-conditioning is opened "/" air-conditioning is closed ", " reading lamp is bright "/" reading lamp goes out ", " ... ";
(7) carry out the combination of first and second grade voice combined command;
(8) export the corresponding electrical appliance control signal of first and second grade voice combined command.
It will be understood by those skilled in the art that and under the prerequisite that does not deviate from broad scope of the present invention, the foregoing description is made some changes.Thereby the present invention is not limited in disclosed specific embodiment.Its scope should contain core of the present invention and the interior all changes of protection domain that appended claims limits.

Claims (9)

1. method that improves automobile voice activated control phonetic recognization rate, it may further comprise the steps:
Step 1, be automobile voice activated control configured voice reminding module, the playback parts of voice cue module shared motor vehicle sound system, voice cue module output voice suggestion signal can be implemented timesharing control to the entertain mem signal that car audio system is play;
The sound identification module of step 2, automobile voice activated control also disposes the flow process of the phonetic order stage division that adopts classification input and hierarchical identification; Through with the phonetic order classification, make in to occur identical word less or the close word that pronounces with the sound instruction of one-level as far as possible, so just can improve the speech recognition reliability greatly;
The sound identification module of step 3, automobile voice activated control also disposes the flow process that adopts the recognition methods of classification phonetic order robustness; Be used to realize under the coarse condition of voice, make sound identification module can keep the identification probability of expection, thereby can improve the discrimination of automobile voice activated control speech recognition greatly.
2. method according to claim 1 is characterized in that, the phonetic order stage division flow process of said step 2 may further comprise the steps:
(1), starts the automobile voice activated control through button or non-button;
(2) voice cue module output voice suggestion signal " welcomes to use first order phonetic order ", comprises " Mytip is welcome you ", and the voice suggestion signal is play the content of " first order keyword phrase " through the playback parts of sound system;
(3) the voice voice activated control is gathered " first order keyword " voice signal that car takes advantage of personnel to send;
(4) sound identification module is done speech recognition to " first order keyword " voice signal, accomplishes the affirmation to first order phonetic order; If be judged as " denying ", judge that the voice that receive are not first order phonetic orders, return " first order keyword " voice that cars to be collected such as (3) takes advantage of personnel to send; If be judged as " being ", judge that the voice that receive are first order phonetic orders, carry out next step flow process (5);
(5) voice cue module is according to the different keywords of first order keyword; The different phonetic cue of output second level phonetic order: " continuing to use second level phonetic order "; Comprise that available commands opens, stops or close "; " available commands liter, stop or fall ", the voice suggestion signal is play the corresponding voice suggestion content of " second level keyword phrase " through the playback parts of sound system;
(6) the voice voice activated control is gathered " second level keyword " voice signal that car takes advantage of personnel to send;
(7) sound identification module is done speech recognition to " second level keyword ", accomplishes the affirmation to second level phonetic order; If be judged as " denying ", judge that the voice that receive are not second level phonetic orders, return " second level keyword " voice that cars to be collected such as (6) takes advantage of personnel to send; If be judged as " being ", judge that the voice that receive are second level phonetic orders, carry out next step flow process (8);
(8) output of voice voice activated control is accomplished the control of corresponding acoustic control electrical equipment by the control signal of first order phonetic order, the combination of second level phonetic order by system control module;
(9) process ends;
Through with the phonetic order classification, make in to occur identical word less or the close word that pronounces with the sound instruction of one-level as far as possible, so just can improve the speech recognition reliability greatly.
3. method according to claim 1 is characterized in that, the method for said classification phonetic order robustness identification may further comprise the steps:
(1) starts the voice voice activated control;
(2) voice voice activated control initialization;
1) definition and foundation are comparable to the non-accurate phonetic model database abbreviation phonetic order robustness phonetic model database of robustness of the accurate phonetic model of phonetic order;
2) confirm the non-accurate proximity criterion of robustness phonetic model database, comprise
A. be judged to be cacuminal voice phonetic and non-cacuminal voice phonetic close;
B. be judged to be the voice phonetic of pre-nasal sound and the voice phonetic of non-pre-nasal sound close;
C. be judged to be the voice phonetic of back nasal sound and the voice phonetic of non-back nasal sound close;
3) first order keyword phrase of definition first order phonetic order, the accurate phonetic model of each keyword received pronunciation and the robustness phonetic model thereof of structure first order keyword phrase;
The phonetic order first order keyword word bank that foundation comprises first order keyword phrase is called for short first order keyword word bank, and each " first order keyword " phonetic model comprises a robustness phonetic model that accurate phonetic model is close with several;
4) second level keyword phrase of definition second level phonetic order, the accurate phonetic model of keyword received pronunciation and the robustness phonetic model thereof of structure second level keyword phrase;
The phonetic order second level keyword word bank that foundation comprises second level keyword phrase is called for short second level keyword word bank, and each " second level keyword " phonetic model comprises a robustness phonetic model that accurate phonetic model is close with several;
(3) the speech recognition identification module receives the first order phonetic order that the instruction people sends;
(4) sound identification module calls " first order keyword word bank " earlier and does the identification of phonetic order robustness coupling; The robustness phonetic model close with several with an accurate phonetic model of each first order keyword compares, with " or " meet and be judged to be " coupling "; If be judged to be " denying ", return step (3); If be judged to be " being ", change step (5);
The keyword code of the first order phonetic order of (5) output coupling;
(6) the speech recognition identification module receives the second level phonetic order that the instruction people sends;
(7) sound identification module calls " first order keyword word bank " earlier and does the identification of phonetic order robustness coupling; The robustness phonetic model close with several with an accurate phonetic model of each first order keyword compares, with " or " meet and be judged to be " coupling "; If be judged to be " denying ", return step (6); If be judged to be " being ", change step (8);
The action keyword code of the second level phonetic order of (8) output coupling;
Call " second level keyword word bank " behind the sound identification module and do the identification of phonetic order robustness coupling; The robustness phonetic model close with several with an accurate phonetic model of each second keyword compares; With " or " meet; Then be judged to be " coupling ", export the code value of this second keyword;
(9) combined treatment made in the first order and the second level phonetic order keyword of coupling: the code value combination of the code value of first order keyword and second level keyword constitutes the combine voice instruction code;
The control signal of the combine voice instruction code of (10) output coupling;
(11) process ends.
4. method according to claim 1 is characterized in that, the timesharing control of said step 1 is to generate the timesharing control signal with voice cue module output voice suggestion signal, and the timesharing control signal is implemented timesharing to the entertain mem signal of sound system and enabled control.
5. method according to claim 2; It is characterized in that; Said first order phonetic order comprises DVD sound equipment, air-conditioning, reading lamp, vehicle window, skylight, rearview mirror, boot and other controlling object title, and distributes a code for each first order phonetic order.
6. method according to claim 2 is characterized in that, said second level phonetic order comprise with first order phonetic order in the action keyword that is complementary of controlling object title, and distribute a code for each second level phonetic order.
7. according to claim 5 or 6 described methods, it is characterized in that said first order phonetic order code is associated with second level phonetic order code, I and II voice combination instruction code is made up of both code combinations; Be used to implement relevant control to corresponding electric appliance.
8. method according to claim 7 is characterized in that, the action keyword of said second level phonetic order allows to be complementary with a plurality of first order phonetic orders.
9. according to claim 6 or 7 described methods, it is characterized in that the action keyword of said second level phonetic order comprises: unlatching and synonym thereof are opened, open, open, play, answer, are navigated; Stop and synonym stops, stops, breaks; Close and synonym closes, shuts, withdraws from, breaks off.
CN201110164289.7A 2011-06-20 2011-06-20 Method for improving voice recognition rate of automobile voice control system Active CN102332265B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110164289.7A CN102332265B (en) 2011-06-20 2011-06-20 Method for improving voice recognition rate of automobile voice control system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110164289.7A CN102332265B (en) 2011-06-20 2011-06-20 Method for improving voice recognition rate of automobile voice control system

Publications (2)

Publication Number Publication Date
CN102332265A true CN102332265A (en) 2012-01-25
CN102332265B CN102332265B (en) 2014-04-16

Family

ID=45484021

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110164289.7A Active CN102332265B (en) 2011-06-20 2011-06-20 Method for improving voice recognition rate of automobile voice control system

Country Status (1)

Country Link
CN (1) CN102332265B (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102582523A (en) * 2012-03-09 2012-07-18 深圳市领华卫通数码科技有限公司 Interior rearview mirror with voice recognition function and voice recognition method
CN102582576A (en) * 2012-03-15 2012-07-18 福州海景科技开发有限公司 Vehicular burglary prevention and personal safety protection system based on voice recognition technique
CN102636171A (en) * 2012-04-27 2012-08-15 深圳市凯立德科技股份有限公司 Voice navigation method and device
CN102664008A (en) * 2012-04-27 2012-09-12 上海量明科技发展有限公司 Method, terminal and system for transmitting data
CN102708858A (en) * 2012-06-27 2012-10-03 厦门思德电子科技有限公司 Voice bank realization voice recognition system and method based on organizing way
CN103398454A (en) * 2013-08-06 2013-11-20 四川长虹电器股份有限公司 Air conditioning system and control method
CN103903617A (en) * 2012-12-24 2014-07-02 联想(北京)有限公司 Voice recognition method and electronic device
CN104240701A (en) * 2013-06-10 2014-12-24 上海能感物联网有限公司 Method for controlling washing machine to work through voice of Chinese natural person
CN104603871A (en) * 2012-08-02 2015-05-06 宝马股份公司 Method and device for operating speech-controlled information system for vehicle
CN105101565A (en) * 2015-09-01 2015-11-25 广西南宁智翠科技咨询有限公司 Method for opening car atmosphere lamp
CN105332586A (en) * 2014-07-22 2016-02-17 比亚迪股份有限公司 Vehicle and vehicle window lifting control system and method thereof
CN105654953A (en) * 2016-03-22 2016-06-08 美的集团股份有限公司 Voice control method and system
CN105741839A (en) * 2016-02-17 2016-07-06 陆玉正 Vehicle-mounted electric appliance voice auxiliary control device
CN106409294A (en) * 2016-10-18 2017-02-15 广州视源电子科技股份有限公司 Method and apparatus for preventing voice command misidentification
CN107077847A (en) * 2014-11-03 2017-08-18 微软技术许可有限责任公司 The enhancing of key phrase user's identification
CN107065679A (en) * 2017-05-15 2017-08-18 佛山市顺德区美的洗涤电器制造有限公司 Dish-washing machine and its control device and control method
WO2017201913A1 (en) * 2016-05-24 2017-11-30 深圳市敢为软件技术有限公司 Precise voice control method and device
CN107672547A (en) * 2017-10-10 2018-02-09 邓雪云 New-energy automobile sound control method, device, mobile terminal and storage medium
CN107800895A (en) * 2016-08-31 2018-03-13 南京中兴软件有限责任公司 A kind of interactive voice answering method and device
CN108305625A (en) * 2018-01-29 2018-07-20 深圳春沐源控股有限公司 Voice control method and device, electronic equipment and computer readable storage medium
CN109741741A (en) * 2018-12-29 2019-05-10 深圳Tcl新技术有限公司 Control method, intelligent terminal and the computer readable storage medium of intelligent terminal
CN110895936A (en) * 2018-09-13 2020-03-20 珠海格力电器股份有限公司 Voice processing method and device based on household appliance
CN111128171A (en) * 2019-12-31 2020-05-08 云知声智能科技股份有限公司 Setting method and device based on voice recognition
CN112017665A (en) * 2020-08-20 2020-12-01 武汉理工大学 Voice sorting system and method based on LD3320 voice chip
CN113160810A (en) * 2021-01-13 2021-07-23 安徽师范大学 LD 3320-based voice recognition interaction method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1365487A (en) * 1999-06-24 2002-08-21 西门子公司 Voice recognition method and device
US20040172256A1 (en) * 2002-07-25 2004-09-02 Kunio Yokoi Voice control system
CN101269638A (en) * 2008-04-10 2008-09-24 清华大学 Vehicle electrical apparatus sound control method based on command word list
CN101323305A (en) * 2008-05-14 2008-12-17 奇瑞汽车股份有限公司 Vehicle-mounted speech recognition control system and control method
EP2028061A2 (en) * 2007-08-23 2009-02-25 Delphi Technologies, Inc. System and method of controlling personalized settings in a vehicle
CN201610132U (en) * 2009-11-23 2010-10-20 浙江吉利汽车研究院有限公司 Automobile voice control system for entertainment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1365487A (en) * 1999-06-24 2002-08-21 西门子公司 Voice recognition method and device
US20040172256A1 (en) * 2002-07-25 2004-09-02 Kunio Yokoi Voice control system
EP2028061A2 (en) * 2007-08-23 2009-02-25 Delphi Technologies, Inc. System and method of controlling personalized settings in a vehicle
CN101269638A (en) * 2008-04-10 2008-09-24 清华大学 Vehicle electrical apparatus sound control method based on command word list
CN101323305A (en) * 2008-05-14 2008-12-17 奇瑞汽车股份有限公司 Vehicle-mounted speech recognition control system and control method
CN201610132U (en) * 2009-11-23 2010-10-20 浙江吉利汽车研究院有限公司 Automobile voice control system for entertainment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王静帆 等: "中文信息检索系统的模糊匹配算法研究和实现", 《中文信息学报》, vol. 21, no. 6, 30 November 2007 (2007-11-30) *

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102582523A (en) * 2012-03-09 2012-07-18 深圳市领华卫通数码科技有限公司 Interior rearview mirror with voice recognition function and voice recognition method
CN102582576A (en) * 2012-03-15 2012-07-18 福州海景科技开发有限公司 Vehicular burglary prevention and personal safety protection system based on voice recognition technique
CN102636171A (en) * 2012-04-27 2012-08-15 深圳市凯立德科技股份有限公司 Voice navigation method and device
CN102664008A (en) * 2012-04-27 2012-09-12 上海量明科技发展有限公司 Method, terminal and system for transmitting data
CN102664008B (en) * 2012-04-27 2014-11-19 上海量明科技发展有限公司 Method, terminal and system for transmitting data
CN102708858A (en) * 2012-06-27 2012-10-03 厦门思德电子科技有限公司 Voice bank realization voice recognition system and method based on organizing way
CN104603871A (en) * 2012-08-02 2015-05-06 宝马股份公司 Method and device for operating speech-controlled information system for vehicle
CN103903617A (en) * 2012-12-24 2014-07-02 联想(北京)有限公司 Voice recognition method and electronic device
CN104240701A (en) * 2013-06-10 2014-12-24 上海能感物联网有限公司 Method for controlling washing machine to work through voice of Chinese natural person
CN103398454B (en) * 2013-08-06 2016-04-13 四川长虹电器股份有限公司 A kind of air-conditioning system and control method
CN103398454A (en) * 2013-08-06 2013-11-20 四川长虹电器股份有限公司 Air conditioning system and control method
CN105332586B (en) * 2014-07-22 2017-07-04 比亚迪股份有限公司 Vehicle and its window lifting control system and control method
CN105332586A (en) * 2014-07-22 2016-02-17 比亚迪股份有限公司 Vehicle and vehicle window lifting control system and method thereof
CN107077847A (en) * 2014-11-03 2017-08-18 微软技术许可有限责任公司 The enhancing of key phrase user's identification
US11270695B2 (en) 2014-11-03 2022-03-08 Microsoft Technology Licensing, Llc Augmentation of key phrase user recognition
CN105101565A (en) * 2015-09-01 2015-11-25 广西南宁智翠科技咨询有限公司 Method for opening car atmosphere lamp
CN105741839A (en) * 2016-02-17 2016-07-06 陆玉正 Vehicle-mounted electric appliance voice auxiliary control device
CN105654953A (en) * 2016-03-22 2016-06-08 美的集团股份有限公司 Voice control method and system
CN105654953B (en) * 2016-03-22 2019-05-17 美的集团股份有限公司 Sound control method and system
WO2017201913A1 (en) * 2016-05-24 2017-11-30 深圳市敢为软件技术有限公司 Precise voice control method and device
CN107800895A (en) * 2016-08-31 2018-03-13 南京中兴软件有限责任公司 A kind of interactive voice answering method and device
CN106409294A (en) * 2016-10-18 2017-02-15 广州视源电子科技股份有限公司 Method and apparatus for preventing voice command misidentification
CN106409294B (en) * 2016-10-18 2019-07-16 广州视源电子科技股份有限公司 The method and apparatus for preventing voice command from misidentifying
CN107065679A (en) * 2017-05-15 2017-08-18 佛山市顺德区美的洗涤电器制造有限公司 Dish-washing machine and its control device and control method
CN107672547A (en) * 2017-10-10 2018-02-09 邓雪云 New-energy automobile sound control method, device, mobile terminal and storage medium
CN107672547B (en) * 2017-10-10 2020-09-18 新昌县捷庭科技有限公司 New energy automobile voice control method and device, mobile terminal and storage medium
CN108305625A (en) * 2018-01-29 2018-07-20 深圳春沐源控股有限公司 Voice control method and device, electronic equipment and computer readable storage medium
CN110895936A (en) * 2018-09-13 2020-03-20 珠海格力电器股份有限公司 Voice processing method and device based on household appliance
CN110895936B (en) * 2018-09-13 2020-09-25 珠海格力电器股份有限公司 Voice processing method and device based on household appliance
CN109741741A (en) * 2018-12-29 2019-05-10 深圳Tcl新技术有限公司 Control method, intelligent terminal and the computer readable storage medium of intelligent terminal
CN111128171A (en) * 2019-12-31 2020-05-08 云知声智能科技股份有限公司 Setting method and device based on voice recognition
CN112017665A (en) * 2020-08-20 2020-12-01 武汉理工大学 Voice sorting system and method based on LD3320 voice chip
CN113160810A (en) * 2021-01-13 2021-07-23 安徽师范大学 LD 3320-based voice recognition interaction method and system

Also Published As

Publication number Publication date
CN102332265B (en) 2014-04-16

Similar Documents

Publication Publication Date Title
CN102332265B (en) Method for improving voice recognition rate of automobile voice control system
CN101281745B (en) Interactive system for vehicle-mounted voice
US8106285B2 (en) Speech-driven selection of an audio file
Hazen et al. A comparison and combination of methods for OOV word detection and word confidence scoring
US20160057261A1 (en) Voice recognition apparatus, vehicle having the same, and method of controlling the vehicle
WO2002054033A3 (en) Hierarchical language models for speech recognition
CN206595039U (en) A kind of interactive system for vehicle-mounted voice
CN101118745B (en) Confidence degree quick acquiring method in speech identification system
US20100305947A1 (en) Speech Recognition Method for Selecting a Combination of List Elements via a Speech Input
CN108242236A (en) Dialog process device and its vehicle and dialog process method
CN105529026A (en) Speech recognition device and speech recognition method
WO2020123227A1 (en) Speech processing system
CN102693725A (en) Speech recognition dependent on text message content
CN101383150B (en) Control method of speech soft switch and its application in geographic information system
CN102097096A (en) Using pitch during speech recognition post-processing to improve recognition accuracy
CN103824557A (en) Audio detecting and classifying method with customization function
CN109887511A (en) A kind of voice wake-up optimization method based on cascade DNN
US20160111089A1 (en) Vehicle and control method thereof
CN101286317A (en) Speech recognition device, model training method and traffic information service platform
US11355112B1 (en) Speech-processing system
CN102693723A (en) Method and device for recognizing speaker-independent isolated word based on subspace
CN108628859A (en) A kind of real-time voice translation system
CN103065628A (en) Voice interaction control guide system and method thereof
DE112021000292T5 (en) VOICE PROCESSING SYSTEM
JP2003509705A (en) Voice recognition method and voice recognition device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant