KR20100101986A - Telematics terminal, metohd for voice recognition and computer recordable medium - Google Patents
Telematics terminal, metohd for voice recognition and computer recordable medium Download PDFInfo
- Publication number
- KR20100101986A KR20100101986A KR1020090020461A KR20090020461A KR20100101986A KR 20100101986 A KR20100101986 A KR 20100101986A KR 1020090020461 A KR1020090020461 A KR 1020090020461A KR 20090020461 A KR20090020461 A KR 20090020461A KR 20100101986 A KR20100101986 A KR 20100101986A
- Authority
- KR
- South Korea
- Prior art keywords
- vehicle
- telematics terminal
- server
- information related
- acoustic model
- Prior art date
Links
- 238000000034 method Methods 0.000 claims abstract description 51
- 238000004891 communication Methods 0.000 claims abstract description 33
- 230000006870 function Effects 0.000 claims description 14
- 230000003213 activating effect Effects 0.000 claims description 7
- 238000005516 engineering process Methods 0.000 description 8
- 230000005236 sound signal Effects 0.000 description 7
- 230000000694 effects Effects 0.000 description 6
- 238000010295 mobile communication Methods 0.000 description 6
- 230000003247 decreasing effect Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000010408 film Substances 0.000 description 1
- 230000020169 heat generation Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 238000009304 pastoral farming Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 230000015541 sensory perception of touch Effects 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/09—Arrangements for giving variable traffic instructions
- G08G1/0962—Arrangements for giving variable traffic instructions having an indicator mounted inside the vehicle, e.g. giving voice messages
- G08G1/0968—Systems involving transmission of navigation instructions to the vehicle
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- General Physics & Mathematics (AREA)
- Navigation (AREA)
Abstract
Description
The present invention relates to a voice recognition method for recognizing the voice of a vehicle occupant in a vehicle telematics terminal and a telematics terminal using the same.
Telematics is a compound word of Telecommunications and Informatics, also known as Information and Communications Technology (ICT), which collectively refers to a technology for transmitting, receiving, and storing information through a communication device. do.
In recent years, telematics may be implemented in a form in which a navigation system using GPS technology and a mobile communications technology are combined.
These vehicle telematics include remote diagnostics of vehicles, fault diagnosis of electrical / mechanical components mounted on a vehicle, vehicle control, between a call center and a vehicle or between vehicles equipped with a vehicle telematics terminal. It can be used in various fields such as telecommunications, intelligent transportation systems, and user-vehicle interface implementation.
In the case of a telematics terminal equipped with a microphone, voice recognition technology may provide an alternative interface in addition to the traditional human-to-device interface (including input through a keyboard, a keypad, and a touchpad). For example, a telematics terminal equipped with a microphone may make a call to another terminal using voice recognition.
In a vehicle telematics terminal performing voice recognition, the noise environment is changed by various internal / external variables. Internal / external variables include vehicle speed, wind noise, and air conditioner module operation.
In the case of a vehicle, the noise environment varies more depending on internal and external variables than the noise environment in general speech recognition. Therefore, consideration of a method for improving the speech recognition rate in a condition in which the noise environment changes is required.
The present invention relates to a speech recognition method and a telematics terminal of a telematics terminal capable of obtaining an improved speech recognition rate compared to a conventional method even in a noise environment of a vehicle which varies according to a variable.
Voice recognition method of the telematics terminal according to an embodiment of the present invention for realizing the above object, the step of transmitting information related to the noise environment of the vehicle to the server, and the information related to the driving environment of the plurality of acoustic models Receiving information related to the acoustic model selected by the server using the server; and performing voice recognition using the selected acoustic model.
In one aspect of the present invention, the information related to the noise environment of the vehicle may include at least one of a driving speed of the vehicle, whether the air conditioner module is operated and the strength of the operation, whether the window is opened or closed, and the vehicle position information.
In another aspect of the present invention, the transmitting may be a step of periodically transmitting information related to the noise environment of the vehicle to a server.
In another aspect of the present invention, the transmitting may include transmitting information related to the noise environment of the vehicle to a server when an input for activating a voice recognition function is received from a passenger of the vehicle.
In another aspect of the present invention, the transmitting may be a step of transmitting, to the server when any event occurs, information related to the noise condition of the vehicle.
In another aspect of the present invention, the random event includes at least one of an increase or decrease in vehicle interior noise, an increase or decrease in vehicle running speed, an operation of an air conditioner module, a change in operation intensity and an end of operation, and an opening or closing of a window. It may include one.
In still another aspect of the present invention, the plurality of acoustic models may be stored in whole or in part in the telematics terminal, or in whole or in part in the server.
In another aspect of the present invention, if the selected acoustic model is not stored in the telematics terminal, the method may further include downloading the selected acoustic model from the server.
In another aspect of the present invention, the selected acoustic model may be an acoustic model selected from a plurality of acoustic models using information related to the noise environment of the vehicle received from the vehicle or information collected by a server.
In another aspect of the present invention, the information collected by the server may include at least one of a road state of a road on which the vehicle is running, a noise state around the vehicle, and weather around the vehicle.
In the computer-readable recording medium related to an embodiment of the present invention for realizing the above object, a program for performing a voice recognition method of the telematics terminal is recorded.
Telematics terminal according to an embodiment of the present invention for realizing the above object, the microphone for receiving the voice of the vehicle occupant, and transmits the information related to the noise environment of the vehicle to the server, the driving environment of the plurality of acoustic models It includes a wireless communication unit for receiving information related to the acoustic model selected by the server using the information associated with the control unit for performing voice recognition using the selected acoustic model.
In one aspect of the present invention, the information related to the noise environment of the vehicle may include at least one of a driving speed of the vehicle, whether the air conditioner module is operated and the strength of the operation, whether the window is opened or closed, and the position information of the vehicle.
In another aspect of the present invention, the wireless communication unit may periodically transmit information related to the noise environment of the vehicle to the server.
In another aspect of the invention, the user input unit for receiving an input for activating the voice recognition function from the occupant of the vehicle, the wireless communication unit, when the input is received, the information related to the noise environment of the vehicle You can send it to the server.
In another aspect of the present invention, the wireless communication unit may transmit information related to the noise environment of the vehicle to the server when any event occurs.
In another aspect of the present invention, the random event includes at least one of an increase or decrease in vehicle interior noise, an increase or decrease in vehicle running speed, an operation of an air conditioner module, a change in operation intensity and an end of operation, and an opening or closing of a window. It can contain one.
In still another aspect of the present invention, the apparatus may further include a memory configured to store all or part of the plurality of acoustic models.
In another aspect of the present invention, the controller determines whether the selected acoustic model is stored in the telematics terminal, and the wireless communication unit, if the selected acoustic model is not stored in the telematics terminal, The selected acoustic model can be downloaded from.
In another aspect of the present invention, the selected acoustic model may be an acoustic model selected from a plurality of acoustic models using information related to the noise environment of the vehicle received from the vehicle or information collected by a server.
In another aspect of the present invention, the information collected by the server may include at least one of a road state of a road on which the vehicle is running, a noise state around the vehicle, and weather around the vehicle.
In the speech recognition method of the telematics terminal according to an embodiment of the present invention, by selecting an acoustic model suitable for the noise environment of the moving object and performing the speech recognition, a high speech recognition rate may be obtained even in a rapidly changing noise environment.
In addition, since the selection of the acoustic model is made at the server, it is possible to reduce the traffic of the telematics terminal for selecting the appropriate acoustic model.
Hereinafter, a telematics terminal related to the present invention will be described in more detail with reference to the accompanying drawings. The suffixes "module" and "unit" for components used in the following description are given or used in consideration of ease of specification, and do not have distinct meanings or roles from each other.
1 is a block diagram of a telematics terminal related to an embodiment of the present invention.
The
Hereinafter, the components will be described in order.
The
The
The broadcast channel may include a satellite channel and a terrestrial channel. The broadcast management server may mean a server that generates and transmits a broadcast signal and / or broadcast related information or a server that receives a previously generated broadcast signal and / or broadcast related information and transmits the same to a terminal. The broadcast signal may include not only a TV broadcast signal, a radio broadcast signal, and a data broadcast signal, but also a broadcast signal having a data broadcast signal combined with a TV broadcast signal or a radio broadcast signal.
The broadcast related information may mean information related to a broadcast channel, a broadcast program, or a broadcast service provider. The broadcast related information may also be provided through a mobile communication network. In this case, it may be received by the
The broadcast related information may exist in various forms. For example, it may exist in the form of Electronic Program Guide (EPG) of Digital Multimedia Broadcasting (DMB) or Electronic Service Guide (ESG) of Digital Video Broadcast-Handheld (DVB-H).
The
The broadcast signal and / or broadcast related information received through the
The
The
The short
The
The GPS module receives a signal containing time information from at least three navigation satellites, and calculates a distance from each satellite using the GPS signal. Position information may be obtained by applying triangulation to the calculated distance. The GPS module may increase the accuracy of the calculated location information by further applying techniques such as map matching and dead reckoning to the location information obtained by applying the triangulation method.
The
Referring to FIG. 1, the A /
The image frame processed by the
The
The
A steering wheel, an acceleration pedal, a brake pedal, a gear shift lever, and the like mounted on the vehicle may configure the
The
The
The
Some of these displays can be configured to be transparent or light transmissive so that they can be seen from the outside. This may be referred to as a transparent display. A representative example of the transparent display is TOLED (Transparant OLED).
The
According to an embodiment of the
When the
The touch sensor may be configured to convert a change in pressure applied to a specific portion of the
If there is a touch input to the touch sensor, the corresponding signal (s) is sent to the touch controller. The touch controller processes the signal (s) and then transmits the corresponding data to the
Referring to FIG. 1, a
Examples of the proximity sensor include a transmission photoelectric sensor, a direct reflection photoelectric sensor, a mirror reflection photoelectric sensor, a high frequency oscillation proximity sensor, a capacitive proximity sensor, a magnetic proximity sensor, and an infrared proximity sensor. When the touch screen is capacitive, the touch screen is configured to detect the proximity of the pointer by a change in an electric field according to the proximity of the pointer. In this case, the touch screen (touch sensor) may be classified as a proximity sensor.
Hereinafter, for convenience of explanation, the act of allowing the pointer to be recognized without being in contact with the touch screen so that the pointer is located on the touch screen is referred to as a "proximity touch", and the touch The act of actually touching the pointer on the screen is called "contact touch." The position where the proximity touch is performed by the pointer on the touch screen refers to a position where the pointer is perpendicular to the touch screen when the pointer is in proximity proximity.
The proximity sensor detects a proximity touch and a proximity touch pattern (for example, a proximity touch distance, a proximity touch direction, a proximity touch speed, a proximity touch time, a proximity touch position, and a proximity touch movement state). Information corresponding to the sensed proximity touch operation and proximity touch pattern may be output on the touch screen.
The
The
The
In addition to the vibration, the
The
The
The
The
The
The identification module is a chip that stores various types of information for authenticating the usage rights of the
The
The
An example in which the
In general, two processes are required for speech recognition. The first step is to extract words from the user's voice using an acoustic model. The acoustic model extracts words having a high probability of matching statistically from the user's voice received using the
The second process is to determine the meaning of text using a language model or a grammar file. The language model is a file including probability information related to word order, and a grammar file is a file of a small size including a set of preset word combinations.
When a word is extracted using the voice, noise removal and noise compensation are performed from the voice. When the speech recognition is performed in a noise environment that is different from the noise characteristic assumed by the acoustic model, it is difficult to extract the correct speech feature in the process of noise removal and noise compensation.
The
The
Various embodiments described herein may be implemented in a recording medium readable by a computer or similar device using, for example, software, hardware or a combination thereof.
According to a hardware implementation, the embodiments described herein include application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), and the like. It may be implemented using at least one of processors, controllers, micro-controllers, microprocessors, and electrical units for performing other functions. The described embodiments may be implemented by the
According to the software implementation, embodiments such as the procedures and functions described herein may be implemented as separate software modules. Each of the software modules may perform one or more functions and operations described herein. Software code may be implemented in software applications written in a suitable programming language. The software code may be stored in the
As shown in FIG. 1, the
The
Information related to the noise environment of the vehicle may be transmitted to a server and used by the server to select an acoustic model suitable for the noise environment of the vehicle. In selecting the acoustic model, the server may use information collected by the server in addition to the information related to the driving environment received from the vehicle.
The information collected by the server may include at least one of a road state of a road on which the vehicle is running, a noise state around the vehicle, and weather around the vehicle.
For example, the server may receive location information of the vehicle from the vehicle and collect information such as road conditions, temperatures, and weather at a location where the vehicle is located. The server may select a suitable acoustic model using information related to the temperature, the weather of the location of the vehicle and the noise environment of the vehicle received from the vehicle.
The server may include information related to the plurality of acoustic models. The information related to the plurality of acoustic models refers to information related to a noise environment of a vehicle corresponding to the acoustic model. The server selects an acoustic model capable of showing the highest voice recognition rate under the noise environment, using the information collected by the server and the noise environment of the vehicle received from the vehicle.
The
The server may update a plurality of previously stored acoustic models and other new acoustic models. The
The
When the occupant of the vehicle receives an input for activating a voice recognition function, the
The
When the
The
The
2 is a flowchart illustrating a voice recognition method of a telematics terminal related to an embodiment of the present invention.
As shown in FIG. 2, the
The
The
In this case, voice recognition may be performed using the selected acoustic model (S140). If there is no code that matches the received code among the plurality of codes, the
3 is a flowchart illustrating a voice recognition method of a telematics terminal according to another embodiment of the present invention.
As shown in FIG. 3, the
4 is a flowchart illustrating a voice recognition method of a telematics terminal according to another embodiment of the present invention.
As shown in FIG. 4, the
Events that significantly change the information related to the noise environment may be set as an arbitrary event, and when the random event occurs, it may be determined that the noise environment is likely to be greatly changed. If the noise environment fluctuates significantly, it is necessary to select an acoustic model different from the acoustic model in use to increase the speech recognition rate. In this case, the
5 is a flowchart illustrating a voice recognition method of a telematics terminal according to another embodiment of the present invention.
As shown in FIG. 5, the
When the input for activating the voice recognition is received, the
When the
Therefore, when the
In addition, according to an embodiment of the present invention, the above-described method may be implemented as code that can be read by a processor in a medium in which a program is recorded. Examples of processor-readable media include ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage, and the like, and may be implemented in the form of a carrier wave (for example, transmission over the Internet). Include.
The telematics terminal having the voice recognition method described above is not limited to the configuration and method of the above-described embodiments, but the embodiments may be modified in whole or in part to enable various modifications. It may alternatively be configured in combination.
1 is a block diagram of a telematics terminal related to an embodiment of the present invention.
2 is a flowchart illustrating a voice recognition method of a telematics terminal related to an embodiment of the present invention.
3 is a flowchart illustrating a voice recognition method of a telematics terminal related to another embodiment of the present invention.
4 is a flowchart illustrating a voice recognition method of a telematics terminal related to another embodiment of the present invention.
5 is a flowchart illustrating a voice recognition method of a telematics terminal related to another embodiment of the present invention.
Claims (21)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020090020461A KR20100101986A (en) | 2009-03-10 | 2009-03-10 | Telematics terminal, metohd for voice recognition and computer recordable medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020090020461A KR20100101986A (en) | 2009-03-10 | 2009-03-10 | Telematics terminal, metohd for voice recognition and computer recordable medium |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20100101986A true KR20100101986A (en) | 2010-09-20 |
Family
ID=43007286
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020090020461A KR20100101986A (en) | 2009-03-10 | 2009-03-10 | Telematics terminal, metohd for voice recognition and computer recordable medium |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20100101986A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101502012B1 (en) * | 2008-10-06 | 2015-03-12 | 엘지전자 주식회사 | Telematics terminal and method for notifying emrergency condition using the same |
KR101628112B1 (en) * | 2014-12-08 | 2016-06-08 | 현대자동차 주식회사 | Apparatus and method for removing noise for voice recognition of vehicle |
US9626962B2 (en) | 2014-05-02 | 2017-04-18 | Samsung Electronics Co., Ltd. | Method and apparatus for recognizing speech, and method and apparatus for generating noise-speech recognition model |
US20170213549A1 (en) * | 2016-01-21 | 2017-07-27 | Ford Global Technologies, Llc | Dynamic Acoustic Model Switching to Improve Noisy Speech Recognition |
WO2018117532A1 (en) * | 2016-12-19 | 2018-06-28 | Samsung Electronics Co., Ltd. | Speech recognition method and apparatus |
US11804236B2 (en) | 2020-12-15 | 2023-10-31 | Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd. | Method for debugging noise elimination algorithm, apparatus and electronic device |
-
2009
- 2009-03-10 KR KR1020090020461A patent/KR20100101986A/en not_active Application Discontinuation
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101502012B1 (en) * | 2008-10-06 | 2015-03-12 | 엘지전자 주식회사 | Telematics terminal and method for notifying emrergency condition using the same |
US9626962B2 (en) | 2014-05-02 | 2017-04-18 | Samsung Electronics Co., Ltd. | Method and apparatus for recognizing speech, and method and apparatus for generating noise-speech recognition model |
KR101628112B1 (en) * | 2014-12-08 | 2016-06-08 | 현대자동차 주식회사 | Apparatus and method for removing noise for voice recognition of vehicle |
US20170213549A1 (en) * | 2016-01-21 | 2017-07-27 | Ford Global Technologies, Llc | Dynamic Acoustic Model Switching to Improve Noisy Speech Recognition |
US10297251B2 (en) * | 2016-01-21 | 2019-05-21 | Ford Global Technologies, Llc | Vehicle having dynamic acoustic model switching to improve noisy speech recognition |
WO2018117532A1 (en) * | 2016-12-19 | 2018-06-28 | Samsung Electronics Co., Ltd. | Speech recognition method and apparatus |
CN110088833A (en) * | 2016-12-19 | 2019-08-02 | 三星电子株式会社 | Audio recognition method and device |
US10770065B2 (en) | 2016-12-19 | 2020-09-08 | Samsung Electronics Co., Ltd. | Speech recognition method and apparatus |
CN110088833B (en) * | 2016-12-19 | 2024-04-09 | 三星电子株式会社 | Speech recognition method and device |
US11804236B2 (en) | 2020-12-15 | 2023-10-31 | Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd. | Method for debugging noise elimination algorithm, apparatus and electronic device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101570369B1 (en) | Telematics terminal and method for controlling vehicle by using thereof | |
KR102206383B1 (en) | Speech recognition apparatus and method thereof | |
EP2172917B1 (en) | Telematics terminal and method for notifing emergency conditions using the same | |
KR101502013B1 (en) | Mobile terminal and method for providing location based service thereof | |
KR101602221B1 (en) | Mobile terminal system and control method thereof | |
KR101537694B1 (en) | Navigation terminal, mobile terminal and method for guiding route thereof | |
KR101631959B1 (en) | Vehicle control system and method thereof | |
KR20100101986A (en) | Telematics terminal, metohd for voice recognition and computer recordable medium | |
KR101562581B1 (en) | Navigation apparatus and method thereof | |
KR20100064937A (en) | Navigation device and method for guiding route thereof | |
KR20100068974A (en) | Navigation device and method for guiding route thereof | |
KR20100072971A (en) | Navigation termninal and method for guiding route thereof | |
KR101667699B1 (en) | Navigation terminal and method for guiding movement thereof | |
KR20100131295A (en) | Signal processing apparatus and method thereof | |
KR20100052324A (en) | Navigation apparatus and method thereof | |
KR101537695B1 (en) | Navigation system and method thereof | |
KR101578426B1 (en) | Signal processing apparatus and method thereof | |
KR101695686B1 (en) | Mobile vehicle controlling apparatus and method thereof | |
KR101544454B1 (en) | Vehicle control system and method thereof | |
KR20100038692A (en) | Navigation apparatus and method thereof | |
KR101516638B1 (en) | Navigation apparatus and method thereof | |
KR20100100235A (en) | Mobile vehicle navigation method and apparatus thereof | |
KR20110010001A (en) | Apparatus for preventing the loss of a terminal and method thereof | |
KR20100043657A (en) | Navigation apparatus and method thereof | |
KR20140128807A (en) | An method for configuring of a vehichle and an appratus using it |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E601 | Decision to refuse application |