KR101650769B1 - The vehicle-mounted voice recognition system by using gesture recognition - Google Patents

The vehicle-mounted voice recognition system by using gesture recognition Download PDF

Info

Publication number
KR101650769B1
KR101650769B1 KR1020150075318A KR20150075318A KR101650769B1 KR 101650769 B1 KR101650769 B1 KR 101650769B1 KR 1020150075318 A KR1020150075318 A KR 1020150075318A KR 20150075318 A KR20150075318 A KR 20150075318A KR 101650769 B1 KR101650769 B1 KR 101650769B1
Authority
KR
South Korea
Prior art keywords
gesture
information
unit
search
media
Prior art date
Application number
KR1020150075318A
Other languages
Korean (ko)
Inventor
김혜진
손만식
황지선
송민규
Original Assignee
미디어젠(주)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 미디어젠(주) filed Critical 미디어젠(주)
Priority to KR1020150075318A priority Critical patent/KR101650769B1/en
Application granted granted Critical
Publication of KR101650769B1 publication Critical patent/KR101650769B1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • G06K9/00389

Abstract

The present invention relates to a voice recognition system for a vehicle using gesture recognition, and more particularly, to a voice recognition system for a vehicle, which recognizes a gesture using a motion sensor, performs a telephone dialing function, a navigation function, a media playback function, The present invention relates to a voice recognition system for a vehicle using gesture recognition that can maximize user's convenience while improving voice recognition performance by performing voice recognition or gesture recognition.

Description

[0001] The present invention relates to a vehicle-mounted voice recognition system using gesture recognition,

The present invention relates to a voice recognition system for a vehicle using gesture recognition, and more particularly, to a voice recognition system for a vehicle, which recognizes a gesture using a motion sensor, performs a telephone dialing function, a navigation function, a media playback function, The present invention relates to a voice recognition system for a vehicle using gesture recognition that can maximize user's convenience while improving voice recognition performance by performing voice recognition or gesture recognition.

Speech Recognition analyzes the user's voice input through a microphone and extracts the features of the user's voice, recognizes the result of the input in close proximity to the previously entered word or sentence, and performs the operation corresponding to the recognized command Technology.

When the speech recognition technology is applied to a vehicle, a driver can easily operate a desired vehicle device by issuing an instruction only by voice without using a hand directly to operate the device.

However, the speech recognition technology applied to existing vehicles often occurs when the vehicle is running, or the driver's voice is not accurately recognized due to noise inside or outside the vehicle.

Thereby, the driver is inconvenient to repeatedly issue a command until the module to which the voice recognition technology is applied recognizes the voice of the driver.

In the case of FIG. 1, a voice recognition menu structure provided in a conventional vehicle is shown. In order to use the navigation function, information required for voice recognition operation is constructed by a contact database, a navigation database, and a media file database. Recognition and sub-categories of main menu, such as address search, name search, and recent destination search.

Also, in order to use the telephone dialing function, the user's mobile phone should be connected to the vehicle with Bluetooth, so that the information, contact information and call history in the mobile phone can be transmitted to the vehicle and used for voice recognition.

In order to use the navigation function, an SD card containing navigation information must be connected, and the corresponding information must be stored so that the name or address can be referred to in order to set the destination by voice recognition.

In addition, the music file must be connected to the USB or stored in a storage medium that can be loaded from the vehicle for media playback, and the sound source information must be stored in order to reproduce the sound source by voice recognition.

Therefore, the user is reluctant to use the voice recognition function because the large-capacity information is searched and used at the same time, resulting in a decrease in the recognition rate and a delay in the overall response speed.

As a result, there is a need for a technique capable of more quickly and accurately recognizing an instruction of a driver on board a vehicle.

Korean Patent No. 10-0948600 (Mar. 12, 2010)

SUMMARY OF THE INVENTION The present invention has been made in view of the above-mentioned problems of the prior art, and it is an object of the present invention to provide a method and apparatus for recognizing a gesture using a motion sensor and performing a phone call function, a navigation function, The performance of the detailed function is performed through speech recognition or gesture recognition, thereby maximizing the user's convenience while improving the conventional speech recognition performance.

According to an aspect of the present invention, there is provided a vehicle voice recognition system using gesture recognition,

A smart phone (200) for providing at least one of contact information, call history information, and media information to the motion speech recognition processing means by Bluetooth communication with the motion speech recognition processing means;

A method for providing a search window on a screen by receiving any one of search request information of an address search, a name search, and a recent destination search from a motion speech recognition processing means, receiving destination information requested to be searched, Navigation (300);

A media playback unit (400) for acquiring a media playback command from the motion speech recognition processing unit and playing back the media;

A motion sensor unit 500 for recognizing the gesture operation of the user;

A microphone unit 600 for acquiring voice information of a user;

Acquiring information on at least one of contact information, history information, and media information in the smart phone by Bluetooth communication with the smartphone, and storing the acquired information in a memory unit, acquiring voice information from the microphone unit to perform voice recognition, Request information or destination information, providing a media playback command signal to the media playback unit, providing a phone call command signal to a smartphone, or comparing a gesture operation recognized by the motion sensor unit with a preset gesture operation, And a motion voice recognition processing means 100 for providing commands, which are set in the gesture operation, to any one of the navigation, the media playback unit, and the smartphone so that the set command can be performed.

According to the vehicle voice recognition system using the gesture recognition according to the present invention, the gesture is recognized by using the motion sensor to perform the telephone dialing function, the navigation function, and the media play function. The performance of speech recognition or recognition of the gesture is performed, thereby maximizing the user's convenience while improving the conventional speech recognition performance.

In addition, a large amount of data (contacts, addresses, POI names, etc.) defined for using the voice recognition function can be distributed and retrieved (the service can be started by matching with a predefined gesture) The effect of the recognition performance improvement is exhibited.

BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a diagram showing a speech recognition menu structure provided in a conventional vehicle. FIG.
FIG. 2 is a diagram illustrating an example of connected devices required for using a general speech recognition function. FIG.
FIG. 3 is an overall configuration view of a vehicle voice recognition system using gesture recognition according to an embodiment of the present invention; FIG.
4 is a block diagram of a means for processing motion speech recognition in a vehicle speech recognition system using gesture recognition according to an embodiment of the present invention.
5 is a block diagram of a gesture setting unit of a voice recognition system for a vehicle using gesture recognition according to an embodiment of the present invention.
FIG. 6 is a conceptual diagram illustrating a structure of a voice recognition menu after a main menu is uttered by a gesture of a voice recognition system for a vehicle using gesture recognition according to an embodiment of the present invention, and FIGS. 7 to 9 are examples of use of a gesture.

The following merely illustrates the principles of the invention. Therefore, those skilled in the art will be able to devise various apparatuses which, although not explicitly described or illustrated herein, embody the principles of the invention and are included in the concept and scope of the invention.

Furthermore, all of the conditional terms and embodiments listed herein are, in principle, only intended for the purpose of enabling understanding of the concepts of the present invention, and are not to be construed as limited to such specifically recited embodiments and conditions do.

It is also to be understood that the detailed description, as well as the principles, aspects and embodiments of the invention, as well as specific embodiments thereof, are intended to cover structural and functional equivalents thereof.

It is also to be understood that such equivalents include all elements contemplated to perform the same function irrespective of currently known equivalents as well as equivalents to be developed in the future.

Thus, for example, it should be understood that the block diagrams herein illustrate conceptual aspects of exemplary circuits embodying the principles of the invention. Similarly, all flowcharts, state transition diagrams, pseudo code, and the like are representative of various processes that may be substantially represented on a computer-readable medium and executed by a computer or processor, whether or not the computer or processor is explicitly shown .

The functions of the various elements shown in the figures, including the functional blocks depicted in the processor or similar concept, may be provided by use of dedicated hardware as well as hardware capable of executing software in connection with appropriate software.

When provided by a processor, the functions may be provided by a single dedicated processor, a single shared processor, or a plurality of individual processors, some of which may be shared.

Also, the explicit use of terms such as processor, control, or similar concepts should not be interpreted exclusively as hardware capable of running software, and may be used without limitation as a digital signal processor (DSP) (ROM), random access memory (RAM), and non-volatile memory. Other hardware may also be included.

It is to be understood that the invention defined by the appended claims is not to be construed as encompassing any means capable of providing such functionality, as the functions provided by the various listed means are combined and combined with the manner in which the claims require .

Means for solving the problems of the present invention are as follows.

That is, the smart phone 200 for providing at least one of contact information, call history information, and media information to the motion voice recognition processing means through Bluetooth communication with the vehicle voice recognition system using the gesture recognition of the present invention, and;

A method for providing a search window on a screen by receiving any one of search request information of an address search, a name search, and a recent destination search from a motion speech recognition processing means, receiving destination information requested to be searched, Navigation (300);

A media playback unit (400) for acquiring a media playback command from the motion speech recognition processing unit and playing back the media;

A motion sensor unit 500 for recognizing the gesture operation of the user;

A microphone unit 600 for acquiring voice information of a user;

Acquiring information on at least one of contact information, history information, and media information in the smart phone by Bluetooth communication with the smartphone, and storing the acquired information in a memory unit, acquiring voice information from the microphone unit to perform voice recognition, Request information or destination information, providing a media playback command signal to the media playback unit, providing a phone call command signal to a smartphone, or comparing a gesture operation recognized by the motion sensor unit with a preset gesture operation, And a motion voice recognition processing means (100) for providing a command set in the gesture operation to the corresponding navigation, a media playback unit, or a smartphone so as to be able to execute a set command when the command exists .

At this time, the motion-speech recognition processing means (100)

A gesture setting unit 110 for setting a registration gesture to be compared with a gesture operation input from the motion sensor unit,

A gesture instruction executing unit (120) for comparing a gesture operation recognized by the motion sensor unit and a gesture set by the gesture setting unit to execute a command for the gesture when the same gesture exists,

An address information obtaining unit 130 for obtaining contact information and call history information in a smart phone by Bluetooth communication with the smartphone,

A voice recognition unit 140 for acquiring voice information from a user and performing voice recognition;

A navigation processor 150 for providing the search request information or the destination information by navigation with reference to the recognition information recognized by the voice recognition unit,

A sound source reproduction processing unit (160) for referring to the recognition information recognized by the speech recognition unit and providing the corresponding media reproduction command to the media reproduction unit;

A telephone dialing processing unit 170 for performing Bluetooth communication with the smartphone by referring to the information recognized by the voice recognition unit and providing a telephone dialing command,

And a memory unit 180 for storing contact information and call history information and sound source information obtained by the address information obtaining unit.

At this time, the gesture setting unit 110 sets a gesture to be matched with a menu entry instruction. At this time, the gesture setting unit 110

A menu selection module 111 for selecting a menu to be executed as a gesture,

A phone gesture setting module 112 for setting a gesture matched to the menus when the menu selected by the menu selection module is name dialing, number dialing, redialing, or the like, and,

A media gesture setting module 113 for setting a gesture matched to the menus if the menu selected by the menu selection module is a sub category menu of media or media, which is a main menu, a song name search, a searched name search, a genre search,

A navigation gesture setting module for setting a gesture matched to the menus when the menu selected by the menu selection module is an address search, a name search, a recent destination search, a phone number search, or the like, which are sub-category menus of navigation or navigation, (114). ≪ / RTI >

Hereinafter, embodiments of a vehicle voice recognition system using gesture recognition according to the present invention will be described in detail with reference to the drawings.

2 is a diagram illustrating an example of connected devices required for using a general speech recognition function. As shown in FIG. 2, since the SD card includes navigation-related data, it becomes possible to use the name search and the address search.

Since the portable storage device such as USB includes media data, it is possible to use search functions such as song name search, singer search, and genre search as voice recognition.

The smartphone is connected to the vehicle voice recognition system of the present invention by Bluetooth, and provides contact information, transmission history, media information, and the like, and is converted into a format that can be used for voice recognition. Therefore, since the general speech recognition system shown in FIG. 1 searches and uses large amount of information at the same time, it is inevitable to delay the overall recognition rate and the overall response speed to output the result.

3 is an overall configuration diagram of a vehicle voice recognition system using gesture recognition according to an embodiment of the present invention.

3, the vehicle voice recognition system using the gesture recognition of the present invention includes a motion speech recognition processing unit 100, a smart phone 200, a navigation unit 300, a media playback unit 400, (500), and a microphone unit (600).

A feature of the present invention having the above-described configuration is that a main menu for executing a command is set as a gesture, and commands of sub-categories according to each menu are processed by speech recognition. However, sub-categories for each menu are also set as gestures It is possible. That is, the present invention is characterized in that the gesture recognition function is interlocked with the voice recognition system to execute the main menu execution through the set gesture recognition and the submenu execution through the voice recognition.

As shown in FIG. 1, the telephone, the media, and the navigation correspond to the main menu, and a name, a number, and a redial correspond to a submenu of a phone, a search for a name of a song, The search corresponds to the submenu of the media.

In the present invention having the above-described configuration, for example, if the user sets the gesture for setting up the thumb as a navigation function, the navigation is executed when the thumb is raised during driving, and the execution of the sub- Voice recognition. It can search the navigation information only through the gesture and output the result quickly. This is because, when a command is uttered in a normal speech recognition system, the speech recognition system performs a search through a speech recognition search engine as to what command is a command that is uttered. At this time, It takes considerable time to recognize. However, according to the present invention, the step of executing the main menu is replaced by the gesture recognition instead of the voice recognition, thereby shortening the time required for the relative search, thereby shortening the entire function execution time.

After the navigation function is executed through the gesture, the navigation unit 300 receives the search request information of the address search, the name search, and the recent destination search from the motion speech recognition processing unit, provides the corresponding search window on the screen, And provides the destination information to the screen.

In addition, if the user sets the gesture for holding the fist to the main menu of the phone, if the gesture of holding the fist while driving is performed, the main menu, that is, the telephone function is executed. .

After executing the telephone function, the smart phone 200 communicates with the motion speech recognition processing means through Bluetooth to provide at least one of contact information, call history information, and media information to the motion speech recognition processing means.

In addition, the media playback unit 400 acquires a media playback command from the motion sound recognition processing unit and plays the media.

The main feature of the present invention is that a main menu for executing a command is set as a gesture and commands of sub-categories according to each menu are processed by voice recognition, but sub-categories for each menu can also be set as gestures as needed.

For this purpose, a motion sensor unit 500 for recognizing a gesture operation of a user and a microphone unit 600 for acquiring a voice command of a user are constituted. The motion sound recognition processing unit 100 recognizes The gesture operation is compared with a predetermined gesture operation to execute a main menu command set for the gesture operation when the same gesture operation exists and then execute a submenu command corresponding to the subsequent voice command.

FIG. 4 is a block diagram showing the operation characteristics of the motion-speech recognition processing means 100 of the vehicle voice recognition system using gesture recognition according to an embodiment of the present invention.

As shown in FIG. 4, the motion-speech recognition processing means 100 includes a gesture-

A gesture instruction execution unit 120, an address information acquisition unit 130, a voice recognition unit 140, a navigation processing unit 150, a sound source reproduction processing unit 160, a telephone dialing processing unit 170, a memory unit 180).

The gesture setting unit 110 is a means for setting a registration gesture to be compared with a gesture operation input from the motion sensor unit, and performs a function of setting a gesture operation of the user as a main menu execution command.

For example, as shown in FIG. 7, if a gesture for raising the thumb is set and the navigation function is set as the setting, if the user only needs to raise his or her thumb in the voice command, The menu function is executed.

For this, the gesture instruction executing unit 120 compares the gesture operation recognized by the motion sensor unit with the gesture set by the gesture setting unit, and executes a command for the gesture when the same gesture exists.

As shown in Fig. 8, when a thumb is touched to feed vehicle fuel, a navigation function is started, a search window for setting a destination is provided on the screen, or a message saying that the destination is a voice is provided . At this time, the driver selects a desired destination or provides voice commands to the system.

As described above, if the speech recognition process is performed from the time of selecting the first main menu, a considerable time is required until the system interprets the speech. However, if the first main menu selection is processed by the gesture, The system operation time can be shortened and processed.

The address information obtaining unit 130 obtains contact information and call log information in a smart phone by Bluetooth communication with a smart phone and stores the corresponding information in a memory unit of the system.

The voice recognition unit 140 recognizes the user's voice command inputted from the microphone unit

The navigation processor 150 refers to the information recognized by the voice recognition unit and provides the search request information or the destination information through navigation. In addition, the sound source reproduction processing unit 160 refers to the recognition information recognized by the speech recognition unit, and provides the corresponding media reproduction command to the media reproduction unit. The dialing processing unit 170 receives the information recognized by the speech recognition unit And Bluetooth communication with the smartphone to provide a telephone dialing command.

At this time, the memory unit 180 stores contact information, call history information, and sound source information obtained by the address information obtaining unit.

Meanwhile, the gesture setting unit 110 sets a gesture matching the main menu execution command.

The main menu entry command is, for example, a menu that can be selected by the driver in the vehicle. The main menu entry command is a main menu such as navigation setting, dialing, media playback, and the like.

6, sub-categories such as name dialing, number dialing, and dialing exist in the main menu called dialing, and the main menu of media playback includes sub-categories such as song name search, searched name search, and genre search And navigation has subcategories such as address search, name search, and recent destination search.

At this time, the main menu entry command is set through the gesture, not through voice.

5 is a block diagram illustrating operational characteristics of the gesture setting unit 110 of the voice recognition system for a vehicle using gesture recognition according to an embodiment of the present invention.

In the above description, the command for executing the main menu is set by using the gesture through the gesture setting unit 110 instead of setting through the voice. In the following description, not only the main menu execution command but also the main menu sub- As shown in FIG.

5, the gesture setting unit 110 includes a menu selection module 111, a telephone gesture setting module 112, a media gesture setting module 113, and a navigation gesture setting module 114. [

The menu selection module 111 is a means for selecting a menu to be executed as a gesture. The user selects a menu to be executed using a gesture. The menu may be a phone, a media, or a navigation, You can also name your name, select a song name, search for an address, and so on.

The phone gesture setting module 112 sets a gesture matched to the menus when the menu selected by the menu selection module is name dialing, number dialing, redialing, or the like, which is a subcategory menu of a telephone or a telephone, which is a main menu. The gesture input to be set may be the motion sensor unit 500 of the present invention, or a separate motion input device may be used.

For example, it is assumed that the execution of the main menu called phone is set as a gesture for raising the thumb through the menu selection module 111 and the phone gesture setting module 112, and that the name setting is set as a gesture for holding a fist lets do it. In this case, if you take a gesture to raise your thumb, the main menu called Phone will be executed, and if you take a gesture to hold your fist, the naming will launch a subcategory menu.

The media gesture setting module 113 sets a gesture matched to the menus if the menu selected by the menu selection module is a music title search, a searched name search, a genre search, or the like, which is a subcategory menu of a media or a medium, which is a main menu. The gesture input to be set may be the motion sensor unit 500 of the present invention, or a separate motion input device may be used.

If the menu selected by the menu selection module is an address search, a name search, a recent destination search, a phone number search, etc., which are sub-category menus of navigation or navigation, which is a main menu, the navigation gesture setting module 114 Set matching gestures. The gesture input to be set may be the motion sensor unit 500 of the present invention, or a separate motion input device may be used.

Hereinafter, another embodiment of the present invention will be described with reference to FIG.

As shown in FIG. 9, the gesture defined by the user directly enters the core function category and executes the corresponding function. By replacing the existing menu entry instruction for each step with a single gesture, unnecessary steps can be reduced, By omitting the search process, the search efficiency can be improved. For this, the gesture setting unit 110 of the present invention as shown in FIG. 5 further comprises a core function gesture setting module 115.

For example, if you use a cell phone to make a phone call, you can use the gesture to launch the main menu, then make a call. In response to the system's response, the system searches for the name of Hong Kil-dong and dials the searched Hong Kil-Dong mobile phone. However, in an embodiment of the present invention, which is described with reference to FIG. 9, a gesture for grasping a fist is set as a command to call a mobile phone, and a call is made to a mobile phone when the player performs a gesture for grasping the fist. That is, a subcategory menu is directly executed using one gesture set without going through the steps of execution of a gesture for entering a main menu of a phone, voice command of uttering a name, voice command of a voice command of Hong Gil-dong.

That is, a core function category menu (for example, a home phone call, a guide to a nearby movie theater, a weather forecast, etc.) frequently used is set and registered as a gesture through the core function gesture setting module 115, As shown in FIG.

In addition, according to another embodiment of the present invention, when a user's hand approaches the proximity sensor using a proximity sensor instead of a motion sensor unit for recognizing a gesture operation of a user, the user directly enters a specific menu (main menu or subcategory menu) You can also run it.

To this end, the present invention comprises a smart phone 200 for providing at least one of contact information, call history information, and media information to the motion speech recognition processing means through Bluetooth communication with the motion speech recognition processing means;

A method for providing a search window on a screen by receiving any one of search request information of an address search, a name search, and a recent destination search from a motion speech recognition processing means, receiving destination information requested to be searched, Navigation (300);

A media playback unit (400) for acquiring a media playback command from the motion speech recognition processing unit and playing back the media;

A proximity sensor unit (not shown) for sensing the approach of the user such as a user's hand;

A microphone unit 600 for acquiring voice information of a user;

Acquiring information on at least one of contact information, history information, and media information in the smart phone by Bluetooth communication with the smartphone, and storing the acquired information in a memory unit, acquiring voice information from the microphone unit to perform voice recognition, Request information or destination information, provide a media playback command signal to a media playback unit, provide a phone call command signal to a smart phone, or provide a command set when a body approach such as a user's hand is detected, And a motion speech recognition processing means (100) for performing a command provided to one of the media playback unit and the smartphone and performing the set command.

In conclusion, the present invention recognizes a gesture using a motion sensor and executes a main menu such as a telephone dialing function, a navigation function, and a media playback function through a gesture or a main menu sub-category such as name dialing, Or by using a gesture up to the frequently used core function category menu, thereby maximizing the user's convenience while improving the performance of the conventional speech recognition system.

Meanwhile, the method according to various embodiments of the present invention may be stored in a computer-readable recording medium. The computer-readable recording medium may be a ROM, a RAM, CDROMs, magnetic tapes, floppy disks, optical data storage devices, and the like, as well as carrier waves (e.g., transmission over the Internet).

While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, but, on the contrary, It should be understood that various modifications may be made by those skilled in the art without departing from the spirit and scope of the present invention.

For example, although the embodiments described above have been described for dialing, playing media, and navigating to a main menu that is executed by a gesture, the present invention is not limited thereto and can be provided in a vehicle voice recognition system in a category that does not depart from the gist of the present invention Services that can be extended to main menu or sub-category menus, such as Internet radio, Internet favorites, weather, mobile websites, and the like.

100: Motion speech recognition processing means
200: Smartphone
300: Navigation
400: Media playback unit
500: Motion sensor unit
600: microphone section

Claims (5)

A voice recognition system for a vehicle using gesture recognition,
A smart phone (200) for providing at least one of contact information, call history information, and media information to the motion speech recognition processing means by Bluetooth communication with the motion speech recognition processing means;
A method for providing a search window on a screen by receiving any one of search request information of an address search, a name search, and a recent destination search from a motion speech recognition processing means, receiving destination information requested to be searched, Navigation (300);
A media playback unit (400) for acquiring a media playback command from the motion speech recognition processing unit and playing back the media;
A motion sensor unit 500 for recognizing the gesture operation of the user;
A microphone unit 600 for acquiring voice information of a user;
Acquiring information on at least one of contact information, history information, and media information in the smart phone by Bluetooth communication with the smartphone, and storing the acquired information in a memory unit, acquiring voice information from the microphone unit to perform voice recognition, Request information or destination information, providing a media playback command signal to the media playback unit, providing a phone call command signal to a smartphone, or comparing a gesture operation recognized by the motion sensor unit with a preset gesture operation, And a motion voice recognition processing means (100) for providing a command set in the gesture operation to any one of the navigation, the media playback unit,
The motion-speech recognition processing means (100)
A gesture setting unit 110 for setting a registration gesture to be compared with a gesture operation input from the motion sensor unit,
A gesture instruction executing unit (120) for comparing a gesture operation recognized by the motion sensor unit and a gesture set by the gesture setting unit to execute a command for the gesture when the same gesture exists,
An address information obtaining unit 130 for obtaining contact information and call history information in a smart phone by Bluetooth communication with the smartphone,
A voice recognition unit 140 for acquiring voice information from a user and performing voice recognition;
A navigation processor 150 for providing the search request information or the destination information by navigation with reference to the recognition information recognized by the voice recognition unit,
A sound source reproduction processing unit (160) for referring to the recognition information recognized by the speech recognition unit and providing the corresponding media reproduction command to the media reproduction unit;
A telephone dialing processing unit 170 for performing Bluetooth communication with the smartphone by referring to the information recognized by the voice recognition unit and providing a telephone dialing command,
And a memory unit (180) for storing contact information and call history information and sound source information obtained by the address information obtaining unit,
The gesture setting unit 110,
A menu selection module 111 for selecting a menu to be executed as a gesture,
A phone gesture setting module 112 for setting a gesture matched to the menus when the menu selected by the menu selection module is name dialing, number dialing, redialing, or the like, and,
A media gesture setting module 113 for setting a gesture matched to the menus if the menu selected by the menu selection module is a sub category menu of media or media, which is a main menu, a song name search, a searched name search, a genre search,
A navigation gesture setting module for setting a gesture matched to the menus when the menu selected by the menu selection module is an address search, a name search, a recent destination search, a phone number search, or the like, which are sub-category menus of navigation or navigation, (114). In this case,
The gesture setting unit 110 further includes a core function gesture setting module 115. The gesture setting unit 115 sets a gesture matching the frequently used core function menus through the core function gesture setting module 115 Automotive Speech Recognition System Using Gesture Recognition.
delete delete delete A voice recognition system for a vehicle using gesture recognition,
A smart phone (200) for providing at least one of contact information, call history information, and media information to the motion speech recognition processing means by Bluetooth communication with the motion speech recognition processing means;
A method for providing a search window on a screen by receiving any one of search request information of an address search, a name search, and a recent destination search from a motion speech recognition processing means, receiving destination information requested to be searched, Navigation (300);
A media playback unit (400) for acquiring a media playback command from the motion speech recognition processing unit and playing back the media;
A proximity sensor unit for sensing the approach of the user such as a user's hand;
A microphone unit 600 for acquiring voice information of a user;
Acquiring information on at least one of contact information, history information, and media information in the smart phone by Bluetooth communication with the smartphone, and storing the acquired information in a memory unit, acquiring voice information from the microphone unit to perform voice recognition, Request information or destination information, provide a media playback command signal to a media playback unit, provide a phone call command signal to a smart phone, or provide a command set when a body approach such as a user's hand is detected, A media playback unit, and a smart phone to perform a set command,
The motion-speech recognition processing means (100)
A gesture setting unit 110 for setting a registration gesture to be compared with a gesture operation input from the proximity sensor unit,
A gesture instruction executing unit (120) for comparing a gesture operation recognized by the proximity sensor unit and a gesture set by the gesture setting unit to execute a command for the gesture when the same gesture exists,
An address information obtaining unit 130 for obtaining contact information and call history information in a smart phone by Bluetooth communication with the smartphone,
A voice recognition unit 140 for acquiring voice information from a user and performing voice recognition;
A navigation processor 150 for providing the search request information or the destination information by navigation with reference to the recognition information recognized by the voice recognition unit,
A sound source reproduction processing unit (160) for referring to the recognition information recognized by the speech recognition unit and providing the corresponding media reproduction command to the media reproduction unit;
A telephone dialing processing unit 170 for performing Bluetooth communication with the smartphone by referring to the information recognized by the voice recognition unit and providing a telephone dialing command,
And a memory unit (180) for storing contact information and call history information and sound source information obtained by the address information obtaining unit,
The gesture setting unit 110,
A menu selection module 111 for selecting a menu to be executed as a gesture,
A phone gesture setting module 112 for setting a gesture matched to the menus when the menu selected by the menu selection module is name dialing, number dialing, redialing, or the like, and,
A media gesture setting module 113 for setting a gesture matched to the menus if the menu selected by the menu selection module is a sub category menu of media or media, which is a main menu, a song name search, a searched name search, a genre search,
A navigation gesture setting module for setting a gesture matched to the menus when the menu selected by the menu selection module is an address search, a name search, a recent destination search, a phone number search, or the like, which are sub-category menus of navigation or navigation, (114). In this case,
The gesture setting unit 110 further includes a core function gesture setting module 115. The gesture setting unit 115 sets a gesture matching the frequently used core function menus through the core function gesture setting module 115 Automotive Speech Recognition System Using Gesture Recognition.
KR1020150075318A 2015-05-28 2015-05-28 The vehicle-mounted voice recognition system by using gesture recognition KR101650769B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020150075318A KR101650769B1 (en) 2015-05-28 2015-05-28 The vehicle-mounted voice recognition system by using gesture recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020150075318A KR101650769B1 (en) 2015-05-28 2015-05-28 The vehicle-mounted voice recognition system by using gesture recognition

Publications (1)

Publication Number Publication Date
KR101650769B1 true KR101650769B1 (en) 2016-08-25

Family

ID=56884821

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020150075318A KR101650769B1 (en) 2015-05-28 2015-05-28 The vehicle-mounted voice recognition system by using gesture recognition

Country Status (1)

Country Link
KR (1) KR101650769B1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019194426A1 (en) * 2018-04-02 2019-10-10 Samsung Electronics Co., Ltd. Method for executing application and electronic device supporting the same
KR20230001968A (en) 2021-06-29 2023-01-05 혜윰기술 주식회사 Voice and gesture integrating device of vehicle

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100948600B1 (en) 2006-12-04 2010-03-24 한국전자통신연구원 System and method for integrating gesture and voice
KR101260053B1 (en) * 2011-11-17 2013-05-06 재단법인대구경북과학기술원 Intelligent vehicle controlling apparatus and method using fusion recognition of user's voice and hand gesture
KR20130062522A (en) * 2011-12-05 2013-06-13 현대자동차주식회사 Smart key system for vehicle
KR20140072475A (en) * 2012-12-05 2014-06-13 주식회사 에이치엠에스 System, method and computer readable recording medium for controlling a navigation by the recognition of a gesture according to the variation of a position

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100948600B1 (en) 2006-12-04 2010-03-24 한국전자통신연구원 System and method for integrating gesture and voice
KR101260053B1 (en) * 2011-11-17 2013-05-06 재단법인대구경북과학기술원 Intelligent vehicle controlling apparatus and method using fusion recognition of user's voice and hand gesture
KR20130062522A (en) * 2011-12-05 2013-06-13 현대자동차주식회사 Smart key system for vehicle
KR20140072475A (en) * 2012-12-05 2014-06-13 주식회사 에이치엠에스 System, method and computer readable recording medium for controlling a navigation by the recognition of a gesture according to the variation of a position

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019194426A1 (en) * 2018-04-02 2019-10-10 Samsung Electronics Co., Ltd. Method for executing application and electronic device supporting the same
US11144175B2 (en) 2018-04-02 2021-10-12 Samsung Electronics Co., Ltd. Rule based application execution using multi-modal inputs
KR20230001968A (en) 2021-06-29 2023-01-05 혜윰기술 주식회사 Voice and gesture integrating device of vehicle

Similar Documents

Publication Publication Date Title
KR102022318B1 (en) Method and apparatus for performing user function by voice recognition
US20220301566A1 (en) Contextual voice commands
JP5754368B2 (en) Mobile terminal remote operation method using vehicle integrated operation device, and vehicle integrated operation device
US9396727B2 (en) Systems and methods for spoken dialog service arbitration
CN106663430B (en) Keyword detection for speaker-independent keyword models using user-specified keywords
AU2016211903B2 (en) Updating language understanding classifier models for a digital personal assistant based on crowd-sourcing
CN108307069B (en) Navigation operation method, navigation operation device and mobile terminal
CN103716454A (en) Method and apparatus of performing a preset operation by using voice recognition
CN108428450B (en) Operation instruction processing method and device
CN105489220A (en) Method and device for recognizing speech
KR101650769B1 (en) The vehicle-mounted voice recognition system by using gesture recognition
US20140323113A1 (en) System, apparatus, method, and computer-readable recording medium for changing user terminal settings
AU2020264367B2 (en) Contextual voice commands
AU2015271922B2 (en) Method and apparatus for executing a user function using voice recognition
CN109977289A (en) Restricted driving information determines method, apparatus and terminal
JPWO2019163011A1 (en) Display control device and display control method
AU2014221287A1 (en) Contextual voice commands
US20140350929A1 (en) Method and apparatus for managing audio data in electronic device
JP6099414B2 (en) Information providing apparatus and information providing method
JP2012058311A (en) Method and apparatus for generating dynamic voice recognition dictionary

Legal Events

Date Code Title Description
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20190917

Year of fee payment: 4