TW202030624A

TW202030624A - Voice recognition control method and device using the same

Info

Publication number: TW202030624A
Application number: TW108103762A
Authority: TW
Inventors: 陳旻宏
Original assignee: 發條橘子雲端行銷股份有限公司
Priority date: 2019-01-31
Filing date: 2019-01-31
Publication date: 2020-08-16

Abstract

A voice recognition control method is provided, including the following steps of: using a voice transceiver to receive a voice signal; using a learning module to learn the voice information by interacting with at least one cloud searching server, and to translate the voice signal into at least one voice information; using a processor to analysis the voice signal according to the voice information to generate a control signal; using an infrared emitter to emit the control signal. A voice recognition control device is provided.

Description

Voice recognition control method and device

本發明係與語音控制有關，特別是有關於一種語音辨識控制方法及裝置。The present invention relates to voice control, and particularly relates to a voice recognition control method and device.

隨著科技的進步，居家設備逐漸的導向智慧型居家概念，以自動化系統調節居家環境，改善以往只能以一遙控器對應專一的電器設備的問題。With the advancement of science and technology, home equipment is gradually leading to the concept of smart home. The home environment is adjusted by an automated system to improve the problem of only one remote control corresponding to the specific electrical equipment.

習知技術係一控制器控制複數電器設備，該控制器通常為智慧型手機、平板電腦或其他遙控器，以有線或無線的方式控制該複數電器設備的運轉狀態。然而，習知技術的控制器須以手持操作才能達到控制的效果，如此繁複的操作方式對於老年人而言或其他使用者而言，實在有許多的不便。The conventional technology is that a controller controls a plurality of electrical devices, and the controller is usually a smart phone, a tablet computer or other remote controllers, and controls the operation status of the plurality of electrical devices in a wired or wireless manner. However, the conventional controller must be operated by hand to achieve the control effect. Such a complicated operation method is really inconvenient for the elderly or other users.

因此，有必要提供一種新穎且具有進步性之語音辨識控制方法及裝置，以解決上述之問題。Therefore, it is necessary to provide a novel and progressive voice recognition control method and device to solve the above-mentioned problems.

本發明之主要目的在於提供一種語音辨識控制方法及裝置，可聲控至少一電器設備。The main purpose of the present invention is to provide a voice recognition control method and device, which can voice control at least one electrical device.

為達成上述目的，本發明提供一種語音辨識控制方法，包括：以一語音收發器接收一語音信號；以一學習模塊至少與一雲端搜尋引擎伺服器互動學習該語音信號，再將該語音信號轉換為至少一語言資料；以一處理器依據各該語言資料解析該語音信號，以產生一控制訊號；及以一紅外線發射器發射該控制訊號，以供控制至少一電器設備。To achieve the above objective, the present invention provides a voice recognition control method, including: receiving a voice signal with a voice transceiver; using a learning module to interact with at least one cloud search engine server to learn the voice signal, and then converting the voice signal Is at least one language data; a processor analyzes the voice signal according to each of the language data to generate a control signal; and an infrared transmitter transmits the control signal for controlling at least one electrical device.

為達成上述目的，本發明另提供一種語音辨識控制裝置，包括：一紅外線發射器，其可發射至少一控制訊號，該至少一控制訊號供控制至少一電器設備；一語音收發器，其可接收一語音信號及可發射一依據該語音信號產生之語音回饋信號；一學習模塊，其至少將該語音信號轉換為至少一語言資料；一儲存單元，其儲存各該語言資料、至少一環境資料及至少一狀態資料至少其中一者；及一處理器，其依據各該語言資料解析該語音信號，並讀取該至少一環境資料及該至少一狀態資料而將該語音信號轉換為該控制訊號及該語音回饋信號至少其中一者。To achieve the above object, the present invention provides a voice recognition control device, including: an infrared transmitter, which can emit at least one control signal, the at least one control signal for controlling at least one electrical device; a voice transceiver, which can receive A voice signal and a voice feedback signal that can be transmitted based on the voice signal; a learning module that at least converts the voice signal into at least one language data; a storage unit that stores each of the language data, at least one environmental data, and At least one of at least one state data; and a processor, which parses the voice signal according to each of the language data, and reads the at least one environmental data and the at least one state data to convert the voice signal into the control signal and At least one of the voice feedback signals.

以下僅以實施例說明本發明可能之實施態樣，然並非用以限制本發明所欲保護之範疇，合先敘明。The following examples are only used to illustrate the possible implementation aspects of the present invention, but they are not intended to limit the scope of protection of the present invention, and are described first.

請參考圖1，其顯示本發明之一較佳實施例，本發明之語音辨識控制方法，包括以下步驟：步驟S1：以一語音收發器接收一語音信號；步驟S2：以一學習模塊至少與一雲端搜尋引擎伺服器互動學習該語音信號，再將該語音信號轉換為至少一語言資料，進一步說明，該學習模塊可以有線或無線的方式連接到雲端搜尋該語音信號的例如語義及文法等；步驟S3：以一處理器依據各該語言資料解析該語音信號，以產生一控制訊號及一語音回饋信號；及步驟S4：以一紅外線發射器發射該控制訊號，以供控制至少一電器設備。此外，該學習模塊亦可透過該語音回饋信號例如回答使用者的問題、或發出疑問等，並可透過與使用者互動學習該語言信號的語義及文法等。Please refer to FIG. 1, which shows a preferred embodiment of the present invention. The voice recognition control method of the present invention includes the following steps: Step S1: Receive a voice signal with a voice transceiver; Step S2: Use a learning module to at least interact with A cloud search engine server interactively learns the voice signal, and then converts the voice signal into at least one language data. It is further explained that the learning module can be connected to the cloud in a wired or wireless manner to search for the voice signal such as semantics and grammar; Step S3: Use a processor to analyze the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and Step S4: Use an infrared transmitter to transmit the control signal for controlling at least one electrical device. In addition, the learning module can also use the voice feedback signal to answer the user's question or issue a question, and can learn the semantics and grammar of the language signal through interaction with the user.

請參考圖2至3，本發明之語音辨識控制裝置1包括一紅外線發射器10、一語音收發器20、一儲存單元30、一學習模塊40及一處理器50。2 to 3, the voice recognition control device 1 of the present invention includes an infrared transmitter 10, a voice transceiver 20, a storage unit 30, a learning module 40 and a processor 50.

該紅外線發射器10可發射至少一控制訊號，各該控制訊號為紅外線訊號，該至少一控制訊號供控制至少一電器設備2，於本實施例中該至少一電器設備2例如為可接收紅外線訊號之冷氣、燈具、電視或風扇等，當然亦可具有發送紅外線訊號之電器設備2；於其他實施例，該紅外線發射器亦可發射複數控制訊號，以同時控制複數電器設備。The infrared transmitter 10 can emit at least one control signal, each of the control signals is an infrared signal, and the at least one control signal is used to control at least one electrical device 2. In this embodiment, the at least one electrical device 2 can receive an infrared signal, for example. The air conditioner, lamp, TV or fan, etc., of course, can also have electrical equipment 2 that transmits infrared signals; in other embodiments, the infrared transmitter can also emit multiple control signals to simultaneously control multiple electrical equipment.

該語音收發器20可接收一語音信號及可發射一依據該語音信號產生之語音回饋信號；該學習模塊40至少將該語音信號轉換為至少一語言資料；該儲存單元30儲存各該語言資料、至少一環境資料及至少一狀態資料至少其中一者；及該處理器50依據各該語言資料解析該語音信號，並讀取該至少一環境資料及該至少一狀態資料至少其中一者而將該語音信號轉換為該控制訊號及該語音回饋信號至少其中一者。藉此，該語音辨識控制裝置1可與使用者4互動進而學習中文語法，並可透過該語音信號控制各該電器設備2，以提升操作便利性。The voice transceiver 20 can receive a voice signal and can transmit a voice feedback signal generated according to the voice signal; the learning module 40 at least converts the voice signal into at least one language data; the storage unit 30 stores each of the language data, At least one of at least one environmental data and at least one state data; and the processor 50 parses the voice signal according to each of the language data, and reads at least one of the at least one environmental data and the at least one state data, and then The voice signal is converted into at least one of the control signal and the voice feedback signal. Thereby, the voice recognition control device 1 can interact with the user 4 to learn Chinese grammar, and can control each of the electrical equipment 2 through the voice signal to improve the convenience of operation.

於本實施例中，至少一語言資料、該至少一環境資料及至少一狀態資料的數量分別為複數；該複數語言資料例如包括中文、英文、粵語、閩南語、泰語等多國語言的詞彙及文法；該複數環境資料例如包括複數環境名稱，該複數狀態資料例如包括環境溫度狀態、環境濕度狀態、該至少一電器設備2的運轉狀態等。In this embodiment, the quantities of the at least one language data, the at least one environment data, and the at least one status data are respectively plural; the plural language data includes, for example, Chinese, English, Cantonese, Hokkien, Thai and other multi-language vocabulary and Grammar; the plural environmental data includes, for example, plural environmental names, and the plural state data includes, for example, the environmental temperature state, the environmental humidity state, the operating state of the at least one electrical device 2 and so on.

較佳地，該學習模塊40可供連接一雲端搜尋引擎伺服器3，以有線或無線的方式連接該雲端搜尋引擎伺服器3。該雲端搜尋引擎伺服器3可例如為搜尋引擎(例如GOOGLE)、資訊資料庫(例如維基百科)等網路資訊，該學習模塊40可依據該語音信號至該雲端搜尋引擎伺服器3獲得該至少一語言資料，如此該學習模塊40可透過網路學習。Preferably, the learning module 40 can be connected to a cloud search engine server 3, which can be connected to the cloud search engine server 3 in a wired or wireless manner. The cloud search engine server 3 can be, for example, a search engine (such as GOOGLE), an information database (such as Wikipedia) and other network information, and the learning module 40 can send to the cloud search engine server 3 according to the voice signal to obtain the at least A language data, so the learning module 40 can be learned through the Internet.

詳細地說明，該處理器50另包括一語音辨識單元51及一語意識別單元52，該語音辨識單元51分析該語音信號為至少一文字訊號，該語意識別單元52將各該文字訊號依據該至少一語言資料、該至少一環境資料及該至少一狀態資料至少其中一者轉換為該控制訊號或該語音回饋信號至少其中一者，藉此可明確地分析及解讀該語音信號。此外，該語音辨識單元51可判斷不同發音、語調以與相近的文字配對。In detail, the processor 50 further includes a voice recognition unit 51 and a semantic recognition unit 52. The voice recognition unit 51 analyzes the voice signal as at least one text signal, and the semantic recognition unit 52 converts each text signal according to the At least one of the at least one language data, the at least one environment data, and the at least one state data is converted into at least one of the control signal or the voice feedback signal, so that the voice signal can be clearly analyzed and interpreted. In addition, the voice recognition unit 51 can determine different pronunciations and intonations to pair with similar characters.

其中，該儲存單元30另包括一語意資料庫31，該語意資料庫31儲存各該語言資料。此外，該儲存單元30另包括一環境資料庫32，該環境資料庫32儲存各該環境資料。進一步，該儲存單元30另包括一狀態資料庫33，該狀態資料庫33儲存各該狀態資料。The storage unit 30 further includes a semantic database 31, and the semantic database 31 stores each language data. In addition, the storage unit 30 further includes an environmental database 32, and the environmental database 32 stores various environmental data. Further, the storage unit 30 further includes a state database 33, and the state database 33 stores each state data.

該語音收發器20另包括一回音濾除模組21，當該語音信號中包含一回音訊號時，該回音濾除模組21可過濾該回音訊號。此外，該語音收發器20另包括一雜訊濾除模組22，當該語音信號中包括一雜音訊號時，該雜訊濾除模組22可過濾該雜音訊號；藉以提升該語音信號的清晰度。The voice transceiver 20 further includes an echo filter module 21. When the voice signal includes an echo signal, the echo filter module 21 can filter the echo signal. In addition, the voice transceiver 20 further includes a noise filter module 22. When the voice signal includes a noise signal, the noise filter module 22 can filter the noise signal; thereby improving the clarity of the voice signal degree.

該語音收發器20包括一收音模組23及一放音模組24，該收音模組23可接收該語音信號，該放音模組24可發射該語音回饋信號。該收音模組23可例如為一麥克風裝置；該放音模組24可例如為一揚聲裝置。此外，該語音收發器20另包括一播放模組25，該播放模組25連接該語意識別單元52及該放音模組24。The voice transceiver 20 includes a receiving module 23 and a playing module 24. The receiving module 23 can receive the voice signal, and the playing module 24 can transmit the voice feedback signal. The sound receiving module 23 may be, for example, a microphone device; the sound playback module 24 may be, for example, a speaker device. In addition, the voice transceiver 20 further includes a playing module 25, and the playing module 25 is connected to the semantic recognition unit 52 and the sound playing module 24.

於本實施例中，該語音辨識控制裝置1另包括一顯示器60，該顯示器60連接該處理器50，該顯示器60可供顯示至少一影像資訊，該至少一影像資訊可為多媒體、或遠端視訊影像，以跟使用者4以影像互動。In this embodiment, the voice recognition control device 1 further includes a display 60 connected to the processor 50. The display 60 can display at least one image information, and the at least one image information can be multimedia or remote Video images to interact with users 4 through images.

S1~S4:步驟 1:語音辨識控制裝置 2:電器設備 3:雲端搜尋引擎伺服器 4:使用者 10:紅外線發射器 20:語音收發器 21:回音濾除模組 22:雜訊濾除模組 23:收音模組 24:放音模組 25:播放模組 30:儲存單元 31:語意資料庫 32:環境資料庫 33:狀態資料庫 40:學習模塊 50:處理器 51:語音辨識單元 52:語意識別單元 60:顯示器 S1~S4: steps 1: Voice recognition control device 2: electrical equipment 3: Cloud search engine server 4: User 10: Infrared transmitter 20: Voice transceiver 21: Echo Filter Module 22: Noise filter module 23: Radio module 24: Playback module 25: Play module 30: storage unit 31: Semantic Database 32: Environmental Database 33: Status database 40: Learning Module 50: processor 51: Voice recognition unit 52: Semantic Recognition Unit 60: display

圖1為本發明一較佳實施例之步驟圖。圖2為本發明一較佳實施例之方塊圖。圖3為本發明一較佳實施例之使用狀態示意圖。Figure 1 is a step diagram of a preferred embodiment of the present invention. Figure 2 is a block diagram of a preferred embodiment of the present invention. Figure 3 is a schematic diagram of a preferred embodiment of the invention in use.

S1~S4:步驟 S1~S4: steps

Claims

A voice recognition control method includes the following steps: Receive a voice signal with a voice transceiver; Use a learning module to interact with at least one cloud search engine server to learn the voice signal, and then convert the voice signal into at least one language data; A processor analyzes the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and An infrared transmitter is used to transmit the control signal for controlling at least one electrical device.

A voice recognition control device, including: An infrared transmitter capable of emitting at least one control signal, and the at least one control signal is used to control at least one electrical device; A voice transceiver that can receive a voice signal and can transmit a voice feedback signal generated based on the voice signal; A learning module, at least converting the voice signal into at least one language data; A storage unit storing at least one of each of the language data, at least one environmental data, and at least one status data; and A processor analyzes the voice signal according to each of the language data, and reads the at least one environmental data and the at least one state data to convert the voice signal into at least one of the control signal and the voice feedback signal.

The voice recognition control device according to claim 2, wherein the learning module can be connected to a cloud search engine server, and the learning module can obtain the at least one language data from the cloud search engine server according to the voice signal.

The voice recognition control device according to claim 2, wherein the processor further includes a voice recognition unit and a semantic recognition unit, the voice recognition unit analyzes the voice signal as at least one text signal, and the semantic recognition unit converts each of the The text signal is converted into at least one of the control signal or the voice feedback signal according to at least one of the at least one language data, the at least one environmental data, and the at least one state data.

The voice recognition control device according to claim 2, wherein the storage unit further includes a semantic database, and the semantic database stores each of the language data.

The voice recognition control device according to claim 2, wherein the storage unit further includes an environment database and a status database, the environment database stores each of the environmental data, and the state database stores each of the state data.

The voice recognition control device according to claim 2, wherein the voice transceiver further includes an echo filtering module, and when the voice signal includes an echo signal, the echo filtering module can filter the echo signal.

The voice recognition control device according to claim 2, wherein the voice transceiver further includes a noise filter module, and when the voice signal includes a noise signal, the noise filter module can filter the noise signal .

The voice recognition control device according to claim 2, wherein the voice transceiver includes a radio module and a sound playback module, the radio module can receive the voice signal, and the sound playback module can transmit the voice feedback signal .

The voice recognition control device according to claim 4, wherein the number of at least one language data, the at least one environment data, and the at least one status data are plural respectively; the learning module can be connected to a cloud search engine server, and the learning module can According to the voice signal to the cloud search engine server to obtain the at least one language data; the storage unit further includes a semantic database, the semantic database stores each of the language data; the storage unit further includes an environment database and a state A database, the environment database stores each of the environmental data, the state database stores each of the state data; the voice transceiver also includes an echo filter module, when the voice signal contains an echo signal, the echo filter The noise removal module can filter the echo signal; the voice transceiver also includes a noise filter module, when the voice signal includes a noise signal, the noise filter module can filter the noise signal; the voice transceiver The device includes a radio module and a playback module, the radio module can receive the voice signal, the playback module can transmit the voice feedback signal; the voice transceiver also includes a playback module, the playback module The semantic recognition unit and the sound module are connected; the voice recognition control device further includes a display connected to the processor, and the display can display at least one image information.