TWM610794U - Voice-controlled operating apparatus - Google Patents

Voice-controlled operating apparatus Download PDF

Info

Publication number
TWM610794U
TWM610794U TW109215601U TW109215601U TWM610794U TW M610794 U TWM610794 U TW M610794U TW 109215601 U TW109215601 U TW 109215601U TW 109215601 U TW109215601 U TW 109215601U TW M610794 U TWM610794 U TW M610794U
Authority
TW
Taiwan
Prior art keywords
voice
built
keywords
module
voiceprint
Prior art date
Application number
TW109215601U
Other languages
Chinese (zh)
Inventor
李金珠
Original Assignee
臺灣銀行股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 臺灣銀行股份有限公司 filed Critical 臺灣銀行股份有限公司
Priority to TW109215601U priority Critical patent/TWM610794U/en
Publication of TWM610794U publication Critical patent/TWM610794U/en

Links

Images

Landscapes

  • Telephone Function (AREA)

Abstract

本新型提供一種聲控操作裝置,可安裝於行動裝置,且包括資料庫、擷取模組、比對模組以及處理模組。資料庫係儲存一內建資料,內建資料包括內建聲紋特徵及內建關鍵詞。擷取模組係接收語音指令,以擷取語音指令的語音聲紋特徵及語音關鍵詞。比對模組係通訊連接資料庫,以擷取上述內建資料,且通訊連接擷取模組,以接收語音聲紋特徵及語音關鍵詞,並判斷語音聲紋特徵是否與任一內建聲紋特徵相似,且判斷任一語音關鍵詞是否與任一內建關鍵詞相似。處理模組係通訊連接比對模組,接收並執行語音指令,以輸出處理結果。The present model provides a voice control operation device, which can be installed in a mobile device and includes a database, an acquisition module, a comparison module and a processing module. The database stores a built-in data, which includes built-in voiceprint features and built-in keywords. The capturing module receives voice commands to capture voice features and voice keywords of the voice commands. The comparison module communicates with the database to capture the above-mentioned built-in data, and communicates with the capture module to receive voice voiceprint features and voice keywords, and determine whether the voice voiceprint feature matches any of the built-in voices. The pattern features are similar, and it is determined whether any voice keyword is similar to any built-in keyword. The processing module is a communication connection comparison module, receiving and executing voice commands to output processing results.

Description

聲控操作裝置Voice control device

本新型涉及一種聲控操作裝置,尤其是一種安裝於行動裝置中的聲控操作裝置。The model relates to a voice control operating device, in particular to a voice control operating device installed in a mobile device.

目前已知的行動裝置 (例如智慧型手機),雖能透過例如iPhone的Siri等應用程式來藉由聲音控制其開機、關機或進行自動搜尋等動作。但習知的行動裝置卻仍無法透過使用者所提供的語音,來進一步操作其他的應用程式 (app)。因此,使用者還需再透過眼盯螢幕、手動輸入文字或指令,才能繼續操作習知的行動裝置的應用程式。Currently known mobile devices (such as smart phones) can be controlled by voice through applications such as iPhone’s Siri to turn on, turn off, or perform automatic searches. However, the conventional mobile device still cannot use the voice provided by the user to further operate other applications (app). Therefore, the user still needs to stare at the screen and manually input text or commands to continue to operate the application of the conventional mobile device.

據此,如何能提供在不需透過使用者的視覺來接收螢幕資訊之情況下,便能操作的應用程式,即成為所屬技術領域中有待解決的問題。Accordingly, how to provide an application program that can be operated without receiving screen information through the user's vision has become a problem to be solved in the technical field.

為解決上述問題,本新型之實施例發展出一種聲控操作裝置,其可安裝於行動裝置中,以透過接收使用者的語音指令,擷取上述語音指令中的語音聲紋特徵以及語音關鍵詞。接著,透過比對使用者的語音聲紋特徵是否與資料庫中的內建聲紋特徵相符,以進一步判斷是否確為本人,取代習知係以手動鍵入文字或圖形密碼,或以指紋或人臉特徵來確認使用者身份的授權技術。而在當判斷確為使用者本人時,則比對語音關鍵詞是否與資料庫中的任一內建關鍵詞相符,以進一步執行語音指令並輸出處理結果。藉此,本新型之實施例確實能取代習知係需透過視覺及手動操作應用程式的技術,並提供更人性化、更方便且同時也能兼顧資料安全性的聲控操作裝置。In order to solve the above-mentioned problems, the embodiment of the present invention develops a voice-controlled operation device, which can be installed in a mobile device to capture the voiceprint features and voice keywords in the voice command by receiving the user's voice command. Then, by comparing whether the user’s voice voiceprint features are consistent with the built-in voiceprint features in the database, we can further determine whether he is the person, instead of manually typing in text or graphic passwords, or using fingerprints or people. Facial features to confirm the authorization technology of the user's identity. When it is determined that it is the user, it is compared whether the voice keyword matches any built-in keyword in the database, so as to further execute the voice command and output the processing result. In this way, the embodiments of the present invention can indeed replace the conventional technology that requires visual and manual operation of application programs, and provide a more user-friendly, more convenient, and at the same time, a voice-controlled operation device that can also take into account data security.

具體而言,本新型之實施例提供了一種聲控操作裝置,其可安裝於計算機或行動裝置中。上述聲控操作裝置包括資料庫、擷取模組、比對模組以及處理模組。上述資料庫係用以儲存一內建資料,而上述內建資料包括多個內建聲紋特徵及多個內建關鍵詞。上述擷取模組係用以接收使用者的語音指令,以擷取並輸出上述語音指令的語音聲紋特徵及語音關鍵詞。上述比對模組係用以通訊連接資料庫,以擷取上述內建資料,且比對模組係用以通訊連接擷取模組,以接收語音聲紋特徵及語音關鍵詞,並判斷上述語音聲紋特徵是否與任一內建聲紋特徵相同或相似,且判斷任一語音關鍵詞是否與任一內建關鍵詞相同或相似。上述處理模組係用以通訊連接比對模組,以接收並執行語音指令,以輸出一處理結果。Specifically, the embodiment of the present invention provides a voice control operation device, which can be installed in a computer or a mobile device. The voice control operation device includes a database, an acquisition module, a comparison module, and a processing module. The database is used to store a built-in data, and the built-in data includes a plurality of built-in voiceprint features and a plurality of built-in keywords. The capture module is used to receive a user's voice command to capture and output the voice voiceprint features and voice keywords of the voice command. The comparison module is used to communicate with the database to retrieve the above-mentioned built-in data, and the comparison module is used to communicate with the capture module to receive voice voiceprint features and voice keywords, and determine the above Whether the voice voiceprint feature is the same or similar to any built-in voiceprint feature, and whether any voice keyword is the same or similar to any built-in keyword is determined. The above-mentioned processing module is used to communicate with the comparison module to receive and execute voice commands to output a processing result.

依據一實施例,其中在當判斷語音聲紋特徵與任一內建聲紋特徵相同或相似時,上述比對模組係用以判斷語音關鍵詞是否與任一內建關鍵詞相同或相似。According to an embodiment, when it is determined that the voice voiceprint feature is the same or similar to any built-in voiceprint feature, the comparison module is used to determine whether the voice keyword is the same or similar to any built-in keyword.

依據另一實施例,其中在當判斷任一語音關鍵詞與任一內建關鍵詞相同或相似時,上述處理模組係用以執行語音指令。According to another embodiment, when it is determined that any voice keyword is the same or similar to any built-in keyword, the processing module is used to execute the voice command.

依據又一實施例,其中在當判斷該語音聲紋特徵與任一該些內建聲紋特徵相同或相似,且當判斷該語音關鍵詞與任一該些內建關鍵詞相同或相似時,上述比對模組係用以執行語音指令。According to another embodiment, when it is determined that the voice voiceprint feature is the same or similar to any one of the built-in voiceprint features, and when it is determined that the voice keyword is the same or similar to any one of the built-in keywords, The above-mentioned comparison module is used to execute voice commands.

依據又一實施例,其中上述比對模組230係比對語音聲紋特徵中之波形或頻譜,以判斷語音聲紋特徵是否與任一內建聲紋特徵相同或相似。According to another embodiment, the comparison module 230 compares the waveform or frequency spectrum of the voice voiceprint feature to determine whether the voice voiceprint feature is the same or similar to any built-in voiceprint feature.

綜合上述技術特徵,本新型之實施例因而具有以下功效:Based on the above technical features, the embodiments of the present invention have the following effects:

(1) 透過擷取使用者之語音指令中的語音聲紋特徵,並將其與資料庫中的內建聲紋特徵進行比對。在上述二者有聲紋特徵相符時,快速完成登入,即可順利進行操作。藉此,本新型之實施例確實可提供更方便且安全的聲控操作裝置,並可成功取代習知僅能透過視覺或手動來登入裝置的技術。(1) By capturing the voice voiceprint feature in the user's voice command, and comparing it with the built-in voiceprint feature in the database. When the above two have the same voiceprint characteristics, quickly complete the login and the operation can be carried out smoothly. In this way, the embodiments of the present invention can indeed provide a more convenient and safe voice-controlled operating device, and can successfully replace the conventional technology that can only log in to the device visually or manually.

(2) 透過擷取使用者之語音指令中的語音關鍵詞,並將其與資料庫中的內建關鍵詞進行比對。在上述二者有任一關鍵詞相符時,快速根據語音指令來執行對應的操作內容,並在完成執行後輸出處理結果。藉此,本新型之實施例確實可提供更直覺且人性化的聲控操作裝置,而無須再透過視覺或手動來操作裝置。據此,本新型之實施例更能讓視覺能力不佳、或當下無法即時使用視覺或手動操作的使用者,仍能無障礙地操作裝置。(2) By capturing the voice keywords in the user's voice commands, and comparing them with the built-in keywords in the database. When any of the above two keywords match, the corresponding operation content is quickly executed according to the voice command, and the processing result is output after the execution is completed. In this way, the embodiments of the present invention can indeed provide a more intuitive and user-friendly voice-controlled operating device, without the need to operate the device visually or manually. Accordingly, the embodiments of the present invention can further enable users who have poor visual ability, or who cannot use vision or manual operation in real time, can still operate the device without obstacles.

有鑑於上述待克服的問題,本新型之實施例發展出一種聲控操作裝置,其可安裝於各種計算機或行動裝置中。上述聲控操作裝置透過擷取模組來接收使用者的語音指令,以進一步擷取上述語音指令中的語音聲紋特徵以及語音關鍵詞。接著,再透過比對模組,分別將語音聲紋特徵以及語音關鍵詞,與資料庫中的內建資料進行比對。接著,在比對模組確認語音聲紋特徵確實與內建聲紋特徵相符,且語音關鍵詞也確實與任一內建關鍵詞相符時,則處理模組便會執行語音指令所對應的操作內容。In view of the above-mentioned problems to be overcome, the embodiments of the present invention develop a voice control operation device, which can be installed in various computers or mobile devices. The voice control operation device receives the user's voice command through the capturing module, so as to further capture the voice voiceprint features and voice keywords in the voice command. Then, through the comparison module, the speech voiceprint features and speech keywords are respectively compared with the built-in data in the database. Then, when the comparison module confirms that the voice voiceprint feature does match the built-in voiceprint feature, and the voice keyword does match any built-in keyword, the processing module will perform the operation corresponding to the voice command content.

為更具體說明本新型之各實施例,以下輔以附圖進行說明。In order to more specifically illustrate the various embodiments of the present invention, the following description is supplemented with the accompanying drawings.

請參照圖1,圖1所繪為根據本新型之一實施例之一種聲控操作方法之流程圖。在圖1中,依據本新型之一實施例,提供了一種聲控操作方法100,包括以下步驟。Please refer to FIG. 1. FIG. 1 is a flowchart of a voice control operation method according to an embodiment of the present invention. In FIG. 1, according to an embodiment of the present invention, a voice control operation method 100 is provided, which includes the following steps.

首先,如步驟110,接收來自一使用者的語音指令,其中上述語音指令可理解為任何係透過聲波傳遞的訊號。因此,更具體而言,上述語音指令可包括語音聲紋特徵及語音關鍵詞。First, in step 110, a voice command from a user is received, where the voice command can be understood as any signal transmitted through sound waves. Therefore, more specifically, the above-mentioned voice command may include voice voiceprint features and voice keywords.

接著,如步驟120,擷取上述語音指令的語音聲紋特徵,以供再進一步如步驟130,將上述語音聲紋特徵與內建聲紋特徵進行比較,並判斷語音聲紋特徵是否符合任一內建聲紋特徵。上述內建聲紋特徵例如可為使用者本人預先提供,以供後續比對基礎的聲紋特徵。藉此,即可判斷上述語音聲紋特徵是否係來自使用者本人。Then, in step 120, the voice voiceprint feature of the voice command is captured, so as to further compare the voice voiceprint feature with the built-in voiceprint feature in step 130, and determine whether the voice voiceprint feature matches any one of them. Built-in voiceprint feature. The above-mentioned built-in voiceprint features can be provided in advance by the user, for example, for subsequent comparison with the basic voiceprint features. In this way, it can be determined whether the voice voiceprint feature is from the user himself.

其中,例如以特定時間長度為單位,針對各語音指令進行切割以提取的語音波形,並將其視為一個音框 (frame)。當使用者發出語音時,就可以擷取對應的波形 (waveform) 或頻譜 (spectrum) 等語音指令。透過分析語音聲紋特徵及內建聲紋特徵在某些特定音框下的波形 (例如週期、波峰及/或波谷振幅,或整體波動形狀等),即可比對二者間是否具有相同的波形,以進一步判斷是否確實為使用者本人的聲紋特徵。Among them, for example, using a specific length of time as a unit, the voice waveform extracted by cutting for each voice command is regarded as a frame. When the user utters a voice, he can capture the corresponding waveform or spectrum and other voice commands. By analyzing the waveforms of voice voiceprint features and built-in voiceprint features under certain sound frames (such as period, peak and/or trough amplitude, or overall wave shape, etc.), you can compare whether the two have the same waveform , In order to further determine whether it is indeed the user's own voiceprint characteristics.

同時,或可透過分析語音聲紋特徵及內建聲紋特徵在某些特定音框下的頻譜 (例如基本頻率 [fundamental frequency]、諧振頻率 [harmonic frequency] 或音高 [pitch] 等),即可比對二者間是否具有相同的頻譜,以進一步判斷是否確實為使用者本人的聲紋特徵。At the same time, it may be possible to analyze the frequency spectrum of voice voiceprint features and built-in voiceprint features under certain sound frames (such as fundamental frequency, harmonic frequency, or pitch, etc.), that is It can be compared whether the two have the same frequency spectrum to further determine whether it is indeed the user's own voiceprint feature.

接著,如步驟131,當判斷語音聲紋特徵的波形或頻譜確實與內建語音特徵不符時,則停止任何後續動作,結束聲控操作方法。Then, in step 131, when it is determined that the waveform or frequency spectrum of the voice voiceprint feature does not match the built-in voice feature, any subsequent actions are stopped, and the voice control operation method is ended.

或接著,如步驟132,當判斷語音聲紋特徵的波形或頻譜確實與內建語音特徵相符時,亦即語音指令確實係來自使用者本人,則繼續擷取上述語音指令的語音關鍵詞。Or then, in step 132, when it is determined that the waveform or frequency spectrum of the voice voiceprint feature is indeed consistent with the built-in voice feature, that is, the voice command is indeed from the user himself, then continue to capture the voice keywords of the voice command.

接著,如步驟140,進一步將所擷取的語音關鍵詞,與內建關鍵詞進行比對,以判斷語音關鍵詞是否與內建關鍵詞相同。例如,上述內建關鍵詞可為預先設定的各種關鍵詞,例如操作動作 (例如匯款)、數字金額 (例如4000元)、銀行名稱 (例如臺銀或臺灣銀行、玉山或玉山銀行等)、帳戶號碼 (例如1234567890) 等。藉此,即可進一步判斷以執行使用者本人所欲執行的操作內容。Then, in step 140, the captured voice keywords are further compared with the built-in keywords to determine whether the voice keywords are the same as the built-in keywords. For example, the above built-in keywords can be various pre-set keywords, such as operation actions (such as remittance), digital amount (such as 4000 yuan), bank name (such as Bank of Taiwan or Bank of Taiwan, Yushan or Yushan Bank, etc.), account Number (for example, 1234567890), etc. In this way, it is possible to further determine to perform the operation content that the user wants to perform.

接著,如步驟131,當判斷內建關鍵詞不與任一內建關鍵詞相符時,則停止任何後續動作,結束聲控操作方法。Then, in step 131, when it is determined that the built-in keyword does not match any of the built-in keywords, any subsequent actions are stopped, and the voice control operation method is ended.

或接著,如步驟141,根據相符的任一內建關鍵詞,執行語音指令。接著,如步驟142,在執行完成後,例如,使用者輸入的語音指令為「哈囉臺銀,我要匯款4000元到玉山帳戶1234567890」,可擷取得到第一語音關鍵詞「匯款」、第二語音關鍵詞「4000元」、第三語音關鍵詞「玉山銀行」及第四語音關鍵詞「帳戶1234567890」。因此,根據內建關鍵詞中的「匯款」、「4000元」、「玉山銀行」、「帳戶」及「1234567890」,即可進行如語音指令「匯款4000元到玉山帳戶1234567890」的動作。接著,在執行例如「匯款4000元到玉山帳戶1234567890」的動作後,輸出例如「已完成匯款,匯款金額為4000元,匯入銀行為玉山銀行,匯入帳戶為1234567890」的處理結果,以直接通知使用者其語音指令的處理情況。或例如,在匯出帳戶餘額不足時,可輸出例如「抱歉,帳戶餘額不足」的處理結果,以直接通知使用者其語音指令的處理情況。Or then, in step 141, the voice command is executed according to any of the matching built-in keywords. Then, in step 142, after the execution is completed, for example, the voice command input by the user is "Hello Taiwan Bank, I want to remit 4000 yuan to Yushan account 1234567890", the first voice keyword "remittance" can be retrieved, The second voice keyword "4000 yuan", the third voice keyword "Yushan Bank" and the fourth voice keyword "account 1234567890". Therefore, according to the built-in keywords "remittance", "4000 yuan", "Yushan Bank", "account" and "1234567890", actions such as the voice command "remit 4000 yuan to Yushan account 1234567890" can be performed. Then, after performing an action such as "Remit 4000 RMB to Yushan Account 1234567890", output the processing result such as "The remittance has been completed, the remittance amount is 4000 RMB, the remittance bank is Yushan Bank, and the remittance account is 1234567890" to directly Notify users of the processing status of their voice commands. Or, for example, when the balance of the exported account is insufficient, the processing result such as "Sorry, the account balance is insufficient" can be output to directly notify the user of the processing status of the voice command.

此外,上述處理結果可為文字訊息或語音訊息,在此並不加以限制。而在上述處理結果係以聲音訊息的形式輸出時,上述聲控操作方法即可全程透過聲音控制完成,即無須再透過視覺或手動才能進行。In addition, the above-mentioned processing result can be a text message or a voice message, which is not limited here. When the above processing result is output in the form of voice messages, the above voice control operation method can be completed through voice control all the way, that is, it does not need to be visually or manually performed.

除了上述聲控操作方法之外,本新型之實施例另外再提供一種聲控操作裝置。請參照圖2,圖2所繪為根據本新型之一實施例之一種聲控操作裝置之示意圖。在圖2中,依據本新型之另一實施例,提供了一種聲控操作裝置200,其可安裝於各種計算機 (例如平板電腦) 或行動裝置 (例如智慧型手機或智慧型手錶等) 中。上述聲控操作裝置200包括資料庫220、擷取模組210、比對模組230及處理模組240。In addition to the above-mentioned voice control operation method, the embodiment of the present invention additionally provides a voice control operation device. Please refer to FIG. 2, which is a schematic diagram of a voice control operation device according to an embodiment of the present invention. In FIG. 2, according to another embodiment of the present invention, a voice control operation device 200 is provided, which can be installed in various computers (such as tablet computers) or mobile devices (such as smart phones or smart watches, etc.). The above-mentioned voice control operation device 200 includes a database 220, an acquisition module 210, a comparison module 230 and a processing module 240.

關於上述資料庫220,進一步說明如下。上述資料庫220係用以儲存一內建資料,上述內建資料包括多個內建聲紋特徵及多個內建關鍵詞。例如,上述內建聲紋特徵可為來自使用者300預先提供者,以儲存於資料庫200並供後續進一步比對、確認是否確為使用者300本人之用。詳細已如前所述,在此不再贅述。The above-mentioned database 220 is further explained as follows. The database 220 is used to store a built-in data. The built-in data includes a plurality of built-in voiceprint features and a plurality of built-in keywords. For example, the above-mentioned built-in voiceprint feature may be provided by the user 300 in advance, so as to be stored in the database 200 and used for subsequent further comparison to confirm whether it is the user 300 himself. The details are as mentioned before, so I won't repeat them here.

關於上述擷取模組210,進一步說明如下。上述擷取模組210係用以接收來自上述使用者300的語音指令,並在擷取上述語音指令後,擷取語音指令中的語音聲紋特徵及語音關鍵詞。接著,在分別順利擷取上述語音聲紋特徵及語音關鍵詞之後,上述擷取模組210則可分別輸出語音聲紋特徵及語音關鍵詞。詳細已如前所述,在此不再贅述。Regarding the aforementioned capturing module 210, further description is as follows. The capturing module 210 is used to receive a voice command from the user 300, and after capturing the voice command, capture voice features and voice keywords in the voice command. Then, after the voice voiceprint features and voice keywords are successfully captured, the capture module 210 can output the voice voiceprint features and voice keywords, respectively. The details are as mentioned before, so I won't repeat them here.

關於上述比對模組230,進一步說明如下。上述比對模組230係用以通訊連接資料庫220,以擷取內建資料 (包括內建聲紋特徵及內建關鍵詞)。此外,上述比對模組230更用以通訊連接擷取模組210,以接收語音聲紋特徵及語音關鍵詞。上述比對模組230則可用以比對並判斷語音聲紋特徵 (例如波形或頻譜) 是否與任一內建聲紋特徵 (例如波形或頻譜) 相同或相似,且比對並判斷任一語音關鍵詞是否與任一內建關鍵詞相同或相似。詳細已如前所述,在此不再贅述。Regarding the above-mentioned comparison module 230, further description is as follows. The comparison module 230 is used to communicate with the database 220 to retrieve built-in data (including built-in voiceprint features and built-in keywords). In addition, the above-mentioned comparison module 230 is further used to communicate with the capturing module 210 to receive voice voiceprint features and voice keywords. The comparison module 230 can be used to compare and determine whether the voice print feature (such as waveform or spectrum) is the same or similar to any built-in voice print feature (such as waveform or spectrum), and compare and determine any voice. Whether the keyword is the same or similar to any of the built-in keywords. The details are as mentioned before, so I won't repeat them here.

其中,依據一實施例,當判斷語音聲紋特徵與任一內建聲紋特徵相同或相似時,上述比對模組230則繼續用以判斷語音關鍵詞是否與任一內建關鍵詞相同或相似。According to an embodiment, when it is determined that the voice voiceprint feature is the same or similar to any built-in voiceprint feature, the comparison module 230 continues to determine whether the voice keyword is the same or similar to any built-in keyword. similar.

關於上述處理模組240,進一步說明如下。上述處理模組240係用以通訊連接比對模組230,以接收語音指令,並根據語音指令執行以輸出一處理結果。詳細已如前所述,在此不再贅述。The processing module 240 described above is further described as follows. The above-mentioned processing module 240 is used to communicate with the comparison module 230 to receive a voice command, and execute according to the voice command to output a processing result. The details are as mentioned before, so I won't repeat them here.

其中,依據又一實施例,當判斷任一語音關鍵詞與任一內建關鍵詞相同或相似時,上述處理模組240則繼續用以執行語音指令,以輸出一處理結果。詳細已如前所述,在此不再贅述。According to another embodiment, when it is determined that any voice keyword is the same or similar to any built-in keyword, the processing module 240 continues to execute the voice command to output a processing result. The details are as mentioned before, so I won't repeat them here.

綜合上述,本新型之實施例由於導入擷取模組,而可供擷取語音指令中的語音聲紋特徵以及語音關鍵詞。再藉由比對模組,將語音聲紋特徵與資料庫中的內建聲紋特徵進行比對,並將語音關鍵詞與資料庫中的內建關鍵詞進行比對。當語音聲紋特徵確認為使用者本人,且語音關鍵詞與內建關鍵詞相符,處理模組即可快速根據語音關鍵詞所對應的動作內容,執行使用者的語音指令。藉此,本新型之實施例確實能提供聲控操作裝置,而可讓使用者透過這樣的聲控操作裝置,在不需透過視覺來接收螢幕資訊、或透過手動來輸入指令的情況下,仍能快速且方便地操作,以完成所欲執行的動作內容。In summary, the embodiment of the present invention is capable of extracting voice features and voice keywords in voice commands due to the introduction of the capture module. Through the comparison module, the voice voiceprint feature is compared with the built-in voiceprint feature in the database, and the voice keywords are compared with the built-in keywords in the database. When the voice voiceprint feature is confirmed to be the user, and the voice keyword matches the built-in keyword, the processing module can quickly execute the user's voice command according to the action content corresponding to the voice keyword. As a result, the embodiments of the present invention can indeed provide a voice-controlled operating device, and the user can use such a voice-activated operating device to quickly receive screen information without visually or manually inputting instructions. And convenient operation to complete the content of the action you want to perform.

本新型在本文中僅以較佳實施例揭露,然任何熟習本技術領域者應能理解的是,上述實施例僅用於描述本新型,並非用以限定本新型所主張之專利權利範圍。舉凡與上述實施例均等或等效之變化或置換,皆應解讀為涵蓋於本新型之精神或範疇內。因此,本新型之保護範圍應以下述之申請專利範圍所界定者為準。The present invention is disclosed in the preferred embodiments in this text. However, anyone familiar with the technical field should understand that the above-mentioned embodiments are only used to describe the present invention and are not intended to limit the scope of the patent rights claimed by the present invention. Any changes or substitutions that are equal or equivalent to the above-mentioned embodiments should be interpreted as being covered by the spirit or scope of the present invention. Therefore, the scope of protection of this new model shall be subject to the scope of the following patent applications.

100:聲控操作方法 110-140、131-132、141-142:步驟 200:聲控操作裝置 210:擷取模組 220:資料庫 230:比對模組 240:處理模組 300:使用者 400:伺服器100: Voice control operation method 110-140, 131-132, 141-142: steps 200: Voice-activated operating device 210: Capture module 220: database 230: comparison module 240: Processing module 300: User 400: server

為讓本新型之上述和其他目的、特徵、優點與實施例能更明顯易懂,所附附圖之說明如下: 圖1所繪為根據本新型之一實施例之一種聲控操作方法之流程圖。 圖2所繪為根據本新型之一實施例之一種聲控操作裝置之示意圖。 In order to make the above and other objectives, features, advantages and embodiments of the present invention more comprehensible, the description of the attached drawings is as follows: FIG. 1 is a flowchart of a voice control operation method according to an embodiment of the present invention. FIG. 2 is a schematic diagram of a voice control operation device according to an embodiment of the present invention.

200:聲控操作裝置 200: Voice-activated operating device

210:擷取模組 210: Capture module

220:資料庫 220: database

230:比對模組 230: comparison module

240:處理模組 240: Processing module

300:使用者 300: User

400:伺服器 400: server

Claims (5)

一種聲控操作裝置,安裝於計算機或行動裝置中,包括: 一資料庫,儲存一內建資料,該內建資料包括複數個內建聲紋特徵及複數個內建關鍵詞; 一擷取模組,接收一使用者的一語音指令,以擷取並輸出該語音指令的一語音聲紋特徵及複數個語音關鍵詞; 一比對模組,通訊連接該資料庫,以擷取該內建資料,且通訊連接該擷取模組,以接收該語音聲紋特徵及該些語音關鍵詞,並判斷該語音聲紋特徵是否與任一該些內建聲紋特徵相同或相似,且判斷任一該些語音關鍵詞是否與任一該些內建關鍵詞相同或相似;以及 一處理模組,通訊連接該比對模組,以接收並執行該語音指令,以輸出一處理結果。 A voice control operating device installed in a computer or mobile device, including: A database, storing a built-in data, the built-in data includes a plurality of built-in voiceprint features and a plurality of built-in keywords; A capturing module that receives a voice command from a user to capture and output a voice voiceprint feature and a plurality of voice keywords of the voice command; A comparison module is communicatively connected to the database to capture the built-in data, and communicatively connected to the capture module to receive the voice voiceprint feature and the voice keywords, and determine the voice voiceprint feature Whether it is the same or similar to any of the built-in voiceprint features, and determine whether any of the speech keywords are the same or similar to any of the built-in keywords; and A processing module is communicatively connected to the comparison module to receive and execute the voice command to output a processing result. 如請求項1所述之聲控操作裝置,其中: 該比對模組判斷該些語音關鍵詞是否與任一該些內建關鍵詞相同或相似,當判斷該語音聲紋特徵與任一該些內建聲紋特徵相同或相似時。 The voice control operation device according to claim 1, wherein: The comparison module determines whether the voice keywords are the same or similar to any of the built-in keywords, when determining that the voice voiceprint feature is the same or similar to any of the built-in voiceprint features. 如請求項1所述之聲控操作裝置,其中: 該處理模組執行該語音指令,當判斷任一該些語音關鍵詞與任一該些內建關鍵詞相同或相似時。 The voice control operation device according to claim 1, wherein: The processing module executes the voice command when it is determined that any of the voice keywords is the same or similar to any of the built-in keywords. 如請求項1所述之聲控操作裝置,其中: 該處理模組執行該語音指令,當判斷該語音聲紋特徵與任一該些內建聲紋特徵相同或相似,且當判斷該語音關鍵詞與任一該些內建關鍵詞相同或相似時。 The voice control operation device according to claim 1, wherein: The processing module executes the voice command, when it is determined that the voice voiceprint feature is the same or similar to any of the built-in voiceprint features, and when it is determined that the voice keyword is the same or similar to any of the built-in keywords . 如請求項1-4中任一項所述之聲控操作裝置,其中: 該比對模組係比對該語音聲紋特徵中之波形或頻譜,以判斷該語音聲紋特徵是否與任一該些內建聲紋特徵相同或相似。 The voice control operation device according to any one of claims 1-4, wherein: The comparison module compares the waveform or frequency spectrum in the voice voiceprint feature to determine whether the voice voiceprint feature is the same or similar to any of the built-in voiceprint features.
TW109215601U 2020-11-26 2020-11-26 Voice-controlled operating apparatus TWM610794U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW109215601U TWM610794U (en) 2020-11-26 2020-11-26 Voice-controlled operating apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW109215601U TWM610794U (en) 2020-11-26 2020-11-26 Voice-controlled operating apparatus

Publications (1)

Publication Number Publication Date
TWM610794U true TWM610794U (en) 2021-04-21

Family

ID=76606153

Family Applications (1)

Application Number Title Priority Date Filing Date
TW109215601U TWM610794U (en) 2020-11-26 2020-11-26 Voice-controlled operating apparatus

Country Status (1)

Country Link
TW (1) TWM610794U (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI789891B (en) * 2021-09-03 2023-01-11 中華大學學校財團法人中華大學 Condition-triggered feedback system and method thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI789891B (en) * 2021-09-03 2023-01-11 中華大學學校財團法人中華大學 Condition-triggered feedback system and method thereof

Similar Documents

Publication Publication Date Title
US11600265B2 (en) Systems and methods for determining whether to trigger a voice capable device based on speaking cadence
WO2021159688A1 (en) Voiceprint recognition method and apparatus, and storage medium and electronic apparatus
US8793135B2 (en) System and method for auditory captchas
CN106653021B (en) Voice wake-up control method and device and terminal
US20170140750A1 (en) Method and device for speech recognition
TWI525532B (en) Set the name of the person to wake up the name for voice manipulation
WO2020119448A1 (en) Voice information verification
US10270736B2 (en) Account adding method, terminal, server, and computer storage medium
CN107153499A (en) The Voice command of interactive whiteboard equipment
US11404052B2 (en) Service data processing method and apparatus and related device
CN106356057A (en) Speech recognition system based on semantic understanding of computer application scenario
WO2018129869A1 (en) Voiceprint verification method and apparatus
EP4369272A2 (en) Techniques to provide sensitive information over a voice connection
US9311461B2 (en) Security system based on questions that do not publicly identify the speaker
WO2019228135A1 (en) Method and device for adjusting matching threshold, storage medium and electronic device
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN108010513A (en) Method of speech processing and equipment
WO2019045816A1 (en) Graphical data selection and presentation of digital content
TWM610794U (en) Voice-controlled operating apparatus
EP3499502A1 (en) Voice information processing method and apparatus
CN109615391A (en) Payment system, method of payment and the second client terminal device
WO2018079294A1 (en) Information processing device and information processing method
WO2019041871A1 (en) Voice object recognition method and device
CN112863495A (en) Information processing method and device and electronic equipment
US11227610B1 (en) Computer-based systems for administering patterned passphrases