TW202001643A - System for determining identity through voiceprint and voice password and method thereof - Google Patents

System for determining identity through voiceprint and voice password and method thereof Download PDF

Info

Publication number
TW202001643A
TW202001643A TW107120121A TW107120121A TW202001643A TW 202001643 A TW202001643 A TW 202001643A TW 107120121 A TW107120121 A TW 107120121A TW 107120121 A TW107120121 A TW 107120121A TW 202001643 A TW202001643 A TW 202001643A
Authority
TW
Taiwan
Prior art keywords
voice
voice signal
voiceprint
password
module
Prior art date
Application number
TW107120121A
Other languages
Chinese (zh)
Inventor
邱全成
Original Assignee
英業達股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英業達股份有限公司 filed Critical 英業達股份有限公司
Priority to TW107120121A priority Critical patent/TW202001643A/en
Publication of TW202001643A publication Critical patent/TW202001643A/en

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

A system for determining identity through a voiceprint and a voice password and a method thereof are provided. By receiving a voice, and determining whether the voice is verified in accordance with a voiceprint of the voice and message of the voice, the system and the method can confirm recognized voice made by real people, and can achieve the effect of increasing security of identity recognized.

Description

透過聲紋及語音密碼判斷身分之系統及方法System and method for judging identity through voiceprint and voice password

一種身分辨識系統及其方法,特別係指一種透過聲紋及語音密碼判斷身分之系統及方法。A body identification system and method thereof, particularly a system and method for judging identity through voiceprint and voice password.

聲紋是用儀器顯示之攜帶語言訊息的聲波頻譜。由現代科學研究可以得知,無論講話者是耳語輕聲講話,還是故意模仿他人聲音和語氣,即使模仿得惟妙惟肖,其聲紋卻始終不相同,也就是說,人在講話時使用的發聲器官舌、牙齒、喉頭、肺、鼻腔在尺寸和形態等方面每個人都不相同,所以任何兩個人的聲紋都有差異,所以聲紋具有唯一性。另外,在人成年以後,聲音可保持長期不變,這表示聲紋也有相對穩定性。Voiceprint is the sound wave spectrum carrying language information displayed by the instrument. It can be learned from modern scientific research that no matter whether the speaker whispered softly or intentionally imitated the voice and tone of others, even if imitated brilliantly, the voiceprint is always different, that is to say, the vocal organ tongue used by people when speaking , Teeth, throat, lungs, nasal cavity are different in size and shape, so any two people have different voiceprints, so the voiceprint is unique. In addition, after adulthood, the sound can remain unchanged for a long time, which means that the voiceprint also has relative stability.

根據不同的應用場景,目前聲紋辨識可分為說話人辨識(Speaker Identification, SI)和說話人確認(Speaker Verification, SV)。其中,說話人確認是指判斷一段語音是否來自特定的使用者,也就是說,透過聲紋識別技術,可以由說話人的語音來判斷說話人的身分。然而,雖然透過聲紋辨識可以確定說話人的身分,且其他人無法模仿,但聲紋辨識並無法分辨語音是由真人所發出,或是由聲音播放裝置所播放,因此,有心人士可能透過錄製說話人的聲音等方式來取得說話人的聲紋,藉以偽造說話人的語音,進而盜用話說人的身分。According to different application scenarios, the current voiceprint recognition can be divided into speaker identification (Speaker Identification, SI) and speaker verification (Speaker Verification, SV). Among them, speaker confirmation refers to judging whether a piece of speech comes from a specific user, that is to say, through voiceprint recognition technology, the identity of the speaker can be judged from the speech of the speaker. However, although the identity of the speaker can be determined through voiceprint recognition, and others cannot imitate it, voiceprint recognition cannot distinguish whether the voice was issued by a real person or played by a sound playback device, therefore, interested persons may record The voice of the speaker is used to obtain the voiceprint of the speaker, so as to forge the voice of the speaker, so as to embezzle the identity of the speaker.

綜上所述,可知先前技術中長期以來一直存在聲紋辨識技術無法判斷語音是否為真人發聲的問題,因此有必要提出改進的技術手段,來解決此一問題。In summary, it can be seen that the voiceprint recognition technology has been unable to determine whether the voice is uttered by a real person for a long time in the prior art. Therefore, it is necessary to propose improved technical means to solve this problem.

有鑒於先前技術存在聲紋辨識技術無法判斷語音是否為真人發聲的問題,本發明遂揭露一種透過聲紋及語音密碼判斷身分之系統及其方法,其中:In view of the problem in the prior art that the voiceprint recognition technology cannot determine whether the voice is uttered by a real person, the present invention discloses a system and method for determining identity through voiceprint and voice password, in which:

本發明所揭露之透過聲紋及語音密碼判斷身分之系統,至少包含:語音接收模組,用以接收第一語音訊號及第二語音訊號;聲紋辨識模組,用以判斷第一語音訊號之聲紋是否與預先建立之聲紋相符;語音辨識模組,用以判斷第一語音訊號是否與預定口令相符,及用以判斷第二語音訊號是否與語音密碼相符;提示模組,用以於第一語音訊號之聲紋與預先建立之聲紋相符、且第一語音訊號與預定口令相符時,提示輸入第二語音訊號;操作執行模組,用以於第二語音訊號與語音密碼相符時,判斷結果通過驗證,執行與第一語音訊號對應之操作。The system for judging identity by voiceprint and voice password disclosed by the present invention at least includes: a voice receiving module for receiving the first voice signal and the second voice signal; a voiceprint recognition module for judging the first voice signal Whether the voiceprint matches the pre-established voiceprint; the voice recognition module is used to determine whether the first voice signal is consistent with the predetermined password, and to determine whether the second voice signal is consistent with the voice password; the prompt module is used to When the voiceprint of the first voice signal matches the pre-established voiceprint, and the first voice signal matches the predetermined password, prompt to input the second voice signal; the operation execution module is used to match the second voice signal and the voice password When the judgment result is verified, the operation corresponding to the first voice signal is performed.

本發明所揭露之透過聲紋及語音密碼判斷身分之方法,其步驟至少包括:接收第一語音訊號;判斷第一語音訊號之聲紋與預先建立之聲紋相符,且第一語音訊號與特定口令相符時,提示輸入第二語音訊號;接收第二語音訊號;判斷第二語音訊號與語音密碼相符時,判斷結果通過驗證,執行與第一語音訊號對應之操作。The method for judging identity by voiceprint and voice password disclosed in the present invention includes at least the steps of: receiving a first voice signal; judging that the voiceprint of the first voice signal matches the pre-established voiceprint, and the first voice signal is specific When the passwords match, prompt to input the second voice signal; receive the second voice signal; when it is judged that the second voice signal matches the voice password, the judgment result is verified, and the operation corresponding to the first voice signal is performed.

本發明所揭露之系統與方法如上,與先前技術之間的差異在於本發明透過在接收到之語音訊號後,依據語音訊號的聲紋以及內容判斷結果是否通過驗證,藉以解決先前技術所存在的問題,並可以達成增加身分識別安全性的技術功效。The system and method disclosed by the present invention are as above. The difference between the present invention and the prior art is that the present invention solves the problems of the prior art by determining whether the result of the voice signal and the content of the voice signal are verified after receiving the voice signal. Problem, and can achieve the technical effect of increasing the security of identity recognition.

以下將配合圖式及實施例來詳細說明本發明之特徵與實施方式,內容足以使任何熟習相關技藝者能夠輕易地充分理解本發明解決技術問題所應用的技術手段並據以實施,藉此實現本發明可達成的功效。The following will describe the features and implementations of the present invention in detail with reference to the drawings and examples. The content is sufficient for any person skilled in the relevant arts to easily fully understand and implement the technical means applied to solve the technical problems of the present invention and implement accordingly, thereby realizing The achievable effect of the invention.

本案可以對使用者所輸入的語音訊號進行聲紋辨識與語音辨識,藉以透過聲紋辨識判斷使用者的身分,並透過語音辨識確認使用者為本人,避免使用者的聲紋被偽造或盜用。In this case, voiceprint recognition and voice recognition can be performed on the voice signal input by the user, so as to determine the user's identity through voiceprint recognition, and confirm the user as the user through voice recognition, so as to prevent the user's voiceprint from being forged or misappropriated.

以下先以「第1圖」本發明所提之透過聲紋及語音密碼判斷身分之系統架構圖來說明本發明的系統運作。如「第1圖」所示,本發明之系統含有語音接收模組110、聲紋辨識模組131、語音辨識模組132、提示模組150、操作執行模組160,以及可以附加的情緒評估模組133、示警模組170、權限判斷模組180、密碼生成模組190。The following describes the system operation of the present invention with the "Figure 1" system architecture diagram proposed by the present invention for determining identity through voiceprint and voice password. As shown in "Figure 1", the system of the present invention includes a voice receiving module 110, a voiceprint recognition module 131, a voice recognition module 132, a prompt module 150, an operation execution module 160, and an additional emotion evaluation Module 133, warning module 170, authority judgment module 180, and password generation module 190.

語音接收模組110負責接收第一語音訊號,也負責接收第二語音訊號,其中,第一語音訊號與第二語音訊號通常是不同時間被輸入的不同語音訊號。在部分的實施例中,語音接收模組110也可以接收第三語音訊號。The voice receiving module 110 is responsible for receiving the first voice signal and also for receiving the second voice signal, wherein the first voice signal and the second voice signal are usually different voice signals that are input at different times. In some embodiments, the voice receiving module 110 may also receive the third voice signal.

一般而言,語音接收模組110可以由外部的麥克風等聲音輸入裝置(圖中未示)接收第一語音訊號、第二語音訊號、第三語音訊號、與設定語音訊號,但本發明並不以此為限。Generally speaking, the voice receiving module 110 can receive the first voice signal, the second voice signal, the third voice signal, and the set voice signal from an external microphone or other voice input device (not shown), but the present invention does not This is the limit.

聲紋辨識模組131負責判斷語音接收模組110所接收到之第一語音訊號的聲紋是否與預先建立之目標聲紋相符。聲紋辨識模組131通常可以透過語音接收模組110建立目標聲紋,但本發明並不以此為限。The voiceprint recognition module 131 is responsible for determining whether the voiceprint of the first voice signal received by the voice receiving module 110 matches the pre-established target voiceprint. The voiceprint recognition module 131 can usually establish a target voiceprint through the voice receiving module 110, but the invention is not limited thereto.

語音辨識模組132負責判斷語音接收模組110所接收到之第一語音訊號是否與預先定義之預定口令相符。其中,預定口令通常為預先錄製的語音訊號,語音辨識模組132可以將預定口令轉換為對應的口令文字,並可以將語音接收模組110所接收到的第一語音訊號轉換為對應的語音文字,並比對轉換產生的語音文字與口令文字是否相同,藉以判斷第一語音訊號與預定口令是否相符,但本發明並不以此為限,例如,預定口令也可以是文字資料,語音辨識模組132可以比對轉換第一語音訊號產生語音文字與預定口令以判斷第一語音訊號與預定口令是否相符。The voice recognition module 132 is responsible for judging whether the first voice signal received by the voice receiving module 110 is consistent with a predefined password. Wherein, the predetermined password is usually a pre-recorded voice signal, and the voice recognition module 132 can convert the predetermined password into corresponding password text, and can convert the first voice signal received by the voice receiving module 110 into corresponding voice text , And compare whether the voice text and password text generated by the conversion are the same, so as to determine whether the first voice signal is consistent with the predetermined password, but the present invention is not limited to this, for example, the predetermined password may also be text data, voice recognition mode The group 132 may compare and convert the first voice signal to generate voice text and a predetermined password to determine whether the first voice signal and the predetermined password are consistent.

語音辨識模組132也負責判斷語音接收模組110所接收到之第二語音訊號是否與語音密碼相符。一般而言,語音密碼通常是文字資料,語音辨識模組132可以將第二語音訊號轉換為對應的語音文字,並比對轉換產生的語音文字與語音密碼是否相同,但本發明並不以此為限。The voice recognition module 132 is also responsible for determining whether the second voice signal received by the voice receiving module 110 matches the voice password. Generally speaking, the voice password is usually text data, and the voice recognition module 132 can convert the second voice signal into the corresponding voice text, and compare whether the converted voice text and the voice password are the same, but the present invention does not use this Limited.

情緒評估模組133可以依據語音接收模組110所接收到之第一語音訊號及/或第二語音訊號的聲紋判斷情緒狀態。情緒評估模組133所判斷出情緒狀態可以是正常、愉悅、生氣、悲傷等表示正常的情緒狀態,也可能是恐懼或害怕等異常的情緒狀態,但本發明所提之情緒狀態並不以上述為限。The emotion evaluation module 133 may determine the emotional state according to the voiceprint of the first voice signal and/or the second voice signal received by the voice receiving module 110. The emotion state judged by the emotion assessment module 133 may be normal, happy, angry, sad, etc., indicating normal emotional states, or may be abnormal emotional states such as fear or fear, but the emotional states mentioned in the present invention are not based on the above Limited.

情緒評估模組133也可以依據語音接收模組110所接收到之第三語音訊號確認情緒狀態,也就是依據第三語音訊號再次判斷情緒狀態是否異常,若本次所判斷出的情緒狀態仍然表示異常,則情緒評估模組133可以確認情緒狀態為異常,而若本次所判斷出的情緒狀態表示正常,則操作執行模組160可以正常執行。其中,情緒評估模組133可以依據第三語音訊號的聲紋判斷情緒狀態是否異常,也可以依據第三語音訊號的內容判斷情緒狀態是否異常。The emotion evaluation module 133 can also confirm the emotional state according to the third voice signal received by the voice receiving module 110, that is, determine whether the emotional state is abnormal again according to the third voice signal, if the emotional state determined this time still indicates Abnormal, the emotion evaluation module 133 can confirm that the emotional state is abnormal, and if the emotional state determined this time indicates normal, the operation execution module 160 can execute normally. The emotion evaluation module 133 can determine whether the emotional state is abnormal according to the voiceprint of the third voice signal, and can also determine whether the emotional state is abnormal according to the content of the third voice signal.

提示模組150負責在聲紋辨識模組131判斷第一語音訊號之聲紋與預先建立之目標聲紋相符、且語音辨識模組132判斷第一語音訊號與預定口令相符時,提示輸入第二語音訊號。提示模組150可以播放預先建立的語音訊息或顯示預先建立的文字訊息來提示輸入第二語音訊號,但本發明並不以此為限。The prompt module 150 is responsible for prompting for the second input when the voiceprint recognition module 131 judges that the voiceprint of the first voice signal matches the pre-established target voiceprint, and the voice recognition module 132 judges that the first voice signal matches the predetermined password Voice signal. The prompt module 150 can play a pre-created voice message or display a pre-created text message to prompt for the input of a second voice signal, but the invention is not limited to this.

提示模組150也可以在情緒評估模組133判斷情緒狀態表示異常時,產生異常提問,並提示輸入與異常提問對應的第三語音訊號。本發明所提之異常提問通常是預先設定的一組問題,其中,每一個問題可以對應一個正確的解答,情緒評估模組133可以在第三語音訊號與正確的解答相符時判斷情緒狀態正常,並可以在第三語音訊號與正確的解答不相符時判斷情緒狀態異常;在部分的實施例中,每一個問題也可以有多個對應的解答,例如,表示情緒狀態正常的解答以及表示情緒狀態異常的解答等,當第三語音訊號與正常的解答相符時,情緒評估模組133可以判斷情緒狀態正常,當第三語音訊號與異常的解答相符時,情緒評估模組133可以判斷情緒狀態異常。The prompt module 150 may also generate an abnormal question when the emotion evaluation module 133 determines that the emotional state indicates abnormality, and prompt to input a third voice signal corresponding to the abnormal question. The abnormal question mentioned in the present invention is usually a predetermined set of questions, where each question can correspond to a correct answer, and the emotion evaluation module 133 can judge that the emotional state is normal when the third voice signal matches the correct answer, Furthermore, when the third voice signal does not match the correct answer, it can be judged that the emotional state is abnormal; in some embodiments, each question can also have multiple corresponding answers, for example, a solution indicating that the emotional state is normal and an emotional state. Abnormal answers, etc. When the third voice signal matches the normal answer, the emotion assessment module 133 can determine that the emotional state is normal, and when the third voice signal matches the abnormal answer, the emotion assessment module 133 can determine that the emotional state is abnormal .

操作執行模組160負責在語音辨識模組132判斷第二語音訊號與語音密碼相符時,判斷結果通過驗證,並執行與第一語音訊號對應的操作,例如執行特定的功能等。若語音辨識模組132判斷第二語音訊號與語音密碼不相符時,則操作執行模組160可以判斷結果沒有通過驗證,且不執行與第一語音訊號對應的操作。The operation execution module 160 is responsible for when the voice recognition module 132 judges that the second voice signal is consistent with the voice password, the judgment result is verified, and performs an operation corresponding to the first voice signal, such as executing a specific function. If the voice recognition module 132 determines that the second voice signal does not match the voice password, the operation execution module 160 may determine that the result has not passed the verification and does not perform the operation corresponding to the first voice signal.

示警模組170可以在情緒評估模組133判斷情緒狀態異常時示警,也可以依據語音接收模組110所接收到之第三語音訊號選擇是否示警。其中,示警模組170可以執行示警作業以進行示警,例如報警、鳴響警報等方式,但本發明並不以此為限。The warning module 170 can warn when the emotion evaluation module 133 judges that the emotional state is abnormal, or it can choose whether to warn according to the third voice signal received by the voice receiving module 110. Wherein, the warning module 170 can perform a warning operation to perform warning, such as an alarm, an audible alarm, etc., but the invention is not limited to this.

權限判斷模組180可以依據語音接收模組110所接收到之第一語音訊號的聲紋判斷與第一語音訊號對應之操作是否可以被操作執行模組160執行。更詳細的,權限判斷模組180可以依據第一語音訊號之聲紋判斷使用者身分,並讀取與所判斷出之使用者身分對應的權限,藉以判斷使用者身分是否具有可以執行與第一語音訊號對應之操作的權限,也就是依據所讀取的權限判斷與第一語音訊號對應之操作是否可以被執行。The authority judging module 180 can judge whether the operation corresponding to the first voice signal can be executed by the operation execution module 160 according to the voiceprint of the first voice signal received by the voice receiving module 110. In more detail, the permission judgment module 180 can determine the user's identity according to the voiceprint of the first voice signal, and read the permission corresponding to the determined user's identity, thereby determining whether the user's identity has the ability to execute with the first The authority of the operation corresponding to the voice signal is to determine whether the operation corresponding to the first voice signal can be performed according to the read authority.

密碼生成模組190可以產生語音辨識模組132所需要的語音密碼,並可以將所產生的語音密碼傳送至指定裝置(圖中未示)。本發明所提之指定裝置可以是電子郵件伺服器、手機等,但本發明並不以此為限。一般而言,密碼生成模組190可以隨機產生語音密碼,也可以依照特定的規則產生語音密碼,本發明沒有特別的限制。The password generation module 190 can generate the voice password required by the voice recognition module 132, and can transmit the generated voice password to a designated device (not shown). The designated device mentioned in the present invention may be an email server, a mobile phone, etc., but the present invention is not limited to this. Generally speaking, the password generation module 190 can randomly generate a voice password, or can generate a voice password according to a specific rule, and the present invention is not particularly limited.

接著以一個實施例來解說本發明的運作系統與方法,並請參照「第2A圖」本發明所提之透過聲紋及語音密碼判斷身分之方法流程圖。在本實施例中,假設本發明應用於影音播放器、喇叭、手機、平板、螢幕、保險箱等電子裝置上。Next, an embodiment is used to explain the operation system and method of the present invention, and please refer to the "Figure 2A" flowchart of the method for judging identity by voiceprint and voice password proposed by the present invention. In this embodiment, it is assumed that the present invention is applied to electronic devices such as audio-visual players, speakers, mobile phones, tablets, screens, and safes.

首先,電子裝置可以儲存擁有者的聲紋。在本實施例中,語音接收模組110可以接收擁有者所發出的語音訊號,聲紋辨識模組131可以依據語音接收模組110所接收到的語音訊號取得擁有者的聲紋,並將所取得的聲紋做為目標聲紋。First, the electronic device can store the owner's voiceprint. In this embodiment, the voice receiving module 110 can receive the voice signal sent by the owner, and the voiceprint recognition module 131 can obtain the voiceprint of the owner according to the voice signal received by the voice receiving module 110, and then The obtained voiceprint is used as the target voiceprint.

之後,當有使用者欲使用電子裝置時,不論使用者是否擁有者,語音接收模組110都可以接收使用者所發出的第一語音訊號(步驟210)。在本實施例中,假設語音接收模組110與電子裝置上之麥克風連接,且語音接收模組110持續保持接收語音訊號的狀態,藉以接收使用者對麥克風所輸入的第一語音訊號。Afterwards, when a user wants to use the electronic device, the voice receiving module 110 can receive the first voice signal sent by the user regardless of whether the user owns it or not (step 210). In this embodiment, it is assumed that the voice receiving module 110 is connected to the microphone on the electronic device, and the voice receiving module 110 continues to receive the voice signal, so as to receive the first voice signal input by the user to the microphone.

在語音接收模組110接收到第一語音訊號(步驟210)後,聲紋辨識模組131可以判斷語音接收模組110所接收到之第一語音訊號的聲紋是否與擁有者預先建立的目標聲紋相符(步驟220),語音辨識模組132也可以判斷語音接收模組110所接收到的第一語音訊號是否與特定口令相符(步驟230)。在實務上,聲紋辨識模組131判斷第一語音訊號的聲紋是否與擁有者預先建立的目標聲紋相符(步驟220)與語音辨識模組132判斷第一語音訊號是否與特定口令相符(步驟230)並沒有先後次序的關係,也就是說,在本發明中,可以先由語音辨識模組132判斷第一語音訊號是否與特定口令相符(步驟230),再由聲紋辨識模組131判斷第一語音訊號的聲紋是否與擁有者預先建立的目標聲紋相符(步驟220)。After the voice receiving module 110 receives the first voice signal (step 210), the voiceprint recognition module 131 can determine whether the voiceprint of the first voice signal received by the voice receiving module 110 is in accordance with the target previously established by the owner When the voiceprint matches (step 220), the voice recognition module 132 can also determine whether the first voice signal received by the voice receiving module 110 matches a specific password (step 230). In practice, the voiceprint recognition module 131 determines whether the voiceprint of the first voice signal matches the target voiceprint previously established by the owner (step 220) and the voice recognition module 132 determines whether the first voice signal matches the specific password ( Step 230) There is no order relationship, that is to say, in the present invention, the voice recognition module 132 can first determine whether the first voice signal matches a specific password (step 230), and then the voiceprint recognition module 131 Determine whether the voiceprint of the first voice signal matches the target voiceprint previously established by the owner (step 220).

若聲紋辨識模組131判斷第一語音訊號的聲紋與擁有者預先建立的目標聲紋不相符,或語音辨識模組132判斷第一語音訊號與特定口令不相符,則操作執行模組160可以判斷結果沒有通過驗證,操作執行模組160不會執行與第一語音訊號對應的操作,例如,將電子裝置解除鎖定,如此,使用者將無法使用電子裝置。而若聲紋辨識模組131判斷第一語音訊號的聲紋與擁有者預先建立的目標聲紋相符,且語音辨識模組132判斷第一語音訊號與特定口令相符,則提示模組150可以提示輸入第二語音訊號(步驟241),且語音接收模組110可以接收第二語音訊號(步驟245)。在本實施例中,假設提示模組150可以播放提示使用者輸入預先設定之語音密碼的提示訊息,藉以提示使用者說出語音密碼,使得語音接收模組110可以接收第二語音訊號。If the voiceprint recognition module 131 determines that the voiceprint of the first voice signal does not match the target voiceprint previously established by the owner, or if the voice recognition module 132 determines that the first voice signal does not match the specific password, the operation execution module 160 It can be judged that the result has not passed the verification, and the operation execution module 160 will not perform the operation corresponding to the first voice signal, for example, unlock the electronic device, so that the user cannot use the electronic device. If the voiceprint recognition module 131 determines that the voiceprint of the first voice signal matches the target voiceprint previously established by the owner, and the voice recognition module 132 determines that the first voice signal matches the specific password, the prompt module 150 may prompt The second voice signal is input (step 241), and the voice receiving module 110 can receive the second voice signal (step 245). In this embodiment, it is assumed that the prompt module 150 can play a prompt message prompting the user to input a preset voice password, thereby prompting the user to say the voice password, so that the voice receiving module 110 can receive the second voice signal.

在語音接收模組110接收到使用者發出的第二語音訊號(步驟245)後,語音辨識模組132可以判斷語音接收模組110所接收到的第二語音訊號是否與語音密碼相符(步驟250)。若語音辨識模組132判斷第二語音訊號與語音密碼相符,則操作執行模組160可以判斷結果通過驗證,並可以執行與第一語音訊號對應的操作(步驟295),例如,將電子裝置解除鎖定,如此,使用者即可以使用電子裝置;而若語音辨識模組132判斷第二語音訊號與語音密碼不相符,則操作執行模組160可以判斷結果沒有通過驗證,操作執行模組160不會執行與第一語音訊號對應的操作,也就是將電子裝置解除鎖定,因此,使用者無法使用電子裝置。After the voice receiving module 110 receives the second voice signal from the user (step 245), the voice recognition module 132 can determine whether the second voice signal received by the voice receiving module 110 matches the voice password (step 250 ). If the voice recognition module 132 judges that the second voice signal matches the voice password, the operation execution module 160 can judge that the result passes the verification and can perform the operation corresponding to the first voice signal (step 295), for example, to release the electronic device Locked, so that the user can use the electronic device; and if the voice recognition module 132 determines that the second voice signal does not match the voice password, the operation execution module 160 can determine that the result has not passed the verification, and the operation execution module 160 will not The operation corresponding to the first voice signal is performed, that is, the electronic device is unlocked, so the user cannot use the electronic device.

如此,本發明在透過聲紋判斷使用者身分後,還額外檢查使用者是否發出語音密碼的第二輸入訊號,避免了使用者盜用擁有者的聲紋即可以使用電子裝置的情況。In this way, after judging the identity of the user through the voiceprint, the present invention additionally checks whether the user sends out the second input signal of the voice password to avoid the situation that the user can use the electronic device by embezzling the owner's voiceprint.

上述實施例中,若電子裝置包含密碼生成模組190,則在提示模組150提示輸入第二語音訊號(步驟241)之前或之後,語音接收模組110接收第二語音訊號(步驟245)前,密碼生成模組190可以動態產生語音密碼,並將所產生的語音密碼傳送到指定裝置。在本實施例中,假設指定裝置為擁有者的手機,則密碼生成模組190可以隨機產生語音密碼,並將所產生的語音密碼以簡訊、電子郵件、即時通訊、近端通訊等方式將所產生的語音密碼傳送到擁有者的手機,如此,當使用者為擁用者時,使用者即可以取得語音密碼。In the above embodiment, if the electronic device includes the password generation module 190, before or after the prompt module 150 prompts to input the second voice signal (step 241), the voice receiving module 110 receives the second voice signal (step 245) The password generation module 190 can dynamically generate a voice password and transmit the generated voice password to a designated device. In this embodiment, assuming that the designated device is the owner's mobile phone, the password generation module 190 can randomly generate a voice password, and use the generated voice password to send the password to the text message, e-mail, instant messaging, near-end communication, etc. The generated voice password is transmitted to the owner's mobile phone, so that when the user is the owner, the user can obtain the voice password.

另外,上述實施例中,若電子裝置包含情緒評估模組133,則可以如「第2B圖」之流程所示,在語音辨識模組132判斷第二語音訊號與語音密碼相符後,操作執行模組160判斷結果通過驗證而執行與第一語音訊號對應之操作(步驟295)前,情緒評估模組133可以依據語音接收模組110所接收到的第一語音訊號及/或第二語音訊號判斷使用者的情緒狀態是否異常(步驟260),若情緒評估模組133判斷使用者的情緒狀態正常,則操作執行模組160可以判斷結果通過驗證,並可以執行與第一語音訊號對應之操作(步驟295),在本實施例中,也就是解除電子裝置的鎖定狀態。In addition, in the above embodiment, if the electronic device includes the emotion evaluation module 133, as shown in the flow of "Figure 2B", after the voice recognition module 132 determines that the second voice signal matches the voice password, the operation execution mode Before the judgment result of the group 160 is verified and the operation corresponding to the first voice signal is performed (step 295), the emotion evaluation module 133 may judge according to the first voice signal and/or the second voice signal received by the voice receiving module 110 Whether the user's emotional state is abnormal (step 260), if the emotional evaluation module 133 determines that the user's emotional state is normal, the operation execution module 160 can determine that the result is verified and can perform the operation corresponding to the first voice signal ( Step 295) In this embodiment, that is, the locked state of the electronic device is released.

而若情緒評估模組133判斷使用者的情緒狀態異常,則提示模組150可以產生異常提問(步驟271),並可以將所產生的異常提問提供給使用者,同時,語音接收模組110可以準備接收使用者回答異常提問所產生的第三語音訊號(步驟275)。在本實施例中,假設異常提問為擁有者預先錄製的問題,則提示模組150可以播放擁有者預先錄製的問題,使用者可以說出提示模組150所播放之問題的回答,使得語音接收模組110可以接收到使用者所發出的回答,也就是第三語音訊號。If the emotion assessment module 133 determines that the user's emotional state is abnormal, the prompt module 150 can generate an abnormal question (step 271), and can provide the generated abnormal question to the user. At the same time, the voice receiving module 110 can Prepare to receive the third voice signal generated by the user answering the abnormal question (step 275). In this embodiment, assuming that the abnormal question is a question pre-recorded by the owner, the prompt module 150 can play back the question pre-recorded by the owner, and the user can speak the answer to the question played by the prompt module 150 to make the voice received The module 110 can receive the response sent by the user, that is, the third voice signal.

在語音接收模組110接收到第三語音訊號(步驟275)後,情緒評估模組133可以依據語音接收模組110所接收到的第三語音訊號確認使用者的情緒狀態是否異常(步驟281),若情緒評估模組133在確認後,判斷使用者的情緒狀態正常,則操作執行模組160可以判斷結果通過驗證,並可以執行與第一語音訊號對應的操作(步驟295),在本實施例中,也就是解除電子裝置的鎖定狀態。After the voice receiving module 110 receives the third voice signal (step 275), the emotion evaluation module 133 can confirm whether the user's emotional state is abnormal according to the third voice signal received by the voice receiving module 110 (step 281) If, after confirmation, the emotion assessment module 133 determines that the user's emotional state is normal, the operation execution module 160 can determine that the result passes the verification and can perform the operation corresponding to the first voice signal (step 295). In this implementation In the example, that is, the locked state of the electronic device is released.

而若情緒評估模組133在確認後,依然判斷使用者的情緒狀態異常,則示警模組170可以依據語音接收模組110所接收到之第三語音訊號的內容選擇是否示警(步驟285)。在本實施例中,假設異常提問為預先設定的一組問題,其中,每一個問題有表示情緒狀態正常的解答以及表示情緒狀態異常的解答,若第三語音訊號與表示情緒狀態正常的解答相符,示警模組170可以選擇不示警,或是第三語音訊號與表示情緒狀態正常的解答不符,但第三語音訊號的聲紋表示情緒狀態正常,則示警模組170可以選擇不示警;而若第三語音訊號與表示情緒狀態異常的解答相符,示警模組170可以選擇示警,則示警模組170可以發出警報,也可以通報管理員或警察局等。If the emotion evaluation module 133 still judges that the user's emotional state is abnormal after confirmation, the warning module 170 can select whether to warn according to the content of the third voice signal received by the voice receiving module 110 (step 285). In this embodiment, it is assumed that the abnormal question is a predetermined set of questions, where each question has a solution indicating that the emotional state is normal and a solution indicating that the emotional state is abnormal, if the third voice signal matches the solution indicating that the emotional state is normal , The warning module 170 can choose not to warn, or the third voice signal does not match the answer indicating that the emotional state is normal, but the voiceprint of the third voice signal indicates that the emotional state is normal, then the warning module 170 can choose not to warn; and if The third voice signal is consistent with the answer indicating that the emotional state is abnormal. The warning module 170 may choose to warn, and the warning module 170 may issue an alarm, or may notify the administrator or the police station.

綜上所述,可知本發明與先前技術之間的差異在於具有在接收到之語音訊號後,依據語音訊號的聲紋以及內容判斷結果是否通過驗證之技術手段,藉由此一技術手段可以來解決先前技術所存在聲紋辨識技術無法判斷語音是否為真人發聲的問題,進而達成增加身分識別安全性的技術功效。In summary, it can be seen that the difference between the present invention and the prior art lies in the technical means of judging whether the result of verification is based on the voiceprint and content of the voice signal after receiving the voice signal. Solve the problem that the voiceprint recognition technology in the prior art cannot determine whether the voice is uttered by a real person, and then achieve the technical effect of increasing the security of identity recognition.

再者,本發明之透過聲紋及語音密碼判斷身分之方法,可實現於硬體、軟體或硬體與軟體之組合中,亦可在電腦系統中以集中方式實現或以不同元件散佈於若干互連之電腦系統的分散方式實現。Furthermore, the method for determining the identity by voiceprint and voice password of the present invention can be implemented in hardware, software, or a combination of hardware and software, and can also be implemented in a centralized manner in a computer system or dispersed in several components with different components Decentralized implementation of interconnected computer systems.

雖然本發明所揭露之實施方式如上,惟所述之內容並非用以直接限定本發明之專利保護範圍。任何本發明所屬技術領域中具有通常知識者,在不脫離本發明所揭露之精神和範圍的前提下,對本發明之實施的形式上及細節上作些許之更動潤飾,均屬於本發明之專利保護範圍。本發明之專利保護範圍,仍須以所附之申請專利範圍所界定者為準。Although the disclosed embodiments of the present invention are as described above, the content described is not intended to directly limit the patent protection scope of the present invention. Anyone who has ordinary knowledge in the technical field to which the present invention belongs, without making any departure from the spirit and scope disclosed by the present invention, makes slight modifications to the form and details of the implementation of the present invention, all belong to the patent protection of the present invention range. The scope of patent protection of the present invention shall still be determined by the scope of the attached patent application.

110‧‧‧語音接收模組 131‧‧‧聲紋辨識模組 132‧‧‧語音辨識模組 133‧‧‧情緒評估模組 150‧‧‧提示模組 160‧‧‧操作執行模組 170‧‧‧示警模組 180‧‧‧權限判斷模組 190‧‧‧密碼生成模組 步驟210‧‧‧接收第一語音訊號 步驟220‧‧‧判斷第一語音訊號之聲紋與目標聲紋是否相符 步驟230‧‧‧判斷第一語音訊號與特定口令是否相符 步驟241‧‧‧提示輸入第二語音訊號 步驟245‧‧‧接收第二語音訊號 步驟250‧‧‧判斷第二語音訊號與語音密碼是否相符 步驟260‧‧‧判斷情緒狀態是否異常 步驟271‧‧‧產生異常提問 步驟275‧‧‧接收第三語音訊號 步驟281‧‧‧依據第三語音訊號之聲紋確認情緒狀態是否異常 步驟285‧‧‧依據第三語音訊號之內容選擇是否示警 步驟295‧‧‧執行與第一語音訊號對應之操作 110‧‧‧Voice receiving module 131‧‧‧ voiceprint recognition module 132‧‧‧Speech recognition module 133‧‧‧Emotion Assessment Module 150‧‧‧ prompt module 160‧‧‧Operation execution module 170‧‧‧Alarm module 180‧‧‧authority judgment module 190‧‧‧ Password generation module Step 210: Receive the first voice signal Step 220: Determine whether the voiceprint of the first voice signal matches the target voiceprint Step 230: Determine whether the first voice signal matches the specific password Step 241‧‧‧Prompt to input the second voice signal Step 245‧‧‧ Receive the second voice signal Step 250: Determine whether the second voice signal matches the voice password Step 260‧‧‧ judge whether the emotional state is abnormal Step 271‧‧‧Abnormal question Step 275‧‧‧ Receive the third voice signal Step 281‧‧‧Confirm whether the emotional state is abnormal according to the voiceprint of the third voice signal Step 285‧‧‧Select whether to warn based on the content of the third voice signal Step 295‧‧‧ Perform the operation corresponding to the first voice signal

第1圖為本發明所提之透過聲紋及語音密碼判斷身分之系統架構圖。 第2A圖為本發明所提之透過聲紋及語音密碼判斷身分之方法流程圖。 第2B圖為本發明所提之依據語音訊號之情緒狀態選擇示警之方法流程圖。FIG. 1 is a system architecture diagram of determining identity by voiceprint and voice password according to the present invention. FIG. 2A is a flowchart of a method for judging identity through voiceprint and voice password according to the present invention. FIG. 2B is a flowchart of a method for selecting and warning according to the emotional state of a voice signal according to the present invention.

步驟210‧‧‧接收第一語音訊號 Step 210: Receive the first voice signal

步驟220‧‧‧判斷第一語音訊號之聲紋與目標聲紋是否相符 Step 220: Determine whether the voiceprint of the first voice signal matches the target voiceprint

步驟230‧‧‧判斷第一語音訊號與特定口令是否相符 Step 230: Determine whether the first voice signal matches the specific password

步驟241‧‧‧提示輸入第二語音訊號 Step 241‧‧‧Prompt to input the second voice signal

步驟245‧‧‧接收第二語音訊號 Step 245‧‧‧ Receive the second voice signal

步驟250‧‧‧判斷第二語音訊號與語音密碼是否相符 Step 250: Determine whether the second voice signal matches the voice password

步驟295‧‧‧執行與第一語音訊號對應之操作 Step 295‧‧‧ Perform the operation corresponding to the first voice signal

Claims (10)

一種透過聲紋及語音密碼判斷身分之方法,該方法至少包含下列步驟: 接收一第一語音訊號; 判斷該第一語音訊號之聲紋與預先建立之目標聲紋相符,且該第一語音訊號與特定口令相符時,提示輸入一第二語音訊號; 接收該第二語音訊號;及 判斷該第二語音訊號與一語音密碼相符時,執行與該第一語音訊號對應之操作。A method for judging identity by voiceprint and voice password, the method includes at least the following steps: receiving a first voice signal; judging that the voiceprint of the first voice signal matches the pre-established target voiceprint, and the first voice signal When it matches a specific password, it prompts to enter a second voice signal; receives the second voice signal; and when it is determined that the second voice signal matches a voice password, performs an operation corresponding to the first voice signal. 如申請專利範圍第1項所述之透過聲紋及語音密碼判斷身分之方法,其中該方法於接收該第二語音訊號之步驟前更包含產生該語音密碼,並傳送該語音密碼至一指定裝置之步驟。The method for determining identity by voiceprint and voice password as described in item 1 of the patent application scope, wherein the method further includes generating the voice password before the step of receiving the second voice signal, and sending the voice password to a designated device Steps. 如申請專利範圍第1項所述之透過聲紋及語音密碼判斷身分之方法,其中該方法於執行與該第一語音訊號對應之操作之步驟前,更包含依據該第一語音訊號及/或該第二語音訊號之聲紋判斷一情緒狀態,並於該情緒狀態表示異常時示警之步驟。The method for determining identity by voiceprint and voice password as described in item 1 of the patent application scope, wherein the method further includes the step of performing the operation corresponding to the first voice signal based on the first voice signal and/or The voiceprint of the second voice signal determines an emotional state, and a step of warning when the emotional state indicates abnormality. 如申請專利範圍第3項所述之透過聲紋及語音密碼判斷身分之方法,其中該方法於該情緒狀態表示異常時示警之步驟,更包含產生一異常提問,及接收與該異常提問相對應之一第三語音訊號,並依據該第三語音訊號確認該情緒狀態表示異常時,依據該第三語音訊號選擇是否示警之步驟。The method for judging identity through voiceprint and voice password as described in Item 3 of the patent application scope, wherein the method of warning when the emotional state indicates abnormality, further includes generating an abnormal question and receiving corresponding to the abnormal question A third voice signal, and when it is confirmed that the emotional state indicates abnormality according to the third voice signal, the step of whether to warn is selected according to the third voice signal. 如申請專利範圍第1項所述之透過聲紋及語音密碼判斷身分之方法,其中該方法於執行與該第一語音訊號對應之操作之步驟前更包含依據該第一語音訊號之聲紋判斷是否可執行與該第一語音訊號對應之操作之步驟。The method for judging identity by voiceprint and voice password as described in item 1 of the patent scope, wherein the method further includes judging the voiceprint according to the first voice signal before performing the step corresponding to the operation of the first voice signal Whether the steps of the operation corresponding to the first voice signal can be performed. 一種透過聲紋及語音密碼判斷身分之系統,該系統至少包含: 一語音接收模組,用以接收一第一語音訊號,及用以接收一第二語音訊號; 一聲紋辨識模組,用以判斷該第一語音訊號之聲紋是否與預先建立之目標聲紋相符; 一語音辨識模組,用以判斷該第一語音訊號是否與一預定口令相符,及用以判斷該第二語音訊號是否與一語音密碼相符; 一提示模組,用以於該聲紋辨識模組判斷該第一語音訊號之聲紋與預先建立之聲紋相符、且該語音辨識模組判斷該第一語音訊號與該預定口令相符時,提示輸入該第二語音訊號;及 一操作執行模組,用以於該第二語音訊號與該語音密碼相符時,執行與該第一語音訊號對應之操作。A system for judging identity through voiceprint and voice password, the system at least includes: a voice receiving module for receiving a first voice signal and a second voice signal; a voiceprint identification module for To determine whether the voiceprint of the first voice signal matches the pre-established target voiceprint; a voice recognition module to determine whether the first voice signal matches a predetermined password and to determine the second voice signal Whether it matches a voice password; a prompt module for the voiceprint recognition module to judge that the voiceprint of the first voice signal matches the pre-established voiceprint, and the voice recognition module judges the first voice signal When the predetermined password matches, the second voice signal is prompted for input; and an operation execution module is used to perform an operation corresponding to the first voice signal when the second voice signal matches the voice password. 如申請專利範圍第6項所述之透過聲紋及語音密碼判斷身分之系統,其中系統更包含一密碼生成模組,用以產生該語音密碼,並傳送該語音密碼至一指定裝置。As described in Item 6 of the patent application scope, a system for determining identity by voiceprint and voice password, wherein the system further includes a password generation module for generating the voice password and transmitting the voice password to a designated device. 如申請專利範圍第6項所述之透過聲紋及語音密碼判斷身分之系統,其中該系統更包含一情緒評估模組及一示警模組,該情緒評估模組用以依據該第一語音訊號及/或該第二語音訊號之聲紋判斷一情緒狀態,該示警模組用以於該情緒狀態異常時示警。The system for determining identity by voiceprint and voice password as described in item 6 of the patent application scope, wherein the system further includes an emotion evaluation module and a warning module, the emotion evaluation module is used to determine the first voice signal And/or the voiceprint of the second voice signal determines an emotional state, and the warning module is used to warn when the emotional state is abnormal. 如申請專利範圍第8項所述之透過聲紋及語音密碼判斷身分之系統,其中該提示模組更用以於該情緒狀態表示異常時產生一異常提問,該語音接收模組更用以接收與該異常提問相對應之一第三語音訊號,使該情緒評估模組依據該第三語音訊號確認該情緒狀態,該示警模組用以於該情緒評估模組確認該情緒狀態異常時,依據該第三語音訊號選擇是否示警。The system for determining identity through voiceprint and voice password as described in item 8 of the patent application scope, wherein the prompt module is further used to generate an abnormal question when the emotional state indicates abnormality, and the voice receiving module is further used to receive A third voice signal corresponding to the abnormal question, so that the emotion evaluation module confirms the emotional state according to the third voice signal, and the warning module is used for confirming that the emotional state is abnormal The third voice signal selects whether to warn. 如申請專利範圍第6項所述之透過聲紋及語音密碼判斷身分之系統,其中該系統更包含一權限判斷模組,用以依據該第一語音訊號之聲紋判斷是否可執行與該第一語音訊號對應之操作。The system for determining identity by voiceprint and voice password as described in item 6 of the patent application scope, wherein the system further includes a permission judgment module for judging whether or not the first voice signal can be executed according to the voiceprint of the first voice signal Operation corresponding to a voice signal.
TW107120121A 2018-06-12 2018-06-12 System for determining identity through voiceprint and voice password and method thereof TW202001643A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW107120121A TW202001643A (en) 2018-06-12 2018-06-12 System for determining identity through voiceprint and voice password and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW107120121A TW202001643A (en) 2018-06-12 2018-06-12 System for determining identity through voiceprint and voice password and method thereof

Publications (1)

Publication Number Publication Date
TW202001643A true TW202001643A (en) 2020-01-01

Family

ID=69942068

Family Applications (1)

Application Number Title Priority Date Filing Date
TW107120121A TW202001643A (en) 2018-06-12 2018-06-12 System for determining identity through voiceprint and voice password and method thereof

Country Status (1)

Country Link
TW (1) TW202001643A (en)

Similar Documents

Publication Publication Date Title
US10984802B2 (en) System for determining identity based on voiceprint and voice password, and method thereof
US11011178B2 (en) Detecting replay attacks in voice-based authentication
JP5533854B2 (en) Speech recognition processing system and speech recognition processing method
US7920680B2 (en) VoIP caller authentication by voice signature continuity
EP3412014B1 (en) Liveness determination based on sensor signals
Shirvanian et al. Wiretapping via mimicry: Short voice imitation man-in-the-middle attacks on crypto phones
US8589167B2 (en) Speaker liveness detection
US20030074201A1 (en) Continuous authentication of the identity of a speaker
US20130132091A1 (en) Dynamic Pass Phrase Security System (DPSS)
JP7120313B2 (en) Biometric authentication device, biometric authentication method and program
US20160328949A1 (en) Method for an Automated Distress Alert System with Speech Recognition
JP6594349B2 (en) Method and apparatus for identifying or authenticating humans and / or objects with dynamic acoustic security information
US20200184979A1 (en) Systems and methods to determine that a speaker is human using a signal to the speaker
JP2007264507A (en) User authentication system, illegal user discrimination method, and computer program
CN109951765B (en) Electronic device providing secure audio output
US11810585B2 (en) Systems and methods for filtering unwanted sounds from a conference call using voice synthesis
Esposito et al. Alexa versus alexa: Controlling smart speakers by self-issuing voice commands
US11170790B2 (en) User authentication with audio reply
JP2019028465A (en) Speaker verification method and speech recognition system
JPH11112672A (en) Multi-spot speaking device
Shirvanian et al. Short voice imitation man-in-the-middle attacks on Crypto Phones: Defeating humans and machines
CN113012715A (en) Acoustic features for voice-enabled computer systems
TW202001643A (en) System for determining identity through voiceprint and voice password and method thereof
KR20200092779A (en) Voice-based identity authentication and digital signature
EP1445760B1 (en) Speaker verifying apparatus