TW201947359A

TW201947359A - Method and system for terminal device accessing different cloud servers

Info

Publication number: TW201947359A
Application number: TW107116595A
Authority: TW
Inventors: 伍自強; 李永輝
Original assignee: 中孚創聯科技有限公司
Priority date: 2018-05-16
Filing date: 2018-05-16
Publication date: 2019-12-16
Also published as: TWI657355B

Abstract

The present invention provides a method and a system for a terminal device accessing different cloud servers. The terminal device receives a voice message, and converts the voice message into a text, and then checks the text. The terminal device sends the voice to a first cloud server through internet when the text includes a first wake command, and the terminal device sends the voice to a second cloud server through internet when the text includes a second wake command.

Description

Voice control method and system for connecting different cloud servers

本發明係與語音控制技術有關，特別是關於一種語音控制連接不同雲端伺服器的方法。The present invention relates to voice control technology, and more particularly to a method for voice control to connect to different cloud servers.

資策會產業情報研究所（Market Intelligence & Consulting Institute, MIC）於2016年底對2017年網路通訊產業提出九大重要的發展趨勢，分別是5G增速風、汽車聯網風、跨域併購風、影視精準風、行動物聯風、網聯合縱風、人機對話風、穿精戴螢風、乍暖還寒風，其中九大趨勢當中所提到的人機對話正是目前市場上討論熱度持續攀升的人工智慧語音助理服務，從Google搜尋趨勢（Google Trends）的圖表中就可以看出，將時間軸設定為2012年到2017年四月，語音助理在網路上維持一定的討論熱度，但近年來有明顯上升的趨勢，代表語音助理市場開始逐漸成形，各家廠商積極擴展海外市場，為了分食全球龐大的語音助理市場以及搶佔全球市佔率。At the end of 2016, the Market Intelligence & Consulting Institute (MIC) proposed nine major development trends for the 2017 network communications industry, namely the 5G growth wind, the car networking wind, the cross-domain merger and acquisition wind, The precision wind of film and television, the wind of action of the Internet of things, the wind of the Internet, the wind of human-machine dialogue, the wearing of wind and wind, and the cold wind at first glance. The human-machine dialogue mentioned in the nine major trends is the current discussion in the market. The rising artificial intelligence voice assistant service can be seen from the Google Trends chart. Setting the timeline to 2012 to April 2017, the voice assistant has maintained a certain amount of discussion on the Internet, but in recent years There is a clear upward trend in the future, representing that the voice assistant market is gradually taking shape, and various manufacturers are actively expanding overseas markets in order to eat the global huge voice assistant market and seize the global market share.

目前人工智慧技術應用發展主要以「語音辨識」與「影像辨識」為兩大發展主軸。根據國際數據資訊IDC（以下以IDC表示）研究預測，2016年人工智慧市場規模將由2016年80億美元，快速成長至2020年的470億美元，年複合成長率高達55.1%。工研院產經中心IEK指出，2017年人工智慧產業已進入加速階段，如機器視覺、語音助理、影像辨識、醫學診斷等，進而促使相關產業鏈開始變革，預期從2017年起，相關的人工智慧終端裝置產品將開始爆炸性成長。At present, the application and development of artificial intelligence technology are mainly based on "speech recognition" and "image recognition". According to international data information IDC (hereinafter referred to as IDC) research forecasts, the size of the artificial intelligence market in 2016 will grow rapidly from USD 8 billion in 2016 to USD 47 billion in 2020, with a compound annual growth rate of 55.1%. IEK of ITRI pointed out that the artificial intelligence industry has entered an accelerated phase in 2017, such as machine vision, voice assistants, image recognition, medical diagnosis, etc., and then has prompted the relevant industrial chain to begin to change. Smart terminal device products will begin to explode.

IDC表示，幾乎所有產業都可望透過人工智慧處理資料能力背後所帶來的利潤，注入一股活水，再次帶動產業的成長。現階段已經採用人工智慧系統主要的產業為金融業、零售業、醫療與離散製造業為主，光這四大產業的合計產值已佔去2016市場規模的50%，未來將加速往更廣泛的領域發展。IDC said that almost all industries are expected to inject a living water through the profit brought by the ability of artificial intelligence to process data, and once again drive the industry's growth. At this stage, the main industries that have adopted artificial intelligence systems are mainly financial, retail, medical and discrete manufacturing. The total output value of these four industries has accounted for 50% of the 2016 market size. Field development.

IDC研究顯示，若以地區市場來區分，北美地區以62億美元市場規模佔據全球市場的78%產值。而歐洲、中東與非洲則佔居第二，雖然目前亞太區相對於其他地區起步發展較慢，但快速成長的速度可望在2020年前拿下市場規模的第二席次。若以成長速度來看，預測日本市場將從2015至2020年期間的年複合成長率高達114.9%居冠，亞太除日本外地區63.9%居次，南美洲56.2%第三，美國54.5%、西歐50.1%。如果要搶佔語音助理所帶來的龐大市場，將取於系統是否提供多國語言的辨識能力。IDC research shows that if differentiated by regional market, North America accounts for 78% of the global market's output value with a market size of 6.2 billion US dollars. Europe, the Middle East, and Africa occupy the second place. Although the Asia-Pacific region currently has a slower development than other regions, the rapid growth rate is expected to win the second place in the market size by 2020. Looking at the growth rate, it is predicted that the Japanese market will have a compound annual growth rate of 114.9% from 2015 to 2020, followed by 63.9% in Asia Pacific except Japan, 36.2% in South America, 54.5% in the United States, and Western Europe. 50.1%. If you want to seize the huge market brought by voice assistants, it will depend on whether the system provides multi-language recognition capabilities.

語音裝置是各大品牌爭相競逐的市場，也被視為電腦、智慧型手機後的下一個計算平台。目前市面上主要提供語音辨識服務的廠商包括：Amazon.com的Alexa、Apple的Siri、Facebook的 Facebook M、Google的Google Assistant，以及Samsung的Bixby。各家廠商的語音助理均有其強項，亦有其不足之處。然而，目前市面上出現的終端設備大多僅能使用某一廠商的語音助理，而不能跨平台使用。這並不是一個最佳的使用環境。Voice devices are a market that major brands are vying for, and they are also regarded as the next computing platform behind computers and smartphones. The vendors that currently provide speech recognition services on the market include: Amazon.com's Alexa, Apple's Siri, Facebook's Facebook M, Google's Google Assistant, and Samsung's Bixby. Each manufacturer's voice assistant has its strengths and weaknesses. However, most of the terminal devices appearing on the market can only use the voice assistant of a certain manufacturer, and cannot be used across platforms. This is not an optimal use environment.

本發明之主要發明目的在於提供一種語音控制連接不同雲端伺服器的方法與系統，其可於單一終端裝置上利用語音控制連接不同的網路伺服器。The main purpose of the present invention is to provide a method and system for voice control to connect to different cloud servers, which can use voice control to connect to different network servers on a single terminal device.

為達上述目的及功效，本發明所揭示的語音控制連接不同雲端伺服器的方法一種語音控制連接不同雲端伺服器的方法，供一使用者使用一終端裝置，利用語音控制而連接至一第一雲端伺服器或是一第二雲端伺服器，包含下列步驟：In order to achieve the above-mentioned object and effect, the method for voice control connecting to different cloud servers disclosed in the present invention is a method for voice control connecting to different cloud servers, for a user to use a terminal device and connect to a first by voice control The cloud server or a second cloud server includes the following steps:

一、提供一語音訊息給一終端裝置；1. Provide a voice message to a terminal device;

二、在該終端裝置中轉換該語音訊息為一文本；2. Convert the voice message into a text in the terminal device;

三、檢查該文本是否包含一第一喚醒指令或是一第二喚醒指令；3. Check whether the text contains a first wake-up instruction or a second wake-up instruction;

四、如該文本中包含一第一喚醒指令，將該語音訊息傳送至一第一雲端伺服器；如該文本中包含一第二喚醒指令，將該語音訊息傳送至一第二雲端伺服器；如該文本中不包含該第一喚醒指令與該第二喚醒指令，重複步驟一至步驟四。4. If the text contains a first wake-up instruction, send the voice message to a first cloud server; if the text contains a second wake-up instruction, send the voice message to a second cloud server; If the text does not include the first wake-up instruction and the second wake-up instruction, repeat step 1 to step 4.

在一實施例中，該終端裝置接收到該語音訊息後，將該語音訊息暫存於一緩衝單元中。In one embodiment, after receiving the voice message, the terminal device temporarily stores the voice message in a buffer unit.

在一實施例中，該語音訊息轉換為該文本後，該文本暫存於該緩衝單元中。In one embodiment, after the voice message is converted into the text, the text is temporarily stored in the buffer unit.

在一實施例中，該語音訊息透過一網路傳送至該第一雲端伺服器或該第二雲端伺服器。In one embodiment, the voice message is transmitted to the first cloud server or the second cloud server through a network.

本發明更提供一種語音控制連接不同雲端伺服器的系統，包含有：一終端裝置、一網路、一第一雲端伺服器，以及一第二雲端伺服器。The present invention further provides a system for voice-controlled connection to different cloud servers, including: a terminal device, a network, a first cloud server, and a second cloud server.

該終端裝置包含有：一音訊接收單元，接收語音訊息；一音訊轉換單元，將該語音訊息轉換為一文本；一比對單元，檢查該文本是否包含一第一喚醒指令或是一第二喚醒指令；一網路連接單元，如該文本中包含該第一喚醒指令，將該語音訊息透過該網路傳送至一第一雲端伺服器；如該文本中包含該第二喚醒指令，將該語音訊息透過該網路傳送至一第二雲端伺服器。The terminal device includes: an audio receiving unit that receives a voice message; an audio conversion unit that converts the voice message into a text; and a comparison unit that checks whether the text contains a first wakeup instruction or a second wakeup Instruction; a network connection unit, if the text contains the first wake-up instruction, the voice message is transmitted to a first cloud server through the network; if the text contains the second wake-up instruction, the voice The message is sent to a second cloud server through the network.

在一實施例中，該終端裝置更包含有一緩衝單元，儲存該語音訊息與該文本。In one embodiment, the terminal device further includes a buffer unit for storing the voice message and the text.

在一實施例中，該終端裝置更包含有一睡眠喚醒單元以及一中央處理單元，當該比對單元發現該文本中包含該第一喚醒指令或是該第二喚醒指令，該睡眠喚醒單元喚醒在休眠模式中的該中央處理單元，該中央處理單元會在該緩衝單元中擷取該語音訊息，透過該網路連接單元連接至該網路，將該語音訊息傳送至該第一雲端伺服器或是該第二雲端伺服器。In one embodiment, the terminal device further includes a sleep wakeup unit and a central processing unit. When the comparison unit finds that the text contains the first wakeup instruction or the second wakeup instruction, the sleep wakeup unit wakes up at The central processing unit in the sleep mode, the central processing unit will capture the voice message in the buffer unit, connect to the network through the network connection unit, and send the voice message to the first cloud server or Is the second cloud server.

而本發明上述之目的及優點，可從下述所選用實施例之詳細說明與附圖中獲得深入了解。The above-mentioned objects and advantages of the present invention can be obtained from the detailed description and accompanying drawings of selected embodiments described below.

第1圖顯示本發明一較佳實施例所提供之語音控制系統的示意圖，包含有一終端裝置10、一網路12，以及複數個雲端伺服器14, 16, 18。其中，該終端裝置10可以為一種電子裝置，例如：智慧型手機(smart phone)、平板電腦(tablet)、筆記型電腦(laptop computer)、桌上型電腦(desktop computer)、個人數位助理(personal digital assistant, PDA)，或是其他特定的電子產品。請參閱第2圖所示，在本實施例中，該終端裝置10為一智慧型手機，至少具有：一音訊接收單元20、一緩衝單元22、一音訊轉換單元24、一比對單元26、一睡眠喚醒單元28、一中央處理單元30，以及一網路連接單元32。此外，在該終端裝置10中安裝有一應用程式(Application, APP)(圖未示)，用以執行以下所述之各項動作。FIG. 1 shows a schematic diagram of a voice control system provided by a preferred embodiment of the present invention, which includes a terminal device 10, a network 12, and a plurality of cloud servers 14, 16, 18. The terminal device 10 may be an electronic device, such as a smart phone, a tablet computer, a laptop computer, a desktop computer, and a personal digital assistant. digital assistant (PDA), or other specific electronic products. Please refer to FIG. 2. In this embodiment, the terminal device 10 is a smart phone and has at least: an audio receiving unit 20, a buffer unit 22, an audio conversion unit 24, a comparison unit 26, A sleep wake-up unit 28, a central processing unit 30, and a network connection unit 32. In addition, an application (APP) (not shown) is installed in the terminal device 10 to perform various actions described below.

在本實施例中，該音訊接收單元20可為一麥克風(microphone)或是其他等效的裝置，用來接收語音訊息。該音訊接收單元20所接收到的語音訊息包含環境音以及使用者所發出的聲音。該音訊接收單元20所接收到的語音訊息可先暫存於該緩衝單元22，或是直接傳送至該音訊轉換單元24進行相關的處理。In this embodiment, the audio receiving unit 20 may be a microphone or other equivalent device for receiving voice messages. The voice message received by the audio receiving unit 20 includes environmental sounds and sounds emitted by users. The voice message received by the audio receiving unit 20 may be temporarily stored in the buffer unit 22 or directly transmitted to the audio converting unit 24 for related processing.

該音訊轉換單元24主要的作用是將該音訊接收單元20所接收到的語音訊息轉換為文本(text)。該音訊接收單元20所接收到的語音訊息是一種類比訊號，所以，當該音訊轉換單元24接收到該音訊接收單元20所傳來的語音訊息後，首先會先將類比語音訊息轉換為數位語音訊息，接著，一數位濾波器將此數位語音訊息中的環境音過濾掉，留下使用者所發出的聲音。最後，該音訊轉換單元24在將過濾後的數位語音訊息轉換為文本，並將該文本暫存於該緩衝單元22。以上所述的內容為一般習知的語音辨識技術(speech recognition technique)，是為本技術領域中具有通常知識者所熟知的技術，在此容不詳述其內容。The main function of the audio conversion unit 24 is to convert the voice message received by the audio receiving unit 20 into text. The audio message received by the audio receiving unit 20 is an analog signal, so when the audio conversion unit 24 receives the audio message from the audio receiving unit 20, it first converts the analog voice message to digital The voice message, and then a digital filter filters out the ambient sound in the digital voice message, leaving the voice emitted by the user. Finally, the audio conversion unit 24 converts the filtered digital voice message into text, and temporarily stores the text in the buffer unit 22. The content described above is a commonly known speech recognition technique, which is a technique well-known to those having ordinary knowledge in the technical field, and its content is not described in detail herein.

該比對單元26的作用在於檢查該文本中是否存在特定的喚醒指令。ㄧ般可執行的語音訊息通常包含兩的部分，開頭為「喚醒指令」，其後則接續著「執行指令」。例如：「Siri，打電話給王大頭」。其中的「Siri」為喚醒指令，將終端裝置10由休眠狀態喚醒而轉換為工作狀態；「打電話給王大頭」為執行指令，命令該終端裝置10由聯絡人中將王大頭的電話號碼提出，並進行撥號的動作。The function of the comparison unit 26 is to check whether a specific wake-up instruction exists in the text. A normal executable voice message usually consists of two parts, beginning with the "wake command" and then followed by the "executing command". For example: "Siri, call Wang Datou." Among them, "Siri" is a wake-up command, which wakes up the terminal device 10 from a sleep state to work state; "calls Wang Datou" is an execution command, instructing the terminal device 10 to be raised by the contact person's telephone number of Wang Datou And make a dialing action.

在本實施例中，吾人設定可連接的雲端伺服器有三，分別定義為：一第一雲端伺服器14、一第二雲端伺服器16，以及一第三雲端伺服器18。該音訊轉換單元24中預先儲存有三個喚醒指令，分別為一第一喚醒指令、一第二喚醒指令，以及一第三喚醒指令。該比對單元26首先檢查該文本中是否存在該第一喚醒指令，如檢查的結果為該文本中具有該第一喚醒指令，則會命令該睡眠喚醒單元28喚醒在休眠模式中的該中央處理單元30。由休眠模式轉換為工作模式的該中央處理單元30會在該緩衝單元22中擷取該語音訊息，透過該網路連接單元32連接至網路12，將該語音訊息傳送至該第一雲端伺服器14。如果檢查的結果為該文本中不具有該第一喚醒指令，該比對單元26則接著檢查該文本中是否具有該第二喚醒指令。In this embodiment, we set three connectable cloud servers, which are defined as: a first cloud server 14, a second cloud server 16, and a third cloud server 18. The audio conversion unit 24 stores three wake-up commands in advance, which are a first wake-up command, a second wake-up command, and a third wake-up command. The comparison unit 26 first checks whether the first wake-up instruction exists in the text. If the result of the check is that the first wake-up instruction is present in the text, the sleep wake-up unit 28 is commanded to wake up the central processing in the sleep mode. Unit 30. The central processing unit 30 converted from the sleep mode to the working mode will capture the voice message in the buffer unit 22, connect to the network 12 through the network connection unit 32, and transmit the voice message to the first cloud server器 14。 14. If the result of the check is that the text does not have the first wake-up instruction, the comparison unit 26 then checks whether the text has the second wake-up instruction.

與前相同，如檢查的結果為該文本中具有該第二喚醒指令，則會命令該睡眠喚醒單元28喚醒在休眠模式中的該中央處理單元30。由休眠模式轉換為工作模式的該中央處理單元30會在該緩衝單元22中擷取該語音訊息，透過該網路連接單元32連接至網路12，將該語音訊息傳送至該第二雲端伺服器16。如果檢查的結果為該文本中不具有該第二喚醒指令，該比對單元26則接著檢查該文本中是否具有該第二喚醒指令。如果檢查的結果為該文本中不具有該第二喚醒指令，該比對單元26則接著檢查該文本中是否具有該第三喚醒指令。如檢查的結果為該文本中具有該第三喚醒指令，則會命令該睡眠喚醒單元28喚醒在休眠模式中的該中央處理單元30。由休眠模式轉換為工作模式的該中央處理單元30會在該緩衝單元22中擷取該語音訊息，透過該網路連接單元32連接至網路12，將該語音訊息傳送至該第三雲端伺服器18。As before, if the result of the check is that the text has the second wake-up instruction, the sleep wake-up unit 28 is commanded to wake up the central processing unit 30 in the sleep mode. The central processing unit 30 converted from the sleep mode to the working mode will capture the voice message in the buffer unit 22, connect to the network 12 through the network connection unit 32, and transmit the voice message to the second cloud server器 16。 16. If the result of the check is that the text does not have the second wake-up instruction, the comparison unit 26 then checks whether the text has the second wake-up instruction. If the result of the check is that the text does not have the second wake-up instruction, the comparison unit 26 then checks whether the text has the third wake-up instruction. If the result of the check is that the third wake-up instruction is included in the text, the sleep wake-up unit 28 is commanded to wake up the central processing unit 30 in the sleep mode. The central processing unit 30 converted from the sleep mode to the working mode will capture the voice message in the buffer unit 22, connect to the network 12 through the network connection unit 32, and transmit the voice message to the third cloud server器 18。 18.

如果該比對單元26的檢查結果為該文本中並未包含該第一、該第二與該第三喚醒指令，則不採取任何動作，系統仍保持在休眠模式，而該音訊接收單元20則持續接收語音訊息，並重複之前所述之各步驟。If the checking result of the comparison unit 26 is that the first, the second and the third wake-up instructions are not included in the text, no action is taken, the system remains in the sleep mode, and the audio receiving unit 20 then Receive voice messages continuously and repeat the steps previously described.

在此要特別提出說明的是，該第一、該第二與該第三喚醒指令可為使用者自定內容，例如：「Hello，智能音箱！」或是由該雲端伺服器所指定的內容，例如：假設該第一雲端伺服器14為蘋果公司(Apple Inc.)的雲端伺服器，則該第一喚醒指令可為「Siri 」。It should be particularly mentioned here that the first, the second and the third wake-up instructions can be user-defined content, such as: "Hello, smart speaker!" Or content specified by the cloud server For example, if the first cloud server 14 is a cloud server of Apple Inc., the first wake-up command may be "Siri".

在一實施例中，吾人設定該第一雲端伺服器14為亞馬遜公司(Amazon.com)的雲端伺服器，該第一喚醒指令為「Alexa」。當一使用者說出「Alexa，播放The Carpenters的Yesterday Once More」。該音訊接收單元20收到該與音訊息後，經由該音訊轉換單元24將其轉換為一文本。接著，該比對單元26檢視該文本的內容，發現包含有該第一喚醒指令(Alexa)，因此，該喚醒單元28會喚醒該中央處理單元30，使其由休眠模式轉換為工作模式。接著該中央處理單元30自該緩衝單元22中提取該語音訊息，透過該網路連接單元32上傳至該亞馬遜公司的雲端伺服器。經該伺服器執行該執行指令(播放The Carpenters的”Yesterday Once More)，取得該歌曲得音訊檔案後傳回該終端裝置10，並開啟該終端裝置10的播放程式播放歌曲。In an embodiment, we set the first cloud server 14 as a cloud server of Amazon.com, and the first wake-up command is "Alexa". When a user says "Alexa, Yesterday Once More by The Carpenters". After receiving the audio message, the audio receiving unit 20 converts the audio message into a text through the audio converting unit 24. Then, the comparison unit 26 inspects the content of the text and finds that the first wake-up instruction (Alexa) is included. Therefore, the wake-up unit 28 wakes up the central processing unit 30 to change from the sleep mode to the working mode. The central processing unit 30 then extracts the voice message from the buffer unit 22 and uploads the voice message to the Amazon company's cloud server through the network connection unit 32. After the server executes the execution instruction (plays "Yesterday Once More" of The Carpenters), obtains the audio file of the song and returns it to the terminal device 10, and starts the player program of the terminal device 10 to play the song.

在一實施例中，吾人設定該第二雲端伺服器16為蘋果公司(Apple Inc.)的雲端伺服器，該第二喚醒指令為「Siri」。當一使用者說出「Hi Siri，導航至臺北市大安區辛亥路2段185號」。該音訊接收單元20收到該與音訊息後，經由該音訊轉換單元24將其轉換為一文本。接著，該比對單元26檢視該文本的內容，發現包含有該第二喚醒指令(Siri)，因此，該喚醒單元28會喚醒該中央處理單元30，使其由休眠模式轉換為工作模式。接著該中央處理單元30自該緩衝單元22中提取該語音訊息，透過該網路連接單元32上傳至該蘋果公司的雲端伺服器。經該伺服器執行該執行指令(導航至臺北市大安區辛亥路2段185號)，叫出對應的地圖回傳至該終端裝置10，再透過該終端裝置10本身的GPS訊號，在該地圖上標是該終端裝置10的所在位置，進行導航之工作。In one embodiment, we set the second cloud server 16 as a cloud server of Apple Inc., and the second wake-up command is "Siri". When a user says "Hi Siri, navigate to No. 185, Section 2, Xinhai Road, Daan District, Taipei City". After receiving the audio message, the audio receiving unit 20 converts the audio message into a text through the audio converting unit 24. Then, the comparison unit 26 inspects the content of the text and finds that the second wake-up instruction (Siri) is included. Therefore, the wake-up unit 28 wakes up the central processing unit 30 to change from the sleep mode to the working mode. The central processing unit 30 then extracts the voice message from the buffer unit 22 and uploads the voice message to the Apple cloud server through the network connection unit 32. After the server executes the execution instruction (navigation to No. 185, Section 2, Xinhai Road, Da'an District, Taipei City), the corresponding map is called back to the terminal device 10, and then the GPS signal of the terminal device 10 is used to display the The superscript is the location of the terminal device 10 for navigation.

在一實施例中，吾人設定該第三雲端伺服器18為一遠端控制電器的伺服器，可透過網路12，對使用者家中各電器進行相關控制，該第三喚醒指令為「Skill platform」。當一使用者說出「Skill platform，打開冷氣機並調整溫度至25℃」。該音訊接收單元20收到該與音訊息後，經由該音訊轉換單元24將其轉換為一文本。接著，該比對單元26檢視該文本的內容，發現包含有該第三喚醒指令(Skill platform)，因此，該喚醒單元28會喚醒該中央處理單元30，使其由休眠模式轉換為工作模式。接著該中央處理單元30自該緩衝單元22中提取該語音訊息，透過該網路連接單元32上傳至該遠端遙控伺服器。經該伺服器執行該執行指令(打開冷氣機並調整溫度至25℃)，透過網路12連接至冷氣機34，打開該冷氣機34的電源使其開始運轉，並調整溫度至25℃。In one embodiment, we set the third cloud server 18 as a server for remotely controlling electrical appliances, which can control the electrical appliances in the user's home through the network 12, and the third wakeup command is "Skill platform ". When a user says "Skill platform, turn on the air conditioner and adjust the temperature to 25 ° C". After receiving the audio message, the audio receiving unit 20 converts the audio message into a text through the audio converting unit 24. Then, the comparison unit 26 inspects the content of the text and finds that the third wake-up instruction (Skill platform) is included. Therefore, the wake-up unit 28 wakes up the central processing unit 30 to change from the sleep mode to the working mode. The central processing unit 30 then extracts the voice message from the buffer unit 22 and uploads the voice message to the remote control server through the network connection unit 32. After the server executes the execution instruction (turn on the air conditioner and adjust the temperature to 25 ° C), it is connected to the air conditioner 34 through the network 12, and the power of the air conditioner 34 is turned on to start operation, and the temperature is adjusted to 25 ° C.

以上所述僅為說明本發明的例示，並非對本發明做任何形式上的限制，本發明所主張之權利範圍自應以申請專利範圍所述為準，而非僅限於上述實施例。任何所屬技術領域中具有通常知識者，在不脫離本發明技術方案的範圍內，當可利用上述揭示的技術內容做出些許更動或修飾為等同變化的等效實施例，但凡是未脫離本發明之技術方案的內容，均仍屬於本發明技術方案的範圍內。The above description is only an illustration to illustrate the present invention, and does not limit the present invention in any form. The scope of the claimed rights of the present invention shall be based on the scope of the patent application, rather than being limited to the above embodiments. Any person with ordinary knowledge in the technical field can make use of the disclosed technical content to make some changes or modifications to equivalent embodiments without departing from the scope of the technical solution of the present invention. The contents of the technical solutions are still within the scope of the technical solutions of the present invention.

10‧‧‧終端裝置10‧‧‧Terminal device

12‧‧‧網路12‧‧‧Internet

14‧‧‧第一雲端伺服器14‧‧‧First Cloud Server

16‧‧‧第二雲端伺服器16‧‧‧Second Cloud Server

18‧‧‧第三雲端伺服器18‧‧‧ Third Cloud Server

20‧‧‧音訊接收單元20‧‧‧Audio receiving unit

22‧‧‧緩衝單元22‧‧‧ buffer unit

24‧‧‧音訊轉換單元24‧‧‧ Audio Conversion Unit

26‧‧‧比對單元26‧‧‧ Matching Unit

28‧‧‧睡眠喚醒單元28‧‧‧sleep wake unit

30‧‧‧中央處理單元30‧‧‧ Central Processing Unit

32‧‧‧網路連接單元32‧‧‧Network Connection Unit

34‧‧‧冷氣機34‧‧‧Air conditioner

第1圖係本發明一較佳實施例的示意圖。第2圖係本發明一較佳實施例的中語音辨識系統的方塊圖。第3圖係本發明一較佳實施例的流程圖。第4圖係本發明一較佳實施例的示意圖，顯示語音操作第一雲端伺服器的狀態。第5圖係本發明一較佳實施例的示意圖，顯示語音操作第二雲端伺服器的狀態。第6圖係本發明一較佳實施例的示意圖，顯示語音操作第三雲端伺服器的狀態。FIG. 1 is a schematic diagram of a preferred embodiment of the present invention. FIG. 2 is a block diagram of a Chinese speech recognition system according to a preferred embodiment of the present invention. FIG. 3 is a flowchart of a preferred embodiment of the present invention. FIG. 4 is a schematic diagram of a preferred embodiment of the present invention, showing the state of the first cloud server operated by voice. FIG. 5 is a schematic diagram of a preferred embodiment of the present invention, which shows the state of voice-operated second cloud server. FIG. 6 is a schematic diagram of a preferred embodiment of the present invention, showing the status of the third cloud server operated by voice.

Claims

A method for voice-controlled connection to different cloud servers, for a user to use a terminal device to connect to a first cloud server or a second cloud server using voice control, including the following steps: 1. Provide a voice Message to a terminal device; 2. converting the voice message into a text in the terminal device; 3. checking whether the text contains a first wakeup instruction or a second wakeup instruction; 4. if the text contains a first wakeup instruction A wake-up command to send the voice message to a first cloud server; if the text includes a second wake-up command, send the voice message to a second cloud server; if the text does not include the first The wake-up instruction and the second wake-up instruction are repeated from step 1 to step 4.

According to the method for voice control connection to different cloud servers described in the first patent application scope, in step 1, after receiving the voice message, the terminal device temporarily stores the voice message in a buffer unit.

The method for connecting voice control to different cloud servers according to item 2 of the patent application scope, wherein in step 1, after the voice message is converted into the text, the text is temporarily stored in the buffer unit.

The voice control method for connecting different cloud servers according to the first patent application scope, wherein in step four, the voice message is transmitted to the first cloud server or the second cloud server through a network.

A system for voice-controlled connection to different cloud servers includes: a terminal device, a network, a first cloud server, and a second cloud server. The terminal device includes: an audio receiving unit for receiving Voice message; an audio conversion unit that converts the voice message into a text; a comparison unit that checks whether the text contains a first wake-up command or a second wake-up command; a network connection unit, as in the text Including the first wake-up command to transmit the voice message to a first cloud server through the network; if the text includes the second wake-up command to transmit the voice message to a second cloud server through the network Device.

For example, the voice control system described in claim 5 of the patent application connects to different cloud servers, wherein the terminal device further includes a buffer unit for storing the voice message and the text.

For example, the system for voice control connecting to different cloud servers as described in the scope of patent application 5, wherein the terminal device further includes a sleep wakeup unit and a central processing unit. When the comparison unit finds that the text contains the first wakeup instruction Or the second wake-up command, the sleep wake-up unit wakes up the central processing unit in the sleep mode, the central processing unit captures the voice message in the buffer unit, and connects to the network through the network connection unit , Sending the voice message to the first cloud server or the second cloud server.