TWM515143U

TWM515143U - Speech translating system and translation processing apparatus

Info

Publication number: TWM515143U
Application number: TW104214073U
Authority: TW
Inventors: 高志宏
Original assignee: 大禹科技股份有限公司
Priority date: 2015-08-31
Filing date: 2015-08-31
Publication date: 2016-01-01

Abstract

A speech translating system and a translation processing apparatus are provided. The speech translating system contains speech transceiving apparatuses and the translation processing apparatus. Each speech transceiving apparatus obtains an audio stream. A target language of each speech transceiving apparatuses are configured by the translation processing apparatus. The audio stream would be transmitted wirelessly to the translation processing apparatus by the speech transceiving apparatus. Then, the translation processing apparatus performs translating with the audio stream to obtain speech translating results, and transmits these speech translating results wirelessly to the corresponding speech transceiving apparatus according to all target languages of the speech transceiving apparatuses. Therefore, a function of instant multi-person translation would be achieved.

Description

Speech translation system and translation processing device

本新型創作是有關於一種語音翻譯技術，且特別是有關於一種語音翻譯系統及翻譯處理裝置。The novel creation is related to a speech translation technology, and in particular to a speech translation system and a translation processing device.

隨著全球化的發展及交通的便利，地理區域間的距離縮小，人們有更多機會前往其他國家或城市工作或旅遊。近幾年來，前往我國旅遊之國外旅客人數亦不斷提昇。有此可知，除了國際商業往來之外，觀光旅遊業亦逐漸邁向國際化。With the development of globalization and the convenience of transportation, the distance between geographical regions is narrowing, and people have more opportunities to work or travel in other countries or cities. In recent years, the number of foreign tourists visiting China has also increased. It can be seen that in addition to international business contacts, tourism and tourism are gradually becoming international.

一般而言，不同國家的人相互溝通，無可避免地會遭遇到語言不通的問題。雖然市面上已經存在許多即時翻譯裝置或翻譯軟體，但是繁瑣的操作步驟對於實際使用而言相當不便。例如，使用者要在使用者介面上按下「翻譯」按鈕，方能進行一句話的翻譯作業。In general, people from different countries communicate with each other and inevitably encounter language problems. Although many instant translation devices or translation software exist on the market, the cumbersome steps are quite inconvenient for practical use. For example, the user has to press the "Translate" button on the user interface to perform a one-word translation.

本新型創作提供一種語音翻譯系統及翻譯處理裝置，其可提供多人語音翻譯功能，且結合免持（ hands free）模式以免除繁瑣的操作步驟。The novel creation provides a speech translation system and a translation processing device, which can provide a multi-person speech translation function, and combines a hands free mode to avoid cumbersome operation steps.

本新型創作提供一種語音翻譯系統。此語音翻譯系統包括語音收發裝置及翻譯處理裝置。各語音收發裝置取得聲音串流。翻譯處理裝置設定各語音收發裝置的目標語言，各語音收發裝置無線地傳送聲音串流至翻譯處理裝置，翻譯處理裝置依據所有語音收發裝置的目標語言對聲音串流進行翻譯以取得語音翻譯結果，且依據各語音收發裝置的目標語言分別將這些語音翻譯結果無線地傳送至對應的語音收發裝置。The novel creation provides a speech translation system. The speech translation system includes a voice transceiving device and a translation processing device. Each voice transmitting and receiving device acquires a voice stream. The translation processing device sets a target language of each of the voice transmitting and receiving devices, and each of the voice transmitting and receiving devices wirelessly transmits the audio stream to the translation processing device, and the translation processing device translates the voice stream according to the target language of all the voice transmitting and receiving devices to obtain a voice translation result. And the voice translation results are wirelessly transmitted to the corresponding voice transceiver device according to the target language of each voice transceiver device.

本新型創作提供一種翻譯處理裝置，而此翻譯處理裝置包括通訊單元、儲存單元及處理單元。通訊單元用以傳送即接收無線訊號。儲存單元紀錄模組。而處理單元耦接通訊單元及儲存單元，且存取並執行儲存單元所紀錄的模組。這些模組包括語音收發模組及翻譯模組。語音收發模組透過通訊單元無線地接收來自語音收發裝置其中一者的聲音串流。翻譯模組設定各語音收發裝置的目標語言，依據所有語音收發裝置的目標語言對聲音串流進行翻譯以取得語音翻譯結果，且依據各語音收發裝置的目標語言，語音收發模組透過通訊單元分別將語音翻譯結果無線地傳送至對應的語音收發裝置。The novel creation provides a translation processing device, and the translation processing device includes a communication unit, a storage unit, and a processing unit. The communication unit is used to transmit and receive wireless signals. Storage unit record module. The processing unit is coupled to the communication unit and the storage unit, and accesses and executes the module recorded by the storage unit. These modules include a voice transceiver module and a translation module. The voice transceiver module wirelessly receives a voice stream from one of the voice transceiver devices through the communication unit. The translation module sets the target language of each voice transceiver device, translates the voice stream according to the target language of all voice transceiver devices to obtain a voice translation result, and according to the target language of each voice transceiver device, the voice transceiver module transmits the voice transmission module through the communication unit respectively. The speech translation result is wirelessly transmitted to the corresponding voice transceiving device.

基於上述，本新型創作實施例提供一種語音翻譯系統及翻譯處理裝置，其透過語音收發裝置監聽使用者說話以取得聲音串流，翻譯處理裝置可無線地接收聲音串流，並將語音翻譯結果分配至對應的語音收發裝置。藉此，本新型創作實施例便能提供即時多人對話翻譯功能，進而可應用於各種需要與外國人溝通的情境。Based on the above, the present invention provides a voice translation system and a translation processing device, which listens to a user's speech through a voice transceiver device to obtain a voice stream, and the translation processing device can wirelessly receive the voice stream and distribute the voice translation result. To the corresponding voice transceiver. Thereby, the novel creation embodiment can provide an instant multi-person dialogue translation function, and can be applied to various situations that need to communicate with foreigners.

為讓本新型創作的上述特徵和優點能更明顯易懂，下文特舉實施例，並配合所附圖式作詳細說明如下。The above described features and advantages of the present invention will become more apparent and understood from the following description.

諸如藍芽（bluetooth）、WiFi等無線通訊技術，可結合現有的電子裝置（例如，智慧型手機、平板電腦、筆記型電腦等），以提供數種免持功能（例如，免持通話、音樂串流、影像串流等）。而電子裝置亦可同時透過前述無線通訊技術，同時與數台免持裝置連接。據此，本新型創作實施例便是將免持裝置作為語音收發裝置以接收使用者的對話內容，透過翻譯處理裝置將對話內容翻譯成設定的目標語言，並分別傳送至數台語音收發裝置，以讓所有使用者都能聆聽對應的語音翻譯結果。藉此，便能實現即時多人對話翻譯功能。以下提出符合本新型創作之精神的多個實施例，應用本實施例者可依其需求而對這些實施例進行適度調整，而不僅限於下述描述中的內容。Wireless communication technologies such as Bluetooth, WiFi, etc., can be combined with existing electronic devices (eg, smart phones, tablets, notebooks, etc.) to provide several hands-free features (eg, hands-free calling, music) Streaming, video streaming, etc.). The electronic device can also be connected to several hands-free devices through the aforementioned wireless communication technology. Accordingly, the novel creation embodiment is to use the hands-free device as a voice transceiver device to receive the conversation content of the user, and translate the conversation content into the set target language through the translation processing device, and respectively transmit the content to the plurality of voice transceiver devices. So that all users can listen to the corresponding speech translation results. In this way, the instant multi-person dialogue translation function can be realized. In the following, various embodiments in accordance with the spirit of the present invention are proposed, and those applying the present embodiment can appropriately adjust these embodiments according to their needs, and are not limited to the contents in the following description.

圖1是依據本新型創作一實施例說明語音翻譯系統的示意圖。請參照圖1，語音翻譯系統10包括一個或數台語音收發裝置110及翻譯處理裝置150。需說明的是，圖1中僅呈現兩台語音收發裝置110，但在實際應用上不以此為限。1 is a schematic diagram illustrating a speech translation system in accordance with an embodiment of the present invention. Referring to FIG. 1, the speech translation system 10 includes one or more voice transceiving devices 110 and a translation processing device 150. It should be noted that only two voice transceivers 110 are presented in FIG. 1 , but the actual application is not limited thereto.

語音收發裝置110例如是藍芽耳麥（headset）或其他可透過藍芽、WiFi等通訊技術傳送及接收聲音訊號的免持裝置。語音收發裝置110包括單聲道（mono）或立體聲（stereo）揚聲器（Speaker）模組、麥克風模組及通訊模組，其可透過麥克風模組接收使用者所發出的聲音，透過通訊模組（例如，藍芽模組、WiFi模組等）傳送及接收聲音串流，並透過揚聲器模組播放聲音。The voice transceiver 110 is, for example, a Bluetooth headset or other hands-free device that transmits and receives voice signals through communication technologies such as Bluetooth or WiFi. The voice transceiver device 110 includes a mono or stereo speaker module, a microphone module and a communication module, and can receive the sound emitted by the user through the microphone module through the communication module ( For example, Bluetooth modules, WiFi modules, etc. transmit and receive audio streams and play sound through the speaker module.

翻譯處理裝置150例如是智慧型手機、平板電腦或筆記型電腦等具備運算處理功能的電子裝置。以硬體觀點而言，圖2是依據本新型創作一實施例說明翻譯處理裝置150的方塊圖。請參照圖2，翻譯處理裝置150包括通訊單元151、第二通訊單元153、儲存單元155及處理單元157。The translation processing device 150 is, for example, an electronic device having an arithmetic processing function such as a smart phone, a tablet computer, or a notebook computer. In a hardware perspective, FIG. 2 is a block diagram illustrating a translation processing apparatus 150 in accordance with an embodiment of the present invention. Referring to FIG. 2, the translation processing device 150 includes a communication unit 151, a second communication unit 153, a storage unit 155, and a processing unit 157.

通訊單元151例如可支援藍芽標準或WiFi標準等無線通訊技術。在本實施例中，通訊單元710用以無線地接收來自語音收發裝置110的聲音串流，並無線地傳送語音翻譯結果至各語音收發裝置110。而關於聲音串流及語音翻譯結果的產生及處理待稍後實施例詳細說明。The communication unit 151 can support, for example, a wireless communication technology such as a Bluetooth standard or a WiFi standard. In this embodiment, the communication unit 710 is configured to wirelessly receive the voice stream from the voice transceiver device 110 and wirelessly transmit the voice translation result to each of the voice transceiver devices 110. The generation and processing of the sound stream and the speech translation result are described in detail later.

第二通訊單元153可以是支援WiFi標準、第三代無線通訊（3G）、第四代無線通訊（4G）或其他具備無線傳輸功能的任何類型無線網路介面模組。在本實施例中，翻譯處理裝置150可透過第二通訊單元153來經由網際網路（Internet）連接至翻譯伺服器，並將文本（text）資料傳送至翻譯伺服器，且自翻譯伺服器取得翻譯文本資料。而關於文本資料及翻譯文本資料的產生及處理待稍後實施例詳細說明。The second communication unit 153 may be a WiFi standard, third generation wireless communication (3G), fourth generation wireless communication (4G) or other wireless network interface module with wireless transmission function. In this embodiment, the translation processing device 150 can connect to the translation server via the Internet through the second communication unit 153, and transmit the text data to the translation server, and obtain the translation server. Translate text materials. The generation and processing of textual materials and translated textual materials are described in detail later.

儲存單元155可以是任何型態的固定或可移動隨機存取記憶體（random access memory，RAM）、唯讀記憶體（read-only memory，ROM）、快閃記憶體（flash memory）或類似元件或上述元件的組合。在本實施例中，儲存單元155係用以記錄語音收發模組155_1、語音處理模組155_3及翻譯模組155_5等軟體程式。本實施例中所述的儲存單元155並未限制是單一記憶體元件，上述之各軟體模組亦可以分開儲存在兩個或兩個以上相同或不同型態之記憶體元件中。The storage unit 155 can be any type of fixed or removable random access memory (RAM), read-only memory (ROM), flash memory or the like. Or a combination of the above elements. In this embodiment, the storage unit 155 is used to record software programs such as the voice transceiver module 155_1, the voice processing module 155_3, and the translation module 155_5. The storage unit 155 described in this embodiment is not limited to a single memory component, and each of the above software modules may be separately stored in two or more memory components of the same or different types.

語音收發模組155_1例如是基於通訊單元151所支援之通訊技術的管理程式，例如，低能量藍牙（Bluetooth Low Energy；BLE）多裝置（multi-Dev）、藍芽管理程式等，其可取得自通訊單元151取得來自語音收發裝置110的聲音串流，並透過通訊單元151將語音翻譯結果發送至語音收發裝置110。語音處理模組155_3具有自動話音識別（Automatic Speech Recognition；ASR）功能及文本到話音（Text to Speech；TTS）功能，例如，Google語音應用程式介面（Application Program Interface；API），其可辨識聲音串流以轉換成文本資料，亦可將文本資料轉換回聲音串流。而翻譯模組155_5用以對聲音串流經轉換的文本資料進行翻譯以取得語音翻譯結果，其詳細運作待稍後實施例說明。翻譯模組155_5的實施範例可以是支援至少兩種目標語言（例如，中文、英語、日語等）的翻譯軟體程式、引擎或應用程式介面（例如，Google翻譯、Bing翻譯等），且不受限於翻譯處理裝置150的作業系統平台（例如，Android、iOS等）。The voice transceiver module 155_1 is, for example, a management program based on a communication technology supported by the communication unit 151, for example, a Bluetooth low energy (BLE) multi-Dev, a Bluetooth management program, etc., which can be obtained from The communication unit 151 acquires the audio stream from the voice transmitting and receiving device 110, and transmits the voice translation result to the voice transmitting and receiving device 110 via the communication unit 151. The voice processing module 155_3 has an automatic speech recognition (ASR) function and a text to speech (TTS) function, for example, a Google Voice application interface (API), which is identifiable. The sound stream is converted into text data, and the text data can be converted back to the sound stream. The translation module 155_5 is used to translate the converted text data to obtain the speech translation result, and the detailed operation is described in the following embodiments. The implementation example of the translation module 155_5 may be a translation software program, an engine or an application interface (for example, Google translation, Bing translation, etc.) that supports at least two target languages (for example, Chinese, English, Japanese, etc.), and is not limited. The operating system platform of the translation processing device 150 (for example, Android, iOS, etc.).

處理單元157分別與通訊單元151、第二通訊單元153及儲存單元155連接，其可以是中央處理單元（Central Processing Unit，CPU），或是其他可程式化之一般用途或特殊用途的微處理器（Microprocessor）、數位信號處理器（Digital Signal Processor，DSP）、可程式化控制器、特殊應用積體電路（Application Specific Integrated Circuit，ASIC）或其他類似元件或上述元件的組合。在本實施例中，處理單元157係用以存取並執行上述儲存單元155中記錄的模組，藉以實現本新型創作的實施例。The processing unit 157 is respectively connected to the communication unit 151, the second communication unit 153, and the storage unit 155, and may be a central processing unit (CPU), or other programmable general purpose or special purpose microprocessor. (Microprocessor), Digital Signal Processor (DSP), Programmable Controller, Application Specific Integrated Circuit (ASIC) or other similar components or a combination of the above. In this embodiment, the processing unit 157 is configured to access and execute the modules recorded in the storage unit 155, thereby implementing the embodiment of the novel creation.

為了讓本領域具通常知識者能清楚明瞭本新型創作的實施例，下文中，將搭配語音收發裝置110及翻譯處理裝置150說明兩個實施情境，其分別是一對多翻譯及多對多翻譯情境。各個流程可依照實施情形而隨之調整，且並不僅限於此。In order to enable those skilled in the art to clarify the embodiments of the novel creation, in the following, two implementation scenarios will be described in conjunction with the voice transceiver 110 and the translation processing device 150, which are one-to-many translation and many-to-many translation. Situation. The various processes can be adjusted accordingly according to the implementation situation, and are not limited thereto.

一對多翻譯情境One-to-many translation scenario

圖3是依據本新型創作一實施例說明語音翻譯系統10的語音翻譯流程圖。假設一情境為，語音翻譯系統10包括語音收發裝置110_1～110_4及翻譯處理裝置150，操作語音收發裝置110_1～110_4的各使用者的目標語言分別是中文、英語、日語、法語。翻譯處理裝置150可能具有使用者介面（User Interface：UI）或實體按鈕設定介面等以供使用者設定其目標語言。在其他一些實施例中，語音收發裝置110_1～110_4亦可能具有目標語言設定介面，以接收使用者的語言設定操作（例如，選擇英語、日語等），並將目標語言的設定資訊傳送至翻譯處理裝置150。翻譯模組155_5便會紀錄所有語音收發裝置110_1～110_4的目標語言。3 is a flow chart showing the speech translation of the speech translation system 10 in accordance with an embodiment of the present invention. Assuming that the context is that the speech translation system 10 includes the speech transmitting and receiving devices 110_1 to 110_4 and the translation processing device 150, the target languages of the users operating the speech transmitting and receiving devices 110_1 to 110_4 are Chinese, English, Japanese, and French, respectively. The translation processing device 150 may have a user interface (User Interface: UI) or a physical button setting interface or the like for the user to set its target language. In some other embodiments, the voice transceivers 110_1 110 110_4 may also have a target language setting interface to receive a user's language setting operation (eg, select English, Japanese, etc.), and transmit the setting information of the target language to the translation processing. Device 150. The translation module 155_5 records the target languages of all of the voice transceiving devices 110_1 to 110_4.

在目標語言設定完成後，語音收發裝置110_1透過其麥克風模組接收使用者的聲音輸入（步驟S310）（例如，你好），並將聲音輸入所產生的聲音串流傳送至翻譯處理裝置150（步驟S315）。需說明的是，翻譯處理裝置150或語音收發裝置110_1～110_4可能具有啟動（實體或虛擬）按鍵，以接收使用者的啟動操作，進而起始本語音翻譯流程（即，進行步驟S310）。語音收發模組155_1（即，藍芽管理程式）透過通訊模組151接收聲音串流，並將聲音串流傳送至語音處理模組155_3（即，語音應用程式介面）（步驟S320）。語音應用程式介面辨識聲音串流對應的目標語言（本實施例為中文），並依據辨識的目標語言將該聲音串流（即，語音資料）轉換成文本資料（步驟S330）。需說明的是，此文本資料的格式是符合翻譯模組155_5所需之輸入格式。After the target language setting is completed, the voice transceiver 110_1 receives the user's voice input through its microphone module (step S310) (for example, hello), and transmits the voice stream generated by the voice input to the translation processing device 150 ( Step S315). It should be noted that the translation processing device 150 or the voice transceiving devices 110_1~110_4 may have a startup (physical or virtual) button to receive the user's startup operation, thereby starting the speech translation process (ie, proceeding to step S310). The voice transceiver module 155_1 (ie, the Bluetooth management program) receives the voice stream through the communication module 151, and transmits the voice stream to the voice processing module 155_3 (ie, the voice application interface) (step S320). The voice application interface identifies the target language corresponding to the voice stream (in this embodiment, Chinese), and converts the voice stream (ie, voice data) into text data according to the recognized target language (step S330). It should be noted that the format of the text data is in accordance with the input format required by the translation module 155_5.

接著，翻譯模組155_5（即，翻譯程式）判斷是否存在授權金鑰，且判斷使用字元數量是否超過上限門檻值（步驟S341）。此授權金鑰是相關於此翻譯程式的使用權限。而若翻譯模組155_5判斷存在授權金鑰（例如，儲存於儲存單元155），則進一步統計使用字元數量，檢查使用字元數量是否超過上限門檻值（例如，一天一百萬、兩百萬等個使用字元數量）。若使用字元數量超過上限門檻值，則處理單元157禁能翻譯模組155_5（例如，禁能部份或全部功能），或是提供付款操作業面（翻譯模組155_5可在確認付款後接續進行後續翻譯作業）。Next, the translation module 155_5 (ie, the translation program) determines whether or not the authorization key exists, and determines whether the number of used characters exceeds the upper threshold (step S341). This authorization key is used by the translator. If the translation module 155_5 determines that there is an authorization key (for example, stored in the storage unit 155), further counts the number of characters used, and checks whether the number of used characters exceeds the upper threshold (for example, one million, two million a day). Wait for the number of characters used). If the number of characters used exceeds the upper threshold, the processing unit 157 disables the translation module 155_5 (eg, disables some or all of the functions), or provides a payment operation floor (the translation module 155_5 can continue after confirming the payment) Perform subsequent translations).

反之，若使用字元數量未超過上限門檻值，則翻譯模組155_5可將文本資料輸入，透過第二通訊單元153將文本資料傳送至翻譯伺服器（步驟S343）。需說明的是，翻譯模組155_5會先將文本資料轉換成翻譯伺服器所需之輸入格式（例如，JSON（JavaScript Object Notation）），而翻譯伺服器回傳之格式亦為JSON格式。翻譯模組155_5可透過第二通訊單元153自翻譯伺服器取得JSON格式的翻譯結果，並將JSON格式的翻譯結果轉換成翻譯文本資料（步驟S345），以符合語音應用程式介面所需之輸入格式。需說明的是，這些翻譯文本資料可分別對應於語音收發裝置110_1～110_4所設定的目標語言（即，中文、英語、日語、法語）。On the other hand, if the number of used characters does not exceed the upper threshold, the translation module 155_5 can input the text data, and transmit the text data to the translation server through the second communication unit 153 (step S343). It should be noted that the translation module 155_5 first converts the text data into an input format required by the translation server (for example, JSON (JavaScript Object Notation)), and the format of the translation server backhaul is also the JSON format. The translation module 155_5 can obtain the translation result in the JSON format from the translation server through the second communication unit 153, and convert the translation result in the JSON format into the translated text data (step S345) to conform to the input format required by the voice application interface. . It should be noted that the translated text materials may correspond to the target languages (ie, Chinese, English, Japanese, and French) set by the voice transmitting and receiving devices 110_1 to 110_4, respectively.

接著，語音應用程式介面將翻譯文本資料轉換成語音翻譯結果（語音資料格式，例如是聲音原始（raw）資料）（步驟S350）。多媒體播放服務程式便對語音翻譯結果進行語音輸出處理（步驟S360）。而藍芽管理程式透過通訊單元151分別將各目標語言的語音翻譯結果至對應的語音收發裝置110_1～110_4（即，語音收發裝置110_1取得中文的語音翻譯結果（例如，你好），語音收發裝置110_2取得英語的語音翻譯結果（例如，hello）等，其餘依此類推）（步驟S370）。藉此，本新型創作實施例便能實現一對多即時語音翻譯之功能。Next, the voice application interface converts the translated text data into a voice translation result (a voice data format, such as a sound raw material) (step S350). The multimedia playback service program performs voice output processing on the voice translation result (step S360). The Bluetooth management program separately transmits the voice translation result of each target language to the corresponding voice transmitting and receiving device 110_1~110_4 through the communication unit 151 (that is, the voice transmitting device 110_1 obtains the Chinese voice translation result (for example, hello), the voice transceiver device. 110_2 obtains a speech translation result of English (for example, hello), and the like, and so on (step S370). Thereby, the novel creation embodiment can realize the function of one-to-many instant voice translation.

此外，語音處理模組155_3亦會辨識聲音串流中是否出現斷句，以作為翻譯句子的結束條件。例如，「今天天氣不錯」及「你下午到哪去？」可分別被判斷為出現斷句，「今天天氣不錯」及「你下午到哪去？」會分別作為兩組文本資料輸入至翻譯模組155_5。而語音翻譯結果輸出至語音收發裝置110後，藍芽程式會再自動開啟，以接收下一斷句對應的聲音串流。換言之，在實際操作上，使用者便無須反覆進行手動啟動操作，翻譯處理裝置150可接續進行翻譯作業。In addition, the voice processing module 155_3 also recognizes whether a sentence is present in the voice stream as an end condition for translating the sentence. For example, "Where is the weather today" and "Where are you going in the afternoon?" can be judged as a broken sentence, "Today's good weather" and "Where are you going in the afternoon?" will be entered into the translation module as two sets of text data respectively. 155_5. After the voice translation result is output to the voice transceiver 110, the Bluetooth program will be automatically turned on to receive the voice stream corresponding to the next sentence. In other words, in actual operation, the user does not need to manually perform the manual start operation, and the translation processing device 150 can continue the translation operation.

多對多翻譯情境Many-to-many translation scenarios

圖4是依據本新型創作另一實施例說明語音翻譯系統10的語音翻譯流程圖。假設一情境為，語音翻譯系統10包括語音收發裝置110_1～110_3及翻譯處理裝置150，操作語音收發裝置110_1～110_4的各使用者的目標語言分別是中文、英語、日語。步驟S430～S470的詳細說明可參照圖3中的步驟S330～S370，於此不再贅述。4 is a flow chart showing the speech translation of the speech translation system 10 in accordance with another embodiment of the present invention. Assuming that the context is that the speech translation system 10 includes the speech transmitting and receiving devices 110_1 to 110_3 and the translation processing device 150, the target languages of the users operating the speech transmitting and receiving devices 110_1 to 110_4 are Chinese, English, and Japanese, respectively. For details of the steps S430 to S470, refer to steps S330 to S370 in FIG. 3, and details are not described herein again.

圖4與圖3不同的地方在於，語音收發裝置110_1～110_3可同時或不同時分別接收三位使用者的聲音輸入（例如，早安、hello、arigatou）（步驟S410），並將分別三組聲音輸入所產生的三組聲音串流分別傳送至翻譯處理裝置150（步驟S415）。而藍芽管理程式可依序或同時接收三組聲音串流，並分別將三組聲音串流傳送至語音應用程式介面（步驟S430）。由於語音處理模組155_3可自動辨識三組聲音串流對應的目標語言，因此在步驟S445中翻譯模組155_5亦可對應取得三組聲音串流分別對應的三組翻譯文本資料（即，中文（早安、你好、謝謝）、英語（good morning、hello、thanks）及日語（ohayo gozaimasu、konnichiwa、arigatou））。藉此，本新型創作實施例便能實現多對多即時語音翻譯之功能。4 is different from FIG. 3 in that the voice transceiving devices 110_1 110 110_3 can respectively receive the voice inputs of three users (for example, good morning, hello, arigatou) at the same time or at different times (step S410), and will respectively be three groups. The three sets of sound streams generated by the sound input are respectively transmitted to the translation processing device 150 (step S415). The Bluetooth management program can receive three sets of sound streams sequentially or simultaneously, and respectively stream the three sets of sounds to the voice application interface (step S430). Since the voice processing module 155_3 can automatically recognize the target language corresponding to the three sets of voice streams, the translation module 155_5 can also obtain three sets of translated text data corresponding to the three sets of voice streams respectively in step S445 (ie, Chinese ( Good morning, hello, thank you), English (good morning, hello, thanks) and Japanese (ohayo gozaimasu, konnichiwa, arigatou)). Thereby, the novel authoring embodiment can realize the function of many-to-many instant voice translation.

需說明的是，前述實施例中各軟體模組（即，語音收發模組155_1、語音處理模組155_3及翻譯模組155_3）可經由單一軟體程式來分別控制其各別的操作，而此軟體程式是儲存於儲存單元155中，並可藉由處理單元157進行存取及執行。It should be noted that, in the foregoing embodiments, the software modules (ie, the voice transceiver module 155_1, the voice processing module 155_3, and the translation module 155_3) can respectively control their respective operations through a single software program, and the software is separately controlled. The program is stored in the storage unit 155 and can be accessed and executed by the processing unit 157.

綜上所述，本新型創作實施例提供一種語音翻譯系統及翻譯處理裝置，其僅需要單一翻譯處理裝置便能處理兩台以上語音收發裝置所接收到的對話內容，並將語音翻譯結果分別傳送至對應的語音收發裝置。而由於藍芽耳麥等免持裝置攜帶方便並已廣泛受大眾使用，且因此使用者僅需將本新型創作實施例所實現的軟體程式裝載於其電子裝置（例如，智慧型手機、平板電腦等），便能輕易實現一對多或多對多的即時語音翻譯。In summary, the present invention provides a voice translation system and a translation processing device that can process the conversation content received by two or more voice transceivers by a single translation processing device, and transmit the voice translation results separately. To the corresponding voice transceiver. Since the Bluetooth-free headset and the like are portable and widely used by the public, the user only needs to load the software program implemented by the novel creation embodiment on the electronic device (for example, a smart phone, a tablet, etc.) ), one-to-many or many-to-many instant voice translation can be easily implemented.

雖然本新型創作已以實施例揭露如上，然其並非用以限定本新型創作，任何所屬技術領域中具有通常知識者，在不脫離本新型創作的精神和範圍內，當可作些許的更動與潤飾，故本新型創作的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed in the above embodiments, it is not intended to limit the novel creation, and any person skilled in the art can make some changes without departing from the spirit and scope of the novel creation. Retouching, the scope of protection of this new creation is subject to the definition of the scope of the patent application attached.

10‧‧‧語音翻譯系統
110、110_1～ 110_4‧‧‧語音收發裝置
150‧‧‧翻譯處理裝置
151‧‧‧通訊單元
153‧‧‧第二通訊單元
155‧‧‧儲存單元
155_1‧‧‧語音收發模組
155_3‧‧‧語音處理模組
155_5‧‧‧翻譯模組
157‧‧‧處理單元
S310～S370、S410～S470‧‧‧步驟10‧‧‧Voice translation system
110, 110_1~110_4‧‧‧ voice transceiver
150‧‧‧Translation processing device
151‧‧‧Communication unit
153‧‧‧Second communication unit
155‧‧‧ storage unit
155_1‧‧‧Voice transceiver module
155_3‧‧‧Voice Processing Module
155_5‧‧‧Translation module
157‧‧‧Processing unit
S310～S370, S410～S470‧‧‧ steps

圖1 是依據本新型創作一實施例說明語音翻譯系統的示意圖。圖2 是依據本新型創作一實施例說明翻譯處理裝置的方塊圖。圖3 是依據本新型創作一實施例說明語音翻譯系統的語音翻譯流程圖。圖4 是依據本新型創作另一實施例說明語音翻譯系統的語音翻譯流程圖。1 is a schematic diagram illustrating a speech translation system in accordance with an embodiment of the present invention. 2 is a block diagram showing a translation processing apparatus in accordance with an embodiment of the present invention. FIG. 3 is a flow chart showing the speech translation of the speech translation system according to an embodiment of the present invention. 4 is a flow chart showing the speech translation of the speech translation system in accordance with another embodiment of the present invention.

150‧‧‧翻譯處理裝置 150‧‧‧Translation processing device

151‧‧‧通訊單元 151‧‧‧Communication unit

153‧‧‧第二通訊單元 153‧‧‧Second communication unit

155‧‧‧儲存單元 155‧‧‧ storage unit

155_1‧‧‧語音收發模組 155_1‧‧‧Voice transceiver module

155_3‧‧‧語音處理模組 155_3‧‧‧Voice Processing Module

155_5‧‧‧翻譯模組 155_5‧‧‧Translation module

157‧‧‧處理單元 157‧‧‧Processing unit

Claims

A voice translation system, comprising: a plurality of voice transceivers, wherein each of the voice transceivers obtains a voice stream; and a translation processing device, wherein the translation processing device sets a target language of each of the voice transceivers, each The voice transceivers wirelessly transmit the voice stream to the translation processing device, and the translation processing device translates the voice stream according to the target language of all of the voice transceiver devices to obtain a plurality of voice translation results, and The target language of each of the voice transceivers wirelessly transmits the voice translation results to the corresponding voice transceivers.

The speech translation system of claim 1, wherein the translation processing device recognizes the target language corresponding to the audio stream, and converts the audio stream into a text material according to the target language.

The speech translation system of claim 2, wherein the translation processing device translates the text data according to the target language of all of the voice transceiving devices to obtain a plurality of translated text materials, and the translated texts are The data is converted into the speech translation results.

The speech translation system of claim 2, wherein the translation processing device determines whether an authorization key exists and determines whether the number of used characters exceeds an upper threshold.

The speech translation system of claim 3, wherein the translation processing device transmits the text data to a translation server and retrieves the translated text data from the translation server.

A translation processing device includes: a communication unit that transmits and receives wireless signals; a storage unit that records a plurality of modules; and a processing unit that couples the communication unit and the storage unit and accesses and executes the storage unit Recording the modules, the modules include: a voice transceiver module, wirelessly receiving a voice stream from one of the plurality of voice transceivers through the communication unit; and a translation module, setting each a target language of the voice transceivers, the voice stream is translated according to the target language of all of the voice transceivers to obtain a plurality of voice translation results, and according to the target language of each of the voice transceivers, The voice transceiver module wirelessly transmits the voice translation results to the corresponding voice transceiver devices through the communication unit.

The translation processing device of claim 6, wherein the modules further comprise: a voice processing module, identifying the target language corresponding to the voice stream, and converting the voice stream according to the target language Into a text message.

The translation processing device of claim 7, wherein the translation module translates the text data according to the target language of all of the voice transceivers to obtain a plurality of translated text materials, and the voice processing module Converting the translated text data into the speech translation results.

The translation processing device of claim 7, wherein the translation module determines whether an authorization key exists and determines whether the number of used characters exceeds an upper threshold.

The translation processing device of claim 8, further comprising: a second communication unit, wherein the translation module transmits the text data to a translation server through the second communication unit, and the translation server Obtain the translated text data.