TWI383317B - Audio playing apparatus with an interactive function and method thereof - Google Patents

Audio playing apparatus with an interactive function and method thereof Download PDF

Info

Publication number
TWI383317B
TWI383317B TW98101607A TW98101607A TWI383317B TW I383317 B TWI383317 B TW I383317B TW 98101607 A TW98101607 A TW 98101607A TW 98101607 A TW98101607 A TW 98101607A TW I383317 B TWI383317 B TW I383317B
Authority
TW
Taiwan
Prior art keywords
prompt
audio
control information
response
voice
Prior art date
Application number
TW98101607A
Other languages
Chinese (zh)
Other versions
TW201028915A (en
Inventor
Hsiao Chung Chou
li zhang Huang
Chuan Hong Wang
Original Assignee
Hon Hai Prec Ind Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hon Hai Prec Ind Co Ltd filed Critical Hon Hai Prec Ind Co Ltd
Priority to TW98101607A priority Critical patent/TWI383317B/en
Publication of TW201028915A publication Critical patent/TW201028915A/en
Application granted granted Critical
Publication of TWI383317B publication Critical patent/TWI383317B/en

Links

Landscapes

  • User Interface Of Digital Computer (AREA)

Description

具有互動功能的音頻播放裝置及其互動方法 Audio playback device with interactive function and interactive method thereof

本發明是關於一種具有互動功能的音頻播放裝置及其互動方法,特別是關於一種可向用戶提問的音頻播放裝置及其互動方法。 The present invention relates to an audio playback device with interactive functions and an interactive method thereof, and more particularly to an audio playback device that can ask a user and an interactive method thereof.

目前有許多音頻播放裝置都具有講故事、播放課文、播放音樂的功能。這些故事,課文或者音樂為預先錄製的存儲於存儲單元的音頻檔案,這些檔案的格式一般為AAC、AC-3、ATRAC3plus、MP3、WMA9等幾種,用戶一般只能通過“下一首”、“上一首”、“暫停”、“播放”等方式控制音頻的播放。故在播放音頻檔案的過程中,不具有和用戶互動的功能,如向用戶提問。 At present, many audio playback devices have the functions of telling stories, playing texts, and playing music. These stories, texts or music are pre-recorded audio files stored in the storage unit. The format of these files is generally AAC, AC-3, ATRAC3plus, MP3, WMA9, etc. Users can only pass the "next", The "previous", "pause", "play" and other methods control the playback of audio. Therefore, in the process of playing the audio file, there is no function to interact with the user, such as asking the user.

有鑒於此,故需要提供一種具有互動功能的音頻播放裝置及其互動方法。 In view of this, it is necessary to provide an audio playback device with interactive functions and an interactive method thereof.

該具有互動功能的音頻播放裝置包括一存儲單元、一輸入單元、一音頻解碼單元及一音頻輸出單元,該存儲單元存儲有至少一互動資料及一提示語音庫,每一互動資料包括一主音頻、至少一問題音頻及一控制資訊,該提示語音庫記錄了至少一個提示語音,該控制資訊包括一主控制資訊及與該問題音頻一一對應的問題控制資訊,該主控制資訊記錄有該主音頻獲取方式,該問題控制資訊定義有其對應問題音頻的獲取方式及下一問題音頻控制資訊的獲取方式;該音頻播放裝置還包括一播放控制 模組,用於在播放所述一互動資料時,獲得該互動資料的控制資訊,並根據主控制資訊中記錄的主音頻獲取方式及問題控制資訊中記錄的問題音頻的獲取方式獲取主音頻及問題音頻,所述主音頻及問題音頻經該音頻解碼單元進行解碼後,由該音頻輸出單元輸出;一提示模組,用於從該提示語音庫中選擇一提示語音,並經該音頻解碼單元進行解碼後,由該音頻輸出單元輸出;一問題連接模組,用於從問題控制資訊的下一問題控制資訊的獲取方式處獲取下一問題控制資訊,由播放控制模組獲取對應問題音頻進行播放。 The interactive audio playback device includes a storage unit, an input unit, an audio decoding unit and an audio output unit. The storage unit stores at least one interactive data and a prompt voice library, and each interactive data includes a main audio. At least one question audio and a control message, the prompt voice library records at least one prompt voice, the control information includes a main control information and a problem control information corresponding to the question audio one-to-one, the main control information record has the main The audio acquisition mode, the problem control information definition has its corresponding problem audio acquisition mode and the next problem audio control information acquisition manner; the audio playback device further includes a playback control The module is configured to obtain control information of the interactive data when playing the interactive data, and obtain the main audio according to the main audio acquiring manner recorded in the main control information and the method for obtaining the problem audio recorded in the problem control information. a problem audio, the main audio and the problem audio are decoded by the audio decoding unit, and output by the audio output unit; a prompting module is configured to select a prompt voice from the prompt voice library, and pass the audio decoding unit After being decoded, the audio output unit outputs; a problem connection module is configured to obtain the next problem control information from the method for acquiring the next problem control information of the problem control information, and the playback control module obtains the corresponding problem audio. Play.

在所述具有互動功能的音頻播放裝置的互動方法中,該音頻播放裝置提供一存儲單元,該存儲單元存儲有至少一互動資料及一提示語音,每一互動資料包括一主音頻、至少一問題音頻及一控制資訊,該提示語音庫記錄了至少一個提示語音,該控制資訊包括一主控制資訊及與該問題音頻一一對應的問題控制資訊,該主控制資訊記錄有該主音頻獲取方式,該問題控制資訊定義有其對應問題音頻的獲取方式及下一問題音頻控制資訊的獲取方式;該方法包括:從存儲單元獲取一互動資料,從而獲得該互動資料的控制資訊,並根據該控制資訊的主音頻的獲取方式獲取主音頻;將該主音頻傳輸至一音頻解碼單元解碼,然後由一音頻輸出單元進行播放;獲取第一個問題控制資訊;播放問題音頻,具體步驟為:根據該問題控制資訊中記錄的問題音頻獲取方式取對應的問題音頻,將該問題音頻傳輸至該音頻解碼單元解碼,然後 由該音頻輸出單元進行播放;從該提示語音庫中選擇一提示語音,並經該音頻解碼單元進行解碼後,由該音頻輸出單元輸出;從問題控制資訊的下一問題控制資訊的獲取方式處獲取下一問題控制資訊,然後執行播放問題音頻的步驟。 In the interactive method of the interactive audio playback device, the audio playback device provides a storage unit, where the storage unit stores at least one interactive data and a prompt voice, each interactive data includes a primary audio, at least one question. Audio and a control information, the prompt voice library records at least one prompt voice, the control information includes a main control information and a problem control information corresponding to the problem audio one by one, the main control information record has the main audio acquisition mode, The problem control information defines a method for acquiring the corresponding problem audio and a method for acquiring the next question audio control information; the method includes: obtaining an interactive data from the storage unit, thereby obtaining control information of the interactive data, and according to the control information The main audio is obtained by acquiring the main audio; the main audio is transmitted to an audio decoding unit for decoding, and then played by an audio output unit; the first problem control information is acquired; and the problem audio is played, the specific steps are: according to the problem The audio recording method of the problem recorded in the control information is corresponding. Audio problem, the problem is transmitted to the audio decoding unit decoding the audio, and then Playing by the audio output unit; selecting a prompt voice from the prompt voice library, and decoding by the audio decoding unit, outputting by the audio output unit; and obtaining the next problem control information from the problem control information Get the next problem control information and then perform the steps to play the problem audio.

相較于現有技術,本發明的具有互動功能的音頻播放裝置及其互動方法在播放完一段音頻後向用戶提問,然後隨機從預先設定的語音庫中獲取一包括答案的提示語音,並根據該提示語音的正確性做出指示,不但使用戶對所播放音頻增加理解及印象,而且使用戶和音頻播放裝置之間產生互動,增加了趣味性。 Compared with the prior art, the interactive audio playing device and the interactive method thereof of the present invention ask a user after playing a piece of audio, and then randomly obtain a prompt voice including an answer from a preset voice library, and according to the Instructing the correctness of the voice to give an indication not only allows the user to add an understanding and impression to the played audio, but also creates an interaction between the user and the audio playback device, which increases the interest.

請參閱圖1,為具有互動功能的音頻播放裝置(以下簡稱為音頻播放裝置)在第一實施方式中的功能模組結構圖。該音頻播放裝置10包括有一存儲單元11、一中央處理單元12、一音頻解碼單元13、一音頻輸出單元14、一輸入單元15及一指示裝置16。該音頻播放裝置10可在播放完一段音頻後與用戶互動,如播放該段音頻對應的問題音頻,向用戶提問。該音頻播放裝置10的存儲單元11中存儲至少一互動資料20、一提示語音庫24。每一互動資料20(圖2及圖3所示)包括一控制資訊21、一主音頻22及至少一問題音頻23。該主音頻22可以是一個故事,一篇課文,一段音樂或其他的音頻。每一問題音頻23為針對該主音頻22的內容所設置的一些用於與用戶互動的音頻。該提示語音庫24包括至少一條提示語音,每條提示 語音為提示用戶進行回答的語音,每一提示語音表示一種提示答案,該提示答案可能是正確的答案,也可能是錯誤的答案,如圖5所示,為提示語音庫24的示意圖。 Please refer to FIG. 1 , which is a structural diagram of a function module of an audio playback device with an interactive function (hereinafter referred to as an audio playback device) in the first embodiment. The audio playback device 10 includes a storage unit 11, a central processing unit 12, an audio decoding unit 13, an audio output unit 14, an input unit 15, and a pointing device 16. The audio playback device 10 can interact with the user after playing a piece of audio, such as playing the question audio corresponding to the piece of audio, and asking the user questions. At least one interactive material 20 and a prompt voice library 24 are stored in the storage unit 11 of the audio playback device 10. Each interactive material 20 (shown in Figures 2 and 3) includes a control message 21, a main audio 22 and at least one question audio 23. The main audio 22 can be a story, a text, a piece of music or other audio. Each question audio 23 is some audio set for the content of the main audio 22 for interacting with the user. The prompt voice library 24 includes at least one prompt voice, each prompt The voice is a voice prompting the user to answer, and each prompt voice represents a prompt answer, which may be the correct answer or the wrong answer, as shown in FIG. 5, which is a schematic diagram of the prompt voice library 24.

該互動資料具有兩種結構,第一種結構為如圖2所示,每一互動資料為一個檔,該控制資訊21、一主音頻22、多個問題音頻23及每個問題音頻對應的資料庫24分別構成檔的一部分。該主音頻22及每一問題音頻23都有一個音頻頭資訊,該音頻頭資訊記錄了與主音頻/問題音頻相關的一些資訊,如播放時長、編碼格式及版本號等。 The interactive data has two structures. The first structure is as shown in FIG. 2, each interactive data is a file, the control information 21, a main audio 22, a plurality of question audios 23, and data corresponding to each question audio. The library 24 forms part of the file, respectively. The main audio 22 and each of the question audios 23 have an audio head information that records information related to the main audio/question audio, such as the play duration, the encoding format, and the version number.

該互動資料的第二種結構如圖3所示,每一互動資料的控制資訊21、主音頻22、每一問題音頻23及每個問題對應的提示語音庫24以獨立的檔進行存儲。 The second structure of the interactive data is shown in FIG. 3. The control information 21 of each interactive material, the main audio 22, each question audio 23, and the prompt voice library 24 corresponding to each question are stored in separate files.

在圖2及圖3所示的兩種互動資料結構中,該控制資訊21的格式基本上是一樣。該控制資訊21包括一主控制資訊211及與該互動問題音頻一一對應的問題控制資訊212。該主控制資訊211記錄有主音頻獲取方式,在圖2所示的互動資料結構中該主音頻獲取方式為記錄主音頻22在該音頻檔中的位移值,而在圖3所示的互動資料結構中該主音頻獲取方式為記錄主音頻22的檔案名稱。該問題控制資訊212記錄對應問題音頻23的獲取方式(以下稱為問題音頻獲取方式)、一連接資訊(即下一問題控制資訊的獲取方式)及對應問題音頻23的正確答案,如圖4所示。 In the two interactive data structures shown in FIG. 2 and FIG. 3, the format of the control information 21 is basically the same. The control information 21 includes a main control information 211 and problem control information 212 corresponding to the interactive question audio. The main control information 211 records the main audio acquisition mode. In the interactive data structure shown in FIG. 2, the main audio acquisition mode records the displacement value of the main audio 22 in the audio file, and the interactive data shown in FIG. The main audio acquisition mode in the structure is to record the file name of the main audio 22. The problem control information 212 records the acquisition mode of the corresponding question audio 23 (hereinafter referred to as the problem audio acquisition mode), a connection information (ie, the acquisition method of the next problem control information), and the correct answer of the corresponding question audio 23, as shown in FIG. Show.

該中央處理單元12包括一播放控制模組121、一提示模組122、一提示判斷模組123、一指示模組124及一問題連 接模組125。 The central processing unit 12 includes a play control module 121, a prompt module 122, a prompt determination module 123, an indication module 124, and a problem connection. The module 125 is connected.

該播放控制模組121用於在播放所述互動資料時,獲得該互動資料的控制資訊,並根據該控制資訊21的主控制資訊211中所記錄的主音頻獲取方式獲取主音頻22及該控制資訊21的問題控制資訊212中所記錄的問題音頻獲取方式獲取相應的問題音頻23,所述主音頻22及問題音頻23經該音頻解碼單元13進行解碼後,由該音頻輸出單元14輸出。 The play control module 121 is configured to obtain control information of the interactive data when playing the interactive data, and obtain the main audio 22 and the control according to the main audio acquisition manner recorded in the main control information 211 of the control information 21 The problem audio acquisition mode recorded in the problem control information 212 of the information 21 acquires the corresponding question audio 23, and the main audio 22 and the question audio 23 are decoded by the audio decoding unit 13, and then output by the audio output unit 14.

該提示模組122用於從該提示語音庫24中隨機選擇一提示語音,並經該音頻解碼單元13進行解碼後,由該音頻輸出單元14輸出。 The prompt module 122 is configured to randomly select a prompt voice from the prompt voice library 24, and after being decoded by the audio decoding unit 13, output by the audio output unit 14.

該提示判斷模組124用於將所隨機選擇的提示語音所表示的提示答案與問題控制資訊中記錄的正確答案相比,判斷該提示語音為正確提示或錯誤提示。例如,如果在該問題控制資訊212記錄的正確答案為“是”,而提示語音表示的提示答案為“否”,則該提示為錯誤提示;如果在該問題控制資訊212記錄的正確答案為“是”,而提示語音表示的答案為“是”,則該提示為正確提示。 The prompt determination module 124 is configured to compare the prompt answer represented by the randomly selected prompt voice with the correct answer recorded in the problem control information, and determine that the prompt voice is a correct prompt or an error prompt. For example, if the correct answer recorded in the question control information 212 is "Yes" and the prompt answer indicated by the prompt voice is "No", the prompt is an error prompt; if the correct answer recorded in the problem control information 212 is " Yes, and the prompt for the voice indication is "Yes", then the prompt is a correct prompt.

該指示模組124用於在所述提示為正確提示時,控制一指示裝置做出正確提示的指示,當所述提示為錯誤提示時,控制該指示裝置做出錯誤提示的指示。在本發明中指示裝置16可以是語音裝置或LED燈等。 The indication module 124 is configured to control an indication that the pointing device makes a correct prompt when the prompt is a correct prompt, and when the prompt is an error prompt, control the pointing device to make an indication of an error prompt. In the present invention, the pointing device 16 can be a voice device or an LED light or the like.

該問題連接模組125用於在指示模組124作出指示之後,判斷該問題控制資訊的連接資訊處所記錄的下一問題控 制資訊的獲取方式是否為一預設值,若為一預設值,則結束互動資料的播放,若不是一預設值,則通知該播放控制模組121根據連接資訊處所記錄的下一問題控制資訊的獲取方式獲取下一問題控制資訊。 The problem connection module 125 is configured to determine, after the indication module 124 indicates, the next problem control recorded in the connection information of the problem control information. Whether the acquisition method of the information is a preset value, if it is a preset value, the playback of the interactive data is ended, and if it is not a preset value, the playback control module 121 is notified according to the next problem recorded in the connection information. Control the way information is obtained to obtain the next problem control information.

參閱圖6所示,為應用與第一實施方式中該音頻播放裝置的互動方法的流程圖。首先,在開始播放一互動資料時,該播放控制模組121獲得該互動資料的控制資訊21,並根據該控制資訊21的主控制資訊211的獲取方式獲取主音頻22(步驟S601)。 Referring to FIG. 6, a flowchart of a method of interacting with the audio playback device in the first embodiment is shown. First, when the interactive data is started to be played, the playback control module 121 obtains the control information 21 of the interactive data, and acquires the main audio 22 according to the acquisition manner of the main control information 211 of the control information 21 (step S601).

該播放控制模組121將該主音頻23傳輸至音頻解碼單元13解碼,然後由該音頻輸出單元14進行播放(步驟S602)。 The playback control module 121 transmits the main audio 23 to the audio decoding unit 13 for decoding, and then plays by the audio output unit 14 (step S602).

該播放控制模組121在主音頻播放完後,獲取第一個問題控制資訊212,根據該問題控制資訊212的問題音頻獲取方式處獲取對應的問題音頻23(步驟S603)。 After the main audio is played, the play control module 121 acquires the first question control information 212, and obtains the corresponding question audio 23 according to the question audio acquisition mode of the problem control information 212 (step S603).

該音頻解碼單元13將該問題音頻23解碼,然後由該音頻輸出單元14進行播放(步驟S604)。 The audio decoding unit 13 decodes the question audio 23 and then plays by the audio output unit 14 (step S604).

該提示模組122從該提示語音庫24中隨機選擇一提示語音,並經該音頻解碼單元13進行解碼後,由該音頻輸出單元14輸出(步驟S605)。 The prompting module 122 randomly selects a prompt voice from the prompt voice library 24, and after being decoded by the audio decoding unit 13, is output by the audio output unit 14 (step S605).

該提示判斷模組124用於將所隨機選擇的提示語音所表示的答案與問題控制資訊中記錄的正確答案相比,判斷該提示語音為正確提示或錯誤提示。(步驟S606)。 The prompt determination module 124 is configured to compare the answer represented by the randomly selected prompt voice with the correct answer recorded in the problem control information, and determine that the prompt voice is a correct prompt or an error prompt. (Step S606).

當判斷該語音提示為正確提示時,該指示模組124控制指示裝置16做出正確提示的指示,當判斷用戶的提示為錯誤提示時,該指示模組124控制該指示裝置16做出錯誤提示的指示(步驟S607)。 When the voice prompt is determined to be a correct prompt, the indication module 124 controls the indication device 16 to make an indication of a correct prompt. When the user's prompt is determined to be an error prompt, the indication module 124 controls the indication device 16 to make an error prompt. An indication (step S607).

在該指示模組124作出指示之後,該問題連接模組125從該問題控制資訊212中的連接資訊中獲取下一問題控制資訊(步驟S608)。 After the indication module 124 makes an indication, the question connection module 125 acquires the next problem control information from the connection information in the problem control information 212 (step S608).

該問題連接模組125判斷所獲取的下一個問題控制資訊的獲取方式是否一預設值(步驟S609),若為一預設值,則結束操作。 The problem connection module 125 determines whether the acquired acquisition method of the next problem control information is a preset value (step S609). If it is a preset value, the operation ends.

若不是一預設值,該播放控制模組121根據該下一個問題控制資訊的獲取方式獲取下一問題控制資訊212,根據該下一問題控制資訊212的問題音頻獲取方式處獲取對應的問題音頻23,流程跳至步驟S604(步驟S610)。 If not a preset value, the playback control module 121 acquires the next problem control information 212 according to the acquisition method of the next problem control information, and obtains the corresponding problem audio according to the problem audio acquisition mode of the next problem control information 212. 23. The flow jumps to step S604 (step S610).

如圖7所示,為具有互動功能的音頻播放裝置在第二實施方式中的功能模組結構圖。在第二實施方式中,該音頻播放裝置中的中央處理單元12還包括一回應接收模組126及一回應判斷模組127,該回應接收模組126用於接收並識別該輸入單元15所產生的輸入信號,確定對該問題音頻的回應。該輸入單元15可為按鍵、觸摸感應器或聲音輸入設備如麥克風等。在本發明中,如果該輸入單元15為按鍵,可定義表示用戶各種回應分別對應的按鍵,例如“是”回應的按鍵及表示用戶作出“否”回應的按鍵;如果該輸入單元15為聲音輸入設備,可定義表示用戶 各種回應分別對應的語音,例如表示用戶作出“是”回應的語音及表示用戶作出“否”回應的語音。 As shown in FIG. 7, it is a functional block diagram of the audio playback device with interactive function in the second embodiment. In the second embodiment, the central processing unit 12 of the audio playback device further includes a response receiving module 126 and a response determining module 127, and the response receiving module 126 is configured to receive and identify the input unit 15 The input signal determines the response to the audio of the question. The input unit 15 can be a button, a touch sensor, or a sound input device such as a microphone or the like. In the present invention, if the input unit 15 is a button, a button corresponding to each of the user's various responses may be defined, such as a "yes" response button and a button indicating that the user has made a "no" response; if the input unit 15 is a voice input Device, can be defined to represent the user The various responses correspond to voices, such as voices indicating that the user made a "yes" response and voices indicating that the user made a "no" response.

該回應判斷模組127用於將所述回應與該問題控制資訊212中記錄的正確答案進行比對,確定該回應為正確回應或錯誤回應。例如,如果在該問題控制資訊212記錄的正確答案為“是”,而回應接收模組126識別到用戶的回應為“否”,則該回應為錯誤回應;如果在該問題控制資訊212記錄的正確答案為“否”,而回應接收模組126識別到用戶的回應為“否”,則該回應為正確回應。 The response determining module 127 is configured to compare the response with the correct answer recorded in the problem control information 212 to determine whether the response is a correct response or an error response. For example, if the correct answer recorded in the question control information 212 is "Yes" and the response receiving module 126 recognizes that the user's response is "No", the response is an error response; if the problem control information 212 is recorded The correct answer is "No", and the response receiving module 126 recognizes that the user's response is "No", then the response is a correct response.

該指示模組124用於結合提示判斷模組124及回應判斷模組127的判斷,產生一判斷結果,根據該判斷結果控制指示裝置16做出指示,在此有四種判斷結果,第一種為提示判斷模組124判斷提示語音為正確提示,回應判斷模組127判斷用戶的回應為正確回應;第二種為提示判斷模組124判斷提示語音為正確提示,回應判斷模組127判斷用戶的回應為錯誤回應;第三種為提示判斷模組124判斷提示語音為錯誤提示,回應判斷模組127判斷用戶的回應為正確回應;第四種為提示判斷模組124判斷提示語音為錯誤提示,回應判斷模組127判斷用戶的回應為錯誤回應。該指示模組124根據不同的判斷結果控制指示裝置16做出不同的指示。在此以該音頻播放裝置為一玩具為例進行說明,如若該判斷結果為第一種,該指示模組124控制該指示裝置16做出點頭的指示,若判斷結果為第二種,該指示模組124控制該指示裝置16做出搖頭的指示,若判斷結果為第三種,則結果指示裝置控制鼻子伸長的指示, 若判斷結果為第四種,則該指示模組124控制該指示裝置16做出眨眼的指示。 The indication module 124 is configured to generate a determination result according to the judgment of the prompt determination module 124 and the response determination module 127, and the instruction indication device 16 makes an indication according to the determination result, where there are four determination results, the first type In response to the prompting determination module 124 determining that the prompting voice is a correct prompt, the response determining module 127 determines that the user's response is a correct response; the second is that the prompting determining module 124 determines that the prompting voice is a correct prompt, and the response determining module 127 determines the user's The response is an error response; the third is that the prompt determination module 124 determines that the prompt voice is an error prompt, the response determination module 127 determines that the user response is a correct response; and the fourth is that the prompt determination module 124 determines that the prompt voice is an error prompt. The response judgment module 127 determines that the user's response is an error response. The indication module 124 controls the pointing device 16 to make different indications according to different determination results. Here, the audio playback device is taken as an example of a toy. If the determination result is the first type, the indication module 124 controls the indication device 16 to make a nod. If the determination result is the second type, the indication is The module 124 controls the indication device 16 to make an instruction to shake the head. If the determination result is the third type, the result indicates that the device controls the indication of the nose extension. If the determination result is the fourth type, the indication module 124 controls the indication device 16 to make an indication of blinking.

參閱圖8所示,為應用與第二實施方式中的音頻播放裝置的互動方法的流程圖。本實施方式中的步驟S801-S806與應用與第一實施方式中的音頻播放裝置的互動方法的步驟S601-S606相同,在此不再贅述。然後該回應接收模組126通過輸入單元15接收並識別用戶根據該問題所作的回應(步驟S807)。 Referring to FIG. 8, a flowchart of a method of interacting with the audio playback device in the second embodiment is applied. Steps S801-S806 in this embodiment are the same as steps S601-S606 of the method of interacting with the audio playback apparatus in the first embodiment, and are not described herein again. The response receiving module 126 then receives and recognizes the response of the user according to the question through the input unit 15 (step S807).

該回應判斷模組127將所識別的用戶的回應與問題控制資訊的正確答案處記錄的正確答案進行比對,判斷用戶的回應是否與問題控制資訊的正確答案處記錄的正確答案一致,若一致,則說明用戶的回應為正確回應,若不一致,則說明用戶的回應為錯誤回應(步驟S808)。 The response determining module 127 compares the identified user's response with the correct answer recorded in the correct answer of the problem control information, and determines whether the user's response is consistent with the correct answer recorded in the correct answer of the problem control information. , the user's response is a correct response, and if not, the user's response is an error response (step S808).

該指示模組124結合提示判斷模組124及回應判斷模組127的判斷,產生一判斷結果(步驟S809)。 The indication module 124 combines the determinations of the prompt determination module 124 and the response determination module 127 to generate a determination result (step S809).

該指示模組124根據上述判斷結果對控制該指示裝置對用戶的回應作出指示(步驟S810)。 The indication module 124 instructs to control the response of the pointing device to the user according to the determination result (step S810).

在做出指示之後,該問題連接模組125從該問題控制資訊212中的連接資訊中獲取下一問題控制資訊(步驟811)。 After making the indication, the question connection module 125 obtains the next question control information from the connection information in the question control information 212 (step 811).

該問題連接模組125判斷所獲取的下一個問題控制資訊的獲取方式是否一預設值(步驟S812),若為一預設值,則結束操作。 The problem connection module 125 determines whether the acquired acquisition method of the next problem control information is a preset value (step S812). If it is a preset value, the operation ends.

若不是一預設值,該播放控制模組121根據該下一個問題控制資訊的獲取方式獲取下一問題控制資訊212,根據該下一問題控制資訊212的問題音頻獲取方式處獲取對應的問題音頻23,流程跳至步驟S804(步驟S813)。 If not a preset value, the playback control module 121 acquires the next problem control information 212 according to the acquisition method of the next problem control information, and obtains the corresponding problem audio according to the problem audio acquisition mode of the next problem control information 212. 23. The flow jumps to step S804 (step S813).

11‧‧‧存儲單元 11‧‧‧ storage unit

12‧‧‧中央處理單元 12‧‧‧Central Processing Unit

13‧‧‧音頻解碼單元 13‧‧‧Audio decoding unit

14‧‧‧音頻輸出單元 14‧‧‧Audio output unit

15‧‧‧輸入單元 15‧‧‧Input unit

16‧‧‧指示裝置 16‧‧‧ indicating device

20‧‧‧互動資料 20‧‧‧Interactive materials

21‧‧‧控制資訊 21‧‧‧Control information

22‧‧‧主音頻 22‧‧‧Main audio

23‧‧‧問題音頻 23‧‧‧Question audio

24‧‧‧提示語音庫 24‧‧‧Prompt speech library

211‧‧‧主控制資訊 211‧‧‧Master Control Information

212‧‧‧問題控制資訊 212‧‧‧ Problem Control Information

121‧‧‧播放控制模組 121‧‧‧Playback Control Module

122‧‧‧提示模組 122‧‧‧ prompt module

123‧‧‧提示判斷模組 123‧‧‧ prompt judgment module

124‧‧‧指示模組 124‧‧‧Indicating module

125‧‧‧問題連接模組 125‧‧‧ Problem connection module

126‧‧‧回應接收模組 126‧‧‧Response receiving module

127‧‧‧回應判斷模組 127‧‧‧Response judgment module

圖1為具有互動功能的音頻播放裝置在第一實施方式中的功能模組結構圖;圖2為互動資料的第一種結構的示意圖;圖3為互動資料的第二種結構的示意圖;圖4為提示語音庫的示意圖;圖5為問題控制資訊的示意圖;圖6為應用與第一實施方式中該音頻播放裝置的互動方法的流程圖;圖7為具有互動功能的音頻播放裝置在第二實施方式中的功能模組結構圖;及圖8為應用與第二實施方式中的音頻播放裝置的互動方法的流程圖。 1 is a structural diagram of a functional module of an audio playback device having an interactive function in a first embodiment; FIG. 2 is a schematic diagram of a first structure of interactive data; and FIG. 3 is a schematic diagram of a second structure of interactive data; 4 is a schematic diagram of the prompt speech library; FIG. 5 is a schematic diagram of the problem control information; FIG. 6 is a flowchart of an interaction method between the application and the audio playback apparatus in the first embodiment; FIG. 7 is an audio playback device with an interactive function. The functional module structure diagram in the second embodiment; and FIG. 8 is a flowchart of the interaction method between the application and the audio playback device in the second embodiment.

11‧‧‧存儲單元 11‧‧‧ storage unit

12‧‧‧中央處理單元 12‧‧‧Central Processing Unit

13‧‧‧音頻解碼單元 13‧‧‧Audio decoding unit

14‧‧‧音頻輸出單元 14‧‧‧Audio output unit

15‧‧‧輸入單元 15‧‧‧Input unit

16‧‧‧指示裝置 16‧‧‧ indicating device

20‧‧‧互動資料 20‧‧‧Interactive materials

24‧‧‧提示語音庫 24‧‧‧Prompt speech library

121‧‧‧播放控制模組 121‧‧‧Playback Control Module

122‧‧‧提示模組 122‧‧‧ prompt module

123‧‧‧提示判斷模組 123‧‧‧ prompt judgment module

124‧‧‧指示模組 124‧‧‧Indicating module

125‧‧‧問題連接模組 125‧‧‧ Problem connection module

Claims (12)

一種具有互動功能的音頻播放裝置,其包括一存儲單元、一輸入單元、一音頻解碼單元及一音頻輸出單元,其改良在於:該存儲單元存儲有至少一互動資料及至少一提示語音庫,每一互動資料包括一控制資訊、一主音頻及至少一問題音頻,該提示語音庫記錄了至少一個提示語音,該控制資訊包括一主控制資訊及與該問題音頻一一對應的問題控制資訊,該主控制資訊記錄有該主音頻獲取方式,該問題控制資訊定義有其對應問題音頻的獲取方式及下一問題音頻控制資訊的獲取方式;一播放控制模組,用於在播放所述一互動資料時,獲得該互動資料的控制資訊,並根據主控制資訊中記錄的主音頻獲取方式及問題控制資訊中記錄的問題音頻的獲取方式獲取主音頻及問題音頻,所述主音頻及問題音頻經該音頻解碼單元進行解碼後,由該音頻輸出單元輸出;一提示模組,用於從該提示語音庫中隨機選擇一提示語音,並經該音頻解碼單元進行解碼後,由該音頻輸出單元輸出;一問題連接模組,用於從問題控制資訊的下一問題控制資訊的獲取方式處獲取下一問題控制資訊,由播放控制模組獲取對應問題音頻進行播放。 An audio playback device with an interactive function, comprising a storage unit, an input unit, an audio decoding unit and an audio output unit, wherein the storage unit stores at least one interactive data and at least one prompt voice library, each An interactive data includes a control information, a main audio, and at least one question audio, the prompt voice library records at least one prompt voice, the control information includes a main control information and problem control information corresponding to the question audio one by one, The main control information record has the main audio acquisition mode, and the problem control information defines the acquisition mode of the corresponding problem audio and the acquisition method of the next question audio control information; a play control module is configured to play the interactive data Obtaining control information of the interactive data, and acquiring main audio and problem audio according to the main audio acquisition manner recorded in the main control information and the method for obtaining the problem audio recorded in the problem control information, where the main audio and the problem audio are After the audio decoding unit performs decoding, it is output by the audio output unit; The module is configured to randomly select a prompt voice from the prompt voice library, and after being decoded by the audio decoding unit, output by the audio output unit; and a question connection module for using the next problem control information The method for obtaining the problem control information acquires the next problem control information, and the playback control module acquires the corresponding question audio for playing. 如申請專利範圍第1項所述的具有互動功能的音頻播放裝置,其中,該問題控制資訊中還記錄該問題音頻的正確答案,該提示語音中包含一提示答案,該音頻播放裝置還包 括一提示判斷模組及一指示模組,該提示判斷模組用於將提示語音所表示的提示答案與問題控制資訊中記錄的正確答案相比,判斷該提示語音為正確提示或錯誤提示,該指示模組用於在提示判斷模組判斷該提示語音為正確提示時,控制一指示裝置做出正確提示的指示,在提示判斷模組判斷提示語音為錯誤提示時,控制該指示裝置做出錯誤提示的指示。 The audio playback device with interactive function according to claim 1, wherein the problem control information further records a correct answer of the question audio, the prompt voice includes a prompt answer, and the audio playback device further includes The prompt determination module and an indication module are configured to compare the prompt answer represented by the prompt voice with the correct answer recorded in the problem control information, and determine that the prompt voice is a correct prompt or an error prompt. The indication module is configured to control an indication device to make a correct prompt when the prompt determination module determines that the prompt voice is a correct prompt, and control the pointing device when the prompt determination module determines that the prompt voice is an error prompt An indication of the error message. 如申請專利範圍第1項所述的具有互動功能的音頻播放裝置,其中,該問題控制資訊還記錄該問題音頻的正確答案,該提示語音中包含一提示答案,該音頻播放裝置還包括一提示判斷模組、一回應接收模組、一回應判斷模組及一指示模組;該提示判斷模組用於將該提示語音所標示的提示答案與問題控制資訊中記錄的正確答案相比,判斷該提示語音為正確提示或錯誤提示;該回應接收模組用於接收並識別一輸入單元產生的回應,將所述回應與該問題控制資訊中記錄的正確答案進行比對,確定該回應為正確回應或錯誤回應;該指示模組用於結合提示判斷模組及回應判斷模組的判斷產生一判斷結果,並根據該判斷結果控制指示裝置做出指示。 The audio playback device with interactive function according to claim 1, wherein the problem control information further records a correct answer of the question audio, the prompt voice includes a prompt answer, and the audio playback device further includes a prompt a judging module, a response receiving module, a response judging module and an indicating module; the prompt judging module is configured to compare the prompt answer indicated by the prompt voice with a correct answer recorded in the problem control information The prompt voice is a correct prompt or an error prompt; the response receiving module is configured to receive and identify a response generated by an input unit, compare the response with a correct answer recorded in the problem control information, and determine that the response is correct Response or error response; the indication module is configured to generate a determination result in conjunction with the judgment of the prompt determination module and the response determination module, and control the indication device to make an indication according to the determination result. 如申請專利範圍第3項所述的具有互動功能的音頻播放裝置,其中,結合提示判斷模組及回應判斷模組判斷所產生的判斷結果有四種,第一種為提示判斷模組判斷提示語音中的提示為正確提示,回應判斷模組判斷用戶的回應為正確回應;第二種為提示判斷模組判斷提示語音中的提示為正確提示回應判斷模組判斷用戶的回應為錯誤回應;第三種為提示判斷模組判斷提示語音中的答案為錯誤提示,回 應判斷模組判斷用戶的回應為正確回應;第四種為提示判斷模組判斷提示語音中的答案為錯誤提示,回應判斷模組判斷用戶的回應為錯誤回應。 For example, the audio playback device with interactive function described in claim 3, wherein the combination of the prompt determination module and the response determination module determines that there are four types of determination results, and the first type is a prompt determination module to determine a prompt. The prompt in the voice is a correct prompt, the response judgment module determines that the user's response is a correct response; the second is that the prompt determination module determines that the prompt in the prompt voice is a correct prompt, and the response judgment module determines that the user's response is an error response; Three kinds of prompt judgment modules determine that the answer in the prompt voice is an error prompt, back It should be judged that the module judges that the user's response is a correct response; the fourth is that the prompt judgment module determines that the answer in the prompt voice is an error prompt, and the response judgment module determines that the user's response is an error response. 如申請專利範圍第1項所述的具有互動功能的音頻播放裝置,其中,所述每一互動資料為一個檔,該主音頻獲取方式及該問題音頻獲取方式為主音頻及問題音頻在互動資料中的位移值。 The interactive audio playback device of claim 1, wherein each of the interactive data is a file, and the main audio acquisition mode and the audio acquisition mode of the problem are main audio and problem audio in interactive data. The displacement value in . 如申請專利範圍第1項所述的具有互動功能的音頻播放裝置,其中,所述每一互動資料的主音頻、每一問題音頻及控制資訊以獨立的檔進行存儲,該主音頻獲取方式及該問題音頻獲取方式為主音頻、問題音頻的檔案名稱。 The audio playback device with interactive function according to claim 1, wherein the main audio, each question audio and control information of each interactive data are stored in an independent file, and the main audio acquisition mode and The audio acquisition method of this problem is the file name of the main audio and problem audio. 一種應用於音頻播放裝置的互動方法,該音頻播放裝置提供一存儲單元,該存儲單元存儲有至少一互動資料及一提示語音庫,每一互動資料包括一主音頻、至少一問題音頻及一控制資訊,該提示語音庫記錄了至少一個提示語音,該控制資訊包括一主控制資訊及與該問題音頻一一對應的問題控制資訊,該主控制資訊記錄有該主音頻獲取方式,該問題控制資訊定義有其對應問題音頻的獲取方式及下一問題音頻控制資訊的獲取方式;其改良在於,該方法包括:從存儲單元獲取一互動資料,從而獲得該互動資料的控制資訊,並根據該控制資訊的主音頻獲取方式獲取主音頻;將該主音頻傳輸至一音頻解碼單元解碼,然後由一音頻輸出單元進行播放;獲取第一個問題控制資訊;播放問題音頻,具體步驟為:根據該問題控制資訊中記錄 的問題音頻獲取方式取對應的問題音頻,將該問題音頻傳輸至該音頻解碼單元解碼,然後由該音頻輸出單元進行播放;從該提示語音庫中隨機選擇一提示語音,並經該音頻解碼單元進行解碼後,由該音頻輸出單元輸出;從問題控制資訊的下一問題控制資訊的獲取方式處獲取下一問題控制資訊,然後執行播放問題音頻的步驟。 An interactive method for an audio playback device, the audio playback device provides a storage unit, the storage unit stores at least one interactive data and a prompt voice library, each interactive data includes a main audio, at least one question audio, and a control Information, the prompt voice library records at least one prompt voice, the control information includes a main control information and a problem control information corresponding to the problem audio one by one, the main control information record has the main audio acquisition mode, the problem control information The method for obtaining the audio of the corresponding problem and the method for obtaining the audio control information of the next question are defined; the method comprises: obtaining an interactive data from the storage unit, thereby obtaining control information of the interactive data, and according to the control information The main audio acquisition mode acquires the main audio; the main audio is transmitted to an audio decoding unit for decoding, and then played by an audio output unit; the first problem control information is acquired; and the problem audio is played, the specific steps are: controlling according to the problem Information record The problem audio acquisition method takes the corresponding question audio, transmits the problem audio to the audio decoding unit, and then plays by the audio output unit; randomly selects a prompt voice from the prompt voice library, and passes the audio decoding unit After decoding, the audio output unit outputs; the next problem control information is acquired from the acquisition method of the next problem control information of the problem control information, and then the step of playing the problem audio is performed. 如申請專利範圍第7項所述的應用於音頻播放裝置的互動方法,其中,該問題控制資訊中還記錄該問題音頻的正確答案,該提示語音表示一提示答案,該方法還包括步驟:在提示語音輸出後,將提示語音所包含的提示答案與問題控制資訊中記錄的正確答案相比,判斷該提示語音中的答案為正確提示或錯誤提示;在判斷提示語音為正確提示時,控制一指示裝置做出正確提示的指示,在提示語音為錯誤提示時,控制該指示裝置做出錯誤提示的指示。 The interactive method applied to the audio playback device according to claim 7, wherein the problem control information further records a correct answer of the question audio, the prompt voice represents a prompt answer, and the method further comprises the steps of: After prompting the voice output, comparing the prompt answer included in the prompt voice with the correct answer recorded in the problem control information, determining that the answer in the prompt voice is a correct prompt or an error prompt; when determining that the prompt voice is a correct prompt, controlling one The indication device makes an indication of a correct prompt, and when the prompt voice is an error prompt, controls the indication device to make an indication of an error prompt. 如申請專利範圍第7項所述的應用於音頻播放裝置的互動方法,其中,該問題控制資訊還記錄該問題音頻的正確答案,該提示語音表示一提示答案,該方法還包括步驟:將提示語音所包含的提示答案與問題控制資訊中記錄的正確答案相比,判斷該提示語音中的答案為正確提示或錯誤提示;接收並識別對該問題音頻的回應;將所述回應與該問題控制資訊中記錄的正確答案進行比對,確定該回應為正確回應或錯誤回應;結合該提示語音為正確提示或錯誤提示及該回應為正確回 應或錯誤回應的判斷,產生一判斷結果,根據該判斷結果控制指示裝置做出指示。 An interactive method applied to an audio playback device according to claim 7, wherein the problem control information further records a correct answer of the question audio, the prompt voice represents a prompt answer, and the method further comprises the step of: prompting The prompt answer included in the voice is compared with the correct answer recorded in the problem control information, and the answer in the prompt voice is judged to be a correct prompt or an error prompt; the response to the audio of the question is received and recognized; and the response is controlled with the problem The correct answer recorded in the information is compared, and the response is determined to be a correct response or an incorrect response; combining the prompt voice as a correct prompt or an error prompt and the response is correct A judgment result should be made or an error response, and a judgment result is generated, and the instruction means is instructed to make an instruction according to the judgment result. 如申請專利範圍第9項所述的應用於音頻播放裝置的互動方法,其中,所述判斷結果有四種,第一種為提示語音為正確提示,用戶的回應為正確回應;第二種為提示語音中為正確提示,用戶的回應為錯誤回應;第三種為提示語音為錯誤提示,用戶的回應為正確回應;第四種為提示語音為錯誤提示,用戶的回應為錯誤回應。 The interactive method applied to the audio playback device according to claim 9, wherein the judgment result has four types, the first one is that the prompt voice is a correct prompt, and the user response is a correct response; The prompt voice is the correct prompt, the user's response is the wrong response; the third is the prompt voice is the error prompt, the user's response is the correct response; the fourth is the prompt voice is the error prompt, and the user response is the wrong response. 如申請專利範圍第7項所述的應用於音頻播放裝置的互動方法,其中,所述每一互動資料為一個檔,該主音頻獲取方式、該問題音頻獲取方式及該提示語音庫的獲取方式為主音頻、問題音頻及提示語音庫在互動資料中的位移值。 The interactive method applied to the audio playback device according to claim 7, wherein each of the interactive materials is a file, the main audio acquisition mode, the audio acquisition mode of the problem, and the acquisition manner of the prompt voice library. The displacement value of the main audio, question audio, and prompt voice library in the interactive data. 如申請專利範圍第7項所述的應用於音頻播放裝置的互動方法,其中,所述每一互動資料的主音頻、每一問題音頻、控制資訊以及每一提示語音庫以獨立的檔進行存儲,該主音頻獲取方式、該問題音頻獲取方式及該每一提示語音庫為主音頻、問題音頻及提示語音庫的檔案名稱。 The interactive method applied to the audio playback device according to claim 7, wherein the main audio of each interactive material, each question audio, control information, and each prompt voice library are stored in separate files. The main audio acquisition mode, the audio acquisition mode of the problem, and the file name of each prompt voice library as the main audio, the question audio, and the prompt voice library.
TW98101607A 2009-01-16 2009-01-16 Audio playing apparatus with an interactive function and method thereof TWI383317B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW98101607A TWI383317B (en) 2009-01-16 2009-01-16 Audio playing apparatus with an interactive function and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW98101607A TWI383317B (en) 2009-01-16 2009-01-16 Audio playing apparatus with an interactive function and method thereof

Publications (2)

Publication Number Publication Date
TW201028915A TW201028915A (en) 2010-08-01
TWI383317B true TWI383317B (en) 2013-01-21

Family

ID=44853847

Family Applications (1)

Application Number Title Priority Date Filing Date
TW98101607A TWI383317B (en) 2009-01-16 2009-01-16 Audio playing apparatus with an interactive function and method thereof

Country Status (1)

Country Link
TW (1) TWI383317B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI603293B (en) * 2014-10-13 2017-10-21 由田新技股份有限公司 Method and apparatus for detecting blink

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW434524B (en) * 1999-12-16 2001-05-16 Mustek Systems Inc Correlative real-time sound teaching method
TW579490B (en) * 2002-05-17 2004-03-11 Inventec Besta Co Ltd Language teaching method using computer writing approach
US20040254749A1 (en) * 2003-06-16 2004-12-16 Canon Kabushiki Kaisha Insulation verification system, insulation verification method, and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW434524B (en) * 1999-12-16 2001-05-16 Mustek Systems Inc Correlative real-time sound teaching method
TW579490B (en) * 2002-05-17 2004-03-11 Inventec Besta Co Ltd Language teaching method using computer writing approach
US20040254749A1 (en) * 2003-06-16 2004-12-16 Canon Kabushiki Kaisha Insulation verification system, insulation verification method, and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI603293B (en) * 2014-10-13 2017-10-21 由田新技股份有限公司 Method and apparatus for detecting blink

Also Published As

Publication number Publication date
TW201028915A (en) 2010-08-01

Similar Documents

Publication Publication Date Title
WO2007061749A2 (en) Methods, systems, and computer program products for speech assessment
JP2008209640A (en) Karaoke sound effect output system
TWI383317B (en) Audio playing apparatus with an interactive function and method thereof
JP4487632B2 (en) Performance practice apparatus and performance practice computer program
JP2007108524A (en) Voice input evaluation apparatus and method, and program
JP4622728B2 (en) Audio reproduction device and audio reproduction processing program
JP4516944B2 (en) Karaoke singing assistance system
CN101770705B (en) Audio playing device with interaction function and interaction method thereof
JPWO2014087571A1 (en) Information processing apparatus and information processing method
WO2018101458A1 (en) Sound collection device, content playback device, and content playback system
TWI392983B (en) Robot apparatus control system using a tone and robot apparatus
TWI383306B (en) Audio playing apparatus with an interactive function and method thereof
TWI383305B (en) Audio playing apparatus with an interactive function and method thereof
JP4516943B2 (en) Karaoke singing assistance system
JP6705409B2 (en) Music reproduction control device, music reproduction control method, and music reproduction control program
JP2017070370A (en) Hearing test device, hearing test method, and hearing test program
JP2007188175A (en) Server device, terminal device, and program
JP6428436B2 (en) Karaoke system, karaoke device, and voice data processing program
KR101682076B1 (en) Method for a learning file section playback using dynamic button
JP6498346B1 (en) Foreign language learning support system, foreign language learning support method and program
JP2019045575A (en) Language learning device, language learning method, and language learning program
JP2019109321A (en) Karaoke system
WO2024228340A1 (en) Information processing device, sound source separation processing method, and program
KR20110118933A (en) System and method for studying of piano play with encouragement comment
Nogueira et al. Speech recognition technology in CI rehabilitation

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees