WO2010133072A1 - Pronunciation evaluating device and method - Google Patents

Pronunciation evaluating device and method Download PDF

Info

Publication number
WO2010133072A1
WO2010133072A1 PCT/CN2009/075281 CN2009075281W WO2010133072A1 WO 2010133072 A1 WO2010133072 A1 WO 2010133072A1 CN 2009075281 W CN2009075281 W CN 2009075281W WO 2010133072 A1 WO2010133072 A1 WO 2010133072A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
module
pronunciation
text data
evaluation
Prior art date
Application number
PCT/CN2009/075281
Other languages
French (fr)
Chinese (zh)
Inventor
陈淮琰
张斌
周骁
Original Assignee
无敌科技(西安)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 无敌科技(西安)有限公司 filed Critical 无敌科技(西安)有限公司
Publication of WO2010133072A1 publication Critical patent/WO2010133072A1/en

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/06Foreign languages
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Definitions

  • the invention relates to a sounding evaluation device and a method thereof, in particular to a sounding evaluation function.
  • Portable electronic consumer products such as electronic dictionaries, mobile phones or personal digital assistants
  • portable electronic consumer products are increasingly favored by many people, and the requirements for the functions of portable electronic consumer products are getting higher and higher.
  • voice learning is one of the most frequently used and needed services for many users.
  • the present invention has been made to solve the above-mentioned technical problems existing in the background art, and proposes a sounding evaluation apparatus and method thereof.
  • the present invention is a pronunciation evaluation device for evaluating user pronunciation, and the special feature is as follows:
  • the device comprises: a storage module, storing a plurality of text data and a sound information corresponding to the text data, The reading parameter and the phonetic symbol data;
  • the input module provides the user to select the text data;
  • the display module displays the selected text data;
  • the audio output module plays the pronunciation audio corresponding to the selected text data;
  • the recording module records the audio signal input by the user;
  • the audio analysis module analyzes the input audio signal, compares the analysis result with the pronunciation parameter and the phonetic data of the corresponding text data, generates an evaluation result based on the phonetic data, and compares the waveform of the input audio signal with the sound waveform of the corresponding text data.
  • the audio analysis module controls the display module to display the evaluation result and the waveform comparison result, and the input module, the recording module and the storage module are respectively connected to the audio analysis module, and the audio analysis module The audio output module and the display module are respectively connected.
  • the audio analysis module described above analyzes the input audio signal according to the LPC cepstrum technique.
  • the above phonetic data contains syllables, and the evaluation results include evaluation scores corresponding to each syllable.
  • the above text data contains words, words or sentences.
  • a pronunciation evaluation method for evaluating user pronunciation which is special in that: The method includes the following steps:
  • the input audio signal is analyzed according to the LPC cepstrum technique.
  • the above phonetic data includes syllables, and the evaluation results include evaluation scores corresponding to each syllable.
  • the above text data contains words, words or sentences.
  • the sounding evaluation device and the method thereof provided by the invention first display words to the user, analyze the user's pronunciation after recording the user's pronunciation, and compare the analysis result with the word pronunciation parameters and the phonetic data, and generate the basis
  • Figure 1 is a block diagram of an embodiment of the apparatus of the present invention
  • FIG. 2 is a schematic diagram 1 of a display interface of an embodiment of the device of the present invention.
  • FIG. 3 is a schematic diagram 2 of a display interface of an embodiment of the device of the present invention.
  • Figure 4 is a flow chart of the method of the present invention.
  • 1- pronunciation evaluation device 11-storage module, 111-character data, 112-audio audio, 113- pronunciation parameters, 114-phonetic data, 12-input module, 13 display module, 14 audio output module, 1-5 Recording module, 151-input audio signal, 16-audio analysis module, 171-test result, 172-waveform alignment result.
  • the sounding evaluation device 1 includes a storage module 11 , an input module 12 , a display module 13 , an audio output module 14 , a recording module 15 , and an audio analysis module 16 .
  • the storage module 11 stores a plurality of text data 111 and a read audio signal 112 corresponding to the text data 111 . , the pronunciation parameter 113 and the phonetic data 114.
  • Text data 111 contains words, words, or statements.
  • the pronunciation audio 112 is the correct pronunciation of the corresponding text data 111, and can be used as a reference for the user to learn the pronunciation.
  • the phonetic symbol data 114 contains syllables.
  • the phonetic data of the word “abbreviation” includes five syllables such as “money”, violent, “,”, “mu,” and “Sh ll".
  • the pronunciation parameter 113 may include a pronunciation audio.
  • the storage module 11 can be a built-in memory, a memory card or an optical storage medium.
  • the input module 12 provides the user with the choice of text data.
  • the input module 12 can be a keyboard, a button group, a cursor controller or a touch module.
  • the display module 13 displays the text data 111 selected by the user, and the audio output module 14 plays the pronunciation audio 112 corresponding to the selected text data 111, so that the user can first listen to the correct pronunciation of the selected text data.
  • the display module 13 displays the phonetic symbol data 114 and other related data of the selected text data.
  • the recording module 15 is activated to record the audio signal 151 input by the user.
  • the display module 13 can display a prompt message to remind the user to start reading the selected text data 111.
  • the audio analysis module 16 analyzes the input audio signal 151, and compares the analysis result with the pronunciation parameter 113 and the phonetic data 114 of the corresponding character data 111 to generate a result 171 based on the phonetic data 114.
  • the audio analysis module 16 compares the waveform of the input audio signal 151 with the waveform of the read audio 112 of the corresponding text data 111, and controls the display module 13 to display the evaluation result 171 and the waveform comparison result 172.
  • the evaluation result 171 includes the evaluation score of each syllable corresponding to the phonetic symbol data 114.
  • the user views the evaluation result 171 to know which syllable pronunciation has a problem, and further corrects the pronunciation. For example, when practicing the pronunciation of the word "abbreviation", if the evaluation score of ", ' ⁇ " is lower, the user should pay more attention to the pronunciation and practice more, and observe the waveform comparison result to understand their pronunciation and Where the difference in correct pronunciation is, the effect of effectively correcting the pronunciation has been achieved.
  • the audio analysis module 16 is implemented in a software manner in which the processor executes the related audio analysis program.
  • the pronunciation evaluation device displays the English word "abbreviation” selected by the user to practice the pronunciation, and the pronunciation evaluation device first plays the built-in pronunciation of the English word "abbreviation", so that the user can listen to the pronunciation first and learn against the phonetic data. . After that, the pronunciation evaluation device displays a prompt message to remind the user to read the word, and the pronunciation evaluation device records the user's voice.
  • the pronunciation evaluation device is turned on for audio analysis and waveform comparison, and the evaluation scores and waveform comparison results of each syllable are displayed. For example, the user's pronunciation has lower scores in the three syllables such as ",i,,, "i” and "sh weaving". After observing the waveform, it can be found that the users in the three syllables seem to be over-sounding, so Users can understand the shortcomings of their pronunciation.
  • the method can be applied to an electronic device having a signal processing function, such as a computer, a portable electronic dictionary, a mobile phone, or a personal digital assistant (PDA).
  • a signal processing function such as a computer, a portable electronic dictionary, a mobile phone, or a personal digital assistant (PDA).
  • PDA personal digital assistant
  • step S1 a plurality of character data and pronunciation audio, pronunciation parameters and phonetic data corresponding to the character data are provided, for example, the data is stored in advance in a storage module of the electronic device.
  • the phonetic data contains syllables, and the pronunciation parameters include LPC cepstral parameters of the audio tones or other audio analysis parameters.
  • the storage module is built-in memory, memory card or optical storage media.
  • step S2 the user selects the text data of the pronunciation to be practiced, displays the selected text data in step S3 and plays the pronunciation audio corresponding to the selected text data, and displays the phonetic symbol of the selected text data on the screen of the electronic device. Data or other relevant data.
  • the prompt information may be displayed on the screen of the electronic device to remind the user to read the selected text data, and the audio signal input by the user is recorded in step S4, and then the input is analyzed in step S5.
  • the audio signal is compared with the pronunciation parameter and the phonetic data of the corresponding text data to generate an evaluation result based on the phonetic data.
  • the evaluation result includes the evaluation score of each syllable corresponding to the phonetic data, and the evaluation score may be an absolute score, a relative score, or a weight score.
  • step S6 the waveform of the input audio signal and the waveform of the sound of the corresponding text data are compared.
  • the evaluation result and the waveform comparison result are displayed in step S7.
  • the user views the evaluation results to know which syllable pronunciation has a problem, and further corrects the pronunciation.
  • the waveform comparison results give the user a clearer picture of why the pronunciation is poor.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

A pronunciation evaluating device (1) and method are provided. The device (1) includes: a memorizing module (11) for memorizing a plurality of literal data(111), pronunciation voice information (112) corresponding to the literal data(111), pronunciation parameters (113) and phonetic symbol data (114); an inputting module (12) supplied for the user to select the literal data (111) by the user; a display module (13) for displaying the selected literal data (111); a pronunciation voice information output module (14) for playing the pronunciation voice information (112) corresponding to the selected literal data (111); a recording module (15) for recording the inputted voice information signal (151) by the user; an audio analyzing module (16) for analyzing the inputted voice information signal (151) and comparing the analyzed result with the pronunciation parameters (113) corresponding to the literal data (111) and the phonetic symbol data (114), generating the evaluating result based on the phonetic symbol data(114), and comparing the waveform of the inputted voice information signal (151) and the waveform of the pronunciation voice information (112) corresponding to the literal data (111); the display module (13) is controlled by the audio analyzing module (16) and displays the evaluated result (171) and the waveform compared result(172).

Description

发音评测装置及其方法 技术领域  Pronunciation evaluation device and method thereof
本发明涉及一种发音评测装置及其方法, 尤其是一种具有发音评测功能 背景技术  The invention relates to a sounding evaluation device and a method thereof, in particular to a sounding evaluation function.
目前, 电子信息化产业飞速发展, 便携式电子消费产品, 例如电子辞典、 手机或个人数字助理机等, 愈来愈受到许多人的青睐, 而人们对于便携式电 子消费产品功能的要求也是愈来愈高。 未来的便携式电子消费产品能否对于 使用者更好的服务, 已成为高科技产品技术发展的重点, 其中, 语音学习是 许多使用者最常使用且需要的服务之一。  At present, the electronic information industry is developing rapidly. Portable electronic consumer products, such as electronic dictionaries, mobile phones or personal digital assistants, are increasingly favored by many people, and the requirements for the functions of portable electronic consumer products are getting higher and higher. . Whether future portable electronic consumer products can better serve users has become the focus of high-tech product technology development. Among them, voice learning is one of the most frequently used and needed services for many users.
目前市面上的便携式电子消费产品中的英文学习功能虽然已经比较完 善, 可是英文学习的同步性和针对性的校正并不完善, 英语学习的发音问题 也一直都是很难攻克。 在先前技术中, 已有多种发音学习装置及方法, 例如 复读机等等, 通过让使用者跟读, 尽快提高口语学习。 然而, 对于使用者而 言, 仅仅跟读的学习效果有限。 因此, 如何让使用者对跟读中的错误有针对 性的去了解、 学习, 由此更快的掌握正确发音是一项亟待解决的问题。  Although the English learning function in the portable electronic consumer products currently on the market has been relatively complete, the synchronization and targeted correction of English learning are not perfect, and the pronunciation problem of English learning has always been difficult to overcome. In the prior art, there have been various pronunciation learning devices and methods, such as a repeater, etc., to improve oral learning as soon as possible by allowing the user to follow up. However, for the user, the learning effect of reading only is limited. Therefore, how to let the user understand and learn the mistakes in the reading, so that mastering the correct pronunciation faster is an urgent problem to be solved.
发明内容 Summary of the invention
本发明为解决背景技术中存在的上述技术问题, 而提出一种发音评测装 置及其方法。  The present invention has been made to solve the above-mentioned technical problems existing in the background art, and proposes a sounding evaluation apparatus and method thereof.
本发明的技术解决方案是: 本发明为一种发音评测装置, 用来评测使用 者发音, 其特殊之处在于: 该装置包含: 储存模块, 储存多个文字数据及对 应文字数据的读音音讯、 读音参数及音标数据; 输入模块, 提供使用者选择 文字数据; 显示模块, 显示所选文字数据; 音讯输出模块, 播放对应所选文 字数据的读音音讯; 录音模块, 记录使用者所输入音讯信号; 音频分析模块, 分析输入音讯信号, 并将分析结果与对应文字数据的读音参数及音标数据比 对, 产生基于音标数据的评测结果, 且比对输入音讯信号的波形及对应文字 数据的读音音讯波形, 音频分析模块控制显示模块显示评测结果及波形比对 结果, 输入模块、 录音模块和储存模块分别接音频分析模块, 音频分析模块 分别接音讯输出模块和显示模块。 The technical solution of the present invention is: The present invention is a pronunciation evaluation device for evaluating user pronunciation, and the special feature is as follows: The device comprises: a storage module, storing a plurality of text data and a sound information corresponding to the text data, The reading parameter and the phonetic symbol data; the input module provides the user to select the text data; the display module displays the selected text data; the audio output module plays the pronunciation audio corresponding to the selected text data; and the recording module records the audio signal input by the user; The audio analysis module analyzes the input audio signal, compares the analysis result with the pronunciation parameter and the phonetic data of the corresponding text data, generates an evaluation result based on the phonetic data, and compares the waveform of the input audio signal with the sound waveform of the corresponding text data. The audio analysis module controls the display module to display the evaluation result and the waveform comparison result, and the input module, the recording module and the storage module are respectively connected to the audio analysis module, and the audio analysis module The audio output module and the display module are respectively connected.
上述音频分析模块根据 LPC倒频谱技术分析输入音讯信号。  The audio analysis module described above analyzes the input audio signal according to the LPC cepstrum technique.
上述音标数据包含音节, 而评测结果包含对应每个音节的评测分数。 上述文字数据包含单字、 单词或语句。  The above phonetic data contains syllables, and the evaluation results include evaluation scores corresponding to each syllable. The above text data contains words, words or sentences.
一种发音评测方法, 用来评测使用者发音, 其特殊之处在于: 该方法包 含下列步骤:  A pronunciation evaluation method for evaluating user pronunciation, which is special in that: The method includes the following steps:
1 )提供多个文字数据及对应文字数据的读音音讯、读音参数及音标数据; 1) providing a plurality of text data and corresponding voice data, audio parameters, phonetic parameters and phonetic data;
2 ) 由使用者选择文字数据; 2) selecting text data by the user;
3 ) 显示所选文字数据并播放对应所选文字数据的读音音讯;  3) displaying the selected text data and playing the pronunciation audio corresponding to the selected text data;
4 ) 记录使用者所输入的音讯信号;  4) recording the audio signal input by the user;
5 )分析输入音讯信号, 并将分析结果与对应文字数据的读音参数及音标 数据比对, 产生基于音标数据的评测结果;  5) analyzing the input audio signal, and comparing the analysis result with the pronunciation parameter and the phonetic data of the corresponding text data to generate an evaluation result based on the phonetic data;
6 ) 比对输入音讯信号的波形及对应文字数据读音音讯的波形;  6) comparing the waveform of the input audio signal with the waveform of the corresponding text data reading audio;
7 ) 显示评测结果及波形比对结果。  7) Display the evaluation results and waveform comparison results.
上述步骤 5 ) 中是根据 LPC倒频谱技术分析输入音讯信号。  In the above step 5), the input audio signal is analyzed according to the LPC cepstrum technique.
上述音标数据包含音节, 评测结果包含对应每个音节的评测分数。  The above phonetic data includes syllables, and the evaluation results include evaluation scores corresponding to each syllable.
上述文字数据包含单字、 单词或语句。  The above text data contains words, words or sentences.
本发明提供的发音评测装置及其方法, 先显示字词让使用者念, 在记录 使用者发音后, 分析使用者的发音, 并将分析结果与字词读音参数及音标数 据比对, 产生基于音标数据的评测结果, 且比对使用者的发音波形及对应字 词的读音讯息的波形, 最后显示评测结果及波形比对结果。 由此, 让使用者 更清楚了解自己发音的问题, 提高使用者修正发音的效率。  The sounding evaluation device and the method thereof provided by the invention first display words to the user, analyze the user's pronunciation after recording the user's pronunciation, and compare the analysis result with the word pronunciation parameters and the phonetic data, and generate the basis The evaluation results of the phonetic data, and comparing the waveform of the user's pronunciation and the waveform of the pronunciation message of the corresponding word, finally display the evaluation result and the waveform comparison result. Thereby, the user is more clearly aware of the problem of his own pronunciation, and improves the efficiency of the user's correction of the pronunciation.
附图说明 DRAWINGS
图 1为本发明装置实施例的方块图;  Figure 1 is a block diagram of an embodiment of the apparatus of the present invention;
图 2为本发明装置实施例显示接口示意图一;  2 is a schematic diagram 1 of a display interface of an embodiment of the device of the present invention;
图 3为本发明装置实施例显示接口示意图二;  3 is a schematic diagram 2 of a display interface of an embodiment of the device of the present invention;
图 4为本发明方法流程图。  Figure 4 is a flow chart of the method of the present invention.
其中, 1-发音评测装置, 11-储存模块, 111-文字数据, 112-读音音讯, 113- 读音参数, 114-音标数据, 12输入模块, 13显示模块 , 14音讯输出模块, 1-5 录音模块, 151-输入音讯信号, 16-音频分析模块, 171-评测结果, 172-波形比 对结果。 Among them, 1- pronunciation evaluation device, 11-storage module, 111-character data, 112-audio audio, 113- pronunciation parameters, 114-phonetic data, 12-input module, 13 display module, 14 audio output module, 1-5 Recording module, 151-input audio signal, 16-audio analysis module, 171-test result, 172-waveform alignment result.
具体实施方式 detailed description
参见图 1, 发音评测装置 1包含储存模块 11、输入模块 12 显示模块 13 音讯输出模块 14、 录音模块 15及音频分析模块 16 储存模块 11储存多个文 字数据 111及对应文字数据 111的读音音讯 112、读音参数 113及音标数据 114。 文字数据 111 包含单字、 单词或语句。 读音音讯 112为对应的文字数据 111 的正确读音, 可作为使用者学习发音的参考。 音标数据 114包含音节, 例如, 单字" abbreviation" 的音标数据为,其包含, "錢" , 暴,, , , "慕,, 及 "Sh ll"等五个音节。 读音参数 113可包含读音音讯 112的 LPC倒频谱 参数或其它音频分析参数。 储存模块 11可为内建内存、 记忆卡或是光储存媒 体。  Referring to FIG. 1 , the sounding evaluation device 1 includes a storage module 11 , an input module 12 , a display module 13 , an audio output module 14 , a recording module 15 , and an audio analysis module 16 . The storage module 11 stores a plurality of text data 111 and a read audio signal 112 corresponding to the text data 111 . , the pronunciation parameter 113 and the phonetic data 114. Text data 111 contains words, words, or statements. The pronunciation audio 112 is the correct pronunciation of the corresponding text data 111, and can be used as a reference for the user to learn the pronunciation. The phonetic symbol data 114 contains syllables. For example, the phonetic data of the word "abbreviation" includes five syllables such as "money", violent, ",", "mu," and "Sh ll". The pronunciation parameter 113 may include a pronunciation audio. The LPC cepstrum parameter of 112 or other audio analysis parameters. The storage module 11 can be a built-in memory, a memory card or an optical storage medium.
输入模块 12提供使用者选择文字数据。 其中, 输入模块 12可为键盘、 按键组、 光标控制器或触控模块。 当使用者选择完毕后, 则显示模块 13显示 使用者所选文字数据 111, 且音讯输出模块 14播放对应所选文字数据 111的 读音音讯 112, 让使用者先聆听所选文字数据的正确发音, 而显示模块 13显 示所选文字数据的音标数据 114及其它相关数据。  The input module 12 provides the user with the choice of text data. The input module 12 can be a keyboard, a button group, a cursor controller or a touch module. After the user selects, the display module 13 displays the text data 111 selected by the user, and the audio output module 14 plays the pronunciation audio 112 corresponding to the selected text data 111, so that the user can first listen to the correct pronunciation of the selected text data. The display module 13 displays the phonetic symbol data 114 and other related data of the selected text data.
所选文字数据 111的读音音讯 112播放完毕后, 便启动录音模块 15来记 录使用者所输入音讯信号 151。 此时显示模块 13可显示提示信息来提醒使用 者可开始念所选文字数据 111。接着,音频分析模块 16分析输入音讯信号 151, 并将分析结果与对应文字数据 111的读音参数 113及音标数据 114比对,产生 基于音标数据 114的评测结果 171。 音频分析模块 16比对输入音讯信号 151 的波形及对应文字数据 111的读音音讯 112的波形, 并控制显示模块 13显示 评测结果 171及波形比对结果 172。 其中, 评测结果 171包含对应音标数据 114的每个音节的评测分数,使用者观看此评测结果 171来了解自己哪一个音 节发音有问题, 进一步仔细修正发音。 例如, 在练习单字" abbreviation" 的 发音时, 若 ",'§" 的评测分数较低, 则使用者需多注意 的发音并多 做练习, 并可观察波形比对结果来了解自己的发音和正确发音的差异在哪里, 已达到有效修正发音的功效。 其中, 上述音频分析模块 16以处理器执行相关音频分析程序的软件方式 实现。 After the read audio 112 of the selected text data 111 is played, the recording module 15 is activated to record the audio signal 151 input by the user. At this time, the display module 13 can display a prompt message to remind the user to start reading the selected text data 111. Next, the audio analysis module 16 analyzes the input audio signal 151, and compares the analysis result with the pronunciation parameter 113 and the phonetic data 114 of the corresponding character data 111 to generate a result 171 based on the phonetic data 114. The audio analysis module 16 compares the waveform of the input audio signal 151 with the waveform of the read audio 112 of the corresponding text data 111, and controls the display module 13 to display the evaluation result 171 and the waveform comparison result 172. The evaluation result 171 includes the evaluation score of each syllable corresponding to the phonetic symbol data 114. The user views the evaluation result 171 to know which syllable pronunciation has a problem, and further corrects the pronunciation. For example, when practicing the pronunciation of the word "abbreviation", if the evaluation score of ", '§" is lower, the user should pay more attention to the pronunciation and practice more, and observe the waveform comparison result to understand their pronunciation and Where the difference in correct pronunciation is, the effect of effectively correcting the pronunciation has been achieved. The audio analysis module 16 is implemented in a software manner in which the processor executes the related audio analysis program.
参见图 2, 发音评测装置显示使用者所选欲练习发音的英文单 字" abbreviation" , 发音评测装置会先播放英文单字" abbreviation" 的内建 发音, 让使用者先聆听发音并对照音标数据进行学习。 之后, 发音评测装置 显示提示信息, 提醒使用者可念此单字, 且发音评测装置会记录使用者的发 音。 参见图 3, 确认使用者输入发音完毕后, 发音评测装置便开启进行音频分 析及波形比对, 并显示每个音节的评测分数及波形比对结果。 例如, 使用者 的发音在 ",i,, , "i "及 "sh 織"等三个音节评测分数较低, 再观察波 形, 可发现在这三个音节部分使用者似乎发音过重, 因此使用者可了解自己 发音的缺点处。  Referring to FIG. 2, the pronunciation evaluation device displays the English word "abbreviation" selected by the user to practice the pronunciation, and the pronunciation evaluation device first plays the built-in pronunciation of the English word "abbreviation", so that the user can listen to the pronunciation first and learn against the phonetic data. . After that, the pronunciation evaluation device displays a prompt message to remind the user to read the word, and the pronunciation evaluation device records the user's voice. Referring to Figure 3, after confirming that the user has finished typing, the pronunciation evaluation device is turned on for audio analysis and waveform comparison, and the evaluation scores and waveform comparison results of each syllable are displayed. For example, the user's pronunciation has lower scores in the three syllables such as ",i,,, "i" and "sh weaving". After observing the waveform, it can be found that the users in the three syllables seem to be over-sounding, so Users can understand the shortcomings of their pronunciation.
参见图 4, 此方法可应用于具有信号处理功能的电子装置, 例如计算机、 可携式电子辞典、 手机或个人数字助理机 (PDA)。 此方法包含下列步骤。 在 步骤 S1 , 提供多个文字数据及对应这些文字数据的读音音讯、 读音参数及音 标数据, 例如, 在电子装置的储存模块中预先储存这些数据。 音标数据包含 音节, 读音参数包含读音音讯的 LPC倒频谱参数或其它音频分析参数。 储存 模块为内建内存、 记忆卡或是光储存媒体。 接着, 在步骤 S2由使用者选择欲 练习发音的文字数据, 在步骤 S3显示所选文字数据并播放对应所选文字数据 的读音音讯, 也可在电子装置的屏幕显示此所选文字数据的音标数据或其它 相关数据。  Referring to Fig. 4, the method can be applied to an electronic device having a signal processing function, such as a computer, a portable electronic dictionary, a mobile phone, or a personal digital assistant (PDA). This method contains the following steps. In step S1, a plurality of character data and pronunciation audio, pronunciation parameters and phonetic data corresponding to the character data are provided, for example, the data is stored in advance in a storage module of the electronic device. The phonetic data contains syllables, and the pronunciation parameters include LPC cepstral parameters of the audio tones or other audio analysis parameters. The storage module is built-in memory, memory card or optical storage media. Next, in step S2, the user selects the text data of the pronunciation to be practiced, displays the selected text data in step S3 and plays the pronunciation audio corresponding to the selected text data, and displays the phonetic symbol of the selected text data on the screen of the electronic device. Data or other relevant data.
经过预设时间让使用者阅读文字数据后, 可在电子装置的屏幕上显示提示信 息提醒使用者可念所选文字数据, 而在步骤 S4记录使用者所输入音讯信号, 接着在步骤 S5分析输入音讯信号, 并将分析结果与对应文字数据的读音参数 及音标数据比对, 产生基于音标数据的评测结果。 其中, 评测结果包含对应 音标数据的每个音节的评测分数, 评测分数可为绝对分数、 相对分数或权重 分数。在步骤 S6比对输入音讯信号的波形及对应文字数据的读音音讯的波形。 最后, 在步骤 S7显示评测结果及波形比对结果。 其中, 使用者观看此评测结 果来了解自己哪一个音节发音有问题, 进一步来仔细修正发音。 而波形比对 结果可使用者更清楚了解发音不佳的原因。 After the preset time allows the user to read the text data, the prompt information may be displayed on the screen of the electronic device to remind the user to read the selected text data, and the audio signal input by the user is recorded in step S4, and then the input is analyzed in step S5. The audio signal is compared with the pronunciation parameter and the phonetic data of the corresponding text data to generate an evaluation result based on the phonetic data. The evaluation result includes the evaluation score of each syllable corresponding to the phonetic data, and the evaluation score may be an absolute score, a relative score, or a weight score. In step S6, the waveform of the input audio signal and the waveform of the sound of the corresponding text data are compared. Finally, the evaluation result and the waveform comparison result are displayed in step S7. Among them, the user views the evaluation results to know which syllable pronunciation has a problem, and further corrects the pronunciation. The waveform comparison results give the user a clearer picture of why the pronunciation is poor.

Claims

权利要求书 Claim
1、 一种发音评测装置, 其特征在于: 该装置包含: 储存模块, 储存多个 文字数据及对应文字数据的读音音讯、 读音参数及音标数据; 输入模块, 提 供使用者选择文字数据; 显示模块, 显示所选文字数据; 音讯输出模块, 播 放对应所选文字数据的读音音讯; 录音模块, 记录使用者所输入音讯信号; 音频分析模块, 分析输入音讯信号, 并将分析结果与对应文字数据的读音参 数及音标数据比对, 产生基于音标数据的评测结果, 且比对输入音讯信号的 波形及对应文字数据的读音音讯波形, 音频分析模块控制显示模块显示评测 结果及波形比对结果, 所述输入模块、 录音模块和储存模块分别接音频分析 模块, 所述音频分析模块分别接音讯输出模块和显示模块。  A sounding evaluation device, comprising: a storage module, storing a plurality of text data and corresponding sound data, sound reading parameters and phonetic symbol data; an input module, providing a user to select text data; a display module , displaying the selected text data; an audio output module, playing the sound information corresponding to the selected text data; a recording module recording the audio signal input by the user; an audio analysis module, analyzing the input audio signal, and analyzing the result with the corresponding text data Comparing the pronunciation parameters and the phonetic data, generating the evaluation result based on the phonetic data, and comparing the waveform of the input audio signal with the sound waveform of the corresponding text data, the audio analysis module controls the display module to display the evaluation result and the waveform comparison result, The input module, the recording module and the storage module are respectively connected to the audio analysis module, and the audio analysis module is respectively connected to the audio output module and the display module.
2、 根据权利要求 1所述的发音评测装置, 其特征在于: 所述音频分析模 块根据 LPC倒频谱技术分析输入音讯信号。  2. The sounding evaluation apparatus according to claim 1, wherein: said audio analysis module analyzes an input audio signal according to an LPC cepstrum technique.
3、 根据权利要求 1所述的发音评测装置, 其特征在于: 所述音标数据包 含音节, 而评测结果包含对应每个音节的评测分数。  3. The sounding evaluation apparatus according to claim 1, wherein: said phonetic symbol data includes a syllable, and the evaluation result includes an evaluation score corresponding to each syllable.
4、 根据权利要求 1所述的发音评测装置, 其特征在于: 所述文字数据包 含单字、 单词或语句。  4. The sounding evaluation apparatus according to claim 1, wherein the character data includes a word, a word or a sentence.
5、 一种发音评测方法, 其特征在于: 该方法包含下列步骤:  5. A sounding evaluation method, characterized in that: the method comprises the following steps:
1 )提供多个文字数据及对应文字数据的读音音讯、读音参数及音标数据; 1) providing a plurality of text data and corresponding voice data, audio parameters, phonetic parameters and phonetic data;
2 ) 由使用者选择文字数据; 2) selecting text data by the user;
3 ) 显示所选文字数据并播放对应所选文字数据的读音音讯;  3) displaying the selected text data and playing the pronunciation audio corresponding to the selected text data;
4 ) 记录使用者所输入的音讯信号;  4) recording the audio signal input by the user;
5 )分析输入音讯信号, 并将分析结果与对应文字数据的读音参数及音标 数据比对, 产生基于音标数据的评测结果;  5) analyzing the input audio signal, and comparing the analysis result with the pronunciation parameter and the phonetic data of the corresponding text data to generate an evaluation result based on the phonetic data;
6 ) 比对输入音讯信号的波形及对应文字数据读音音讯的波形;  6) comparing the waveform of the input audio signal with the waveform of the corresponding text data reading audio;
7 ) 显示评测结果及波形比对结果。  7) Display the evaluation results and waveform comparison results.
6、 根据权利要求 5所述的发音评测方法, 其特征在于: 所述步骤 5 ) 中 是根据 LPC倒频谱技术分析输入音讯信号。  6. The sounding evaluation method according to claim 5, wherein: in the step 5), the input audio signal is analyzed according to the LPC cepstrum technique.
7、 根据权利要求 5所述的发音评测方法, 其特征在于: 所述音标数据包 含音节, 评测结果包含对应每个音节的评测分数。  7. The sounding evaluation method according to claim 5, wherein: the phonetic symbol data includes a syllable, and the evaluation result includes an evaluation score corresponding to each syllable.
8、 根据权利要求 5所述的发音评测方法, 其特征在于: 所述文字数据包 含单字、 单词或语句。  8. The sounding evaluation method according to claim 5, wherein the character data includes a word, a word or a sentence.
PCT/CN2009/075281 2009-05-21 2009-12-03 Pronunciation evaluating device and method WO2010133072A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN 200910022631 CN101551952A (en) 2009-05-21 2009-05-21 Device and method for evaluating pronunciation
CN200910022631.2 2009-05-21

Publications (1)

Publication Number Publication Date
WO2010133072A1 true WO2010133072A1 (en) 2010-11-25

Family

ID=41156176

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2009/075281 WO2010133072A1 (en) 2009-05-21 2009-12-03 Pronunciation evaluating device and method

Country Status (2)

Country Link
CN (1) CN101551952A (en)
WO (1) WO2010133072A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112230875A (en) * 2020-10-13 2021-01-15 华南师范大学 Artificial intelligence following reading method and following reading robot

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101551952A (en) * 2009-05-21 2009-10-07 无敌科技(西安)有限公司 Device and method for evaluating pronunciation
CN101739870B (en) * 2009-12-03 2012-07-04 深圳先进技术研究院 Interactive language learning system and method
CN104599680B (en) * 2013-10-30 2019-11-26 语冠信息技术(上海)有限公司 Real-time spoken evaluation system and method in mobile device
CN104485116B (en) * 2014-12-04 2019-05-14 上海流利说信息技术有限公司 Voice quality assessment equipment, method and system
CN105810211B (en) * 2015-07-13 2019-11-29 维沃移动通信有限公司 A kind of processing method and terminal of audio data
CN105118354A (en) * 2015-09-14 2015-12-02 百度在线网络技术(北京)有限公司 Data processing method for language learning and device thereof
CN107038910A (en) * 2016-02-04 2017-08-11 咸大根 Mat langue leaning system and method
CN107203539B (en) * 2016-03-17 2020-07-14 曾雅梅 Speech evaluating device of complex word learning machine and evaluating and continuous speech imaging method thereof
CN108039180B (en) * 2017-12-11 2021-03-12 广东小天才科技有限公司 Method for learning achievement of children language expression exercise and microphone equipment
CN110085260A (en) * 2019-05-16 2019-08-02 上海流利说信息技术有限公司 A kind of single syllable stress identification bearing calibration, device, equipment and medium
CN113838479B (en) * 2021-10-27 2023-10-24 海信集团控股股份有限公司 Word pronunciation evaluation method, server and system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002023613A (en) * 2000-07-05 2002-01-23 Tomoe Bosai Tsushin Kk Language learning system
JP2003162291A (en) * 2001-11-22 2003-06-06 Ricoh Co Ltd Language learning device
CN1510590A (en) * 2002-12-24 2004-07-07 英业达股份有限公司 Language learning system and method with visual prompting to pronunciaton
CN1512300A (en) * 2002-12-30 2004-07-14 艾尔科技股份有限公司 User's interface, system and method for automatically marking phonetic symbol to correct pronunciation
JP2004279799A (en) * 2003-03-17 2004-10-07 Univ Waseda Apparatus and method for speaking evaluation
JP2006084966A (en) * 2004-09-17 2006-03-30 Advanced Telecommunication Research Institute International Automatic evaluating device of uttered voice and computer program
CN101393694A (en) * 2008-10-21 2009-03-25 无敌科技(西安)有限公司 Chinese character pronunciation studying device with pronunciation correcting function of Chinese characters, and method therefor
CN101551952A (en) * 2009-05-21 2009-10-07 无敌科技(西安)有限公司 Device and method for evaluating pronunciation

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002023613A (en) * 2000-07-05 2002-01-23 Tomoe Bosai Tsushin Kk Language learning system
JP2003162291A (en) * 2001-11-22 2003-06-06 Ricoh Co Ltd Language learning device
CN1510590A (en) * 2002-12-24 2004-07-07 英业达股份有限公司 Language learning system and method with visual prompting to pronunciaton
CN1512300A (en) * 2002-12-30 2004-07-14 艾尔科技股份有限公司 User's interface, system and method for automatically marking phonetic symbol to correct pronunciation
JP2004279799A (en) * 2003-03-17 2004-10-07 Univ Waseda Apparatus and method for speaking evaluation
JP2006084966A (en) * 2004-09-17 2006-03-30 Advanced Telecommunication Research Institute International Automatic evaluating device of uttered voice and computer program
CN101393694A (en) * 2008-10-21 2009-03-25 无敌科技(西安)有限公司 Chinese character pronunciation studying device with pronunciation correcting function of Chinese characters, and method therefor
CN101551952A (en) * 2009-05-21 2009-10-07 无敌科技(西安)有限公司 Device and method for evaluating pronunciation

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112230875A (en) * 2020-10-13 2021-01-15 华南师范大学 Artificial intelligence following reading method and following reading robot

Also Published As

Publication number Publication date
CN101551952A (en) 2009-10-07

Similar Documents

Publication Publication Date Title
WO2010133072A1 (en) Pronunciation evaluating device and method
KR101826714B1 (en) Foreign language learning system and foreign language learning method
RU2340007C2 (en) Device and method of pronouncing phoneme
US20060194181A1 (en) Method and apparatus for electronic books with enhanced educational features
CN108520650A (en) A kind of intelligent language training system and method
JP2014529771A (en) System and method for language learning
JP2007206317A (en) Authoring method and apparatus, and program
KR101164379B1 (en) Learning device available for user customized contents production and learning method thereof
CN111653265B (en) Speech synthesis method, device, storage medium and electronic equipment
US20100318346A1 (en) Second language pronunciation and spelling
JP2015036788A (en) Pronunciation learning device for foreign language
Kaiser Mobile-assisted pronunciation training: The iPhone pronunciation app project
JP7376071B2 (en) Computer program, pronunciation learning support method, and pronunciation learning support device
JP6166831B1 (en) Word learning support device, word learning support program, and word learning support method
WO2008035852A1 (en) Language traing method and apparatus by matching pronunciation and a character
KR20140078810A (en) Apparatus and method for learning rhythm pattern by using native speaker's pronunciation data and language data.
KR20140087956A (en) Apparatus and method for learning phonics by using native speaker's pronunciation data and word and sentence and image data
KR20140107067A (en) Apparatus and method for learning word by using native speakerpronunciation data and image data
KR20140028527A (en) Apparatus and method for learning word by using native speaker's pronunciation data and syllable of a word
KR20140079677A (en) Apparatus and method for learning sound connection by using native speaker's pronunciation data and language data.
KR100593590B1 (en) Automatic Content Generation Method and Language Learning Method
TWI281649B (en) System and method of dictation learning for correcting pronunciation
US20130149680A1 (en) Methods and systems for teaching a non-native language
CN102542854A (en) Method for learning pronunciation through role play
KR20140079245A (en) Apparatus and method for learning rhythm pattern by using native speaker's pronunciation data and language data.

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09844824

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09844824

Country of ref document: EP

Kind code of ref document: A1