CN105469656A

CN105469656A - Spoken language learning system and operating method of the system

Info

Publication number: CN105469656A
Application number: CN201510821973.6A
Authority: CN
Inventors: 于拾全; 卫亚东; 田学红
Original assignee: Dongguan Fandou Information Technology Co Ltd
Current assignee: Dongguan Fandou Information Technology Co Ltd
Priority date: 2015-11-23
Filing date: 2015-11-23
Publication date: 2016-04-06

Abstract

The invention relates to a spoken language learning system. The system comprises an audio decoding module used for audio file decoding, a voice breakpoint searching module used for automatically calculating and searching a voice breakpoint in an audio, an audio playing module used for carrying out playing and playback on audio data, an adaptive recording module used for adaptively recording a user voice and a record playback module used for carrying out playback on a record. The audio decoding module is connected to the voice breakpoint searching module. And the voice breakpoint searching module is connected to the audio playing module. By using the spoken language learning system, a problem of simultaneously and interactively training listening and speaking capabilities during English learning is effectively solved.

Description

The How It Works of a kind of verbal learning system and this system

Technical field

The present invention relates to a kind of verbal learning system, and the How It Works of this system.

Background technology

The Learning demands of Oral English Practice, by the training of listening and speaking repeatedly, just can improve learning efficiency.And people, for the English audio file bought or download, are generally use unidirectional audio player at present, user can only train the ability of oneself listening, the ability that can not oneself is trained timely to say.

In view of this, necessaryly provide a kind of system that user can be allowed to utilize conventional audio file or network audio to realize the combined training listened, say, confirm, to improve learning efficiency.

Summary of the invention

A kind of verbal learning system that the present invention provides to solve the problem, comprise: for the audio decoder module of audio file decoding, for automatically calculating the voice interruption point search module finding voice breakpoint in audio frequency, for playing the audio playing module with plays back audio data, for recording the self-adaptation recording module of the voice of user adaptively, and for playback recording recording playback module, described audio decoder module and described voice interruption point search model calling, described voice interruption point search module is connected with described audio playing module.

Preferably, described audio decoder module supports the decoding of the audio files such as MP3 or MVA or online audio stream.

Preferably, the decoded data of random length is read in the support of described audio decoder module at every turn.

Preferably, described self-adaptation recording module has the noise reduction module of support voice noise reduction process.

Preferably, voice are saved in a recording file MicFile by described self-adaptation recording module, and described recording playback module can trigger described recording file MicFile automatically.

The present invention also provides a kind of How It Works of upper predicate learning system, and described How It Works comprises:

Step 1, audio decoder module are decoded to audio file;

Step 2, voice interruption point search module calculate the voice interruption point found in audio frequency automatically;

Step 3, audio playing module are play and plays back audio data;

Step 4, self-adaptation recording module record the voice of user adaptively;

The recording of step 5, recording playback module replaying user.

Preferably, in described step 2, described voice interruption point search module is based on whole voice data buffer memory or automatically calculate the voice interruption point found inside decoded data stream based on partial data stream.

Preferably, in described step 2, described voice interruption point search module uses energy threshold voice breaking point detection algorithm.

Preferably, in described step 4, if continue not occur efficient voice in physique very first time length T1, then automatically terminate to record; If there is efficient voice in very first time length T1, then enter quiet section of judgement, if continue a second time span T2 to occur quiet section, then automatically terminate to record.

Preferably, after described step 5, further comprising the steps of:

Step 6, described audio decoder module and voice interruption point search module carry out follow-up data decode and breaking point detection.

Beneficial effect of the present invention is: this verbal learning system efficiently solves the problem of the ability of interactive training listening and speaking simultaneously in English study.As long as just can realize the circuit training sentence by sentence of listening to, repeating, confirming based on common audio file or network audio stream, support simple sentence repeat playing function in addition, spoken learning efficiency can be significantly improved.

Accompanying drawing explanation

The learning system block schematic illustration that Fig. 1 provides for the embodiment of the present invention.

Embodiment

Below in conjunction with accompanying drawing, the present invention is further elaborated:

The invention provides a kind of verbal learning system.The input object of this verbal learning system is audio file, and wherein mainly voice are main, do not comprise lasting background music.

As shown in Figure 1, this verbal learning system comprises audio decoder module, for the decoding of audio file; Voice interruption point search module, for automatically calculating the voice interruption point found in audio frequency; Audio playing module, for playing and plays back audio data; Self-adaptation recording module, for recording the voice of user adaptively; Recording playback module, for the recording of replaying user.

Audio decoder module and voice interruption point search model calling, transfer to voice interruption point search module by decoded decoded data stream.Voice interruption point search module is connected with audio playing module, and the data of sound bite are passed to audio playing module.

The present invention also provides the How It Works of above-mentioned verbal learning system, comprises the following steps:

Step 1, audio decoder module are decoded to audio file;

Step 3, audio playing module are play and plays back audio data;

Step 4, self-adaptation recording module record the voice of user adaptively;

The recording of step 5, recording playback module replaying user.

Audio decoder module supports the decoding process of the audio files such as MP3 or MVA, also supports the decoding of online audio stream, and supports each decoded data reading random length.For different platforms, suitable cache size can be selected, each decoded data PcmData reading appropriate length.

Voice interruption point search module can based on whole voice data buffer memory, also can automatically calculate based on partial data stream the voice interruption point found inside decoded data stream, use algorithm to include but not limited to the energy threshold voice breaking point detection scheduling algorithm commonly used.As: based on the decoded data PcmData obtained, in units of 20ms or 40ms frame, carry out the calculating of speech energy and zero-crossing rate above, then by sliding window and threshold judgement, judge whether to there is voice interruption point.If there is voice interruption point, then record breakpoint information, and start recording module after audio playing module plays sound bite.If there is no voice interruption point, then directly pass to audio playing module data and play voice.

Voice playing module is play-overed after receiving data above, if do not have data, automatically stops playing.Voice playing module can play the sound bite data that voice interruption point search module above exports; Also can certain sound bite of specifying of repeat playing.

Self-adaptation recording module adaptive control record length length can save as audio file user speech input recording, and self-adaptation recording module has noise reduction module, support voice noise reduction process simultaneously.Wherein the algorithm of adaptive control duration includes but not limited to the quiet segment length control of speech terminals detection, self-adaptation etc.After self-adaptation recording module receives enabled instruction, start recording process, the data MicData that self-adaptation recording module buffer memory microphone apparatus exports, be saved in a recording file MicFile, breaking point detection carried out to data MicData simultaneously.If continue not occur efficient voice in very first time length T1, automatically terminate to record.If there is efficient voice in very first time length T1, then enter quiet section of judgement, if continue the second time span T2 to occur quiet section, then automatically terminate to record.After recording receives, automatically start recording playback module.

Recording playback module can trigger the recording file MicFile playing user automatically, confirms the oneself oneself repeating voice for user.Start playback file MicFile after recording playback module receives instruction, after finishing, comprise the following steps: notification audio decoder module and voice interruption point search module carry out follow-up data decode and breaking point detection.

If period user input instruction, then notification audio decoder module decoded data from the point of interruption position of preserving above.

This verbal learning system efficiently solves the problem of the ability of interactive training listening and speaking simultaneously in English study.As long as just can realize the circuit training sentence by sentence of listening to, repeating, confirming based on common audio file or network audio stream, support simple sentence repeat playing function in addition, spoken learning efficiency can be significantly improved.

The above embodiment, just preferred embodiments of the present invention, be not limit practical range of the present invention, therefore all equivalences done according to structure, feature and the principle described in the present patent application the scope of the claims change or modify, and all should be included in patent claim of the present invention.

Claims

1. a verbal learning system, it is characterized in that, described verbal learning system comprises: for the audio decoder module of audio file decoding, for automatically calculating the voice interruption point search module finding voice breakpoint in audio frequency, for playing the audio playing module with plays back audio data, for recording the self-adaptation recording module of the voice of user adaptively, and for playback recording recording playback module

Described audio decoder module and described voice interruption point search model calling, described voice interruption point search module is connected with described audio playing module.

2. verbal learning system as claimed in claim 1, is characterized in that, described audio decoder module supports the decoding of the audio files such as MP3 or MVA or online audio stream.

3. verbal learning system as claimed in claim 1, it is characterized in that, the decoded data of random length is read in the support of described audio decoder module at every turn.

4. the verbal learning system as described in claim 1 or 2 or 3, it is characterized in that, described self-adaptation recording module has the noise reduction module of support voice noise reduction process.

5. verbal learning system as claimed in claim 4, it is characterized in that, voice are saved in a recording file MicFile by described self-adaptation recording module, and described recording playback module can trigger described recording file MicFile automatically.

6. a How It Works for the arbitrary verbal learning system as described in claim 1-5, it is characterized in that, described How It Works comprises:

Step 1, audio decoder module are decoded to audio file;

Step 3, audio playing module are play and plays back audio data;

Step 4, self-adaptation recording module record the voice of user adaptively;

The recording of step 5, recording playback module replaying user.

7. How It Works as claimed in claim 6, is characterized in that, in described step 2, described voice interruption point search module is based on whole voice data buffer memory or automatically calculate the voice interruption point found inside decoded data stream based on partial data stream.

8. How It Works as claimed in claim 6, is characterized in that, in described step 2, described voice interruption point search module uses energy threshold voice breaking point detection algorithm.

9. How It Works as claimed in claim 6, is characterized in that, in described step 4, if continue not occur efficient voice in physique very first time length T1, then automatically terminates to record; If there is efficient voice in very first time length T1, then enter quiet section of judgement, if continue a second time span T2 to occur quiet section, then automatically terminate to record.

10. How It Works as claimed in claim 6, is characterized in that, after described step 5, further comprising the steps of: