CN108735220A

CN108735220A - A kind of language learning intelligent earphone, intelligent interactive system and man-machine interaction method

Info

Publication number: CN108735220A
Application number: CN201810319502.9A
Authority: CN
Inventors: 祝梦影
Original assignee: Sichuan Feixun Information Technology Co Ltd
Current assignee: Sichuan Feixun Information Technology Co Ltd
Priority date: 2018-04-11
Filing date: 2018-04-11
Publication date: 2018-11-02

Abstract

The present invention provides a kind of language learning intelligent earphone, intelligent interactive system and man-machine interaction methods.In the present invention, the former voice document of learner is obtained first, and carrying out text file conversion to the original voice document using speech recognition technology obtains original text this document；Wrong point list is obtained to original text this document error correction later；Then feedback error point list, and according to wrong point list, each erroneous point in original text this document is replaced with into correct machine language text to be formed and corrects version text file；Version voice document is corrected followed by phonetic synthesis correction version text file acquisition and plays to learner.By error correction, feedback and replacement, when learner hears that the content that earphone plays is the sound of oneself, illustrate this part spoken language inerrancy, when hearing machine language, illustrates wrong herein, need to be corrected with reference to correct machine language, add to practice as needed.

Description

A kind of language learning intelligent earphone, intelligent interactive system and man-machine interaction method

Technical field

The present invention relates to technical field of intelligent interaction more particularly to a kind of language learning intelligent earphone, intelligent interaction systems System and man-machine interaction method.

Background technology

Wireless Bluetooth headsets are developed to from wired earphone, from function earphone is listened to music merely to can be with the intelligence of human-computer interaction Earphone, the type and function of earphone are more and more abundant.

In addition, as what is exchanged between various countries increases, people are also more and more for the demand of foreign language language.Existing language Learn in product, can only realize the function of listening and read, such as some language learning cell phone applications, it can not be to learner It practises situation and provides specific feedback, this causes learner that can not look into scarce mending-leakage, in place of specific aim is covered the shortage.

Invention content

The goal of the invention of the present invention is to provide a kind of language learning intelligent earphone, intelligent interactive system and human-computer interaction side Method can designate that the mistake place in learner's spoken language and feed back to learner, for learner's repetition learning to reinforce practicing.

To achieve the above object, the present invention provides a kind of language learning intelligent earphone, including：Recording module, voice are known Other module, correction module, feedback module, text processing module, voice synthetic module and playing module；

Wherein, the recording module is used for the study phonetic acquisition original voice document of typing learner；

The sound identification module is used to carry out text file conversion to the former voice document using speech recognition technology, Obtain original text this document；

The correction module is used to obtain wrong point list to original text this document error correction using artificial intelligence technology；

The feedback module is for feeding back the wrong point list；

The wrong point list of the text processing module based on feedback, by each erroneous point in described original text this document Correct machine language text is replaced with, is formed and corrects version text file；

The voice synthetic module is used to carry out voice document to the correction version text file using speech synthesis technique Conversion obtains and corrects version voice document；

The playing module plays the correction version voice document by earphone.

Optionally, the correction module uses artificial intelligence technology to original text this document error correction.

Optionally, the wrong point list of the text processing module based on feedback, also forms revised text file；Institute State the type of error text file and corresponding correct machine language text file that revised text file is each erroneous point；It is described Voice synthetic module also carries out voice document conversion using speech synthesis technique to the revised text file, obtains each erroneous point Type of error voice document and corresponding machine talk file；The playing module also plays the type of error by earphone Voice document and machine talk file.

Optionally, the revised text file is interspersed in the correction version text file by the text processing module, Or it is put in before or after the correction version text file.

Optionally, further include voice storage module, version voice is corrected for storing.

Optionally, further include voice storage module, type of error voice for storing each erroneous point and corresponding correct Machine language voice.

Optionally, the intelligent earphone is translation earphone, the sound identification module, voice synthetic module and broadcasting mould Block is served as by sound identification module, voice synthetic module and the playing module of the translation earphone.

The present invention also provides a kind of language learning intelligent interactive systems, including：

Above-mentioned intelligent earphone and mobile terminal；

Wherein, the mobile terminal and the intelligent earphone interactive communication, the APP on the mobile terminal, which is synchronized, obtains institute State former voice document and the correction version voice document.

The present invention further provides a kind of man-machine interaction methods of intelligent earphone, including：

The study phonetic acquisition original voice document of typing learner；

Text file conversion is carried out to the former voice document using speech recognition technology, obtains original text this document；

To original text this document error correction, wrong point list is obtained；

It feeds back the wrong point list and each erroneous point in described original text this document is replaced with into correct machine language Text forms and corrects version text file；

Voice document conversion is carried out to the correction version text file using speech synthesis technique, obtains and corrects version voice text Part；

The correction version voice document is played by earphone.

Optionally, using artificial intelligence technology to original text this document error correction.

Optionally, the wrong point list is fed back, revised text file is also formed；The revised text file is each mistake The type of error text file and corresponding correct machine language text file of point；It is also repaiied to described using speech synthesis technique It orders text file and carries out voice document conversion, obtain the type of error voice document and corresponding machine talk text of each erroneous point Part；The type of error voice document and machine talk file are played by earphone.

Optionally, when broadcasting, the type of error voice document and machine talk file are interspersed in the correction version voice It is interior, or be put in before or after the correction version voice.

Optionally, the correction version voice is also stored, to call study repeatedly.

Optionally, the type of error voice document and machine talk file are also stored, to call study repeatedly.

Optionally, the different type of error text files of each erroneous point are distinguished using different code.

Optionally, the type of error of each erroneous point includes：Word or byte pronunciation mistake, tone or stress mistake, With pause or not familiar mistake.

Optionally, further include the APP being synchronized to former voice document with the correction version voice document on mobile phone.

Optionally, further include by former voice document, the type of error language for correcting version voice document and each erroneous point APP on sound file and corresponding machine talk file synchronization to mobile phone.

Compared with prior art, the beneficial effects of the present invention are：

1) in the present invention, the former voice document of learner is obtained first, using speech recognition technology to the original voice document It carries out text file conversion and obtains original text this document；Wrong point list is obtained to original text this document error correction later；Then it feeds back Mistake point list, and according to wrong point list, each erroneous point in original text this document is replaced with into correct machine language text Version text file is corrected to be formed；Version voice document is corrected followed by phonetic synthesis correction version text file acquisition and is played to Learner.As can be seen that by error correction, feedback and replacement, when learner hears that the content that earphone plays is the sound of oneself When, illustrate this part spoken language inerrancy, when hearing machine language, illustrate it is wrong herein, need to be with reference to correct machine language It corrects, adds to practice as needed.

2) in alternative, in error correction step, a) the representative normative text number in the data packet that can be prestored with earphone According to being compared, big data error correction of the artificial intelligence technology by internet can also b) be used.

3) in alternative, version voice document is corrected in addition to playing, can also be stored in earphone, be adjusted repeatedly for learner It takes hard of hearing.

Still optionally further, the correction version voice document can be stored, can also store and correct version text file, benefit exists In：Reduce the memory space occupied, or in the case where memory space is certain, stores several sections more.

Still optionally further, memory block can be opened in earphone, can also be opened in the shifting that LAN can be formed with the earphone In dynamic terminal.

4) in alternative, learner can not only hear complete correction version voice, can also hear the mistake of each erroneous point The voice of type and corresponding correct machine language.Above-mentioned purpose by feedback error point list simultaneously, also formed revision text This document；Revised text file is the type of error text file of each erroneous point and corresponding correct machine language text text Part.Later revised text file with correct version text file respectively respectively or together, played after phonetic synthesis.About each mistake The type of error and corresponding correct machine language voice of point can be interspersed in and correct in version voice, or be put in when playing Before or after the correction version voice.

Still optionally further, in addition to playing, it is also sent to the APP of mobile terminal, APP is opened whenever and wherever possible for learner It practises.Above-mentioned transmission opportunity can be：A) learner is actively in the APP of mobile terminal when synchronous study result；Or b) learner When opening the APP of mobile terminal, earphone is actively sent；Or c) mobile terminal is once in earphone under same LAN, earphone Actively send.

5) in alternative, the type of error of each erroneous point includes：Word or byte pronunciation mistake, tone or stress are wrong Mistake and pause or not familiar mistake.Each type of error is refined, the emphasis that learner can be prompted to correct targetedly corrects mistake.

Description of the drawings

Fig. 1 is the module map of the language learning intelligent earphone in one embodiment of the invention；

Fig. 2 is the flow chart of the man-machine interaction method of the intelligent earphone in Fig. 1：

Fig. 3 is the flow chart of the man-machine interaction method of the intelligent earphone in another embodiment of the present invention.

Specific implementation mode

To make the above purposes, features and advantages of the invention more obvious and understandable, below in conjunction with the accompanying drawings to the present invention Specific embodiment be described in detail.

Fig. 1 is the module map of the language learning intelligent earphone in one embodiment of the invention.

Shown in referring to Fig.1, which includes：Recording module 11, sound identification module 12, error correction Module 13, feedback module 14, text processing module 15, voice synthetic module 16 and playing module 17.

The function of each module is introduced individually below.

Recording module 11 is used for the study phonetic acquisition original voice document of typing learner.

The study voice of learner can be one or a few words, one or several phrases, even one or several words.It learns The content of the study voice of habit person can be based at that time, local, mood is extemporaneously played instantly, can also be with reference on intelligent terminal The learning Content presented in APP application programs.

The former voice document of typing can be stored in voice storage module.

Sound identification module 12 is used to carry out text file conversion to former voice document using speech recognition technology, obtains former Text file.

Speech recognition technology can be existing speech recognition technology, such as using at present using more depth nerve net Network (Deep Neural Network, DNN), recurrent neural network (Recurrent Neural Network, RNN).It is converted into First text file occupies memory space small, is second conducive to subsequently compare error correction in big data.

Correction module 13 is used to, to original text this document error correction, obtain wrong point list.

Error correction can be there are two types of approach.First, the study voice for being directed to learner is originated from and is answered with reference to the APP on intelligent terminal The case where with the learning Content presented in program, can be compared with the representative normative text data in pre stored data. Second, for the study voice of learner content be based at that time, it is local, mood is extemporaneously played instantly the case where, using artificial Intellectual technology error correction.The use of artificial intelligence technology error correction refer to capturing to compare in big data on the internet.Earphone can lead to It crosses bluetooth to connect with mobile terminal, big data is obtained using mobile terminal as hot spot.Earphone can also with mobile terminal and Router forms LAN, and big data is obtained using the communication of router.

Mistake point list includes the correspondence of the position and wrong content of each mistake.

Feedback module 14 is for feeding back the mistake point list.

Feedback module 14 forms a kind of feedback mechanism so that learner has practiced spoken language not only through saying, and knows mistake Accidentally place, targetedly to repeat.

Wrong point list of the text processing module 15 based on feedback, each erroneous point in original text this document is replaced with correctly Machine language text, formed correct version text file.

About correct machine language text when replacing, acquisition modes can have：A) correction module 13 not only obtains The position of erroneous point and erroneous point content can also compare according to data packet or big data and obtain correct machine language text；It should Correct machine language text can be together listed in position, wrong content in wrong point list；Or b) text processing module 15 The machine language for calling earphone to prestore.

Voice synthetic module 16 is used to carry out voice document conversion to correcting version text file using speech synthesis technique, obtains Version voice document must be corrected.Speech synthesis technique is referred to existing speech synthesis technique, such as uses WaveNet softwares.

Playing module 17 plays the correction version voice document by earphone.

Version voice document is corrected in addition to playing, can also be stored in voice storage module, be called repeatedly for learner.

Learner can be turned over the former voice document of calling up and down by the button on earphone 1 and/or correct version voice document, also It can be called by the APP of intelligent terminal.

When learner hears that the content that earphone 1 plays is oneself sound, illustrate this part spoken language inerrancy, when hearing When machine language, illustrate wrong herein, need to be corrected with reference to correct machine language, can also add to practice as needed.

Intelligent earphone 1 in the present embodiment can be translation earphone.In other words, which is based on existing translation Earphone carries out functions expanding, in this way：Sound identification module 12, voice synthetic module 16 and playing module 17 can be by translation ears Sound identification module, voice synthetic module and the playing module of machine serve as.

One embodiment of the invention also provides a kind of language learning intelligent interactive system, including：

Above-mentioned intelligent earphone 1 and mobile terminal；

Wherein, mobile terminal and 1 interactive communication of intelligent earphone, the APP on mobile terminal synchronize obtain former voice document with Correct version voice document.

Intelligent earphone 1 can pass through bluetooth and mobile terminal interactive communication.Intelligent earphone 1 can also with mobile terminal and Router forms LAN, the interactive communication in LAN.

Fig. 2 is the flow chart of the man-machine interaction method of the intelligent earphone in Fig. 1.

Each step is introduced individually below.

With reference to shown in Fig. 2, step S1, the study phonetic acquisition original voice document of typing learner is first carried out.

The former voice document of typing can be stored in voice storage module.

Then step S2 is executed, text file conversion is carried out to the original voice document using speech recognition technology, is obtained former Text file.

Speech recognition technology can be existing speech recognition technology.

Being converted into text file, first to occupy memory space small, is second conducive to subsequently compare error correction in big data.

Followed by step S3 is executed, to original text this document error correction, wrong point list is obtained.

Error correction can be there are two types of method.First, the study voice for being directed to learner is originated from and is answered with reference to the APP on intelligent terminal The case where with the learning Content presented in program, can be compared with the representative normative text data in pre stored data. Second, for the study voice of learner content be based at that time, it is local, mood is extemporaneously played instantly the case where, using artificial Intellectual technology error correction.The use of artificial intelligence technology error correction refer to capturing to compare in big data on the internet.Earphone can lead to It crosses bluetooth to connect with mobile terminal, big data is obtained using mobile terminal as hot spot.Earphone can also with mobile terminal and Router forms LAN, and big data is obtained using the communication of router.

Step S4 is executed later, and feedback error point list simultaneously replaces with each erroneous point in the original text this document correctly Machine language text forms and corrects version text file.

This feedback step forms a kind of feedback mechanism so that learner has practiced spoken language not only through saying, and knows mistake Accidentally place, targetedly to repeat.

About correct machine language text when replacing, acquisition modes can have：A) error correction step S3 is not only obtained The position of erroneous point and erroneous point content can also compare according to data packet or big data and obtain correct machine language text；It should Correct machine language text can be together listed in position, wrong content in wrong point list；Or b) earphone is called to prestore Machine language.

Step S5 is executed, voice document conversion is carried out to correcting version text file using speech synthesis technique, is corrected Version voice document.

Speech synthesis technique is referred to existing speech synthesis technique.

Then step S6 is executed, which is played by earphone.

Play-back technology is referred to existing play-back technology.

Intelligent earphone in the present embodiment can be translation earphone.In other words, which is based on existing translation ear Machine carries out functions expanding, in this way：Speech recognition steps S2, phonetic synthesis step S5 and broadcasting step S6 can be by translation earphones Sound identification module, voice synthetic module and playing module complete.

Another embodiment of the present invention also provides a kind of Foreigh-language oral-speech practice intelligent earphone.Compared with previous embodiment, this Intelligent earphone in embodiment difference lies in：1) wrong point list of the text processing module 15 based on feedback also forms revision text This document；The revised text file is the type of error text file of each erroneous point and corresponding correct machine language text text Part；2) voice synthetic module 16 also carries out voice document conversion using speech synthesis technique to the revised text file, obtains The type of error voice document of each erroneous point and corresponding machine talk file；3) playing module 17 passes through earphone also playback error Type voice file and machine talk file.

Revised text file can be interspersed in and correct in version text file by text processing module 15 as needed, or put Before or after correcting version text file.

The type of error of each erroneous point may include：Word or byte pronunciation mistake, tone or stress mistake and pause or Not familiar mistake.In text file, different type mistake can be distinguished by different code, can also be distinguished by different storage addresses. Word or byte pronunciation mistake, tone or stress mistake can be placed on not familiar mistake before or after correction version text file.For Pause mistake can be interspersed in and correct in version text file, so that follow-up earphone plays " should not have pause herein " to prompt to use Family.The certain value of full section, such as 80% or more can be accounted for mistake of statistics for the criterion of not familiar mistake, plays " this section life Dredge " to prompt learner.

Fig. 3 is the flow chart of the man-machine interaction method of the intelligent earphone in another embodiment of the present invention.It, can be with reference to Fig. 3 Find out, it is roughly the same with the flow in Fig. 2, difference lies in：

1) in step S4 ', feedback error point list also forms revised text file；Revised text file is each erroneous point Type of error text file and corresponding correct machine language text file；

2) in step S5 ', voice document conversion also is carried out to revised text file using speech synthesis technique, obtains each mistake Overdue type of error voice document and corresponding machine talk file；

3) in step S6 ', pass through earphone also playback error type voice file and machine talk file.

In revised text file, the type of error of each erroneous point may include：Word or byte pronunciation mistake, tone or again Sound mistake and pause or not familiar mistake.In text file, different type mistake can be distinguished by different code, can also be by not It is distinguished with storage address.Word or byte pronunciation mistake, can be placed on not familiar mistake and correct version text tone or stress mistake Before or after file.For pause mistake, it can be interspersed in and correct in version text file, so that the broadcasting of follow-up earphone " is not answered herein Have pause " to prompt user.Can account for the certain value of full section with mistake of statistics for the criterion of not familiar mistake, for example, 80% with On, " this section is not familiar " is played to prompt learner.

The type of error voice document of each erroneous point and corresponding machine talk file are played, it can also be by each erroneous point Type of error voice document and corresponding machine talk file preserve, so that learner calls.Learner can be by earphone Button up and down turn over the type of error voice document for calling each erroneous point and corresponding machine talk file, intelligence can also be passed through The APP of terminal is called.

Intelligent earphone in the present embodiment can also be translation earphone.New phonetic synthesis step S5 ' and broadcasting step S6 ' can be completed by the voice synthetic module and playing module of translation earphone.

Although present disclosure is as above, present invention is not limited to this.Any those skilled in the art are not departing from this It in the spirit and scope of invention, can make various changes or modifications, therefore protection scope of the present invention should be with claim institute Subject to the range of restriction.

Claims

1. a kind of language learning intelligent earphone, which is characterized in that including：Recording module, sound identification module, correction module, Feedback module, text processing module, voice synthetic module and playing module；

The sound identification module is used to carry out text file conversion to the former voice document using speech recognition technology, obtains Original text this document；

The correction module is used to, to original text this document error correction, obtain wrong point list；

The feedback module is for feeding back the wrong point list；

The wrong point list of the text processing module based on feedback, each erroneous point in described original text this document is replaced For correct machine language text, is formed and correct version text file；

The voice synthetic module is used to carry out voice document conversion to the correction version text file using speech synthesis technique, It obtains and corrects version voice document；

The playing module plays the correction version voice document by earphone.

2. language learning intelligent earphone according to claim 1, which is characterized in that the correction module uses artificial intelligence Energy technology is to original text this document error correction.

3. language learning intelligent earphone according to claim 1, which is characterized in that the text processing module is based on anti- The wrong point list of feedback, also forms revised text file；The revised text file is the type of error text of each erroneous point This document and corresponding correct machine language text file；The voice synthetic module is using speech synthesis technique also to described Revised text file carries out voice document conversion, obtains the type of error voice document and corresponding machine talk text of each erroneous point Part；The playing module also plays the type of error voice document and machine talk file by earphone.

4. language learning intelligent earphone according to claim 3, which is characterized in that the text processing module will be described Revised text file is interspersed in the correction version text file, or is put in before or after the correction version text file.

5. language learning intelligent earphone according to claim 1, which is characterized in that further include voice storage module, use Version voice is corrected in storage.

6. language learning intelligent earphone according to claim 3, which is characterized in that further include voice storage module, use In the type of error voice and corresponding correct machine language voice that store each erroneous point.

7. language learning intelligent earphone according to any one of claims 1 to 6, which is characterized in that the intelligent earphone To translate earphone, the sound identification module, voice synthetic module and playing module are by the speech recognition for translating earphone Module, voice synthetic module and playing module serve as.

8. a kind of language learning intelligent interactive system, which is characterized in that including：

Intelligent earphone described in any one of claim 1 to 7 and mobile terminal；

Wherein, the mobile terminal and the intelligent earphone interactive communication, the APP on the mobile terminal, which is synchronized, obtains the original Voice document and the correction version voice document.

9. a kind of man-machine interaction method of intelligent earphone, which is characterized in that including：

The study phonetic acquisition original voice document of typing learner；

It feeds back the wrong point list and each erroneous point in described original text this document is replaced with into correct machine language text, It is formed and corrects version text file；

Voice document conversion is carried out to the correction version text file using speech synthesis technique, obtains and corrects version voice document；

The correction version voice document is played by earphone.

10. man-machine interaction method according to claim 9, which is characterized in that using artificial intelligence technology to the original text This document error correction.

11. man-machine interaction method according to claim 9, which is characterized in that the feedback wrong point list is also formed and repaiied Order text file；The revised text file is the type of error text file of each erroneous point and corresponding correct machine language Text file；Voice document conversion also is carried out to the revised text file using speech synthesis technique, obtains each erroneous point Type of error voice document and corresponding machine talk file；The type of error voice document and machine language are played by earphone Sound file.

12. man-machine interaction method according to claim 11, which is characterized in that when broadcasting, the type of error voice text Part and machine talk file are interspersed in the correction version voice, or are put in before or after the correction version voice.

13. man-machine interaction method according to claim 9, which is characterized in that the correction version voice is also stored, with Study is called repeatedly.

14. man-machine interaction method according to claim 11, which is characterized in that the type of error voice document and machine Voice document is also stored, to call study repeatedly.

15. man-machine interaction method according to claim 11, which is characterized in that the different type of error texts of each erroneous point File is distinguished using different code.

16. the man-machine interaction method according to claim 11 or 15, which is characterized in that the type of error of each erroneous point Including：Word or byte pronunciation mistake, tone or stress mistake and pause or not familiar mistake.

17. man-machine interaction method according to claim 9, which is characterized in that further include entangling former voice document with described Legal voice document is synchronized to the APP on mobile phone.

18. man-machine interaction method according to claim 11, which is characterized in that further include by former voice document, described entangle On the type of error voice document and corresponding machine talk file synchronization to mobile phone of legal voice document and each erroneous point APP。