CN110097874A

CN110097874A - A kind of pronunciation correction method, apparatus, equipment and storage medium

Info

Publication number: CN110097874A
Application number: CN201910406383.5A
Authority: CN
Inventors: 刘晨晨; 沈欣尧; 关普键; 杨晓飞; 蒋成林; 陈磊; 吴梦香; 林顺达; 戴政
Original assignee: SHANGHAI LIULISHUO INFORMATION TECHNOLOGY Co Ltd
Current assignee: SHANGHAI LIULISHUO INFORMATION TECHNOLOGY Co Ltd
Priority date: 2019-05-16
Filing date: 2019-05-16
Publication date: 2019-08-06

Abstract

The invention discloses a kind of pronunciation correction methods, and the audio data of predetermined word typing is directed to by obtaining；Audio data is analyzed, detects whether the case where increasing vowel there are syllable end；According to testing result, generating predetermined pronunciation of words, there are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unification is in a manner of understanding syllable come the problem of correcting syllable end Canadian dollar sound, obtained feedback information can assist English learner to fully understand the concept of syllable, the repeated work that sound is corrected one by one is eliminated, waste of time is avoided.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user can carry out relevant practice whenever and wherever possible.In addition, present invention also provides a kind of pronunciation correction device, equipment and computer readable storage mediums having above-mentioned technique effect.

Description

A kind of pronunciation correction method, apparatus, equipment and storage medium

Technical field

The present invention relates to voice technology fields, more particularly to a kind of pronunciation correction method, apparatus, equipment and computer Readable storage medium storing program for executing.

Background technique

With the development of science and technology, language learning application Internet-based has also obtained quick development.Some In language learning application, application provider sends client for learning stuff by internet, and user obtains via client Learning stuff carries out corresponding study.For language learning, other than learning grammar with vocabulary, articulation ability is wherein most One of important ability.Under normal conditions, user can promote the articulation ability of itself by reading aloud, with modes such as readings.However, User can not learn whether itself pronunciation is accurate in most cases.

Since simple or compound vowel of a Chinese syllable most of in Chinese is all vowel, so thering is part learner habitual can increase in English equivalents Add a sound, such as the monosyllable end bed/bed/ Canadian dollar pronunciation at(be- " Tinkling "), has actually become a double-tone Save word.

Traditional scheme is by teaching explanation syllable concept, as the concept base for learning other skills (such as stress) Plinth not will do it special training.When there is syllable end Canadian dollar mail topic, traditional teaching method can be considered as phonetic symbol hair The problem of sound (such as the above problem, will be considered that be /pronunciation of d/ is not correct enough), need one by one sound corrected, cause to repeat It works more, time-consuming extremely long.

Summary of the invention

The object of the present invention is to provide a kind of pronunciation correction method, apparatus, equipment and computer readable storage medium, with Solve the problems, such as that existing scheme needs sound correction one by one to lead to that repeated work is more, takes a long time.

In order to solve the above technical problems, the present invention provides a kind of pronunciation correction method, comprising:

Obtain the audio data for being directed to predetermined word typing；

The audio data is analyzed, detects whether the case where increasing vowel there are syllable end；

According to testing result, there are the feedback informations of syllable mistake for the generation predetermined pronunciation of words.

Optionally, described that the audio data is analyzed, detect whether the case where increasing vowel there are syllable end Include:

The audio data is analyzed, the end consonant of each syllable in audio data is detected；

After detecting end consonant, whether the adjacent audio data after detecting end consonant has sound periodicity, If it is, determining the case where increasing vowel there are syllable end.

Optionally, described that the audio data is analyzed, detect the end consonant packet of each syllable in audio data It includes:

Whether the end that each syllable is determined according to the word content of the predetermined word is consonant；

If the end of each syllable is consonant in the predetermined word, carry out forcing cutting pair by speech recognition Together, the position for obtaining each phoneme determines the position of consonant, to detect the end consonant of each syllable in audio data.

Optionally, in the generation predetermined pronunciation of words, there are after the feedback information of syllable mistake further include:

Display is identified to the feedback information in display interface, and/or plays preset corresponding audio.

Optionally, the audio data is analyzed described, detects whether that there are the feelings that syllable end increases vowel After condition further include:

If it is, generating the prompt information of the included syllable quantity of practical pronunciation.

Present invention also provides a kind of pronunciation correction devices, comprising:

Module is obtained, for obtaining the audio data for being directed to predetermined word typing；

Detection module detects whether that there are the feelings that syllable end increases vowel for analyzing the audio data Condition；

Generation module, for according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.

Optionally, further includes:

Feedback module is being shown for generating the predetermined pronunciation of words there are after the feedback information of syllable mistake Interface is identified display to the feedback information, and/or plays preset corresponding audio.

Present invention also provides a kind of pronunciation correction equipment, are applied to server-side, and the equipment includes:

Memory, for storing computer program；

Processor realizes following steps when for executing the computer program: obtaining the sound for being directed to predetermined word typing Frequency evidence；The audio data is analyzed, detects whether the case where increasing vowel there are syllable end；It is tied according to detection Fruit, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.

Present invention also provides a kind of pronunciation correction equipment, are applied to client, and the equipment includes:

Audio collecting device is directed to the audio data of predetermined word for typing；

Communication device, for the audio data to be sent to server-side, so that the server-side is to the audio data It is analyzed, detects whether the case where increasing vowel there are syllable end；According to testing result, the predetermined pronunciation of words is generated There are the feedback informations of syllable mistake；And receive the feedback information that the server-side is sent；

Display device, for showing the feedback information in the display interface.

Present invention also provides a kind of computer readable storage medium, meter is stored on the computer readable storage medium The step of calculation machine program, the computer program realizes any of the above-described kind of pronunciation correction method when being executed by processor.

Pronunciation correction method provided by the present invention is directed to the audio data of predetermined word typing by obtaining；To audio Data are analyzed, and detect whether the case where increasing vowel there are syllable end；According to testing result, predetermined pronunciation of words is generated There are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether that there are sounds Save the situation of mistake, the unified feedback information energy in a manner of understanding syllable to obtain the problem of correcting syllable end Canadian dollar sound It enough assists English learner to fully understand the concept of syllable, eliminates the repeated work that sound is corrected one by one, avoid the wave of time Take.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome learning time With the limitation in space, user can carry out relevant practice whenever and wherever possible.In addition, present invention also provides one kind to have above-mentioned skill Pronunciation correction device, equipment and the computer readable storage medium of art effect.

Detailed description of the invention

It, below will be to embodiment or existing for the clearer technical solution for illustrating the embodiment of the present invention or the prior art Attached drawing needed in technical description is briefly described, it should be apparent that, the accompanying drawings in the following description is only this hair Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to these attached drawings.

Fig. 1 is a kind of flow chart of specific embodiment of pronunciation correction method provided herein；

Fig. 2 is the process signal that the case where increasing vowel there are syllable end is detected whether provided by the embodiment of the present application Figure；

Fig. 3 is the flow chart of another specific embodiment of pronunciation correction method provided herein；

Fig. 4 is syllable exercise visual feedback example figure；

Fig. 5 is the structural block diagram of pronunciation correction device provided in an embodiment of the present invention；

Fig. 6 is pronunciation correction equipment application provided in an embodiment of the present invention in the structural block diagram of server-side；

Fig. 7 is pronunciation correction equipment application provided in an embodiment of the present invention in the structural block diagram of client；

Fig. 8 is the structural block diagram of pronunciation correction equipment provided in an embodiment of the present invention.

Specific embodiment

In order to enable those skilled in the art to better understand the solution of the present invention, with reference to the accompanying drawings and detailed description The present invention is described in further detail.Obviously, described embodiments are only a part of the embodiments of the present invention, rather than Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise Under every other embodiment obtained, shall fall within the protection scope of the present invention.

The description and claims of this application and term " first ", " second ", " third ", " in above-mentioned attached drawing The (if present)s such as four " are to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should manage The data that solution uses in this way are interchangeable under appropriate circumstances, so that the embodiments described herein can be in addition to illustrating herein Or the sequence other than the content of description is implemented.In addition, term " includes " and " having " and their any deformation, it is intended that Cover it is non-exclusive include, for example, containing the process, method, system, product or equipment of a series of steps or units need not limit In step or unit those of is clearly listed, but may include be not clearly listed or for these process, methods, produce The other step or units of product or equipment inherently.

It should be noted that the description for being related to " first ", " second " etc. in the present invention is used for description purposes only, and cannot It is interpreted as its relative importance of indication or suggestion or implicitly indicates the quantity of indicated technical characteristic.Define as a result, " the One ", the feature of " second " can explicitly or implicitly include at least one of the features.In addition, the skill between each embodiment Art scheme can be combined with each other, but must be based on can be realized by those of ordinary skill in the art, when technical solution Will be understood that the combination of this technical solution is not present in conjunction with there is conflicting or cannot achieve when, also not the present invention claims Protection scope within.

The embodiment of the present invention can be used for word pronunciation learning scene or hair in word pronunciation learning scene, especially language learning Sound corrects scene, and wherein language includes but is not limited to the foreign languages such as English, French, German, Japanese and mandarin, Guangdong language, Sichuan Hua Deng Chinese branch.The present embodiments relate to language learning scene for example to can be language learning software or language learning whole Pronunciation assessment scene, the scenes such as pronunciation correction scene in end, are also possible to other language learning scenes, in the embodiment of the present invention It does not limit.

The application scenarios of the embodiment of the present application are described in detail below, user can carry out phonetics by client It practises, client can show user's content to be learned in the display interface, and can also be played by audios such as loudspeakers Device exports the audio content of speech form to user.When user carries out the word pronunciation learning of voice, client can pass through sound Frequency acquisition device acquires audio data when user pronunciation, so as to subsequent progress pronunciation correction operation.It is understood that executing The main body of pronunciation correction operation can be client, or server-side, this does not influence the realization of the application.

Client can include but is not limited in the embodiment of the present invention: smart phone, tablet computer, MP4, MP3, PC, PDA, wearable device and wear display equipment etc.；Server-side can include but is not limited to: single network server, multiple networks The server group of server composition is based on cloud computing cloud consisting of a large number of computers or network servers.

In conjunction with above-mentioned application scenarios, a kind of flow chart of specific embodiment of pronunciation correction method provided herein As shown in Figure 1, this method specifically includes:

Step S101: the audio data for being directed to predetermined word typing is obtained；

User can read aloud the predetermined word, the voice of the word to be practiced is directed to by client typing, by audio The corresponding audio data of voice is obtained after acquisition device acquisition.Predetermined word can be single syllable words or multisyllable word, This is without limitation.

Step S102: analyzing the audio data, detects whether the case where increasing vowel there are syllable end；

Detect whether that the process for the case where there are syllable end increases vowel is shown referring to provided by Fig. 2 the embodiment of the present application It is intended to, detects whether that the process for the case where there are syllable end increases vowel can specifically include:

Step S1021: analyzing the audio data, detects the end consonant of each syllable in audio data；

Whether the end that each syllable is determined according to the word content of the predetermined word is consonant；If the booking list In word there are the end of syllable be consonant, then by speech recognition carry out force cutting be aligned, obtain the position of each phoneme, The position of consonant is determined, to detect the end consonant of each syllable in audio data.

Step S1022: after detecting end consonant, whether the adjacent audio data after detecting end consonant has sound Sound is periodical, if it is, determining the case where increasing vowel there are syllable end.

After detecting end consonant, it can be further analyzed in the audio data in subsequent prefixed time interval, Judge whether it has sound periodical.Prefixed time interval can be 50 milliseconds to 200 milliseconds.Detecting that end consonant opens After beginning in 50 milliseconds to 200 milliseconds, the periodicity of sound is detected.This is because vowel is periodically to shake, consonant does not have Periodically, therefore if it is detected that periodically being considered as more by force increasing vowel at syllable end, that is, there is syllable mistake.

The periodicity of sound can be calculated by the autocorrelation method of time domain.Related coefficient measurement refer to two not With the degree that influences each other between event；And auto-correlation coefficient measurement is same event between two different times Degree of correlation, vivid saying exactly measure oneself behavior over to oneself present influence.Determining by auto-correlation coefficient To the periodicity of sound.

Step S103: according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.

Pronunciation correction method provided by the present invention is directed to the audio data of predetermined word typing by obtaining；To audio Data are analyzed, and detect whether the case where increasing vowel there are syllable end；According to testing result, predetermined pronunciation of words is generated There are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether that there are sounds Save the situation of mistake, the unified feedback information energy in a manner of understanding syllable to obtain the problem of correcting syllable end Canadian dollar sound It enough assists English learner to fully understand the concept of syllable, eliminates the repeated work that sound is corrected one by one, avoid the wave of time Take.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome learning time With the limitation in space, user can carry out relevant practice whenever and wherever possible.

Based on any of the above embodiments, pronunciation correction method provided herein can further include: The predetermined pronunciation of words is being generated there are after the feedback information of syllable mistake, the feedback information is being carried out in display interface Mark display, and/or play preset corresponding audio.

Referring to Fig. 3, another specific embodiment of pronunciation correction method provided herein can be specifically included:

Step S201: the audio data for being directed to predetermined word typing is obtained；

Step S202: analyzing the audio data, detects whether the case where increasing vowel there are syllable end；

Step S203: according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake；

Step S204: display is identified to the feedback information in display interface, and/or plays preset correspondence Audio.

Further, the application is analyzed to the audio data, detects whether that there are syllable ends to increase vowel The case where after can also include: if it is, generate it is practical pronounce included syllable quantity prompt information.

The situation of correcting errors of syllable when feeding back by display interface.Such as Fig. 4 syllable exercise visual feedback example figure institute Show, by circle above interface indicate user it is practical pronounce syllable it is whether correct.When correct, the circle on interface shows green Color simultaneously plays corresponding audio；When mistake, the circle on interface is shaken.Further, it is also possible to be included according to practical pronunciation is generated The prompt information of syllable quantity, by voice and text prompt occur it is practical read several syllables, such as can be on display circle It tells personally and knows that the single syllable words of script have been read as 2 syllables by user.

Pronunciation correction device provided in an embodiment of the present invention is introduced below, pronunciation correction device described below with Above-described pronunciation correction method can correspond to each other reference.

Fig. 5 is the structural block diagram of pronunciation correction device provided in an embodiment of the present invention, can be with referring to Fig. 5 pronunciation correction device Include:

Module 100 is obtained, for obtaining the audio data for being directed to predetermined word typing；

Detection module 200 detects whether that there are syllable ends to increase vowel for analyzing the audio data Situation；

Generation module 300, for according to testing result, generating the predetermined pronunciation of words, there are the feedback letters of syllable mistake Breath.

As a kind of specific embodiment, detection module 200 is specifically used in pronunciation correction device provided herein: The audio data is analyzed, the end consonant of each syllable in audio data is detected；After detecting end consonant, inspection Whether the adjacent audio data after surveying end consonant has sound periodicity, if it is, determining that there are the increases of syllable end The case where vowel.

As a kind of specific embodiment, detection module 200 is specifically used in pronunciation correction device provided herein: Whether the end that each syllable is determined according to the word content of the predetermined word is consonant；If existed in the predetermined word The end of syllable is consonant, then carries out forcing cutting alignment by speech recognition, obtain the position of each phoneme, determine consonant Position, to detect the end consonant of each syllable in audio data.

Based on any of the above embodiments, pronunciation correction device provided herein can further include: Feedback module, for being identified display, and/or the preset diaphone of broadcasting to the feedback information in display interface Effect.

Based on any of the above embodiments, pronunciation correction device provided herein can further include: Cue module, for being analyzed to the audio data, after detecting whether the case where increasing vowel there are syllable end, If it is determined that the case where increasing vowel there are syllable end, then generate the prompt information for the syllable quantity that practical pronunciation is included.

The pronunciation correction device of the present embodiment is for realizing pronunciation correction method above-mentioned, therefore in pronunciation correction device The embodiment part of the visible pronunciation correction method hereinbefore of specific embodiment, for example, obtaining module 100, detection module 200, generation module 300 is respectively used to realize step S101, S102, S103 in above-mentioned pronunciation correction method, so, it is specific Embodiment is referred to the description of corresponding various pieces embodiment, and details are not described herein.

The application is directed to the audio data of predetermined word typing by obtaining；Audio data is analyzed, is detected whether The case where increasing vowel there are syllable end；According to testing result, generating predetermined pronunciation of words, there are the feedback letters of syllable mistake Breath.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unified to manage The mode of syllable is solved come the problem of correcting syllable end Canadian dollar sound, obtained feedback information can assist English learner sufficiently to manage The concept for solving syllable eliminates the repeated work that sound is corrected one by one, avoids waste of time.Also, use the application can be with It does not need teacher and carries out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user can be with Relevant practice is carried out whenever and wherever possible.

In addition, being applied to server-side 1, Fig. 6 mentions for the embodiment of the present invention present invention also provides a kind of pronunciation correction equipment The pronunciation correction equipment application of confession includes: in the structural block diagram of server-side, the equipment

Memory 11, for storing computer program；

Processor 12 realizes following steps when for executing the computer program: obtaining for predetermined word typing Audio data；The audio data is analyzed, detects whether the case where increasing vowel there are syllable end；It is tied according to detection Fruit, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.

Wherein, memory 11 include at least a type of readable storage medium storing program for executing, the readable storage medium storing program for executing include flash memory, Hard disk, multimedia card, card-type memory (for example, SD or DX memory etc.), magnetic storage, disk, CD etc..Memory 11 It can be the internal storage unit of pronunciation correction equipment, such as hard disk in some embodiments.Memory 11 is in other implementations It is also possible to the External memory equipment of pronunciation correction equipment, such as plug-in type hard disk, intelligent memory card (Smart Media in example Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card) etc..Further, memory 11 can also both including pronunciation correction equipment internal storage unit and also including External memory equipment.Memory 11 can not only be used It is installed on the application software and Various types of data, such as the code of pronunciation correction program 01 etc. of pronunciation correction equipment in storage, may be used also For temporarily storing the data that has exported or will export.

Processor 12 can be in some embodiments a central processing unit (Central Processing Unit, CPU), controller, microcontroller, microprocessor or other data processing chips, the program for being stored in run memory 11 Code or processing data, such as execute pronunciation correction program 01 etc..

Optionally, the processor 12 is for being implemented as follows step when executing the computer program: to the sound Frequency detects the end consonant of each syllable in audio data according to being analyzed；After detecting end consonant, detection end is auxiliary Whether the adjacent audio data after sound has sound periodicity, if it is, determining that there are the feelings that syllable end increases vowel Condition.

Optionally, the processor 12 is for being implemented as follows step when executing the computer program: according to described The word content of predetermined word determines whether the end of each syllable is consonant；If in the predetermined word, there are the ends of syllable Tail is consonant, then carries out forcing cutting alignment by speech recognition, obtain the position of each phoneme, determine the position of consonant, To detect the end consonant of each syllable in audio data.

It is understood that server-side can include but is not limited in the embodiment of the present application: single network server, multiple The server group of network server composition is based on cloud computing cloud consisting of a large number of computers or network servers.

In addition, being applied to client 2, Fig. 7 mentions for the embodiment of the present invention present invention also provides a kind of pronunciation correction equipment The pronunciation correction equipment application of confession includes: in the structural block diagram of client, the equipment

Audio collecting device 21 is directed to the audio data of predetermined word for typing；

Communication device 22, for the audio data to be sent to server-side, so that the server-side is to the audio number According to being analyzed, the case where increasing vowel there are syllable end is detected whether；According to testing result, the predetermined word hair is generated There are the feedback informations of syllable mistake for sound；And receive the feedback information that the server-side is sent；

Display device 23, for showing the feedback information in the display interface.

Optionally, display device can be also used in pronunciation correction equipment provided by the embodiment of the present application: on display circle It is identified display in face of the feedback information, and/or plays preset corresponding audio.

Optionally, display device can be also used in pronunciation correction equipment provided by the embodiment of the present application: to described Audio data is analyzed, and after detecting whether the case where increasing vowel there are syllable end, is increased if there is syllable end The case where vowel, then generates the prompt information for the syllable quantity that practical pronunciation is included.

It is understood that client can include but is not limited in the embodiment of the present application: smart phone, tablet computer, MP4, MP3, PC, PDA, wearable device and wear display equipment etc..

Further, present invention also provides a kind of pronunciation correction systems, as shown in figure 8, the system includes any of the above-described Kind server-side 1 and any of the above-described kind of client 2.User can carry out word pronunciation learning by client, and client can be aobvious Show the content for showing that user is to be learned on interface, and voice can also be exported to user by audio playing apparatus such as loudspeakers The audio content of form, when user carries out the word pronunciation learning of voice, client can acquire user by audio collecting device Audio data when pronunciation, and audio data is sent to server-side, the process of pronunciation correction is carried out by server-side.In server-side After being analyzed to audio data and obtain feedback information, which is sent to client.Pass through the aobvious of client Showing device shows feedback information, provides a user vision auxiliary information.

In addition, being deposited on the computer readable storage medium present invention also provides a kind of computer readable storage medium Computer program is contained, the computer program realizes any of the above-described kind of pronunciation correction method when being executed by processor the step of.

Pronunciation correction equipment, pronunciation correction system, computer readable storage medium and preceding method provided herein It is corresponding.It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.

To sum up, the application is directed to the audio data of predetermined word typing by obtaining；Audio data is analyzed, is detected The case where increasing vowel with the presence or absence of syllable end；According to testing result, generating predetermined pronunciation of words, there are the anti-of syllable mistake Feedforward information.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unified The problem of syllable end Canadian dollar sound is corrected in a manner of understanding syllable, obtained feedback information can assist English learner to fill The concept of sub-argument solution syllable eliminates the repeated work that sound is corrected one by one, avoids waste of time.Also, use the application It can not need teacher and carry out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user Relevant practice can be carried out whenever and wherever possible.

Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with it is other The difference of embodiment, same or similar part may refer to each other between each embodiment.For being filled disclosed in embodiment For setting, since it is corresponded to the methods disclosed in the examples, so being described relatively simple, related place is referring to method part Explanation.

Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Profession Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered Think beyond the scope of this invention.

The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology In any other form of storage medium well known in field.

Pronunciation correction method, apparatus provided by the present invention, equipment and computer readable storage medium are carried out above It is discussed in detail.Used herein a specific example illustrates the principle and implementation of the invention, above embodiments Explanation be merely used to help understand method and its core concept of the invention.It should be pointed out that for the common of the art , without departing from the principle of the present invention, can be with several improvements and modifications are made to the present invention for technical staff, these Improvement and modification are also fallen within the protection scope of the claims of the present invention.

Claims

1. a kind of pronunciation correction method characterized by comprising

Obtain the audio data for being directed to predetermined word typing；

2. pronunciation correction method as described in claim 1, which is characterized in that described to analyze the audio data, inspection Surveying the case where increasing vowel with the presence or absence of syllable end includes:

After detecting end consonant, whether the adjacent audio data after detecting end consonant has sound periodicity, if It is then to determine the case where increasing vowel there are syllable end.

3. pronunciation correction method as claimed in claim 2, which is characterized in that described to analyze the audio data, inspection The end consonant of each syllable includes: in survey audio data

If in the predetermined word there are the end of syllable be consonant, by speech recognition carry out force cutting be aligned, obtain To the position of each phoneme, the position of consonant is determined, to detect the end consonant of each syllable in audio data.

4. pronunciation correction method as described in any one of claims 1 to 3, which is characterized in that generate the booking list described There are after the feedback information of syllable mistake for word pronunciation further include:

5. pronunciation correction method as claimed in claim 4, which is characterized in that the audio data is analyzed described, Detect whether there are syllable end increase vowel the case where after further include:

6. a kind of pronunciation correction device characterized by comprising

Detection module detects whether the case where increasing vowel there are syllable end for analyzing the audio data；

7. pronunciation correction method as claimed in claim 6, which is characterized in that further include:

Feedback module, for generating the predetermined pronunciation of words there are after the feedback information of syllable mistake, in display interface Display is identified to the feedback information, and/or plays preset corresponding audio.

8. a kind of pronunciation correction equipment, which is characterized in that be applied to server-side, the equipment includes:

Memory, for storing computer program；

Processor realizes following steps when for executing the computer program: obtaining the audio number for being directed to predetermined word typing According to；The audio data is analyzed, detects whether the case where increasing vowel there are syllable end；According to testing result, raw At the predetermined pronunciation of words, there are the feedback informations of syllable mistake.

9. a kind of pronunciation correction equipment, which is characterized in that be applied to client, the equipment includes:

Communication device, for the audio data to be sent to server-side, so that the server-side carries out the audio data Analysis detects whether the case where increasing vowel there are syllable end；According to testing result, the predetermined pronunciation of words is generated to exist The feedback information of syllable mistake；And receive the feedback information that the server-side is sent；

Display device, for showing the feedback information in the display interface.

10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program realizes the step of the pronunciation correction method as described in any one of claim 1 to 5 when the computer program is executed by processor Suddenly.