CN110097874A - A kind of pronunciation correction method, apparatus, equipment and storage medium - Google Patents

A kind of pronunciation correction method, apparatus, equipment and storage medium Download PDF

Info

Publication number
CN110097874A
CN110097874A CN201910406383.5A CN201910406383A CN110097874A CN 110097874 A CN110097874 A CN 110097874A CN 201910406383 A CN201910406383 A CN 201910406383A CN 110097874 A CN110097874 A CN 110097874A
Authority
CN
China
Prior art keywords
syllable
audio data
pronunciation
vowel
consonant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910406383.5A
Other languages
Chinese (zh)
Inventor
刘晨晨
沈欣尧
关普键
杨晓飞
蒋成林
陈磊
吴梦香
林顺达
戴政
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI LIULISHUO INFORMATION TECHNOLOGY Co Ltd
Original Assignee
SHANGHAI LIULISHUO INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI LIULISHUO INFORMATION TECHNOLOGY Co Ltd filed Critical SHANGHAI LIULISHUO INFORMATION TECHNOLOGY Co Ltd
Priority to CN201910406383.5A priority Critical patent/CN110097874A/en
Publication of CN110097874A publication Critical patent/CN110097874A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
    • G09B5/065Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting

Abstract

The invention discloses a kind of pronunciation correction methods, and the audio data of predetermined word typing is directed to by obtaining;Audio data is analyzed, detects whether the case where increasing vowel there are syllable end;According to testing result, generating predetermined pronunciation of words, there are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unification is in a manner of understanding syllable come the problem of correcting syllable end Canadian dollar sound, obtained feedback information can assist English learner to fully understand the concept of syllable, the repeated work that sound is corrected one by one is eliminated, waste of time is avoided.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user can carry out relevant practice whenever and wherever possible.In addition, present invention also provides a kind of pronunciation correction device, equipment and computer readable storage mediums having above-mentioned technique effect.

Description

A kind of pronunciation correction method, apparatus, equipment and storage medium
Technical field
The present invention relates to voice technology fields, more particularly to a kind of pronunciation correction method, apparatus, equipment and computer Readable storage medium storing program for executing.
Background technique
With the development of science and technology, language learning application Internet-based has also obtained quick development.Some In language learning application, application provider sends client for learning stuff by internet, and user obtains via client Learning stuff carries out corresponding study.For language learning, other than learning grammar with vocabulary, articulation ability is wherein most One of important ability.Under normal conditions, user can promote the articulation ability of itself by reading aloud, with modes such as readings.However, User can not learn whether itself pronunciation is accurate in most cases.
Since simple or compound vowel of a Chinese syllable most of in Chinese is all vowel, so thering is part learner habitual can increase in English equivalents Add a sound, such as the monosyllable end bed/bed/ Canadian dollar pronunciation at(be- " Tinkling "), has actually become a double-tone Save word.
Traditional scheme is by teaching explanation syllable concept, as the concept base for learning other skills (such as stress) Plinth not will do it special training.When there is syllable end Canadian dollar mail topic, traditional teaching method can be considered as phonetic symbol hair The problem of sound (such as the above problem, will be considered that be /pronunciation of d/ is not correct enough), need one by one sound corrected, cause to repeat It works more, time-consuming extremely long.
Summary of the invention
The object of the present invention is to provide a kind of pronunciation correction method, apparatus, equipment and computer readable storage medium, with Solve the problems, such as that existing scheme needs sound correction one by one to lead to that repeated work is more, takes a long time.
In order to solve the above technical problems, the present invention provides a kind of pronunciation correction method, comprising:
Obtain the audio data for being directed to predetermined word typing;
The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;
According to testing result, there are the feedback informations of syllable mistake for the generation predetermined pronunciation of words.
Optionally, described that the audio data is analyzed, detect whether the case where increasing vowel there are syllable end Include:
The audio data is analyzed, the end consonant of each syllable in audio data is detected;
After detecting end consonant, whether the adjacent audio data after detecting end consonant has sound periodicity, If it is, determining the case where increasing vowel there are syllable end.
Optionally, described that the audio data is analyzed, detect the end consonant packet of each syllable in audio data It includes:
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;
If the end of each syllable is consonant in the predetermined word, carry out forcing cutting pair by speech recognition Together, the position for obtaining each phoneme determines the position of consonant, to detect the end consonant of each syllable in audio data.
Optionally, in the generation predetermined pronunciation of words, there are after the feedback information of syllable mistake further include:
Display is identified to the feedback information in display interface, and/or plays preset corresponding audio.
Optionally, the audio data is analyzed described, detects whether that there are the feelings that syllable end increases vowel After condition further include:
If it is, generating the prompt information of the included syllable quantity of practical pronunciation.
Present invention also provides a kind of pronunciation correction devices, comprising:
Module is obtained, for obtaining the audio data for being directed to predetermined word typing;
Detection module detects whether that there are the feelings that syllable end increases vowel for analyzing the audio data Condition;
Generation module, for according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Optionally, further includes:
Feedback module is being shown for generating the predetermined pronunciation of words there are after the feedback information of syllable mistake Interface is identified display to the feedback information, and/or plays preset corresponding audio.
Present invention also provides a kind of pronunciation correction equipment, are applied to server-side, and the equipment includes:
Memory, for storing computer program;
Processor realizes following steps when for executing the computer program: obtaining the sound for being directed to predetermined word typing Frequency evidence;The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;It is tied according to detection Fruit, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Present invention also provides a kind of pronunciation correction equipment, are applied to client, and the equipment includes:
Audio collecting device is directed to the audio data of predetermined word for typing;
Communication device, for the audio data to be sent to server-side, so that the server-side is to the audio data It is analyzed, detects whether the case where increasing vowel there are syllable end;According to testing result, the predetermined pronunciation of words is generated There are the feedback informations of syllable mistake;And receive the feedback information that the server-side is sent;
Display device, for showing the feedback information in the display interface.
Present invention also provides a kind of computer readable storage medium, meter is stored on the computer readable storage medium The step of calculation machine program, the computer program realizes any of the above-described kind of pronunciation correction method when being executed by processor.
Pronunciation correction method provided by the present invention is directed to the audio data of predetermined word typing by obtaining;To audio Data are analyzed, and detect whether the case where increasing vowel there are syllable end;According to testing result, predetermined pronunciation of words is generated There are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether that there are sounds Save the situation of mistake, the unified feedback information energy in a manner of understanding syllable to obtain the problem of correcting syllable end Canadian dollar sound It enough assists English learner to fully understand the concept of syllable, eliminates the repeated work that sound is corrected one by one, avoid the wave of time Take.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome learning time With the limitation in space, user can carry out relevant practice whenever and wherever possible.In addition, present invention also provides one kind to have above-mentioned skill Pronunciation correction device, equipment and the computer readable storage medium of art effect.
Detailed description of the invention
It, below will be to embodiment or existing for the clearer technical solution for illustrating the embodiment of the present invention or the prior art Attached drawing needed in technical description is briefly described, it should be apparent that, the accompanying drawings in the following description is only this hair Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to these attached drawings.
Fig. 1 is a kind of flow chart of specific embodiment of pronunciation correction method provided herein;
Fig. 2 is the process signal that the case where increasing vowel there are syllable end is detected whether provided by the embodiment of the present application Figure;
Fig. 3 is the flow chart of another specific embodiment of pronunciation correction method provided herein;
Fig. 4 is syllable exercise visual feedback example figure;
Fig. 5 is the structural block diagram of pronunciation correction device provided in an embodiment of the present invention;
Fig. 6 is pronunciation correction equipment application provided in an embodiment of the present invention in the structural block diagram of server-side;
Fig. 7 is pronunciation correction equipment application provided in an embodiment of the present invention in the structural block diagram of client;
Fig. 8 is the structural block diagram of pronunciation correction equipment provided in an embodiment of the present invention.
Specific embodiment
In order to enable those skilled in the art to better understand the solution of the present invention, with reference to the accompanying drawings and detailed description The present invention is described in further detail.Obviously, described embodiments are only a part of the embodiments of the present invention, rather than Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise Under every other embodiment obtained, shall fall within the protection scope of the present invention.
The description and claims of this application and term " first ", " second ", " third ", " in above-mentioned attached drawing The (if present)s such as four " are to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should manage The data that solution uses in this way are interchangeable under appropriate circumstances, so that the embodiments described herein can be in addition to illustrating herein Or the sequence other than the content of description is implemented.In addition, term " includes " and " having " and their any deformation, it is intended that Cover it is non-exclusive include, for example, containing the process, method, system, product or equipment of a series of steps or units need not limit In step or unit those of is clearly listed, but may include be not clearly listed or for these process, methods, produce The other step or units of product or equipment inherently.
It should be noted that the description for being related to " first ", " second " etc. in the present invention is used for description purposes only, and cannot It is interpreted as its relative importance of indication or suggestion or implicitly indicates the quantity of indicated technical characteristic.Define as a result, " the One ", the feature of " second " can explicitly or implicitly include at least one of the features.In addition, the skill between each embodiment Art scheme can be combined with each other, but must be based on can be realized by those of ordinary skill in the art, when technical solution Will be understood that the combination of this technical solution is not present in conjunction with there is conflicting or cannot achieve when, also not the present invention claims Protection scope within.
The embodiment of the present invention can be used for word pronunciation learning scene or hair in word pronunciation learning scene, especially language learning Sound corrects scene, and wherein language includes but is not limited to the foreign languages such as English, French, German, Japanese and mandarin, Guangdong language, Sichuan Hua Deng Chinese branch.The present embodiments relate to language learning scene for example to can be language learning software or language learning whole Pronunciation assessment scene, the scenes such as pronunciation correction scene in end, are also possible to other language learning scenes, in the embodiment of the present invention It does not limit.
The application scenarios of the embodiment of the present application are described in detail below, user can carry out phonetics by client It practises, client can show user's content to be learned in the display interface, and can also be played by audios such as loudspeakers Device exports the audio content of speech form to user.When user carries out the word pronunciation learning of voice, client can pass through sound Frequency acquisition device acquires audio data when user pronunciation, so as to subsequent progress pronunciation correction operation.It is understood that executing The main body of pronunciation correction operation can be client, or server-side, this does not influence the realization of the application.
Client can include but is not limited in the embodiment of the present invention: smart phone, tablet computer, MP4, MP3, PC, PDA, wearable device and wear display equipment etc.;Server-side can include but is not limited to: single network server, multiple networks The server group of server composition is based on cloud computing cloud consisting of a large number of computers or network servers.
In conjunction with above-mentioned application scenarios, a kind of flow chart of specific embodiment of pronunciation correction method provided herein As shown in Figure 1, this method specifically includes:
Step S101: the audio data for being directed to predetermined word typing is obtained;
User can read aloud the predetermined word, the voice of the word to be practiced is directed to by client typing, by audio The corresponding audio data of voice is obtained after acquisition device acquisition.Predetermined word can be single syllable words or multisyllable word, This is without limitation.
Step S102: analyzing the audio data, detects whether the case where increasing vowel there are syllable end;
Detect whether that the process for the case where there are syllable end increases vowel is shown referring to provided by Fig. 2 the embodiment of the present application It is intended to, detects whether that the process for the case where there are syllable end increases vowel can specifically include:
Step S1021: analyzing the audio data, detects the end consonant of each syllable in audio data;
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;If the booking list In word there are the end of syllable be consonant, then by speech recognition carry out force cutting be aligned, obtain the position of each phoneme, The position of consonant is determined, to detect the end consonant of each syllable in audio data.
Step S1022: after detecting end consonant, whether the adjacent audio data after detecting end consonant has sound Sound is periodical, if it is, determining the case where increasing vowel there are syllable end.
After detecting end consonant, it can be further analyzed in the audio data in subsequent prefixed time interval, Judge whether it has sound periodical.Prefixed time interval can be 50 milliseconds to 200 milliseconds.Detecting that end consonant opens After beginning in 50 milliseconds to 200 milliseconds, the periodicity of sound is detected.This is because vowel is periodically to shake, consonant does not have Periodically, therefore if it is detected that periodically being considered as more by force increasing vowel at syllable end, that is, there is syllable mistake.
The periodicity of sound can be calculated by the autocorrelation method of time domain.Related coefficient measurement refer to two not With the degree that influences each other between event;And auto-correlation coefficient measurement is same event between two different times Degree of correlation, vivid saying exactly measure oneself behavior over to oneself present influence.Determining by auto-correlation coefficient To the periodicity of sound.
Step S103: according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Pronunciation correction method provided by the present invention is directed to the audio data of predetermined word typing by obtaining;To audio Data are analyzed, and detect whether the case where increasing vowel there are syllable end;According to testing result, predetermined pronunciation of words is generated There are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether that there are sounds Save the situation of mistake, the unified feedback information energy in a manner of understanding syllable to obtain the problem of correcting syllable end Canadian dollar sound It enough assists English learner to fully understand the concept of syllable, eliminates the repeated work that sound is corrected one by one, avoid the wave of time Take.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome learning time With the limitation in space, user can carry out relevant practice whenever and wherever possible.
Based on any of the above embodiments, pronunciation correction method provided herein can further include: The predetermined pronunciation of words is being generated there are after the feedback information of syllable mistake, the feedback information is being carried out in display interface Mark display, and/or play preset corresponding audio.
Referring to Fig. 3, another specific embodiment of pronunciation correction method provided herein can be specifically included:
Step S201: the audio data for being directed to predetermined word typing is obtained;
Step S202: analyzing the audio data, detects whether the case where increasing vowel there are syllable end;
Step S203: according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake;
Step S204: display is identified to the feedback information in display interface, and/or plays preset correspondence Audio.
Further, the application is analyzed to the audio data, detects whether that there are syllable ends to increase vowel The case where after can also include: if it is, generate it is practical pronounce included syllable quantity prompt information.
The situation of correcting errors of syllable when feeding back by display interface.Such as Fig. 4 syllable exercise visual feedback example figure institute Show, by circle above interface indicate user it is practical pronounce syllable it is whether correct.When correct, the circle on interface shows green Color simultaneously plays corresponding audio;When mistake, the circle on interface is shaken.Further, it is also possible to be included according to practical pronunciation is generated The prompt information of syllable quantity, by voice and text prompt occur it is practical read several syllables, such as can be on display circle It tells personally and knows that the single syllable words of script have been read as 2 syllables by user.
Pronunciation correction device provided in an embodiment of the present invention is introduced below, pronunciation correction device described below with Above-described pronunciation correction method can correspond to each other reference.
Fig. 5 is the structural block diagram of pronunciation correction device provided in an embodiment of the present invention, can be with referring to Fig. 5 pronunciation correction device Include:
Module 100 is obtained, for obtaining the audio data for being directed to predetermined word typing;
Detection module 200 detects whether that there are syllable ends to increase vowel for analyzing the audio data Situation;
Generation module 300, for according to testing result, generating the predetermined pronunciation of words, there are the feedback letters of syllable mistake Breath.
As a kind of specific embodiment, detection module 200 is specifically used in pronunciation correction device provided herein: The audio data is analyzed, the end consonant of each syllable in audio data is detected;After detecting end consonant, inspection Whether the adjacent audio data after surveying end consonant has sound periodicity, if it is, determining that there are the increases of syllable end The case where vowel.
As a kind of specific embodiment, detection module 200 is specifically used in pronunciation correction device provided herein: Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;If existed in the predetermined word The end of syllable is consonant, then carries out forcing cutting alignment by speech recognition, obtain the position of each phoneme, determine consonant Position, to detect the end consonant of each syllable in audio data.
Based on any of the above embodiments, pronunciation correction device provided herein can further include: Feedback module, for being identified display, and/or the preset diaphone of broadcasting to the feedback information in display interface Effect.
Based on any of the above embodiments, pronunciation correction device provided herein can further include: Cue module, for being analyzed to the audio data, after detecting whether the case where increasing vowel there are syllable end, If it is determined that the case where increasing vowel there are syllable end, then generate the prompt information for the syllable quantity that practical pronunciation is included.
The pronunciation correction device of the present embodiment is for realizing pronunciation correction method above-mentioned, therefore in pronunciation correction device The embodiment part of the visible pronunciation correction method hereinbefore of specific embodiment, for example, obtaining module 100, detection module 200, generation module 300 is respectively used to realize step S101, S102, S103 in above-mentioned pronunciation correction method, so, it is specific Embodiment is referred to the description of corresponding various pieces embodiment, and details are not described herein.
The application is directed to the audio data of predetermined word typing by obtaining;Audio data is analyzed, is detected whether The case where increasing vowel there are syllable end;According to testing result, generating predetermined pronunciation of words, there are the feedback letters of syllable mistake Breath.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unified to manage The mode of syllable is solved come the problem of correcting syllable end Canadian dollar sound, obtained feedback information can assist English learner sufficiently to manage The concept for solving syllable eliminates the repeated work that sound is corrected one by one, avoids waste of time.Also, use the application can be with It does not need teacher and carries out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user can be with Relevant practice is carried out whenever and wherever possible.
In addition, being applied to server-side 1, Fig. 6 mentions for the embodiment of the present invention present invention also provides a kind of pronunciation correction equipment The pronunciation correction equipment application of confession includes: in the structural block diagram of server-side, the equipment
Memory 11, for storing computer program;
Processor 12 realizes following steps when for executing the computer program: obtaining for predetermined word typing Audio data;The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;It is tied according to detection Fruit, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Wherein, memory 11 include at least a type of readable storage medium storing program for executing, the readable storage medium storing program for executing include flash memory, Hard disk, multimedia card, card-type memory (for example, SD or DX memory etc.), magnetic storage, disk, CD etc..Memory 11 It can be the internal storage unit of pronunciation correction equipment, such as hard disk in some embodiments.Memory 11 is in other implementations It is also possible to the External memory equipment of pronunciation correction equipment, such as plug-in type hard disk, intelligent memory card (Smart Media in example Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card) etc..Further, memory 11 can also both including pronunciation correction equipment internal storage unit and also including External memory equipment.Memory 11 can not only be used It is installed on the application software and Various types of data, such as the code of pronunciation correction program 01 etc. of pronunciation correction equipment in storage, may be used also For temporarily storing the data that has exported or will export.
Processor 12 can be in some embodiments a central processing unit (Central Processing Unit, CPU), controller, microcontroller, microprocessor or other data processing chips, the program for being stored in run memory 11 Code or processing data, such as execute pronunciation correction program 01 etc..
Optionally, the processor 12 is for being implemented as follows step when executing the computer program: to the sound Frequency detects the end consonant of each syllable in audio data according to being analyzed;After detecting end consonant, detection end is auxiliary Whether the adjacent audio data after sound has sound periodicity, if it is, determining that there are the feelings that syllable end increases vowel Condition.
Optionally, the processor 12 is for being implemented as follows step when executing the computer program: according to described The word content of predetermined word determines whether the end of each syllable is consonant;If in the predetermined word, there are the ends of syllable Tail is consonant, then carries out forcing cutting alignment by speech recognition, obtain the position of each phoneme, determine the position of consonant, To detect the end consonant of each syllable in audio data.
It is understood that server-side can include but is not limited in the embodiment of the present application: single network server, multiple The server group of network server composition is based on cloud computing cloud consisting of a large number of computers or network servers.
In addition, being applied to client 2, Fig. 7 mentions for the embodiment of the present invention present invention also provides a kind of pronunciation correction equipment The pronunciation correction equipment application of confession includes: in the structural block diagram of client, the equipment
Audio collecting device 21 is directed to the audio data of predetermined word for typing;
Communication device 22, for the audio data to be sent to server-side, so that the server-side is to the audio number According to being analyzed, the case where increasing vowel there are syllable end is detected whether;According to testing result, the predetermined word hair is generated There are the feedback informations of syllable mistake for sound;And receive the feedback information that the server-side is sent;
Display device 23, for showing the feedback information in the display interface.
Optionally, display device can be also used in pronunciation correction equipment provided by the embodiment of the present application: on display circle It is identified display in face of the feedback information, and/or plays preset corresponding audio.
Optionally, display device can be also used in pronunciation correction equipment provided by the embodiment of the present application: to described Audio data is analyzed, and after detecting whether the case where increasing vowel there are syllable end, is increased if there is syllable end The case where vowel, then generates the prompt information for the syllable quantity that practical pronunciation is included.
It is understood that client can include but is not limited in the embodiment of the present application: smart phone, tablet computer, MP4, MP3, PC, PDA, wearable device and wear display equipment etc..
Further, present invention also provides a kind of pronunciation correction systems, as shown in figure 8, the system includes any of the above-described Kind server-side 1 and any of the above-described kind of client 2.User can carry out word pronunciation learning by client, and client can be aobvious Show the content for showing that user is to be learned on interface, and voice can also be exported to user by audio playing apparatus such as loudspeakers The audio content of form, when user carries out the word pronunciation learning of voice, client can acquire user by audio collecting device Audio data when pronunciation, and audio data is sent to server-side, the process of pronunciation correction is carried out by server-side.In server-side After being analyzed to audio data and obtain feedback information, which is sent to client.Pass through the aobvious of client Showing device shows feedback information, provides a user vision auxiliary information.
In addition, being deposited on the computer readable storage medium present invention also provides a kind of computer readable storage medium Computer program is contained, the computer program realizes any of the above-described kind of pronunciation correction method when being executed by processor the step of.
Pronunciation correction equipment, pronunciation correction system, computer readable storage medium and preceding method provided herein It is corresponding.It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
To sum up, the application is directed to the audio data of predetermined word typing by obtaining;Audio data is analyzed, is detected The case where increasing vowel with the presence or absence of syllable end;According to testing result, generating predetermined pronunciation of words, there are the anti-of syllable mistake Feedforward information.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unified The problem of syllable end Canadian dollar sound is corrected in a manner of understanding syllable, obtained feedback information can assist English learner to fill The concept of sub-argument solution syllable eliminates the repeated work that sound is corrected one by one, avoids waste of time.Also, use the application It can not need teacher and carry out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user Relevant practice can be carried out whenever and wherever possible.
Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with it is other The difference of embodiment, same or similar part may refer to each other between each embodiment.For being filled disclosed in embodiment For setting, since it is corresponded to the methods disclosed in the examples, so being described relatively simple, related place is referring to method part Explanation.
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Profession Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered Think beyond the scope of this invention.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology In any other form of storage medium well known in field.
Pronunciation correction method, apparatus provided by the present invention, equipment and computer readable storage medium are carried out above It is discussed in detail.Used herein a specific example illustrates the principle and implementation of the invention, above embodiments Explanation be merely used to help understand method and its core concept of the invention.It should be pointed out that for the common of the art , without departing from the principle of the present invention, can be with several improvements and modifications are made to the present invention for technical staff, these Improvement and modification are also fallen within the protection scope of the claims of the present invention.

Claims (10)

1. a kind of pronunciation correction method characterized by comprising
Obtain the audio data for being directed to predetermined word typing;
The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;
According to testing result, there are the feedback informations of syllable mistake for the generation predetermined pronunciation of words.
2. pronunciation correction method as described in claim 1, which is characterized in that described to analyze the audio data, inspection Surveying the case where increasing vowel with the presence or absence of syllable end includes:
The audio data is analyzed, the end consonant of each syllable in audio data is detected;
After detecting end consonant, whether the adjacent audio data after detecting end consonant has sound periodicity, if It is then to determine the case where increasing vowel there are syllable end.
3. pronunciation correction method as claimed in claim 2, which is characterized in that described to analyze the audio data, inspection The end consonant of each syllable includes: in survey audio data
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;
If in the predetermined word there are the end of syllable be consonant, by speech recognition carry out force cutting be aligned, obtain To the position of each phoneme, the position of consonant is determined, to detect the end consonant of each syllable in audio data.
4. pronunciation correction method as described in any one of claims 1 to 3, which is characterized in that generate the booking list described There are after the feedback information of syllable mistake for word pronunciation further include:
Display is identified to the feedback information in display interface, and/or plays preset corresponding audio.
5. pronunciation correction method as claimed in claim 4, which is characterized in that the audio data is analyzed described, Detect whether there are syllable end increase vowel the case where after further include:
If it is, generating the prompt information of the included syllable quantity of practical pronunciation.
6. a kind of pronunciation correction device characterized by comprising
Module is obtained, for obtaining the audio data for being directed to predetermined word typing;
Detection module detects whether the case where increasing vowel there are syllable end for analyzing the audio data;
Generation module, for according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
7. pronunciation correction method as claimed in claim 6, which is characterized in that further include:
Feedback module, for generating the predetermined pronunciation of words there are after the feedback information of syllable mistake, in display interface Display is identified to the feedback information, and/or plays preset corresponding audio.
8. a kind of pronunciation correction equipment, which is characterized in that be applied to server-side, the equipment includes:
Memory, for storing computer program;
Processor realizes following steps when for executing the computer program: obtaining the audio number for being directed to predetermined word typing According to;The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;According to testing result, raw At the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
9. a kind of pronunciation correction equipment, which is characterized in that be applied to client, the equipment includes:
Audio collecting device is directed to the audio data of predetermined word for typing;
Communication device, for the audio data to be sent to server-side, so that the server-side carries out the audio data Analysis detects whether the case where increasing vowel there are syllable end;According to testing result, the predetermined pronunciation of words is generated to exist The feedback information of syllable mistake;And receive the feedback information that the server-side is sent;
Display device, for showing the feedback information in the display interface.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program realizes the step of the pronunciation correction method as described in any one of claim 1 to 5 when the computer program is executed by processor Suddenly.
CN201910406383.5A 2019-05-16 2019-05-16 A kind of pronunciation correction method, apparatus, equipment and storage medium Pending CN110097874A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910406383.5A CN110097874A (en) 2019-05-16 2019-05-16 A kind of pronunciation correction method, apparatus, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910406383.5A CN110097874A (en) 2019-05-16 2019-05-16 A kind of pronunciation correction method, apparatus, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN110097874A true CN110097874A (en) 2019-08-06

Family

ID=67448281

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910406383.5A Pending CN110097874A (en) 2019-05-16 2019-05-16 A kind of pronunciation correction method, apparatus, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN110097874A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111047922A (en) * 2019-12-27 2020-04-21 浙江工业大学之江学院 Pronunciation teaching method, device, system, computer equipment and storage medium
CN113920803A (en) * 2020-07-10 2022-01-11 上海流利说信息技术有限公司 Error feedback method, device, equipment and readable storage medium

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5675705A (en) * 1993-09-27 1997-10-07 Singhal; Tara Chand Spectrogram-feature-based speech syllable and word recognition using syllabic language dictionary
CN1372247A (en) * 2001-02-27 2002-10-02 三菱电机株式会社 Speech sound coding method and coder thereof
CN1658283A (en) * 2004-02-20 2005-08-24 索尼株式会社 Method and apparatus for separating sound-source signal and method and device for detecting pitch
CN101105939A (en) * 2007-09-04 2008-01-16 安徽科大讯飞信息科技股份有限公司 Sonification guiding method
CN101145346A (en) * 2006-09-13 2008-03-19 富士通株式会社 Speech enhancement apparatus, speech recording apparatus and method, and computer readable recording medium
US20080082333A1 (en) * 2006-09-29 2008-04-03 Nokia Corporation Prosody Conversion
CN101231848A (en) * 2007-11-06 2008-07-30 安徽科大讯飞信息科技股份有限公司 Method for performing pronunciation error detecting based on holding vector machine
US20090004633A1 (en) * 2007-06-29 2009-01-01 Alelo, Inc. Interactive language pronunciation teaching
CN101939784A (en) * 2009-01-29 2011-01-05 松下电器产业株式会社 Hearing aid and hearing-aid processing method
CN102222498A (en) * 2005-10-20 2011-10-19 日本电气株式会社 Voice judging system, voice judging method and program for voice judgment
CN102254553A (en) * 2010-05-17 2011-11-23 阿瓦雅公司 Automatic normalization of spoken syllable duration
CN103405217A (en) * 2013-07-08 2013-11-27 上海昭鸣投资管理有限责任公司 System and method for multi-dimensional measurement of dysarthria based on real-time articulation modeling technology
CN106327923A (en) * 2016-10-28 2017-01-11 北京优瑞特教育科技有限公司 Auxiliary teaching aid for English learning and confirmation method of letter pronunciation of English words
CN108091185A (en) * 2018-01-12 2018-05-29 李勤骞 The word learning system and its word learning method combined into syllables based on syllable
CN108648527A (en) * 2018-05-15 2018-10-12 郑州琼佩电子技术有限公司 A kind of pronunciation of English matching correcting method

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5675705A (en) * 1993-09-27 1997-10-07 Singhal; Tara Chand Spectrogram-feature-based speech syllable and word recognition using syllabic language dictionary
CN1372247A (en) * 2001-02-27 2002-10-02 三菱电机株式会社 Speech sound coding method and coder thereof
CN1658283A (en) * 2004-02-20 2005-08-24 索尼株式会社 Method and apparatus for separating sound-source signal and method and device for detecting pitch
CN102222498A (en) * 2005-10-20 2011-10-19 日本电气株式会社 Voice judging system, voice judging method and program for voice judgment
CN101145346A (en) * 2006-09-13 2008-03-19 富士通株式会社 Speech enhancement apparatus, speech recording apparatus and method, and computer readable recording medium
US20080082333A1 (en) * 2006-09-29 2008-04-03 Nokia Corporation Prosody Conversion
US20090004633A1 (en) * 2007-06-29 2009-01-01 Alelo, Inc. Interactive language pronunciation teaching
CN101105939A (en) * 2007-09-04 2008-01-16 安徽科大讯飞信息科技股份有限公司 Sonification guiding method
CN101231848A (en) * 2007-11-06 2008-07-30 安徽科大讯飞信息科技股份有限公司 Method for performing pronunciation error detecting based on holding vector machine
CN101939784A (en) * 2009-01-29 2011-01-05 松下电器产业株式会社 Hearing aid and hearing-aid processing method
CN102254553A (en) * 2010-05-17 2011-11-23 阿瓦雅公司 Automatic normalization of spoken syllable duration
CN103405217A (en) * 2013-07-08 2013-11-27 上海昭鸣投资管理有限责任公司 System and method for multi-dimensional measurement of dysarthria based on real-time articulation modeling technology
CN106327923A (en) * 2016-10-28 2017-01-11 北京优瑞特教育科技有限公司 Auxiliary teaching aid for English learning and confirmation method of letter pronunciation of English words
CN108091185A (en) * 2018-01-12 2018-05-29 李勤骞 The word learning system and its word learning method combined into syllables based on syllable
CN108648527A (en) * 2018-05-15 2018-10-12 郑州琼佩电子技术有限公司 A kind of pronunciation of English matching correcting method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111047922A (en) * 2019-12-27 2020-04-21 浙江工业大学之江学院 Pronunciation teaching method, device, system, computer equipment and storage medium
CN113920803A (en) * 2020-07-10 2022-01-11 上海流利说信息技术有限公司 Error feedback method, device, equipment and readable storage medium

Similar Documents

Publication Publication Date Title
CN110085261B (en) Pronunciation correction method, device, equipment and computer readable storage medium
CN107564511B (en) Electronic device, phoneme synthesizing method and computer readable storage medium
CN110136747A (en) A kind of method, apparatus, equipment and storage medium for evaluating phoneme of speech sound correctness
CN111951780B (en) Multitasking model training method for speech synthesis and related equipment
CN110136748A (en) A kind of rhythm identification bearing calibration, device, equipment and storage medium
CN108877764B (en) Audio synthetic method, electronic equipment and the computer storage medium of talking e-book
US9489864B2 (en) Systems and methods for an automated pronunciation assessment system for similar vowel pairs
CN109858038A (en) A kind of text punctuate determines method and device
CN109697988B (en) Voice evaluation method and device
CN109448704A (en) Construction method, device, server and the storage medium of tone decoding figure
CN109166569B (en) Detection method and device for phoneme mislabeling
CN101551952A (en) Device and method for evaluating pronunciation
WO2019146753A1 (en) Language proficiency assessment device using brain activity, and language proficiency assessment system
CN110097874A (en) A kind of pronunciation correction method, apparatus, equipment and storage medium
CN110503941B (en) Language ability evaluation method, device, system, computer equipment and storage medium
CN109448717B (en) Speech word spelling recognition method, equipment and storage medium
CN109697975B (en) Voice evaluation method and device
CN110085260A (en) A kind of single syllable stress identification bearing calibration, device, equipment and medium
CN112309429A (en) Method, device and equipment for explosion loss detection and computer readable storage medium
CN110349567B (en) Speech signal recognition method and device, storage medium and electronic device
CN111951827B (en) Continuous reading identification correction method, device, equipment and readable storage medium
CN115099222A (en) Punctuation mark misuse detection and correction method, device, equipment and storage medium
CN108959163B (en) Subtitle display method for audio electronic book, electronic device and computer storage medium
CN110428668B (en) Data extraction method and device, computer system and readable storage medium
CN111026839B (en) Method for detecting mastering degree of dictation word and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190806