CN110097874A - A kind of pronunciation correction method, apparatus, equipment and storage medium - Google Patents
A kind of pronunciation correction method, apparatus, equipment and storage medium Download PDFInfo
- Publication number
- CN110097874A CN110097874A CN201910406383.5A CN201910406383A CN110097874A CN 110097874 A CN110097874 A CN 110097874A CN 201910406383 A CN201910406383 A CN 201910406383A CN 110097874 A CN110097874 A CN 110097874A
- Authority
- CN
- China
- Prior art keywords
- syllable
- audio data
- pronunciation
- vowel
- consonant
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/06—Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
- G09B5/065—Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
Abstract
The invention discloses a kind of pronunciation correction methods, and the audio data of predetermined word typing is directed to by obtaining;Audio data is analyzed, detects whether the case where increasing vowel there are syllable end;According to testing result, generating predetermined pronunciation of words, there are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unification is in a manner of understanding syllable come the problem of correcting syllable end Canadian dollar sound, obtained feedback information can assist English learner to fully understand the concept of syllable, the repeated work that sound is corrected one by one is eliminated, waste of time is avoided.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user can carry out relevant practice whenever and wherever possible.In addition, present invention also provides a kind of pronunciation correction device, equipment and computer readable storage mediums having above-mentioned technique effect.
Description
Technical field
The present invention relates to voice technology fields, more particularly to a kind of pronunciation correction method, apparatus, equipment and computer
Readable storage medium storing program for executing.
Background technique
With the development of science and technology, language learning application Internet-based has also obtained quick development.Some
In language learning application, application provider sends client for learning stuff by internet, and user obtains via client
Learning stuff carries out corresponding study.For language learning, other than learning grammar with vocabulary, articulation ability is wherein most
One of important ability.Under normal conditions, user can promote the articulation ability of itself by reading aloud, with modes such as readings.However,
User can not learn whether itself pronunciation is accurate in most cases.
Since simple or compound vowel of a Chinese syllable most of in Chinese is all vowel, so thering is part learner habitual can increase in English equivalents
Add a sound, such as the monosyllable end bed/bed/ Canadian dollar pronunciation at(be- " Tinkling "), has actually become a double-tone
Save word.
Traditional scheme is by teaching explanation syllable concept, as the concept base for learning other skills (such as stress)
Plinth not will do it special training.When there is syllable end Canadian dollar mail topic, traditional teaching method can be considered as phonetic symbol hair
The problem of sound (such as the above problem, will be considered that be /pronunciation of d/ is not correct enough), need one by one sound corrected, cause to repeat
It works more, time-consuming extremely long.
Summary of the invention
The object of the present invention is to provide a kind of pronunciation correction method, apparatus, equipment and computer readable storage medium, with
Solve the problems, such as that existing scheme needs sound correction one by one to lead to that repeated work is more, takes a long time.
In order to solve the above technical problems, the present invention provides a kind of pronunciation correction method, comprising:
Obtain the audio data for being directed to predetermined word typing;
The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;
According to testing result, there are the feedback informations of syllable mistake for the generation predetermined pronunciation of words.
Optionally, described that the audio data is analyzed, detect whether the case where increasing vowel there are syllable end
Include:
The audio data is analyzed, the end consonant of each syllable in audio data is detected;
After detecting end consonant, whether the adjacent audio data after detecting end consonant has sound periodicity,
If it is, determining the case where increasing vowel there are syllable end.
Optionally, described that the audio data is analyzed, detect the end consonant packet of each syllable in audio data
It includes:
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;
If the end of each syllable is consonant in the predetermined word, carry out forcing cutting pair by speech recognition
Together, the position for obtaining each phoneme determines the position of consonant, to detect the end consonant of each syllable in audio data.
Optionally, in the generation predetermined pronunciation of words, there are after the feedback information of syllable mistake further include:
Display is identified to the feedback information in display interface, and/or plays preset corresponding audio.
Optionally, the audio data is analyzed described, detects whether that there are the feelings that syllable end increases vowel
After condition further include:
If it is, generating the prompt information of the included syllable quantity of practical pronunciation.
Present invention also provides a kind of pronunciation correction devices, comprising:
Module is obtained, for obtaining the audio data for being directed to predetermined word typing;
Detection module detects whether that there are the feelings that syllable end increases vowel for analyzing the audio data
Condition;
Generation module, for according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Optionally, further includes:
Feedback module is being shown for generating the predetermined pronunciation of words there are after the feedback information of syllable mistake
Interface is identified display to the feedback information, and/or plays preset corresponding audio.
Present invention also provides a kind of pronunciation correction equipment, are applied to server-side, and the equipment includes:
Memory, for storing computer program;
Processor realizes following steps when for executing the computer program: obtaining the sound for being directed to predetermined word typing
Frequency evidence;The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;It is tied according to detection
Fruit, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Present invention also provides a kind of pronunciation correction equipment, are applied to client, and the equipment includes:
Audio collecting device is directed to the audio data of predetermined word for typing;
Communication device, for the audio data to be sent to server-side, so that the server-side is to the audio data
It is analyzed, detects whether the case where increasing vowel there are syllable end;According to testing result, the predetermined pronunciation of words is generated
There are the feedback informations of syllable mistake;And receive the feedback information that the server-side is sent;
Display device, for showing the feedback information in the display interface.
Present invention also provides a kind of computer readable storage medium, meter is stored on the computer readable storage medium
The step of calculation machine program, the computer program realizes any of the above-described kind of pronunciation correction method when being executed by processor.
Pronunciation correction method provided by the present invention is directed to the audio data of predetermined word typing by obtaining;To audio
Data are analyzed, and detect whether the case where increasing vowel there are syllable end;According to testing result, predetermined pronunciation of words is generated
There are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether that there are sounds
Save the situation of mistake, the unified feedback information energy in a manner of understanding syllable to obtain the problem of correcting syllable end Canadian dollar sound
It enough assists English learner to fully understand the concept of syllable, eliminates the repeated work that sound is corrected one by one, avoid the wave of time
Take.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome learning time
With the limitation in space, user can carry out relevant practice whenever and wherever possible.In addition, present invention also provides one kind to have above-mentioned skill
Pronunciation correction device, equipment and the computer readable storage medium of art effect.
Detailed description of the invention
It, below will be to embodiment or existing for the clearer technical solution for illustrating the embodiment of the present invention or the prior art
Attached drawing needed in technical description is briefly described, it should be apparent that, the accompanying drawings in the following description is only this hair
Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root
Other attached drawings are obtained according to these attached drawings.
Fig. 1 is a kind of flow chart of specific embodiment of pronunciation correction method provided herein;
Fig. 2 is the process signal that the case where increasing vowel there are syllable end is detected whether provided by the embodiment of the present application
Figure;
Fig. 3 is the flow chart of another specific embodiment of pronunciation correction method provided herein;
Fig. 4 is syllable exercise visual feedback example figure;
Fig. 5 is the structural block diagram of pronunciation correction device provided in an embodiment of the present invention;
Fig. 6 is pronunciation correction equipment application provided in an embodiment of the present invention in the structural block diagram of server-side;
Fig. 7 is pronunciation correction equipment application provided in an embodiment of the present invention in the structural block diagram of client;
Fig. 8 is the structural block diagram of pronunciation correction equipment provided in an embodiment of the present invention.
Specific embodiment
In order to enable those skilled in the art to better understand the solution of the present invention, with reference to the accompanying drawings and detailed description
The present invention is described in further detail.Obviously, described embodiments are only a part of the embodiments of the present invention, rather than
Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise
Under every other embodiment obtained, shall fall within the protection scope of the present invention.
The description and claims of this application and term " first ", " second ", " third ", " in above-mentioned attached drawing
The (if present)s such as four " are to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should manage
The data that solution uses in this way are interchangeable under appropriate circumstances, so that the embodiments described herein can be in addition to illustrating herein
Or the sequence other than the content of description is implemented.In addition, term " includes " and " having " and their any deformation, it is intended that
Cover it is non-exclusive include, for example, containing the process, method, system, product or equipment of a series of steps or units need not limit
In step or unit those of is clearly listed, but may include be not clearly listed or for these process, methods, produce
The other step or units of product or equipment inherently.
It should be noted that the description for being related to " first ", " second " etc. in the present invention is used for description purposes only, and cannot
It is interpreted as its relative importance of indication or suggestion or implicitly indicates the quantity of indicated technical characteristic.Define as a result, " the
One ", the feature of " second " can explicitly or implicitly include at least one of the features.In addition, the skill between each embodiment
Art scheme can be combined with each other, but must be based on can be realized by those of ordinary skill in the art, when technical solution
Will be understood that the combination of this technical solution is not present in conjunction with there is conflicting or cannot achieve when, also not the present invention claims
Protection scope within.
The embodiment of the present invention can be used for word pronunciation learning scene or hair in word pronunciation learning scene, especially language learning
Sound corrects scene, and wherein language includes but is not limited to the foreign languages such as English, French, German, Japanese and mandarin, Guangdong language, Sichuan
Hua Deng Chinese branch.The present embodiments relate to language learning scene for example to can be language learning software or language learning whole
Pronunciation assessment scene, the scenes such as pronunciation correction scene in end, are also possible to other language learning scenes, in the embodiment of the present invention
It does not limit.
The application scenarios of the embodiment of the present application are described in detail below, user can carry out phonetics by client
It practises, client can show user's content to be learned in the display interface, and can also be played by audios such as loudspeakers
Device exports the audio content of speech form to user.When user carries out the word pronunciation learning of voice, client can pass through sound
Frequency acquisition device acquires audio data when user pronunciation, so as to subsequent progress pronunciation correction operation.It is understood that executing
The main body of pronunciation correction operation can be client, or server-side, this does not influence the realization of the application.
Client can include but is not limited in the embodiment of the present invention: smart phone, tablet computer, MP4, MP3, PC,
PDA, wearable device and wear display equipment etc.;Server-side can include but is not limited to: single network server, multiple networks
The server group of server composition is based on cloud computing cloud consisting of a large number of computers or network servers.
In conjunction with above-mentioned application scenarios, a kind of flow chart of specific embodiment of pronunciation correction method provided herein
As shown in Figure 1, this method specifically includes:
Step S101: the audio data for being directed to predetermined word typing is obtained;
User can read aloud the predetermined word, the voice of the word to be practiced is directed to by client typing, by audio
The corresponding audio data of voice is obtained after acquisition device acquisition.Predetermined word can be single syllable words or multisyllable word,
This is without limitation.
Step S102: analyzing the audio data, detects whether the case where increasing vowel there are syllable end;
Detect whether that the process for the case where there are syllable end increases vowel is shown referring to provided by Fig. 2 the embodiment of the present application
It is intended to, detects whether that the process for the case where there are syllable end increases vowel can specifically include:
Step S1021: analyzing the audio data, detects the end consonant of each syllable in audio data;
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;If the booking list
In word there are the end of syllable be consonant, then by speech recognition carry out force cutting be aligned, obtain the position of each phoneme,
The position of consonant is determined, to detect the end consonant of each syllable in audio data.
Step S1022: after detecting end consonant, whether the adjacent audio data after detecting end consonant has sound
Sound is periodical, if it is, determining the case where increasing vowel there are syllable end.
After detecting end consonant, it can be further analyzed in the audio data in subsequent prefixed time interval,
Judge whether it has sound periodical.Prefixed time interval can be 50 milliseconds to 200 milliseconds.Detecting that end consonant opens
After beginning in 50 milliseconds to 200 milliseconds, the periodicity of sound is detected.This is because vowel is periodically to shake, consonant does not have
Periodically, therefore if it is detected that periodically being considered as more by force increasing vowel at syllable end, that is, there is syllable mistake.
The periodicity of sound can be calculated by the autocorrelation method of time domain.Related coefficient measurement refer to two not
With the degree that influences each other between event;And auto-correlation coefficient measurement is same event between two different times
Degree of correlation, vivid saying exactly measure oneself behavior over to oneself present influence.Determining by auto-correlation coefficient
To the periodicity of sound.
Step S103: according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Pronunciation correction method provided by the present invention is directed to the audio data of predetermined word typing by obtaining;To audio
Data are analyzed, and detect whether the case where increasing vowel there are syllable end;According to testing result, predetermined pronunciation of words is generated
There are the feedback informations of syllable mistake.The application can automatically analyze the audio data of typing, detect whether that there are sounds
Save the situation of mistake, the unified feedback information energy in a manner of understanding syllable to obtain the problem of correcting syllable end Canadian dollar sound
It enough assists English learner to fully understand the concept of syllable, eliminates the repeated work that sound is corrected one by one, avoid the wave of time
Take.Also, teacher can not needed using the application and carry out true man's demonstration lesson or correction face to face, therefore overcome learning time
With the limitation in space, user can carry out relevant practice whenever and wherever possible.
Based on any of the above embodiments, pronunciation correction method provided herein can further include:
The predetermined pronunciation of words is being generated there are after the feedback information of syllable mistake, the feedback information is being carried out in display interface
Mark display, and/or play preset corresponding audio.
Referring to Fig. 3, another specific embodiment of pronunciation correction method provided herein can be specifically included:
Step S201: the audio data for being directed to predetermined word typing is obtained;
Step S202: analyzing the audio data, detects whether the case where increasing vowel there are syllable end;
Step S203: according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake;
Step S204: display is identified to the feedback information in display interface, and/or plays preset correspondence
Audio.
Further, the application is analyzed to the audio data, detects whether that there are syllable ends to increase vowel
The case where after can also include: if it is, generate it is practical pronounce included syllable quantity prompt information.
The situation of correcting errors of syllable when feeding back by display interface.Such as Fig. 4 syllable exercise visual feedback example figure institute
Show, by circle above interface indicate user it is practical pronounce syllable it is whether correct.When correct, the circle on interface shows green
Color simultaneously plays corresponding audio;When mistake, the circle on interface is shaken.Further, it is also possible to be included according to practical pronunciation is generated
The prompt information of syllable quantity, by voice and text prompt occur it is practical read several syllables, such as can be on display circle
It tells personally and knows that the single syllable words of script have been read as 2 syllables by user.
Pronunciation correction device provided in an embodiment of the present invention is introduced below, pronunciation correction device described below with
Above-described pronunciation correction method can correspond to each other reference.
Fig. 5 is the structural block diagram of pronunciation correction device provided in an embodiment of the present invention, can be with referring to Fig. 5 pronunciation correction device
Include:
Module 100 is obtained, for obtaining the audio data for being directed to predetermined word typing;
Detection module 200 detects whether that there are syllable ends to increase vowel for analyzing the audio data
Situation;
Generation module 300, for according to testing result, generating the predetermined pronunciation of words, there are the feedback letters of syllable mistake
Breath.
As a kind of specific embodiment, detection module 200 is specifically used in pronunciation correction device provided herein:
The audio data is analyzed, the end consonant of each syllable in audio data is detected;After detecting end consonant, inspection
Whether the adjacent audio data after surveying end consonant has sound periodicity, if it is, determining that there are the increases of syllable end
The case where vowel.
As a kind of specific embodiment, detection module 200 is specifically used in pronunciation correction device provided herein:
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;If existed in the predetermined word
The end of syllable is consonant, then carries out forcing cutting alignment by speech recognition, obtain the position of each phoneme, determine consonant
Position, to detect the end consonant of each syllable in audio data.
Based on any of the above embodiments, pronunciation correction device provided herein can further include:
Feedback module, for being identified display, and/or the preset diaphone of broadcasting to the feedback information in display interface
Effect.
Based on any of the above embodiments, pronunciation correction device provided herein can further include:
Cue module, for being analyzed to the audio data, after detecting whether the case where increasing vowel there are syllable end,
If it is determined that the case where increasing vowel there are syllable end, then generate the prompt information for the syllable quantity that practical pronunciation is included.
The pronunciation correction device of the present embodiment is for realizing pronunciation correction method above-mentioned, therefore in pronunciation correction device
The embodiment part of the visible pronunciation correction method hereinbefore of specific embodiment, for example, obtaining module 100, detection module
200, generation module 300 is respectively used to realize step S101, S102, S103 in above-mentioned pronunciation correction method, so, it is specific
Embodiment is referred to the description of corresponding various pieces embodiment, and details are not described herein.
The application is directed to the audio data of predetermined word typing by obtaining;Audio data is analyzed, is detected whether
The case where increasing vowel there are syllable end;According to testing result, generating predetermined pronunciation of words, there are the feedback letters of syllable mistake
Breath.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unified to manage
The mode of syllable is solved come the problem of correcting syllable end Canadian dollar sound, obtained feedback information can assist English learner sufficiently to manage
The concept for solving syllable eliminates the repeated work that sound is corrected one by one, avoids waste of time.Also, use the application can be with
It does not need teacher and carries out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user can be with
Relevant practice is carried out whenever and wherever possible.
In addition, being applied to server-side 1, Fig. 6 mentions for the embodiment of the present invention present invention also provides a kind of pronunciation correction equipment
The pronunciation correction equipment application of confession includes: in the structural block diagram of server-side, the equipment
Memory 11, for storing computer program;
Processor 12 realizes following steps when for executing the computer program: obtaining for predetermined word typing
Audio data;The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;It is tied according to detection
Fruit, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
Wherein, memory 11 include at least a type of readable storage medium storing program for executing, the readable storage medium storing program for executing include flash memory,
Hard disk, multimedia card, card-type memory (for example, SD or DX memory etc.), magnetic storage, disk, CD etc..Memory 11
It can be the internal storage unit of pronunciation correction equipment, such as hard disk in some embodiments.Memory 11 is in other implementations
It is also possible to the External memory equipment of pronunciation correction equipment, such as plug-in type hard disk, intelligent memory card (Smart Media in example
Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card) etc..Further, memory
11 can also both including pronunciation correction equipment internal storage unit and also including External memory equipment.Memory 11 can not only be used
It is installed on the application software and Various types of data, such as the code of pronunciation correction program 01 etc. of pronunciation correction equipment in storage, may be used also
For temporarily storing the data that has exported or will export.
Processor 12 can be in some embodiments a central processing unit (Central Processing Unit,
CPU), controller, microcontroller, microprocessor or other data processing chips, the program for being stored in run memory 11
Code or processing data, such as execute pronunciation correction program 01 etc..
Optionally, the processor 12 is for being implemented as follows step when executing the computer program: to the sound
Frequency detects the end consonant of each syllable in audio data according to being analyzed;After detecting end consonant, detection end is auxiliary
Whether the adjacent audio data after sound has sound periodicity, if it is, determining that there are the feelings that syllable end increases vowel
Condition.
Optionally, the processor 12 is for being implemented as follows step when executing the computer program: according to described
The word content of predetermined word determines whether the end of each syllable is consonant;If in the predetermined word, there are the ends of syllable
Tail is consonant, then carries out forcing cutting alignment by speech recognition, obtain the position of each phoneme, determine the position of consonant,
To detect the end consonant of each syllable in audio data.
It is understood that server-side can include but is not limited in the embodiment of the present application: single network server, multiple
The server group of network server composition is based on cloud computing cloud consisting of a large number of computers or network servers.
In addition, being applied to client 2, Fig. 7 mentions for the embodiment of the present invention present invention also provides a kind of pronunciation correction equipment
The pronunciation correction equipment application of confession includes: in the structural block diagram of client, the equipment
Audio collecting device 21 is directed to the audio data of predetermined word for typing;
Communication device 22, for the audio data to be sent to server-side, so that the server-side is to the audio number
According to being analyzed, the case where increasing vowel there are syllable end is detected whether;According to testing result, the predetermined word hair is generated
There are the feedback informations of syllable mistake for sound;And receive the feedback information that the server-side is sent;
Display device 23, for showing the feedback information in the display interface.
Optionally, display device can be also used in pronunciation correction equipment provided by the embodiment of the present application: on display circle
It is identified display in face of the feedback information, and/or plays preset corresponding audio.
Optionally, display device can be also used in pronunciation correction equipment provided by the embodiment of the present application: to described
Audio data is analyzed, and after detecting whether the case where increasing vowel there are syllable end, is increased if there is syllable end
The case where vowel, then generates the prompt information for the syllable quantity that practical pronunciation is included.
It is understood that client can include but is not limited in the embodiment of the present application: smart phone, tablet computer,
MP4, MP3, PC, PDA, wearable device and wear display equipment etc..
Further, present invention also provides a kind of pronunciation correction systems, as shown in figure 8, the system includes any of the above-described
Kind server-side 1 and any of the above-described kind of client 2.User can carry out word pronunciation learning by client, and client can be aobvious
Show the content for showing that user is to be learned on interface, and voice can also be exported to user by audio playing apparatus such as loudspeakers
The audio content of form, when user carries out the word pronunciation learning of voice, client can acquire user by audio collecting device
Audio data when pronunciation, and audio data is sent to server-side, the process of pronunciation correction is carried out by server-side.In server-side
After being analyzed to audio data and obtain feedback information, which is sent to client.Pass through the aobvious of client
Showing device shows feedback information, provides a user vision auxiliary information.
In addition, being deposited on the computer readable storage medium present invention also provides a kind of computer readable storage medium
Computer program is contained, the computer program realizes any of the above-described kind of pronunciation correction method when being executed by processor the step of.
Pronunciation correction equipment, pronunciation correction system, computer readable storage medium and preceding method provided herein
It is corresponding.It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description,
The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
To sum up, the application is directed to the audio data of predetermined word typing by obtaining;Audio data is analyzed, is detected
The case where increasing vowel with the presence or absence of syllable end;According to testing result, generating predetermined pronunciation of words, there are the anti-of syllable mistake
Feedforward information.The application can automatically analyze the audio data of typing, detect whether there is a situation where syllable mistake, unified
The problem of syllable end Canadian dollar sound is corrected in a manner of understanding syllable, obtained feedback information can assist English learner to fill
The concept of sub-argument solution syllable eliminates the repeated work that sound is corrected one by one, avoids waste of time.Also, use the application
It can not need teacher and carry out true man's demonstration lesson or correction face to face, therefore overcome the limitation in learning time and space, user
Relevant practice can be carried out whenever and wherever possible.
Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with it is other
The difference of embodiment, same or similar part may refer to each other between each embodiment.For being filled disclosed in embodiment
For setting, since it is corresponded to the methods disclosed in the examples, so being described relatively simple, related place is referring to method part
Explanation.
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure
And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and
The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These
Function is implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Profession
Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered
Think beyond the scope of this invention.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor
The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit
Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology
In any other form of storage medium well known in field.
Pronunciation correction method, apparatus provided by the present invention, equipment and computer readable storage medium are carried out above
It is discussed in detail.Used herein a specific example illustrates the principle and implementation of the invention, above embodiments
Explanation be merely used to help understand method and its core concept of the invention.It should be pointed out that for the common of the art
, without departing from the principle of the present invention, can be with several improvements and modifications are made to the present invention for technical staff, these
Improvement and modification are also fallen within the protection scope of the claims of the present invention.
Claims (10)
1. a kind of pronunciation correction method characterized by comprising
Obtain the audio data for being directed to predetermined word typing;
The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;
According to testing result, there are the feedback informations of syllable mistake for the generation predetermined pronunciation of words.
2. pronunciation correction method as described in claim 1, which is characterized in that described to analyze the audio data, inspection
Surveying the case where increasing vowel with the presence or absence of syllable end includes:
The audio data is analyzed, the end consonant of each syllable in audio data is detected;
After detecting end consonant, whether the adjacent audio data after detecting end consonant has sound periodicity, if
It is then to determine the case where increasing vowel there are syllable end.
3. pronunciation correction method as claimed in claim 2, which is characterized in that described to analyze the audio data, inspection
The end consonant of each syllable includes: in survey audio data
Whether the end that each syllable is determined according to the word content of the predetermined word is consonant;
If in the predetermined word there are the end of syllable be consonant, by speech recognition carry out force cutting be aligned, obtain
To the position of each phoneme, the position of consonant is determined, to detect the end consonant of each syllable in audio data.
4. pronunciation correction method as described in any one of claims 1 to 3, which is characterized in that generate the booking list described
There are after the feedback information of syllable mistake for word pronunciation further include:
Display is identified to the feedback information in display interface, and/or plays preset corresponding audio.
5. pronunciation correction method as claimed in claim 4, which is characterized in that the audio data is analyzed described,
Detect whether there are syllable end increase vowel the case where after further include:
If it is, generating the prompt information of the included syllable quantity of practical pronunciation.
6. a kind of pronunciation correction device characterized by comprising
Module is obtained, for obtaining the audio data for being directed to predetermined word typing;
Detection module detects whether the case where increasing vowel there are syllable end for analyzing the audio data;
Generation module, for according to testing result, generating the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
7. pronunciation correction method as claimed in claim 6, which is characterized in that further include:
Feedback module, for generating the predetermined pronunciation of words there are after the feedback information of syllable mistake, in display interface
Display is identified to the feedback information, and/or plays preset corresponding audio.
8. a kind of pronunciation correction equipment, which is characterized in that be applied to server-side, the equipment includes:
Memory, for storing computer program;
Processor realizes following steps when for executing the computer program: obtaining the audio number for being directed to predetermined word typing
According to;The audio data is analyzed, detects whether the case where increasing vowel there are syllable end;According to testing result, raw
At the predetermined pronunciation of words, there are the feedback informations of syllable mistake.
9. a kind of pronunciation correction equipment, which is characterized in that be applied to client, the equipment includes:
Audio collecting device is directed to the audio data of predetermined word for typing;
Communication device, for the audio data to be sent to server-side, so that the server-side carries out the audio data
Analysis detects whether the case where increasing vowel there are syllable end;According to testing result, the predetermined pronunciation of words is generated to exist
The feedback information of syllable mistake;And receive the feedback information that the server-side is sent;
Display device, for showing the feedback information in the display interface.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium
Program realizes the step of the pronunciation correction method as described in any one of claim 1 to 5 when the computer program is executed by processor
Suddenly.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910406383.5A CN110097874A (en) | 2019-05-16 | 2019-05-16 | A kind of pronunciation correction method, apparatus, equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910406383.5A CN110097874A (en) | 2019-05-16 | 2019-05-16 | A kind of pronunciation correction method, apparatus, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110097874A true CN110097874A (en) | 2019-08-06 |
Family
ID=67448281
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910406383.5A Pending CN110097874A (en) | 2019-05-16 | 2019-05-16 | A kind of pronunciation correction method, apparatus, equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110097874A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111047922A (en) * | 2019-12-27 | 2020-04-21 | 浙江工业大学之江学院 | Pronunciation teaching method, device, system, computer equipment and storage medium |
CN113920803A (en) * | 2020-07-10 | 2022-01-11 | 上海流利说信息技术有限公司 | Error feedback method, device, equipment and readable storage medium |
Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5675705A (en) * | 1993-09-27 | 1997-10-07 | Singhal; Tara Chand | Spectrogram-feature-based speech syllable and word recognition using syllabic language dictionary |
CN1372247A (en) * | 2001-02-27 | 2002-10-02 | 三菱电机株式会社 | Speech sound coding method and coder thereof |
CN1658283A (en) * | 2004-02-20 | 2005-08-24 | 索尼株式会社 | Method and apparatus for separating sound-source signal and method and device for detecting pitch |
CN101105939A (en) * | 2007-09-04 | 2008-01-16 | 安徽科大讯飞信息科技股份有限公司 | Sonification guiding method |
CN101145346A (en) * | 2006-09-13 | 2008-03-19 | 富士通株式会社 | Speech enhancement apparatus, speech recording apparatus and method, and computer readable recording medium |
US20080082333A1 (en) * | 2006-09-29 | 2008-04-03 | Nokia Corporation | Prosody Conversion |
CN101231848A (en) * | 2007-11-06 | 2008-07-30 | 安徽科大讯飞信息科技股份有限公司 | Method for performing pronunciation error detecting based on holding vector machine |
US20090004633A1 (en) * | 2007-06-29 | 2009-01-01 | Alelo, Inc. | Interactive language pronunciation teaching |
CN101939784A (en) * | 2009-01-29 | 2011-01-05 | 松下电器产业株式会社 | Hearing aid and hearing-aid processing method |
CN102222498A (en) * | 2005-10-20 | 2011-10-19 | 日本电气株式会社 | Voice judging system, voice judging method and program for voice judgment |
CN102254553A (en) * | 2010-05-17 | 2011-11-23 | 阿瓦雅公司 | Automatic normalization of spoken syllable duration |
CN103405217A (en) * | 2013-07-08 | 2013-11-27 | 上海昭鸣投资管理有限责任公司 | System and method for multi-dimensional measurement of dysarthria based on real-time articulation modeling technology |
CN106327923A (en) * | 2016-10-28 | 2017-01-11 | 北京优瑞特教育科技有限公司 | Auxiliary teaching aid for English learning and confirmation method of letter pronunciation of English words |
CN108091185A (en) * | 2018-01-12 | 2018-05-29 | 李勤骞 | The word learning system and its word learning method combined into syllables based on syllable |
CN108648527A (en) * | 2018-05-15 | 2018-10-12 | 郑州琼佩电子技术有限公司 | A kind of pronunciation of English matching correcting method |
-
2019
- 2019-05-16 CN CN201910406383.5A patent/CN110097874A/en active Pending
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5675705A (en) * | 1993-09-27 | 1997-10-07 | Singhal; Tara Chand | Spectrogram-feature-based speech syllable and word recognition using syllabic language dictionary |
CN1372247A (en) * | 2001-02-27 | 2002-10-02 | 三菱电机株式会社 | Speech sound coding method and coder thereof |
CN1658283A (en) * | 2004-02-20 | 2005-08-24 | 索尼株式会社 | Method and apparatus for separating sound-source signal and method and device for detecting pitch |
CN102222498A (en) * | 2005-10-20 | 2011-10-19 | 日本电气株式会社 | Voice judging system, voice judging method and program for voice judgment |
CN101145346A (en) * | 2006-09-13 | 2008-03-19 | 富士通株式会社 | Speech enhancement apparatus, speech recording apparatus and method, and computer readable recording medium |
US20080082333A1 (en) * | 2006-09-29 | 2008-04-03 | Nokia Corporation | Prosody Conversion |
US20090004633A1 (en) * | 2007-06-29 | 2009-01-01 | Alelo, Inc. | Interactive language pronunciation teaching |
CN101105939A (en) * | 2007-09-04 | 2008-01-16 | 安徽科大讯飞信息科技股份有限公司 | Sonification guiding method |
CN101231848A (en) * | 2007-11-06 | 2008-07-30 | 安徽科大讯飞信息科技股份有限公司 | Method for performing pronunciation error detecting based on holding vector machine |
CN101939784A (en) * | 2009-01-29 | 2011-01-05 | 松下电器产业株式会社 | Hearing aid and hearing-aid processing method |
CN102254553A (en) * | 2010-05-17 | 2011-11-23 | 阿瓦雅公司 | Automatic normalization of spoken syllable duration |
CN103405217A (en) * | 2013-07-08 | 2013-11-27 | 上海昭鸣投资管理有限责任公司 | System and method for multi-dimensional measurement of dysarthria based on real-time articulation modeling technology |
CN106327923A (en) * | 2016-10-28 | 2017-01-11 | 北京优瑞特教育科技有限公司 | Auxiliary teaching aid for English learning and confirmation method of letter pronunciation of English words |
CN108091185A (en) * | 2018-01-12 | 2018-05-29 | 李勤骞 | The word learning system and its word learning method combined into syllables based on syllable |
CN108648527A (en) * | 2018-05-15 | 2018-10-12 | 郑州琼佩电子技术有限公司 | A kind of pronunciation of English matching correcting method |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111047922A (en) * | 2019-12-27 | 2020-04-21 | 浙江工业大学之江学院 | Pronunciation teaching method, device, system, computer equipment and storage medium |
CN113920803A (en) * | 2020-07-10 | 2022-01-11 | 上海流利说信息技术有限公司 | Error feedback method, device, equipment and readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110085261B (en) | Pronunciation correction method, device, equipment and computer readable storage medium | |
CN107564511B (en) | Electronic device, phoneme synthesizing method and computer readable storage medium | |
CN110136747A (en) | A kind of method, apparatus, equipment and storage medium for evaluating phoneme of speech sound correctness | |
CN111951780B (en) | Multitasking model training method for speech synthesis and related equipment | |
CN110136748A (en) | A kind of rhythm identification bearing calibration, device, equipment and storage medium | |
CN108877764B (en) | Audio synthetic method, electronic equipment and the computer storage medium of talking e-book | |
US9489864B2 (en) | Systems and methods for an automated pronunciation assessment system for similar vowel pairs | |
CN109858038A (en) | A kind of text punctuate determines method and device | |
CN109697988B (en) | Voice evaluation method and device | |
CN109448704A (en) | Construction method, device, server and the storage medium of tone decoding figure | |
CN109166569B (en) | Detection method and device for phoneme mislabeling | |
CN101551952A (en) | Device and method for evaluating pronunciation | |
WO2019146753A1 (en) | Language proficiency assessment device using brain activity, and language proficiency assessment system | |
CN110097874A (en) | A kind of pronunciation correction method, apparatus, equipment and storage medium | |
CN110503941B (en) | Language ability evaluation method, device, system, computer equipment and storage medium | |
CN109448717B (en) | Speech word spelling recognition method, equipment and storage medium | |
CN109697975B (en) | Voice evaluation method and device | |
CN110085260A (en) | A kind of single syllable stress identification bearing calibration, device, equipment and medium | |
CN112309429A (en) | Method, device and equipment for explosion loss detection and computer readable storage medium | |
CN110349567B (en) | Speech signal recognition method and device, storage medium and electronic device | |
CN111951827B (en) | Continuous reading identification correction method, device, equipment and readable storage medium | |
CN115099222A (en) | Punctuation mark misuse detection and correction method, device, equipment and storage medium | |
CN108959163B (en) | Subtitle display method for audio electronic book, electronic device and computer storage medium | |
CN110428668B (en) | Data extraction method and device, computer system and readable storage medium | |
CN111026839B (en) | Method for detecting mastering degree of dictation word and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190806 |