KR101973791B1

KR101973791B1 - Method for correcting voice

Info

Publication number: KR101973791B1
Application number: KR1020170087238A
Authority: KR
Inventors: 조희정; 이지연
Original assignee: 조희정; 이지연
Priority date: 2017-07-10
Filing date: 2017-07-10
Publication date: 2019-04-29
Also published as: KR20190006348A

Abstract

본 발명의 실시 형태는 학습자 단말기에 설치된 말소리학습콘텐츠앱이 학습자에 대한 말소리 훈련을 시키는 말소리 평가 및 학습 방법에 있어서, 말소리 훈련에 사용될 말소리훈련콘텐츠를 학습자로부터 선택받는 말소리훈련콘텐츠 선택 과정; 선택된 말소리훈련콘텐츠의 자발어테스트화면을 제공하여 자발어 테스트가 이루어지도록 하는 자발어 테스트 과정; 상기 자발어 테스트 결과 미리 설정된 자발어 평가기준에 미달하는 경우, 말소리 훈련콘텐츠에서 사용된 언어모델의 말소리를 따라하는 모방어에 대한 테스트를 하는 모방어테스트화면을 제공하여 모방어 테스트가 이루어지도록 하는 모방어 테스트 과정; 상기 모방어 테스트 결과 미리 설정된 모방어 평가기준에 미달하는 경우, 상기 모방어 테스트 결과를 반영하여 생성한 음성변환된 언어모델에 대한 모방을 테스트하는 음성변환테스트화면을 제공하여 음성변환테스트가 이루어지도록 하는 음성변환 테스트 과정; 및 상기 말소리 테스트, 모방어 테스트, 음성변환 테스트의 테스트 결과를 반영한 학습자 맞춤형 학습자료를 제공하여 학습자의 말소리 학습이 이루어지도록 하는 말소리 학습 과정;을 포함할 수 있다.The embodiments of the present invention provide a speech evaluation and learning method in which a speech learning content app installed in a learner terminal performs speech training for a learner, the speech evaluation and learning method comprising: a speech training content selection process in which a speech training content to be used for speech training is selected from a learner; A spontaneous speech test process for providing a spontaneous speech test screen of the selected speech training contents to allow a spontaneous speech test to be performed; And an empirical test screen for performing a test on a mimetic word following the speech model of the language model used in the speech training content is provided so as to perform the mimetic word test if the self- An empirical test process; And a speech conversion test screen for testing imitation of the speech-converted language model generated by reflecting the result of the imitation test is provided so as to perform the speech conversion test when the imitation test result is less than a preset imitation evaluation criterion A speech conversion test process; And a speech learning process for providing learner customized learning data reflecting the test results of the speech test, the speech test, and the speech conversion test so that the learner's speech learning is performed.

Description

{Method for correcting voice}

본 발명은 말소리 평가 및 학습 방법으로서, 말소리 학습을 제공하는 말소리 평가 및 학습 방법에 관한 것이다.The present invention relates to a speech evaluation and learning method, and a speech evaluation and learning method that provides speech learning.

일반적인 발달성 장애치료 및 교육 방법에 있어서, 치료전문기관 및 전문가를 통한 치료 및 교육은 장애우에 대한 전문적인 치료가 가능하고, 신뢰를 형성할 수 있는 장점에 비해, 치료전문가의 부족, 경제적 부담, 이용 접근성의 부족, 치료이용시간 부족, 치료이용의 대기시간과 같은 단점이 있다. 또한, 사설 치료기관 및 복지시설을 통한 치료 및 교육은 치료비용이 저렴한 장점이 있으나, 치료사들의 전문성 부족, 시설의 열악함, 전문기관과의 연계부족, 치료이용 대기시간과 같은 단점이 있다. 또한, PC용 치료 S/W 프로그램을 이용한 치료 및 교육은 경제적 부담 감소, 접근성 및 편의성 향상의 장점이 있으나, 전문기관과의 연계부족, 학습자의 개별적인 특성을 고려하지 못한 발음중점의 학습방식과 같은 단점이 있다. In the treatment and education methods of general developmental disorders, treatment and education through treatment specialists and specialists can provide professional treatment for persons with disabilities, There are disadvantages such as lack of accessibility to use, lack of time for treatment, and waiting time for treatment. In addition, treatment and education through private treatment facilities and welfare facilities have advantages such as low cost of treatment but shortcomings such as lack of expertise of therapists, inferiority of facilities, lack of connection with professional organizations, and waiting time for treatment. In addition, treatment and education using PC treatment S / W program has advantages of reduction of economic burden, accessibility and convenience, but it is difficult to apply the same method as learning method of phonemic point which can not take into consideration individual characteristics of learners There are disadvantages.

언어발달에 있어서 조기발견 및 조기집중중재의 중요성에 비해 이러한 종래 발달성 장애치료 및 교육 방법은 접근성의 부족, 대기시간의 증가로 인한 치료시기의 지연, 경제적 부담으로 인한 언어치료 이용시간 부족, 전문가와의 연계 부족으로 인해 치료결과에 대한 즉각적 피드백이 어려우며, 전문적인 평가 및 조언이 부족한 문제점이 있다.In contrast to the importance of early detection and early intensive intervention in language development, such conventional developmental disability treatment and education methods are inadequate for accessibility, delayed treatment due to increase in waiting time, lack of time for language therapy due to economic burden, It is difficult to provide immediate feedback on the result of the treatment, and there is a problem in that it lacks professional evaluation and advice.

한편, 최근 급격히 증가하고 있는 다문화 가정의 자녀에 대한 언어교육이 필요하지만 대부분 전문기관의 방문을 통해서만 가능한 상황이다. 또한, 매년 장애우 수가 증가하는 추세이지만, 이에 비례하여 복지시설 및 치료/교육기관이 현저히 부족한 상황이다.On the other hand, language education for the children of multicultural families, which are growing rapidly in recent years, is necessary, but most of them are only possible through visiting specialist institutions. In addition, although the number of people with disabilities is increasing every year, proportionately there is a lack of welfare facilities and treatment / education facilities.

이와 같이 부족한 치료/교육 기관으로 인해 치료/교육서비스의 질적 저하가 우려되며, 치료/교육 이용시간 부족으로 인해 치료/교육 시기가 지연되거나 치료/교육기간이 장기화되는 문제점이 있다.As a result of the lack of such treatment / education institutions, there is a concern that the quality of the treatment / education service is deteriorated and the treatment / education time is delayed or the treatment / education period is prolonged due to lack of time for treatment / education.

이에, 최근에는 언어학습용 콘텐츠가 제공되고 있으나 언어학습 효과에 있어서 실효성이 떨어지는 문제가 있다. 즉, 기존의 언어학습용 콘텐츠의 제작 및 사용자에게 제공하는 방법은, 대부분 한가지 녹음대본은 한가지 음성과 한 가지 발음속도로만 제작하여 사용자에게 제공되고, 언어모델의 발음방법을 강의하고 모델링하는 방식에 한정되었다.Thus, although language learning contents are provided in recent years, there is a problem that the effectiveness of language learning is inferior. In other words, the existing language learning contents are produced and provided to the user. Most of the recording scripts are provided only to the user with one voice and one pronunciation speed, and are limited to the method of lecturing and modeling the pronunciation method of the language model .

또한 발음속도도 대부분 모국어 사용자의 정상적인 발음양상과 속도이다 보니, 비록 컴퓨터와 인터넷의 편리한 반복 재생기능을 언어학습에 활용한다고 해도 이러한 한가지 형태의 콘텐츠 제공만으로는 언어학습 효과를 기대할 수 없는 문제가 있다.In addition, although the pronunciation speed is mostly normal pronunciation and speed of the native language user, even if the convenient repetition function of the computer and the Internet is utilized for the language learning, there is a problem that the effect of the language learning can not be expected only by providing such one type of contents.

언어 치료 및 교육이 필요한 학습자의 경우 발음하는 방법이 서툰 이유도 있지만 청각정보처리능력이 부족한 특성의 영향도 있다. 즉, 언어에 대한 기민성과 모국어 변별능력이 부족하고 청각 주의폭, 청각 기억폭 등의 청각정보처리능력이 부족한 특성이 있어, 제시되는 말 속도에 따라 발음 정확도가 달라지기도 한다. 따라서 언어정보처리과정에 기반하여 학습자의 말소리 지각 수준에 대한 객관화된 데이터를 바탕으로 학습자가 언어를 정확하게 들을 수 있는 말소리 자극의 특성을 파악하고 이를 조절해줌으로써 학습자가 정확하게 언어를 인식하고 말하도록 돕는 시스템이 필요하다. 또한 학습자가 숙달될 때까지 언어를 충분히 학습할 수 있도록 말하기 학습이 재미있으면서도 자연스럽게 진행될 필요가 있다. 따라서 학습자의 언어 반응으로 수행되는, 다양한 학습게임은 반복연습을 통한 언어학습 효과와 자발어 증진 효과를 기대할 수 있다. For learners who need language therapy and education, there are reasons that pronunciation is poor, but there are also characteristics that lack hearing ability. In other words, lack of dexterity and language discrimination ability of language, lack of hearing information processing ability such as hearing aids, auditory memory width, and so on, the pronunciation accuracy varies according to the presented speech speed. Therefore, based on the learner's speech perception level based on the linguistic information processing process, learners can grasp the characteristics of the speech stimulation that can accurately hear the language and adjust it to help the learner accurately recognize and speak the language We need a system. Also, speaking learning should be fun and natural, so that learners can learn enough language until they are mastered. Therefore, various learning games performed by learner 's linguistic reaction can expect language learning effect and voluntary promotion effect through iterative practice.

한국공개특허 제10-2010-0005177호Korean Patent Publication No. 10-2010-0005177

본 발명의 기술적 과제는 언어 진단 및 치료가 필요한 학습자(언어표현 발달이 느린 유아, 난청과 조음장애를 비롯한 언어장애, 청력 손실로 인한 언어문제를 가진 노인, 발달장애 등)가 말소리 학습을 할 수 있도록 하는 말소리 평가 및 학습 방법을 제공하는데 있다.The technical problem of the present invention is to provide a learner who is required to diagnose and treat language (an infant with slow language development, a language disorder including hearing loss and articulation disorder, an elderly person with language problem due to hearing loss, And to provide a speech evaluation and learning method.

상기 말소리훈련콘텐츠 선택 과정은, 말소리 훈련에 사용될 말소리의 학습목표수준을 선택함을 특징으로 할 수 있다.The speech training content selection process may be characterized by selecting a learning target level of speech to be used for speech training.

상기 자발어 테스트는, 상기 학습목표수준에 부합하는 자발어의 음소, 음절, 단어, 문장을 낼 수 있는 사진이나 문자를 표시하는 자발어 표시 과정; 자발어를 발성하는 학습자의 음성을 입력받아 소음을 제거하는 자발어 전처리 과정; 소음제거된 자발어의 음성에 대하여 반응평가를 진행하는 자발어 반응평가 과정; 상기 자발어 반응평가 결과, 상기 자발어 평가기준에 부합하는 경우 발음평가를 진행하여 피드백 결과를 제공하는 자발어 발음평가 과정; 및 상기 자발어 반응평가 결과, 상기 자발어 평가기준에 미달하는 경우, 자발어 테스트 실패의 피드백을 제공하고 상기 모방어 테스트로 진행하는 모방어 테스트 이동 과정;을 포함할 수 있다.Wherein the spontaneous speech test is a spontaneous speech display step of displaying a picture or a letter capable of generating phonemes, syllables, words, and sentences of a spontaneous word corresponding to the learning target level; A spontaneous speech preprocessing process for eliminating noise by inputting speech of a learner who speaks spontaneous speech; A spurious response evaluation process in which a response evaluation is performed on the noise of the noise canceled spontaneous speech; A pronunciation evaluating step of providing a feedback result by proceeding pronunciation evaluation when the result of the self-verbal response evaluation meets the self-verbal evaluation criterion; And an empirical test movement process of providing the feedback of the spontaneous word test failure and proceeding to the empirical word test when the result of the spurious word reaction evaluation is less than the spurious word evaluation criterion.

상기 모방어 테스트 과정은, 상기 학습목표수준에 부합하는 모방어의 음소, 음절, 단어, 문장을 낼 수 있는 사진이나 문자로 표시하고 언어모델의 말소리를 따라하는 모방어 제시 과정; 모방어를 발성하는 학습자의 음성을 입력받아 소음을 제거하는 모방어 전처리 과정; 소음제거된 모방어의 음성에 대하여 반응평가를 진행하는 모방어 반응평가 과정; 상기 모방어 반응평가 결과, 상기 모방어 평가기준에 부합하는 경우 발음평가를 진행하여 피드백 결과를 제공하는 모방어 발음평가 과정; 및 상기 모방어 반응평가 결과, 상기 모방어 평가기준에 미달하는 경우, 모방어 테스트 실패의 피드백을 제공하고 상기 음성변환 테스트로 진행하는 음성변환 테스트 이동 과정;을 포함할 수 있다.Wherein the mimic word test process includes: displaying a mimetic word by displaying a phonemic, a syllable, a word, and a sentence of a mimetic word corresponding to the learning target level as a photograph or a letter and following a speech of a language model; A preprocessing process of removing a noise by inputting a voice of a learner who speaks a mimetic word; An empirical response evaluation process for evaluating the response to the voice of the noise canceled mimic word; A pronunciation evaluation process of performing a pronunciation evaluation and providing a feedback result when the pronunciation evaluation meets the pronunciation evaluation result, And a voice conversion test movement step of providing feedback of a pronunciation test failure and proceeding to the voice conversion test when the result of the voice response evaluation is less than the voice recognition evaluation criterion.

상기 음성변환 테스트 과정은, 상기 모방어 테스트 결과를 반영하여 학습목표수준의 음성변환을 생성하여 제시하는 음성변환 생성 과정; 음성변환된 언어모델의 말소리를 제시하는 음성변환 제시 과정; 음성변환을 따라 발성하는 학습자의 음성을 입력받아 소음을 제거하는 음성변환 전처리 과정; 소음제거된 학습자 음성에 대하여 반응평가를 진행하여, 미리 설정된 음성변환 평가기준에 부합하는 경우 발음평가를 진행하여 피드백 결과를 제공하는 음성변환 발음평가 피드백 제공 과정; 및 상기 음성변환을 따라하는 학습자 음성에 대한 반응평가 결과, 상기 음성변환 평가기준에 미달하는 경우, 음성변환 테스트 실패의 피드백을 제공하는 음성변환 반응평가 피드백 제공 과정;을 포함할 수 있다.The voice conversion test process may include: a voice conversion generation process of generating and presenting a voice conversion of a learning target level by reflecting the result of the speaker test; A speech conversion presentation process for presenting a speech of a speech-converted language model; A speech conversion pre-processing step of inputting speech of a learner who speaks through speech conversion and removing noise; A voice conversion pronunciation evaluation feedback providing step of performing a reaction evaluation on the learner's voice with noise removed and proceeding with pronunciation evaluation if the dictionary meets the predetermined voice conversion evaluation criteria to provide a feedback result; And a voice conversion reaction evaluation feedback providing step of providing feedback of the voice conversion test failure when the voice conversion evaluation criterion is not satisfied as a result of the evaluation of the learner voice following the voice conversion.

상기 음성변환 테스트 과정은, 학습자의 음성반응이 미리 설정된 기준을 충족할 때까지, 상기 음성변환 생성 과정, 음성변환 전처리 과정, 음성변환 발음평가 피드백 제공 과정, 및 음성변환 반응평가 피드백 제공 과정을 반복함을 특징으로 할 수 있다.The voice conversion test process repeats the voice conversion generation process, the voice conversion preprocess process, the voice conversion pronunciation evaluation feedback providing process, and the voice conversion reaction evaluation feedback providing process until the voice response of the learner meets predetermined criteria Can be characterized.

상기 반응평가는, 언어반응평가와 비언어적반응평가가 이루어지며, 비언어적반응평가의 경우, 반응속도, 화면전환, 시선처리에 대한 평가가 이루어지며, 상기 발음평가는, 음소수준, 음절수준, 단어수준, 문장수준에 대한 언어모델에 기반한 발음 평가와 학습자의 학습목표기준과 학습수준에 대한 평가가 이루어짐을 특징으로 할 수 있다.In the non-verbal response evaluation, the reaction rate, the screen transition, and the gaze treatment are evaluated. The pronunciation evaluation includes a phoneme level, a syllable level, a word level , Pronunciation evaluation based on the language model for the sentence level, and assessment of the learners' learning target standards and learning levels.

상기 말소리 학습 과정은, 상기 학습자 맞춤형 학습자료를 제공하는 과정; 상기 학습자 맞춤형 학습자료를 학습하는 학습자의 음성, 입모양, 혀모양, 호흡이 포함된 학습자 정보를 입력받는 과정; 상기 학습자 정보에서 노이즈를 제거하는 과정; 노이즈가 제거된 학습자 정보가 미리 설정된 학습 평가기준에 부합하는지를 학습 평가를 진행하는 과정; 상기 학습 평가 결과, 상기 학습 평가기준에 미달하는 경우 상기 학습자 맞춤형 학습자료를 수정하여 학습자 정보 입력 및 학습 평가를 반복하는 과정; 및 상기 학습 평가 결과, 상기 학습 평가기준에 부합하는 경우, 말소리 학습 게임을 진행하여 학습시키는 말소리 학습 게임 진행 과정;을 포함할 수 있다.Wherein the speech learning process comprises: providing the learner-customized learning material; A step of receiving learner information including a voice, a mouth shape, a tongue shape, and breathing of a learner learning the learner customized learning material; Removing noise from the learner information; A step of learning evaluation of whether or not learner information whose noise has been removed meets a predetermined learning evaluation standard; Repeating the learner information input and the learning evaluation by modifying the learner customized learning material when the learning evaluation result is less than the learning evaluation standard; And a speech learning game progress process in which the speech learning game is progressed and learned when the learning evaluation result meets the learning evaluation standard.

상기 말소리 학습 게임은, 학습자가 이미 학습된 말소리나 학습중인 말소리들로 구성되어, 학습자의 말소리 반응으로 게임의 승패가 결정되도록 함을 특징으로 할 수 있다.The speech learning game is characterized in that the learner is made up of already learned speech or speech being learned, and the win or loss of the game is determined by the learner's speech response.

본 발명의 실시 형태에 따르면 말소리 훈련 학습을 제공해줌으로써, 언어 진단 및 치료가 필요한 학습자가 효율적으로 말소리 훈련을 할 수 있다.According to the embodiment of the present invention, a learner who needs language diagnosis and treatment can perform speech training efficiently by providing speech training training.

도 1은 본 발명의 실시예에 따른 말소리 평가 및 학습이 가능한 학습자 단말기의 사시도.
도 2는 본 발명의 실시예에 따른 학습자 단말기의 구성 블록도.
도 3은 본 발명의 실시예에 따른 말소리 평가 및 학습 과정을 도시한 플로차트.
도 4는 본 발명의 실시예에 따른 자발어 테스트 과정을 도시한 플로차트.
도 5는 본 발명의 실시예에 따른 전처리모듈을 도시한 그림.
도 6은 본 발명의 실시예에 따른 피드백모듈을 도시한 그림이며, 도 7은 본 발명의 실시예에 따른 말소리평가모듈을 도시한 그림.
도 8은 본 발명의 실시예에 따른 모방어 테스트 과정을 도시한 플로차트.
도 9는 본 발명의 실시예에 따른 음성변환 테스트 과정을 도시한 그림.
도 10은 본 발명의 실시예에 따른 음성변환모듈을 도시한 그림.
도 11은 본 발명의 실시예에 따른 말소리 학습 과정을 도시한 플로차트.
도 12는 본 발명의 실시예에 다른 말소리학습모듈을 도시한 그림.
도 13은 본 발명의 실시예에 따른 말소리 학습 게임 화면의 예시 그림.1 is a perspective view of a learner terminal capable of evaluation and learning speech according to an embodiment of the present invention;
2 is a block diagram of a configuration of a learner terminal according to an embodiment of the present invention;
3 is a flowchart illustrating a speech evaluation and learning process according to an embodiment of the present invention.
4 is a flowchart illustrating a spontaneous speech test process according to an embodiment of the present invention.
FIG. 5 is a diagram illustrating a preprocessing module according to an embodiment of the present invention. FIG.
FIG. 6 is a diagram illustrating a feedback module according to an embodiment of the present invention, and FIG. 7 is a diagram illustrating a speech evaluation module according to an embodiment of the present invention.
FIG. 8 is a flow chart illustrating an empirical test process according to an embodiment of the present invention; FIG.
9 is a diagram illustrating a voice conversion test process according to an embodiment of the present invention.
FIG. 10 is a diagram illustrating a voice conversion module according to an embodiment of the present invention. FIG.
11 is a flowchart illustrating a speech learning process according to an embodiment of the present invention.
FIG. 12 is a diagram illustrating a speech learning module according to an embodiment of the present invention. FIG.
FIG. 13 is an exemplary illustration of a speech learning game screen according to an embodiment of the present invention. FIG.

이하, 본 발명의 장점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나 본 발명은, 이하에서 개시되는 실시예들에 한정되는 것이 아니라 여러 가지 다양한 형태로 구현될 것이며, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것으로, 본 발명은 청구항의 범주에 의해 정의될 뿐이다. 또한, 본 발명을 설명함에 있어 관련된 공지 기술 등이 본 발명의 요지를 흐리게 할 수 있다고 판단되는 경우 그에 관한 자세한 설명은 생략하기로 한다.BRIEF DESCRIPTION OF THE DRAWINGS The advantages and features of the present invention, and how to achieve them, will be apparent from the following detailed description of embodiments thereof taken in conjunction with the accompanying drawings. The present invention may, however, be embodied in many different forms and should not be construed as being limited to the exemplary embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete and will fully convey the concept of the invention to those skilled in the art. And the present invention is only defined by the scope of the claims. In the following description, well-known functions or constructions are not described in detail since they would obscure the invention in unnecessary detail.

도 1은 본 발명의 실시예에 따른 말소리 평가 및 학습이 가능한 학습자 단말기의 사시도이며, 도 2는 본 발명의 실시예에 따른 학습자 단말기의 구성 블록도이다.FIG. 1 is a perspective view of a learner terminal capable of evaluation and learning speech according to an embodiment of the present invention, and FIG. 2 is a block diagram of a learner terminal according to an embodiment of the present invention.

본 발명은 학습자가 말을 하면 학습자 단말기(10)가 그 중에서 부정확한 발음만을 선별해 내어 정확한 발음을 들을 수 있도록 수정 증폭하여 들려주고, 다시 학습자의 말소리를 듣고 부정확한지 여부를 자동반복하여 알려줌으로써 궁극적으로 발음 문제를 교정하게 해주는 언어 학습 수단이다.In the present invention, when a learner speaks, the learner terminal 10 selects and edits only improper pronunciations so that the learner can hear the correct pronunciation, hears the learner's speech again, It is a language learning tool that ultimately corrects pronunciation problems.

이를 위하여 본 발명의 학습자 단말기(10)는, 언어장애와 발달장애 등과 같이 말소리 학습이 필요한 학습자들을 훈련시킬 수 있도록 하는 어플리케이션(앱;APP)이 설치되어, 이러한 어플리케이션을 통하여 말소리 훈련을 수행하도록 한다.To this end, the learner terminal 10 of the present invention is provided with an application (APP) for training learners who need to learn speech, such as a language disorder and a developmental disorder, and performs speech training through such an application .

본 발명의 학습자 단말기(10)는, 도 1의 도면에서는 스마트폰(smart phone)을 예로 들어 설명하나, 스마트폰만 아니라 데스크탑 PC(desktop PC), 태블릿 PC(tablet PC), 슬레이트 PC(slate PC), 노트북 컴퓨터(notebook computer), 디지털방송용 단말기, PDA(Personal Digital Assistants), PMP(Portable Multimedia Player), 내비게이션(Navigation) 등이 해당될 수 있다. 물론, 본 발명이 적용 가능한 단말기는 상술한 종류에 한정되지 않고, 다양한 단말기를 모두 포함할 수 있음은 당연하다.The learner terminal 10 of the present invention will be described by taking a smart phone as an example in the drawing of Fig. 1, but it is possible to use not only a smart phone but also a desktop PC, a tablet PC, a slate PC A notebook computer, a digital broadcasting terminal, a PDA (Personal Digital Assistants), a PMP (Portable Multimedia Player), a navigation, and the like. Of course, the terminal to which the present invention can be applied is not limited to the above-described types, and it is of course possible to include various terminals.

본 발명의 학습자 단말기(10)는 도 2에 도시한 바와 같이, 단말기 통신부(11), 단말기 메모리(12), 단말기 표시부(13), 단말기 입력부(14), 말소리훈련콘텐츠앱이 설치된 단말기 제어부(15)를 포함할 수 있다.2, the learner terminal 10 of the present invention includes a terminal communication unit 11, a terminal memory 12, a terminal display unit 13, a terminal input unit 14, a terminal control unit 15).

단말기 통신부(11)는, 이동통신망을 통하여 통신하는 기능을 수행하는 모듈로서, 3G, 4G 등의 이동 통신을 수행하는 경우에는, 무선 송신되는 신호의 주파수를 상승변환 및 증폭하는 RF송신기(미도시)와, 수신되는 무선 신호를 저잡음 증폭하고 주파수를 하강 변환하는 RF수신기(미도시) 등을 포함한다.The terminal communication unit 11 is a module that performs a function of communicating through a mobile communication network. When performing mobile communication such as 3G or 4G, the terminal communication unit 11 includes an RF transmitter (not shown) for up-converting and amplifying the frequency of a radio- An RF receiver (not shown) for low-noise amplifying the received radio signal and down-converting the frequency, and the like.

단말기 메모리(12)는, 본 발명의 말소리훈련콘텐츠(자발어, 모방어, 음성변환 등),화면 그래픽 인터페이스(GUI) 정보 등이 저장된 저장체이다. 이러한 메모리는, 플래시메모리(Flash Memory), CF카드(Compact Flash Card), SD카드(Secure Digital Card) 등 정보의 입출력이 가능한 모듈로서 장치의 내부에 구비되어 있을 수도 있고, 별도의 장치에 구비되어 있을 수도 있다.The terminal memory 12 is a storage body in which the speech training contents (spontaneous speech, mimic speech, voice conversion, etc.) of the present invention, screen graphic interface (GUI) Such a memory is a module capable of inputting and outputting information such as a flash memory, a CF card (Compact Flash Card), and an SD card (Secure Digital Card) There may be.

단말기 표시부(13)는, 자발어테스트화면, 모방어테스트화면, 음성변환테스트화면, 학습자 맞춤형 학습자료를 표시하는 모듈이다.The terminal display unit 13 is a module for displaying a spontaneous speech test screen, a mimetic word test screen, a speech conversion test screen, and a learner-customized learning material.

단말기 입력부(14)는, 단말기 표시부(13)를 통해 표시된 자발어테스트화면, 모방어테스트화면, 음성변환테스트화면, 학습자 맞춤형 학습자료 등을 통하여 학습자로부터 입력받는 모듈이다. 이러한 단말기 입력부(14)와 단말기 표시부(13)는 터치스크린패널의 단일 형태로 구현될 수 있다. 터치스크린패널은, 입력과 표시를 동시에 수행할 수 있는 터치 스크린 화면을 제공하여 단말기의 전면에 마련되어 작업 화면을 표시하는 표시창으로서, 학습자와의 소통을 위한 그래픽 유저 인터페이스(GUI;Graphic User Interface)를 표시한다.The terminal input unit 14 is a module that receives input from a learner through a voluntary language test screen, an empirical test screen, a voice conversion test screen, and a learner-customized learning material displayed through the terminal display unit 13. The terminal input unit 14 and the terminal display unit 13 may be implemented as a single type of touch screen panel. A touch screen panel is a display window provided on a front surface of a terminal to provide a touch screen screen capable of simultaneously performing input and display, and includes a graphical user interface (GUI) for communicating with a learner Display.

단말기 제어부(15)는, 학습자 단말기(10)의 각 기능 모듈을 제어하는 MCU(Main Control Unit)로 구현되어, 본 발명의 말소리학습콘텐츠앱이 설치된 모듈이다. 참고로, 스마트폰(smart phone) 등으로 구현되는 학습자 단말기(10)는, 수백여 종의 다양한 어플리케이션(응용프로그램)을 사용자가 원하는 대로 설치하고 추가 또는 삭제할 수 있어, 사용자가 원하는 어플리케이션을 직접 제작할 수도 있으며, 다양한 어플리케이션을 통하여 자신에게 알맞은 인터페이스를 구현할 수 있다. 따라서 구글마켓, 애플스토어 등에서 말소리학습콘텐츠앱을 다운로드받아 스마트폰에 설치할 수 있다.The terminal control unit 15 is implemented by an MCU (Main Control Unit) that controls each function module of the learner terminal 10 and is a module in which the speech learning content app of the present invention is installed. For reference, the learner terminal 10 implemented with a smart phone or the like can install, add or delete hundreds of various applications (application programs) as desired by the user, And can implement an appropriate interface through various applications. Therefore, you can download the speech learning content app from Google Market, Apple Store, etc. and install it on the smartphone.

도 3은 본 발명의 실시예에 따른 말소리 평가 및 학습 과정을 도시한 플로차트이며, 도 4는 본 발명의 실시예에 따른 자발어 테스트 과정을 도시한 플로차트이며, 도 5는 본 발명의 실시예에 따른 전처리모듈을 도시한 그림이며, 도 6은 본 발명의 실시예에 따른 피드백모듈을 도시한 그림이며, 도 7은 본 발명의 실시예에 따른 말소리평가모듈을 도시한 그림이며, 도 8은 본 발명의 실시예에 따른 모방어 테스트 과정을 도시한 플로차트이며, 도 9는 본 발명의 실시예에 따른 음성변환 테스트 과정을 도시한 그림이며, 도 10은 본 발명의 실시예에 따른 음성변환모듈을 도시한 그림이며, 도 11은 본 발명의 실시예에 따른 말소리 학습 과정을 도시한 플로차트이며, 도 12는 본 발명의 실시예에 다른 말소리학습모듈을 도시한 그림이며, 도 13은 본 발명의 실시예에 따른 말소리 학습 게임 화면의 예시 그림이다.FIG. 3 is a flow chart illustrating a speech evaluation and learning process according to an embodiment of the present invention. FIG. 4 is a flowchart illustrating a self-test word test process according to an embodiment of the present invention. FIG. 6 is a diagram illustrating a feedback module according to an embodiment of the present invention. FIG. 7 is a diagram illustrating a speech evaluation module according to an embodiment of the present invention, and FIG. FIG. 9 is a flowchart illustrating a voice conversion test process according to an embodiment of the present invention. FIG. 10 is a flowchart illustrating a voice conversion test process according to an embodiment of the present invention. FIG. 11 is a flow chart illustrating a speech learning process according to an embodiment of the present invention. FIG. 12 is a diagram illustrating a speech learning module according to an embodiment of the present invention. FIG. Yes In accordance with an exemplary figure of speech learning the game screen.

학습자 단말기에 설치된 말소리학습콘텐츠앱이 학습자에 대한 말소리훈련을 실시하는 말소리 평가 및 학습 방법에 있어서, 본 발명의 말소리 평가 및 학습 과정은, 말소리훈련콘텐츠 선택 과정(S100), 자발어 테스트 과정(S200), 모방어 테스트 과정(S300), 음성변환 테스트 과정(S400), 및 말소리 학습 과정(S500)을 가질 수 있다.In a method of evaluating and learning a speech in which a speech learning content app installed in a learner terminal performs speech training for a learner, the speech evaluation and learning process of the present invention includes a speech training content selection process (S100), a spontaneous speech test process An empirical test process S300, a voice conversion test process S400, and a speech learning process S500.

말소리훈련콘텐츠 선택 과정(S100)은, 말소리 훈련에 사용될 말소리훈련콘텐츠를 학습자로부터 선택받는 과정으로서, 말소리 훈련에 사용될 말소리의 학습목표수준을 선택받는다.The speech training content selection process (S100) is a process of selecting a speech training content to be used for speech training from a learner, and selects a target level of speech to be used in speech training.

말소리훈련콘텐의 선택 과정을 상술하면, 말소리선택은 어휘주제별 수준에서 선택할 수도 있고, 발음 수준에서도 선택할 수 있다. 이는 어휘주제별로 선택할 경우에는 유아나 어휘력이 부족한 학습자에게 적합하다. 초기언어발달에서는 언어표현력보다 언어 이해력이 선행되어야 하므로 학습자에게 이미 익숙하거나 좋아하는 어휘주제일 경우 새로운 언어나 발음을 지도하기가 쉽다. 왜냐하면 듣고 말하는 경험이 많았던 어휘일수록 학습하기가 용이하기 때문이다. If you specify the selection process of the speech training content, the speech selection can be selected at the lexical topic level or at the pronunciation level. This is appropriate for learners who are lacking in infant or vocabulary skills when choosing them by vocabulary themes. In early language development, language comprehension must precede language expressiveness, so it is easy to teach a new language or pronunciation if the learner is familiar with or likes a vocabulary topic. Because the more vocabulary you had to hear and speak, the easier it is to learn.

발음 수준에서 선택할 경우 발음 학습이 중점인 대상에게 적합하다. 이러한 학습자에게는 발음훈련효과를 높이기 위해서 유사한 발음유형들끼리 학습자료를 제시하는 것이 보다 효과적이다. 따라서 학습자가 학습할 말소리를 빠르게 검색하기 용이하게 2가지 수준으로 나누어 제시한다. 그리고 학습자의 검색 데이터는 말소리 평가 결과와 함께 학습자의 특성과 맞춤형 학습자료를 구성하는데 활용된다. 그런 다음 테스트 발화 수준을 선택하여 최종 말소리 결정하게 된다. 테스트 발화수준 선택은 학습자의 말소리 수준에 따라 음소, 음절, 단어, 문장 수준 중 하나를 선택할 수 있다. 음절수준에서의 자음의 초성, 중성, 종성위치를 선택할 수 있고, 단어 수준에서는 음절수와 단어상에 목표발음의 위치를 선택할 수 있다. 예를 들어 목표발음의 'ㄱ'이라도 '고추'처럼 'ㄱ'이 첫음절의 초성에 해당되는 단어나 '아기'처럼 'ㄱ'이 두번째 음절의 초성에 해당되는 단어를 선택할 수도 있다. 문장 수준에서는 한 문장에 포함된 단어수를 선택하거나 문장수를 선택할 수 있다. 이는 소리의 위치에 따라 동일한 말소리라도 발음의 정확도나 음 강도 등의 차이가 있기 때문이다. 동일한 말소리라도 목표자극의 소리위치, 발화음절수에 따라 학습자의 발음 정확도가 다르기 때문에 음소, 음절, 단어, 문장 단위별로 발음을 분석한다. 예를 들어, '사탕'과 '이사'처럼 '사'의 위치에 따라 '사탕'은 /사탕/으로 '사'를 정확하게 발음할 수 있어도 　'이사'는 /이아/로 　'사'를 정확하게 발음하지 못할 수 있다. 발화수준 선택 후에는 목표자극을 결정하는데, 학습자의 말소리 평가 및 학습을 위해 말소리와 발화수준을 포함한 최종말소리 자극을 결정하는 것이다.If you choose from pronunciation level, it is suitable for the person whose pronunciation learning is important. For these learners, it is more effective to present learning materials among similar pronunciation types in order to enhance the pronunciation training effect. Therefore, it is divided into two levels so that learners can quickly search for the speech to be learned. And the learner 's search data is used to construct learner' s characteristics and customized learning materials together with the speech evaluation result. Then select the test speech level to determine the final speech. The test utterance level selection can be one of phoneme, syllable, word, sentence level according to the learner 's speech level. At the syllable level, you can select the beginning, neutral, and consonant positions of the consonants. At the word level, you can choose the number of syllables and the position of the target pronunciation on the word. For example, even if 'A' is the target pronunciation, 'A' may be a word that corresponds to the beginning of the first syllable, such as 'Pepper', or a word that corresponds to the beginning of the second syllable, such as 'baby'. At the sentence level, you can select the number of words in a sentence or choose the number of sentences. This is because there are differences in the accuracy and sound intensity of the pronunciation of the same speech depending on the position of the sound. Because the learner's pronunciation accuracy differs according to the position of the target stimulus and the number of the syllables of the target stimuli, the pronunciation is analyzed by the phonemes, syllables, words, and sentence units. For example, 'Candy' / 'Candy' / 'Candy' can be pronounced correctly according to the position of 'San', such as 'Candy' and 'Moving' I can not. After choosing the level of speech, the target stimulus is determined, and the final speech stimulus, including speech and speech levels, is determined for learner speech evaluation and learning.

한편, 자발어 테스트 과정(S200)은, 선택된 말소리훈련콘텐츠의 자발어테스트화면을 제공하여 자발어 테스트가 이루어지도록 한다. 여기서 '자발어'라 함은, 알려진 바와 같이, 촉구나 반복적으로 학습된 말이 아닌 생각한 것으로 스스로 표현하는 말을 말한다. 예를 들어, "몇 살이야" 라고 질문했을 때, 엄마나 주변인이 "4살이라고 말해야지, 4살"이라고 언어촉진을 하지 않고 스스로 "4살이에요"라고 대답하거나 "놀이공원 가고 싶어요"처럼 자신이 원하는 것을 말로 스스로 표현하는 것이다. 이처럼 촉구나 지도 없이 스스로 의사소통이 가능한 것이다.Meanwhile, the spontaneous speech test process (S200) provides a spontaneous speech test screen of the selected speech speech training contents so that the spontaneous speech test is performed. Here, 'spontaneous language', as it is known, is a word that expresses itself as thoughts, not words learned or repeatedly learned. For example, when asked, "How old are you?", Your mother or neighbor should say "4 years old" or "4 years old" It expresses itself in words by oneself. This is how you can communicate yourself without urging or guidance.

이러한 자발어 테스트는 도 4에 도시한 바와 같이, 학습목표수준에 부합하는 언어의 음소, 음절, 단어, 문장을 낼 수 있는 사진이나 문자를 표시하는 자발어 표시 과정(S210)과, 자발어를 발성하는 학습자의 음성을 입력(S220)받아 소음을 제거하는 자발어 전처리 과정(S230)과, 소음제거된 자발어의 음성에 대하여 반응평가를 진행하는 자발어 반응평가 과정(S240), 자발어 반응평가 결과, 자발어 평가기준에 부합하는 경우(S240a) 발음평가를 진행(S250)하여 피드백 결과를 제공하는 자발어 발음평가 과정과, 자발어 반응평가(S240) 결과, 자발어 평가기준에 미달하는 경우(S240b), 자발어 테스트 실패의 피드백(S261)을 제공하고 모방어 테스트(S300)로 진행하는 모방어 테스트 이동 과정을 가질 수 있다.As shown in FIG. 4, the spontaneous speech test is a spontaneous speech display process (S210) for displaying a picture or a character capable of generating phonemes, syllables, words, and sentences of a language meeting a learning target level, A spontaneous speech preprocessing step S230 of inputting voices of the learner to receive noise S220, a spontaneous speech response evaluation process S240 of evaluating the response of the noisy speech of the spontaneous speech, If the evaluation result meets the spoken language evaluation criteria (S240a), the pronunciation evaluation process is performed (S250), and the spontaneous pronunciation evaluation process and the spoken language reaction evaluation process (S240) (S240b), feedback of the spontaneous test failure (S261), and an empirical test moving process that proceeds to the empirical test (S300).

즉, 자발어 테스트에서, 테스트 목표발음이 포함된 사진이나 문자를 제시하였을 때의 학습자의 자발어 반응을 분석한다. 자발어 반응평가에서 학습자의 말소리 반응이 정답일 경우 발음평가로 진행되고 말소리평가에서 발음이 정확한 경우에는 다른 테스트로 진행되며, 발음 오류가 있는 경우에는 모방어 테스트로 진행된다. 또한 학습자의 말소리 반응이 오답일 경우에도 모방어 테스트로 진행된다.That is, in the spontaneous test, the learner's voluntary response is analyzed when a photograph or a letter including a test target pronunciation is presented. If the learner's response is correct, the pronunciation evaluation is performed. If the learner's pronunciation is correct, the test will proceed to the other test. If there is a pronunciation error, the learner's test will proceed to the pronunciation test. Also, if the learner 's speech is wrong, the test will be conducted.

상기에서, 자발어 전처리 과정(S230)은, 자발어를 발성하는 학습자의 음성을 입력받아 소음을 제거하는 전처리 과정인데, 도 5에 도시한 바와 같이 소음제거, 피드백제거, 음성향상, 특정 벡터, 입술 이미지, 혀움직임 이미지, 음성/문자변환의 전처리가 이루어지도록 한다. 즉, 전처리 과정은, 학습자의 음성을 평가에 적합하도록 주변환경소음을 줄여주는 소음제거, 입력된 음성에서 말소리만 증폭하고 다른 잡음을 제거하는 피드백제거, 음성 향상, 특정벡터를 추출한다. 입력된 학습자의 동영상 자료에서 입모양과 혀움직임 부분만 추출하여 평가에 적합하도록 정보를 추출한다. 학습자의 음성을 문자로 변환시킨다.In the above, the spontaneous speech preprocessing step (S230) is a preprocessing process of receiving a learner voice uttering a spontaneous speech and removing noise. As shown in FIG. 5, noise removal, feedback elimination, speech enhancement, Lips image, tongue motion image, and voice / character conversion. In other words, the preprocessing process extracts noises that reduce the ambient noise so that the learner's voice is suitable for evaluation, eliminates feedback that amplifies only the speech from input speech and removes other noise, enhances speech, and extracts a specific vector. From the video data of the input learners, only the mouth shape and the tongue movement are extracted and information is extracted for evaluation. Converts the learner's voice to text.

또한 상기에서 자발어 반응평가에 대한 피드백과 자발어 발음평가에 대한 피드백은 도 6에 도시된 피드백모듈에서 이루어질 수 있다.In addition, the feedback on the spontaneous response evaluation and the feedback on the spontaneous pronunciation evaluation can be performed in the feedback module shown in FIG.

도 6에 도시한 피드백모듈에는 반응평가피드백, 발음평가피드백, 구강평가피드백, 보상피드백으로구성된다. The feedback module shown in Fig. 6 includes reaction evaluation feedback, pronunciation evaluation feedback, oral evaluation feedback, and compensation feedback.

반응평가피드백은, 평가와 학습 동안 학습자의 반응에 대한 정답여부를 시각 그래픽 피드백과 청각, 3D 캐릭터 피드백으로 제시할 수 있다.The response evaluation feedback can present visual feedback, auditory feedback, 3D character feedback, and the like of the learner's response during evaluation and learning.

발음평가피드백은 발음평가피드백과 그래픽피드백으로구성되는데, 발음평가피드백은 학습자 음성에 대한 발음평가에 따른 결과를 시각적 피드백으로 제시하고, 그래픽 피드백은 학습자의 음성에 대한 스펙트럼, 속도, 강세, 억양을 그래픽 피드백으로 제시하는데, 이는 말소리 평가를 위한 자발어 테스트, 모방어테스트, 음성변환테스트에도 제시될 수 있고, 학습단계에서 학습자료로 제시되어 활용할 수 있다. 구강평가피드백은 학습자의 입모양과 혀움직임 영상에 대한 평가 결과를 동영상으로 제시하는데, 수정해야 할 포인트에 대해서는 그래픽으로 제시한다. 보상 피드백은점수등의 방식으로 학습자에게 피드백을 제시한다. 그리고 반응평가피드백, 발음평가피드백, 구강평가피드백, 보상피드백은 각 테스트와 학습 단계에 따라 필요한 피드백만을 따로 활용할 수도 있다. 예를 들어 자발어테스트에서 반응과 발음이 모두 정확한 경우에는 반응평가피드백과 발음평가피드백, 보상피드백만 제시될 수 있다. The pronunciation evaluation feedback is composed of pronunciation evaluation feedback and graphical feedback. The pronunciation evaluation feedback provides the visual feedback based on the pronunciation evaluation of the learner's voice. The graphical feedback provides the spectrum of the learner's voice, It is presented as graphical feedback, which can be presented in spontaneous pronunciation test, mimetic language test, voice conversion test for speech evaluation, and can be utilized as learning material in learning stage. Oral evaluation feedback is presented as a video of the learner's mouth shape and tongue motion image, and the points to be corrected are presented graphically. Compensation feedback provides feedback to learners in ways such as scoring. In addition, reaction evaluation feedback, pronunciation evaluation feedback, mouth evaluation feedback, and compensation feedback may utilize only necessary feedback depending on each test and learning step. For example, if the reaction and pronunciation are both correct in the spontaneous test, only reaction evaluation feedback, pronunciation evaluation feedback and compensation feedback can be presented.

또한, 자발어 반응평가와 자발어 발음평가는, 도 7에 도시한 말소리 평가모듈을 통해 이루어질 수 있다. 참고로 이러한 말소리 평가모듈에서의 반응평가와 발음평가는 후술할 모방어에 대한 반응평가와 발음평가, 후술할 음성변환에 대한 반응평가와 발음평가에도 마찬가지로 적용될 수 있다.In addition, the spontaneous speech response evaluation and the spontaneous pronunciation evaluation can be performed through the speech evaluation module shown in FIG. For reference, the reaction evaluation and the pronunciation evaluation in the speech evaluation module can be similarly applied to the reaction evaluation and pronunciation evaluation described later, the reaction evaluation to the speech conversion to be described later, and the pronunciation evaluation.

도 7을 참조하여 말소리 평가모듈을 설명한다. The speech evaluation module will be described with reference to FIG.

말소리 평가모듈에는 반응평가모듈, 발음평가모듈, 구강평가모듈로 구성된다. The speech evaluation module consists of a reaction evaluation module, a pronunciation evaluation module, and an oral evaluation module.

반응평가는 목표 자료에 대한 반응에 대한 언어반응과 비언어적반응을분석한다. The response evaluation analyzes the verbal and nonverbal responses to the response to the target data.

언어반응평가에서는 정답반응과 오답반응을평가한다. 정답반응에는 학습자가 목표 말소리에 해당하는 음성 반응이 있을 경우에 해당되고 학습자의 발음을 분석하는 발음 평가로 진행된다. 예를 들어 '사탕' 사진 자극에 /사탕/, /아탕/, /탕/ 등의 '사탕' 말소리에 해당되는 음성반응이 있을 경우에는 정답으로 인정된다. 오답반응에는 학습자가 목표 자극을 표현하지 못하는 무반응과 목표자극과는 전혀 상관없는 반응을 하였을 경우에 해당되고, 이는 모방어 테스트로 진행된다. Language response assessment evaluates correct responses and incorrect responses. In the correct response, the learner evaluates the pronunciation of the learner's pronunciation. For example, if there is a negative reaction corresponding to 'candy' photo stimulus / candy /, / sugar /, / sugar / etc. In response to wrong response, the learner responds to the stimulus that does not express the target stimulus and the stimulus does not have any relation to the target stimulus.

이는 학습자의 무반응 또는 상관없는 반응이 학습자가 목표자료에 대한 적합한 어휘력을 갖추고 있지 않거나 문자를 읽지 못한　영향일 수 있기 때문이다. 따라서 오답에 대한 발음평가는 진행하지 않는다. 비언어 반응평가에서는 반응속도, 화면 전환속도, 시선에 대해 분석한다. 반응속도 분석은 학습자의 응답 반응속도를 저장하고 분석한다.This is because the learner 's unresponsive or irrelevant response may be the result of the learner lacking the appropriate vocabulary for the target data or reading the text. Therefore, the evaluation of the pronunciation of wrong answers does not proceed. In the nonverbal reaction evaluation, the reaction rate, the screen switching speed, and the gaze are analyzed. Response rate analysis stores and analyzes the learner 's response rate.

화면전환속도는 마치 책장을 넘기듯 학습자가 터치를 통해 다음 화면을 전환하기까지 시간을 측정하여 학습자의 개별적인 학습 속도를 분석한다. 시선 분석은 eye tracking 기술을 활용하여 학습자가 학습 과제에 주의 집중하는 패턴과 주의지속시간을 분석한다. 비언어 반응평가 분석자료는 학습자의 개별적인 학습 스타일을 분석하여 학습자에게 가장 효과적인 자극과 학습자료를 구성하고 제시하는 방법을 탐색하고 학습자료를 수정하는데 활용된다.The speed of screen switching is measured by the learner's individual learning speed by measuring the time until the next screen is switched through the touch as if the user is turning over the bookcase. Eye line analysis uses eye tracking technology to analyze the patterns and attention duration of learners' attention to learning tasks. Non-verbal response assessment analysis data is used to analyze learners' individual learning styles and to find out how to construct and present the most effective stimuli and learning materials for learners and to modify learning materials.

발음평가모듈은 언어치료 평가기준을 활용하여 언어 모델과 학습자의 발음간의 차이에 대한 특성을 분석한다. 평가에는 언어수준(음소, 음절, 단어, 문장)에 따른 발음 정확도를 평가하기 위해 조음장애와 유창성 장애평가에서 사용되는, 대치, 생략, 왜곡, 첨가, 반복, 연장, 말소리 속도 등과 각각의 말소리에 대한 음강도에 대한 분석이 포함된다. 예를 들어 '사탕'을 보고 /따탕/처럼 /사/를 /따/로 대치하거나 부정확하게 발음하는 오류 특성도 있지만, 　/사- - - -탕/(연장) or /사사사탕/(동일한말소리반복), 　/사아탕/(말소리첨가), 　/-탕/(말소리생략) 같은 오류 특성도 있다. The pronunciation evaluation module analyzes characteristics of the difference between the language model and the pronunciation of the learner using the language therapy evaluation criteria. The evaluation was conducted in order to evaluate pronunciation accuracy according to the language level (phonemes, syllables, words, sentences), and to compare the pronouns, omissions, distortions, additions, repetitions, extensions, Includes analysis of the sound intensity for For example, there is an error characteristic that 'candy' is replaced with / tall / as / s / / t / / / There is also an error characteristic such as / repetition), / sachan / (speech added), / - tang / (speech omitted)

또한 '사탕'의 　'ㅅ'를 /스/와 'ㅏ'를 /아/(음소수준)로 발음할 수 있어도 '사' 음절수준에서는 　/아/로 음소생략, 　'사탕' 단어수준에서는 /다탕/ 대치오류, /살탕/ 첨가오류, ''어제 마트에서 사탕을 샀어' 문장 수준에서는 /어제 마트에서 -탕을샀어'로 말소리 생략 등 오류 특성이 다양하고, 각 발어수준별로도 다르게 나타낼 수 있다. 그리고 사람이 말을 할 때 각각의 말소리 위치에 따라, 발화 길이 수준에 따라 음강도가 제각기 다르게 나타날 수 있다. 이는 말소리를 정확하게 듣고 발음하는데 모두 영향을 미친다. 따라서 언어 모델과 학습자의 발음을 분석하여 학습자의 발음 오류 특성과 언어 듣기 특성을 판단하는 요인으로 활용할 수 있다. 그리고 말소리 자극이 없는 자발어 테스트, 모델 언어의 정확한 말소리 자극이 있는 모방어 테스트, 변환된 모델언어의 음성이 변환된 말소리 자극이 있는 테스트처럼 말소리 자극 수준에 따라서도 학습자의 발음 오류 특성이 다르게 나타날 수 있다. In addition, it is possible to pronounce 'ㅅ' of 'candy' as / a / and 'a' as / ah / (phoneme level) / Error of substitution, / mistake / addition error, '' I bought a candy at the mart yesterday '' At the level of the sentence / I bought the yesterday from the mart-tang ' . And, when a person speaks, the sound intensity varies depending on the spoken length level, depending on the position of each speech. This affects both the correct pronunciation of the speech and the pronunciation. Therefore, by analyzing the language model and learners 'pronunciation, it can be used as a factor to judge learner' s pronunciation error characteristics and linguistic listening characteristics. In addition, the pronunciation error characteristics of the learners vary according to the speech stimulus level, such as spontaneous speech test without speech stimulation, mimetic speech test with accurate speech stimulation of model language, and speech speech stimulation with transformed model language .

이는 학습자의 발음오류에 대한 특성과 학습자의 언어학습에 적합한 말소리 자극수준에 대한 특성에 대한 정보를 제공한다. This provides information on the characteristics of the learner's pronunciation errors and the characteristics of the learner's stimulation level suitable for the learner's language learning.

구강평가모듈은 말을 할 때 언어모델과 학습자의 입모양과 혀 움직임, 호흡 간의 차이에 대한 특성을 비교 분석한다. 말소리에 따라 입모양과 혀움직임이 다르고, 호흡 즉, 들숨과 날숨도 가기 다르며, 말을 유창하게 하기 위해서는 호흡 조절이 또한 중요하다. 학습자마다 발음 오류 특성이 다르고, 부정확하게 발음하는 원인이 저마다 다르기 때문에 이를 분석하고 분석결과를 반영한 학습자 맞춤형 학습자료 를구성할 필요가 있다.The oral assessment module compares and analyzes the language model and the characteristics of the learner's mouth, tongue movement, and respiration when speaking. Depending on the speech, mouth shape and tongue movements are different, breathing, inhalation and exhalation are different, and respiratory control is also important to make the horse fluent. It is necessary to construct a learner 's customized learning material that analyzes the results and analyzes the results because the learner has different pronunciations and different causes of inaccurate pronunciation.

학습자의 입모양과 혀움직임은 학습자가 발화할때 카메라로 촬영하여 전처리 통해 입모양과 혀움직임 데이터 부분만 활용하여 분석한다. 호흡은 UWB 레이더로 측정하여 그래픽으로 산출된다. 학습자의 발음 상태를 분석하기 위해 카메라, 무선 이어폰과 무선 마이크, 호흡 센서기를 활용할 수 있다. The learner 's mouth and tongue movements are captured by the camera when the learner is speaking and analyzed using only mouth shape and tongue movement data through preprocessing. Breathing is measured graphically by UWB radar. A camera, a wireless earphone, a wireless microphone, and a breathing sensor can be used to analyze the learner's pronunciation status.

한편, 모방어 테스트 과정(S300)은, 자발어 테스트 결과 미리 설정된 자발어 평가기준에 미달하는 경우, 말소리 훈련콘텐츠에서 사용된 언어모델의 말소리를 따라하는 모방어에 대한 테스트를 하는 모방어테스트화면을 제공하여 모방어 테스트가 이루어지도록 한다. 즉, 도 8에 도시한 바와 같이, 학습목표수준에 부합하는 모방어의 음소, 음절, 단어, 문장을 낼 수 있는 사진이나 문자를 표시하고 언어모델의 말소리를 제시하는 모방어 제시 과정(S310)과, 모방어를 발성하는 학습자의 음성을 입력(S320)받아 소음을 제거하는 모방어 전처리 과정(S330)과, 소음제거된 모방어의 음성에 대하여 반응평가를 진행하는 모방어 반응평가 과정(S3400과, 모방어 반응평가 결과, 모방어 평가기준에 부합하는 경우(S340a), 발음평가를 진행(S350)하여 피드백 결과를 제공하는 모방어 발음평가 과정과, 모방어 반응평가(S340)의 결과, 모방어 평가기준에 미달하는 경우(S340b), 모방어 테스트 실패의 피드백을 제공(S360)하고 음성변환 테스트(S400)로 진행하는 음성변환 테스트 이동 과정을 가진다.On the other hand, the mimetic language test process (S300) is a simulation language test screen for testing a mimetic language that follows a speech of a language model used in a speech training content when the self- So that an empirical test is performed. In other words, as shown in FIG. 8, a mimic presentation process (S310) for displaying a phoneme, a syllable, a word, and a sentence of a mimetic word corresponding to a learning target level, (S330) of inputting a voice of a learner who speaks a mimetic word (S320), and a mimic word response evaluation process (S3400) for evaluating a response to the voice of the noise-eliminated mimetic word A pronunciation evaluation process (S340a), a pronunciation evaluation process (S350) to provide a feedback result, and a pronunciation evaluation process (S340) as a result of the mimetic word reaction evaluation process, (Step S340b), feedback of the imitation test failure is provided (S360), and the process goes to the voice conversion test (S400).

즉, 모방어 테스트는 학습 목표 발음이 포함된 목표자극이 제시되면서 시작된다. 목표자극에는 학습 목표 발음이 포함되고 수준(음소, 음절, 단어, 문장)에 적합한 내용으로, 사진과 언어모델로 구성되거나 사진과 언어모델, 문자로 구성되어 제시될 수 있다. 목표자극에 대한 학습자의 음성입력이 되면, 전처리 단계에서 소음 제거 등을 통해 음성을 향상시키고 평가에 적합한 각각의 데이터를 추출한다. 반응평가는 언어모델의 말을 모방하는 학습자의 음성반응을 분석하여, 정답인 경우에는 발음평가를 진행한 후 피드백을 제시한다. 오답인 경우에는 피드백을 제시하고, 음성변환테스트로진행된다.That is, the empirical test starts when a target stimulus containing the learning target pronunciation is presented. Target stimuli include learning target pronunciation, content suitable for level (phonemes, syllables, words, sentences), composed of photographs and language models, or composed of photographs, language models, and letters. When the learner 's voice input to the target stimulus is obtained, the speech is improved through the noise elimination in the preprocessing step, and each data suitable for evaluation is extracted. The response evaluation analyzes the learner 's voice response mimicking the language model' s words. If the answer is correct, the learner evaluates the pronunciation and presents feedback. If the answer is incorrect, feedback is presented and voice conversion test is performed.

한편, 음성변환 테스트 과정(S400)은 모방어 테스트 결과 미리 설정된 모방어 평가기준에 미달하는 경우, 모방어 테스트 결과를 반영하여 생성한 음성변환을 테스트하는 음성변환테스트화면을 제공하여 음성변환테스트가 이루어지도록 한다. 즉, 도 9에 도시한 바와 같이 음성변환 테스트 과정(S400)은, 모방어 테스트 결과를 반영하여 학습목표수준의 음성변환(S410)하여 제시하며(S420), 음성변환을 따라 발성하는 학습자의 음성을 입력(S430)받아 소음을 제거하는 음성변환 전처리 과정(S440)과, 소음제거된 학습자 음성에 대하여 반응평가를 진행(S450)하여, 미리 설정된 음성변환 평가기준에 부합하는 경우(S450a) 발음평가를 진행(S460)하여 피드백 결과를 제공하는 음성변환 발음평가 피드백 제공 과정과, 음성변환을 따라하는 학습자 음성에 대한 반응평가 결과, 음성변환 평가기준에 미달하는 경우(S450b), 음성변환 테스트 실패의 피드백을 제공(S471)하는 음성변환 반응평가 피드백 제공 과정을 가진다.On the other hand, the speech conversion test process (S400) provides a speech conversion test screen for testing the speech conversion generated by reflecting the result of the speech recognition test, . That is, as shown in FIG. 9, the speech conversion test process (S400) reflects the imitation speech test result and performs speech conversion (S410) of the learning target level (S420) (S450), and the pronunciation evaluation process is performed on the learner's voice that has been removed (S450). If it is determined that the learner's voice is noisy (S450a) (S460), and provides a feedback result. If the evaluation result of the response to the learner's voice following the speech conversion is less than the evaluation criteria of the speech conversion (S450b) And provides feedback (S471).

그리고 음성변환 테스트 과정(S400)은, 학습자의 음성반응이 미리 설정된 기준을 충족할 때까지, 음성변환 생성 과정, 음성변환 전처리 과정, 음성변환 발음평가 피드백 제공 과정, 및 음성변환 반응평가 피드백 제공 과정을 반복한다.The speech conversion test process (S400) may include a speech conversion process, a speech conversion preprocessing process, a speech conversion evaluation feedback providing process, and a speech conversion reaction evaluation feedback providing process until the learner's speech response meets a preset reference .

상술하면, 음성변환 테스트는 모방어 테스트의 분석결과를 반영하여 음성변환을 통해 언어모델의 음성을 증폭 수정한다. 수정된 언어 모델을 포함한 수정된 목표자극을 제시하여 학습자 음성반응을 입력한다. In detail, the speech conversion test amplifies and corrects the speech of the language model through voice conversion, reflecting the result of the analysis of the pronunciation test. Enter the learner's negative response by presenting the modified target stimulus including the modified language model.

음성변환 전처리 과정에서는 입력된 학습자의 음성을 소음 제거 등의 작업을 통해 음성을 향상시키고 평가에 적합한 각각의 데이터를 추출한다. 반응평가는 수정된 언어모델의 말을 따라하는 학습자의 음성반응을 분석하여, 정답인 경우에는 발음평가를 진행한 후 피드백을 제시하고, 모방어 테스트로 진행된다. 오답인 경우에는 피드백을 제시하고, 분석 결과를 언어모델의 음성변환 단계에 반영된다. In the speech conversion preprocessing process, the input learner 's speech is improved through noise elimination, etc., and each data suitable for evaluation is extracted. The response evaluation analyzes the learner 's voice response following the words of the modified language model. If the answer is correct, the learner evaluates the pronunciation and presents feedback, and then proceeds to the empirical test. In the case of an incorrect answer, the feedback is presented, and the analysis result is reflected in the voice conversion step of the language model.

학습자가 음성 반응이 정확해지는 수준까지 음성변환 과정이 자동 반복되어 학습자의 말소리 오류특성을 분석하고 치료 계획에 반영한다. 예를 들어 학습자의 '사탕' 말소리에서 '다탕' 소리로 말한다면, 언어 모델의 음성을 '사' 소리를 크게, '탕'소리는 작게 소리를 증폭변환시켜 제시하고, 이 자극에 학습자가 '사탕'으로 발음한다면, 학습자의 말소리 오류특성은 조음 오류중 대치 유형에 해당되고, 해당 증폭수준을 학습자의 오류를 해결하기 위한 방안으로 결정될 수 있다. 또한, 언어모델의 음성을 소리는 2단계 증폭하고 속도는 1단계 느린 배속으로 제시하였을 때 학습자의 말소리가 정확해진다면, 해당 학습자의 말소리 학습자료는 음성증폭 2단계, 속도 조절 1단계 수준으로 결정된다.The learner will automatically repeat the speech conversion process until the voice response is correct, so that the learner will analyze the speech characteristics of the speech and incorporate it into the treatment plan. For example, if you speak in the learner 's' candy' speech, the speech of the language model is amplified and the 'loud' sound is amplified. Candy ', the learner' s utterance error characteristic corresponds to the substitution type of the articulation error, and the amplification level of the learner can be determined as a solution for solving the learner 's error. Also, if the voice of the language model is amplified in two steps and the speed is presented at the slow speed of the first step, if the learner's utterance is correct, the learner's speech learning data is determined as the voice amplification level 2 and the speed level 1 level do.

한편, 말소리 학습 과정(S500)은, 말소리 테스트, 모방어 테스트, 음성변환 테스트의 테스트 결과를 반영한 학습자 맞춤형 학습자료를 제공하여 학습자의 말소리 학습이 이루어지도록 한다. 말소리 학습 과정(S500)은 도 11에 도시한 바와 같이, 학습할 말소리 선택(S510) 후 학습자 맞춤형 학습자료를 제공하는 과정(S520)과, 학습자 맞춤형 학습자료를 학습하는 학습자의 음성, 입모양, 혀모양, 호흡이 포함된 학습자 정보를 입력받는 과정(S530)과, 학습자 정보에서 노이즈를 제거하는 전처리 과정(S540)과, 노이즈가 제거된 학습자 정보가 미리 설정된 학습 평가기준에 부합하는지를 학습 평가를 진행하는 과정(S550)과, 학습 평가 결과, 학습 평가기준에 미달하는 경우(S550b) 피드백 결과를 도출(S560)하여 학습자 맞춤형 학습자료를 수정(S570)하여 학습자 정보 입력 및 학습 평가를 반복하는 과정과, 학습 평가 결과, 학습 평가기준에 부합하는 경우(S550a), 피드백을 제공(S580)하고, 말소리 학습 게임을 진행하여 학습시키는 말소리 학습 게임 진행 과정(S590)을 가진다.Meanwhile, the speech learning process (S500) provides the learner-customized learning material reflecting the test results of the speech test, the mimetic word test, and the voice conversion test so that the learner can learn the speech. As shown in FIG. 11, the speech learning process (S500) includes a process (S520) of providing learner-customized learning data after selecting a speech to be learned (S510), a process of selecting a learner- (S530) of inputting learner information including a tongue shape and a breath (S530), a preprocessing process of removing noise from the learner information (S540), and a learning evaluation of whether or not the learner information (S560), the learning result is derived (S560), and the learner customized learning material is modified (S570) to repeat the learner information input and the learning evaluation (Step S580), a speech learning game process (step S590) in which the speech learning game is progressed and learned, the feedback is provided (step S580) Have.

상술하면, 말소리 학습과정에서는 학습할 말소리 선택하는데, 학습자가 직접 선택할 수도 있고 앞선 평가결과가 반영되어 자동적으로 선택될 수도 있다. 말소리가 선택되면, 학습자의 발음 분석결과를 반영한 맞춤형 언어모델이 포함된 학습자료가 제시된다. 학습자료에는 도 12에 도시한 바와 같이 언어모델과 입모양, 구강구조자료, 그래프를 함께 제시하여 학습보조자료로 포함될 수 있다. 학습자에 따라 맞춤형 언어 모델의 음성만으로도 학습할 수 있고, 그래픽까지 함께 제시하여 학습할 수도 있다. 학습 강의 실시후 학습자가 말을 하면, 학습자의 음성을 녹음하고 얼굴표정을 카메라로 촬영 녹화하며, 무선으로 호흡을 측정하여 학습자의 정보를 입력한다. 학습자의 정보입력에는 학습자의 음성, 입모양과 혀움직임, 호흡이 포함된다. 전처리 단계를 거쳐 발음평가를 한다. 분석 결과에 따라 언어모델의 음성을 증폭 수정하고, 입모양과 혀움직임에 대한 지도 포인트 수준, 그래픽에서 강조할 학습내용을 변환하여 학습 자료를 수정한다. 학습 진행할때 이어폰과 마이크를 함께 활용할 수 있다.In other words, in the speech learning process, the learner selects the speech to be learned. When a speech is selected, learning material containing a customized language model reflecting the learner's pronunciation analysis result is presented. The learning material may include a language model, mouth shape, oral structure data, and graphs, as shown in FIG. 12, and may be included as a learning aid. Learning can be done with only the voice of the customized language model according to the learner, and graphics can be presented together with learning. When the learner speaks after the lecture is performed, the learner's voice is recorded, the facial expression is photographed with the camera, the breath is measured wirelessly, and the learner's information is input. The learner's information input includes the learner's voice, mouth shape, tongue movement, and breathing. The pronunciation evaluation is performed through the preprocessing step. Modify the learning data by amplifying the voice of the language model according to the analysis result, converting the learning contents to be emphasized in the graphic point level and the mouth shape and tongue movement. Earphones and microphones can be used together when learning.

말소리 학습단계의 말소리평가 단계에서 일정 기준 이상의 정확도로 성공하면 반복 및 유창성 연습을 위해 말소리 학습게임이 제시된다. 말소리 학습게임 제공(S590)은 학습자가 이미 학습된 말소리나 현재 학습중인 말소리들로 구성되고, 학습자의 말소리 반응으로 게임의 승패가 결정되는 학습자 맞춤형 말소리 학습게임이다. 말소리를 재미있게 연습할 수 있도록 학습자의 말소리에 반응하는 다양한 언어 게임유형들이 포함된다. 예를 들어 도 13에 도시한 바와 같이 학습목표단어가 사탕인 경우 쓰레기통으로 떨어지는 사물들 중 '사탕' 그림 자극이 나타났을 때 학습자가 재빨리 사탕을 말하여 많이 모으는 게임이다. 이때 3D캐릭터가 게임 방법을 소개하고, 학습 성취 및 숙달동기를 높이기 위해 피드백을 제공한다. 또한, 발음 정확도를 향상시키기 위해 언어 모델 또는 음성변환된 언어모델이 제시될 수있다. 3D캐릭터는 학습자가 게임을 하는 동안 피드백을 제시하며 학습자에게 교사 역할과 학습자의 경쟁자역할을 수행한다. 입체적으로 제시되는 3D캐릭터들은 학습자에게 칭찬과 격려같은 피드백을 제공하기도 하고, 학습 동기를 자극하는 경쟁자 역할도 수행하면서 게임 학습의 재미와 연습 효과를 높인다. 학습자의 반응결과에 상응하는 캐릭터반응은 학습자가 상호작용하면서 학습하는 효과를 제공함으로써 재미있게 말소리 학습을 반복하여 숙달할 수 있도록 촉진하게 한다. 　　　　Speaking If you succeed at a certain level of accuracy in the speech evaluation stage, you will be presented with a speech learning game for repetition and fluency practice. The speech learning game provision (S590) is a learner-customized speech learning game in which the learner is composed of the speech already learned or the speech being currently being learned and the win or loss of the game is determined by the learner's speech response. It includes a variety of language game types that respond to learners' speech so that they can practice the speech fun. For example, as shown in FIG. 13, when a learning target word is candy, a learner quickly collects a lot of candy when a 'candy' picture stimulus appears among objects falling into a trash bin. At this time, the 3D character introduces the game method and provides feedback to improve learning achievement and mastery motivation. In addition, a language model or a voice-translated language model may be presented to improve pronunciation accuracy. The 3D character presents the feedback while the learner plays the game and plays the role of the teacher and the competitor of the learner to the learner. 3D characters presented in three dimensions provide learner with feedback such as praise and encouragement, and also enhance the fun and practice effect of game learning while playing a role of competitor stimulating learning motivation. The character response corresponding to the learner 's response result stimulates the learner to repeat the learner' s learning by providing the learning effect while interacting.

상술한 본 발명의 설명에서의 실시예는 여러가지 실시가능한 예중에서 당업자의 이해를 돕기 위하여 가장 바람직한 예를 선정하여 제시한 것으로, 이 발명의 기술적 사상이 반드시 이 실시예만 의해서 한정되거나 제한되는 것은 아니고, 본 발명의 기술적 사상을 벗어나지 않는 범위내에서 다양한 변화와 변경 및 균등한 타의 실시예가 가능한 것이다.The embodiments of the present invention described above are selected and presented in order to facilitate the understanding of those skilled in the art from a variety of possible examples. The technical idea of the present invention is not necessarily limited to or limited to these embodiments Various changes, modifications, and other equivalent embodiments are possible without departing from the spirit of the present invention.

S100:말소리훈련콘텐츠 선택 과정
S200:자발어 테스트 과정
S300:모방어 테스트 과정
S400:변환음성 테스트 과정
S500:말소리 학습 과정S100: Speaking Training Content Selection Process
S200: Self-assessment test
S300: Practice test process
S400: Transformed speech test process
S500: Speaking process

Claims

A method of correcting a speech in which a speech learning content app installed in a learner terminal performs a speech training for a learner,
A speech training content selection process in which a speech training content to be used for speech training is selected from a learner;
A spontaneous speech test process for providing a spontaneous speech test screen of the selected speech training contents to allow a spontaneous speech test to be performed;
And an empirical test screen for performing a test on a mimetic word following the speech model of the language model used in the speech training content is provided so as to perform the mimetic word test if the self- An empirical test process;
And a speech conversion test screen for testing imitation of the speech-converted language model generated by reflecting the result of the imitation test is provided so as to perform the speech conversion test when the imitation test result is less than a preset imitation evaluation criterion A speech conversion test process; And
And a speech learning process for providing a learner customized learning material reflecting the test results of the spontaneous speech test, the mimetic word test, and the voice conversion test so that the learner's speech learning is performed,
The speech training content selection process includes:
We choose the level of the learning target to be used in the speech training,
In the spontaneous speech test,
A voluntary display process of displaying a photograph or character capable of generating phonemes, syllables, words, and sentences of a spontaneous word corresponding to the learning target level;
A spontaneous speech preprocessing process for eliminating noise by inputting speech of a learner who speaks spontaneous speech;
A spurious response evaluation process in which a response evaluation is performed on the noise of the noise canceled spontaneous speech;
A pronunciation evaluating step of providing a feedback result by proceeding pronunciation evaluation when the result of the self-verbal response evaluation meets the self-verbal evaluation criterion; And
And an empirical test moving step of providing feedback of a spontaneous test failure and proceeding to the empirical test when the spontaneous word response evaluation result is less than the spontaneous word evaluation standard,
The simulated speech test process includes:
Displaying the phonemes, syllables, words, and sentences of the mimetic word corresponding to the learning target level as photographs or characters and following the speech of the language model;
A preprocessing process of removing a noise by inputting a voice of a learner who speaks a mimetic word;
An empirical response evaluation process for evaluating the response to the voice of the noise canceled mimic word;
A pronunciation evaluation process of performing a pronunciation evaluation and providing a feedback result when the pronunciation evaluation meets the pronunciation evaluation result, And
And a voice conversion test movement step of providing feedback of a mimetic word test failure and proceeding to the voice conversion test when the mimic word evaluation criterion is not satisfied as a result of the mimic word response evaluation,
The voice conversion test process includes:
A voice conversion generating step of generating and presenting voice conversion of a learning target level by reflecting the result of the speaker test;
A speech conversion presentation process for presenting a speech of a speech-converted language model;
A speech conversion pre-processing step of inputting speech of a learner who speaks through speech conversion and removing noise;
A voice conversion pronunciation evaluation feedback providing step of performing a reaction evaluation on the learner's voice with noise removed and proceeding with pronunciation evaluation if the dictionary meets the predetermined voice conversion evaluation criteria to provide a feedback result; And
And a voice conversion reaction evaluation feedback providing step of providing a feedback of a voice conversion test failure when the voice conversion evaluation criterion is not satisfied as a result of the evaluation of the learner's voice following the voice conversion,
Wherein the voice conversion reaction evaluation feedback providing step comprises:
The learner's voice response characteristic of the voice-activated language model is analyzed. The learner's speech is greatly amplified and the syllable with no error is transformed into a small amount of speech and presented to the learner , And when the presented language model is pronounced correctly, the amplification level for voice conversion is determined as a solution for error correction.

delete

The method according to claim 1,
The speech conversion process, the speech conversion preprocessing process, the speech conversion pronunciation evaluation feedback process, and the speech conversion reaction evaluation feedback process are repeated until the learner's speech response meets a preset reference. Way.

The method according to claim 1,
In the reaction evaluation, a language reaction evaluation and a non-verbal reaction evaluation are performed. In non-verbal reaction evaluation, a reaction rate, a screen transition, and a gaze treatment are evaluated.
Wherein the pronunciation evaluation is based on a language model based on a phoneme level, a syllable level, a word level, and a sentence level, and evaluates a learning target standard and a learning level of the learner.

The method according to claim 1,
Providing the learner customized learning material;
A step of receiving learner information including a voice, a mouth shape, a tongue shape, and breathing of a learner learning the learner customized learning material;
Removing noise from the learner information;
A step of learning evaluation of whether or not learner information whose noise has been removed meets a predetermined learning evaluation standard;
Repeating the learner information input and the learning evaluation by modifying the learner customized learning material when the learning evaluation result is less than the learning evaluation standard; And
A speech learning game progress process in which the speech learning game is progressed and learned when the learning evaluation result meets the learning evaluation standard;
/ RTI >

The method according to claim 8,
Wherein the learner is composed of already learned speech or speech being learned, and the win or loss of the game is determined by the learner's speech response.