KR20230000175A

KR20230000175A - Method for evaluating pronunciation based on AI, method for providing study content for coaching pronunciation, and computing system performing the same

Info

Publication number: KR20230000175A
Application number: KR1020210082255A
Authority: KR
Inventors: 정성은
Original assignee: (주)위버스마인드
Priority date: 2021-06-24
Filing date: 2021-06-24
Publication date: 2023-01-02

Abstract

Disclosed are a method of collecting voice data which a learner pronounces on a word-by-word basis to determine the pronunciation accuracy of each phoneme, a method of providing learning content to allow learners to practice learners' weak pronunciation using the same, and a computing system performing the same. According to one aspect of the present invention, a method of evaluating phoneme-level pronunciation accuracy to calculate a phoneme-level score for each phoneme constituting the sound system of a given language, comprises the steps in which: an evaluation system acquires voice data of each of a plurality of words to be evaluated spoken by a person being evaluated; the evaluation system calculates, for each of the plurality of words to be evaluated, a word-level score corresponding to the word to be evaluated, based on the voice data of the word to be evaluated; and the evaluation system determines, for each of phonemes to be evaluated, at least part of the total phonemes constituting the sound system of the language, a phoneme-level score corresponding to the phoneme to be evaluated. In the step of determining the phoneme-level score corresponding to the phoneme to be evaluated includes a step of determining a phoneme-level score corresponding to the phoneme to be evaluated based on the word-level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated spoken by the person being evaluated.

Description

Method for evaluating pronunciation based on AI, method for providing study content for coaching pronunciation, and computing system performing the same}

본 발명은 음소 단위 발음 정확도 평가 방법, 발음 코칭용 학습 컨텐츠 제공 방법 및 이를 수행하기 위한 컴퓨팅 시스템에 관한 것이다. 보다 상세하게는 학습자가 단어 단위로 발음하는 음성 데이터를 수집하여 이를 통해 음소 단위의 발음 정확도를 판단하는 방법 및 이를 이용하여 학습자의 취약한 발음을 연습할 수 있도록 하는 학습용 컨텐츠를 제공하는 방법 및 이를 수행하는 컴퓨팅 시스템에 관한 것이다.The present invention relates to a method for evaluating pronunciation accuracy in units of phonemes, a method for providing learning contents for pronunciation coaching, and a computing system for performing the same. More specifically, a method for determining pronunciation accuracy in phoneme units by collecting voice data that learners pronounce in word units, and a method for providing learning contents that enable learners to practice weak pronunciation using the method and performing the same It is about a computing system that

일반적으로 현대인들은 산업의 전문화와 국제화의 추세에 따라 외국어에 대한 관심이 많아지는 경향을 보이는데, 이러한 추세에 대응하기 위해 컴퓨터, 온라인 등을 활용한 외국어 학습 수단이 다양하게 출시되고 있다. 특히 비원어민이 발음한 외국어의 정확도를 평가하는 방법도 다양하게 개발 중에 있으며, 학습자가 발음하면 발음의 정확도를 알려주는 소프트웨어도 시판되고 있다. 외국어 발음 평가 방법의 경우, 일례로 음성신호 처리 기술을 이용한 발음 비교 방법이 있으며, 대표적으로 은닉 마코브 모델(Hidden Markov Model)을 이용하여 비원어민의 발음 음성에 대한 인식을 한 후, 원어민의 음성과 비교하는 알고리즘을 들 수 있다.In general, modern people tend to show an increased interest in foreign languages according to the trend of industrial specialization and internationalization. In particular, various methods for evaluating the accuracy of foreign language pronunciation by non-native speakers are being developed, and software that informs the accuracy of pronunciation when a learner pronounces it is also commercially available. In the case of a foreign language pronunciation evaluation method, there is, for example, a pronunciation comparison method using voice signal processing technology. Representatively, a hidden Markov model is used to recognize the pronunciation voice of a non-native speaker, and then the native speaker's voice and an algorithm that compares .

한편, 외국어의 발음을 평가함에 있어서, 단어 수준에서 발음의 정확도를 평가하는 것은 상대적으로 용이하며, 다양한 기술이 제안되고 있다. 이러한 기술의 예로 등록특허 10-2052031와 같은 것을 들 수 있다.Meanwhile, in evaluating the pronunciation of a foreign language, it is relatively easy to evaluate the accuracy of pronunciation at the word level, and various techniques have been proposed. An example of such technology may be mentioned such as Korean Patent Registration No. 10-2052031.

그런데 외국어의 발음을 학습하는데 있어서 학습자가 정확히 발음하지 못하는 음소를 판단하고, 그러한 음소를 중심으로 발음 연습을 행하도록 하는 것이 매우 효과적인 방법임에도 불구하고 음소 수준에서 발음의 정확도를 파악하는 것은 상당히 어려운 실정이다.However, in learning the pronunciation of a foreign language, it is quite difficult to determine the accuracy of pronunciation at the phoneme level, although it is a very effective method to determine the phonemes that learners cannot pronounce accurately and practice pronunciation with a focus on those phonemes. to be.

등록특허 10-2052031Registered Patent No. 10-2052031

본 발명이 이루고자 하는 기술적인 과제는 음소 단위 발음 정확도 평가 방법, 발음 코칭용 학습 컨텐츠 제공 방법 및 이를 수행하기 위한 컴퓨팅 시스템을 제공하는 것이다.A technical problem to be achieved by the present invention is to provide a method for evaluating pronunciation accuracy in units of phonemes, a method for providing learning contents for pronunciation coaching, and a computing system for performing the same.

본 발명의 일 측면에 따르면, 소정의 언어의 소리 체계를 구성하는 각 음소 별 음소 레벨 점수를 산출하기 위한 음소 단위 발음 정확도 평가 방법으로서, 평가 시스템이, 피평가자(appraise)가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득하는 단계; 상기 평가 시스템이, 상기 복수의 평가 대상 단어 각각에 대하여, 상기 평가 대상 단어의 음성 데이터에 기초하여, 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출하는 단계; 및 상기 평가 시스템이, 상기 언어의 소리 체계를 구성하는 전체 음소 중에서 적어도 일부인 복수의 평가 대상 음소 각각에 대하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 단계를 포함하되, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 단계는, 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 단계를 포함하는 음소 단위 발음 정확도 평가 방법이 제공된다.According to one aspect of the present invention, as a phoneme-unit pronunciation accuracy evaluation method for calculating phoneme level scores for each phoneme constituting the sound system of a predetermined language, the evaluation system comprises a plurality of evaluation targets uttered by an appraiser. acquiring voice data of each word; calculating, by the evaluation system, a word level score corresponding to the evaluation target word, based on voice data of the evaluation target word, for each of the plurality of evaluation target words; and determining, by the evaluation system, a phoneme level score corresponding to the phoneme to be evaluated for each of a plurality of phonemes to be evaluated, which are at least a part of all phonemes constituting the sound system of the language. In the step of determining a phoneme level score corresponding to the phoneme level corresponding to the phoneme to be evaluated, based on a word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the subject, the phoneme level corresponding to the phoneme to be evaluated. A method for evaluating pronunciation accuracy in units of phonemes, including determining a score, is provided.

일 실시예에서, 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 단계는, 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값을 산출하는 단계; 및 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 단계를 포함할 수 있다.In an embodiment, the step of determining a phoneme level score corresponding to the phoneme to be evaluated based on a word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the person to be evaluated, calculating an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated; and determining a phoneme level score corresponding to the phoneme to be evaluated based on an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated.

일 실시예에서, 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출하는 단계는, 상기 피평가자가 발화한 상기 평가 대상 단어의 발음의 정확도를 나타내는 발음 점수를 산출하는 단계; 상기 피평가자가 발화한 상기 평가 대상 단어의 억양의 정확도를 나타내는 억양 점수를 산출하는 단계; 상기 피평가자가 발화한 상기 평가 대상 단어의 볼륨의 정확도를 나타내는 볼륨 점수를 산출하는 단계; 상기 피평가자가 발화한 상기 평가 대상 단어의 길이의 장단의 정확도를 나타내는 길이 점수를 산출하는 단계; 및 상기 피평가자가 발화한 상기 평가 대상 단어의 강세의 정확도를 나타내는 강세 점수를 산출하는 단계 중 적어도 일부를 포함할 수 있다.In an embodiment, the calculating of the word level score corresponding to the evaluation target word may include calculating a pronunciation score indicating accuracy of pronunciation of the evaluation target word uttered by the evaluation target; calculating an intonation score indicating accuracy of intonation of the word to be evaluated uttered by the subject of evaluation; calculating a volume score representing accuracy of the volume of the evaluation target word uttered by the subject; Calculating a length score representing the accuracy of the long and short length of the word to be evaluated uttered by the subject of evaluation; and calculating a stress score representing the accuracy of the stress of the word to be evaluated uttered by the person to be evaluated.

일 실시예에서, 상기 피평가자가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득하는 단계는, 상기 피평가자가 발화한 복수의 평가 대상 문장의 음성 데이터를 획득하는 단계; 및 상기 복수의 평가 대상 문장 각각에 대하여, 상기 평가 대상 문장의 음성 데이터를 단어 별로 분할하는 단계를 포함하되, 상기 복수의 평가 대상 단어는, 상기 복수의 평가 대상 문장에 포함된 각 단어를 포함할 수 있다.In an embodiment, the obtaining of audio data of each of a plurality of evaluation target words uttered by the evaluation subject may include acquiring audio data of a plurality of evaluation target sentences uttered by the evaluation subject; and dividing the audio data of the evaluation target sentence into words for each of the plurality of evaluation target sentences, wherein the plurality of evaluation target words include each word included in the plurality of evaluation target sentences. can

본 발명의 다른 일 측면에 따르면, 소정의 언어의 소리 체계를 구성하는 각 음소 별 음소 레벨 점수를 산출하기 위한 음소 단위 발음 정확도 평가 방법으로서, 평가 시스템이, 피평가자(appraise)가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득하는 단계; 및 상기 복수의 평가 대상 단어 각각의 음성 데이터가 획득될 때마다, 상기 평가 시스템이, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 피평가자의 음소 레벨 점수를 갱신하는 단계를 포함하되, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 피평가자의 음소 레벨 점수를 갱신하는 단계는, 상기 평가 대상 단어의 음성 데이터에 기초하여, 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출하는 단계; 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소를 판단하는 단계; 및 상기 평가 대상 음소 각각에 대하여, 기 산출된 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수에 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 반영하여, 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 갱신하는 단계를 포함하는 음소 단위 발음 정확도 평가 방법이 제공된다.According to another aspect of the present invention, as a phoneme-unit pronunciation accuracy evaluation method for calculating a phoneme level score for each phoneme constituting the sound system of a predetermined language, the evaluation system comprises a plurality of evaluations uttered by an appraiser. obtaining voice data of each target word; and updating, by the evaluation system, a phoneme level score of the subject of evaluation based on the speech data of the word to be evaluated whenever voice data of each of the plurality of words to be evaluated is obtained. The step of updating the phoneme level score of the subject of evaluation based on the voice data of the evaluation target may include: calculating a word level score corresponding to the word to be evaluated based on the voice data of the word to be evaluated; determining at least one evaluation target phoneme constituting the evaluation target word; and for each of the phonemes to be evaluated, a word level score corresponding to the word to be evaluated is reflected in the pre-calculated phoneme level score corresponding to the phoneme to be evaluated of the subject to be evaluated. A method for evaluating phoneme-by-phoneme pronunciation accuracy, which includes updating a phoneme level score, is provided.

본 발명의 다른 일 측면에 따르면, 학습 시스템이, 상술한 음소 단위 발음 정확도 평가 방법에 의해 구축되는 각 음소 별 음소 레벨 점수에 기초하여 학습 대상 음소를 추출하는 단계; 상기 학습 시스템이, 상기 학습 대상 음소를 포함하는 단어를 상기 학습용 단어로 선정하는 단계; 및 상기 학습 시스템이, 상기 학습용 단어 또는 상기 학습용 단어가 포함된 학습용 컨텐츠를 학습자 단말로 제공하는 단계를 포함하는 학습용 컨텐츠 제공 방법이 제공된다.According to another aspect of the present invention, the learning system extracts a phoneme to be learned based on the phoneme level score for each phoneme established by the above-described method for evaluating pronunciation accuracy in units of phonemes; selecting, by the learning system, a word including the phoneme to be learned as the learning word; and providing, by the learning system, the words for learning or the contents for learning including the words for learning to a learner terminal.

일 실시예에서, 상기 학습용 컨텐츠 제공 방법은, 상기 학습 시스템이, 상기 학습 대상 음소와 대치되는 대치 음소를 포함하는 단어를 추가 학습용 단어로 선정하는 단계; 및 상기 학습 시스템이, 상기 추가 학습용 단어 또는 상기 추가 학습용 단어가 포함된 추가 학습용 컨텐츠를 상기 학습자 단말로 제공하는 단계를 더 포함할 수 있다.In an embodiment, the learning content providing method may include selecting, by the learning system, a word including a replacement phoneme that is replaced with the learning target phoneme as a word for additional learning; and providing, by the learning system, the additional learning words or additional learning contents including the additional learning words to the learner terminal.

본 발명의 다른 일 측면에 따르면, 데이터 처리장치에 설치되며 상술한 방법을 수행하기 위한 매체에 기록된 컴퓨터 프로그램이 제공된다.According to another aspect of the present invention, a computer program installed in a data processing device and recorded on a medium for performing the above method is provided.

본 발명의 다른 일 측면에 따르면, 상술한 방법을 수행하기 위한 컴퓨터 프로그램이 기록된 컴퓨터 판독 가능한 기록매체가 제공된다.According to another aspect of the present invention, a computer readable recording medium on which a computer program for performing the above method is recorded is provided.

본 발명의 다른 일 측면에 따르면, 컴퓨팅 시스템으로서, 프로세서; 및 컴퓨터 프로그램을 저장하는 메모리를 포함하고, 상기 컴퓨터 프로그램은, 상기 프로세서에 의해 실행되는 경우, 상기 컴퓨팅 시스템으로 하여금 상술한 방법을 수행하도록 하는 컴퓨팅 시스템.According to another aspect of the present invention, a computing system comprising: a processor; and a memory storing a computer program, wherein the computer program, when executed by the processor, causes the computing system to perform the above-described method.

본 발명의 다른 일 측면에 따르면, 소정의 언어의 소리 체계를 구성하는 각 음소 별 음소 레벨 점수를 산출하기 위한 음소 단위 발음 정확도 평가 시스템으로서, 피평가자(appraise)가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득하는 획득모듈; 상기 복수의 평가 대상 단어 각각에 대하여, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출하는 단어 레벨 평가모듈; 및 상기 언어의 소리 체계를 구성하는 전체 음소 중에서 적어도 일부인 복수의 평가 대상 음소 각각에 대하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 음소 레벨 평가모듈을 포함하되, 상기 음소 레벨 평가모듈은, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하기 위하여, 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 음소 단위 발음 정확도 평가 시스템이 제공된다.According to another aspect of the present invention, a phoneme-unit pronunciation accuracy evaluation system for calculating phoneme level scores for each phoneme constituting the sound system of a predetermined language, wherein each of a plurality of evaluation target words uttered by an appraiser an acquisition module for obtaining voice data; a word level evaluation module for calculating a word level score corresponding to the evaluation target word based on voice data of the evaluation target word for each of the plurality of evaluation target words; and a phoneme level evaluation module for determining a phoneme level score corresponding to the phoneme to be evaluated for each of a plurality of phonemes to be evaluated, which are at least a part of all phonemes constituting the sound system of the language, wherein the phoneme level evaluation module comprises: , In order to determine a phoneme level score corresponding to the phoneme to be evaluated, based on the word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the subject, A phoneme-by-phoneme pronunciation accuracy evaluation system is provided that determines a corresponding phoneme level score.

일 실시예에서, 상기 음소 레벨 평가모듈은, 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값을 산출하고, 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수 있다.In one embodiment, the phoneme level evaluation module calculates an average value of word level scores of each of the words including the phoneme to be evaluated among the plurality of words to be evaluated, uttered by the subject, and the plurality of words to be evaluated. A phoneme level score corresponding to the phoneme to be evaluated may be determined based on an average value of word level scores of each word including the phoneme to be evaluated.

일 실시예에서, 상기 단어 레벨 평가모듈은, 상기 피평가자가 발화한 상기 평가 대상 단어의 발음의 정확도를 나타내는 발음 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 억양의 정확도를 나타내는 억양 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 볼륨의 정확도를 나타내는 볼륨 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 길이의 장단의 정확도를 나타내는 길이 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 강세의 정확도를 나타내는 강세 점수 중 적어도 일부를 산출할 수 있다.In an embodiment, the word level evaluation module may include a pronunciation score indicating accuracy of pronunciation of the word to be evaluated uttered by the subject, an intonation score indicating accuracy of intonation of the word to be evaluated uttered by the subject, and the subject to be evaluated. A volume score indicating the accuracy of the volume of the word to be evaluated uttered, a length score indicating the accuracy of the length of the word to be evaluated uttered by the subject, and an accuracy of stress of the word to be evaluated uttered by the subject At least some of the indicated stress scores may be calculated.

일 실시에에서, 상기 획득모듈은, 상기 피평가자가 발화한 복수의 평가 대상 문장의 음성 데이터를 획득하고, 상기 복수의 평가 대상 문장 각각에 대하여, 상기 평가 대상 문장의 음성 데이터를 단어 별로 분할하며, 상기 복수의 평가 대상 단어는, 상기 복수의 평가 대상 문장에 포함된 각 단어를 포함할 수 있다.In one embodiment, the acquisition module acquires audio data of a plurality of evaluation target sentences uttered by the person being evaluated, and divides the audio data of the evaluation target sentence by word for each of the plurality of evaluation target sentences; The plurality of evaluation target words may include each word included in the plurality of evaluation target sentences.

본 발명의 다른 일 측면에 따르면, 소정의 언어의 소리 체계를 구성하는 각 음소 별 음소 레벨 점수를 산출하기 위한 음소 단위 발음 정확도 평가 시스템으로서, 피평가자(appraise)가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득하는 획득모듈; 및 상기 복수의 평가 대상 단어 각각의 음성 데이터가 획득될 때마다, 상기 평가 시스템이, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 피평가자의 음소 레벨 점수를 갱신하는 평가모듈을 포함하되, 상기 평가모듈은, 상기 평가 대상 단어의 음성 데이터에 기초하여, 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출하는 단어 레벨 평가모듈; 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소를 판단하는 음소 판단모듈; 및 상기 평가 대상 음소 각각에 대하여, 기 산출된 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수에 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 반영하여, 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 갱신하는 음소 레벨 평가모듈을 포함하는 음소 단위 발음 정확도 평가 시스템이 제공된다.According to another aspect of the present invention, a phoneme-unit pronunciation accuracy evaluation system for calculating phoneme level scores for each phoneme constituting the sound system of a predetermined language, wherein each of a plurality of evaluation target words uttered by an appraiser an acquisition module for obtaining voice data; and an evaluation module configured to update a phoneme level score of the subject of evaluation based on the speech data of the evaluation target word whenever voice data of each of the plurality of evaluation target words is acquired. A word level evaluation module for calculating a word level score corresponding to the evaluation target word based on the voice data of the evaluation target word; a phoneme determination module for determining at least one evaluation target phoneme constituting the evaluation target word; and for each of the phonemes to be evaluated, a word level score corresponding to the word to be evaluated is reflected in the pre-calculated phoneme level score corresponding to the phoneme to be evaluated of the subject to be evaluated. A phoneme unit pronunciation accuracy evaluation system including a phoneme level evaluation module for updating a phoneme level score is provided.

본 발명의 다른 일 측면에 따르면, 상기 음소 단위 발음 정확도 평가 방법에 의해 구축되는 각 음소 별 음소 레벨 점수에 기초하여 학습 대상 음소를 추출하는 판단모듈; 상기 학습 대상 음소를 포함하는 단어를 상기 학습용 단어로 선정하는 선정모듈; 및 상기 학습용 단어 또는 상기 학습용 단어가 포함된 학습용 컨텐츠를 학습자 단말로 제공하는 제공모듈을 포함하는 학습용 컨텐츠 제공 시스템이 제공된다.According to another aspect of the present invention, a determination module for extracting a phoneme to be learned based on a phoneme level score for each phoneme established by the phoneme-by-phoneme pronunciation accuracy evaluation method; a selection module for selecting a word including the phoneme to be learned as the learning word; and a provision module for providing the learning words or the learning contents including the learning words to a learner terminal.

일 실시예에서, 상기 선정모듈은, 상기 학습 대상 음소와 대치되는 대치 음소를 포함하는 단어를 추가 학습용 단어로 더 선정하고, 상기 제공모듈은, 상기 추가 학습용 단어 또는 상기 추가 학습용 단어가 포함된 추가 학습용 컨텐츠를 상기 학습자 단말로 더 제공할 수 있다.In an embodiment, the selection module further selects a word including a replacement phoneme that is replaced with the target phoneme as an additional learning word, and the providing module further selects the additional learning word or the additional learning word including the additional learning word. Content for learning may be further provided to the learner terminal.

본 발명의 기술적 사상에 의하면, 학습자가 단어 단위로 발음하는 음성 데이터를 수집하여 이를 통해 음소 단위의 발음 정확도를 판단할 수 있다. 즉, 비교적 판단이 용이한 단어 수준의 평가 점수를 활용하여 음소 단위의 점수를 나타내는 점수를 산출함으로써, 피평가자의 음소 단위의 발음 정확도를 손쉽게 파악할 수 있는 효과가 있다.According to the technical idea of the present invention, it is possible to determine pronunciation accuracy in phoneme units by collecting voice data that a learner pronounces in word units. That is, by calculating a score representing a score of a phoneme unit using a word-level evaluation score that is relatively easy to judge, there is an effect of easily grasping the pronunciation accuracy of the subject to be evaluated.

또한 이를 이용하여 학습자의 취약한 발음을 연습할 수 있도록 하는 학습용 컨텐츠를 제공할 수 있다.In addition, it is possible to provide learning contents that enable learners to practice weak pronunciation by using this.

본 발명의 상세한 설명에서 인용되는 도면을 보다 충분히 이해하기 위하여 각 도면의 간단한 설명이 제공된다.
도 1은 본 발명의 기술적 사상에 따른 음소 단위 발음 정확도 평가 방법 및 발음 코칭용 학습 컨텐츠 제공 방법이 수행되는 환경을 개략적으로 도시한 도면이다.
도 2는 본 발명의 일 실시예에 따른 음소 단위 발음 정확도 평가 방법의 일 예를 도시한 흐름도이다.
도 3은 도 2의 S110 단계의 구체적인 예를 도시한 흐름도이다.
도 4a는 8개의 평가 대상 단어 W₁ 내지 W₈ 각각의 단어 레벨 점수 및 해당 단어를 구성하는 음소를 도시한 도표이다.
도 4b는 음소 P₁ 내지 P₅ 각각의 음소 레벨 점수를 산출하기 위한 식의 예를 나타낸 도표이다.
도 4c는 음소 P₁ 내지 P₅ 각각의 음소 레벨 점수를 산출하기 위한 다른 식의 예를 나타낸 도표이다.
도 5는 본 발명의 다른 일 실시예에 따른 음소 단위 발음 정확도 평가 방법의 일 예를 도시한 흐름도이다.
도 6은 도 5의 방법에 의해 갱신되는 평가 대상 음소 P₁ 내지 P₅의 음소 레벨 점수의 변화를 나타낸 도표이다.
도 7은 본 발명의 일 실시예에 따른 발음 코칭용 학습 컨텐츠 제공 방법의 일 예를 도시한 흐름도이다.
도 8은 본 발명의 일 실시예에 따른 평가 시스템의 개략적인 구성을 도시한 블록도이며, 도 9는 본 발명의 일 실시예에 따른 학습 시스템의 개략적인 구성을 도시한 블록도이다.In order to more fully understand the drawings cited in the detailed description of the present invention, a brief description of each drawing is provided.
1 is a diagram schematically illustrating an environment in which a method for evaluating pronunciation accuracy in phoneme units and a method for providing learning contents for pronunciation coaching according to the technical idea of the present invention are performed.
2 is a flowchart illustrating an example of a method for evaluating pronunciation accuracy in units of phonemes according to an embodiment of the present invention.
3 is a flowchart illustrating a specific example of step S110 of FIG. 2 .
FIG. 4A is a diagram showing word level scores of each of the eight evaluation target words W ₁ to W ₈ and phonemes constituting the corresponding words.
4B is a chart showing examples of equations for calculating phoneme level scores for each of the phonemes P ₁ to P ₅ .
4C is a chart showing an example of another equation for calculating phoneme level scores for each of the phonemes P ₁ to P ₅ .
5 is a flowchart illustrating an example of a method for evaluating pronunciation accuracy in units of phonemes according to another embodiment of the present invention.
FIG. 6 is a chart showing changes in phoneme level scores of evaluation target phones P ₁ to P ₅ updated by the method of FIG. 5 .
7 is a flowchart illustrating an example of a method of providing learning contents for pronunciation coaching according to an embodiment of the present invention.
8 is a block diagram showing a schematic configuration of an evaluation system according to an embodiment of the present invention, and FIG. 9 is a block diagram showing a schematic configuration of a learning system according to an embodiment of the present invention.

본 발명은 다양한 변환을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변환, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.Since the present invention can apply various transformations and have various embodiments, specific embodiments will be illustrated in the drawings and described in detail in the detailed description. However, it should be understood that this is not intended to limit the present invention to specific embodiments, and includes all transformations, equivalents, and substitutes included in the spirit and scope of the present invention. In describing the present invention, if it is determined that a detailed description of related known technologies may obscure the gist of the present invention, the detailed description will be omitted.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 제1, 제2 등의 용어는 특별한 순서를 나타내는 것이 아니며, 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다.Terms such as first and second may be used to describe various components, but the components should not be limited by the terms. Terms such as first and second do not indicate a particular order, and are used only for the purpose of distinguishing one element from another.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. Terms used in this application are only used to describe specific embodiments, and are not intended to limit the present invention. Singular expressions include plural expressions unless the context clearly dictates otherwise.

본 명세서에 있어서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.In this specification, terms such as "include" or "have" are intended to designate that there is a feature, number, step, operation, component, part, or combination thereof described in the specification, but one or more other It should be understood that the presence or addition of features, numbers, steps, operations, components, parts, or combinations thereof is not precluded.

또한, 본 명세서에 있어서는 어느 하나의 구성요소가 다른 구성요소로 데이터를 '전송'하는 경우에는 상기 구성요소는 상기 다른 구성요소로 직접 상기 데이터를 전송할 수도 있고, 적어도 하나의 또 다른 구성요소를 통하여 상기 데이터를 상기 다른 구성요소로 전송할 수도 있는 것을 의미한다. 반대로 어느 하나의 구성요소가 다른 구성요소로 데이터를 '직접 전송'하는 경우에는 상기 구성요소에서 다른 구성요소를 통하지 않고 상기 다른 구성요소로 상기 데이터가 전송되는 것을 의미한다.In addition, in the present specification, when one component 'transmits' data to another component, the component may directly transmit the data to the other component, or through at least one other component. It means that the data can be transmitted to the other component. Conversely, when one component 'directly transmits' data to another component, it means that the data is transmitted from the component to the other component without going through the other component.

이하, 첨부된 도면들을 참조하여 본 발명의 실시예들을 중심으로 본 발명을 상세히 설명한다. 각 도면에 제시된 동일한 참조부호는 동일한 부재를 나타낸다.Hereinafter, the present invention will be described in detail with reference to the accompanying drawings, focusing on embodiments of the present invention. Like reference numerals in each figure indicate like elements.

도 1은 본 발명의 기술적 사상에 따른 음소 단위 발음 정확도 평가 방법 및 발음 코칭용 학습 컨텐츠 제공 방법이 수행되는 환경을 개략적으로 도시한 도면이다.1 is a diagram schematically illustrating an environment in which a method for evaluating pronunciation accuracy in phoneme units and a method for providing learning contents for pronunciation coaching according to the technical idea of the present invention are performed.

도 1을 참조하면, 본 발명의 일 실시예에 따른 음소 단위 발음 정확도 평가 방법은 음소 단위 발음 정확도 평가 시스템(100; 이하, '평가 시스템'이라고 함)에 의해 수행될 수 있으며, 본 발명의 일 실시예에 따른 발음 코칭용 학습 컨텐츠 제공 방법은 발음 코칭용 학습 컨텐츠 제공 시스템(200; 이하 '학습 시스템'이라고 함)에 의해 수행될 수 있다.Referring to FIG. 1 , a method for evaluating pronunciation accuracy in units of phonemes according to an embodiment of the present invention may be performed by a system for evaluating pronunciation accuracy in units of phonemes (100; hereinafter referred to as 'evaluation system'). A method of providing learning contents for pronunciation coaching according to an embodiment may be performed by a system 200 (hereinafter referred to as a 'learning system') for providing learning contents for pronunciation coaching.

상기 평가 시스템(100)은 소정의 언어의 소리 체계를 구성하는 각 음소 별로 음소 레벨 점수를 산출할 수 있다.The evaluation system 100 may calculate a phoneme level score for each phoneme constituting the sound system of a predetermined language.

상기 언어는 대표적으로 영어일 수 있으나, 이에 한정되는 것은 아니며 중국어, 일본어, 한국어, 불어, 스페인어 등 다양할 수 있다. 이하에서는 영어를 위주로 설명을 하지만 본 발명의 기술적 사상이 이에 한정되는 것은 아니다.The language may be typically English, but is not limited thereto and may be various such as Chinese, Japanese, Korean, French, Spanish, and the like. Hereinafter, description will be given mainly in English, but the technical idea of the present invention is not limited thereto.

한편 음소는 특정 언어의 소리 체계 내에서 다른 소리와 구별되어 대립적 기능을 하는 언어 사용자가 인식하는 소리의 최소 단위일 수 있다. 예를 들어 음소는 자음 및 모음을 포함할 수 있으며, 모음은 장모음, 단모음, 이중모음 등을 포함할 수 있다. 특정한 언어에서 발음될 수 있는 음소는 이미 널리 공지되어 있으므로 여기에서는 상세한 설명은 생략하기로 한다. 통상적으로 각각의 음소는 서로 구분되는 발음 기호로 표기될 수 있으며, 발음기호에는 대표적으로 국제음성기호(International Phonetic Alphabet)를 들 수 있다.On the other hand, a phoneme may be the smallest unit of sound recognized by a language user, which is distinguished from other sounds within the sound system of a specific language and plays an opposing function. For example, phonemes may include consonants and vowels, and vowels may include long vowels, short vowels, diphthongs, and the like. Since phonemes that can be pronounced in a specific language are already widely known, a detailed description thereof will be omitted. Conventionally, each phoneme may be represented by a phonetic symbol that is distinguished from each other, and the international phonetic alphabet is a representative example of the phonetic symbol.

상기 평가 시스템(100)은 피평가자의 사용자 단말(예를 들어, 20)로부터 상기 피평가자가 발화한 다수의 음성 데이터를 획득할 수 있으며, 획득한 음성 데이터에 기초하여 각 음소 별 음소 레벨 점수를 산출할 수 있다.The evaluation system 100 may obtain a plurality of voice data uttered by the subject from the user terminal (eg, 20) of the subject, and calculate a phoneme level score for each phoneme based on the obtained speech data. can

일 실시예에서 상기 평가 시스템(100)은 상기 언어의 소리 체계를 구성하는 각 음소와 해당 음소의 음소 레벨 점수를 매핑한 음소 별 점수 정보(예를 들어, 테이블)을 생성할 수 있다. 한편 상기 평가 시스템(100)은 사용자(예를 들어, 피평가자, 피학습자 등)마다 각각 음소 별 점수 정보를 별도로 구성할 수 있다.In an embodiment, the evaluation system 100 may generate score information (eg, a table) for each phoneme in which each phoneme constituting the sound system of the language is mapped to a phoneme level score of the corresponding phoneme. Meanwhile, the evaluation system 100 may separately configure score information for each phoneme for each user (eg, a subject to be evaluated, a student to be learned, etc.).

일 실시예예서, 상기 평가 시스템(100)은 생성한 음소 별 점수 정보를 데이터베이스(300; DB)에 저장할 수 있다. 상기 DB(300)는 음소 별 점수 정보 외에 본 발명의 기술적 사상을 구현하는데 필요한 각종 정보를 저장할 수 있다. 예를 들어, 상기 DB(300)는 음소 평가나 발음 학습에 이용될 각종 단어 혹은 문장을 저장할 수 있으며, 음소와 단어 혹은 문장 간의 매핑 관계를 저장할 수도 있다. 또한 상기 DB(300)는 특정 음소와 대응되는 대치 음소에 대한 정보를 더 저장할 수도 있다. 본 명세서에서 DB라 함은, 적어도 하나의 테이블로 구현될 수도 있으며, 상기 DB에 저장된 정보를 검색, 저장, 및 관리하기 위한 별도의 DBMS(Database Management System)을 더 포함하는 의미로 사용될 수도 있다. 또한, 링크드 리스트(linked-list), 트리(Tree), 관계형 DB의 형태 등 다양한 방식으로 구현될 수 있으며, 상기 DB에 저장될 정보를 저장할 수 있는 모든 데이터 저장매체 및 데이터 구조를 포함하는 의미로 사용될 수 있다.In one embodiment, the evaluation system 100 may store the generated score information for each phoneme in the database 300 (DB). The DB 300 may store various types of information required to implement the technical idea of the present invention in addition to score information for each phoneme. For example, the DB 300 may store various words or sentences to be used for phoneme evaluation or pronunciation learning, and may store mapping relationships between phonemes and words or sentences. In addition, the DB 300 may further store information on replacement phonemes corresponding to specific phonemes. In this specification, DB may be implemented as at least one table, and may be used to mean further including a separate database management system (DBMS) for searching, storing, and managing information stored in the DB. In addition, it can be implemented in various ways such as linked-list, tree, relational DB, etc., and includes all data storage media and data structures capable of storing information to be stored in the DB. can be used

한편 상기 학습 시스템(200)은 상기 평가 시스템(100)에 의해 구축된 음소 별 점수 정보를 이용하여 사용자가 취약한 발음을 학습할 수 있도록 하는 학습용 컨텐츠를 생성하고 이를 상기 학습자의 사용자 단말(예를 들어, 20)로 제공할 수 있다.Meanwhile, the learning system 200 uses the score information for each phoneme established by the evaluation system 100 to generate learning content that allows the user to learn weak pronunciation, and creates content for learning using the learner's user terminal (for example, , 20).

예를 들어 상기 학습 시스템(200)은 상기 평가 시스템(100)에 의해 구축된 음소 별 점수 정보를 이용하여 학습이 필요할 것으로 보이는 음소를 파악한 후 해당 음소가 포함된 학습 단어 또는 학습 문장을 상기 사용자 단말(20)로 제공할 수 있다. 그러면 상기 사용자 단말(20)은 사용자가 발화하는 상기 학습 단어 또는 학습 문장에 대한 음성 데이터를 획득하고 이를 상기 학습 시스템(200)에 전달할 수 있으며, 상기 학습 시스템(200)은 이를 평가할 수 있다.For example, the learning system 200 uses the score information for each phoneme established by the evaluation system 100 to determine a phoneme that is likely to require learning, and then sends a learning word or a learning sentence including the phoneme to the user terminal. (20) can be provided. Then, the user terminal 20 may acquire voice data for the learning word or learning sentence uttered by the user and transmit it to the learning system 200, and the learning system 200 may evaluate it.

한편, 상기 평가 시스템(100) 및/또는 상기 학습 시스템(200)은 본 발명의 기술적 사상을 구현하기 위한 연산능력을 가진 데이터 처리장치인 컴퓨팅 시스템일 수 있으며, 일반적으로 네트워크를 통해 클라이언트가 접속 가능한 데이터 처리 장치인 서버뿐만 아니라 개인용 컴퓨터나 휴대 단말 등과 같은 컴퓨팅 장치를 포함할 수 있다.On the other hand, the evaluation system 100 and/or the learning system 200 may be a computing system that is a data processing device having an arithmetic capability for implementing the technical concept of the present invention, and is generally connectable by a client through a network. It may include a computing device such as a personal computer or portable terminal as well as a server that is a data processing device.

상기 평가 시스템(100) 및/또는 상기 학습 시스템(200)은 어느 하나의 물리적 장치로 구현될 수도 있으나, 필요에 따라 복수의 물리적 장치가 유기적으로 결합되어 본 발명의 기술적 사상에 따른 상기 평가 시스템(100) 및/또는 상기 학습 시스템(200)을 구현할 수 있음을 본 발명의 기술분야의 평균적 전문가는 용이하게 추론할 수 있을 것이다.The evaluation system 100 and/or the learning system 200 may be implemented with any one physical device, but a plurality of physical devices are organically combined as needed to form the evaluation system according to the technical idea of the present invention ( 100) and/or the learning system 200 can be implemented easily by an average expert in the art of the present invention.

도 1에 도시된 바와 같이, 상기 평가 시스템(100) 및 상기 학습 시스템(200)은 소정의 모(母) 시스템(10)의 서브 시스템의 형태로 구현될 수도 있다. 상기 모 시스템(10)은 서버일 수 있다. 상기 서버(10)는 본 발명의 기술적 사상을 구현하기 위한 연산능력을 가진 데이터 처리장치를 의미하며, 일반적으로 네트워크를 통해 클라이언트가 접속 가능한 데이터 처리장치뿐만 아니라 개인용 컴퓨터, 휴대 단말 등과 같이 특정 서비스를 수행할 수 있는 어떠한 장치도 서버로 정의될 수 있음을 본 발명의 기술분야의 평균적 전문가는 용이하게 추론할 수 있을 것이다. As shown in FIG. 1 , the evaluation system 100 and the learning system 200 may be implemented as subsystems of a predetermined mother system 10 . The parent system 10 may be a server. The server 10 refers to a data processing device having computing power for implementing the technical idea of the present invention, and generally provides specific services such as personal computers and mobile terminals as well as data processing devices accessible to clients through a network. An average expert in the art of the present invention will readily infer that any device capable of performing can be defined as a server.

도 1에 도시된 바와 같이, 상기 평가 시스템(100) 및 상기 학습 시스템(200)은 서로 분리된 형태로 구현될 수도 있으나, 실시 예에 따라 상기 평가 시스템(100) 및 상기 학습 시스템(200)은 하나의 시스템으로 구현될 수도 있다.As shown in FIG. 1, the evaluation system 100 and the learning system 200 may be implemented in separate forms, but according to an embodiment, the evaluation system 100 and the learning system 200 It can also be implemented as a single system.

상기 사용자 단말(200)은 프로세싱 장치를 구비한 컴퓨팅 장치일 수 있다. 예를 들어 상기 사용자 단말(200)은 스마트폰, 휴대전화, PDA, 태블릿 PC와 같은 모바일 장치일 수 있으나 이에 한정되는 것은 아니며, 랩탑, 테스크탑 등을 포함할 수 있다. 한편 상기 사용자 단말(200)은 발화자의 음성을 입력받을 수 있는 마이크, 상기 학습 시스템(100)이 제공하는 학습용 컨텐츠를 출력할 수 있는 디스플레이 장치 및 스피커를 포함할 수 있다.The user terminal 200 may be a computing device having a processing device. For example, the user terminal 200 may be a mobile device such as a smart phone, a mobile phone, a PDA, or a tablet PC, but is not limited thereto, and may include a laptop, a desktop, and the like. Meanwhile, the user terminal 200 may include a microphone capable of receiving a speaker's voice, a display device capable of outputting contents for learning provided by the learning system 100, and a speaker.

한편 상기 평가 시스템(100) 및/또는 상기 학습 시스템(200)은 상기 사용자 단말(20)과 유/무선 네트워크(예를 들어, 인터넷)를 통해 연결되어 본 발명의 기술적 사상을 구현하는데 필요한 각종 정보, 데이터 및/또는 신호를 송수신할 수 있다.On the other hand, the evaluation system 100 and/or the learning system 200 are connected to the user terminal 20 through a wired/wireless network (eg, Internet), and various information required to implement the technical idea of the present invention. , data and / or signals can be transmitted and received.

도 2는 상기 평가 시스템(100)이 수행하는 음소 단위 발음 정확도 평가 방법의 일 예를 도시한 흐름도이다.2 is a flowchart illustrating an example of a method for evaluating pronunciation accuracy in units of phonemes performed by the evaluation system 100 .

도 2를 참조하면 상기 평가 시스템(100)은 피평가자가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득할 수 있다(S110).Referring to FIG. 2 , the evaluation system 100 may obtain voice data of each of a plurality of evaluation target words uttered by the subject (S110).

일 실시예에서, 상기 평가 시스템(100)은 피평가자의 사용자 단말(예를 들어, 20)로부터 상기 복수의 평가 대상 단어 각각의 음성 데이터를 획득할 수 있다. 예를 들어, 상기 평가 시스템(100)은 평가 대상 단어를 상기 사용자 단말(20)에 전송하고 상기 사용자 단말(20)은 상기 평가 대상 단어를 디스플레이할 수 있으며, 피평가자는 해당 단어를 발화하여 상기 사용자 단말(20)에 구비된 마이크를 통해 입력할 수 있다. 그러면 상기 사용자 단말(20)은 마이크를 통해 입력된 음성에 상응하는 음성 데이터를 상기 평가 시스템(100)으로 전송할 수 있다.In one embodiment, the evaluation system 100 may obtain voice data of each of the plurality of evaluation target words from the user terminal (eg, 20) of the subject of evaluation. For example, the evaluation system 100 may transmit a word to be evaluated to the user terminal 20, the user terminal 20 may display the word to be evaluated, and the subject to be evaluated utters the word to the user terminal 20. It can be input through a microphone provided in the terminal 20 . Then, the user terminal 20 may transmit voice data corresponding to the voice input through the microphone to the evaluation system 100 .

상기 평가 시스템(100)은 문장 단위로 입력된 음성 데이터를 획득할 수도 있는데, 그 예가 도 3에 도시되어 있다. 도 3은 도 2의 S110 단계의 구체적인 예를 도시한 흐름도이다.The evaluation system 100 may acquire voice data input in units of sentences, an example of which is shown in FIG. 3 . 3 is a flowchart illustrating a specific example of step S110 of FIG. 2 .

도 3을 참조하면 상기 평가 시스템(100)은 상기 사용자 단말(20)로 복수의 단어를 포함하는 평가 대상 문장을 제공할 수 있으며(S111), 상기 사용자 단말(20)은 상기 평가 대상 문장을 출력할 수 있고, 피평가자는 상기 평가 대상 문장을 발화하여 마이크를 통해 입력할 수 있다. 그러면 상기 사용자 단말은 피평가자가 발화한 평가 대상 문장의 음성에 상응하는 음성 데이터를 상기 평가 시스템(100)으로 전송할 수 있다(S112).Referring to FIG. 3 , the evaluation system 100 may provide an evaluation target sentence including a plurality of words to the user terminal 20 (S111), and the user terminal 20 outputs the evaluation target sentence. and the subject to be evaluated can utter the sentence to be evaluated and input it through a microphone. Then, the user terminal may transmit voice data corresponding to the voice of the evaluation target sentence uttered by the subject to the evaluation system 100 (S112).

이후 상기 평가 시스템(100)은 상기 평가 대상 문장의 음성 데이터를 단어 별로 분할할 수 있다(S113). 문장에 상응하는 음성 데이터를 단어 별로 구분하는 기술은 파형 분석이나 머신 러닝 기술을 통해 구현될 수 있다. 문장에 상응하는 음성 데이터를 단어 별로 구분하는 기술 자체는 널리 공지되어 있으므로 본 명세서에서는 이에 대한 상세한 설명은 생략하기로 한다.Thereafter, the evaluation system 100 may divide the speech data of the evaluation target sentence by word (S113). A technology for classifying voice data corresponding to a sentence into words may be implemented through waveform analysis or machine learning technology. Since the technology itself of classifying voice data corresponding to a sentence into words is widely known, a detailed description thereof will be omitted in the present specification.

상기 평가 시스템(100)은 복수의 평가 대상 문장 각각에 대하여 S111 내지 S113 단계를 수행할 수 있다. 따라서, 도 3의 실시예에서 상기 복수의 평가 대상 단어는, 상기 복수의 평가 대상 문장에 포함된 각 단어로 구성될 수 있다.The evaluation system 100 may perform steps S111 to S113 for each of a plurality of evaluation target sentences. Accordingly, in the embodiment of FIG. 3 , the plurality of evaluation target words may be composed of each word included in the plurality of evaluation target sentences.

예를 들어 상기 복수의 평가 대상 문장이 각각 T₁, T₂, T₃이고, T₁은 단어 W₁, W₂, W₃로 구성되어 있고, T₂는 단어 W₄, W₅, W₆로 구성되어 있고, T₃는 단어 W₇, W₈, W₉, W₁₀으로 구성되어 있는 경우, 상기 평가 대상 단어는 W₁ 내지 W₁₀일 수 있다.For example, the plurality of evaluation target sentences are T ₁ , T ₂ , and T ₃ , T ₁ is composed of words W ₁ , W ₂ , and W ₃ , and T ₂ is composed of words W ₄ , W ₅ , and W ₆ . , and T ₃ is composed of words W ₇ , W ₈ , W ₉ , and W ₁₀ , the words to be evaluated may be W ₁ to W ₁₀ .

다시 도 2를 참조하면, 상기 평가 시스템(100)은 복수의 평가 대상 단어 각각에 상응하는 단어 레벨 점수 획득할 수 있다(S120). 다시 말하면, 상기 평가 시스템은(100), 상기 복수의 평가 대상 단어 각각에 대하여, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출할 수 있다.Referring back to FIG. 2 , the evaluation system 100 may obtain word level scores corresponding to each of a plurality of words to be evaluated (S120). In other words, the evaluation system 100 may calculate a word level score corresponding to the evaluation target word based on voice data of the evaluation target word for each of the plurality of evaluation target words.

실시예에 따라 상기 단어 레벨 점수는 세부적인 항목의 점수를 포함할 수 있다. 예를 들어, 상기 평가 대상 단어의 단어 레벨 점수는 상기 피평가자가 발화한 상기 평가 대상 단어의 발음의 정확도를 나타내는 발음 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 억양의 정확도를 나타내는 억양 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 볼륨의 정확도를 나타내는 볼륨 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 길이의 장단의 정확도를 나타내는 길이 점수, 상기 피평가자가 발화한 상기 평가 대상 단어의 강세의 정확도를 나타내는 강세 점수 중 적어도 일부를 포함할 수 있으며, 상기 평가 시스템(100)은 각각의 세부 항목의 점수를 산출하는 과정을 별도로 수행할 수 있다.According to embodiments, the word level score may include scores of detailed items. For example, the word level score of the word to be evaluated includes a pronunciation score indicating accuracy of pronunciation of the word to be evaluated uttered by the subject, an intonation score indicating accuracy of intonation of the word to be evaluated uttered by the subject, the A volume score indicating the accuracy of the volume of the word to be evaluated uttered by the subject, a length score indicating the accuracy of the length of the word to be evaluated uttered by the subject, and an accuracy of stress of the word to be evaluated uttered by the subject It may include at least some of the stress scores representing , and the evaluation system 100 may separately perform a process of calculating scores for each subitem.

또는 상기 평가 대상 단어의 단어 레벨 점수는 발음 점수, 억양 점수, 볼륨 점수, 길이 점수 강세 점수 중 적어도 일부의 가중 평균으로 계산되는 값일 수도 있다.Alternatively, the word level score of the word to be evaluated may be a value calculated as a weighted average of at least some of pronunciation scores, intonation scores, volume scores, length scores, and stress scores.

상기 평가 시스템(100)이 특정한 단어의 음성 데이터를 이용하여 해당 단어에 상응하는 단어 레벨 점수를 산출하는 방법은 다양할 수 있다. 예를 들어, 상기 평가 시스템(100)은 해당 단어의 음성 데이터의 파형을 분석하고 소정의 정답 음성 데이터의 파형과 비교하여 일치도를 비교할 수 있다. 이 외에도 단어레벨 점수를 산출할 수 있는 다양한 방법이 있다. 음성 데이터에 기초하여 단어 레벨의 정확도 혹은 점수를 판단하는 기술 자체는 본 발명의 핵심적인 특징이 아니며 널리 공지된 기술이므로 본 명세서에서는 이에 대한 상세한 설명은 생략하기로 한다.A method in which the evaluation system 100 calculates a word level score corresponding to a specific word using voice data of a specific word may be various. For example, the evaluation system 100 may compare the degree of agreement by analyzing the waveform of voice data of the corresponding word and comparing it with the waveform of predetermined answer voice data. In addition to this, there are various methods that can calculate word level scores. Since the technology of determining word-level accuracy or score based on voice data is not a key feature of the present invention and is a well-known technology, a detailed description thereof will be omitted herein.

이후 상기 평가 시스템(100)은 상기 언어의 소리 체계를 구성하는 전체 음소 혹은 전체 음소 중 일부인 복수의 평가 대상 음소 각각에 대하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수 있다(S130).Thereafter, the evaluation system 100 may determine a phoneme level score corresponding to the phoneme to be evaluated for each of a plurality of phonemes to be evaluated, which are all phonemes or a part of all phonemes constituting the sound system of the language (S130).

보다 상세하게는 상기 평가 시스템(100)은 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수 있다(S140).More specifically, the evaluation system 100 determines the phoneme level score corresponding to the phoneme to be evaluated based on the word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the person to be evaluated. It can be determined (S140).

일 실시예에서, 상기 평가 시스템(100)은 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값을 산출할 수 있으며, 산출한 평균 값을 상기 평가 대상 음소에 상응하는 음소 레벨 점수로 결정할 수 있는데, 이에 대하여 도 4a 및 도 4b를 참조하여 보다 상세하게 설명하기로 한다. 설명의 편의를 위하여 도 4a 및 도 4b에서는 언어의 소리 체계를 구성하는 전체 음소가 P₁ 내지 P5라고 가정한다.In an embodiment, the evaluation system 100 may calculate an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated, and the calculated average value may be used as the phoneme to be evaluated. It can be determined as a phoneme level score corresponding to , which will be described in more detail with reference to FIGS. 4A and 4B. For convenience of explanation, it is assumed in FIGS. 4A and 4B that all phonemes constituting the sound system of language are _P1 to P5.

도 4a는 8개의 평가 대상 단어 W₁ 내지 W₈ 각각의 단어 레벨 점수 및 해당 단어를 구성하는 음소를 도시한 표이며, 도 4b는 음소 P₁ 내지 P₅ 각각의 음소 레벨 점수를 산출하기 위한 식을 나타낸 표이다.FIG. 4A is a table showing word level scores of each of eight evaluation target words W ₁ to W ₈ and phonemes constituting the word, and FIG. 4B is an equation for calculating phoneme level scores of each of phonemes P ₁ to P ₅ . is a table showing

도 4a에 도시된 바와 같이 각 평가 대상 단어 W₁ 내지 W₈는 음소 혹은 음소들로 구성될 수 있는데, 해당 평가 대상 단어를 구성하는 음소 혹은 음소들은 미리 정의되어 있다. 따라서, 상기 평가 시스템(100)은 평가 대상 단어가 될 수 있는 모든 후보 단어 각각에 대하여, 해당 후보 단어를 구성하는 음소를 미리 저장할 수 있으며, 후보 단어 및 이를 구성하는 음소들간의 매핑 관계는 상기 DB(300)에 미리 저장되어 있을 수 있다.As shown in FIG. 4A , each evaluation target word W ₁ to W ₈ may be composed of phonemes or phonemes, and the phonemes or phonemes constituting the evaluation target word are predefined. Therefore, the evaluation system 100 may store phonemes constituting the candidate word in advance for each candidate word that may be an evaluation target word, and the mapping relationship between the candidate word and the phonemes constituting the candidate word may be determined in the DB. (300) may be stored in advance.

도 4b를 참조하면, 음소 P₁의 경우에는 평가 대상 단어 W₁ 내지 W₈ 중 W₁, W₂, W₄에 포함되어 있으므로 상기 평가 시스템(100)은 W₁의 단어 레벨 점수 S₁, W₂의 단어 레벨 점수 S₂, 및 W₄의 단어 레벨 점수 S₄의 평균 값(즉, (S₁+S₂+S₄)/3)을 산출할 수 있으며 이를 P₁의 음소 레벨 점수로 결정할 수 있다.Referring to FIG. 4B , since the phoneme P ₁ is included in W ₁ , W ₂ , and W ₄ of words W ₁ to W ₈ to be evaluated, the evaluation system 100 determines the word level score S ₁ , W of W ₁ . The average value of the word-level score S ₂ of ₂ and the word-level score S ₄ of W ₄ (ie, (S ₁ +S ₂ +S ₄ )/3) can be calculated and determined as the phoneme level score of P ₁ . can

음소 P₂는 평가 대상 단어 W₁, W₅, W₆, W₇에 포함되어 있으므로 상기 평가 시스템(100)은 이들의 단어 레벨 점수의 평균 값인 [(S₁+S₅+S₆+S₇)/4]을 P₂의 음소 레벨 점수로 결정할 수 있다.Since the phoneme P ₂ is included in the words W ₁ , W ₅ , W ₆ , and W ₇ to be evaluated, the evaluation system 100 calculates the average value of their word level scores [(S ₁ +S ₅ +S ₆ +S ₇ )/4] can be determined as the phoneme level score of P ₂ .

음소 P₃는 평가 대상 단어 W₃, W₄, W₈ 에 포함되어 있으므로 상기 평가 시스템(100)은 이들의 단어 레벨 점수의 평균 값인 [(S₃+S₄+S₈)/3]을 P₃의 음소 레벨 점수로 결정할 수 있다.Since the phoneme P ₃ is included in the words W ₃ , W ₄ , and W ₈ to be evaluated, the evaluation system 100 calculates [(S ₃ +S ₄ +S ₈ )/3], the average value of their word level scores, as P . It can be determined with a phoneme level score of ₃ .

음소 P₄는 평가 대상 단어 W₂, W₃, W₅, W₇에 포함되어 있으므로 상기 평가 시스템(100)은 이들의 단어 레벨 점수의 평균 값인 [(S₂+S₃+S₅+S₇)/4]을 P₄의 음소 레벨 점수로 결정할 수 있다.Since the phoneme P ₄ is included in the words W ₂ , W ₃ , W ₅ , and W ₇ to be evaluated, the evaluation system 100 calculates the average value of their word level scores [(S ₂ +S ₃ +S ₅ +S ₇ )/4] can be determined as the phoneme level score of P ₄ .

음소 P₅는 평가 대상 단어 W₂, W₃, W₅, W₆, W₈에 포함되어 있으므로 상기 평가 시스템(100)은 이들의 단어 레벨 점수의 평균 값인 [(S₂+S₃+S₅+S₆+S₈)/5]을 P₅의 음소 레벨 점수로 결정할 수 있다.Since the phoneme P ₅ is included in the words W ₂ , W ₃ , W ₅ , W ₆ , and W ₈ to be evaluated, the evaluation system 100 calculates the average value of their word level scores [(S ₂ +S ₃ +S ₅ +S ₆ +S ₈ )/5] can be determined as the phoneme level score of P ₅ .

한편 도 4b에 도시된 바와 같이, 상기 평가 시스템(100)은 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값 그 자체를 상기 평가 대상 음소에 상응하는 음소 레벨 점수로 결정할 수도 있으나, 실시예에 따라서는 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수도 있다. 예를 들어 상기 평가 시스템(100)은 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값이 특정한 수치 구간에 속하는 경우 해당 수치 구간에 상응하는 점수를 상기 평가 대상 음소에 상응하는 음소 레벨 점수로 결정할 수도 있다. 이 외에도 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하는 다양한 방법이 있을 수 있음은 물론이다.Meanwhile, as shown in FIG. 4B, the evaluation system 100 may determine the average value of word level scores of each word including the phoneme to be evaluated as the phoneme level score corresponding to the phoneme to be evaluated. According to an example, a phoneme level score corresponding to the phoneme to be evaluated may be determined based on an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated. For example, if the average value of word level scores of each word including the phoneme to be evaluated belongs to a specific numerical range, the evaluation system 100 assigns a score corresponding to the corresponding numerical interval to the phoneme level corresponding to the phoneme to be evaluated. It can also be decided by score. In addition to this, of course, there may be various methods for determining a phoneme level score corresponding to the phoneme to be evaluated.

또한 실시예에 따라서는 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값을 산출하는 대신, 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수에 기초하여 산출될 수 있는 다양한 값을 기반으로 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수도 있다. 예를 들어 상기 평가 시스템(100)은 상기 평가 대상 음소를 포함하는 단어 각각에 포함된 음소의 개수의 역수를 가중치로 하는 가중 평균을 산출하고 이를 기반으로 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수도 있는데, 그 예가 도 4c에 도시되어 있다. 도 4c의 음소 레벨 점수는 평가 대상 단어와 그들의 단어 레벨 점수가 도 4a인 경우를 도시하고 있으며, 단어 각각에 포함된 음소의 개수의 역수를 가중치로 하는 가중 평균을 산출한다는 점을 제외하면 도 4b와 동일하므로 상세한 설명은 생략하기로 한다.In addition, depending on the embodiment, instead of calculating the average value of word level scores of each word including the phoneme to be evaluated, various values that can be calculated based on the word level score of each word including the phoneme to be evaluated are calculated. Based on this, a phoneme level score corresponding to the phoneme to be evaluated may be determined. For example, the evaluation system 100 calculates a weighted average in which the reciprocal of the number of phonemes included in each word including the phoneme to be evaluated is used as a weight, and based on this, a phoneme level score corresponding to the phoneme to be evaluated is calculated. It can also be determined, an example of which is shown in Figure 4c. The phoneme level score of FIG. 4C shows the case where the evaluation target words and their word level scores are in FIG. 4A, except that a weighted average is calculated with the reciprocal of the number of phonemes included in each word as a weight. Since it is the same as, detailed description will be omitted.

도 5는 본 발명의 다른 일 실시예에 따른 음소 단위 발음 정확도 평가 방법의 일 예를 도시한 흐름도이다. 도 5를 설명함에 있어, 앞서 설명한 내용과 중복되거나 앞서 설명한 내용으로부터 자명하게 도출할 수 있는 내용에 대해서는 상세한 설명을 생략하기로 한다.5 is a flowchart illustrating an example of a method for evaluating pronunciation accuracy in units of phonemes according to another embodiment of the present invention. In the description of FIG. 5 , detailed descriptions of contents that are redundant with the above-described contents or that can be obviously derived from the above-described contents will be omitted.

도 5를 참조하면, 상기 평가 시스템(100)은 복수의 평가 대상 단어 각각에 대하여, S210 단계 내지 S240 단계를 수행할 수 있다(S200).Referring to FIG. 5 , the evaluation system 100 may perform steps S210 to S240 for each of a plurality of words to be evaluated (S200).

상기 평가 시스템(100)은 피평가자가 발화한 상기 평가 대상 단어의 음성 데이터를 획득할 수 있다(S210).The evaluation system 100 may acquire voice data of the word to be evaluated uttered by the person to be evaluated (S210).

상기 평가 시스템(100)은 평가 대상 단어 각각의 음성 데이터가 획득될 때마다, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 피평가자의 음소 레벨 점수를 갱신할 수 있다(S220 내지 S240).The evaluation system 100 may update the phoneme level score of the subject of evaluation based on the speech data of each word to be evaluated whenever voice data of each word to be evaluated is obtained (S220 to S240).

보다 상세하게는 상기 평가 시스템(210)은 상기 평가 대상 단어의 음성 데이터에 기초하여, 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출할 수 있다(S220).More specifically, the evaluation system 210 may calculate a word level score corresponding to the evaluation target word based on voice data of the evaluation target word (S220).

또한 상기 평가 시스템(210)은 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소를 판단할 수 있다(S230).In addition, the evaluation system 210 may determine at least one evaluation target phoneme constituting the evaluation target word (S230).

이후 상기 평가 시스템(210)은 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소 각각에 대하여, 기 산출된 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수에 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 반영함으로써 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 갱신할 수 있다(S230, S240). 예를 들어 상기 평가 시스템(100)은 현재까지 입력된 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 누적 평균 값을 산출할 수 있으며, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 산출된 누적 평균 값으로 갱신할 수 있다. 한편 각 평가 대상 음소의 초기 음소 레벨 점수는 0으로 설정되어 있을 수 있다.Thereafter, the evaluation system 210 calculates the phoneme level score corresponding to the phoneme to be evaluated for each of the at least one phoneme to be evaluated constituting the word to be evaluated, and the word corresponding to the word to be evaluated. By reflecting the level score, the phoneme level score corresponding to the evaluation target phoneme of the subject can be updated (S230 and S240). For example, the evaluation system 100 may calculate a cumulative average value of word level scores of each word including the phoneme to be evaluated among the words to be evaluated that have been input so far, and the phoneme level corresponding to the phoneme to be evaluated. The score can be updated with the calculated cumulative average value. Meanwhile, the initial phoneme level score of each phoneme to be evaluated may be set to 0.

도 4a의 각 평가 대상 단어 W₁ 내지 W₈의 음성 데이터가 차례로 획득되는 경우, 도 5의 방법에 의해 갱신되는 평가 대상 음소 P₁ 내지 P₅의 음소 레벨 점수의 변화가 도 6에 도시되어 있다. When the voice data of each evaluation target word W ₁ to W ₈ in FIG. 4A is obtained sequentially, FIG. 6 shows changes in phoneme level scores of evaluation target phones P ₁ to P ₅ updated by the method of FIG. 5 .

도 4a 및 도 6을 참조하면, 평가 대상 음소 P₁ 내지 P₅의 음소 레벨 점수가 소정의 초기 값(예를 들어, 0)으로 설정되어 있는 상태에서, 평가 대상 단어 W₁의 음성 데이터가 획득되면(S210), 상기 평가 시스템(100)은 평가 대상 단어 W₁의 단어 레벨 점수를 S₁으로 결정할 수 있으며(S220), 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소 P₁ 및 P₂를 판단할 수 있다(S230).Referring to FIGS. 4A and 6 , voice data of the word W ₁ to be evaluated is obtained in a state in which the phoneme level scores of the phonemes to be evaluated P ₁ to P ₅ are set to a predetermined initial value (eg, 0). (S210), the evaluation system 100 may determine the word level score of the evaluation target word W ₁ as S ₁ (S220), and at least one evaluation target phoneme P ₁ and P ₂ constituting the evaluation target word It is possible to determine (S230).

그러면 상기 평가 시스템(100)은 평가 대상 음소 P₁의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W₁) 중 평가 대상 음소 P₁을 포함하는 단어들(즉 W₁)의 단어 레벨 점수의 누적 평균 값 S₁ = S₁/1으로 갱신할 수 있다(S240). 마찬가지로 상기 평가 시스템(100)은 평가 대상 음소 P₂의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W₂) 중 평가 대상 음소 P₂를 포함하는 단어들(즉 W₁)의 단어 레벨 점수의 누적 평균 값 S₁ = S₁/1으로 갱신할 수 있다(S240).Then, the evaluation system 100 calculates the phoneme level score of the phoneme P ₁ to be evaluated as the word level of words (ie, W ₁ ) including the phoneme to be evaluated P ₁ among words to be evaluated (ie, W ₁ ) input so far. The cumulative average value of the score S ₁ = S ₁ /1 may be updated (S240). Similarly, the evaluation system 100 calculates the phoneme level score of the phoneme P ₂ to be evaluated as the word level of words (ie, W ₁ ) including the phoneme to be evaluated P ₂ among words to be evaluated (ie, W ₂ ) input so far. The cumulative average value of the score S ₁ = S ₁ /1 may be updated (S240).

다음 차례로 평가 대상 단어 W₂의 음성 데이터가 획득되면(S210), 상기 평가 시스템(100)은 평가 대상 단어 W₂의 단어 레벨 점수를 S₂로 결정할 수 있으며(S220), 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소 P₁, P₄ 및 P₅를 판단할 수 있다(S230).Next, when voice data of the word W ₂ to be evaluated is obtained (S210), the evaluation system 100 may determine the word level score of the word W ₂ to be evaluated as S ₂ (S220), and configure the word to be evaluated. At least one evaluation target phoneme P ₁ , P ₄ , and P ₅ may be determined (S230).

그러면 상기 평가 시스템(100)은 평가 대상 음소 P₁의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W_1,W₂) 중 평가 대상 음소 P₁을 포함하는 단어들(즉 W_1,W₂)의 단어 레벨 점수의 누적 평균 값 (S₁₊S₂)/2으로 갱신할 수 있다(S240). 또한 상기 평가 시스템(100)은 평가 대상 음소 P₄의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W₁, W₂) 중 평가 대상 음소 P₄를 포함하는 단어들(즉 W₂)의 단어 레벨 점수의 누적 평균 값 S₂ = S₂/1으로 갱신할 수 있다(S240). 또한 상기 평가 시스템(100)은 평가 대상 음소 P₅의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W₁, W₂) 중 평가 대상 음소 P₅를 포함하는 단어들(즉 W₂)의 단어 레벨 점수의 누적 평균 값 S₂ = S₂/1으로 갱신할 수 있다(S240).Then, the evaluation system 100 assigns the phoneme level score of the phoneme to be evaluated P ₁ to words including _the phoneme to be evaluated P ₁ (ie, W ₁ _, W ₂ ) may be updated to the cumulative average value of word level scores (S _{1 +} S ₂ )/2 (S240). In addition, the evaluation system 100 assigns the phoneme level score of the phoneme P ₄ to be evaluated to words including the phoneme P ₄ to be evaluated (ie, W ₂ ) among words to be evaluated (ie, W ₁ and W ₂ ) input so far. The cumulative average value of word level scores of S ₂ = S ₂ /1 may be updated (S240). In addition, the evaluation system 100 assigns the phoneme level score of the phoneme P ₅ to be evaluated to words including the phoneme P ₅ to be evaluated (ie, W ₂ ) among words to be evaluated (ie, W ₁ and W ₂ ) input so far. The cumulative average value of word level scores of S ₂ = S ₂ /1 may be updated (S240).

다음 차례로 평가 대상 단어 W₃의 음성 데이터가 획득되면(S210), 상기 평가 시스템(100)은 평가 대상 단어 W₃의 단어 레벨 점수를 S₃로 결정할 수 있으며(S220), 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소 P₃, P₄ 및 P₅를 판단할 수 있다(S230).Next, when voice data of the word W ₃ to be evaluated is obtained (S210), the evaluation system 100 may determine the word level score of the word W ₃ to be evaluated as S ₃ (S220), and configure the word to be evaluated. At least one evaluation target phoneme P ₃ , P ₄ , and P ₅ may be determined (S230).

그러면 상기 평가 시스템(100)은 평가 대상 음소 P₃의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W_1,W_2,W₃) 중 평가 대상 음소 P₃을 포함하는 단어들(즉 W₃)의 단어 레벨 점수의 누적 평균 값 S₃ = S₃/1으로 갱신할 수 있다(S240). 또한 상기 평가 시스템(100)은 평가 대상 음소 P₄의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W₁, W_2,W₃) 중 평가 대상 음소 P₄를 포함하는 단어들(즉 W_2,W₃)의 단어 레벨 점수의 누적 평균 값 (S₂+S₃)/2로 갱신할 수 있다(S240). 또한 상기 평가 시스템(100)은 평가 대상 음소 P₅의 음소 레벨 점수를 지금까지 입력된 평가 대상 단어들(즉 W₁, W_2,W₃) 중 평가 대상 음소 P₅를 포함하는 단어들(즉 W_2,W₃)의 단어 레벨 점수의 누적 평균 값 (S₂+S₃)/2로 갱신할 수 있다(S240).Then, the evaluation system 100 assigns the phoneme level score of the phoneme P ₃ to be evaluated to words including the phoneme P ₃ to be evaluated (ie, W _{1 ,} W _{2 ,} W ₃ ) among words to be evaluated (ie, W 1 , W 2 , and W 3 ) input so far. The cumulative average value of word level scores of W ₃ ) may be updated to S ₃ = S ₃ /1 (S240). In addition, the evaluation system 100 assigns the phoneme level score of the phoneme P ₄ to be evaluated to words including the phoneme P ₄ to be evaluated (ie, W ₁ , W _{2 ,} W ₃ ) among words to be evaluated (ie, W 1 , W 2 , and W 3 ) input so far. The cumulative average value of word level scores of W _{2 and} W ₃ ) may be updated to (S ₂ +S ₃ )/2 (S240). In addition, the evaluation system 100 assigns the phoneme level score of the phoneme P ₅ to be evaluated to words including the phoneme P ₅ to be evaluated (ie, W ₁ , W _{2 ,} W ₃ ) among words to be evaluated (ie, W 1 , W 2 , and W 3 ) input so far. The cumulative average value of word level scores of W _{2 and} W ₃ ) may be updated to (S ₂ +S ₃ )/2 (S240).

마찬가지 방식으로 상기 평가 시스템(100)은 평가 대상 단어 W₄, W₅, W₆, W_7,W₈이 차례로 들어올 때마다 위 과정을 반복하여 도 6에 도시된 바와 같이 각 평가 대상 음소 P₁ 내지 P₅의 음소 레벨 점수를 갱신할 수 있다.In the same way, the evaluation system 100 repeats the above process whenever evaluation target words W ₄ , W ₅ , W ₆ , W _{7 ,} and W ₈ are sequentially entered, and as shown in FIG. 6 , each evaluation target phoneme P ₁ The phoneme level scores of through P ₅ may be updated.

도 7은 상기 학습 시스템(200)이 수행하는 발음 코칭용 학습 컨텐츠 제공 방법의 일 예를 도시한 흐름도이다.7 is a flowchart illustrating an example of a method of providing learning contents for pronunciation coaching performed by the learning system 200 .

도 7을 참조하면 상기 학습 시스템(200)은 상기 평가 시스템(100)이 생성한 각 음소 별 음소 레벨 점수에 기초하여 하나 혹은 그 이상의 학습 대상 음소를 추출할 수 있다(S310).Referring to FIG. 7 , the learning system 200 may extract one or more phonemes to be learned based on the phoneme level score for each phoneme generated by the evaluation system 100 (S310).

예를 들어 상기 학습 시스템(200)은 가장 낮은 음소 레벨 점수를 가지는 음소를 학습 대상 음소라고 판단할 수 있다. 또는 상기 학습 시스템(200)은 소정의 기준치 이하의 음소 레벨 점수를 가지는 하나 이상의 음소를 학습 대상 음소라고 판단할 수 있다.For example, the learning system 200 may determine a phoneme having the lowest phoneme level score as a learning target phoneme. Alternatively, the learning system 200 may determine one or more phonemes having a phoneme level score equal to or less than a predetermined reference value as a learning target phoneme.

이후 상기 학습 시스템(200)은 상기 학습 대상 음소를 포함하는 단어를 학습용 단어로 선정할 수 있다(S320). 상기 학습 시스템(200)은 예를 들어 상기 DB(300)에 다수의 학습용 후보 단어를 미리 저장할 수 있으며, 상기 학습 시스템(200)은 저장된 학습용 후보 단어들 중 상기 학습 대상 음소를 포함하는 단어를 학습용 단어로 선정할 수 있다. 실시예에 따라서는, 만약 상기 학습 대상 음소가 2 이상인 경우 상기 학습 시스템(200)은 상기 학습 대상 음소를 가능한 많이 포함하는 단어를 학습용 단어로 선정할 수 있다.Thereafter, the learning system 200 may select a word including the phoneme to be learned as a word for learning (S320). The learning system 200 may, for example, store a plurality of candidate words for learning in advance in the DB 300, and the learning system 200 selects a word including the phoneme to be learned from among the stored candidate words for learning. can be chosen by word. Depending on the embodiment, if the number of phonemes to be learned is two or more, the learning system 200 may select a word including as many phonemes to be learned as a learning word.

실시예에 따라 상기 학습 시스템(200)은 상기 학습용 단어를 포함하는 학습용 컨텐츠를 선정할 수도 있다. 예를 들어 상기 학습용 컨텐츠는 상기 학습용 단어를 포함하는 문장 혹은 문단일 수 있으며, 상기 학습용 단어에 상응하는 오디오 또는 비디오 데이터일 수도 있다. 또는 상기 학습용 단어를 포함하는 문장 또는 문단에 상응하는 오디오 데이터 또는 비디오 데이터일 수도 있다.Depending on the embodiment, the learning system 200 may select learning content including the learning word. For example, the contents for learning may be sentences or paragraphs including the words for learning, or may be audio or video data corresponding to the words for learning. Alternatively, it may be audio data or video data corresponding to a sentence or paragraph including the learning word.

상기 학습 시스템(200)에 의해 선정 가능한 모든 학습용 컨텐츠는 미리 생성되어 예를 들어 상기 DB(300)에 미리 저장되어 있을 수 있다. 만약 미리 저장된 컨텐츠 중에서 상기 학습용 단어를 포함하는 컨텐츠가 다수인 경우 상기 학습 시스템(200)은 학습자의 기존 학습 이력을 참고하여 학습용 컨텐츠를 선정할 수 있다.All contents for learning that can be selected by the learning system 200 may be created in advance and stored in the DB 300, for example. If there are many contents including the words for learning among the pre-stored contents, the learning system 200 may select the contents for learning by referring to the learner's previous learning history.

이후 상기 학습 시스템(200)은 상기 학습용 단어 또는 상기 학습용 단어가 포함된 학습용 컨텐츠를 학습자 단말(예를 들어, 20)로 제공할 수 있다(S330). 그러면 상기 학습자 단말(20)은 제공받은 상기 학습용 단어 또는 상기 학습용 단어가 포함된 학습용 컨텐츠에 기초하여 리스닝 테스트 또는 발음 테스트를 진행할 수 있다.Thereafter, the learning system 200 may provide the learning word or the learning content including the learning word to the learner terminal (eg, 20) (S330). Then, the learner terminal 20 may perform a listening test or a pronunciation test based on the provided learning word or learning contents including the learning word.

한편 실시예에 따라서 상기 학습 시스템(20)은 상기 학습 대상 음소에 대한 학습이 더욱 효과적으로 이루어질 수 있도록 상기 학습 대상 음소와 대비될 수 있는 대치 음소에 대한 학습 컨텐츠를 상기 학습자 단말로 더 제공할 수 있다. 각각의 음소에 상응하는 대치 음소는 미리 지정되어 있을 수 있으며, 예를 들어 음소 'r'의 대치 음소는 'l'로 지정되어 있을 수 이고, 음소 'f'의 대치 음소는 'p'로 지정되어 있을 수 있다.Meanwhile, according to an embodiment, the learning system 20 may further provide learning content for a replacement phoneme that can be compared with the target phoneme to the learner terminal so that learning of the target phoneme can be performed more effectively. . The replacement phoneme corresponding to each phoneme may be designated in advance, for example, the replacement phoneme of the phoneme 'r' may be designated as 'l', and the replacement phoneme of the phoneme 'f' may be designated as 'p' may have been

도 7을 계속해서 참조하면, 상기 학습 시스템(20)은 학습 대상 음소와 대치되는 대치 음소를 포함하는 추가 학습용 단어를 선정할 수 있으며(S340), 상기 추가 학습용 단어 또는 상기 추가 학습용 단어가 포함된 추가 학습용 컨텐츠를 상기 학습자 단말로 더 제공할 수 있다(S350).With continued reference to FIG. 7 , the learning system 20 may select a word for additional learning that includes a replacement phoneme that is replaced with a phoneme to be learned (S340), and the word for additional learning or the word for additional learning is included. Additional learning content may be further provided to the learner terminal (S350).

도 8은 본 발명의 일 실시예에 따른 평가 시스템(100)의 개략적인 구성을 도시한 블록도이며, 도 9는 본 발명의 일 실시예에 따른 학습 시스템(200)의 개략적인 구성을 도시한 블록도이다.8 is a block diagram showing a schematic configuration of an evaluation system 100 according to an embodiment of the present invention, and FIG. 9 shows a schematic configuration of a learning system 200 according to an embodiment of the present invention. It is a block diagram.

상기 평가 시스템(100) 및 상기 학습 시스템(200)은 본 발명의 기술적 사상을 구현하기 위해 필요한 하드웨어 리소스(resource) 및/또는 소프트웨어를 구비한 논리적인 구성을 의미할 수 있으며, 반드시 하나의 물리적인 구성요소를 의미하거나 하나의 장치를 의미하는 것은 아니다. 즉, 상기 평가 시스템(100) 및 상기 학습 시스템(200)은 본 발명의 기술적 사상을 구현하기 위해 구비되는 하드웨어 및/또는 소프트웨어의 논리적인 결합을 의미할 수 있으며, 필요한 경우에는 서로 이격된 장치에 설치되어 각각의 기능을 수행함으로써 본 발명의 기술적 사상을 구현하기 위한 논리적인 구성들의 집합으로 구현될 수도 있다. 또한, 상기 평가 시스템(100) 및 상기 학습 시스템(200)은 본 발명의 기술적 사상을 구현하기 위한 각각의 기능 또는 역할별로 별도로 구현되는 구성들의 집합을 의미할 수도 있다. 상기 평가 시스템(100) 및 상기 학습 시스템(200)의 각 구성은 서로 다른 물리적 장치에 위치할 수도 있고, 동일한 물리적 장치에 위치할 수도 있다. 또한, 구현 예에 따라서는 상기 평가 시스템(100) 및 상기 학습 시스템(200)의 구성 요소 각각을 구성하는 소프트웨어 및/또는 하드웨어의 결합 역시 서로 다른 물리적 장치에 위치하고, 서로 다른 물리적 장치에 위치한 구성들이 서로 유기적으로 결합되어 각각의 상기 모듈들을 구현할 수도 있다.The evaluation system 100 and the learning system 200 may refer to a logical configuration having hardware resources and/or software required to implement the technical idea of the present invention, and must be one physical It does not mean a component or a device. That is, the evaluation system 100 and the learning system 200 may refer to a logical combination of hardware and/or software provided to implement the technical idea of the present invention, and if necessary, may be installed in devices spaced apart from each other. It may be implemented as a set of logical components for implementing the technical idea of the present invention by being installed and performing each function. In addition, the evaluation system 100 and the learning system 200 may refer to a set of components implemented separately for each function or role to implement the technical idea of the present invention. Each component of the evaluation system 100 and the learning system 200 may be located in different physical devices or in the same physical device. In addition, depending on the implementation example, the combination of software and/or hardware constituting each of the components of the evaluation system 100 and the learning system 200 is also located in different physical devices, and components located in different physical devices Each of the modules may be organically combined with each other.

또한, 본 명세서에서 모듈이라 함은, 본 발명의 기술적 사상을 수행하기 위한 하드웨어 및 상기 하드웨어를 구동하기 위한 소프트웨어의 기능적, 구조적 결합을 의미할 수 있다. 예컨대, 상기 모듈은 소정의 코드와 상기 소정의 코드가 수행되기 위한 하드웨어 리소스(resource)의 논리적인 단위를 의미할 수 있으며, 반드시 물리적으로 연결된 코드를 의미하거나, 한 종류의 하드웨어를 의미하는 것은 아님은 본 발명의 기술분야의 평균적 전문가에게는 용이하게 추론될 수 있다.Also, in this specification, a module may mean a functional and structural combination of hardware for implementing the technical concept of the present invention and software for driving the hardware. For example, the module may mean a logical unit of a predetermined code and a hardware resource for executing the predetermined code, and does not necessarily mean a physically connected code or one type of hardware. can be easily deduced to an average expert in the art of the present invention.

도 8을 참조하면, 상기 평가 시스템(100)은 획득모듈(110), 및 평가모듈(115)을 포함할 수 있으며, 상기 평가모듈(115)은 단어레벨 평가모듈(120), 음소 판단모듈(130), 음소레벨 평가모듈(140)을 포함할 수 있다. 본 발명의 실시예에 따라서는, 상술한 구성요소들 중 일부 구성요소는 반드시 본 발명의 구현에 필수적으로 필요한 구성요소에 해당하지 않을 수도 있으며, 또한 실시예에 따라 상기 평가 시스템(100)은 이보다 더 많은 구성요소를 포함할 수도 있음은 물론이다. 예를 들어 상기 평가 시스템(100)은 외부 장치와 통신하기 위한 통신모듈(미도시), 상기 평가 시스템(100)의 구성요소 및 리소스를 제어하기 위한 제어모듈(미도시), 각종 정보 및 데이터를 저장하기 위한 저장모듈(미도시) 등을 더 포함할 수 있다.Referring to FIG. 8 , the evaluation system 100 may include an acquisition module 110 and an evaluation module 115, and the evaluation module 115 includes a word level evaluation module 120 and a phoneme determination module ( 130), and a phoneme level evaluation module 140. Depending on the embodiment of the present invention, some of the above-described components may not necessarily correspond to the components essential to the implementation of the present invention, and according to the embodiment, the evaluation system 100 may It goes without saying that more components may be included. For example, the evaluation system 100 includes a communication module (not shown) for communicating with an external device, a control module (not shown) for controlling components and resources of the evaluation system 100, and various information and data. A storage module (not shown) for storage may be further included.

본 발명의 일 실시예에 따르면, 상기 획득모듈(110)은 피평가자가 발화한 복수의 평가 대상 단어 각각의 음성 데이터를 획득할 수 있으며, 상기 단어 레벨 평가모듈(120)은 상기 복수의 평가 대상 단어 각각에 대하여, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출할 수 있으며, 상기 음소레벨 평가모듈(140)은 상기 언어의 소리 체계를 구성하는 전체 음소 중에서 적어도 일부인 복수의 평가 대상 음소 각각에 대하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수 있다. 특히 상기 음소 레벨 평가모듈(140)은, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정하기 위하여, 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수 있다.According to an embodiment of the present invention, the acquisition module 110 may acquire voice data of each of a plurality of evaluation target words uttered by a person to be evaluated, and the word level evaluation module 120 may obtain the plurality of evaluation target words. For each word, a word level score corresponding to the word to be evaluated may be calculated based on the voice data of the word to be evaluated, and the phoneme level evaluation module 140 may calculate a word level score among all phonemes constituting the sound system of the language. A phoneme level score corresponding to the phoneme to be evaluated may be determined for each of at least some of the plurality of phonemes to be evaluated. In particular, in order to determine a phoneme level score corresponding to the phoneme to be evaluated, the phoneme level evaluation module 140 determines the word level of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the subject to be evaluated. Based on the score, a phoneme level score corresponding to the phoneme to be evaluated may be determined.

일 실시예에서, 상기 음소 레벨 평가모듈(140)은 상기 피평가자가 발화한 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값을 산출하고, 상기 복수의 평가 대상 단어 중에서 상기 평가 대상 음소를 포함하는 단어 각각의 단어 레벨 점수의 평균 값에 기초하여, 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 결정할 수 있다.In an embodiment, the phoneme level evaluation module 140 calculates an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the person to be evaluated, and the plurality of evaluation targets. A phoneme level score corresponding to the phoneme to be evaluated may be determined based on an average value of word level scores of each word including the phoneme to be evaluated among target words.

일 실시예에서, 상기 획득모듈은, 상기 피평가자가 발화한 복수의 평가 대상 문장의 음성 데이터를 획득하고, 상기 복수의 평가 대상 문장 각각에 대하여, 상기 평가 대상 문장의 음성 데이터를 단어 별로 분할하며, 상기 복수의 평가 대상 단어는, 상기 복수의 평가 대상 문장에 포함된 각 단어를 포함할 수 있다.In one embodiment, the acquisition module acquires voice data of a plurality of evaluation target sentences uttered by the subject, and divides the voice data of the evaluation target sentence by word for each of the plurality of evaluation target sentences; The plurality of evaluation target words may include each word included in the plurality of evaluation target sentences.

본 발명의 다른 일 실시예에 따르면, 상기 평가모듈(115)은 상기 복수의 평가 대상 단어 각각의 음성 데이터가 획득될 때마다, 상기 평가 시스템이, 상기 평가 대상 단어의 음성 데이터에 기초하여 상기 피평가자의 음소 레벨 점수를 갱신할 수 있다, 앞서 설명한 바와 같이 상기 평가모듈(115)은 단어레벨 평가모듈(120), 음소 판단모듈(130), 음소레벨 평가모듈(140)을 포함할 수 있으며, 상기 단어 레벨 평가모듈(120)은 상기 평가 대상 단어의 음성 데이터에 기초하여, 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 산출하고, 상기 음소 판단모듈(130)은 상기 평가 대상 단어를 구성하는 적어도 하나의 평가 대상 음소를 판단하고, 상기 음소레벨 평가모듈(140)은 상기 평가 대상 음소 각각에 대하여, 기 산출된 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수에 상기 평가 대상 단어에 상응하는 단어 레벨 점수를 반영하여, 상기 피평가자의 상기 평가 대상 음소에 상응하는 음소 레벨 점수를 갱신할 수 있다.According to another embodiment of the present invention, the evaluation module 115 is configured so that, whenever voice data of each of the plurality of words to be evaluated is obtained, the evaluation system determines the person to be evaluated based on the voice data of the word to be evaluated. As described above, the evaluation module 115 may include a word level evaluation module 120, a phoneme determination module 130, and a phoneme level evaluation module 140. The word level evaluation module 120 calculates a word level score corresponding to the evaluation target word based on the voice data of the evaluation target word, and the phoneme determination module 130 calculates at least one word constituting the evaluation target word. The phoneme level evaluation module 140 determines the phoneme to be evaluated, and the phoneme level evaluation module 140 determines the phoneme level score corresponding to the phoneme to be evaluated for each of the phonemes to be evaluated, and the word corresponding to the word to be evaluated is calculated in advance. A phoneme level score corresponding to the phoneme to be evaluated may be updated by reflecting the level score.

도 9는 본 발명의 일 실시예에 따른 학습 시스템(200)의 구성을 도시한 블록이다.9 is a block diagram showing the configuration of a learning system 200 according to an embodiment of the present invention.

도 9를 참조하면 상기 학습 시스템(200)은 판단모듈(210), 선정모듈(220) 및 제공모듈(230)을 포함할 수 있다. 본 발명의 실시예에 따라서는, 상술한 구성요소들 중 일부 구성요소는 반드시 본 발명의 구현에 필수적으로 필요한 구성요소에 해당하지 않을 수도 있으며, 또한 실시예에 따라 상기 학습 시스템(200)은 이보다 더 많은 구성요소를 포함할 수도 있음은 물론이다. 예를 들어 상기 학습 시스템(200)은 외부 장치와 통신하기 위한 통신모듈(미도시), 상기 장초 추정 시스템(100)의 구성요소 및 리소스를 제어하기 위한 제어모듈(미도시), 각종 데이터 및 정보를 저장할 수 있는 저장모듈(미도시) 등을 더 포함할 수 있다.Referring to FIG. 9 , the learning system 200 may include a judgment module 210 , a selection module 220 and a provision module 230 . Depending on the embodiment of the present invention, some of the above-mentioned components may not necessarily correspond to the components necessary for the implementation of the present invention, and according to the embodiment, the learning system 200 has more It goes without saying that more components may be included. For example, the learning system 200 includes a communication module (not shown) for communicating with an external device, a control module (not shown) for controlling components and resources of the long frame estimation system 100, various data and information It may further include a storage module (not shown) capable of storing .

또한 실시예에 따라 학습 시스템(200)은 상기 평가 구축 시스템(100)과 통합된 형태로 구현될 수 있으며, 이 경우, 상기 학습 시스템(200)은 상술한 상기 평가 시스템(100)에 포함된 구성을 더 포함할 수 있다.In addition, according to embodiments, the learning system 200 may be implemented in an integrated form with the evaluation construction system 100. In this case, the learning system 200 is a component included in the evaluation system 100 described above. may further include.

상기 판단모듈(210)은 상기 평가 시스템(100)에 의해 구축되는 각 음소 별 음소 레벨 점수에 기초하여 학습 대상 음소를 추출할 수 있으며, 상기 선정모듈(220)은 상기 학습 대상 음소를 포함하는 단어를 상기 학습용 단어로 선정할 수 있다. 상기 제공모듈(230)은 상기 학습용 단어 또는 상기 학습용 단어가 포함된 학습용 컨텐츠를 학습자 단말(예를 들어, 20)로 제공할 수 있다.The determination module 210 may extract a phoneme to be learned based on the phoneme level score for each phoneme established by the evaluation system 100, and the selection module 220 may extract a word including the phoneme to be learned. can be selected as the learning word. The provision module 230 may provide the learning word or the learning content including the learning word to a learner terminal (eg, 20).

일 실시예에서, 상기 선정모듈(220)은 상기 학습 대상 음소와 대치되는 대치 음소를 포함하는 단어를 추가 학습용 단어로 더 선정하고, 상기 제공모듈(230)은 상기 추가 학습용 단어 또는 상기 추가 학습용 단어가 포함된 추가 학습용 컨텐츠를 상기 학습자 단말(20)로 더 제공할 수 있다.In an embodiment, the selection module 220 further selects a word including a replacement phoneme that is replaced with the target phoneme as a word for additional learning, and the providing module 230 further selects the word for additional learning or the word for additional learning. It is possible to further provide additional content for learning, which includes, to the learner terminal 20.

한편, 구현 예에 따라서, 상기 평가 시스템(100) 및 상기 학습 시스템(200)은 프로세서 및 상기 프로세서에 의해 실행되는 프로그램을 저장하는 메모리를 포함할 수 있다. 상기 프로세서는 싱글 코어 CPU혹은 멀티 코어 CPU를 포함할 수 있다. 메모리는 고속 랜덤 액세스 메모리를 포함할 수 있고 하나 이상의 자기 디스크 저장 장치, 플래시 메모리 장치, 또는 기타 비휘발성 고체상태 메모리 장치와 같은 비휘발성 메모리를 포함할 수도 있다. 프로세서 및 기타 구성 요소에 의한 메모리로의 액세스는 메모리 컨트롤러에 의해 제어될 수 있다. Meanwhile, according to an implementation example, the evaluation system 100 and the learning system 200 may include a processor and a memory storing a program executed by the processor. The processor may include a single-core CPU or a multi-core CPU. The memory may include high-speed random access memory and may also include non-volatile memory such as one or more magnetic disk storage devices, flash memory devices, or other non-volatile solid-state memory devices. Access to memory by processors and other components may be controlled by a memory controller.

한편, 본 발명의 실시예에 따른 방법은 컴퓨터가 읽을 수 있는 프로그램 명령 형태로 구현되어 컴퓨터로 읽을 수 있는 기록 매체에 저장될 수 있으며, 본 발명의 실시예에 따른 제어 프로그램 및 대상 프로그램도 컴퓨터로 판독 가능한 기록 매체에 저장될 수 있다. 컴퓨터가 읽을 수 있는 기록 매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록 장치를 포함한다.On the other hand, the method according to the embodiment of the present invention may be implemented in the form of computer-readable program instructions and stored in a computer-readable recording medium, and the control program and target program according to the embodiment of the present invention are also computer-readable. It can be stored on a readable recording medium. The computer-readable recording medium includes all types of recording devices in which data that can be read by a computer system is stored.

기록 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 소프트웨어 분야 당업자에게 공지되어 사용 가능한 것일 수도 있다.Program commands recorded on the recording medium may be specially designed and configured for the present invention, or may be known and usable to those skilled in the software field.

컴퓨터로 읽을 수 있는 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media) 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드가 저장되고 실행될 수 있다.Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks and magnetic tapes, optical media such as CD-ROMs and DVDs, floptical disks and hardware devices specially configured to store and execute program instructions, such as magneto-optical media and ROM, RAM, flash memory, and the like. In addition, the computer-readable recording medium is distributed in computer systems connected through a network, so that computer-readable codes can be stored and executed in a distributed manner.

프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 전자적으로 정보를 처리하는 장치, 예를 들어, 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다.Examples of program instructions include high-level language codes that can be executed by a device that electronically processes information using an interpreter, for example, a computer, as well as machine language codes generated by a compiler.

상술한 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The hardware device described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시 예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성요소들도 결합된 형태로 실시될 수 있다.The above description of the present invention is for illustrative purposes, and those skilled in the art can understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, the embodiments described above should be understood as illustrative in all respects and not limiting. For example, each component described as a single type may be implemented in a distributed manner, and similarly, components described as distributed may be implemented in a combined form.

본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타나며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.The scope of the present invention is indicated by the following claims rather than the detailed description above, and all changes or modifications derived from the meaning and scope of the claims and equivalent concepts thereof should be construed as being included in the scope of the present invention. .

Claims

A phoneme-unit pronunciation accuracy evaluation method for calculating a phoneme level score for each phoneme constituting the sound system of a predetermined language,
acquiring voice data of each of a plurality of evaluation target words uttered by an appraiser, by an evaluation system;
The evaluation system, for each of the plurality of evaluation target words,
Calculating a word level score corresponding to the evaluation target word based on voice data of the evaluation target word; and
determining, by the evaluation system, a phoneme level score corresponding to the phoneme to be evaluated, for each of a plurality of phonemes to be evaluated, which are at least a part of all phonemes constituting the sound system of the language;
The step of determining a phoneme level score corresponding to the phoneme to be evaluated,
and determining a phoneme level score corresponding to the phoneme to be evaluated based on a word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated. Assessment Methods.

According to claim 1,
The step of determining a phoneme level score corresponding to the phoneme to be evaluated based on a word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the subject,
calculating an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated; and
and determining a phoneme level score corresponding to the phoneme to be evaluated based on an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated. .

According to claim 1,
Calculating a word level score corresponding to the word to be evaluated,
Calculating a pronunciation score representing accuracy of pronunciation of the word to be evaluated uttered by the person to be evaluated;
calculating an intonation score indicating accuracy of intonation of the word to be evaluated uttered by the subject of evaluation;
calculating a volume score representing accuracy of the volume of the evaluation target word uttered by the subject;
Calculating a length score representing the accuracy of the long and short length of the word to be evaluated uttered by the subject of evaluation; and
The method of evaluating pronunciation accuracy in units of phonemes, comprising at least a part of calculating a stress score representing the accuracy of the stress of the word to be evaluated uttered by the person to be evaluated.

According to claim 1,
Acquiring voice data of each of a plurality of words to be evaluated uttered by the person to be evaluated includes:
obtaining voice data of a plurality of evaluation target sentences uttered by the subject of evaluation; and
For each of the plurality of evaluation target sentences, dividing the audio data of the evaluation target sentence by word,
The plurality of evaluation target words include each word included in the plurality of evaluation target sentences.

A phoneme-unit pronunciation accuracy evaluation method for calculating a phoneme level score for each phoneme constituting the sound system of a predetermined language,
acquiring voice data of each of a plurality of evaluation target words uttered by an appraiser, by an evaluation system; and
Whenever voice data of each of the plurality of words to be evaluated is obtained, the evaluation system updating a phoneme level score of the subject based on the voice data of the word to be evaluated;
Updating the phoneme level score of the subject based on the voice data of the word to be evaluated,
Calculating a word level score corresponding to the evaluation target word based on voice data of the evaluation target word;
determining at least one evaluation target phoneme constituting the evaluation target word; and
For each of the phonemes to be evaluated,
and updating a phoneme level score corresponding to the phoneme to be evaluated by reflecting a word level score corresponding to the word to be evaluated in the pre-calculated phoneme level score corresponding to the phoneme to be evaluated of the evaluated person. A method for evaluating phoneme-unit pronunciation accuracy.

extracting, by a learning system, a phoneme to be learned based on a phoneme level score for each phoneme constructed by the method according to claim 1 or 5;
selecting, by the learning system, a word including the phoneme to be learned as the learning word; and
The learning content providing method comprising providing, by the learning system, the learning word or the learning content including the learning word to a learner terminal.

According to claim 6,
selecting, by the learning system, a word including a replacement phoneme that is replaced with the learning target phoneme as a word for additional learning; and
The learning content providing method further comprising providing, by the learning system, the additional learning words or additional learning contents including the additional learning words to the learner terminal.

A computer program installed in a data processing device and recorded on a medium for performing the method according to any one of claims 1 to 5.

A computer readable recording medium on which a computer program for performing the method according to any one of claims 1 to 5 is recorded.

As a computing system,
processor; and a memory for storing a computer program;
The computer program, when executed by the processor, causes the computing system to perform the method according to any one of claims 1 to 5.

A phoneme-unit pronunciation accuracy evaluation system for calculating a phoneme level score for each phoneme constituting the sound system of a predetermined language,
an acquisition module for obtaining voice data of each of a plurality of evaluation target words uttered by an appraiser;
a word level evaluation module for calculating a word level score corresponding to the evaluation target word based on voice data of the evaluation target word for each of the plurality of evaluation target words; and
A phoneme level evaluation module for determining a phoneme level score corresponding to the phoneme to be evaluated for each of a plurality of phonemes to be evaluated, which are at least a part of all phonemes constituting the sound system of the language,
The phoneme level evaluation module determines a phoneme level score corresponding to the phoneme to be evaluated,
The phoneme-unit pronunciation accuracy evaluation system for determining a phoneme level score corresponding to the phoneme to be evaluated based on a word level score of each word including the phoneme to be evaluated among the plurality of words to be evaluated.

According to claim 11,
The phoneme level evaluation module,
Calculating an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated uttered by the subject;
A phoneme-unit pronunciation accuracy evaluation system for determining a phoneme level score corresponding to the phoneme to be evaluated based on an average value of word level scores of each word including the phoneme to be evaluated among the plurality of words to be evaluated.

According to claim 1,
The word level evaluation module,
A pronunciation score indicating the accuracy of pronunciation of the word to be evaluated uttered by the subject, an intonation score indicating accuracy of intonation of the word to be evaluated uttered by the subject, and accuracy of volume of the word to be evaluated uttered by the subject Phoneme unit pronunciation that calculates at least a part of a volume score representing a volume score, a length score representing accuracy of length of the word to be evaluated uttered by the subject, and a stress score representing accuracy of stress of the word to be evaluated uttered by the subject. Accuracy evaluation system.

According to claim 11,
The acquisition module,
Acquiring audio data of a plurality of evaluation target sentences uttered by the subject, and dividing the audio data of the evaluation target sentence by word for each of the plurality of evaluation target sentences;
The plurality of evaluation target words include phoneme unit pronunciation accuracy evaluation system including each word included in the plurality of evaluation target sentences.

A phoneme-unit pronunciation accuracy evaluation system for calculating a phoneme level score for each phoneme constituting the sound system of a predetermined language,
an acquisition module for obtaining voice data of each of a plurality of evaluation target words uttered by an appraiser; and
Whenever voice data of each of the plurality of words to be evaluated is acquired, the evaluation system includes an evaluation module for updating a phoneme level score of the subject of evaluation based on the voice data of the word to be evaluated;
The evaluation module,
a word level evaluation module for calculating a word level score corresponding to the evaluation target word based on the voice data of the evaluation target word;
a phoneme determination module for determining at least one evaluation target phoneme constituting the evaluation target word; and
For each of the phonemes to be evaluated,
Phoneme level evaluation to update the phoneme level score corresponding to the phoneme to be evaluated by reflecting the word level score corresponding to the word to be evaluated to the pre-calculated phoneme level score corresponding to the phoneme to be evaluated A phoneme unit pronunciation accuracy evaluation system including a module.

a decision module for extracting a phoneme to be learned based on the phoneme level score for each phoneme constructed by the method according to claim 1 or 5;
a selection module for selecting a word including the phoneme to be learned as the learning word; and
A learning content providing system comprising a provision module for providing the learning word or the learning content including the learning word to a learner terminal.

According to claim 6,
The selection module,
Further selecting a word including a replacement phoneme that is replaced with the learning target phoneme as a word for additional learning,
The providing module,
A learning content providing system for further providing the additional learning words or additional learning contents including the additional learning words to the learner terminal.