JP2021157073A

JP2021157073A - Language learning system, language learning supporting method, and program

Info

Publication number: JP2021157073A
Application number: JP2020057824A
Authority: JP
Inventors: 核人川上; Kakuto Kawakami
Original assignee: Individual
Current assignee: Individual
Priority date: 2020-03-27
Filing date: 2020-03-27
Publication date: 2021-10-07

Abstract

To provide a language learning system and the like with which learners can do their learning without feeling embarrassment towards people nearby.SOLUTION: A language learning system 1 of the invention includes a conversation playing means 27 which plays information of a series of conversations including first audio information which is audio information of a first speaker and second audio information which is audio information of a second speaker in turn, with a silent period after either of the first audio information or the second audio information, so that a learner can do vocal exercises copying an immediately preceding line in the series of natural conversations. In a case where only the learner can hear the conversation audio, it sounds to people nearby as if the learner is making natural conversations with someone on the other side of the phone, therefore the user can be prevented from feeling embarrassment towards people nearby.SELECTED DRAWING: Figure 1

Description

本発明は、言語学習システム、言語学習支援方法及びプログラムに関し、特にリスニング及びスピーキング学習に適した言語学習システム等に関する。 The present invention relates to a language learning system, a language learning support method and a program, and particularly to a language learning system suitable for listening and speaking learning.

言語学習は、リーディング・ライティング・リスニング・スピーキングの４つに大別される。特に近年は、大学入学共通テストにおいて英語のスピーキング試験の導入が決まるなど、スピーキングへの意識が高まりつつあり、スピーキング学習に適した言語学習システムが求められている。 Language learning is broadly divided into four categories: reading, writing, listening, and speaking. In particular, in recent years, awareness of speaking has been increasing, such as the introduction of an English speaking test in the common test for university admissions, and a language learning system suitable for speaking learning is required.

特許文献１には、音声認識システムを用いて受講者の発音が正しいかどうかを判定し、受講者が正しく発音するまで反復練習させるスピーキング学習機能を有する通信教育システムが開示されている。 Patent Document 1 discloses a correspondence education system having a speaking learning function that determines whether or not a student's pronunciation is correct using a voice recognition system and repeatedly practices until the student pronounces correctly.

特許文献１に記載の通信教育システムは、通信ネットワークを介して相互に接続されている受講者端末と教育システムを備えていて、下記の流れで実施される。まず、教育システムから受講者端末に音声データＡが送信され、受講者端末が受信した音声データＡを再生する。そして、再生された音声データＡを聞いた受講者は、それと同じ音を発声する。受講者が発声した音声は、マイクによって音声データＡ‘に変換されて教育システムに送信される。教育システムは音声認識システムを用いて音声データＡ’を分析した後、受講者の発音が正しいかどうかの判定を行う。判定がＮＧであれば、受講者は再度、発声する。教育システムは、判定がＯＫであれば、次の問題である音声データＢを送信する。このように、特許文献１に記載の通信教育システムによれば、受講者は見本となる音声の後に続いて発声し、正しい発音を習得するまで反復練習を行うことができる。 The correspondence education system described in Patent Document 1 includes a student terminal and an education system that are interconnected via a communication network, and is implemented in the following flow. First, the voice data A is transmitted from the education system to the student terminal, and the voice data A received by the student terminal is reproduced. Then, the student who hears the reproduced voice data A utters the same sound. The voice uttered by the student is converted into voice data A'by the microphone and transmitted to the education system. The education system analyzes the voice data A'using the voice recognition system, and then determines whether or not the student's pronunciation is correct. If the judgment is NG, the student speaks again. If the judgment is OK, the education system transmits the voice data B, which is the next problem. As described above, according to the correspondence education system described in Patent Document 1, the student can utter after the sample voice and practice repeatedly until he / she learns the correct pronunciation.

特開２００１−２６５２０７号公報Japanese Unexamined Patent Publication No. 2001-265207

しかしながら、例えばカフェのような周囲に人がいる状況において、従来の通信教育システムによりスピーキング練習をしていると非常に目立ってしまう。学習者がヘッドホンやイヤホン等を用いて見本音声を聞くことで、見本音声が周囲の人に聞こえない状況にしたとしても、単語や何の脈絡もない一文を発声することで、周囲の人から学習者が言語学習をしていることが知られてしまい、羞恥を感じる学習者もいるという問題がある。 However, in a situation where there are people around, such as a cafe, speaking practice using a conventional distance learning system is very noticeable. Even if the learner listens to the sample voice using headphones or earphones so that the sample voice cannot be heard by the surrounding people, the learner can utter a word or a sentence without any connection from the surrounding people. There is a problem that some learners feel embarrassed because it is known that the learners are learning a language.

そこで、本発明は、学習者が周囲の人に対して羞恥を感じることなく学習できる言語学習システム等を提供することを目的とする。 Therefore, an object of the present invention is to provide a language learning system or the like in which learners can learn without feeling shame toward the people around them.

本発明は、学習者が情報端末からアクセスし、見本音声に続いて前記学習者が発声練習をするための言語学習システムであって、
第１話者の音声情報である第１音声情報及び第２話者の音声情報である第２音声情報を順に含む一連の会話情報を、前記第１音声情報又は第２音声情報のいずれかの後に無音時間を有しながら再生する会話再生手段を具えることを特徴とする言語学習システム、
又は、
学習者が情報端末からアクセスし、見本音声に続いて前記学習者が発声練習をするための言語学習システムを用いた言語学習支援方法であって、
前記言語学習システムは、
第１話者の音声情報である第１音声情報及び第２話者の音声情報である第２音声情報を順に含む一連の会話情報を、前記第１音声情報又は第２音声情報のいずれかの後に無音時間を有しながら再生する会話再生手段を具えることを特徴とする言語学習支援方法、
又は、
コンピュータに前記言語学習支援方法を実行させるためのプログラムによって前記課題を解決した。 The present invention is a language learning system for a learner to access from an information terminal and to practice vocalization by the learner following a sample voice.
A series of conversation information including the first voice information which is the voice information of the first speaker and the second voice information which is the voice information of the second speaker in order is either the first voice information or the second voice information. A language learning system characterized by having a conversation playback means that plays back while having a silent time later,
Or
It is a language learning support method using a language learning system for a learner to access from an information terminal and to practice vocalization by the learner following a sample voice.
The language learning system
A series of conversation information including the first voice information which is the voice information of the first speaker and the second voice information which is the voice information of the second speaker in order is either the first voice information or the second voice information. A language learning support method characterized by having a conversation playback means that plays back while having a silent time later,
Or
The problem was solved by a program for causing a computer to execute the language learning support method.

本発明の言語学習システム、言語学習支援方法又はプログラムによれば、以下の効果を奏することができる。複数人による会話の音声のうち第１話者又は第２話者のセリフの再生後に一定の無音時間を設けることにより、学習者は自然な一連の会話の中で、直前のセリフを真似て発声練習をすることができる。そのため、携帯電話、ヘッドホン又はイヤホン等を用いて学習者のみが会話音声を聞くことができる状況であれば、周囲の人にとっては、学習者が電話の相手と自然な会話を行っているように聞こえる。よって、学習者は人目を気にすることなく、むしろ堂々と言語学習を行うことができる。また、会話再生手段により再生される一連の会話情報を電話応対での会話情報とすることにより、周囲の人にとっては、まるで電話をしているかのように見えるため、学習者は、周囲の人に対して羞恥を感じることが少ない。 According to the language learning system, language learning support method or program of the present invention, the following effects can be achieved. By providing a certain period of silence after the first or second speaker's dialogue is played out of the voice of a conversation by multiple people, the learner utters by imitating the previous dialogue in a natural series of conversations. You can practice. Therefore, if only the learner can hear the conversation voice using a mobile phone, headphones, earphones, etc., it seems to the surrounding people that the learner is having a natural conversation with the other party. hear. Therefore, the learner can learn the language in a dignified manner without worrying about the eyes. In addition, by using a series of conversation information reproduced by the conversation reproduction means as conversation information for answering a telephone, the learner looks as if he / she is making a telephone call to the surrounding people. I don't feel ashamed of it.

さらに、特定音声検出手段が、例えば、「Ｓｏｒｒｙ」又は「Ｐａｒｄｏｎ」のような第一種特定音声を検出した場合に、直前のセリフに戻って再生される構成とすれば、自然な会話の流れで、学習者が再度聞きたいセリフに戻って聞くことが可能になる。 Further, if the specific voice detecting means detects, for example, a first-class specific voice such as "Sorry" or "Pardon", it is configured to return to the immediately preceding line and be played back, so that a natural conversation flow is achieved. Then, the learner can return to the line he / she wants to hear again and listen to it.

また、特定音声検出手段が、例えば、「Ｓｐｅａｋｍｏｒｅｓｌｏｗｌｙ」のような第二種特定音声を検出した場合に、直前のセリフの再生速度を落として再生する構成とすれば、自然な会話の流れで、学習者が再度聞きたいセリフを確実に聞くことができる。 Further, if the specific voice detecting means detects, for example, a second type specific voice such as "Speak more slowy", the playback speed of the immediately preceding dialogue is reduced and the specific voice is played back, so that a natural conversation flow occurs. Then, the learner can surely hear the line that he / she wants to hear again.

また、特定音声検出手段が、例えば、「Ｈｏｌｄｏｎ」や「Ｊｕｓｔａｓｅｃｏｎｄ」等の第三種特定音声を検出した場合に、会話再生手段による会話情報の再生を停止し、「Ｔｈａｎｋｙｏｕｆｏｒｈｏｌｄｉｎｇ」等の第四種特定音声が検出された場合に、会話再生手段による会話情報の再生を再開する構成とすれば、学習者が任意に学習を中断させることができる。 Further, when the specific voice detecting means detects, for example, a third type specific voice such as "Hold on" or "Just a second", the playback of the conversation information by the conversation playing means is stopped, and "Thank you for holding" is stopped. If the reproduction of the conversation information by the conversation reproduction means is resumed when the type 4 specific voice such as "" is detected, the learner can arbitrarily interrupt the learning.

また、無音時間をどの話者のセリフの再生後にするか設定できる構成とすれば、学習者が発声練習をする話者を選択することが可能になる。 In addition, if the silence time can be set after the dialogue of which speaker is played, the learner can select the speaker to practice vocalization.

さらに、学習者の情報端末に、会話再生手段の起動前には、第１話者又は第２話者の情報と通話開始ボタンを表示させ、会話再生手段の起動中には、第１話者又は第２話者の情報と通話終了ボタンを表示させる構成とすれば、学習者が学習を開始するときに電話を掛け、学習中は電話中であることを装うことができるので、学習者は一層人目を気にする必要がない。 Further, the learner's information terminal is displayed with the information of the first speaker or the second speaker and the call start button before the conversation reproduction means is activated, and the first speaker is displayed during the activation of the conversation reproduction means. Alternatively, if the information of the second speaker and the call end button are displayed, the learner can make a call when starting learning and pretend to be on the phone during learning, so that the learner can make a call. You don't have to worry about the eyes.

また、学習者端末が無音時間設定手段を有する構成とすれば、学習者が各自の話し方に合わせて自由に無音時間の長さを設定することが可能になるため、周囲の人に対しては、さらに自然な会話のように聞こえる。 In addition, if the learner terminal is configured to have a silence time setting means, the learner can freely set the length of the silence time according to his / her own speaking style. Sounds like a more natural conversation.

本発明の実施形態の言語学習システムの構成図。The block diagram of the language learning system of embodiment of this invention. 図１の言語学習システムにおけるサーバ及び学習者端末からから行う処理を示したフローチャート。The flowchart which showed the process performed from the server and the learner terminal in the language learning system of FIG. 図１の言語学習システムにおける学習者端末の画面。The screen of the learner terminal in the language learning system of FIG. 図１の言語学習システムにおける管理者端末の画面。The screen of the administrator terminal in the language learning system of FIG.

以下、本発明の実施形態を図１〜４を参照して説明する。但し、本発明はこの実施形態に限定されるものではない。 Hereinafter, embodiments of the present invention will be described with reference to FIGS. 1 to 4. However, the present invention is not limited to this embodiment.

図１には、本発明の一実施形態である言語学習システム１が示されている。言語学習システム１は、サーバ３と、学習者の情報端末５（以下、「学習者端末５」という。）と、管理者端末７で構成することができる。サーバ３と学習者端末５と管理者端末７は、インターネットや専用回線等の通信回線９を介して相互に通信可能である。 FIG. 1 shows a language learning system 1 which is an embodiment of the present invention. The language learning system 1 can be composed of a server 3, a learner's information terminal 5 (hereinafter, referred to as “learner terminal 5”), and an administrator terminal 7. The server 3, the learner terminal 5, and the administrator terminal 7 can communicate with each other via a communication line 9 such as the Internet or a dedicated line.

サーバ３は、第１話者と第２話者を含む複数人による会話であって、第１話者の音声情報である第１音声情報及び第２話者の音声情報である第２音声情報とを順に含む一連の会話情報を学習者端末５に送信する会話送信手段１１を具えている。また、一連の会話情報に対応するテキスト情報を学習者端末５に送信するテキスト送信手段１３と、学習者端末５から学習時間及び達成率を受信する学習結果受信手段１５と、学習者端末５から学習者音声情報を受信する学習者音声受信手段１７と、学習者音声情報の発音正確度を判定する発音正確度判定手段１９と、発音正確度を学習者端末５に送信する発音正確度送信手段２１と、第１話者又は第２話者の情報を学習者端末５に送信する話者情報送信手段２３を具えることができる。なお、一連の会話情報は、電話応対での会話情報であることが好ましい。すなわち、「Ｈｅｌｌｏ」や「Ｈｉ」等の電話の掛け方や受け方の決まり文句から始まり、「Ｔｈａｎｋｙｏｕｆｏｒｃａｌｌｉｎｇ」や「Ｈａｖｅａｎｉｃｅｄａｙ」等の電話の切り方の決まり文句で終わる会話情報とするのがよい。また、一連の会話情報は、第１話者と第２話者の１対１の会話に限られず、２人以上の話者による会話であればよい。 The server 3 is a conversation between a plurality of people including the first speaker and the second speaker, and is the first voice information which is the voice information of the first speaker and the second voice information which is the voice information of the second speaker. It is provided with a conversation transmission means 11 for transmitting a series of conversation information including the above to the learner terminal 5. Further, the text transmitting means 13 for transmitting the text information corresponding to the series of conversation information to the learner terminal 5, the learning result receiving means 15 for receiving the learning time and the achievement rate from the learner terminal 5, and the learner terminal 5 The learner voice receiving means 17 for receiving the learner voice information, the pronunciation accuracy determining means 19 for determining the pronunciation accuracy of the learner voice information, and the pronunciation accuracy transmitting means for transmitting the pronunciation accuracy to the learner terminal 5. The 21 and the speaker information transmitting means 23 for transmitting the information of the first speaker or the second speaker to the learner terminal 5 can be provided. In addition, it is preferable that the series of conversation information is the conversation information in the telephone response. That is, conversation information that begins with a cliché of how to make or receive a call such as "Hello" or "Hi" and ends with a cliché of how to hang up a call such as "Thank you for calling" or "Have a nice day". It is good to do. Further, the series of conversation information is not limited to the one-to-one conversation between the first speaker and the second speaker, and may be a conversation between two or more speakers.

学習者端末５は、サーバ３から一連の会話情報を受信する会話受信手段２５と、一連の会話情報を第１音声情報又は第２音声情報のいずれかの後に無音時間を有しながら再生する会話再生手段２７を具えている。また、無音時間の長さを設定する無音時間設定手段２９と、学習者による特定の音声を検出する特定音声検出手段３１と、リピート再生手段３２、スロー再生手段３３、一時停止手段３４と、第１音声情報と第２音声情報のいずれの再生後に無音時間を設けるかを設定する音声選択手段３５と、サーバ３からテキスト情報を受信するテキスト受信手段３７と、テキスト情報を表示するテキスト表示手段３９と、会話再生手段２７による会話情報の再生開始から再生終了までの時間を学習時間として記録する学習時間記録手段４１と、学習時間に基づいて学習の達成率を決定する達成率決定手段４３と、学習時間及び達成率を出力する学習結果出力手段４５と、学習時間及び達成率をサーバ３に送信する学習結果送信手段４７と、無音時間の間に学習者が発した音声を学習者音声情報として録音する録音手段４９と、学習者音声情報をサーバ３に送信する学習者音声送信手段５１と、サーバ３から発音正確度を受信する発音正確度受信手段５３と、発音正確度を出力する発音正確度出力手段５５と、サーバ３から第１話者又は第２話者の情報を受信する話者情報受信手段５７と、第１話者又は第２話者の情報等を表示する画面表示手段５９を具えることができる。 The learner terminal 5 has a conversation receiving means 25 that receives a series of conversation information from the server 3, and a conversation that reproduces the series of conversation information after either the first voice information or the second voice information while having a silent time. It is equipped with a reproduction means 27. Further, the silent time setting means 29 for setting the length of the silent time, the specific voice detecting means 31 for detecting a specific voice by the learner, the repeat playback means 32, the slow playback means 33, the pause means 34, and the like. A voice selection means 35 for setting which of the first voice information and the second voice information is played back to provide a silent time, a text receiving means 37 for receiving text information from the server 3, and a text displaying means 39 for displaying the text information. A learning time recording means 41 that records the time from the start of playback of conversation information to the end of playback by the conversation playback means 27 as a learning time, and an achievement rate determining means 43 that determines the learning achievement rate based on the learning time. The learning result output means 45 that outputs the learning time and the achievement rate, the learning result transmitting means 47 that transmits the learning time and the achievement rate to the server 3, and the voice emitted by the learner during the silent time are used as the learner voice information. The recording means 49 for recording, the learner voice transmitting means 51 for transmitting the learner voice information to the server 3, the pronunciation accuracy receiving means 53 for receiving the pronunciation accuracy from the server 3, and the pronunciation accuracy for outputting the pronunciation accuracy. The output means 55, the speaker information receiving means 57 that receives the information of the first speaker or the second speaker from the server 3, and the screen display means 59 that displays the information of the first speaker or the second speaker. Can be equipped.

ここで、学習者端末５は、スマートフォンの他、パーソナルコンピュータ、タブレット型端末又は携帯電話型端末であってもよい。すなわち、学習者端末５として専用の端末を用意する必要はなく、例えば、それぞれのＩＤ又はメールアドレスとパスワードでアプリケーション又はクラウドシステムにログインすることによって、言語学習システム１にアクセスすることができるようにすればよい。 Here, the learner terminal 5 may be a personal computer, a tablet type terminal, or a mobile phone type terminal in addition to the smartphone. That is, it is not necessary to prepare a dedicated terminal as the learner terminal 5, and the language learning system 1 can be accessed by logging in to the application or the cloud system with each ID or email address and password, for example. do it.

特定音声検出手段３１は、会話再生手段２７による会話情報の再生中又は無音時間の間に、学習者の発した音声の中から、特定音声を検出する。特定音声には、「Ｓｏｒｒｙ」や「Ｐａｒｄｏｎ」のような、相手の会話を聞き返す音声である第一種特定音声、「Ｓｐｅａｋｍｏｒｅｓｌｏｗｌｙ」のような、相手に速度の遅い発声を促す第二種特定音声、「Ｈｏｌｄｏｎ」や「Ｊｕｓｔａｓｅｃｏｎｄ」のような、会話を中断させる第三種特定音声、及び「Ｔｈａｎｋｙｏｕｆｏｒｈｏｌｄｉｎｇ」のような、会話を再開させる第四種特定音声が含まれる。 The specific voice detecting means 31 detects the specific voice from the voice uttered by the learner during the reproduction of the conversation information by the conversation reproducing means 27 or during the silent time. Specific voices include first-class specific voices such as "Sorry" and "Pardon" that listen back to the other party's conversation, and second-class specific voices that encourage the other party to speak slowly, such as "Speak more slowy". Includes specific voices, type 3 specific voices that interrupt the conversation, such as "Hold on" and "Just a second", and type 4 specific voices that resume the conversation, such as "Thank you for holding". ..

特定音声検出手段３１が第一種特定音声を検出した場合、リピート再生手段３２が起動し、第一種特定音声を検出した直前の第１音声情報又は／及び第２音声情報を再生する。本構成により、自然な会話の流れで、学習者が再度聞きたいセリフに戻って聞くことが可能になる。 When the specific voice detecting means 31 detects the first-class specific voice, the repeat playback means 32 is activated to reproduce the first voice information and / and the second voice information immediately before the first-class specific voice is detected. With this configuration, the learner can return to the dialogue he / she wants to hear again in a natural conversation flow.

また、特定音声検出手段３１が第二種特定音声を検出した場合、スロー再生手段３３が起動し、第二種特定音声を検出した直前の第１音声情報又は／及び第２音声情報の再生速度を落として再生する。本構成とすることにより、自然な会話の流れで、学習者が再度聞きたいセリフを確実に聞くことができる。 Further, when the specific voice detecting means 31 detects the second type specific voice, the slow playback means 33 is activated, and the reproduction speed of the first voice information and / and the second voice information immediately before the detection of the second type specific voice is performed. Drop and play. With this configuration, the learner can surely hear the lines he / she wants to hear again in a natural conversation flow.

また、特定音声検出手段３１が第三種特定音声を検出した場合、一次停止手段３４が起動し、会話再生手段２７による会話情報の再生を停止し、第四種特定音声が検出された場合に、会話再生手段２７による会話情報の再生を再開する。本構成とすれば、学習者が任意に学習を中断させることができる。 Further, when the specific voice detecting means 31 detects the third type specific voice, the primary stop means 34 is activated, the reproduction of the conversation information by the conversation reproduction means 27 is stopped, and the fourth type specific voice is detected. , The reproduction of the conversation information by the conversation reproduction means 27 is resumed. With this configuration, the learner can arbitrarily interrupt the learning.

さらに、音声選択手段３５によれば、第１音声情報と第２音声情報のいずれの再生後に無音時間を設けるかを設定することができるので、学習者は、発声練習をする話者を選択することが可能である。 Further, according to the voice selection means 35, it is possible to set which of the first voice information and the second voice information is to be provided with the silence time, so that the learner selects the speaker to practice speaking. It is possible.

サーバ３は、後述するように、通信回線９を介して、学習者端末５及び管理者端末７に各種画面を表示させ、各種データを取得する。なお、サーバ３が取得したデータは、サーバ３に保存されるように構成するのがよい。 As will be described later, the server 3 causes the learner terminal 5 and the administrator terminal 7 to display various screens via the communication line 9, and acquires various data. The data acquired by the server 3 may be configured to be stored in the server 3.

図２は、サーバ３及び学習者端末５から行う処理を示したフローチャートである。なお、サーバ３には、第１話者と第２話者を含む複数人による一連の会話の音声である会話情報が予め保存されている。 FIG. 2 is a flowchart showing the processing performed from the server 3 and the learner terminal 5. The server 3 stores in advance conversation information, which is the voice of a series of conversations by a plurality of people including the first speaker and the second speaker.

まず、サーバ３から学習端末５に会話情報及び会話情報に対応するテキスト情報が送信され（ステップＳ１）、学習者端末５が会話情報及びテキスト情報を受信する（ステップＳ２）。学習者端末５は、会話情報のうち第１話者のセリフを再生（ステップＳ３）した後に、無音時間を設ける（ステップＳ４）。無音時間の間に学習者が第１話者のセリフを真似て発した音声を学習者音声情報として録音することもできる（ステップＳ４）。学習者端末５は、無音時間が経過した後、第２話者のセリフを再生する（ステップＳ５）。全ての会話情報が再生され終わるまで、ステップＳ３からステップＳ５が繰り返される。会話情報の再生が終了したら（ステップＳ６）、学習者端末５からサーバ３に学習者音声情報が送信され（ステップＳ７）、サーバ３が学習者音声情報を受信する（ステップＳ８）。次に、サーバ３は、学習者音声情報の発音正確度を判定し（ステップＳ９）、発音正確度を学習者端末５に送信する（ステップＳ１０）。学習者端末５は、サーバ３から発音正確度を受信し（ステップＳ１１）、発音正確度を後述の（ｃ）ユーザ画面に表示する（ステップＳ１２）。 First, the conversation information and the text information corresponding to the conversation information are transmitted from the server 3 to the learning terminal 5 (step S1), and the learner terminal 5 receives the conversation information and the text information (step S2). The learner terminal 5 provides a silent time after reproducing the dialogue of the first speaker in the conversation information (step S3) (step S4). It is also possible to record the voice emitted by the learner imitating the dialogue of the first speaker during the silent time as the learner's voice information (step S4). The learner terminal 5 reproduces the dialogue of the second speaker after the silence time has elapsed (step S5). Steps S3 to S5 are repeated until all the conversation information has been reproduced. When the reproduction of the conversation information is completed (step S6), the learner voice information is transmitted from the learner terminal 5 to the server 3 (step S7), and the server 3 receives the learner voice information (step S8). Next, the server 3 determines the pronunciation accuracy of the learner's voice information (step S9), and transmits the pronunciation accuracy to the learner terminal 5 (step S10). The learner terminal 5 receives the pronunciation accuracy from the server 3 (step S11), and displays the pronunciation accuracy on the user screen (c) described later (step S12).

図３（ａ）〜（ｃ）は、画面表示手段５９によって学習者端末５の画面に表示されるスタート画面６１、機能画面６３、及びユーザ画面６５の一例である。スタート画面６１は、ステップＳ２の前に学習者端末５の画面に表示され、第１話者の話し相手である第２話者の情報、例えば、名前６７と、第２話者のイメージ画像６９等の話者情報と、通話開始ボタンである発信ボタン７１と、通話終了ボタンである切断ボタン７３が表示されている。学習者が発信ボタン７１を押すと、ステップＳ２からＳ６（図２参照）が実行される。学習者が切断ボタン７３を押すと、一連の会話情報の再生途中であっても再生が中断され、学習が終了され、ステップＳ７（図２参照）が実行される。このように、学習者端末５の画面に、会話再生手段２７の起動前には、第１話者又は第２話者の情報と通話開始ボタンを表示させ、会話再生手段２７の起動中には、第１話者又は第２話者の情報と通話終了ボタンを表示させることにより、学習者が学習を開始するときに電話を掛け、学習中は電話中であることを装うことができるので、学習者は一層人目を気にする必要がない。 3A to 3C are examples of a start screen 61, a function screen 63, and a user screen 65 displayed on the screen of the learner terminal 5 by the screen display means 59. The start screen 61 is displayed on the screen of the learner terminal 5 before step S2, and the information of the second speaker who is the talk partner of the first speaker, for example, the name 67 and the image image 69 of the second speaker, etc. The speaker information, the call start button 71, and the call end button 73 are displayed. When the learner presses the call button 71, steps S2 to S6 (see FIG. 2) are executed. When the learner presses the disconnect button 73, the reproduction is interrupted even during the reproduction of a series of conversation information, the learning is terminated, and step S7 (see FIG. 2) is executed. In this way, the information of the first speaker or the second speaker and the call start button are displayed on the screen of the learner terminal 5 before the conversation reproduction means 27 is activated, and during the activation of the conversation reproduction means 27, the information and the call start button are displayed. By displaying the information of the first speaker or the second speaker and the call end button, the learner can make a call when starting learning and pretend to be on the phone during learning. Learners don't have to worry more about the eyes.

機能画面６３は、ステップＳ３からＳ５の会話情報の再生中に学習者端末５の画面に表示され得るものであり、第２話者の名前６７と、テキスト情報の表示のＯＮ／ＯＦＦを選択するチェックボックス７５と、無音時間の長さを設定する無音時間設定手段である無音時間スライダー７７と、切断ボタン７３が表示されている。チェックボックス７５でＯＮを選択すると、再生中の会話情報に対応するテキスト情報を学習者端末の画面に表示することができ、ＯＦＦを選択すると当該テキスト情報を非表示にすることができる。会話情報に対応するテキスト情報を表示すれば、学習者は会話情報の原文又は訳文等を見ながら言語学習を行うことが可能になる。また、無音時間スライダー７７のポインタを動かすと、無音時間の長さを調整することができる。このように、学習者端末５が無音時間設定手段を有する構成とすれば、学習者が各自の話し方に合わせて自由に無音時間の長さを設定することが可能になるため、周囲の人に対しては、さらに自然な会話のように聞こえる。 The function screen 63 can be displayed on the screen of the learner terminal 5 during the reproduction of the conversation information in steps S3 to S5, and selects ON / OFF of the second speaker's name 67 and the display of the text information. A check box 75, a silence time slider 77 which is a silence time setting means for setting the length of the silence time, and a disconnect button 73 are displayed. When ON is selected in the check box 75, the text information corresponding to the conversation information being played can be displayed on the screen of the learner terminal, and when OFF is selected, the text information can be hidden. By displaying the text information corresponding to the conversation information, the learner can learn the language while looking at the original text or the translated text of the conversation information. Further, by moving the pointer of the silence time slider 77, the length of the silence time can be adjusted. In this way, if the learner terminal 5 is configured to have the silence time setting means, the learner can freely set the length of the silence time according to his / her own speaking style, so that the surrounding people can freely set the length of the silence time. On the other hand, it sounds like a more natural conversation.

ユーザ画面６５は、学習者による言語学習システム１へのログイン後や学習終了後等に学習者端末５の画面に表示され得るものであり、学習時間７９と、達成率８１と、発音正確度８３と、一連の会話情報の一覧８５が表示されている。学習時間７９は、学習者端末５により記録された学習者が学習した時間の合計であり、達成率８１は、一連の会話情報の一覧８５に対し、学習が完了した一連の会話情報の割合を表している。本構成とすることにより、学習者は、自身の学習度合いを簡便に知ることができる。 The user screen 65 can be displayed on the screen of the learner terminal 5 after the learner logs in to the language learning system 1 or after learning is completed, and has a learning time of 79, an achievement rate of 81, and a pronunciation accuracy of 83. , A list 85 of a series of conversation information is displayed. The learning time 79 is the total time learned by the learner recorded by the learner terminal 5, and the achievement rate 81 is the ratio of the series of conversation information for which learning has been completed to the list 85 of the series of conversation information. Represents. With this configuration, the learner can easily know his / her own learning degree.

図４は、管理者端末７に表示される管理者画面８７の一例である。管理者画面８７には、管理者端末７がサーバ３から受信した、各学習者の氏名８９、合計学習時間９１、達成率９３、発音正確度９５等の情報が表示される。本構成とすることにより、管理者は、各学習者の学習の進捗状況を簡便に把握することができる。 FIG. 4 is an example of the administrator screen 87 displayed on the administrator terminal 7. Information such as the name 89 of each learner, the total learning time 91, the achievement rate 93, and the pronunciation accuracy 95 received by the administrator terminal 7 from the server 3 is displayed on the administrator screen 87. With this configuration, the administrator can easily grasp the learning progress of each learner.

以上に説明した本発明の言語学習システム等によれば、学習者は自然な一連の会話の中で、直前のセリフを真似て発声練習をすることができるので、携帯電話、ヘッドホン又はイヤホン等を用いて学習者のみが会話音声を聞くことができる状況であれば、周囲の人にとっては、学習者が電話の相手と自然な会話を行っているように聞こえるため、学習者は、周囲の人に対して羞恥を感じることなく学習することができる。 According to the language learning system of the present invention described above, the learner can practice speaking by imitating the immediately preceding dialogue in a natural series of conversations. If the situation is such that only the learner can hear the conversation voice, the learner is the person around him because it sounds like the learner is having a natural conversation with the other person on the phone. You can learn without feeling ashamed.

１言語学習システム
５情報端末（学習者端末）
２７会話再生手段
２９無音時間設定手段
３１特定音声検出手段
３２リピート再生手段
３３スロー再生手段
３４一時停止手段
３５音声選択手段
５９画面表示手段
６７，６９第一話者又は第二話者の情報
７１通話開始ボタン
７３通話終了ボタン 1 Language learning system 5 Information terminal (learner terminal)
27 Conversation playback means 29 Silence time setting means 31 Specific voice detection means 32 Repeat playback means 33 Slow playback means 34 Pause means 35 Voice selection means 59 Screen display means 67, 69 Information of first speaker or second speaker 71 Call Start button 73 Call end button

Claims

It is a language learning system for the learner to access from an information terminal and to practice vocalization following the sample voice.
A series of conversation information including the first voice information which is the voice information of the first speaker and the second voice information which is the voice information of the second speaker in order is either the first voice information or the second voice information. It is characterized by having a conversation reproduction means for later reproducing while having a silent time.
Language learning system.

The language learning system according to claim 1, wherein the conversation information is conversation information for answering a telephone call.

A specific voice detecting means for detecting a first-class specific voice, which is a specific voice by the learner,
When the first-class specific voice is detected by the specific voice detecting means, the first voice information, the second voice information, or both of the first voice information and / or the second voice information reproduced immediately before by the conversation reproduction means are returned to the above. The language learning system according to claim 1 or 2, further comprising a repeat reproduction means for initiating reproduction of conversation information.

Specific voice detecting means for detecting the second type specific voice which is a specific voice by the learner, and
When the type 2 specific voice is detected by the specific voice detecting means, the first voice information, the second voice information, or both of the first voice information and / or the second voice information reproduced immediately before by the conversation reproduction means are returned to the above. The language learning system according to any one of claims 1 to 3, further comprising a slow reproduction means for slowing down the reproduction speed of the conversation information and starting the reproduction of the conversation information.

A specific voice detecting means for detecting a type 3 specific voice and a type 4 specific voice, which are specific voices by the learner, and
When the third type specific voice is detected by the specific voice detecting means, the reproduction of the conversation information by the conversation playing means is stopped, and the reproduction of the conversation information is stopped.
The language learning according to any one of claims 1 to 4, further comprising a pause means for resuming the reproduction of the conversation information by the conversation reproduction means when the type 4 specific voice is detected by the specific voice detection means. system.

The language learning system according to any one of claims 1 to 5, further comprising a voice selection means for setting which of the first voice information and the second voice information is to be provided with a silence time.

With more screen display means,
The screen display means is attached to the information terminal.
Before activating the conversation reproduction means, the information of the first speaker or the second speaker and the call start button are displayed.
While the conversation reproduction means is activated, the information of the first speaker or the second speaker and the call end button are displayed.
The language learning system according to any one of claims 1 to 6.

The language learning system according to any one of claims 1 to 7, further comprising a silence time setting means for setting the length of the silence time.

It is a language learning support method using a language learning system for a learner to access from an information terminal and to practice vocalization by the learner following a sample voice.
The language learning system
A series of conversation information including the first voice information which is the voice information of the first speaker and the second voice information which is the voice information of the second speaker in order is either the first voice information or the second voice information. A language learning support method characterized by having a conversation reproduction means for later reproducing while having a silent time.

A program for causing a computer to execute the language learning support method according to claim 9.