KR20110110382A - The method of using by subtitle of multimedia on voice recognition system for language learning - Google Patents

The method of using by subtitle of multimedia on voice recognition system for language learning Download PDF

Info

Publication number
KR20110110382A
KR20110110382A KR1020100029654A KR20100029654A KR20110110382A KR 20110110382 A KR20110110382 A KR 20110110382A KR 1020100029654 A KR1020100029654 A KR 1020100029654A KR 20100029654 A KR20100029654 A KR 20100029654A KR 20110110382 A KR20110110382 A KR 20110110382A
Authority
KR
South Korea
Prior art keywords
learning
video
subtitles
user
voice recognition
Prior art date
Application number
KR1020100029654A
Other languages
Korean (ko)
Inventor
이성기
Original Assignee
이성기
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 이성기 filed Critical 이성기
Priority to KR1020100029654A priority Critical patent/KR20110110382A/en
Publication of KR20110110382A publication Critical patent/KR20110110382A/en

Links

Images

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/06Foreign languages
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Educational Technology (AREA)
  • Educational Administration (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

The present invention automatically converts the contents into contents capable of voice recognition learning using subtitles extracted or automatically generated from multimedia media data such as moving images and voice files, so that the user can simultaneously play the multimedia data through various playback media. The present invention relates to a speech recognition language learning system.
The present invention makes it possible to easily use a variety of multimedia data as a tool for language learning and at the same time enable language learning through speech recognition technology to increase its utilization and learning effect.
The user can enjoy multimedia data according to the user's taste through the desired playback media, and convert the contents into learning contents to enable speech recognition language learning to enhance the user's interest and learning effect.
In particular, since it can be used as learning data without violating the inherent rights of the work, there is little restriction on the data that can be learned and the scope of its use is vast.

Description

The method of using by subtitle of multimedia on voice recognition system for language learning}

It is a technology that uses a speech recognition engine as a learning tool by associating a subtitle system that is currently operated with media-related playback.

Speech recognition is a technology that finds the closest result from a list of pre-input recognitions by extracting and analyzing features from the voice of a person delivered to a computer or a voice recognition system through a telephone or a microphone. This technique is currently used in various forms, especially in the case of English learning is the best technique that can be applied to the learning of speaking.

Traditionally, the language learning method is a simple method of learning through voice, text, screen, and voice through subtitles provided with the video, and edits the video for educational purposes or uses a related authoring tool to combine voice recognition technology. In order to utilize video as learning content by producing and distributing the learning content by hand, it is an invention for solving the disadvantage that a lot of time and money are put into it.

The present invention provides a method that can easily and easily provide the content necessary for speech learning, which is most important in language learning, and provides a subtitle file consisting of the current playing time and subtitles in order to reduce the time and cost required for producing learning content. It was able to learn speech recognition language automatically.

Through the present invention, it is possible to recycle various types of images as learning contents by using subtitles of the images, and to stimulate the visual and auditory senses through the images and to speak and learn at the same time. Allow them to learn.

FIG. 1 is a flowchart illustrating a procedure of performing voice recognition learning using subtitle synchronization information for matching subtitles with corresponding subtitles, screens, and subtitles when a video file is played.

The present invention is based on a simple command such as play, pause, and stop on the screen playback in conjunction with tools and programs for video playback and playback of the video based on the play time information of the subtitles related to the video and pause for voice recognition language learning. After performing the voice recognition learning to learn the subtitle corresponding to the video and show the results immediately after the method to play the next video repeatedly to learn.

Claims (3)

Method of reconstructing speech recognition language learning content by using the form of generalized subtitle file composed of video subtitles and time information when subtitles are displayed on video when playing video files Using the reconstructed contents, the video corresponding to the voice recognition learning is played using the reconstructed contents, and the video is paused to learn the subtitles on the video, and then the subtitles of the video are displayed separately and the voice recognition is performed. How to display the evaluation score for similarity through voice recognition based on the user's pronunciation when the user pronounces the subtitles through the voice input device at the same time as the beep signal indicating the beginning The scope of use of the present invention is applicable to services through web pages through web browsers, portable devices and smartphone applications.
KR1020100029654A 2010-04-01 2010-04-01 The method of using by subtitle of multimedia on voice recognition system for language learning KR20110110382A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020100029654A KR20110110382A (en) 2010-04-01 2010-04-01 The method of using by subtitle of multimedia on voice recognition system for language learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020100029654A KR20110110382A (en) 2010-04-01 2010-04-01 The method of using by subtitle of multimedia on voice recognition system for language learning

Publications (1)

Publication Number Publication Date
KR20110110382A true KR20110110382A (en) 2011-10-07

Family

ID=45026941

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020100029654A KR20110110382A (en) 2010-04-01 2010-04-01 The method of using by subtitle of multimedia on voice recognition system for language learning

Country Status (1)

Country Link
KR (1) KR20110110382A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20190040891A (en) * 2017-10-11 2019-04-19 주식회사 산타 System and Method for Extracting Voice of Video Contents and Interpreting Machine Translation Thereof Using Cloud Service
KR20190040890A (en) * 2018-08-24 2019-04-19 주식회사 산타 Voice Extraction of Video Contents Using Cloud Service and Service Providing System for Interpreting Machine Translation
CN112017663A (en) * 2020-08-14 2020-12-01 博泰车联网(南京)有限公司 Voice generalization method and device and computer storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20190040891A (en) * 2017-10-11 2019-04-19 주식회사 산타 System and Method for Extracting Voice of Video Contents and Interpreting Machine Translation Thereof Using Cloud Service
KR20190040890A (en) * 2018-08-24 2019-04-19 주식회사 산타 Voice Extraction of Video Contents Using Cloud Service and Service Providing System for Interpreting Machine Translation
CN112017663A (en) * 2020-08-14 2020-12-01 博泰车联网(南京)有限公司 Voice generalization method and device and computer storage medium
CN112017663B (en) * 2020-08-14 2024-04-30 博泰车联网(南京)有限公司 Voice generalization method and device and computer storage medium

Similar Documents

Publication Publication Date Title
US20200294487A1 (en) Hands-free annotations of audio text
Mirzaei et al. Partial and synchronized captioning: A new tool to assist learners in developing second language listening skill
US20130196292A1 (en) Method and system for multimedia-based language-learning, and computer program therefor
Perego Audio description: Evolving recommendations for usable, effective and enjoyable practices
US20200058288A1 (en) Timbre-selectable human voice playback system, playback method thereof and computer-readable recording medium
CN104252861A (en) Video voice conversion method, video voice conversion device and server
Moore et al. Word-level emotion recognition using high-level features
JP2022533310A (en) A system and method for simultaneously expressing content in a target language in two forms and improving listening comprehension of the target language
US20150213793A1 (en) Methods and systems for converting text to video
CN107403011A (en) Reality environment language learning implementation method and automatic recording control method
CN111462553A (en) Language learning method and system based on video dubbing and sound correction training
Che et al. Automatic online lecture highlighting based on multimedia analysis
KR20110110382A (en) The method of using by subtitle of multimedia on voice recognition system for language learning
Henrichsen et al. Predicting the attitude flow in dialogue based on multi-modal speech cues
TW201102836A (en) Content adaptive multimedia processing system and method for the same
US11537781B1 (en) System and method to support synchronization, closed captioning and highlight within a text document or a media file
KR20140078810A (en) Apparatus and method for learning rhythm pattern by using native speaker's pronunciation data and language data.
KR20140087956A (en) Apparatus and method for learning phonics by using native speaker's pronunciation data and word and sentence and image data
US10593366B2 (en) Substitution method and device for replacing a part of a video sequence
KR20140075994A (en) Apparatus and method for language education by using native speaker's pronunciation data and thought unit
KR20140087951A (en) Apparatus and method for learning english grammar by using native speaker's pronunciation data and image data.
KR20140107067A (en) Apparatus and method for learning word by using native speakerpronunciation data and image data
KR20140079677A (en) Apparatus and method for learning sound connection by using native speaker's pronunciation data and language data.
Wald Concurrent collaborative captioning
CN109903594A (en) Spoken language exercise householder method, device, equipment and storage medium

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E601 Decision to refuse application