WO2020204256A1 - Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole - Google Patents

Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole Download PDF

Info

Publication number
WO2020204256A1
WO2020204256A1 PCT/KR2019/006336 KR2019006336W WO2020204256A1 WO 2020204256 A1 WO2020204256 A1 WO 2020204256A1 KR 2019006336 W KR2019006336 W KR 2019006336W WO 2020204256 A1 WO2020204256 A1 WO 2020204256A1
Authority
WO
WIPO (PCT)
Prior art keywords
speech
test
voice
unit
recognition
Prior art date
Application number
PCT/KR2019/006336
Other languages
English (en)
Korean (ko)
Inventor
이충재
이창재
송민규
Original Assignee
미디어젠 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 미디어젠 주식회사 filed Critical 미디어젠 주식회사
Publication of WO2020204256A1 publication Critical patent/WO2020204256A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/01Assessment or evaluation of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4353Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving decryption of additional data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4355Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8543Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]

Abstract

La présente invention concerne un système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole. Plus particulièrement, la présente invention concerne un système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de la parole à même de reproduire et d'évaluer, au moyen d'un moteur de TTS (moteur de synthèse de parole), de nouveaux motifs de phrases n'importe quand en temps réel sans enregistrement, afin d'affronter les problèmes d'un système automatique classique d'évaluation de reconnaissance de parole, qui reproduit séquentiellement des données de parole préenregistrées afin d'évaluer le taux de reconnaissance d'un appareil de reconnaissance de parole, tel qu'un temps et un coût excessifs en raison de la construction continue d'une base de données d'enregistrement de parole et de la possibilité de ne pas reconnaître d'autres phrases de même signification. Ainsi, la présente invention concerne un système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole, à même de fournir des performances précises et rapides et de produire des résultats de vérification fonctionnelle.
PCT/KR2019/006336 2019-04-04 2019-05-27 Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole WO2020204256A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2019-0039483 2019-04-04
KR1020190039483A KR102020773B1 (ko) 2019-04-04 2019-04-04 음성합성엔진을 이용한 멀티미디어 음성인식 자동 평가시스템

Publications (1)

Publication Number Publication Date
WO2020204256A1 true WO2020204256A1 (fr) 2020-10-08

Family

ID=68578103

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2019/006336 WO2020204256A1 (fr) 2019-04-04 2019-05-27 Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole

Country Status (2)

Country Link
KR (1) KR102020773B1 (fr)
WO (1) WO2020204256A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113450768A (zh) * 2021-06-25 2021-09-28 平安科技(深圳)有限公司 语音合成系统评测方法、装置、可读存储介质及终端设备
CN113836010A (zh) * 2021-09-14 2021-12-24 招商银行股份有限公司 语音智能客服自动化测试方法、系统及存储介质

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112908298B (zh) * 2021-01-18 2022-12-09 杭州国芯科技股份有限公司 一种语音识别测试项目中自动转录和测试方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050086055A1 (en) * 2003-09-04 2005-04-21 Masaru Sakai Voice recognition estimating apparatus, method and program
KR20060062884A (ko) * 2004-12-06 2006-06-12 한국전자통신연구원 잡음환경하의 음성인식엔진 평가 시스템 및 자동화 방법
KR20130029635A (ko) * 2011-09-15 2013-03-25 현대모비스 주식회사 음성인식 성능 평가 모듈 및 그 방법
KR20130051278A (ko) * 2011-11-09 2013-05-20 엘지전자 주식회사 개인화된 tts 제공장치
KR101605848B1 (ko) * 2014-11-24 2016-04-01 하동경 음성인식 성능 평가 방법 및 그 장치

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050086055A1 (en) * 2003-09-04 2005-04-21 Masaru Sakai Voice recognition estimating apparatus, method and program
KR20060062884A (ko) * 2004-12-06 2006-06-12 한국전자통신연구원 잡음환경하의 음성인식엔진 평가 시스템 및 자동화 방법
KR20130029635A (ko) * 2011-09-15 2013-03-25 현대모비스 주식회사 음성인식 성능 평가 모듈 및 그 방법
KR20130051278A (ko) * 2011-11-09 2013-05-20 엘지전자 주식회사 개인화된 tts 제공장치
KR101605848B1 (ko) * 2014-11-24 2016-04-01 하동경 음성인식 성능 평가 방법 및 그 장치

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113450768A (zh) * 2021-06-25 2021-09-28 平安科技(深圳)有限公司 语音合成系统评测方法、装置、可读存储介质及终端设备
CN113836010A (zh) * 2021-09-14 2021-12-24 招商银行股份有限公司 语音智能客服自动化测试方法、系统及存储介质

Also Published As

Publication number Publication date
KR102020773B1 (ko) 2019-11-04

Similar Documents

Publication Publication Date Title
US8219397B2 (en) Data processing system for autonomously building speech identification and tagging data
WO2020204256A1 (fr) Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole
US8209169B2 (en) Synchronization of an input text of a speech with a recording of the speech
WO2020027619A1 (fr) Procédé, dispositif et support d'informations lisible par ordinateur pour la synthèse vocale à l'aide d'un apprentissage automatique sur la base d'une caractéristique de prosodie séquentielle
WO2019139428A1 (fr) Procédé de synthèse vocale à partir de texte multilingue
WO2018151464A1 (fr) Système de codage et procédé de codage utilisant la reconnaissance vocale
KR20150014236A (ko) 인터랙티브 캐릭터 기반 외국어 학습 장치 및 방법
WO2019208860A1 (fr) Procédé d'enregistrement et de sortie de conversation entre de multiples parties au moyen d'une technologie de reconnaissance vocale, et dispositif associé
WO2021033865A1 (fr) Procédé et appareil pour l'apprentissage du coréen écrit
WO2021251539A1 (fr) Procédé permettant de mettre en œuvre un message interactif en utilisant un réseau neuronal artificiel et dispositif associé
US20150254238A1 (en) System and Methods for Maintaining Speech-To-Speech Translation in the Field
Yarra et al. Indic TIMIT and Indic English lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations
Shahriar et al. A communication platform between bangla and sign language
US20200320976A1 (en) Information processing apparatus, information processing method, and program
JP6605105B1 (ja) 文章記号挿入装置及びその方法
KR101992370B1 (ko) 말하기 학습방법 및 학습시스템
US20210264812A1 (en) Language learning system and method
WO2022169208A1 (fr) Système de visualisation vocale pour apprentissage de l'anglais, et procédé associé
WO2021154018A1 (fr) Dispositif électronique et procédé de commande du dispositif électronique
CN114170856A (zh) 用机器实施的听力训练方法、设备及可读存储介质
CN114420159A (zh) 音频评测方法及装置、非瞬时性存储介质
KR102107447B1 (ko) 선택적 음성 모델의 적용에 기초한 번역 기능을 제공하는 텍스트 음성 변환 장치 및 그 동작 방법
JP2003162524A (ja) 言語処理装置
Tits et al. Flowchase: a Mobile Application for Pronunciation Training
WO2019156427A1 (fr) Procédé d'identification d'un locuteur sur la base d'un mot prononcé et appareil associé, et appareil de gestion de modèle vocal sur la base d'un contexte et procédé associé

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19922597

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19922597

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 15-06-2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19922597

Country of ref document: EP

Kind code of ref document: A1