WO2020204256A1 - Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole - Google Patents
Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole Download PDFInfo
- Publication number
- WO2020204256A1 WO2020204256A1 PCT/KR2019/006336 KR2019006336W WO2020204256A1 WO 2020204256 A1 WO2020204256 A1 WO 2020204256A1 KR 2019006336 W KR2019006336 W KR 2019006336W WO 2020204256 A1 WO2020204256 A1 WO 2020204256A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech
- test
- voice
- unit
- recognition
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4398—Processing of audio elementary streams involving reformatting operations of audio signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/01—Assessment or evaluation of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
- H04N21/4353—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving decryption of additional data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
- H04N21/4355—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8543—Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2019-0039483 | 2019-04-04 | ||
KR1020190039483A KR102020773B1 (ko) | 2019-04-04 | 2019-04-04 | 음성합성엔진을 이용한 멀티미디어 음성인식 자동 평가시스템 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020204256A1 true WO2020204256A1 (fr) | 2020-10-08 |
Family
ID=68578103
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2019/006336 WO2020204256A1 (fr) | 2019-04-04 | 2019-05-27 | Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR102020773B1 (fr) |
WO (1) | WO2020204256A1 (fr) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113450768A (zh) * | 2021-06-25 | 2021-09-28 | 平安科技(深圳)有限公司 | 语音合成系统评测方法、装置、可读存储介质及终端设备 |
CN113836010A (zh) * | 2021-09-14 | 2021-12-24 | 招商银行股份有限公司 | 语音智能客服自动化测试方法、系统及存储介质 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112908298B (zh) * | 2021-01-18 | 2022-12-09 | 杭州国芯科技股份有限公司 | 一种语音识别测试项目中自动转录和测试方法 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050086055A1 (en) * | 2003-09-04 | 2005-04-21 | Masaru Sakai | Voice recognition estimating apparatus, method and program |
KR20060062884A (ko) * | 2004-12-06 | 2006-06-12 | 한국전자통신연구원 | 잡음환경하의 음성인식엔진 평가 시스템 및 자동화 방법 |
KR20130029635A (ko) * | 2011-09-15 | 2013-03-25 | 현대모비스 주식회사 | 음성인식 성능 평가 모듈 및 그 방법 |
KR20130051278A (ko) * | 2011-11-09 | 2013-05-20 | 엘지전자 주식회사 | 개인화된 tts 제공장치 |
KR101605848B1 (ko) * | 2014-11-24 | 2016-04-01 | 하동경 | 음성인식 성능 평가 방법 및 그 장치 |
-
2019
- 2019-04-04 KR KR1020190039483A patent/KR102020773B1/ko active IP Right Grant
- 2019-05-27 WO PCT/KR2019/006336 patent/WO2020204256A1/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050086055A1 (en) * | 2003-09-04 | 2005-04-21 | Masaru Sakai | Voice recognition estimating apparatus, method and program |
KR20060062884A (ko) * | 2004-12-06 | 2006-06-12 | 한국전자통신연구원 | 잡음환경하의 음성인식엔진 평가 시스템 및 자동화 방법 |
KR20130029635A (ko) * | 2011-09-15 | 2013-03-25 | 현대모비스 주식회사 | 음성인식 성능 평가 모듈 및 그 방법 |
KR20130051278A (ko) * | 2011-11-09 | 2013-05-20 | 엘지전자 주식회사 | 개인화된 tts 제공장치 |
KR101605848B1 (ko) * | 2014-11-24 | 2016-04-01 | 하동경 | 음성인식 성능 평가 방법 및 그 장치 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113450768A (zh) * | 2021-06-25 | 2021-09-28 | 平安科技(深圳)有限公司 | 语音合成系统评测方法、装置、可读存储介质及终端设备 |
CN113836010A (zh) * | 2021-09-14 | 2021-12-24 | 招商银行股份有限公司 | 语音智能客服自动化测试方法、系统及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
KR102020773B1 (ko) | 2019-11-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8219397B2 (en) | Data processing system for autonomously building speech identification and tagging data | |
WO2020204256A1 (fr) | Système multimédia automatique d'évaluation de reconnaissance de parole utilisant un moteur de synthèse de parole | |
US8209169B2 (en) | Synchronization of an input text of a speech with a recording of the speech | |
WO2020027619A1 (fr) | Procédé, dispositif et support d'informations lisible par ordinateur pour la synthèse vocale à l'aide d'un apprentissage automatique sur la base d'une caractéristique de prosodie séquentielle | |
WO2019139428A1 (fr) | Procédé de synthèse vocale à partir de texte multilingue | |
WO2018151464A1 (fr) | Système de codage et procédé de codage utilisant la reconnaissance vocale | |
KR20150014236A (ko) | 인터랙티브 캐릭터 기반 외국어 학습 장치 및 방법 | |
WO2019208860A1 (fr) | Procédé d'enregistrement et de sortie de conversation entre de multiples parties au moyen d'une technologie de reconnaissance vocale, et dispositif associé | |
WO2021033865A1 (fr) | Procédé et appareil pour l'apprentissage du coréen écrit | |
WO2021251539A1 (fr) | Procédé permettant de mettre en œuvre un message interactif en utilisant un réseau neuronal artificiel et dispositif associé | |
US20150254238A1 (en) | System and Methods for Maintaining Speech-To-Speech Translation in the Field | |
Yarra et al. | Indic TIMIT and Indic English lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations | |
Shahriar et al. | A communication platform between bangla and sign language | |
US20200320976A1 (en) | Information processing apparatus, information processing method, and program | |
JP6605105B1 (ja) | 文章記号挿入装置及びその方法 | |
KR101992370B1 (ko) | 말하기 학습방법 및 학습시스템 | |
US20210264812A1 (en) | Language learning system and method | |
WO2022169208A1 (fr) | Système de visualisation vocale pour apprentissage de l'anglais, et procédé associé | |
WO2021154018A1 (fr) | Dispositif électronique et procédé de commande du dispositif électronique | |
CN114170856A (zh) | 用机器实施的听力训练方法、设备及可读存储介质 | |
CN114420159A (zh) | 音频评测方法及装置、非瞬时性存储介质 | |
KR102107447B1 (ko) | 선택적 음성 모델의 적용에 기초한 번역 기능을 제공하는 텍스트 음성 변환 장치 및 그 동작 방법 | |
JP2003162524A (ja) | 言語処理装置 | |
Tits et al. | Flowchase: a Mobile Application for Pronunciation Training | |
WO2019156427A1 (fr) | Procédé d'identification d'un locuteur sur la base d'un mot prononcé et appareil associé, et appareil de gestion de modèle vocal sur la base d'un contexte et procédé associé |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19922597 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19922597 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 15-06-2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19922597 Country of ref document: EP Kind code of ref document: A1 |