WO2014148665A3 - 멀티미디어 콘텐츠 편집장치 및 그 방법 - Google Patents

멀티미디어 콘텐츠 편집장치 및 그 방법 Download PDF

Info

Publication number
WO2014148665A3
WO2014148665A3 PCT/KR2013/002502 KR2013002502W WO2014148665A3 WO 2014148665 A3 WO2014148665 A3 WO 2014148665A3 KR 2013002502 W KR2013002502 W KR 2013002502W WO 2014148665 A3 WO2014148665 A3 WO 2014148665A3
Authority
WO
WIPO (PCT)
Prior art keywords
voice
text
data
unit
text object
Prior art date
Application number
PCT/KR2013/002502
Other languages
English (en)
French (fr)
Other versions
WO2014148665A2 (ko
Inventor
정찬의
Original Assignee
디노플러스(주)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 디노플러스(주) filed Critical 디노플러스(주)
Publication of WO2014148665A2 publication Critical patent/WO2014148665A2/ko
Publication of WO2014148665A3 publication Critical patent/WO2014148665A3/ko

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit

Abstract

멀티미디어 콘텐츠 제작시 음성 데이터와 텍스트 데이터를 동기화하는 멀티미디어 콘텐츠 편집장치 및 그 방법에 관한 것으로, 입력된 텍스트 데이터를 문단/문장/단어 단위 순으로 순차 분리한 후 단어 단위의 텍스트 객체를 생성하는 텍스트 객체 생성부; 입력된 음성 데이터의 문장 끝 위치를 지정하고 음소 구간을 검출한 후 음성 인식을 하는 음성 인식부; 상기 음성 인식부에서 인식된 음성 데이터로부터 음성 텍스트 객체를 생성하는 음성 객체 생성부; 상기 텍스트 객체와 음성 텍스트 객체를 템플릿 매칭 방식으로 대비시켜 음성과 텍스트를 동기화하는 자동 동기화부를 구비함으로써, 음성 데이터와 텍스트 데이터의 자동 동기화가 가능하여 기존 수작업 대비 동기화 작업 시간을 단축할 수 있으며, 동기화 작업의 효율성 및 정확성을 향상시킬 수 있게 된다.
PCT/KR2013/002502 2013-03-21 2013-03-26 멀티미디어 콘텐츠 편집장치 및 그 방법 WO2014148665A2 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR20130030117A KR101493006B1 (ko) 2013-03-21 2013-03-21 멀티미디어 콘텐츠 편집장치 및 그 방법
KR10-2013-0030117 2013-03-21

Publications (2)

Publication Number Publication Date
WO2014148665A2 WO2014148665A2 (ko) 2014-09-25
WO2014148665A3 true WO2014148665A3 (ko) 2015-05-07

Family

ID=51581569

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2013/002502 WO2014148665A2 (ko) 2013-03-21 2013-03-26 멀티미디어 콘텐츠 편집장치 및 그 방법

Country Status (2)

Country Link
KR (1) KR101493006B1 (ko)
WO (1) WO2014148665A2 (ko)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107908743B (zh) * 2017-11-16 2021-12-03 百度在线网络技术(北京)有限公司 人工智能应用构建方法和装置
CN110444199B (zh) * 2017-05-27 2022-01-07 腾讯科技(深圳)有限公司 一种语音关键词识别方法、装置、终端及服务器

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017131028A1 (ja) 2016-01-26 2017-08-03 東レ株式会社 ポリフェニレンスルフィド樹脂組成物およびその製造方法
KR102642259B1 (ko) * 2023-06-22 2024-03-04 유니닥스 주식회사 Ai 학습용 데이터 가공 장치

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060285654A1 (en) * 2003-04-14 2006-12-21 Nesvadba Jan Alexis D System and method for performing automatic dubbing on an audio-visual stream
US20120041758A1 (en) * 2007-06-28 2012-02-16 Nuance Communications, Inc. Synchronization of an input text of a speech with a recording of the speech
US20120245719A1 (en) * 2011-03-23 2012-09-27 Story Guy A Jr Managing playback of synchronized content
US20120265527A1 (en) * 2011-04-15 2012-10-18 Hon Hai Precision Industry Co., Ltd. Interactive voice recognition electronic device and method
KR20120129015A (ko) * 2011-05-18 2012-11-28 조성진 어학 컨텐츠 생성 방법 및 이를 위한 단말기

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB8817705D0 (en) * 1988-07-25 1988-09-01 British Telecomm Optical communications system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060285654A1 (en) * 2003-04-14 2006-12-21 Nesvadba Jan Alexis D System and method for performing automatic dubbing on an audio-visual stream
US20120041758A1 (en) * 2007-06-28 2012-02-16 Nuance Communications, Inc. Synchronization of an input text of a speech with a recording of the speech
US20120245719A1 (en) * 2011-03-23 2012-09-27 Story Guy A Jr Managing playback of synchronized content
US20120265527A1 (en) * 2011-04-15 2012-10-18 Hon Hai Precision Industry Co., Ltd. Interactive voice recognition electronic device and method
KR20120129015A (ko) * 2011-05-18 2012-11-28 조성진 어학 컨텐츠 생성 방법 및 이를 위한 단말기

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110444199B (zh) * 2017-05-27 2022-01-07 腾讯科技(深圳)有限公司 一种语音关键词识别方法、装置、终端及服务器
CN107908743B (zh) * 2017-11-16 2021-12-03 百度在线网络技术(北京)有限公司 人工智能应用构建方法和装置

Also Published As

Publication number Publication date
KR20140115536A (ko) 2014-10-01
KR101493006B1 (ko) 2015-02-13
WO2014148665A2 (ko) 2014-09-25

Similar Documents

Publication Publication Date Title
GB2529564A (en) Method, apparatus and system for regenerating voice intonation in automatically dubbed videos
AU2019268131A1 (en) Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal
EP3767622A3 (en) Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
WO2009078256A1 (ja) 発音変動規則抽出装置、発音変動規則抽出方法、および発音変動規則抽出用プログラム
WO2014209810A3 (en) Methods and apparatuses for mining synonymous phrases, and for searching related content
EP3001662A3 (en) Conference proceed apparatus and method for advancing conference
SG11201808360SA (en) Acoustic model training method, speech recognition method, apparatus, device and medium
WO2017218243A3 (en) Intent recognition and emotional text-to-speech learning system
MX2014010795A (es) Dispositivo para extraer informacion a partir de un dialogo.
WO2014197334A3 (en) System and method for user-specified pronunciation of words for speech synthesis and recognition
WO2017033063A3 (en) Statistics-based machine translation method, apparatus and electronic device
EP3767620A3 (en) Speech endpointing based on word comparisons
EP2963643A3 (en) Entity name recognition
WO2013181158A3 (en) Synchronizing translated digital content
WO2009158581A3 (en) System and method for spoken topic or criterion recognition in digital media and contextual advertising
EP4235648A3 (en) Language model biasing
GB2542288A (en) Enhancing reading accuracy, efficiency and retention
WO2013192218A3 (en) Dynamic language model
WO2014148665A3 (ko) 멀티미디어 콘텐츠 편집장치 및 그 방법
MX2014015611A (es) Metodo para corregir error de reconocimiento de voz y aparato de recepcion de transmision que aplica el mismo.
WO2012169737A3 (en) Display apparatus and method for executing link and method for recognizing voice thereof
WO2009063445A3 (en) A method and apparatus for fast search in call-center monitoring
WO2012094422A3 (en) A voice based system and method for data input
NZ700273A (en) Negative example (anti-word) based performance improvement for speech recognition
GB2486038B (en) Speech-to-text conversion

Legal Events

Date Code Title Description
32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03/03/2016)

122 Ep: pct application non-entry in european phase

Ref document number: 13878866

Country of ref document: EP

Kind code of ref document: A2