WO2014148665A3 - 멀티미디어 콘텐츠 편집장치 및 그 방법 - Google Patents
멀티미디어 콘텐츠 편집장치 및 그 방법 Download PDFInfo
- Publication number
- WO2014148665A3 WO2014148665A3 PCT/KR2013/002502 KR2013002502W WO2014148665A3 WO 2014148665 A3 WO2014148665 A3 WO 2014148665A3 KR 2013002502 W KR2013002502 W KR 2013002502W WO 2014148665 A3 WO2014148665 A3 WO 2014148665A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- voice
- text
- data
- unit
- text object
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
Abstract
멀티미디어 콘텐츠 제작시 음성 데이터와 텍스트 데이터를 동기화하는 멀티미디어 콘텐츠 편집장치 및 그 방법에 관한 것으로, 입력된 텍스트 데이터를 문단/문장/단어 단위 순으로 순차 분리한 후 단어 단위의 텍스트 객체를 생성하는 텍스트 객체 생성부; 입력된 음성 데이터의 문장 끝 위치를 지정하고 음소 구간을 검출한 후 음성 인식을 하는 음성 인식부; 상기 음성 인식부에서 인식된 음성 데이터로부터 음성 텍스트 객체를 생성하는 음성 객체 생성부; 상기 텍스트 객체와 음성 텍스트 객체를 템플릿 매칭 방식으로 대비시켜 음성과 텍스트를 동기화하는 자동 동기화부를 구비함으로써, 음성 데이터와 텍스트 데이터의 자동 동기화가 가능하여 기존 수작업 대비 동기화 작업 시간을 단축할 수 있으며, 동기화 작업의 효율성 및 정확성을 향상시킬 수 있게 된다.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20130030117A KR101493006B1 (ko) | 2013-03-21 | 2013-03-21 | 멀티미디어 콘텐츠 편집장치 및 그 방법 |
KR10-2013-0030117 | 2013-03-21 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2014148665A2 WO2014148665A2 (ko) | 2014-09-25 |
WO2014148665A3 true WO2014148665A3 (ko) | 2015-05-07 |
Family
ID=51581569
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2013/002502 WO2014148665A2 (ko) | 2013-03-21 | 2013-03-26 | 멀티미디어 콘텐츠 편집장치 및 그 방법 |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR101493006B1 (ko) |
WO (1) | WO2014148665A2 (ko) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107908743B (zh) * | 2017-11-16 | 2021-12-03 | 百度在线网络技术(北京)有限公司 | 人工智能应用构建方法和装置 |
CN110444199B (zh) * | 2017-05-27 | 2022-01-07 | 腾讯科技(深圳)有限公司 | 一种语音关键词识别方法、装置、终端及服务器 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017131028A1 (ja) | 2016-01-26 | 2017-08-03 | 東レ株式会社 | ポリフェニレンスルフィド樹脂組成物およびその製造方法 |
KR102642259B1 (ko) * | 2023-06-22 | 2024-03-04 | 유니닥스 주식회사 | Ai 학습용 데이터 가공 장치 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060285654A1 (en) * | 2003-04-14 | 2006-12-21 | Nesvadba Jan Alexis D | System and method for performing automatic dubbing on an audio-visual stream |
US20120041758A1 (en) * | 2007-06-28 | 2012-02-16 | Nuance Communications, Inc. | Synchronization of an input text of a speech with a recording of the speech |
US20120245719A1 (en) * | 2011-03-23 | 2012-09-27 | Story Guy A Jr | Managing playback of synchronized content |
US20120265527A1 (en) * | 2011-04-15 | 2012-10-18 | Hon Hai Precision Industry Co., Ltd. | Interactive voice recognition electronic device and method |
KR20120129015A (ko) * | 2011-05-18 | 2012-11-28 | 조성진 | 어학 컨텐츠 생성 방법 및 이를 위한 단말기 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB8817705D0 (en) * | 1988-07-25 | 1988-09-01 | British Telecomm | Optical communications system |
-
2013
- 2013-03-21 KR KR20130030117A patent/KR101493006B1/ko active IP Right Grant
- 2013-03-26 WO PCT/KR2013/002502 patent/WO2014148665A2/ko active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060285654A1 (en) * | 2003-04-14 | 2006-12-21 | Nesvadba Jan Alexis D | System and method for performing automatic dubbing on an audio-visual stream |
US20120041758A1 (en) * | 2007-06-28 | 2012-02-16 | Nuance Communications, Inc. | Synchronization of an input text of a speech with a recording of the speech |
US20120245719A1 (en) * | 2011-03-23 | 2012-09-27 | Story Guy A Jr | Managing playback of synchronized content |
US20120265527A1 (en) * | 2011-04-15 | 2012-10-18 | Hon Hai Precision Industry Co., Ltd. | Interactive voice recognition electronic device and method |
KR20120129015A (ko) * | 2011-05-18 | 2012-11-28 | 조성진 | 어학 컨텐츠 생성 방법 및 이를 위한 단말기 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110444199B (zh) * | 2017-05-27 | 2022-01-07 | 腾讯科技(深圳)有限公司 | 一种语音关键词识别方法、装置、终端及服务器 |
CN107908743B (zh) * | 2017-11-16 | 2021-12-03 | 百度在线网络技术(北京)有限公司 | 人工智能应用构建方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
KR20140115536A (ko) | 2014-10-01 |
KR101493006B1 (ko) | 2015-02-13 |
WO2014148665A2 (ko) | 2014-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2529564A (en) | Method, apparatus and system for regenerating voice intonation in automatically dubbed videos | |
AU2019268131A1 (en) | Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal | |
EP3767622A3 (en) | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface | |
WO2009078256A1 (ja) | 発音変動規則抽出装置、発音変動規則抽出方法、および発音変動規則抽出用プログラム | |
WO2014209810A3 (en) | Methods and apparatuses for mining synonymous phrases, and for searching related content | |
EP3001662A3 (en) | Conference proceed apparatus and method for advancing conference | |
SG11201808360SA (en) | Acoustic model training method, speech recognition method, apparatus, device and medium | |
WO2017218243A3 (en) | Intent recognition and emotional text-to-speech learning system | |
MX2014010795A (es) | Dispositivo para extraer informacion a partir de un dialogo. | |
WO2014197334A3 (en) | System and method for user-specified pronunciation of words for speech synthesis and recognition | |
WO2017033063A3 (en) | Statistics-based machine translation method, apparatus and electronic device | |
EP3767620A3 (en) | Speech endpointing based on word comparisons | |
EP2963643A3 (en) | Entity name recognition | |
WO2013181158A3 (en) | Synchronizing translated digital content | |
WO2009158581A3 (en) | System and method for spoken topic or criterion recognition in digital media and contextual advertising | |
EP4235648A3 (en) | Language model biasing | |
GB2542288A (en) | Enhancing reading accuracy, efficiency and retention | |
WO2013192218A3 (en) | Dynamic language model | |
WO2014148665A3 (ko) | 멀티미디어 콘텐츠 편집장치 및 그 방법 | |
MX2014015611A (es) | Metodo para corregir error de reconocimiento de voz y aparato de recepcion de transmision que aplica el mismo. | |
WO2012169737A3 (en) | Display apparatus and method for executing link and method for recognizing voice thereof | |
WO2009063445A3 (en) | A method and apparatus for fast search in call-center monitoring | |
WO2012094422A3 (en) | A voice based system and method for data input | |
NZ700273A (en) | Negative example (anti-word) based performance improvement for speech recognition | |
GB2486038B (en) | Speech-to-text conversion |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03/03/2016) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13878866 Country of ref document: EP Kind code of ref document: A2 |