WO2012064110A3 - 스크립트 데이터 생성 방법 및 장치 - Google Patents
스크립트 데이터 생성 방법 및 장치 Download PDFInfo
- Publication number
- WO2012064110A3 WO2012064110A3 PCT/KR2011/008522 KR2011008522W WO2012064110A3 WO 2012064110 A3 WO2012064110 A3 WO 2012064110A3 KR 2011008522 W KR2011008522 W KR 2011008522W WO 2012064110 A3 WO2012064110 A3 WO 2012064110A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- syllable
- playback position
- prediction
- audio data
- time information
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 238000004519 manufacturing process Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
- G11B27/105—Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/34—Indicating arrangements
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
- G11B2020/10537—Audio or video recording
- G11B2020/10546—Audio or video recording specifically adapted for audio data
- G11B2020/10555—Audio or video recording specifically adapted for audio data wherein the frequency, the amplitude, or other characteristics of the audio signal is taken into account
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
본 발명은 오디오 데이터에 대한 스크립트 데이터를 생성하는 방법 및 장치에 관한 것으로, 오디오 데이터의 실제 소리 구간의 전체 시간 정보를 획득하는 단계와, 텍스트 데이터에 기초하여 소리 구간에 대한 전체 음절수 정보를 획득하는 단계와, 전체 시간 정보 및 전체 음절수 정보에 기초하여 한 음절에 대응하는 단위 음절 시간 정보를 산출하는 단계와, 텍스트 데이터에서 예측이 필요한 단어 또는 구절까지가 차지하는 소리 구간의 구간 음절수 정보와 단위 음절 시간 정보에 기초하여 오디오 데이터의 대응 소리 구간에 대한 예측 재생 위치 정보를 획득하는 단계와, 예측 재생 위치의 이전 또는 이후에 위치하는 오디오 데이터의 묵음 구간들 중 예측 재생 위치에 가장 인접한 묵음 구간을 실제 재생 위치 정보로 저장하는 단계를 포함하는 스크립트 데이터 생성 방법을 제공한다.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011800538470A CN103210447A (zh) | 2010-11-10 | 2011-11-09 | 脚本数据生成方法及装置 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020100111615A KR101030777B1 (ko) | 2010-11-10 | 2010-11-10 | 스크립트 데이터 생성 방법 및 장치 |
KR10-2010-0111615 | 2010-11-10 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2012064110A2 WO2012064110A2 (ko) | 2012-05-18 |
WO2012064110A3 true WO2012064110A3 (ko) | 2012-07-12 |
Family
ID=44365384
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2011/008522 WO2012064110A2 (ko) | 2010-11-10 | 2011-11-09 | 스크립트 데이터 생성 방법 및 장치 |
Country Status (3)
Country | Link |
---|---|
KR (1) | KR101030777B1 (ko) |
CN (1) | CN103210447A (ko) |
WO (1) | WO2012064110A2 (ko) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114282941A (zh) * | 2021-12-20 | 2022-04-05 | 咪咕音乐有限公司 | 广告插入位置的确定方法、装置、设备及存储介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002318580A (ja) * | 2001-04-23 | 2002-10-31 | Sony Corp | 音声再生装置、音声再生方法、音声再生プログラム、音声再生プログラム格納媒体、およびデータ格納媒体 |
JP2005115391A (ja) * | 2003-10-08 | 2005-04-28 | Agfa Inc | テキストのディスプレイとオーディオの再生とを同期させるためのシステム及び方法 |
JP2005189454A (ja) * | 2003-12-25 | 2005-07-14 | Casio Comput Co Ltd | テキスト同期音声再生制御装置及びプログラム |
JP2009008884A (ja) * | 2007-06-28 | 2009-01-15 | Internatl Business Mach Corp <Ibm> | 音声の再生に同期して音声の内容を表示させる技術 |
JP2010157816A (ja) * | 2008-12-26 | 2010-07-15 | Toshiba Corp | 字幕情報作成装置、字幕情報作成方法及びプログラム |
JP2010233019A (ja) * | 2009-03-27 | 2010-10-14 | Kddi Corp | 字幕ずれ補正装置、再生装置および放送装置 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000058943A1 (fr) * | 1999-03-25 | 2000-10-05 | Matsushita Electric Industrial Co., Ltd. | Systeme et procede de synthese de la parole |
JP2005242231A (ja) * | 2004-02-27 | 2005-09-08 | Yamaha Corp | 音声合成装置、音声合成方法、及び音声合成プログラム |
-
2010
- 2010-11-10 KR KR1020100111615A patent/KR101030777B1/ko not_active IP Right Cessation
-
2011
- 2011-11-09 WO PCT/KR2011/008522 patent/WO2012064110A2/ko active Application Filing
- 2011-11-09 CN CN2011800538470A patent/CN103210447A/zh active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002318580A (ja) * | 2001-04-23 | 2002-10-31 | Sony Corp | 音声再生装置、音声再生方法、音声再生プログラム、音声再生プログラム格納媒体、およびデータ格納媒体 |
JP2005115391A (ja) * | 2003-10-08 | 2005-04-28 | Agfa Inc | テキストのディスプレイとオーディオの再生とを同期させるためのシステム及び方法 |
JP2005189454A (ja) * | 2003-12-25 | 2005-07-14 | Casio Comput Co Ltd | テキスト同期音声再生制御装置及びプログラム |
JP2009008884A (ja) * | 2007-06-28 | 2009-01-15 | Internatl Business Mach Corp <Ibm> | 音声の再生に同期して音声の内容を表示させる技術 |
JP2010157816A (ja) * | 2008-12-26 | 2010-07-15 | Toshiba Corp | 字幕情報作成装置、字幕情報作成方法及びプログラム |
JP2010233019A (ja) * | 2009-03-27 | 2010-10-14 | Kddi Corp | 字幕ずれ補正装置、再生装置および放送装置 |
Also Published As
Publication number | Publication date |
---|---|
WO2012064110A2 (ko) | 2012-05-18 |
KR101030777B1 (ko) | 2011-05-25 |
CN103210447A (zh) | 2013-07-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2013144605A3 (en) | Transcription of speech | |
EP4047497A3 (en) | Speaker verification using co-location information | |
WO2014043027A3 (en) | Improving phonetic pronunciation | |
BRPI0802614A2 (pt) | métodos e aparelhos para codificação e decodificação de sinais de áudio baseados em objeto | |
WO2011152675A3 (en) | Method and apparatus for adaptive streaming based on plurality of elements for determining quality of content | |
WO2011115454A3 (en) | Method and apparatus for adaptively streaming content including plurality of chapters | |
EP2413301A4 (en) | Device and method for generating route restriction information of intersection, computer program for generating route restriction information of intersection, and recording medium for recording computer program | |
WO2011108893A3 (en) | Method and apparatus for generating and reproducing adaptive stream based on file format, and recording medium thereof | |
WO2010087614A3 (ko) | 오디오 신호의 부호화 및 복호화 방법 및 그 장치 | |
PL2491551T3 (pl) | Urządzenie do dostarczania reprezentacji sygnału upmixu w oparciu o reprezentację sygnału downmixu, urządzenie do dostarczania strumienia bitów reprezentującego wielokanałowy sygnał audio, sposoby, program komputerowy i strumień bitów wykorzystujący sygnalizację sterowania zniekształceniami | |
WO2010148141A3 (en) | Apparatus and method for speech analysis | |
PL2888737T3 (pl) | Urządzenie i sposób odtwarzania sygnału audio, urządzenie i sposób do generowania zakodowanego sygnału audio i odpowiadający program komputerowy | |
WO2011071290A3 (en) | Streaming method and apparatus operating by inserting other content into main content | |
PL2489038T3 (pl) | Urządzenie do dostarczania reprezentacji sygnału upmixu na bazie reprezentacji sygnału downmixu, urządzenie do dostarczania strumienia bitów reprezentującego wielokanałowy sygnał audio, sposoby, programy komputerowe i strumień bitów reprezentujący wielokanałowy sygnał audio z zastosowaniem parametru kombinacji liniowej | |
MX2009005159A (es) | Un metodo y un aparato para descodificar una señal de audio. | |
BR112012028272A2 (pt) | método de reprodução de som esterofônico, aparelho de reprodução de som estereofônico, e meio de gravação legível por computador não transitório | |
WO2009051091A1 (ja) | 画像符号化装置及び復号装置、画像符号化方法及び復号方法、それらのプログラム並びにプログラムを記録した記録媒体 | |
PL2380167T3 (pl) | Urządzenie, sposób i program komputerowy do realizacji upmixu sygnału audio downmixu | |
WO2010008232A3 (ko) | 실감 효과 표현 방법 및 그 장치 및 실감 효과 메타데이터가 기록된 컴퓨터로 읽을 수 있는 기록 매체 | |
WO2010143907A3 (ko) | 다객체 오디오 신호를 부호화하는 방법 및 부호화 장치, 복호화 방법 및 복호화 장치, 그리고 트랜스코딩 방법 및 트랜스코더 | |
WO2010008234A3 (ko) | 실감 효과 표현 방법 및 그 장치 및 실감 기기 성능 메타데이터가 기록된 컴퓨터로 읽을 수 있는 기록 매체 | |
MX2019010418A (es) | Dispositivo para codificacion predictiva de imagenes, metodo para codificacion predictiva de imagenes, programa informatico para codificacion predictiva de imagenes, dispositivo para decodificacion predictiva de imagenes, metodo para la decodificacion predictiva de imagenes y programa informatico para decodificacion predictiva de imagenes. | |
EP2290997A4 (en) | DATA STRUCTURE, RECORDING MEDIA, PLAYING DEVICE, PLAY PROCESS AND PROGRAM | |
NO20101150L (no) | Fremgangsmate og system for situasjonsstyrt spraktolkning | |
MX355008B (es) | Aparato de procesamiento de información, medio de grabación de información, método de procesamiento de información, y programa. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11839750 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11839750 Country of ref document: EP Kind code of ref document: A2 |