WO2012064110A3 - 스크립트 데이터 생성 방법 및 장치 - Google Patents

스크립트 데이터 생성 방법 및 장치 Download PDF

Info

Publication number
WO2012064110A3
WO2012064110A3 PCT/KR2011/008522 KR2011008522W WO2012064110A3 WO 2012064110 A3 WO2012064110 A3 WO 2012064110A3 KR 2011008522 W KR2011008522 W KR 2011008522W WO 2012064110 A3 WO2012064110 A3 WO 2012064110A3
Authority
WO
WIPO (PCT)
Prior art keywords
syllable
playback position
prediction
audio data
time information
Prior art date
Application number
PCT/KR2011/008522
Other languages
English (en)
French (fr)
Other versions
WO2012064110A2 (ko
Inventor
임광순
김인송
Original Assignee
Lim Kwang-Soon
Kim In-Song
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lim Kwang-Soon, Kim In-Song filed Critical Lim Kwang-Soon
Priority to CN2011800538470A priority Critical patent/CN103210447A/zh
Publication of WO2012064110A2 publication Critical patent/WO2012064110A2/ko
Publication of WO2012064110A3 publication Critical patent/WO2012064110A3/ko

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/105Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/027Syllables being the recognition units
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10546Audio or video recording specifically adapted for audio data
    • G11B2020/10555Audio or video recording specifically adapted for audio data wherein the frequency, the amplitude, or other characteristics of the audio signal is taken into account

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Document Processing Apparatus (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

본 발명은 오디오 데이터에 대한 스크립트 데이터를 생성하는 방법 및 장치에 관한 것으로, 오디오 데이터의 실제 소리 구간의 전체 시간 정보를 획득하는 단계와, 텍스트 데이터에 기초하여 소리 구간에 대한 전체 음절수 정보를 획득하는 단계와, 전체 시간 정보 및 전체 음절수 정보에 기초하여 한 음절에 대응하는 단위 음절 시간 정보를 산출하는 단계와, 텍스트 데이터에서 예측이 필요한 단어 또는 구절까지가 차지하는 소리 구간의 구간 음절수 정보와 단위 음절 시간 정보에 기초하여 오디오 데이터의 대응 소리 구간에 대한 예측 재생 위치 정보를 획득하는 단계와, 예측 재생 위치의 이전 또는 이후에 위치하는 오디오 데이터의 묵음 구간들 중 예측 재생 위치에 가장 인접한 묵음 구간을 실제 재생 위치 정보로 저장하는 단계를 포함하는 스크립트 데이터 생성 방법을 제공한다.
PCT/KR2011/008522 2010-11-10 2011-11-09 스크립트 데이터 생성 방법 및 장치 WO2012064110A2 (ko)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011800538470A CN103210447A (zh) 2010-11-10 2011-11-09 脚本数据生成方法及装置

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020100111615A KR101030777B1 (ko) 2010-11-10 2010-11-10 스크립트 데이터 생성 방법 및 장치
KR10-2010-0111615 2010-11-10

Publications (2)

Publication Number Publication Date
WO2012064110A2 WO2012064110A2 (ko) 2012-05-18
WO2012064110A3 true WO2012064110A3 (ko) 2012-07-12

Family

ID=44365384

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2011/008522 WO2012064110A2 (ko) 2010-11-10 2011-11-09 스크립트 데이터 생성 방법 및 장치

Country Status (3)

Country Link
KR (1) KR101030777B1 (ko)
CN (1) CN103210447A (ko)
WO (1) WO2012064110A2 (ko)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114282941A (zh) * 2021-12-20 2022-04-05 咪咕音乐有限公司 广告插入位置的确定方法、装置、设备及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002318580A (ja) * 2001-04-23 2002-10-31 Sony Corp 音声再生装置、音声再生方法、音声再生プログラム、音声再生プログラム格納媒体、およびデータ格納媒体
JP2005115391A (ja) * 2003-10-08 2005-04-28 Agfa Inc テキストのディスプレイとオーディオの再生とを同期させるためのシステム及び方法
JP2005189454A (ja) * 2003-12-25 2005-07-14 Casio Comput Co Ltd テキスト同期音声再生制御装置及びプログラム
JP2009008884A (ja) * 2007-06-28 2009-01-15 Internatl Business Mach Corp <Ibm> 音声の再生に同期して音声の内容を表示させる技術
JP2010157816A (ja) * 2008-12-26 2010-07-15 Toshiba Corp 字幕情報作成装置、字幕情報作成方法及びプログラム
JP2010233019A (ja) * 2009-03-27 2010-10-14 Kddi Corp 字幕ずれ補正装置、再生装置および放送装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000058943A1 (fr) * 1999-03-25 2000-10-05 Matsushita Electric Industrial Co., Ltd. Systeme et procede de synthese de la parole
JP2005242231A (ja) * 2004-02-27 2005-09-08 Yamaha Corp 音声合成装置、音声合成方法、及び音声合成プログラム

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002318580A (ja) * 2001-04-23 2002-10-31 Sony Corp 音声再生装置、音声再生方法、音声再生プログラム、音声再生プログラム格納媒体、およびデータ格納媒体
JP2005115391A (ja) * 2003-10-08 2005-04-28 Agfa Inc テキストのディスプレイとオーディオの再生とを同期させるためのシステム及び方法
JP2005189454A (ja) * 2003-12-25 2005-07-14 Casio Comput Co Ltd テキスト同期音声再生制御装置及びプログラム
JP2009008884A (ja) * 2007-06-28 2009-01-15 Internatl Business Mach Corp <Ibm> 音声の再生に同期して音声の内容を表示させる技術
JP2010157816A (ja) * 2008-12-26 2010-07-15 Toshiba Corp 字幕情報作成装置、字幕情報作成方法及びプログラム
JP2010233019A (ja) * 2009-03-27 2010-10-14 Kddi Corp 字幕ずれ補正装置、再生装置および放送装置

Also Published As

Publication number Publication date
WO2012064110A2 (ko) 2012-05-18
KR101030777B1 (ko) 2011-05-25
CN103210447A (zh) 2013-07-17

Similar Documents

Publication Publication Date Title
WO2013144605A3 (en) Transcription of speech
EP4047497A3 (en) Speaker verification using co-location information
WO2014043027A3 (en) Improving phonetic pronunciation
BRPI0802614A2 (pt) métodos e aparelhos para codificação e decodificação de sinais de áudio baseados em objeto
WO2011152675A3 (en) Method and apparatus for adaptive streaming based on plurality of elements for determining quality of content
WO2011115454A3 (en) Method and apparatus for adaptively streaming content including plurality of chapters
EP2413301A4 (en) Device and method for generating route restriction information of intersection, computer program for generating route restriction information of intersection, and recording medium for recording computer program
WO2011108893A3 (en) Method and apparatus for generating and reproducing adaptive stream based on file format, and recording medium thereof
WO2010087614A3 (ko) 오디오 신호의 부호화 및 복호화 방법 및 그 장치
PL2491551T3 (pl) Urządzenie do dostarczania reprezentacji sygnału upmixu w oparciu o reprezentację sygnału downmixu, urządzenie do dostarczania strumienia bitów reprezentującego wielokanałowy sygnał audio, sposoby, program komputerowy i strumień bitów wykorzystujący sygnalizację sterowania zniekształceniami
WO2010148141A3 (en) Apparatus and method for speech analysis
PL2888737T3 (pl) Urządzenie i sposób odtwarzania sygnału audio, urządzenie i sposób do generowania zakodowanego sygnału audio i odpowiadający program komputerowy
WO2011071290A3 (en) Streaming method and apparatus operating by inserting other content into main content
PL2489038T3 (pl) Urządzenie do dostarczania reprezentacji sygnału upmixu na bazie reprezentacji sygnału downmixu, urządzenie do dostarczania strumienia bitów reprezentującego wielokanałowy sygnał audio, sposoby, programy komputerowe i strumień bitów reprezentujący wielokanałowy sygnał audio z zastosowaniem parametru kombinacji liniowej
MX2009005159A (es) Un metodo y un aparato para descodificar una señal de audio.
BR112012028272A2 (pt) método de reprodução de som esterofônico, aparelho de reprodução de som estereofônico, e meio de gravação legível por computador não transitório
WO2009051091A1 (ja) 画像符号化装置及び復号装置、画像符号化方法及び復号方法、それらのプログラム並びにプログラムを記録した記録媒体
PL2380167T3 (pl) Urządzenie, sposób i program komputerowy do realizacji upmixu sygnału audio downmixu
WO2010008232A3 (ko) 실감 효과 표현 방법 및 그 장치 및 실감 효과 메타데이터가 기록된 컴퓨터로 읽을 수 있는 기록 매체
WO2010143907A3 (ko) 다객체 오디오 신호를 부호화하는 방법 및 부호화 장치, 복호화 방법 및 복호화 장치, 그리고 트랜스코딩 방법 및 트랜스코더
WO2010008234A3 (ko) 실감 효과 표현 방법 및 그 장치 및 실감 기기 성능 메타데이터가 기록된 컴퓨터로 읽을 수 있는 기록 매체
MX2019010418A (es) Dispositivo para codificacion predictiva de imagenes, metodo para codificacion predictiva de imagenes, programa informatico para codificacion predictiva de imagenes, dispositivo para decodificacion predictiva de imagenes, metodo para la decodificacion predictiva de imagenes y programa informatico para decodificacion predictiva de imagenes.
EP2290997A4 (en) DATA STRUCTURE, RECORDING MEDIA, PLAYING DEVICE, PLAY PROCESS AND PROGRAM
NO20101150L (no) Fremgangsmate og system for situasjonsstyrt spraktolkning
MX355008B (es) Aparato de procesamiento de información, medio de grabación de información, método de procesamiento de información, y programa.

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11839750

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11839750

Country of ref document: EP

Kind code of ref document: A2