TWI227851B - A method for synchronizing audio and video - Google Patents

A method for synchronizing audio and video Download PDF

Info

Publication number
TWI227851B
TWI227851B TW92137538A TW92137538A TWI227851B TW I227851 B TWI227851 B TW I227851B TW 92137538 A TW92137538 A TW 92137538A TW 92137538 A TW92137538 A TW 92137538A TW I227851 B TWI227851 B TW I227851B
Authority
TW
Taiwan
Prior art keywords
data
sound
time
audio
played
Prior art date
Application number
TW92137538A
Other languages
Chinese (zh)
Other versions
TW200521826A (en
Inventor
Huang-Lin Yang
Original Assignee
Inventec Besta Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Besta Co Ltd filed Critical Inventec Besta Co Ltd
Priority to TW92137538A priority Critical patent/TWI227851B/en
Application granted granted Critical
Publication of TWI227851B publication Critical patent/TWI227851B/en
Publication of TW200521826A publication Critical patent/TW200521826A/en

Links

Landscapes

  • Television Signal Processing For Recording (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

The present invention is in related to a method for synchronizing audio and video, wherein played audio information and video information being assembled by a plurality of audio information segments and video information segments. Before playing, the audio information segments and the video information segments may be processed and stored in an audio information buffer zone and a video information buffer zone temporarily and individually; a relative location of processed audio information segments is a played location at the moment and is picked up in the audio information segments during the processing before playing. The method for synchronizing audio and video is firstly to record an initial writing time of the audio information segments of the audio information buffer zone; secondly to accumulate a total writing amount of the audio information segments of the audio information buffer zone; thirdly to compile a total rest amount of the audio information segments, not played yet, in the audio information buffer zone. The total rest amount is equal to that the total writing amount subtracts a product of a sampling frequency and a passing time. The sampling frequency is defined as a information amount of played video information segments per time unit, and the passing time is described as a difference of a present time subtracting the initial writing time; a real played location is then a difference of a present played location subtracting the total rest amount; at last, a playing of the video information segments is capable of adjusting according to the real played location.

Description

12278511227851

【發明所屬之技術領域】 、本發明係關於一種聲音與影像的同步方法與記錄 =法之電腦可讀取之記錄媒體,特別關於一種藉由估算g 曰身料之實際播放位置調整影像資料之播放的聲音與影像 的同步方法與記錄有該方法之電腦可讀取之記錄媒體。 於現今的電腦應用中,多媒體無疑是最熱門與最重要 =領域之一,藉由多媒體豐富的影像與聲音效果,所欲傳 的内谷可以生動地呈現,並輕易地深植閱聽大眾的心中 於某些 現聲音與影 音的多媒體 料時,資料 衝器中,再 Converter〕 之聲音資料 ,但此時緩 際輸出的聲 致影像的顯 法同步的情 為解決 資料,於其 像無法 内容時 片段經 由數位 輸出 的資料 衝器中 音資料 示比聲 形。 上述問 他作業 業環境下播放多媒體資料時,有時會出 同步的情形,這種情形於播放隨字幕發 尤其顯得嚴重。其成因在於播放聲音資 解壓縮等前置處理後,係先行儲存於緩 類比轉換器(DAC,Digital to Analog 5而某些多媒體播放程式於計算所輸出 量時’係以傳送至緩衝器多少資料為準 可能餘有尚未輸出的資料片段,造成實 車父多媒體播放程式所估算為少,進而導 音的輸出還快,因而出現聲音與影像無 即正確估算實際上輸出了多少聲音 被境有二種作法,其一為由系統於前次[Technical field to which the invention belongs] The present invention relates to a computer-readable recording medium for a method and recording method for synchronizing sound and video, and particularly to a method for adjusting image data by estimating the actual playback position of the figure A method for synchronizing the played sound and video and a computer-readable recording medium recorded with the method. In today's computer applications, multimedia is undoubtedly one of the hottest and most important fields. With rich multimedia image and sound effects, the inner valley that you want to spread can be vividly presented, and it is easy to deeply read and listen to the public. In my mind, when some multimedia materials such as sound and video are present, the sound data in the data buffer, and then Converter], but at this time, the slow synchronization of the apparent synchronization of the sound-induced video output is to solve the data. The mid-range data of the time clip is digitally output through the data buffer. As mentioned above, when playing multimedia materials in an operating environment, synchronization sometimes occurs. This situation is particularly serious when playing with subtitles. The reason for this is that after pre-processing such as decompressing the playback audio data, it is first stored in a slow analog converter (DAC, Digital to Analog 5), and some multimedia player programs are used to calculate how much data is transmitted to the buffer. It may be that there are remaining data fragments that have not been output, resulting in a low estimate of the real car parent multimedia player program, and the output of the guide sound is also fast. Therefore, if there is no sound or video, it is correct to estimate how much sound is actually output. This method is based on the previous system

1227851 五、發明說明(2) :C : I : : t ί f結束後自動呼叫-函數指#,使用者 中I*算&二‘到自己的程式函數,並於此程式函數 -返回長度函纟,此::的長度,其二為由系統端提供 的資料量S 數可於被啤叫時回傳緩衝器中剩餘 於未ί:ί =系f;提供特定的:數’ 用者端難以自行實作,^ ^兄 目、於系統開發端之使 下,改善聲音與影像J 改;系統程式的情況 題。 “、、法冋步的情形,實乃亟需解決的問 【發明内 針對 述特定函 聲音與影 為達 法,其中 複數筆聲 聲音資料 別暫存於 而代表所 位置為何 段中取得 段中首先 容】 上述問 數之作 像無法 上述目 所播放 音資料 片段與 一聲音 處理之 之一目 題,本發 業環境下 同步的情 的,本發 之聲音資 片段與複 影像資料 資料緩衝 聲音資料 月1J播放位 與影像的 音資料緩 明之目 ’無需 形之方 明提供 料與影 數筆影 片段於 區域與 片段相 置可於 同步方 衝區域 的為提供一 改變系統程 法。 一種聲音與 像資料係分 像資料片段 播放前需經 一影像資料 對於聲音資 前置處理時 法首先將記 之聲音資料 種於未 式而能 影像的 別由相 所組成 前置處 緩衝區 料片段 由聲音 錄聲音 片段之 提供上 夠改善 同步方 對應之 ,上述 理並分 域中, 之相對 資料片 資料片 一啟始 第10頁 1227851 五、發明說明(3) 寫入時間,其 區域中之聲音 資料片段中已 資料片段之一 寫入資料量扣 得之差,而取 之資料量大小 間,再接著將 目前播放位置 際播放位置調 ^ 此外,本 π己錄〜聲音與 ,决所播放之 3聲音資料片 f料片段與影 子於〜聲音資 表所處理之聲 i何之一目前 知。此聲音 $'先寫入聲 :時間,其次 埤中之聲音資 :4片段中已寫 ;斗片段之一總 次累計聲音資料片 資料片 寫入聲 段之一總寫 音資料緩衝 總剩餘資料量,其 除一取樣頻率與一 樣頻率為每單位時 ,經過時間則為一 計算一 扣除總 整影像 發明亦 影像的 聲音資 段與複 像資料 料緩衝 音資料 播放位 與影像 音資料 累計聲 料片段 入聲音 剩餘資 實際播放位 剩餘資料量 資料片段之 提供一種電 同步方法, 料與影像資 數筆影像資 片段於播放 區域與一影 片段相對於 置可於前置 的同步方法 緩衝區域之 音資料片段 之一總寫入 資料緩衝區 料量,其中 段中已 入資料 區域中 中上述 經過時 間欲播 現在時 置,其 後所得 播放。 腦可讀 運用此 料係分 料片段 前需經 像資料 聲音資 處理時 首先將 聲音資 中已寫 資料量 域中而 上述總 寫入聲音 量,接著 而尚未播 總剩餘資 間二者之 出之聲音 間扣除啟 中實際播 之差,最 取之記錄 聲音與影 別由相對 所組成, 前置處理 緩衝區域 料片段之 由聲音資 記錄聲音 料片段之 入聲音資 ’接著計 尚未播放 剩餘資料 資料緩衝 計算聲音 放之聲音 料量為總 乘積後所 資料片段 始寫入時 放位置為 後依據實 媒體,其 像的同步 應之複數 上述聲音 並分別暫 中,而代 相對位置 料片段中 資料片段 一啟始寫 料緩衝區 算聲音資 之聲音資 量為總寫1227851 V. Description of the invention (2): C: I:: t 自动 Automatically call after the end-function refers to #, I * in the user calculates & two 'to his own program function, and in this program function-returns the length The length of the function: This is the length of the data. The second is the amount of data provided by the system. The number S can be left in the buffer when the call is made. It is difficult to implement it on your own, ^ ^ Brother, under the system development side, improve sound and video J; system program problems. "The situation where the method is pacing is really a question that needs to be resolved [the invention refers to the specific letter sounds and shadows as the method of expression, in which the plurality of pen sounds and sound data are not temporarily stored, which represents the location of the paragraph obtained in the paragraph First of all] The above-mentioned questions ca n’t be used as one of the audio data fragments and sound processing items in the above project. If the situation is synchronized under the environment of the industry, the audio material fragments and the complex image data buffer the sound data. On January 1J, the audio and video data of the playback position and the image are clear. "No need to form Fang Ming to provide material and shadows. Several video segments are placed in the area and the fragments are placed in the same area to provide a systematic method for changing the system. A sound and The image data is a segmented data segment. Before playing, it is necessary to pass an image data. For the preprocessing of sound data, the recorded sound data is first planted in a non-formable image. The provision of recorded audio clips is enough to improve the synchronization of the corresponding party. The above data is divided into domains. Page 10 1227851 V. Description of the invention (3) Writing time, the difference between the amount of written data in one of the data segments in the sound data segment in the area, and the difference between the amount of data taken, and then the current playback Inter-position playback position adjustment ^ In addition, this π has been recorded ~ sound and, the 3 sound data pieces f and clips played ~ ~ sounds are processed by the sound information table is currently known. This sound $ '先Written sound: time, followed by sound data: written in 4 clips; one of the bucket clips total cumulative sound data piece data piece written sound piece one total written data buffer total remaining data amount, divided by one When the sampling frequency and the same frequency are per unit, the elapsed time is calculated by deducting the total video image and the audio data segment of the video and the composite data. The buffered audio data playback bit and the video and audio data are accumulated. An actual synchronization method is provided for the remaining amount of data in the actual playback position. Data and image data. Several image data clips are placed in the playback area and a video segment. One of the audio data segments in the buffer area of the synchronization method is written into the data buffer volume, and the above-mentioned elapsed time in the data area in the segment is to be broadcasted to the current position, and then played back. Brain-readable use of this material system Before segmenting the clip, it is necessary to process the audio data of the video data first. The total written audio volume in the audio data field is first recorded, and then the remaining audio data between the two remaining data is not broadcasted. The difference between the broadcast sound and the best recorded sound is composed of the relative. The pre-processing buffer area material fragments are recorded by the sound data and the sound material fragments are included in the sound data. Then the remaining data is buffered and the sound is calculated. The material volume is the total product. When the data segment starts to be written, the playback position is based on the real media. The synchronization of the image should be the plural of the above sounds and temporarily suspended. Instead, the data segment in the relative position material segment starts the writing buffer. The volume of the sound data of the district calculation is the total write

1227851 五、發明說明(4) 入資料量扣除 之差,而取樣 資料量大小, ’再接著將計 前播放位置扣 播放位置調整 由於本發 資料之實際播 提供特定的函 音與影像的同 所有作業環境 的情形下,使 步的情形的問 一取樣頻 頻率為每 播放位置 資料量後 片段之播 與影像的 整影像資 需改變系 受特定作 底解決了 以自行實 率與一經 單位時間 經過時間則為一現 算一實際 除總剩餘 影像資料 明之聲音 放位置調 數’亦無 步方法不 。因而徹 用者端難 題0 二者之乘積後所得 之聲音資料片段之 扣除啟始寫入時間 實際播放位置為目 差,最後依據實際 法係藉由估算聲音 放,故無需系統端 ,因之本發明之聲 之限制,而適用於 端未提供特定函數 決聲音與影像不同 【實施方式】 以下將參照附隨的圖示, 二與影像的同步方法與電腦可讀取之4 J較佳實施例之聲 、兀:將以相同的參照符號加以說明媒體。其中相同 ㈡參照第1圖所示,本發明之簦土 rt ’、用T處理多媒體資料於播放時所產::影像的同步方法 壓缩i述多媒體資料同時包含聲音與影伤=同步的情形。 格式加以儲 像負料,诵奮在η 且聲音與影像資料可經如解屋縮等前置處理, 對應之複數筆聲立時間輸出的資料量分;為相 皁曰身料片段與影像資料 刀口】馮相 _____ 乃#又。刖置處理完 第12頁 1227851 五、發明說明(5) 料片段於實際播放前,將分㈣存於 數位類比轉換器(DAC,❿」:出,衝"’而後再由 輪出聲音資料片f以及ώ恶s舶 Anai〇g Converter ) ^片奴以及由螢幕顯示影像資料片段。 本發明之要求,未經饜縮夕觫立~这聲曰/、衫像負料並非 明之同步方法。、、 曰^影像資料亦可適用本發 請參照第2圖所示,本發明之聲 ?藉由計”音資料輸出緩衝器中剩; 里,間接估异實際輸出的聲音資料 #备貝料 至緩衝器中之聲音資料的資粗1 =的貝科里’避免以傳送 的資料量而產生的誤差:際輪出的聲音資料 。本發明之聲音與影像的聲音與影像同步的效果 寫入聲音資料片段(S2〇1)驟包含:判斷是否 聲音π輸出緩衝器之啟始寫u:日』:段。寫人 入聲音貝料輸出緩衝器中之聲音父=寫 (s2〇3)、判斷是否暫停、跳躍或重新播放、;二;料量 器中而尚未播*之聲音資料;| ” A聲音資料輸出緩衝 、計算實際播放位置(=段=餘資料量(㈣” 像資料片段之播放(S20 8 )、播二:際置調整影 衝器中而尚未播放之聲音f J、、入聲音資料輪出緩 料量歸零(S210)、判;;==)、將總寫入資 以及播放已寫入聲音資料聲音貝料…S2U ) 叶後衝為中而尚未播放之聲音資料 第13頁 1227851 五、發明說明(6) 牛古、i 2 1 2 ),以下將詳細說明本發明之聲立盥公你 V方法的各個步驟。 耸曰與衫像的同 = ,首先 續流程中,θ通劲钚‘ 早曰貝枓片段,於後 步驟S202。 則、、,°束本,瓜程,若是,則繼續執行 於步驟S202,將畔位叹立次μ " 輸出緩衝器之啟始寫入=間?二二$首先寫人聲音資料 輸出的聲音資料的資料旦二 ρ、〃彳的步驟於估算實際 上述啟始寫入時間ct經過的時間長度,其中 於步_3,將間。 聲音資料片段之總寫入資二,^二貝料輪出緩衝器中之 值為零,隨著聲音資料片ρ ^ ,寫入資料量之初始 累計。 片'又的寫入,總寫入資料量亦隨之 舄/主思者,總寫入資料量與聲音資粗认 存之聲音資料片段的資料量無Μ亦=輪出緩衝器中現 t:際由數位類比轉換器輸出而減少:。:者聲音資料片 料輸出緩衝器中之各筆聲立此外’寫入聲音資 同的資料量大小,但即便;音資又段::情形下具有相 不同亦無礙於本發明之實施。 波此間資料量大小 於步糊4,將判;;否暫停 是,則執行步驟S2 09 ’播放已寫入聲立次,重新播放,若 ί = $之聲音資料片段,接著再執行衝器中而尚 …歸零,以便以暫#、跳躍或重;=,將總寫入 輝放之目標始點為1227851 V. Description of the invention (4) The difference between the amount of input data and the amount of sampled data, and then the pre-calculated playback position is deducted from the playback position. Because the actual broadcast of this data provides a specific function and the same In the case of the operating environment, the frequency of the sampling is set to the frequency of the playback of the clip and the image after each amount of data in the playback position. The overall image data needs to be changed. It is resolved by a specific background. The time is a current calculation and the actual number of sound placement position adjustments except for the total remaining image data. Therefore, the user-side problem 0 is the product of the two. The deduction of the initial writing time and the actual playback position of the audio data segment is the difference. Finally, the sound is estimated by the actual method, so there is no need for the system side. The limitation of the sound of invention, but applicable to the end does not provide a specific function, the sound is different from the image [Implementation] The following will refer to the accompanying diagram, the second method of synchronization with the image and the computer-readable 4 J preferred embodiment Voice, Wu: The media will be explained with the same reference symbols. It is the same as shown in FIG. 1. According to the present invention, rt ′, which is used to process multimedia data during playback, is a method of synchronizing video: Compression of multimedia data including both sound and shadow damage = synchronization. Format is used to store image negative materials, chanting in η, and sound and image data can be pre-processed such as unpacking and shrinking, corresponding to the amount of data output by the time of multiple pen sounds; for body soap and image data Incision] Feng Xiang _____ 乃 # Also. Page 121227851 after finishing the processing. V. Description of the invention (5) Before the actual clip is played, the data will be stored in a digital analog converter (DAC, ❿): output, punch " ', and then the sound data will be rotated out. Movie f and Anai 0g Converter) ^ film slave and video data clips displayed on the screen. The requirements of the present invention do not stand without curling up-this sound is not a clear synchronization method. The image data is also applicable to this issue. Please refer to Figure 2. The sound of the present invention is left in the audio data output buffer; indirectly, the actual output audio data is indirectly estimated. # 备 贝 料The thickness of the sound data in the buffer is 1 = Beckory 'to avoid the error caused by the amount of data transmitted: the sound data of the international rotation. The sound and video of the present invention are written with the effect of sound and video synchronization The sound data segment (S2〇1) includes: determine whether the beginning of the sound π output buffer is written u: day :: segment. Write the sound into the sound shell output buffer. Parent = write (s2〇3), Determine whether to pause, skip, or replay; 2. Sound data in the gauge that has not yet been played *; | ”A sound data output buffer, calculate the actual playback position (= segment = remaining data amount (㈣)) Play (S20 8), play two: Intermediate adjustment of the unplayed sound f J, the input and output of the sound data roll out to zero (S210), judgment;; ==), write the total Information and playing the written sound data sound material ... S2U) Ye Houchong is a medium and still Unplayed sound data page 131227851 V. Description of the invention (6) Niu Gu, i 2 1 2), the following will explain in detail the steps of the V method of the voice of the present invention. = , Continued in the process, θ Tong Jin 钚 'earlier said the 枓 枓 fragment, and then step S202. Then ,,, ° Bunben, Gua Cheng, if it is, then continue to step S202, the next position will be sighed μ " The initial writing of the output buffer = time? 22 $ First write the data of the sound data output by the human voice data. The steps of ρ and 〃 彳 are to estimate the actual elapsed time of the above initial writing time ct. Among them, step _3, the time. The total written data of the audio data segment is two, and the value in the ^ 2 material wheel out buffer is zero. With the audio data piece ρ ^, the initial accumulated amount of written data. 'Writing again, the total amount of written data will also follow / thinker, the total amount of written data and the amount of sound data in the sound data segments that are roughly recognized by the sound data are not M = = now in the rotation buffer t: Reduced by the digital analog converter output :: each of the audio data sheet output buffer In addition, Shengli 'writes the same amount of data as the sound information, but even if the sound information is different: the situation is different without affecting the implementation of the present invention. The amount of data here will be judged in step 4; ; No Pause Yes, execute step S2 09 'Play the written sound immediately, and replay, if ί = $ 的 之 音 数据 段, and then execute the puncher and still ... return to zero, in order to temporarily #, jump or Heavy; =, the starting point of writing the total target is

IMIM

咖 第14頁 1227851 五、發明說明(7) 準,重新累計寫入聲音資料輪 之總寫入資料量,再接著執行 於步驟S204之判斷結果為否,'Page 14 1227851 V. Description of the invention (7) Yes, re-accumulate the total amount of written data written in the sound data wheel, and then execute the judgment result in step S204 is no,

於步驟S20 5,將判斷此時Y 行步驟S211,判斷是否還有聲^ ,則回到步驟S2 0 3,繼續於聲1 出緩衝器時累計總寫入資料量7 播放已寫入聲音資料緩衝器= ’然後結束本流程。 若於步驟S2 05之判斷結 ’進行下述計算聲音I' 枓η &々& 孝曰貝枓輪出緩 枓片敫之總剩餘資料量的步驟 _=驟82()6,將計算已寫i G播匕聲音資料片段之總剩 先為计异已經過的時間長产, 間扣除步驟S202所記錄之: 為系統時間;其次“ = 過時間内實際播出 片;樣頻率為每單位時 料^二4量大小;最後將步驟 里扣除於經過時間内 即為寫入聲音資料仏山’際播出 ^ 貝枓輸出緩衝哭中 又之總剩餘資料量。 口口 於步驟S2Q7,將計算實際播 緩衝器中之聲音資料片段 驟3201及其後續步驟。i 繼續執行步驟S20 5。 否顯示影像,*否,則執 資料片段等待輪出,若是 ::片段寫入聲音資料輸 右否,則執行步驟S21 2, 尚未播放之聲音資料片段 疋’則繼續執行步驟S 20 6 衝器中尚未播放之聲音資 聲音資 餘資料 經過時 入時間 過時間 之聲音 間欲播 S20 3 累 之聲音 而尚未 料輸出緩 量,其計 間長度可 而得,其 乘以取樣 資料片段 出之該等 計所得之 資料片段 播放之聲 衝器中而 算方式首 由現在時 中上述現 頻率,即 之資料量 聲音資料 總寫入資 之資料量 音資料片 放位置,其可由目前播放At step S205, it will be judged at this time that the line of step S211 is judged at this time, and it is judged whether there is still sound ^, then go back to step S203, and continue to accumulate the total amount of written data when the sound is out of the buffer 7 Play the written sound data buffer == Then end this process. If it is judged in step S205 that 'the following calculation sound I' is performed: 枓 η & 々 & Xiao Yue Bei's step out of the total remaining data amount of the relief film _ = step 82 () 6, will be calculated The total remaining of the i G broadcast sound data clips is the long time that has passed since the calculation of the difference, and the time recorded in step S202 is deducted: is the system time; secondly, "= the actual broadcast of the film in the past time; the sample frequency is every The unit time is ^ 2 and 4; the final deduction of the step within the elapsed time is the writing of the sound data. 仏 山 '际 播 出 枓 The total remaining data in the output buffer crying. In step S2Q7, the Calculate the audio data segments in the actual playback buffer. Step 3201 and subsequent steps. I Continue to step S20 5. If no image is displayed, * No, execute the data segment and wait for rotation. , Then step S21 2 is performed, and the unplayed sound data segment 疋 ′ is then continued to step S 20 6 the unplayed sound data in the puncher, the remaining data of the audio data, and the sound that has passed the time and time elapses, and S20 3 tired sound will be played. And not yet lost The amount of time is calculated by multiplying it by the sampled data fragments. The calculation method is based on the current frequency above the present frequency, that is, the amount of sound data. The position of the total volume of data and audio data, which can be played by the current

第15頁 1227851 五、發明說明(8) __ 位置扣除該總剩餘資料量後而得。其 係代表經前置處理的聲音資料中理播放位置 間最相近之聲音資料片段相對於聲間與目前時 音資靡,故目前播放位置未播放之聲 其誤差即為聲音資料輸出緩衝器中差’ 段的資料量。 ^之聲音資料片Page 15 1227851 V. Description of the invention (8) __ This position is obtained after deducting the total remaining data. It represents the most similar sound data segment between the pre-processed sound data and the sound playback position. Compared with the sound and the current time, the sound is very expensive. Therefore, the error of the sound that is not played at the current playback position is in the sound data output buffer. Poor 'amount of data. ^ Sound Expansion

舉例而言’請參照第3圖所示,假設 、B、C、D、E等五筆聲音資钮 卓曰貝料包含A 置之定義,目前播放位置為c、D:聲::::二前播放位 由於尚有B、c等二聲音資料片段暫存於聲立間,但 器中尚未播放,故實際播放 、:以:、緩衝 與實際播放位置間之誤差。 马目則播放位置 於伙頭播放多媒體資料且於 躍或重新播放的情形之下,實際:並無暫停、跳 以取樣頻率而直接求得,但於;2可由經過時間乘 於播放過矛呈中有暫停 戈^新播二#非從頭播放或 據上述之方式求取的置情形時’仍須依 位置調整影像資料片段之播放嶋 像資料“,則4捨料相對應之影 片段加以前置處理並輸出。完成本步驟後,更料For example, please refer to Figure 3. Assume that five strokes, B, C, D, E and other five-tone sound buttons include the definition of A, and the current playback position is c, D: sound :::: Because there are still two audio data segments such as B and c temporarily stored in the sound stand, but not yet played in the device, the actual playback, ::, buffer and the actual playback position error. Mamu's playing position is when the player plays multimedia data and jumps or replays. Actually: there is no pause, skip directly to the sampling frequency, but at; 2 can be multiplied by the elapsed time and played. There is a pause Ge ^ 新 播 二 # When the situation is not played from the beginning or obtained according to the above method, 'the image data of the video data segment must still be adjusted according to the position', then the video segment corresponding to the 4 material is added before Set processing and output. After completing this step, more material

1227851 五、發明說明(9) S211,判斷是否還有聲 本流程或循環執行上述 由於本方法預先於 寫入時間,並於播放流 器中之聲音資料片段之 時,得以依據經過時間 際播出之聲音資料片段 輸出緩衝器中剩餘之聲 料輸出緩衝器中剩餘之 位置校正為實際播放位 像資料片段之播放,達 又,本發明係於應 定的函數,亦無需改變 的同步方法不受特定作 環境。因而徹底解決了 ,使用者端難以自行實 的問題。 音資料片 步驟S203 聲音資料 程中累計 總寫入資 及取樣頻 之資料量 音資料的 聲音資料 置,最後 成聲音與 用程式端 系統程式 業環境之 於系統端 作以解決 #又荨待輸出 至步驟S208 開始播放時 已寫入聲音 料量,而於 率’計算出 ’再間接計 資料量,進 的資料量得 依據實際播 影像同步的 實作’無需 ,故本發明 限制,而適 未提供特定 聲音與影像 ,以決 之流程 ,記錄 資料輸 顯示影 經過時 算出聲 而藉由 以將目 放位置 效果。 系統端 之聲音 用於所 函數的 不同步 定結束 〇 下啟始 出緩衝 像資料 間内實 音資料 聲音資 前播放 調整影 提供特 與影像 有作業 情形下 的情形1227851 V. Description of the invention (9) S211, to determine whether there is still sound in this process or to execute the above loop. Because this method is pre-written at the time, and when the sound data segment in the streamer is played, it can be broadcast based on the elapsed time. The remaining positions in the sound material output buffer remaining in the sound data piece output buffer are corrected to the actual playback of the bit image data piece. In addition, the invention is a function that should be determined, and the synchronization method that does not need to be changed is not subject to specific Operating environment. So it completely solves the problem that it is difficult for the user to implement it by themselves. Step S203 of the audio data piece. The total volume of data and sampling frequency data accumulated in the audio data process is set to the audio data. Finally, the sound and the program-side system are used to solve the problem. To step S208, the volume of audio data has been written at the beginning of playback, and the data amount is calculated indirectly at the rate of 'calculation'. The amount of data entered can be based on the implementation of the actual broadcast image synchronization. Provide specific sounds and images to determine the process, record data to calculate the sound when the video is displayed, and use it to position the effect. The sound on the system side is used for the asynchronous synchronization of all functions. The end will start. The next buffer will be output. The image data will be real audio data. The audio data will be played before.

1227851 圖式簡單說明 【圖式簡單說明】 第1圖係顯示多媒體資料輸出流程之示意圖; 第2圖係顯示本發明之聲音與影像的同步方法之流程圖; 第3圖係顯示目前播放位置與實際播放位置的誤差之示意 圖。 【圖式符號說明】 S201 判斷是否寫入聲音資料片段 S20 2 記錄聲音資料片段首先寫入聲音資料輸 出緩衝器之啟始寫入時間1227851 Schematic illustration [Schematic description] Figure 1 shows the schematic diagram of the multimedia data output process; Figure 2 shows the flowchart of the method for synchronizing the sound and video of the present invention; Figure 3 shows the current playback position and Schematic diagram of the actual playback position error. [Symbol description] S201 Determines whether to write the audio data segment S20 2 Records the audio data segment first writes the start time of the audio data output buffer

5203 累計已寫入聲音資料輸出緩衝器中之聲 音資料片段之總寫入資料量 5204 判斷是否暫停、跳躍或重新播放 S20 5 判斷是否顯示影像 S20 6 計算已寫入聲音資料輸出緩衝器中而尚 未播放之聲音資料片段之總剩餘資料量 S20 7 計算實際播放位置 S208 依據實際播放位置調整影像資料片段之 播放5203 Accumulated total written data volume of sound data fragments written in the sound data output buffer 5204 Determine whether to pause, skip, or replay S20 5 Determine whether to display the image S20 6 Calculate the data written to the sound data output buffer but not yet Total remaining data volume of the audio data segment being played S20 7 Calculate the actual playback position S208 Adjust the playback of the video data segment according to the actual playback position

S20 9 播放已寫入聲音資料輸出緩衝器中而尚 未播放之聲音資料片段 5210 將總寫入資料量歸零 5211 判斷是否尚有聲音資料片段 5212 播放已寫入聲音資料緩衝器中而尚未播 放之聲音資料片段S20 9 Play the audio data fragments that have been written to the audio data output buffer but have not yet played 5210 Reset the total written data amount to 5211 Determine whether there are still audio data fragments 5212 Play the audio data buffer that has been written to the audio data buffer but has not yet played Audio clip

第18頁Page 18

Claims (1)

1227851 六、申請專利範圍 1 · 一種聲音 與影像資 複數筆影 等影像資 一聲音資 表所處理 相對位置 音資料片 記錄該 區域之聲 累計該 域中之聲 計算該 域中而尚 其中該總 率與一經 率為每單 大小,該 與影像的同 料係分別由 像資料片段 料片段於播 料緩衝區域 之聲音資料 為何之一目 段中取得, 荨聲音資料 音資料片段 專聲音資料 音資料片段 等聲音資料 未播放之聲 剩餘資料量 過時間二者 位時間欲播 經過時間為 #方法 才目對應 所組成 玫前需 與一影 片段相 前播放 聲音與 片段中 之一啟 片段中 之一總 ’其中所 之複數筆 ,該等聲 經前置處 像資料緩 對於該等 位置可於 影像的同 首先寫入 始寫入時 已寫入該 寫入資料 已寫入該 片段中 音資料片段之 為該總 之乘積 寫入資料 後所得之 出之該等聲音資 一現在時間扣除 播放之 聲音資 音資料 理並分 衝區域 聲音資 前置處 步方法 該聲音 間; 聲音資 量; 聲音資 總剩餘 量扣除差,而 料片段 該啟始 聲音資料 料片段與 片段與該 別暫存於 中,而代 料片段之 理時由聲 包含: 資料緩衝 料緩衝 [^r 料緩衝區 資料量, 一取樣頻 該取樣頻 之資料量 寫入時間 义什异一貫際播放位置,其中該實際播放位置為該 則播放位置扣除該總剩餘資料量後所得之差;以^目 依據該實際播放位置調整該等影像資料片段 。 心播放 如申請專利範圍第1項所述之聲音與影像的同步方去1227851 VI. Scope of patent application1. A sound and video data, such as multiple strokes, and other video data. The relative position of the audio data sheet records the sound in the area. The sound in the domain is accumulated. The sound in the domain is calculated. The rate and the time-lapse rate are the size of each order. The same material with the image is obtained from the audio data of the audio data in the buffer area of the broadcast data segment. The audio data is the audio data segment. Wait until the sound data is not played. The remaining data volume is over time. Both time is to be played. The elapsed time is #. 'The plural of them, these sounds are pre-processed by the image data. For these positions, they can be written at the same time as the first write of the image. The sound information obtained after writing the data for the total product is the current time minus the sound information data that is played. And divide the area of the sound resources in front of the sound method; the sound resources; the total remaining amount of sound resources minus the difference, and the material fragments should be stored in the beginning of the sound data material fragments and fragments, and substitute materials The rationale of the clip includes: data buffer data buffer [^ r data buffer data amount, a sampling frequency of the data amount of the sampling frequency writing time meaning is very consistent consistent playback position, where the actual playback position is the playback The difference between the position after deducting the total remaining data amount; adjust the image data fragments according to the actual playback position with a mesh. Heart playback as described in item 1 of the scope of patent application 第19頁 1227851 六、申請專利範圍 ____ 〉其中該聲音資料緩衝區域為聲音 (audio play buffer )。 貧钭輸出緩衝盗 圍;ir述之聲音與影像的同步方* 4 “ : ί 時間取自於系統時間。 4.如2專利範圍第工項所述 ,其中該現在時間取自於系統時間:、-像的问步方法 5 .如申請專利範圍第i項所述之 ,更包含播放該等聲音資料 二/、衫像的同步方法 緩衝區域中而尚未播放聲立寫入該聲音資料 6. 如申請專利範圍第5項所以 ,更包含於該等聲音資料 【^衫像的同步方法 衝區域中而尚未播放之聲片已;:該聲音資料緩 其後該等聲音資料片段中首先 放元畢後,將 域之聲音資料片段的寫 音資料緩衝區 ,並將該總寫入資料量歸^門6又疋為該啟始寫入時間 7. 如申請專利範圍第丄項心聲音 ’更包含清除該等影像資料 /已=像的同步方法 8 緩衝區域中而尚未播放之二人該景Μ象資料 腦可讀取之記錄媒體=記:= 同步方法,運用該聲音盥旦‘耳曰與影像的 音資料與影像資料係八=像的同ν方法所播放之聲 片段與複數筆影像資二片相對,之複數筆聲音資料 段與該等影像資料片段,該等聲音資料片 暫存於-聲音資料緩衝::;:= = =並分別 貝科綾衝區域中Page 19 1227851 VI. Scope of Patent Application ____〉 The audio data buffer area is audio play buffer. The poor output buffers stolen; the synchronizing side of the sound and the video described by ir * 4 ": ί time is taken from the system time. 4. As described in item 2 of the patent scope, where the current time is taken from the system time: 、 -Image step method 5. As described in item i of the scope of the patent application, it also includes the synchronization method to play these sound data II /, shirt image synchronization buffer area and has not yet played the sound to write the sound data 6. If the scope of the patent application is the fifth item, it is included in the sound data [^ shirt image synchronization method and has not yet played the sound film ;: the sound data is later put in the sound data fragment first yuan After the completion, the voice data buffer of the voice data segment of the domain is returned, and the total written data volume is returned to the gate 6 and the initial writing time 7. If the patent application scope item 丄 heart sound 'more Contains a method of synchronizing the erasing of such image data / already = 8. The two persons in the buffer area who have not yet played the scene image data can be read by the brain = note: = synchronization method, using this sound Audio The sound clips played by the same ν method as the image data are in contrast to the two pieces of image data. The plurality of sound data pieces and these image data pieces are temporarily stored in the-sound data buffer. ::;: = = = 六、申請專利範圍 ,而指示所處理之聲音資料片段相 片=對位置為何之一目前播敌 :料片段中取得’該聲音與影像的同心 記錄該等聲音資料片段中首先寫入該聲音資 區域之聲音資料片段之一啟始寫入時間; 緩衝 累計該等聲音資料片段中已寫入該聲音資料緩 域中之聲音資料片段之一總寫入資料量; οσ 計算該等聲音資料片段中已寫入該聲音資料緩衝區 域中而尚未播放之聲音資料片段之一總剩餘資料量, 其中該總剩餘資料量為該總寫入資料量扣除二取樣頻 率與一,過時間二者之乘積後所得之差,而該取樣頻 率為每t位時間欲播出之該等聲音資料片段之數目, 該經,時=為一現在時間扣除該啟始寫入時間; 計异一實際播放位置,其中該實際播放位置為該目 前播放位置扣除該總剩餘資料量後所得之差;以及 依據該實際播放位置調整該等影像資料片段之播放 9 ·如:ΐ: 8項所述之電腦可讀取之記錄媒體 :〇以:衝區域為聲音資料輪出緩衝器 (audio Play buffer )。 山及 j 口 10.如第8項所述之電腦可讀取之記錄媒體 ,寫入時間取自於系統時間。 η·如申Μ 8項所述之電腦可讀取之記錄媒體 上2278516. The scope of the patent application, and the photo of the audio data segment that is being processed is indicated as to which one is currently playing: the material segment is obtained from the concentric record of the sound and video. The audio data segment is first written into the audio data area. The initial writing time of one of the sound data fragments; buffering and accumulating the total amount of written data of one of the sound data fragments in the sound data buffer domain that has been written into the sound data fragments; οσ Calculating the The total remaining data amount of one of the sound data fragments written into the sound data buffer area but not yet played, wherein the total remaining data amount is obtained by subtracting the product of two sampling frequencies and one from the total written data amount and the time two Difference, and the sampling frequency is the number of the audio data segments to be broadcasted every t-bit time, the time, hour = is the current time minus the initial writing time; calculate the actual playback position, where the The actual playback position is the difference between the current playback position and the total remaining data amount; and adjusting the video data based on the actual playback position The playback period of 9.3: ΐ: 8 of the computer readable recording medium of: square to: a sound data region round punch-out buffers (audio Play buffer). Mountain and J port 10. The computer-readable recording medium as described in item 8, the writing time is taken from the system time. η · A computer-readable recording medium as described in Item 8 on 227851 第22頁Page 22
TW92137538A 2003-12-30 2003-12-30 A method for synchronizing audio and video TWI227851B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW92137538A TWI227851B (en) 2003-12-30 2003-12-30 A method for synchronizing audio and video

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW92137538A TWI227851B (en) 2003-12-30 2003-12-30 A method for synchronizing audio and video

Publications (2)

Publication Number Publication Date
TWI227851B true TWI227851B (en) 2005-02-11
TW200521826A TW200521826A (en) 2005-07-01

Family

ID=35667163

Family Applications (1)

Application Number Title Priority Date Filing Date
TW92137538A TWI227851B (en) 2003-12-30 2003-12-30 A method for synchronizing audio and video

Country Status (1)

Country Link
TW (1) TWI227851B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7787578B2 (en) 2005-07-12 2010-08-31 Realtek Semiconductor Corp. Method and apparatus for synchronizing multimedia data stream

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7787578B2 (en) 2005-07-12 2010-08-31 Realtek Semiconductor Corp. Method and apparatus for synchronizing multimedia data stream
US8040991B2 (en) 2005-07-12 2011-10-18 Realtek Semiconductor Corp. Method and apparatus for synchronizing multimedia data stream

Also Published As

Publication number Publication date
TW200521826A (en) 2005-07-01

Similar Documents

Publication Publication Date Title
WO2018149175A1 (en) Video-recording method and apparatus, and electronic device
TW556154B (en) Real-time control of playback rates in presentations
JP6367334B2 (en) Video processing method, apparatus, and playback apparatus
CN106412645B (en) To the method and apparatus of multimedia server uploaded videos file
US7820901B2 (en) Information management method, information management program, and information management device
JP2007188597A (en) Content reproduction device and content reproduction method, and program
WO2016202176A1 (en) Method, device and apparatus for synthesizing media file
JP2008166914A (en) Method and apparatus for synchronizing data stream of content with meta data
WO2017076304A1 (en) Audio data processing method and device
KR20090039408A (en) Apparatus and method for providing the thread of a contents
CN111435600B (en) Method and apparatus for processing audio
US20140089797A1 (en) System and method for generating event distribution information
WO2023160288A1 (en) Conference summary generation method and apparatus, electronic device, and readable storage medium
CN108429931A (en) A kind of method for broadcasting multimedia file and device
US20190019533A1 (en) Methods for efficient annotation of audiovisual media
JP2017512434A (en) Apparatus and method for playing an interactive audiovisual movie
CN104104901B (en) A kind of data playing method and device
TWI227851B (en) A method for synchronizing audio and video
US20100141838A1 (en) Presentation synchronization system and method
US20140380162A1 (en) Variable audiovisual playing method
CN108831510A (en) Method, apparatus, terminal and the storage medium that audio-video document is got ready
CN113535116A (en) Audio file playing method and device, terminal and storage medium
CN107679068B (en) Information importing and displaying method of multimedia file, mobile terminal and storage device
WO2020038290A1 (en) Bookmark display method and device
WO2022079853A1 (en) Music data processing method

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees