TWI276961B - System, method and machine-readable storage medium for synchronization of still image and audio - Google Patents

System, method and machine-readable storage medium for synchronization of still image and audio Download PDF

Info

Publication number
TWI276961B
TWI276961B TW93118490A TW93118490A TWI276961B TW I276961 B TWI276961 B TW I276961B TW 93118490 A TW93118490 A TW 93118490A TW 93118490 A TW93118490 A TW 93118490A TW I276961 B TWI276961 B TW I276961B
Authority
TW
Taiwan
Prior art keywords
image
discontinuity
music
point
sound
Prior art date
Application number
TW93118490A
Other languages
Chinese (zh)
Other versions
TW200601048A (en
Inventor
Yi-Kai Chen
Original Assignee
Ulead Systems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ulead Systems Inc filed Critical Ulead Systems Inc
Priority to TW93118490A priority Critical patent/TWI276961B/en
Publication of TW200601048A publication Critical patent/TW200601048A/en
Application granted granted Critical
Publication of TWI276961B publication Critical patent/TWI276961B/en

Links

Landscapes

  • Studio Circuits (AREA)
  • Television Signal Processing For Recording (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

A method is employed for synchronization of still image and audio. The method first displays a still image, plays the audio, and partially analyzes the audio in the sampling intervals. The method determines a transition point in the sampling interval using a musical content analysis method and displays the next still image when the transition point is achieved.

Description

1276961 五、發明說明(l) —- 發明所屬之技術領域 此發明是一種影像顯示技術,特別是一種協調 (synchronize)聲音之影像顯示系統及方法。 先前技術 傳統之影像顯示系統,除了可播放一連串之靜態影像 (a sequence 〇f stiU images)之外,亦可搭配播放數位 音樂(digital audio)。靜態影像包括各式各樣影像格式 之檔案,例如,GIF、JPEG、SVG、PNG、JPEG 2 0 0 0 或°/他 之衫像格式。數位音樂亦包括各式各樣聲音格式之標案, 例如,MP3、MP4、AAC、VGF、0GG、WAV或其他之聲音格 式。然而,一般之影像顯示系統,大多獨立播放一連串之 靜態影像以及數位音樂。 為協調(synchronize)數位音樂以及靜態影像,美國 第5, 50 8, 470號專利中揭露一Karode裝置,讓影像的改變 可以與預設的音樂節拍(mus i c b ea t)協調。另外,美國第 6,6 3 9,6 4 9號專利中另揭露一音樂與影像協調裝置,用以 分析一整段音樂,並根據顯著的循環特徵(recurring f eature),產生影像切換訊號,讓影像顯示器可依據影像 切換訊號,播放下一張靜態數位影像。 雖然’先如之技術可自動產生影像切換訊號,但亦存 在若干缺點。首先,在播放影像之前,必須先分析一整段 音樂’耗費大量時間與計算資源。因此,需要一影像暨音 樂播放系統與方法,用以節省分析時間與計算資源。 發明内容1276961 V. INSTRUCTION DESCRIPTION (l) - TECHNICAL FIELD OF THE INVENTION This invention is an image display technology, and more particularly, an image display system and method for synchronizing sound. Prior Art Conventional image display systems, in addition to playing a series of still images (a sequence 〇f stiU images), can also be used to play digital audio. Still images include files in a variety of image formats, such as GIF, JPEG, SVG, PNG, JPEG 2 0 0 or °/Hi shirt format. Digital music also includes a variety of sound formats, such as MP3, MP4, AAC, VGF, 0GG, WAV or other sound formats. However, in general image display systems, most of the images are played independently of a series of still images and digital music. In order to synchronize digital music and still images, a Karode device is disclosed in U.S. Patent No. 5,508,470, the disclosure of which is incorporated herein by reference. In addition, a music and video coordination device is disclosed in the U.S. Patent No. 6,6, 9, 094, for analyzing a whole piece of music and generating a video switching signal according to a significant recurring feature. Let the image display switch the signal according to the image to play the next static digital image. Although the prior art technology can automatically generate image switching signals, there are also several disadvantages. First of all, before playing an image, you must analyze a whole piece of music, which takes a lot of time and computing resources. Therefore, an image and music playback system and method are needed to save analysis time and computing resources. Summary of the invention

五、發明說明(2) 首先有ί於此,本實施例揭露 内容分張f態影像,播音樂播放方法。 的部分八針對一段聲音串=9串流,使用一個音樂 於上诚:,,在上述取樣區間t汍的取樣區間做音樂内容 換時間點時,切 、定一個影像切換時間點, 點,此方法更包括決定尚未t下-張靜態影像。 产為段點為基礎,决定包人t聲音串流令之一個分段 /瓜為取,區尸日 1 。 &匕含分段點前後一部份聲音串 本實施例更揭露一電 « 電腦程式,該電腦程式==取館存媒體,用以儲存一 该電腦系統執行如上所述之方2至—電腦系統中並且使得 此外,本實施例揭露一二二 :儲存裝置、一顯示裝置、_::暨音樂播放系統,包括 元。儲存裝置用以儲存—聲立 3凌置、以及一處理單 音震置用以播放聲音串流。以及多張靜態影像。聲 態影像至顯示裝置,決定聲音串;t r用以用以輸出—張靜 用—個音樂内容分析方法以二對I之一段取樣區間,使 換時間點’於切換時間點時,於二取樣區間決定-個切 裝置。在決定取樣區間時,處二至顯示 一個分段點,並以分段點為基礎 ;串流中之 區間。 、疋匕έ力&點之取樣 聲音串流(audio stream)可為MP3、MP4、A 〇GG或WAV格式之數位聲音;而靜態影像可ff 、 厕、SVG、PNG或JPEG 2 0 0 0格式之數位影像。音&容 0599-A20372TWF(Nl);2004-03;snowball.ptd 第5頁 ----,111, 1276961V. Description of the Invention (2) First of all, the present embodiment discloses a method for playing a music and a method for playing music. Part 8 is for a string of sound = 9 streams, using a piece of music in Shangcheng:, when the music interval is changed at the sampling interval of the sampling interval t汍, cut and set an image switching time point, point, this The method further includes determining that the still image has not been t-down. The production is based on the segment point, and it is decided that the packager's voice stream is ordered by a segment/melon, and the corpse day is 1 . & 一 contains a part of the sound string before and after the segmentation point. This embodiment further discloses a computer program, the computer program == accessing the library media, for storing a computer system to perform the above 2 In the system and in addition, the embodiment discloses a two-two: a storage device, a display device, a _:: cum music playing system, including a meta. The storage device is used for storing the sound, and the processing of the sound is used to play the sound stream. And multiple still images. The sound image is displayed to the display device, and the sound string is determined; tr is used for outputting - Zhang Jing use - a music content analysis method to sample the interval of one of the two pairs of I, so that the time point is changed at the time of switching, and the second sampling is performed. Interval decision - a cutting device. When determining the sampling interval, the second to the display of a segmentation point, based on the segmentation point; the interval in the stream. , 疋匕έ && point sample audio stream (audio stream) can be MP3, MP4, A 〇 GG or WAV format digital sound; while still image can be ff, toilet, SVG, PNG or JPEG 2 0 0 0 Digital image of the format. Sound & 0599-A20372TWF (Nl); 2004-03; snowball.ptd Page 5 ----, 111, 1276961

五、發明說明(3) _ 分析方法用以取得相應於一樂器起奏點(attack time f〇r an instrument)、一旋律不連續點(mel〇dy discontinuity)、一 合聲不連續點(harm〇nic discontinui ty)、一 節拍不連續點(beat discontinuity)、一 音高不連續點(pitch discontinuity)、一最大表面峰值㈦”^龍 r〇ugh peak value)或一最大表面谷值(minimum r〇ugh valley value) 之上述切換時間點。 實施方式 第1圖係表示依據本發明實施例之影像暨音樂播放系 統之系統架構圖。依據本發明實施例之影像暨音樂播放系 統10包括一處理單元11、一記憶體丨2、一儲存裝置13、一 顯示裝置14、一聲音裝置(audi〇 device)15,並使用匯流 排1 6將其連結再一起。除此之外,熟習此技藝人士也可將 本發明實施於其他電腦系統樣態(conf igurati〇n)上,例 如,手持式設備(hand-held de vices)、多處理器系統、 以微處理器為基礎或可程式化之消費性電子產品 (microprocessor-based or programmable consumer electronics)、網路電腦、迷你電腦、大型主機以及類似 之設備。處理單元11可包含一單一中央處理單元 (central-processing unit; CPU)或者是關連於平行運算 環境(parallel processing environment)之複數平行處 理單元。記憶體12包含唯讀記憶體(read only mem〇ry ; ROM)、快閃記憶體(fiash ROM)以及/或動態存取記憶體V. Description of invention (3) _ Analytical method is used to obtain an attack time f〇r an instrument, a mel〇dy discontinuity, a coma discontinuity (harm) 〇nic discontinui ty), beat discontinuity, pitch discontinuity, a maximum surface peak (seven) "long r〇ugh peak value" or a maximum surface valley (minimum r The above-mentioned switching time point of the 〇ugh valley value. Embodiment 1 is a system architecture diagram of an image and music playing system according to an embodiment of the present invention. The video and music playing system 10 according to an embodiment of the present invention includes a processing unit. 11. A memory device 2, a storage device 13, a display device 14, and an audio device 15 are connected together by a bus bar 16. In addition, those skilled in the art also The present invention can be implemented on other computer system configurations, such as hand-held de vices, multi-processor systems, microprocessor-based or Processing unit 11 may include a single central processing unit (CPU) or a microprocessor-based or programmable consumer electronics Is a parallel parallel processing unit associated with a parallel processing environment. The memory 12 includes a read only memory (ROM), a flash memory, and/or a dynamic access memory. body

0599-A20372TWF(Nl);2004-03;snowball.ptd 第6頁 1276961 五、發明說明(4) (random access memory; RAM),田、 ^ , 用以儲存可供處理單元 11執订之私式核組。一般而言,未 r + · Λ , 私式模組包含常序 (routines)、程式(program)、物 / , λ _ 〜件 Cob ject)、兀件 (component)等,用以執行影傻既立 介π丨、,與#从\ 丁〜1豕旦音樂播放功能。本發明 亦可以貫施於分散式運笪提p ^ 1+ j & 1 兄,其運算工作被一連結於通 訊網路之遂端處理設備所勃并 立谢Μ姑μ 在分散式環境中,影像暨 音樂播放功能之執行也許由太从 n ^ _ . ^ f由本地从及多部遠端電腦系統共 同元成。儲存叙置1 3包含廊雄爿士里 + π & ^ $ 硬碟叙置、軟碟裝置、光碟裝置 ,:以讀取硬碟、軟碟、光碟、隨身碟中儲 Λ二=、且、音樂棺案(亦即是聲音串流)、組態播案 (、、且悲、、、己錄)以及/或靜態影像檔案。 第一實施例 ,:實施例揭露一種即時處理之影像暨音樂播放方 / 一 f由處理單兀1 1所執行。第2圖係表示依據本發 明弟一貫施例之影像暨音樂播放方法之方法流程圖。 如二驟S 2 11 ’接收一個聲音串流以及一串之靜態影 ,此茸曰串流(audio stream)可為 Mp3、Mp4、AAC、 VGF、0GG、WAV或发仙獻立 ”他卓曰格式之數位聲音。如步驟 S212,it過聲音裝置15,播放聲音串流。如步驟s22i,透 ;顯示裝置14,顯示序列之靜態影像中之第-張靜態影 像。 如步驟S 2 3 1,從起私w / I始點或上一個分段點(p a r t i t i ο η po l n t)以後,於聲音电、、* a t 串中決定一個分段點,而此分段點 距離起始點或上一個分與 刀奴點之長度,可為一固定長度或一0599-A20372TWF(Nl);2004-03;snowball.ptd Page 61276961 V. Invention Description (4) (random access memory; RAM), Tian, ^, used to store the private unit available for processing unit 11 Nuclear group. In general, not r + · Λ , the private module contains routines, programs, objects / , λ _ ~ Cob ject, components, etc. Lisuke π丨,, and # from \丁~1豕dan music playback function. The present invention can also be applied to the decentralized operation to raise the p ^ 1+ j & 1 brother, whose operation is connected to the end of the communication network by the processing equipment of the communication network. The execution of the cum music playback function may be made up of n ^ _ . ^ f from local and multiple remote computer systems. Storage Description 1 3 contains the 爿 爿 爿 + + π & ^ $ hard disk description, floppy device, CD device, to read hard disk, floppy disk, CD, flash drive, storage 2 =, and , music files (that is, sound streaming), configuration broadcasts (,, and sad, and, recorded) and / or static image files. The first embodiment, the embodiment discloses that an instant processing image and music player/f is executed by the processing unit 11. Figure 2 is a flow chart showing the method of video and music playback according to the consistent embodiment of the present invention. If the second step S 2 11 'receives a stream of sound and a string of static shadows, the audio stream can be Mp3, Mp4, AAC, VGF, 0GG, WAV or send a fairy. The digital sound of the format. In step S212, it passes the sound device 15 to play the sound stream. If the step s22i, the display device 14 displays the first still image in the still image of the sequence. Step S 2 3 1. After starting the w/I starting point or the last segment point (partiti ο η po lnt), a segmentation point is determined in the sound power, * at string, and the segment point is from the starting point or the previous point The length of the knife slave point can be a fixed length or one

0599-A20372TWF(Nl);2004-03;snowbal1.ptd 第7頁 1276961 五、發明說明(5) 變動長度。舉例言之,此長度可由公式(1 )計算而得’0599-A20372TWF(Nl);2004-03;snowbal1.ptd Page 7 1276961 V. Description of invention (5) Length of change. For example, this length can be calculated from equation (1).

Lseg = Ltotal/(Nimg-1)..............................公式(1), 其中,L t 〇 t a 1表示音樂串流之總長度;N i mg表示靜態影像 之數目。除此之外,此分段點距離起始點或上一個分段點 之長度也可預先設定並儲存於組態槽或組態紀錄中。如步 驟S23 2,以分段點為基礎,決定一取樣區間(sampling interval)。於較佳之情況下,取樣區間之長度可預先設Lseg = Ltotal/(Nimg-1)........................Formula (1), where L t 〇ta 1 represents the total length of the music stream; N i mg represents the number of still images. In addition to this, the length of the segment point from the starting point or the last segment point can also be preset and stored in the configuration slot or configuration record. In step S23 2, a sampling interval is determined based on the segmentation point. In the preferred case, the length of the sampling interval can be preset

定並儲存於組態檔或組態紀錄中。如步驟S233,使用各式 各樣之音樂内容分析方法(musical content analysis methods),從取樣區間中,決定一切換時間點(image transi t ion point )。音樂内容分析方法為熟習此技藝人 士所習知’可用以決定一樂益起奏點(attacktime for an instrument)、旋律不連續點(mei〇dy discontinuity)、合聲不連續點(harmonic discontinuity)、節拍不連續點(beat discontinuity)、 音高不連續點(pitch discontinuity)、最大表面峰值 (maximum rough peak value)、最大矣而欠括,· · 取八不1囬合值(minimum rough valley value)以及其他聲音特徵中之一者。如牛Set and save in the configuration file or configuration record. In step S233, a variety of image content points are used to determine an image transi t ion point from the sampling interval. The music content analysis method is known to those skilled in the art to determine the "attacktime for an instrument", the mel〇dy discontinuity, the harmonic discontinuity, Beat discontinuity, pitch discontinuity, maximum rough peak value, maximum 矣 欠 , , · · · · · · · · · · · · · · · · · · · · · · · · · · And one of the other sound features. Cow

驟S234,影像閒置一段時間至步驟S233所決定之切換1夺"間 點。如步驟S 2 3 5,於切換時間點,透過顯示襄置丨4,N :厂、 下一張靜態影像。於較佳之情況下,使用一 ^技 ’卜不 ^ m 個轉場效果 (transition effect),諸如放大(zoom in、 u 111 1 n)、淡入 f f 只 H p in)、飛入(fly in)或其他,用以顯示下— 張静態影後。 如步驟S241,判斷是否存在未顯示過^〜1豕 ❿之知態影像,是In step S234, the image is idle for a period of time until the switch 1 is determined by the step S233. In step S 2 3 5, at the switching time point, through the display device 丨 4, N: factory, the next still image. In a preferred case, a transition effect is used, such as zooming in (zoom in, u 111 1 n), fade in ff, only H p in), flying in (fly in) Or other, to display the next - static image. In step S241, it is determined whether there is a known image that does not display ^~1豕 ,,

1276961 五、發明說明(6) 則進行步驟S 2 3 1 ;否則結束整個影像暨音樂播放方法。 第3圖係表示依據本發明第一實施例之範例影像暨音 樂播放示意圖。參考步驟S21 1,先接收一聲音串流AS以及 序列之靜態影像11至I 4。參考步驟S 2 1 2,透過聲音裝置j 5 播放聲音串流A S。參考步驟S 2 2 1,顯示靜態影像11。 之後,參考步驟S231至S235,決定分段點al ;以分段 點為基礎,決定一個取樣區間,al’至al’,;使用一個音 樂内容分析方法’在此取樣區間中,決定出切換時間點 s 1,最後當音樂播放至切換時間點s 1時,將靜能旦彡禮]· 9顧 示於顯示裝置14上。參考步驟S241,由於還存; 之靜態影像I 3以及I 4,因此整個流程進行步驟$ 2 3 1繼續下 一階段之處理。 、 參考 礎,決定 分析方法 當音樂播 裝置1 4上 像14,因 參考 礎,決定 分析方法 樂播放至 1 4上。參 像,因此 步驟S231至S235 ’決定分段點a2 ;以分段點為基 一個取樣區間’ a2’至a2’ ’ ;使用—個音樂内容 ’在此取樣區間中,決定出切換時間點以;最後 放至切換時間點s2時,將靜態影像13顯示於顯示 。參考步驟S241,由於還存在未顯示過之靜態影 此整個流程進行步驟S2 3 1繼續下一階段之卢理。 步驟S231至S235,決定分段點^ · 处 /订这广日日〇, 又』a3,以分段點為基 一個取樣區間,a3至a3’,;使用—個音半 ,在此取樣區間中’決定出切換時間㈣、.舍立 切換時間點S3時,將靜態影像於 :: 考步驟S24卜由於不存在任何夫;於顯不裝置 整個流程結束。 縣饤未顯不過之靜態影1276961 V. Inventive Note (6) Then proceed to step S 2 3 1; otherwise, the entire image and music playing method is ended. Figure 3 is a diagram showing an exemplary video and music playback in accordance with a first embodiment of the present invention. Referring to step S21 1, a sound stream AS and a sequence of still images 11 to I4 are first received. Referring to step S 2 1 2, the sound stream A S is played through the sound device j 5 . Referring to step S 2 2 1, a still image 11 is displayed. After that, referring to steps S231 to S235, the segmentation point a1 is determined; based on the segmentation point, a sampling interval, al' to al', is determined; using a music content analysis method, in this sampling interval, the switching time is determined. Point s 1, and finally when the music is played to the switching time point s 1, the static energy is displayed on the display device 14. Referring to step S241, since the still images I 3 and I 4 are still stored, the entire flow proceeds to step $2 3 1 to continue the processing of the next stage. , reference basis, decision analysis method When the music broadcast device 1 4 like 14, because of the reference, decide the analysis method to play to 14. Reference, therefore steps S231 to S235 'determine segmentation point a2; segmentation segment based on a sampling interval 'a2' to a2''; use - music content 'in this sampling interval, determine the switching time point to Finally, when the switching time point s2 is placed, the still image 13 is displayed on the display. Referring to step S241, since there is still a static shadow that has not been displayed, the entire process proceeds to step S2 3 1 to continue to the next stage. Steps S231 to S235, determining the segmentation point ^ · at/receiving the wide day and the sun, and then a3, taking a segmentation point as a sampling interval, a3 to a3',; using a halftone, in this sampling interval When 'deciding the switching time (4), and turning the switching time point S3, the static image is:: test step S24 because there is no such thing; the entire process ends. The static shadow of the county

0599-A20372TWF(N1);2004-03;s nowba11.p t d 第9頁 1276961 —一 五、發明說明(7) 第二實施例 第二實施例揭露一種批次處理之影像暨音樂播放方 法,本方法由處理單元1 1所執行。第4圖係表示依據本發 明實施例之影像暨音樂播放方法之方法流程圖。 如步驟S 4 1 1,接收一聲音串流,此聲音串流可為 MP3、MP4、AAC、VGF、OGG、WAV或其他聲音格式之數位聲 音。如步驟S 4 2 1,於聲音串流中決定至少一分段點 (partition point),而分段點間之長度可為一固定長度 或一變動長度。於較佳之情況下,每一分段點間之長度由 上述之公式(1 )計算而得。除此之外,兩分段點間之每一 長度也可預先設定並儲存於組態槽或組態紀錄中。為如步 驟S422,以每一分段點為基礎,決定一取樣區間 (sampling interval)。於較佳之情況下,每一取樣區間 之長度為預先設定並儲存於組態檔或組態紀錄中。如步驟 S423,使用各式各樣之音樂内容分析方法(musiCal content analysis methods),從每一取樣區間中,決定 一切換時間點(image transition point)。音樂内容分析 方法為熟習此技藝人士所習知,可用以決定一樂器起奏 點、旋律不連續點、合聲不連續點、節拍不連續點、A言 不連續點、最大表面峰值、最大表面谷值以及其他聲^ 徵中之一者。如步驟S431,接收序列之靜熊影 —= 靜態影像之格式,可為TIFF、GIF、ϊρπ〜 張 麗2_以及其他影像格式中J:EG、SVG、PNG、 過聲音裝置15,播放聲音串流如—牛者_如步驟,逯 τ机如步驟S433,依據先前產0599-A20372TWF(N1);2004-03;s nowba11.ptdpage 91276961 -15, invention description (7) second embodiment second embodiment discloses a batch processing image and music playing method, the method It is executed by the processing unit 11. Figure 4 is a flow chart showing a method of video and music playback method in accordance with an embodiment of the present invention. In step S 4 1 1, a stream of sounds is received, which may be digital sounds of MP3, MP4, AAC, VGF, OGG, WAV or other sound formats. In step S 4 2 1, at least one partition point is determined in the sound stream, and the length between the segment points may be a fixed length or a variable length. In the preferred case, the length between each segment point is calculated by the above formula (1). In addition to this, each length between the two segment points can also be pre-set and stored in the configuration slot or configuration record. For example, in step S422, a sampling interval is determined based on each segment point. In the preferred case, the length of each sampling interval is preset and stored in the configuration file or configuration record. In step S423, an image transition point is determined from each sampling interval using a variety of music content analysis methods (musiCal content analysis methods). The music content analysis method is known to those skilled in the art and can be used to determine an instrument starting point, a melody discontinuity point, a chorus discontinuity point, a beat discontinuity point, an A word discontinuity point, a maximum surface peak value, a maximum surface. One of the valleys and other sounds. In step S431, the sequence of the static bear shadow-=still image is received, which can be TIFF, GIF, ϊρπ~ Zhang Li 2_ and other image formats: J: EG, SVG, PNG, over-the-sound device 15, playing the sound string If the flow is like a cow, such as a step, the ττ machine is as in step S433, based on the previous production.

1276961 五、發明說明(8) ^之切換時間點,透過顯示裝置丨4, 象。於較佳之情況下,可於顯示新的;::::靜態影 個轉場效果,諸如放大、$人、φ的聍悲衫像時,使用一 木渚如放大淡入飛入或其他。 弟0圖係表示依據本發明第二實施 ::放示意圖。參考步驟S4U,先 "串影1暨音 考步驟S42l ’將聲音串流仏切分為四$土,。參 至a3。參考步驟S422,以各個分 ::“又點ai 樣&間,分別為“,至al,’、a2,, 疋一個取 參考步驟S423,使用一音樂内容分析2 ;及:3至:3,’。 區間中,決定出切換時間點Sl、S2以及的取樣 S43 1,接收序列之靜態影像11、I 2、;[ 3以及。= S4 32 ’透過聲音裝置15播放聲音串流ς步驟 S433 ’當音樂播放至切換時間點s 靜: 於顯示裝置14上;者立*播放至將静恶影像12顯示 与推τ q祐 田曰本播放至切換時間點S2時,將靜能 〜像13顯不於顯示裝置14上;以及當立鉍撼 =心 點S3,,將靜態影像丨4顯示於顯示装置^上。刀、日、間 加第6立圖係表示依據本發明第一以及第二實施例之軟 °影像暨音樂播放系統之軟體41被處理單元11 植41 ^二’^中包括聲音串流分析模組411以及控制模 、、 耳曰串^分析模組4 1 1由儲存裝置1 5輸入聲音串 流,於—聲音串流中決定至少一分段點。聲音串流分析模組 4 11以母一分段點為基礎,決定一取樣區間,並且使用一 音樂内容分析方法,從每一取樣區間中,決定一切換時間 點。聲音串流分析模組4 1 1更將決定之切換時間點傳送給1276961 V. Description of the invention (8) ^ The switching time point is transmitted through the display device 丨4, image. In the preferred case, a new ;:::: static shadow transition effect, such as a magnified, $human, φ sorrowful shirt image, can be used to zoom in or out. The figure 0 shows a second embodiment of the present invention. Referring to step S4U, the first "cross-image 1 cum sound test step S42l ' divides the sound stream into four $ soils. Go to a3. Referring to step S422, each of the points: "and ai-like &, respectively, ", to al, ', a2, 疋 a reference step S423, using a music content analysis 2; and: 3 to: 3 , '. In the interval, the sampling time S1, S2 and the sampling S43 are determined, and the still image of the sequence is received, I2; [3 and . = S4 32 'Playing the sound stream through the sound device 15, step S433 'When the music is played to the switching time point s static: on the display device 14; the player* plays until the static image 12 is displayed and pushed τ q You Tianyu When the playback to the switching time point S2, the static image ~ image 13 is not displayed on the display device 14; and when the vertical position = the heart point S3, the still image 丨 4 is displayed on the display device ^. The knives, the day, and the sixth image show that the soft body 41 of the soft image and music playing system according to the first and second embodiments of the present invention is processed by the processing unit 11 to include a sound stream analysis module. The group 411 and the control module, the deaf string analysis module 4 1 1 input the audio stream from the storage device 15, and determine at least one segment point in the audio stream. The sound stream analysis module 4 11 determines a sampling interval based on the parent-segment point, and uses a music content analysis method to determine a switching time point from each sampling interval. The voice stream analysis module 4 1 1 further transmits the determined switching time point to

0599-A20372TWF(N1);2004-03;snowball.ptd0599-A20372TWF(N1);2004-03;snowball.ptd

12769611276961

控制模組412。控制模組412由儲存署 音串流以及序列之靜態影像,驅動聲罝接收上述之聲 流,將第一張靜態影像輸出至顯示掌^裝置15播放聲音串 在偵測到聲音串流播放至切換時門士 1 4 °控制模組4 1 2 像輪出至顯示裝置1 4。 ”日守’將下一張靜態影 再者,本發明提出一種電腦可 存一電腦程式,上述電腦程式用以^現第f媒體々,用以儲 例之影像暨音樂播放方法,此方法二 _ 一以及第二實施 驟。第7圖係表示依據本發明第一以曰仃_如上所述之步 暨音樂播放方法之電腦可讀取儲存媒體^=咅實施例之影像 式包含七個邏輯,分別為接收聲立电、ώ θ ”電腦程 年曰串流邏輯5 2 1、、、叔+ \ 段點邏輯5 2 2、決定取樣區間邏輟R 9 q ^ ’、疋为 胸、接收存列之靜態影像邏輯52::播=;=點邏 526與顯示靜態影像邏輯527。 曰串w邏輯 法,僅 ’大量 因此,藉由本發明所提供之影像播放系統及 針對取樣區段之聲音串流進行部分的音樂内容分 節省分析時間與計算資源’亦可實現即時協調 (synchronize)聲音以及靜態影像。 雖然本發明之實施例揭露如上,然其並非用以 發明,任何熟悉此項技藝者,在不脫離本發明之2疋本 圍内,當可做些許更動與潤飾,因此本發明之保^二和範 視後附之申請專利範圍所界定者為準。 執圍當Control module 412. The control module 412 drives the sonar stream and the sequence of still images to drive the sonar to receive the sound stream, and outputs the first still image to the display device 15 to play the sound string and detects the sound stream to be played to When switching, the gate 1 4 ° control module 4 1 2 turns out to the display device 14 . "Shou Shou" will be the next static image. The present invention proposes a computer program for storing a computer program. The computer program is used to display the image and music playing method of the storage case. _ I and the second embodiment. Figure 7 shows a computer readable storage medium according to the first step of the present invention as described above. The image format of the embodiment includes seven logics. , respectively, receive sound power, ώ θ ” computer program year 曰 stream logic 5 2 1 , , 叔 + \ segment point logic 5 2 2, determine the sampling interval logic R 9 q ^ ', 疋 for chest, receive The static image logic 52:: broadcast =; = point logic 526 and display static image logic 527.曰string w logic method, only a large number, therefore, the video playback system provided by the present invention and the music stream segmentation for the sampling segment can save the analysis time and computing resources' Sound and still images. Although the embodiments of the present invention are disclosed above, they are not intended to be invented, and any one skilled in the art can make some changes and retouchings without departing from the scope of the present invention. And as defined in the scope of the patent application attached to Fanshi. Behave

1276961 圖式簡單說明 第1圖係表示依據本發明實施例之影像暨音樂播放系 統之系統架構圖。 第2圖係表示依據本發明實施例之影像暨音樂播放方 法之方法流程圖。 第3圖係表示依據本發明實施例之範例影像暨音樂播 放不意圖。 第4圖係表示依據本發明實施例之軟體架構示意圖。 第5圖係表示依據本發明實施例之影像暨音樂播放方 法之電腦可讀取儲存媒體示意圖。 第6圖係表示依據本發明第一以及第二實施例之軟體 架構不意圖。 第7圖係表示依據本發明第一以及第二實施例之影像 暨音樂播放方法之電腦可讀取儲存媒體示意圖。 符號說明 1 0〜影像暨音樂播放系統; 11〜處理單元; 1 2〜記憶體, 1 3〜儲存裝置; 1 4〜顯示裝置; 1 5〜聲音裝置; 1 6〜匯流排; S211、S221.....S 233〜方法操作步驟; AS〜聲音串流;1276961 BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a system architecture diagram of an image and music playback system in accordance with an embodiment of the present invention. Figure 2 is a flow chart showing a method of video and music playback in accordance with an embodiment of the present invention. Figure 3 is a diagram showing exemplary image and music playback in accordance with an embodiment of the present invention. Figure 4 is a block diagram showing the software architecture in accordance with an embodiment of the present invention. Figure 5 is a diagram showing a computer readable storage medium in accordance with an image and music playing method according to an embodiment of the present invention. Fig. 6 is a view showing a software architecture in accordance with the first and second embodiments of the present invention. Figure 7 is a diagram showing a computer readable storage medium in accordance with the image and music playing method of the first and second embodiments of the present invention. Symbol Description 1 0 ~ video and music playback system; 11 ~ processing unit; 1 2 ~ memory, 1 3 ~ storage device; 1 4 ~ display device; 1 5 ~ sound device; 1 6 ~ bus bar; S211, S221. ....S 233~ method operation steps; AS~sound stream;

0599-A20372TWF(Nl);2004-03;snowball.ptd 第13頁 1276961 圖式簡單說明 a 1、a 2、a 3〜分段點; al,、al,,、a2,、a2,,、a3,、a3,,〜取樣參考點; s 1、s 2、s 3〜切換時間點; 11、I 2、I 3、I 4〜靜態影像; 4 1 1〜聲音串流分析模組; 4 1 2〜控制模組; 5 2 0〜影像播放電腦程式; 5 2 1〜接收聲音串流邏輯; 5 2 2〜決定分段點邏輯; 5 2 3〜決定取樣區間邏輯; 5 2 4〜決定切換時間點邏輯; 52 5〜接收序列之靜態影像邏輯; 5 2 6〜播放聲音串流邏輯; 5 2 7〜顯示靜態影像邏輯。0599-A20372TWF(Nl);2004-03;snowball.ptd Page 131276961 Schematic description a 1, a 2, a 3 ~ segmentation point; al,, al,,, a2, a2,,, a3 , a3,, ~ sampling reference point; s 1, s 2, s 3 ~ switching time point; 11, I 2, I 3, I 4 ~ static image; 4 1 1 ~ sound stream analysis module; 4 1 2~ control module; 5 2 0~ video playback computer program; 5 2 1~ receive sound stream logic; 5 2 2~ decide segmentation point logic; 5 2 3~ decide sampling interval logic; 5 2 4~ decide to switch Time point logic; 52 5~ Receive sequence static image logic; 5 2 6~ play sound stream logic; 5 2 7~ display still image logic.

0599-A20372TWF(N1);2004-03;snowba11.p t d 第14頁0599-A20372TWF(N1);2004-03;snowba11.p t d第14页

Claims (1)

1276961 __ 案號 93118490 六、申請專利範圍 1 · 一種影像暨音樂播放方法,被一處理單元載入並執 行,其方法包括下列步驟: 顯示一張靜態影像; 播放聲音串流; .使用一個音樂内容分析方法’針對一段聲音串流之一 段取樣區間做音樂内容的部分分析; 於上述取樣區間中決定〆個切換時間點;以及 於上述切換時間點時,切換顯示下一張靜態影像。 2 ·如申請專利範圍第1項所述之影像暨音樂播放方 法,其中上述聲音串流(audi0 stream)為MP3、MP4、 AAC、VGF、OGG或WAV格式之數位聲音。 3 ·如申請專利範圍第1項所述之影像暨音樂播放方 法,其中上述靜態影像為TIFF、GIF、JPEG、SVG、PNG或 JPEG 2000格式之數位影像。 4 ·如申請專利範圍第1項所述之影像暨音樂播放方 法,其中上述音樂内容分析方法用以取得相應於一樂器起 奏點(attack time f 〇r an instrument)、一旋律不連續 點(melody discontinuity)、一合聲不連續點(harmonic discontinuity)、一 節拍不連續點(beat discontinuity)、一 音高不連續點(pi tch discontinuity)、一 最大表面冷值(maximum rough peak value)或一最大表面谷值rough valley value) 之上述切換時間點。 5.如申請專利範圍第1項所述之影像暨音樂播放方1276961 __ Case No. 93118490 VI. Patent Application No. 1 · An image and music playing method is loaded and executed by a processing unit, and the method comprises the following steps: displaying a still image; playing a sound stream; using a music content The analysis method 'partial analysis of the music content for a sampling interval of a segment of the sound stream; determining a switching time point in the sampling interval; and switching to display the next still image at the switching time point. 2. The video and music playing method according to claim 1, wherein the audio stream (audio stream) is a digital sound in an MP3, MP4, AAC, VGF, OGG or WAV format. 3. The image and music playing method as described in claim 1, wherein the still image is a digital image in a TIFF, GIF, JPEG, SVG, PNG or JPEG 2000 format. 4. The image and music playing method according to claim 1, wherein the music content analyzing method is used to obtain an attack time f 〇r an instrument and a melody discontinuity point ( Melody discontinuity), a harmonic discontinuity, a beat discontinuity, a pi tch discontinuity, a maximum rough peak value, or a The above switching time point of the maximum valley value. 5. The video and music player as described in item 1 of the patent application scope 1276961 Ρ . ν _案號 93ii^irg : — L1 正 __ 六、申請專利範圍1—一 —: 法,更包括下列步驟: 決定尚未播放之上述聲音串流中之一個分段點;以及 以上述分段點為基礎,決定包含上述分段點之上述取樣區 間、 6 ·如申請專利範圍第5項所述之影像暨音樂播放方 法,其中上述分段點被使用以將上述聲音串流切分為相同 等份。 7 ·如申請專利範圍第6項所述之影像暨音樂播放方 法,其中上述音樂内容分析用以取得相應於一樂器起奏點 (attack time for an instrument)、一旋律不連續點 (melody discontinuity)、一合聲不連續點(harmonic discontinuity)、一 節拍不連續點(beat discontinuity)、一 音高不連續點(pitch discontinuity)、一最大表面峰值(maximum peak value)或一最大表面谷值(minimum r〇Ugh vaney vaiue) 之上述切換時間點。 8· —種電腦可讀取儲存媒體,用以儲存一電腦程式, 該電腦程式用以載入至一電腦系統中並且使得該電腦系統 執行如申請專利範圍第1至7項中任一者所述之方法。 9. 一種影像暨音樂播放系統,包括: 一儲存裝置’用以儲存‘一段聲音串流以及複數靜態影 像; 一顯示裝置; 一聲音裝置,用以播放上述聲音串流;以及1276961 Ρ . ν _ case number 93ii^irg : — L1 正 __ VI. Patent application 1-1—: The method further includes the following steps: determining a segmentation point in the above-mentioned sound stream that has not been played; Based on the segmentation point, the sampling interval including the segmentation point is determined, and the image and music playing method described in claim 5, wherein the segmentation point is used to cut the sound stream Divided into the same aliquot. 7. The image and music playing method according to claim 6, wherein the music content analysis is used to obtain an attack time for an instrument and a melody discontinuity. , a harmonic discontinuity, a beat discontinuity, a pitch discontinuity, a maximum peak value, or a maximum surface valley (minimum) r〇Ugh vaney vaiue) The above switching time point. 8. A computer readable storage medium for storing a computer program for loading into a computer system and causing the computer system to perform any of claims 1 to 7 The method described. 9. An image and music playback system comprising: a storage device </ RTI> for storing 'a stream of sounds and a plurality of still images; a display device; a sound device for playing the stream of sounds; 0599-A20372TWF1CN1);2004-03;snowbal1.ptc 第16頁 1276961 r 月.,曰 修正 i 號 93]!|你為〇6 六、申請專利範圍 _ 一處理單疋’用以輪出一張靜態影像至上述顯示裝 X,使Μ個音樂内容分析方法,針對一段聲音串流之一 f取樣區間做音樂内容的部分分析,於上述取樣區間中決 疋個切換日可間點,於上述切換時間點時,輸出下一張靜 態影像至上述顯示裝置。 I 0 ·如申请專利範圍第9項所述之影像暨音樂播放系 、、充八中上述聲日串流(audio stream)為MP3、MP4、 AAC、VGF、0GG或WAV格式之數位聲音。 II ·如申請專利範圍第9項所述之影像暨音樂播放系 統,其中上述靜態影像gTlFF、GI F、jpEG、SVG、pNG或 JPEG 20 0 0格式之數位影像。 1 2 ·如申請專利範圍第9項所述之影像暨音樂播放系 統,其中上述音樂内容分析方法用以取得相應於一樂器起 奏點(attack time for an instrument)、一旋律不連續 點(melody discontinuity)、一合聲不連續點(harmonic discontinuity)、一 節拍不連續點(beat discontinuity)、一 音高不連續點(Pitch discontinuity) 一 隶大表面峰值(iflaximum rough peak value)或 袁大表面合值(iniiiiniuni rough valley value) 之上述切換時間點。 1 3.如申請專利範圍第9項所述之影像暨音樂播放系 統,其中上述處理單元用以決定尚未播放之上述聲音串流 中之一個分段點,以上述分段點為基礎,決定包含上述分 段點之上述取樣區間。0599-A20372TWF1CN1);2004-03;snowbal1.ptc Page 161276961 r Month.,曰修正i号93]!|你为〇6 VI. Patent scope _ 一处理单疋' used to rotate a static The image is sent to the display device X, so that a music content analysis method performs partial analysis of the music content for a sampling interval of one piece of the audio stream, and in the sampling interval, the switching time interval is determined in the above switching time. When the point is output, the next still image is output to the above display device. I 0 · The video and music playing system described in claim 9 of the patent application, and the audio stream of the above-mentioned audio stream is a digital sound of the MP3, MP4, AAC, VGF, 0GG or WAV format. II. The video and music playback system of claim 9, wherein the still image is a digital image of the gTlFF, GI F, jpEG, SVG, pNG or JPEG 200 format. 1 2. The image and music playing system according to claim 9, wherein the music content analysis method is used to obtain an attack time for an instrument and a melody discontinuity (melody) Discontinuity), harmonic discontinuity, beat discontinuity, pitch discontinuity, orlaximum rough peak value or Yuan Da surface The above switching time point of the value (iniiiiniuni rough valley value). The image and music playing system of claim 9, wherein the processing unit is configured to determine a segment point of the sound stream that has not been played, based on the segmentation point, and determine to include The above sampling interval of the above segmentation point. 0599-A20372TWF1(N1);2004-03;snowba11.pt c 第17頁0599-A20372TWF1(N1);2004-03;snowba11.pt c第17页
TW93118490A 2004-06-25 2004-06-25 System, method and machine-readable storage medium for synchronization of still image and audio TWI276961B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW93118490A TWI276961B (en) 2004-06-25 2004-06-25 System, method and machine-readable storage medium for synchronization of still image and audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW93118490A TWI276961B (en) 2004-06-25 2004-06-25 System, method and machine-readable storage medium for synchronization of still image and audio

Publications (2)

Publication Number Publication Date
TW200601048A TW200601048A (en) 2006-01-01
TWI276961B true TWI276961B (en) 2007-03-21

Family

ID=38646396

Family Applications (1)

Application Number Title Priority Date Filing Date
TW93118490A TWI276961B (en) 2004-06-25 2004-06-25 System, method and machine-readable storage medium for synchronization of still image and audio

Country Status (1)

Country Link
TW (1) TWI276961B (en)

Also Published As

Publication number Publication date
TW200601048A (en) 2006-01-01

Similar Documents

Publication Publication Date Title
WO2018059342A1 (en) Method and device for processing dual-source audio data
US9601029B2 (en) Method of presenting a piece of music to a user of an electronic device
EP3489946A1 (en) Real-time jamming assistance for groups of musicians
CN110675886A (en) Audio signal processing method, audio signal processing device, electronic equipment and storage medium
JP2011516907A (en) Music learning and mixing system
WO2017076304A1 (en) Audio data processing method and device
TWI731382B (en) Method, device and equipment for speech synthesis
WO2016202176A1 (en) Method, device and apparatus for synthesizing media file
JP2019219638A (en) Music synthesis method, system, terminal and computer-readable storage medium
CN113035162B (en) Ethnic music generation method, device, equipment and storage medium
KR950004253A (en) Karaoke device back chorus playback device
Müller et al. Interactive fundamental frequency estimation with applications to ethnomusicological research
WO2023051246A1 (en) Video recording method and apparatus, device, and storage medium
CN108257588A (en) One kind is set a song to music method and device
JP2012088402A (en) Information processor, information processing method, and program
JP2006260644A (en) Data processing method, electronic device, program and recording medium
WO2009010009A1 (en) Prompting message forming method and device for mobile terminal
JP2010237257A (en) Evaluation device
TWI276961B (en) System, method and machine-readable storage medium for synchronization of still image and audio
US20060020880A1 (en) System and method for synchronization of music and images
US20130339349A1 (en) Method and apparatus for music searching
JP2009230007A (en) Performance information display and program
JP2009205039A (en) Audio data conversion/reproduction system, audio data conversion device and audio data reproduction device
JP2008216681A (en) Karaoke device wherein recorded singer&#39;s singing can strictly be compared with model singing
JP4413643B2 (en) Music search and playback device

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees