TWI305347B - Method and apparatus for encoding and decoding an audio signal - Google Patents

Method and apparatus for encoding and decoding an audio signal Download PDF

Info

Publication number
TWI305347B
TWI305347B TW95124070A TW95124070A TWI305347B TW I305347 B TWI305347 B TW I305347B TW 95124070 A TW95124070 A TW 95124070A TW 95124070 A TW95124070 A TW 95124070A TW I305347 B TWI305347 B TW I305347B
Authority
TW
Taiwan
Prior art keywords
signal
audio signal
information
header
audio
Prior art date
Application number
TW95124070A
Other languages
Chinese (zh)
Other versions
TW200707406A (en
Inventor
Hee Suk Pang
Hyen O Oh
Dong Soo Kim
Jae Hyun Lin
Yang Won Jung
Original Assignee
Lg Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020060056480A external-priority patent/KR20070003574A/en
Application filed by Lg Electronics Inc filed Critical Lg Electronics Inc
Publication of TW200707406A publication Critical patent/TW200707406A/en
Application granted granted Critical
Publication of TWI305347B publication Critical patent/TWI305347B/en

Links

Landscapes

  • Time-Division Multiplex Systems (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Description

1305347 .九'發明說明: 【發明所屬之技術領域】 本發明係關於一種音頻訊號處理技術,特別係關於一種音頻 訊號之編碼及解碼裝置及其方法。 、 【先前技術】 • 通常,音頻訊號編碼裝置係壓縮一音頻訊號至一單聲道 ‘ (mono)或身歷聲(stereo)類型之下混(d〇wnmix)訊號,而非 鲁壓縮多頻道音頻訊號之各個頻道。音頻訊號編碼裝置傳輸壓縮之 下混訊號連同一空間資訊訊號(或者辅助資料訊號)至一解碼袭 置,或者儲存此壓縮下混訊號及此空間資訊訊號於儲存媒介中。 這種况下,於下混多頻道音頻訊號中提取之空間資訊气穿 係用於自壓縮下混訊號復原初始多頻道音頻訊號。 空間資訊訊號包含一標頭(header)及空間資訊。並且,組離 資訊係包含於此標頭中。標頭為用以解釋空間資訊之資訊。 ® 音頻訊號解碼裝置利用包含於標頭中之組態資訊解碼空間資 • 訊。包含於標頭中之組態資訊被傳輸至一解碼裝置,或者連同空 間資訊一同儲存於儲存媒介中。 音頻訊號編碼裝置將一編碼下混訊號及此空間資訊訊號一起 多工(multiplex)為一位元流形式,並接著傳輸此多工訊號至解碼 裝置。由於組態資訊通常為不變的,因此包含有組態資訊之標頭 被插入位元流中一次。既然初始插入音頻訊號中一次之紐態資訊 被傳輸,因此假如自一任意時間點複製音頻訊號時,由於不存在 1305347 、組態資訊,則音頻訊號解碼裝置在解碼空間資訊時存在問題。也 就是§兒,假如在廣播、Ik選視訊(video on demand)或類似情況 下,由於音頻訊號經由制者之請求自―特定時間點複製而不是 自初始部複製,目此無法使用包含於音頻訊號中傳輪之組態資 訊。由此,不能夠解碼空間資訊。 【發明内容】 « 鲁 鑒於以上的問題,本發明的主要目的在於提供-種音頻訊號 之編瑪及解碼方法及其裝置,透過令標頭選擇性地包含於空間資 訊訊號的框中以解碼音頻訊號。 本發明之另-目的在於提供—種音頻訊號之編碼及解碼方法 及其裝置’其透錢複數個標頭包含於空間資訊訊射進而即使 音頻訊號自隨機點複製也能夠解碼音頻訊號。 , @此’為耻述目的’本發明所揭露之-種音頻訊號解碼方 •法包3以下步驟.接收包含有音麵述符之音頻訊號,藉由音頻 描述符識腕含下混訊號和空間資訊訊號之音頻訊號,及藉由空 ,間資誠號賴下混鶴為多頻道訊號,其中空㈣訊訊號包含 、標頭,每個標頭均預先設置具有時間或空間間隔。 【實施方式】 有關本發_槪與實作,統合圖式作最佳實施例詳細說 明如下。 為了理解本發明’於音頻職解碼裝置及方法之前,首先對 1305347 '、、^之、扁辦置及方法作出轉。然㊉,本發明之解瑪裝置 、、二不局限於下文之解碼裝置及其方法。並且,本發明適用 ;利用工間貝5fl產生多頻道之音頻柄方案,以及網(碰那 1/2_層110及進階音頻編碼(AAC)等。 立「弟1圖」為本發明實施例之音頻訊號之構造示意圖,其中 曰頻訊號自3触魏魏置雜至音細贿碼裝置。 明參考「第1圖」,音頻訊號包含音頻描述符101、下混訊號 103以及空間資訊訊號105。 假如使用編碼方案㈣驗顧或_物之音賴號,則音 頻訊號可以包含_她及音頻描述符而和下混訊號ι〇3。本 ^明:以包含空間資訊訊號耶做為輔助資料。為了使音頻訊號 解碼裝置無需分析音頻訊號而瞭解音狀編碼解碼^⑺㈣之 基本貧訊’音敏號可鱗顯地包含音頻描鱗而。音頻描述 由_碼必須之少量_她成,例如傳輸音頻訊號 =解、縣錄、龍聽之取樣_、細#前使用編碼 #石馬益之識別碼及其它類似物等等。 音頻訊號解瑪裝置能夠利用音頻描述符朗瞭解音頻訊號使 用之心_||之_。特观,制音_麟1G1,音頻訊號 解碼妓關_減之音親綠妓彻0 及下混訊號㈣原的多頻道訊號。這時,_=1 二_繞(__d)減瓣賴。糾三維職技術,透過 1305347 或兩個頻道可聽見具有空間資訊訊號ι〇5與下混訊號⑻相結 合之音頻訊號。 曰頻描述付101之定位係獨立於音頻訊號所包含之下混訊號 或空間資訊訊號1〇5,例如,音頻描述符則位於代表音頻訊 號之一獨立區域中。 如果標頭沒有提供至下混訊號103,音頻訊號解碼裝置則能夠 利用音頻描述符而來解妈下混訊號腿。 下%訊號103為下混多頻道產生的訊號。下混訊號m能夠 產生自曰頻錢解碼裝置(圖未示)所包含之下混單元(圖未示), 或者由人工產生。 、下混訊號103可被分類為包含空間資訊訊號1〇5的一種情 況,或者不包含標頭的一種情況。 如果下混訊號1G3包含標頭,標酬藉由-框單元包含於每 「框中。如果下混訊號1〇3不包含標頭,如上文所描述,下混訊 諕_ _透過一音頻訊號解碼裝置使用音頻描述符則進行解 碼。下混訊號1〇3或者具有每一框均包含標頭之形式,或者具有 不包含標頭之形式。並且,下混訊號1G3以相同方式包含於二音 頻訊號中,直至内容結束。 曰 空間資訊訊號105也可以分類為包含有標頭及空間資訊的一 種情況,狀健含找資财包含標__種軌。空間資訊 訊號K)5之標頭區別於下混訊號1(B之標頭,這是由於其不必要 1305347 等同地插入各個框中。特別地,空間資訊訊號105能夠一起使用 包3有標碩之框及不包含標頭之框。包含於空間資訊訊號105之 標頭中的大多數資訊為組態資訊,其透過解釋空間資訊來解碼空 間資訊。 、 「第2圖」為本發明另一實施例之音頻訊號之構造示意圖, • 其中此音頻訊號自一音頻訊號編碼裝置傳輸至一音頻訊號解碼裝 置。 5奢參考「第2圖」,音頻訊號包含下混訊號103及空間資訊訊 號105。並且’音頻訊號以框排列之基本位元流(ES)形式存在。 各個下混訊號103及空間資訊訊號1〇5以獨立基本位元流形 式偶爾傳輸至音頻訊號解碼裝置。並且如「第2圖」顯示,下混 訊號1〇3及空間資訊信號1〇5能夠結合為一個基本位元流形式, 以傳輸至音頻訊號解碼裝置。 Φ 饭如下混訊號103及空間資訊訊號105結合為-基本位元流 形式且傳輸至音頻訊波解碼裝置資訊訊號挪能夠包 '含於下混訊號103之部份辅助資料或附加資料(額外資料)之位 , 置。 並且,音頻訊號可包含訊號識別資訊,以顯示空間資訊訊號 105是否與下混訊號1〇3結合。 空間資訊訊號105之框可分類為包含有標頭201及空間資訊 203的-種情況’以及僅包含空間資訊2〇3的一種情況。特別地, 1305347 空間資訊訊號105能夠一起使用包含標頭2〇1之框以及不包含標 頭201之框。 在本發明中,標頭201至少被插入空間資訊訊號1〇5中一次。 特別地,音頻訊號編碼裝置可以將標頭2〇1插入空間資訊訊號1〇5 之各每一框中,週期性地將標頭2〇1插入空間資訊訊號1〇5的框 的各個固定_巾’或者_雛地將_遍插人空間資訊訊 號105的框的每一任意間隔中。 音頻訊號可包含用以指示標頭2〇1是否包含於框中的資訊。 如果標頭2〇1係包含於空間資訊訊號1〇5巾,音頻訊號解碼 裝置則自標頭2Q1中提取組態資訊應,並接著依照此組態資訊 ⑽解碼於標頭201之後傳輸之空間資訊2〇3。由於標頭為遂 過轉空間資訊203用來解碼之資訊,因此標頭加則於音頻訊 號傳輸之較早階段被傳輸。 # 如果標頭2〇1不包含於空間資訊訊號1〇5中,音頻訊號解石馬 裝置則利用較早階段傳輸之才_2〇1來解碼空間資訊挪。 :如果標頭逝在音頻訊麵音頻編碼裝置傳輸至音頻解碼菜 =丢失或者如糾傳輸之音賴⑽其中間部份 ,料訊鱗碼錢财能夠使用先 中之先前標頭201,然後音頻訊號解碼裝 中2〇1。讀情況下,音頻訊號解碼震置自標頭· I貝訊2〇5,其中此標頭2〇1係區別於最初插入音頻訊號 置能夠利用提取之組態資 10 1305347 訊205解碼此音頻訊號。這時,自插入音頻訊號中之標頭2〇1提 取之組態資訊205可以等同於或區別於先前之組態資訊2 〇 5,其中 此先别組癌資訊205提取自較早階段傳輸之標頭。 如果標頭201為變化的,組態資訊2〇5可提取自一新的標頭 201 ’提取的組態資訊205被解碼,然後於標頭2〇1之後傳輸之空 間資訊203被解碼。如果標頭201為不變的,則需要判斷新標頭 201是否與之前傳輸之舊標頭201相同。如果上述兩個標頭2〇1 係互相不同的,則能夠偵測音頻訊號傳輸路徑上之音頻訊號中出 現了錯誤。 自空間資訊訊號105之標頭201提取的組態資訊205為用於 解釋空間資訊203之資訊。 空間資訊訊號105能夠包含下面實訊(下文中稱作為時間對 位資訊(time align information)),即用以在透過音頻訊號解碼裝 置利用下混訊號103及空間資訊訊號1〇5產生多頻道的過程中區 別兩個訊號的時序延遲偏差。 可透過一解多工單元(demultiplexing unit)(圖未示)分析自 音頻訊號編碼裝置傳輸至音頻訊號解碼裝置之音頻訊號,接著將 音頻訊號分離至下混訊號103及空間資訊訊號105中。 由解多工單元分離之下混訊號1〇3被解碼。被解碼之下混訊 號103利用空間資訊訊號105產生一多頻道。在透過結合下混訊 號103及空間資訊訊號105產生多頻道時,音頻訊號解碼裝置能 11 1305347 -夠藉由包含於組態資訊205中之時間對位資訊(圖未示)調整兩 .訊號之同步,並結合兩訊號之起點位置等,其中組態資訊2〇5係 提取自空間資訊訊號105之標頭201。 時間槽(time slot)之位置資訊207係包含於空間資訊訊號ι〇5 之空間資訊203中,其中一參數將應用於此時間槽。作為一空間 ••參數(空間尾接(spatial·)),其中存在有代表音頻訊號間之能 量偏差之頻道級別(channel level)偏差、代表音頻訊號之緊密性 及相似度之頻道間關係(interciiannel correlations)和代表利用其 它訊號預測音頻訊雜之係數之頻道預測係數(channd脾細⑽ coeffic她)。下文中,各個空間尾接或空間尾接束將被稱為夕來 數' " 假如N個參數存在於空間資訊訊號1〇5所包含之框中,則n 個參數被分臟麟框的特定時間槽位置。如果代表參數即將應 φ甩於框中包含之其中一個時間槽的資訊被命名為時間槽之位置資 訊207,則音頻訊號解碼裝置利用將應用參數之時間槽之位置資訊 ,207解碼空間資訊2〇3。這時,此參數係包含於空間資訊2〇3中。 , 第3圖」為本發明實施例之音頻訊號解碼裝置之方塊圖。 凊芩考「第3圖」’依照本發明之實施例之音頻訊號解碼裝置 係包含接收單元301和提取單元3〇3。 音頻訊號解碼裝置之接收單元3〇1藉由一輸入終端而透過 音頻訊號解碼裝置來接收以基本位元流形式傳輸之音頻訊號。 12 1305347 由音頻訊號解碼裝置接收之音頻訊號包含一音頻描述符1〇1 與一下混訊號103,並可以更包含空間資訊訊號105作為輔助資料 或附加資料(延伸資料)。 音頻訊號解碼裝置之提取單元303從接收到的音頻訊號包含 之標頭201提取組態資訊2〇5,並隨後藉由一輸出終端〇xjti輸出 提取之組態資訊205。 音頻訊號可包含標頭識別資訊,用以識別標頭2〇1是否包含 於^^"~才匡中〇 音頻訊號解碼裝置藉由包含於音頻訊號中之標頭識別資訊識 別標頭201是否包含於此框中。如果標頭2〇1包含於其中,則音 頻訊號解碼裝置自標頭2〇1提取組態資訊2〇5。在本發明中,至少 一標頭201包含於空間資訊訊號1〇5中。 「第4圖」為本發明另一實施例之音頻訊號解碼裝置之方塊 圖。 明蒼考第4圖」’本發明之另一實施例之音頻訊號解碼裝置 下包含魏單元30卜解多工單元、核心解碼單元4〇3、多頻 道生成單元405,空間資訊解碼單元·及提取單元3〇3。 曰頻。fl號解碼裝置之接收單元3〇1藉由一輸出終端腿接收 =音頻__裝置以位元流形式傳輸之音頻訊號。並且,接收 早7L 301發送接收到的音頻訊號至解多工單元彻。 解夕工早元401分離接收單亓狄、,, 又早兀301發迗之音頻訊號為一編碼 1305347 下混訊號103及一編碼空間資訊訊號1〇5 ^解多工單元4〇1傳輸自 位元流分離之編碼下混訊號1〇3至核心解碼單元4〇3,並傳輸自位 元流分離之編碼空間資訊訊號1〇5至提取單元3〇:3。 編碼下5孔戒103由核心解碼單元403解碼,然後傳輸至多 頻道生成單元405。編碼空間資訊訊號105包含標頭2〇1及空間資 訊 203。 如果標頭201包含於編碼空間資訊訊號1〇5中,提取單元3〇3 則從標頭201提取組態資訊205。提取單元303藉由音頻訊號包含 之標頭識別資訊區別標頭2〇1之存在。特別地,標頭識別資訊可 以表不標頭201是否包含於空間資訊訊號1〇5所包含之框中。標 頭識別資訊可以代表框的順序或者音頻訊號之位元序列,如果標 頭201包含於框中,則自標頭201提取之組態資訊2〇5包含於此 框順序或位元序列中。 如果藉由標頭識別資訊判斷標頭2〇1包含於框中,則提取單 兀303自包含於框中之標頭2〇1提取組態資訊2〇5。然後,解瑪提 取之組態資訊205。 空間資訊解碼單元407依照解碼之組態資訊205解碼包含於 框中之空間資訊203。 並且,多頻道生成單元405利用解碼下混訊號1〇3及解竭办 間貧訊203產生一多頻道訊號,然後藉由一輸出終端〇UT2輪出 產生多頻道訊號。 14 1305347 「第5圖」為本發明實施例之音頻訊號解碼方法之流程圖。 明苓考「第5圖」,音頻訊號解碼裝置接收空間資訊訊號1〇5, 其中空間魏峨1〇5係由音親號編碼裝置以位元流形式傳輸 (步驟501)。 如上文所述,空間資訊訊號1〇5可分為以獨立於下混訊號1〇3 之位元流傳輸的-麟況,或者結合下混訊號1G3 —起傳輸的一 種情況。 音頻訊號讀乡工單元係分賴接收之音頻訊號為編碼 下此訊5虎103及編碼空間資訊訊號1〇5。編碼空間資訊訊號1〇5 包含“頭201及空間資訊2〇3。如果標頭2〇1包含於空間資訊訊號 105之框中,則音頻訊號解碼裝置可識別標頭2〇1 (步驟5〇3)。 曰頻訊號解碼裝置從標頭2〇1提取組態資訊2〇5(步驟505)。 並且,音頻訊號解碼裝置利用提取之組態資訊205解碼空間 資訊203 (步驟507) 〇 「第6圖」為本發明另一實施例之音頻訊號解碼方法之流程 圖。 請參考「第6圖」’音頻訊號解碼裝置接收空間資訊訊號105, 其中空間資訊訊號105係由音頻訊號編碼裝置以位元流形式傳輸 (步驟501)。 如上文所述’空間資訊訊號105可分為以獨立於下混訊號1〇3 之位元流傳輸的一種情況,或者包含於下混訊號1〇3之輔助資料 15 1305347 或延伸資料巾—起傳輸的—種情況。 曰頻訊號之解多工單元仙分離接收之音頻訊號為編碼下混 。 及編碼二間資訊訊號105。編碼空間資訊訊號105包含標 頭201及二間:貝汛2〇3。音頻訊號解碼裝置判斷標頭是否包含 於框中(步驟601)。 如果祐頭201包含於框中,則音頻訊號解碼裝置識別標頭201 (步驟503 )。 然後音頻訊號解碼農置從標頭施提取组態資訊2〇5 (步驟 505)0 音頻訊號解碼裝置判斷自標頭2〇1提取之組態資訊2〇5是否 為從包含於空間資訊訊號1〇5中之第一標頭2()1提取的組態資訊 205 (步驟 603 )。 如果組怨資訊205提取自首先從音頻訊號提取的標頭2〇1,音 頻訊號解碼裝置則解碼組態資訊2〇5 (步驟611),並依照此解碼 組態資訊205解碼於組態資訊2〇5之後傳輸之空間 資訊203。 如果自音頻訊號提取之標頭2〇1不是首先從空間資訊訊號 105提取之標頭201 ’音頻訊號解碼裝置則判斷自標頭2〇1提取之 組態資訊205是否與自第一標頭2〇1提取之組態資訊2〇5相同(步 驟 605)。 如果此組態資訊205與提取自第一標頭201之組態資訊2〇5 相同,音頻訊號解碼裝置則利用此提取自標頭201之解碼組態資 16 1305347 訊205解碼空間資訊203。 如果提取的組態資訊205不同於從第一標頭2〇1提取之电能 資訊挪,音·號解絲觸音頻訊號從音継號編碼震置 至音頻訊麟碼裝置之傳輸雜上是㈣現錯誤(步驟6〇7)。 如果此組態資訊2〇5為可變的,那麼即使組態資訊2〇5不同 於自第-標頭20】提取之組態資訊施,上述錯誤也不會出現。因 此,音頻訊號解碼裝置更新標頭201為新的標頭2〇1 (步驟_)。 音頻訊號解碼裝置然後解碼提取自此更新標頭施之組態資訊 205 (步驟 611)。 ' 依照此解碼組態資訊205,音頻訊號解碼裳置解碼於此組態資 訊205之後傳輸之空間資訊203。 如果組恝貧訊205為不變的,且不同於提取自第一標頭2〇1 之組怨貧訊205,則表示錯誤會出現於音頻訊號傳輸路徑上。因 此,音頻訊號解鱗置去除包含於框中之空間資訊2(B,或者修正 空間貧訊203之錯誤,其中此空間資訊2〇3中包含有錯誤組態資 訊205 (步驟613 )。 「第7圖」為本發明又一實施例之音頻訊號解碼方法之流程 圖。 請參考「第7圖」,音頻訊號解碼裝置接收空間資訊訊號105, 其中空間貢訊訊號105係由音頻訊號編碼裝置以位元流形式傳輸 (步驟501)。 17 1305347 =。曰頻磁之解多工單元·分離所毅音頻峨為編媽下現 §就1〇3及解碼空間資訊訊號⑽。這時,即將參數表示之時間 之位置資訊207係包含於此空間資訊訊號 105 中。 ^ 音頻訊號解石馬裝置從空間資訊2〇3中提取時間槽 2〇7(步驟70])。 置貝訊 音頻訊緣i裝置彻提取之關槽之位置資訊,透過 :應财-參數之時職位置·此參數至對鱗間槽(步^ 「第8圖」為本發明實施例之麟表科間槽數量之位置次 蚊方法流_。表科_數量之位置資訊為分配肋表貝 時間槽之位置資訊2〇7之位元數量。 '、 —表示時間槽數量之位置資訊能夠透過以下步驟而發現,其中 弟一參數健詩此_槽:自此時_數量減去此參數數旦, 增加1至此減法結果,對此增加舞以2為底之對數,並庫用一 ㈣函數至此對數值。特別地,表示應用有第一參數之時間槽數旦 德置_賊過公式,娜吻崎得,射〃 k,,與Γ i分別表示時間槽數量及參數婁欠量。 、 假設’’ 1ST’為一自铁數,目丨|主_也 奴θ …數則表不應用有_产參數之時間样 數置之Μ育訊被表示為制有Ν,參數之時間槽之位置資^ 207。這時,應用有ν111參數之時間样 、 ]钇之位置資訊207能夠透過 下步驟發現,即將位於應用有Ν、數之時間槽及應用雜心 18 1305347 '數之時間槽之間的時間槽的數量加上由财參數應用之時間槽 • 的位置資訊,以及辦、,广, 曰力1至此增加值(步驟80J)。特別地,"+# 户數應用之時間槽之位置資訊可透過公式綱+《_+1獲得,其 中(N 1)係代表存在於應用有_产參數之時間槽及應用有# .參數之時間槽之間的時間槽數量。 * /曰果Ν >數應用之時間槽之位置資訊2〇7被發現,則能夠獲 • Λ數里之—間槽位置資訊,以表示應用有阶1)&參數之時間 2位置。特別地’透過自時間微量中減去應用於一框之參數數 里及應用有Ν翏數之時間槽的位置資訊,以及增加(而)至此減 法值’迫夠發現表示數量之時間槽位置資訊,以表示_广參數 應用之時間槽位置(步驟8〇3 )。特別地,透過公式 (g2(kl谓+1_胸))能约發現應用有(Ν+1)&參數之時間槽數量 之位置貝訊,其巾k”、〃 i〃及〃卿〃分別表示時間槽數量、 #參數數量及應用參數之時間槽之位置資訊2〇5。 如果:上述方式麟絲時職數量之位置flTL,則表示應 用有(N 1)參數之時間槽數量之位置資訊具有與",成反比之 .力配位凡數I。即代表應用有參數之時間槽數量之位置資訊為-依照夕N”的變化值。 第9圖」為本發明再—實施例之音頻訊號解碼方法之流程 圖。 曰頻訊處解碼裝置從音頻訊號編碼裝置接收音頻訊號(步驟 19 1305347 曰頻訊號匕S音頻描述符101、下混訊號103及空間資訊訊 號 105 〇 曰頻Λ號解碼|置提取音頻訊號包含之音頻描述符皿(步驟 9〇3)代表a頻編碼解碼器㈤⑷之—識別符係包含於此音頻 描述101中。 ' .…音頻訊號解碼襄置藉由音頻描述符1〇1識別音頻訊號,音頻 • 虎包3有下混_u 1()3及空㈣訊訊號應。特別地,音頻訊號 解碼裝置利用工間貪訊訊號1〇5區別傳輸之音頻訊號為用以產生 多頻道之訊號(步驟905)。 η並且θ頻訊5虎解碼裝置藉由空間資訊訊號廳哺換下混 訊號1〇3為—多頻道訊號。如上文所描述,標頭2〇1能夠以各個 預定間隔包含於空間資訊訊號105中。 工業應用 • 正如上文所述,本發明之音頻訊號編碼及解碼方法及其裝置 能夠使得_選擇性地包含於空間資訊訊號中。 亚且,如果複數個標頭包含於空間資訊訊號中,本發明之立 、頻織編碼及解财法及其裝置能_碼㈣魏,即使是透二 曰頻訊麟碼裝置自任意點複製此音娜號。 〜雖然本發明以前述之較佳實施例揭露如上,然其並非用以限 疋本發明^相像技藝者,在不麟本發明之精神和範圍 田可作—許之更動麵飾,因此本發明之專利保護範圍須視 20 1305347 本說明書所附之申請專利翻所界定者為準 【圖式簡單說明】 第1圖為本㈣實施例之音頻訊號之構造示意圖; f2圖為本發明另—實施例之音頻訊號之構造示意圖; ' $3圖為本發明實施例之音頻訊號解碼裝置之方掩圖; .第4®為本剌另—實施例之音頻訊號解碼裝置之方塊圖; _ 第5圖為本發明貫施例之音頻訊號解碼方法之流程圖; 第6圖為本發明另一實施例之音頻訊號解碼方法之流程圖; ,7圖為本發明又一實施例之音頻訊號解碼方法之流程圖; 第8圖為本發明實施例之獲得表示數量之位置資訊之方法流 程圖;以及 第9圖為本發明再一實施例之音頻訊號解碼方法之流程圖 【主要元件符號說明】 101 103 105 201 203 205 207 301 音頻描述符 下混訊號 空間資訊訊號 標頭 空間資訊 組態資訊 時間槽之位置資訊 接收單元 21 1305347 303 提取單元 401 解多工單元 403 核心解碼單元 405 多頻道生成單元 407 空間資訊解碼單元 INI 輸入終端 IN2 輸入終端 OUT1 輸出終端 OUT2 輸出終端 步驟501 接收空間資訊訊號 步驟503 識別標頭 步驟505 .提取組態資訊 步驟507 解碼空間資訊 步驟601 存在標頭? 步驟603 組態資訊取自第一標頭? 步驟605 態資訊? 提取之組態資訊是否等同於自第一標頭提取之組 步驟607 偵測到錯誤出現? 步驟609 更新標頭 步驟611 解碼組態育訊 步驟613 去除空間資訊或者修正空間資訊之錯誤 22 1305347 步驟701 提取時間槽之位置資訊 步驟703 應用參數至時間槽 步驟801 獲得應用有Nth參數之時間槽之位置資訊 步驟803 置資訊 獲得表示應用有(N+l)th參數之時間槽數量之位 步驟901 接收音頻訊號 步驟903 提取音頻描述符 步驟905 藉由空間資訊訊號識別音頻訊號是否產生多頻道 231305347. 九的发明说明: Technical Field of the Invention The present invention relates to an audio signal processing technique, and more particularly to an audio signal encoding and decoding apparatus and method thereof. [Prior Art] • Generally, an audio signal encoding device compresses an audio signal to a mono (mono) or stereo type (d〇wnmix) signal instead of a compact multi-channel audio. Each channel of the signal. The audio signal encoding device transmits the compressed downmix signal to the same spatial information signal (or auxiliary data signal) to a decoding attack, or stores the compressed downmix signal and the spatial information signal in the storage medium. In this case, the spatial information extracted in the downmix multi-channel audio signal is used to restore the initial multi-channel audio signal from the compressed downmix signal. The spatial information signal contains a header and spatial information. Also, the group information is included in this header. The header is information used to explain spatial information. The ® audio signal decoding device decodes the spatial information using the configuration information contained in the header. The configuration information contained in the header is transmitted to a decoding device or stored in a storage medium along with the spatial information. The audio signal encoding device multiplexes a coded downmix signal and the spatial information signal into a one-bit stream form, and then transmits the multiplex signal to the decoding device. Since the configuration information is usually constant, the header containing the configuration information is inserted into the bit stream once. Since the information of the initial state of the initial insertion of the audio signal is transmitted, if the audio signal is copied from an arbitrary time point, since the 1305347 and the configuration information do not exist, the audio signal decoding apparatus has a problem in decoding the spatial information. That is, if it is in the broadcast, video on demand or the like, since the audio signal is copied from the specific time point by the maker's request instead of being copied from the initial part, it cannot be used in the audio. The configuration information of the transmission wheel in the signal. Thus, spatial information cannot be decoded. SUMMARY OF THE INVENTION « In view of the above problems, the main object of the present invention is to provide a method and apparatus for encoding and decoding audio signals, and to decode audio by selectively including a header in a frame of a spatial information signal. Signal. Another object of the present invention is to provide a method and apparatus for encoding and decoding audio signals, which have a plurality of headers included in the spatial information transmission and which are capable of decoding audio signals even if the audio signals are copied from random points. , @本'for the purpose of shame", the audio signal decoding method disclosed in the present invention is the following steps: receiving an audio signal containing a voiced facet, and the audio descriptor includes a downmix signal and The audio signal of the spatial information signal, and the multi-channel signal by the vacant space, the space (4) signal contains and the header, each header is preset with a time or space interval. [Embodiment] The present invention is described in detail with reference to the preferred embodiment of the present invention. In order to understand the present invention, prior to the audio job decoding apparatus and method, first, the transfer of the 1305347', the ^, the flat and the method is made. However, the apparatus and apparatus of the present invention are not limited to the following decoding apparatus and method. Furthermore, the present invention is applicable to an audio-handle scheme for generating a multi-channel by using the studio 5fl, and a network (touching the 1/2_layer 110 and advanced audio coding (AAC), etc. The schematic diagram of the structure of the audio signal, wherein the frequency signal is from the 3 touch Wei Wei to the sound bribe code device. Referring to "1", the audio signal includes the audio descriptor 101, the downmix signal 103 and the spatial information signal. 105. If you use the coding scheme (4) to check or _ the voice of the object, the audio signal can include _ her and the audio descriptor and the downmix signal ι〇3. This ^: with the aid of the spatial information signal In order to make the audio signal decoding device not need to analyze the audio signal, the audio-visual codec can be understood. (7) (4) The basic poor sense 'sound sensitive number can be included in the audio scale. The audio description is made up of a small amount of _ code. For example, the transmission audio signal = solution, county record, dragon listening sample _, fine #前前编码#Shi Mayi's identification code and other analogs, etc. The audio signal decoding device can use the audio descriptor to understand the audio signal usage. heart of_ _ _. Special, vocal _ _ 1G1, audio signal decoding _ _ 减 亲 亲 亲 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及 及The 3D job technology can be used to hear the audio signal with the spatial information signal ι〇5 and the downmix signal (8) through 1305347 or two channels. The frequency description of the 101 is independent of the audio signal. The downmix signal or the spatial information signal 1〇5, for example, the audio descriptor is located in a separate area representing one of the audio signals. If the header is not provided to the downmix signal 103, the audio signal decoding device can utilize the audio descriptor The lower part signal 103 is a signal generated by the downmix multi-channel. The downmix signal m can generate a sub-mixing unit (not shown) included in the self-frequency decoding device (not shown), or Manually generated. The downmix signal 103 can be classified into a case including the spatial information signal 1〇5, or a case where the header is not included. If the downmix signal 1G3 includes a header, the reward is included by the -frame unit In every box If the downmix signal 1〇3 does not contain a header, as described above, the downmix 諕__ is decoded by an audio signal decoding device using an audio descriptor. The downmix signal 1〇3 or has each frame included The form of the header, or has the form of not including the header. Moreover, the downmix signal 1G3 is included in the second audio signal in the same manner until the content ends. The spatial information signal 105 can also be classified into the header and space information. In one case, the health of the resource contains the standard __ type of track. The header of the spatial information signal K) 5 is different from the header of the downmix signal 1 (B, because it is unnecessary to insert each frame equally into 1305347) In particular, the spatial information signal 105 can be used together with the box 3 and the frame without the header. Most of the information contained in the header of spatial information signal 105 is configuration information, which interprets spatial information to decode spatial information. 2 is a schematic structural diagram of an audio signal according to another embodiment of the present invention, wherein the audio signal is transmitted from an audio signal encoding device to an audio signal decoding device. 5 Luxury reference "Figure 2", the audio signal includes a downmix signal 103 and a spatial information signal 105. And the 'audio signal' exists in the form of a basic bit stream (ES) arranged in a frame. Each downmix signal 103 and spatial information signal 1〇5 are occasionally transmitted to the audio signal decoding device in a separate basic bit stream. And as shown in "Fig. 2", the downmix signal 1〇3 and the spatial information signal 1〇5 can be combined into one basic bit stream form for transmission to the audio signal decoding apparatus. Φ The following mixing number 103 and spatial information signal 105 are combined into a basic bit stream form and transmitted to the audio signal decoding device. The information signal can be packaged as part of the auxiliary data or additional data contained in the downmix signal 103 (additional data) ), position. Moreover, the audio signal may include signal identification information to indicate whether the spatial information signal 105 is combined with the downmix signal 1〇3. The frame of the spatial information signal 105 can be classified into a case where the header 201 and the space information 203 are included, and a case where only the spatial information 2〇3 is included. In particular, the 1305347 spatial information signal 105 can use a box containing the header 2〇1 and a frame containing no header 201. In the present invention, the header 201 is inserted into the spatial information signal 1〇5 at least once. In particular, the audio signal encoding apparatus can insert the header 2〇1 into each of the spatial information signals 1〇5, and periodically insert the header 2〇1 into each fixed frame of the spatial information signal 1〇5. The towel 'or _ nesting _ is inserted into every arbitrary interval of the frame of the spatial information signal 105. The audio signal may include information indicating whether the header 2〇1 is included in the frame. If the header 2〇1 is included in the spatial information signal 1〇5 towel, the audio signal decoding device extracts the configuration information from the header 2Q1, and then decodes the space transmitted after the header 201 according to the configuration information (10). Information 2〇3. Since the header is the information used by the over-space information 203 for decoding, the header addition is transmitted at an earlier stage of the audio signal transmission. # If the header 2〇1 is not included in the spatial information signal 1〇5, the audio signal solution device uses the transmission of the earlier stage to decode the spatial information. : If the header is lost in the audio interface audio encoding device to the audio decoding dish = lost or the middle part of the sound transmission (10), the data scale money can use the previous header 201, then the audio The signal decoding device is 2〇1. In the case of reading, the audio signal decoding is set from the header · I bei 2 〇 5, wherein the header 2 〇 1 is different from the initial insertion of the audio signal and can be decoded by the extracted configuration 10 10305347 205 . At this time, the configuration information 205 extracted from the header 2〇1 in the inserted audio signal may be equal to or different from the previous configuration information 2 〇5, wherein the first group cancer information 205 is extracted from the label transmitted at an earlier stage. head. If the header 201 is changed, the configuration information 2〇5 can extract the configuration information 205 extracted from a new header 201' and the spatial information 203 transmitted after the header 2〇1 is decoded. If the header 201 is unchanged, it is necessary to determine if the new header 201 is identical to the old header 201 previously transmitted. If the two headers 2〇1 are different from each other, it is possible to detect an error in the audio signal on the audio signal transmission path. The configuration information 205 extracted from the header 201 of the spatial information signal 105 is information for interpreting the spatial information 203. The spatial information signal 105 can include the following actual information (hereinafter referred to as time align information), that is, the multi-channel is generated by the audio signal decoding device using the downmix signal 103 and the spatial information signal 1〇5. The timing delay deviation of the two signals is distinguished in the process. The audio signal transmitted from the audio signal encoding device to the audio signal decoding device can be analyzed by a demultiplexing unit (not shown), and then the audio signal is separated into the downmix signal 103 and the spatial information signal 105. The mixed signal 1〇3 is decoded by the demultiplexing unit. The decoded mixed signal 103 uses the spatial information signal 105 to generate a multi-channel. When the multi-channel is generated by combining the downmix signal 103 and the spatial information signal 105, the audio signal decoding device can 11 1305347 - can adjust the two signals by the time alignment information (not shown) included in the configuration information 205. Synchronization, combined with the starting position of the two signals, etc., wherein the configuration information 2〇5 is extracted from the header 201 of the spatial information signal 105. The location information 207 of the time slot is included in the spatial information 203 of the spatial information signal ι〇5, and one parameter will be applied to this time slot. As a space ••parameter (spatial·), there are channel level deviations representing energy deviations between audio signals, inter-channel relationships representing the closeness and similarity of audio signals (interciiannel) Correlations) and channel prediction coefficients (channd spleen (10) coeffic her) that represent the coefficients of the audio signal using other signals. In the following, each space tail or space tail bundle will be referred to as the number of eves. If the N parameters exist in the frame contained in the spatial information signal 1〇5, then n parameters are divided into dirty boxes. Specific time slot location. If the information indicating that one of the time slots is included in the box is named as the position information 207 of the time slot, the audio signal decoding device decodes the spatial information by using the position information of the time slot in which the parameter is applied. 3. At this time, this parameter is included in the spatial information 2〇3. Figure 3 is a block diagram of an audio signal decoding apparatus according to an embodiment of the present invention. Referring to "Fig. 3", an audio signal decoding apparatus according to an embodiment of the present invention includes a receiving unit 301 and an extracting unit 3〇3. The receiving unit 301 of the audio signal decoding device receives the audio signal transmitted in the form of a basic bit stream through the audio signal decoding device through an input terminal. 12 1305347 The audio signal received by the audio signal decoding device includes an audio descriptor 1〇1 and a next mixed signal 103, and may further include a spatial information signal 105 as auxiliary data or additional data (extended data). The extracting unit 303 of the audio signal decoding device extracts the configuration information 2〇5 from the header 201 of the received audio signal, and then outputs the extracted configuration information 205 through an output terminal 〇xjti. The audio signal may include header identification information for identifying whether the header 2〇1 is included in the ^^"~ 匡 〇 〇 audio signal decoding device identifies whether the header 201 is identified by the header identification information included in the audio signal Included in this box. If the header 2〇1 is included therein, the audio signal decoding apparatus extracts the configuration information 2〇5 from the header 2〇1. In the present invention, at least one header 201 is included in the spatial information signal 1〇5. Fig. 4 is a block diagram of an audio signal decoding apparatus according to another embodiment of the present invention. 4: "The audio signal decoding apparatus of another embodiment of the present invention includes a Wei unit 30 demultiplexing unit, a core decoding unit 4〇3, a multichannel generating unit 405, a spatial information decoding unit, and The extraction unit 3〇3.曰 frequency. The receiving unit 〇1 of the fl-number decoding device receives the audio signal transmitted by the audio__device in the form of a bit stream by an output terminal leg. Moreover, the received early 7L 301 transmits the received audio signal to the demultiplexing unit. The XI XI MU 401 separates and receives the 亓 、 、,, and the 301 迗 迗 迗 迗 音频 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 130 The bit stream separates the encoded downmix signal 1〇3 to the core decoding unit 4〇3, and transmits the coded spatial information signal 1〇5 separated from the bit stream to the extracting unit 3〇:3. The coded lower 5 or ring 103 is decoded by the core decoding unit 403 and then transmitted to the multichannel generating unit 405. The code space information signal 105 includes a header 2〇1 and a spatial resource 203. If the header 201 is included in the code space information signal 1〇5, the extraction unit 3〇3 extracts the configuration information 205 from the header 201. The extracting unit 303 distinguishes the existence of the header 2〇1 by the header identifying information included in the audio signal. In particular, the header identification information can indicate whether the header 201 is included in the frame included in the spatial information signal 1〇5. The header identification information may represent the order of the frames or the sequence of bits of the audio signal. If the header 201 is included in the frame, the configuration information 2〇5 extracted from the header 201 is included in the frame order or the sequence of bits. If it is judged by the header identification information that the header 2〇1 is included in the frame, the extraction unit 303 extracts the configuration information 2〇5 from the header 2〇1 included in the frame. Then, the configuration information 205 of the solution is extracted. The spatial information decoding unit 407 decodes the spatial information 203 contained in the frame in accordance with the decoded configuration information 205. Moreover, the multi-channel generation unit 405 generates a multi-channel signal by decoding the downmix signal 1〇3 and the depletion of the inter-office message 203, and then generating a multi-channel signal by rotating the output terminal 〇UT2. 14 1305347 "FIG. 5" is a flowchart of an audio signal decoding method according to an embodiment of the present invention. In the "Picture 5", the audio signal decoding apparatus receives the spatial information signal 1〇5, wherein the space Wei峨1〇5 is transmitted by the phono encoding apparatus in the form of a bit stream (step 501). As described above, the spatial information signal 1〇5 can be divided into a case where the bit stream is transmitted independently of the downmix signal 1〇3, or a case where the downmix signal 1G3 is transmitted together. The audio signal reading the rural unit is based on the received audio signal. The code is 5 Tiger 103 and the code space information signal 1〇5. The code space information signal 1〇5 includes “head 201 and space information 2〇3. If the header 2〇1 is included in the frame of the spatial information signal 105, the audio signal decoding device can recognize the header 2〇1 (step 5〇 3) The 曰 frequency signal decoding device extracts the configuration information 2〇5 from the header 2〇1 (step 505). Further, the audio signal decoding device decodes the spatial information 203 by using the extracted configuration information 205 (step 507) 〇 6 is a flow chart of an audio signal decoding method according to another embodiment of the present invention. Please refer to the "Fig. 6" audio signal decoding device for receiving the spatial information signal 105, wherein the spatial information signal 105 is transmitted by the audio signal encoding device in the form of a bit stream (step 501). As described above, the 'spatial information signal 105 can be divided into a case where the bit stream is transmitted independently of the downmix signal 1〇3, or the auxiliary data 15 1305347 or the extended data towel included in the downmix signal 1〇3. The case of transmission. The multiplexed signal is separated from the received audio signal by the code downmix. And encoding two information signals 105. The coded space information signal 105 includes a header 201 and two spaces: Bellow 2〇3. The audio signal decoding means judges whether or not the header is included in the frame (step 601). If the head 201 is included in the frame, the audio signal decoding apparatus recognizes the header 201 (step 503). Then the audio signal decoding farmer extracts the configuration information from the header 2〇5 (step 505). 0 The audio signal decoding device determines whether the configuration information extracted from the header 2〇1 is 2 from the spatial information signal 1 The configuration information 205 extracted by the first header 2() 1 of 〇5 (step 603). If the grievance information 205 is extracted from the header 2〇1 first extracted from the audio signal, the audio signal decoding device decodes the configuration information 2〇5 (step 611), and decodes the configuration information 205 according to the decoding configuration information 205. Space information 203 transmitted after 〇5. If the header 2〇1 extracted from the audio signal is not the header 201' audio signal decoding device first extracted from the spatial information signal 105, it is determined whether the configuration information 205 extracted from the header 2〇1 is from the first header 2 The configuration information extracted by 〇1 is the same as 2〇5 (step 605). If the configuration information 205 is the same as the configuration information 2〇5 extracted from the first header 201, the audio signal decoding apparatus uses the decoding configuration information extracted from the header 201 to decode the spatial information 203. If the extracted configuration information 205 is different from the power information extracted from the first header 2〇1, the transmission of the audio signal from the audio code to the audio device is (4) Error now (step 6〇7). If this configuration information 2〇5 is variable, the above error will not occur even if the configuration information 2〇5 is different from the configuration information extracted from the first-header 20]. Therefore, the audio signal decoding apparatus updates the header 201 to a new header 2〇1 (step_). The audio signal decoding device then decodes the configuration information 205 extracted from this update header (step 611). According to this decoding configuration information 205, the audio signal decoding field decodes the spatial information 203 transmitted after the configuration information 205. If the group 205 is constant and different from the group 205 extracted from the first header 2 〇 1, it indicates that an error will occur on the audio signal transmission path. Therefore, the audio signal de-scaling removes the spatial information 2 (B, or corrects the error of the spatial 203, which is included in the frame, wherein the spatial information 2〇3 contains the error configuration information 205 (step 613). 7 is a flowchart of an audio signal decoding method according to still another embodiment of the present invention. Referring to FIG. 7, the audio signal decoding apparatus receives a spatial information signal 105, wherein the spatial tribute signal 105 is composed of an audio signal encoding apparatus. Bit stream form transmission (step 501). 17 1305347 =. 曰 frequency magnetic solution multiplex unit · separation of the sound of the 峨 编 编 下 § § § § 1 〇 3 and decoding spatial information signal (10). At this time, the parameter is represented The location information 207 of the time is included in the spatial information signal 105. ^ The audio signal solution device extracts the time slot 2〇7 from the spatial information 2〇3 (step 70)). The position information of the clearing groove is obtained through: the position of the position of the financial account - the parameter to the interval between the scales (step ^ "8th figure" is the position of the number of slots in the branch table of the embodiment of the present invention Mosquito method flow _. Table _ number The location information is the number of bits of the position information of the ribbed time slot. 2, and the position information indicating the number of time slots can be found through the following steps, wherein the parameter one is _ slot: since this time _ number minus this parameter for a few days, increase 1 to the result of this subtraction, increase the base 2 logarithm of the dance, and use the (4) function to the logarithm. In particular, the time slot of the application with the first parameter is several Deji _ thief over the formula, Na Kissaki, shot 〃 k,, and Γ i respectively indicate the number of time slots and parameters 娄 。., assuming ''1ST' is a self-iron number, witness 主 _ _ slave The θ ... number is not applied to the time sample of the _ production parameter. The Μ Μ Μ 被 Μ Μ Μ Μ Μ Μ Ν Ν Ν Ν Ν Ν Ν Ν Ν Ν Ν Ν Ν Ν Ν 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数 参数The location information 207 can be found through the following steps, that is, the number of time slots between the application time slot and the time slot of the application miscellaneous 18 1305347's plus the time slot of the time slot applied by the financial parameter. , and do, and wide, 曰力1 to this added value (Step 80J). In particular, the position information of the time slot of the "+# household application can be obtained by the formula + "_+1, where (N 1) represents the time slot in which the application has the parameter of production and The number of time slots between the time slots of the #. parameter is applied. * /曰果Ν > The number of time slots of the application time slot 2〇7 is found, then the information about the position of the slot can be obtained. To indicate the time 2 position of the application with the order 1) & parameter, in particular, by subtracting the position information of the time slot applied to the frame and applying the time slot of the application from the time trace, and increasing (and At this point, the subtraction value is enough to find the time slot position information indicating the number to indicate the time slot position of the _ wide parameter application (step 8〇3). In particular, through the formula (g2 (kl is said to be +1_thir)), it is possible to find the position of the number of time slots to which the (Ν+1)& parameter is applied, and the towel k", 〃i〃, and 〃卿〃 Indicates the number of time slots, the number of # parameters, and the position information of the time slot of the application parameters. 2 如果5. If: the position of the number of positions in the above method is flTL, it indicates the position of the number of time slots to which the (N 1) parameter is applied. The information has an inverse relationship with ", and the force is assigned to the number I. That is, the position information indicating the number of time slots in which the parameter is applied is - the change value according to the evening N". Fig. 9 is a flow chart showing an audio signal decoding method according to still another embodiment of the present invention.曰The frequency decoding device receives the audio signal from the audio signal encoding device (step 19 1305347 曰frequency signal 匕S audio descriptor 101, downmix signal 103 and spatial information signal 105 Λ frequency 解码 decoding | set extract audio signal includes The audio descriptor dish (step 9〇3) represents the a-frequency codec (5) (4)--the identifier is included in the audio description 101. '...the audio signal decoding device identifies the audio signal by the audio descriptor 1〇1, Audio • Tiger Bag 3 has a downmix _u 1 () 3 and an empty (4) signal. In particular, the audio signal decoding device uses the inter-work greedy signal 1 〇 5 to distinguish the transmitted audio signal as a signal for generating multiple channels. (Step 905) η and θ 频 5 The tiger decoding device feeds the downmix signal 1 〇 3 into a multi-channel signal by the spatial information signal chamber. As described above, the header 2 〇 1 can be included at each predetermined interval. In the spatial information signal 105. Industrial Application • As described above, the audio signal encoding and decoding method and apparatus of the present invention enable _ to be selectively included in a spatial information signal. The header is included in the spatial information signal, and the vertical, frequency-coding and de-funding method of the present invention and its device can be _code (four) Wei, even if the trans-secondary frequency device is copied from any point. Although the present invention has been disclosed above in the above preferred embodiments, it is not intended to limit the invention to those skilled in the art, and the present invention can be used in the spirit and scope of the present invention. The scope of patent protection shall be subject to the definition of the patent application attached to the specification of the present application. [FIG. 1 is a schematic diagram of the structure of the audio signal of the embodiment of the present invention; FIG. 2 is a schematic diagram of the audio signal of the embodiment of the present invention; FIG. 5 is a block diagram of an audio signal decoding apparatus according to an embodiment of the present invention; and FIG. 5 is a block diagram of an audio signal decoding apparatus according to an embodiment of the present invention; A flowchart of an audio signal decoding method according to an embodiment of the present invention; FIG. 6 is a flowchart of an audio signal decoding method according to another embodiment of the present invention; and FIG. 7 is a flowchart of an audio signal decoding method according to still another embodiment of the present invention. Figure 8 is a flowchart of a method for obtaining position information indicating a quantity according to an embodiment of the present invention; and FIG. 9 is a flowchart of an audio signal decoding method according to still another embodiment of the present invention. [Representation of main component symbols] 101 103 105 201 203 205 207 301 Audio Descriptor Downmix Signal Space Information Signal Header Space Information Configuration Information Time Slot Position Information Receiving Unit 21 1305347 303 Extraction Unit 401 Demultiplexing Unit 403 Core Decoding Unit 405 Multichannel Generation Unit 407 Spatial Information Decoding Unit INI Input Terminal IN2 Input Terminal OUT1 Output Terminal OUT2 Output Terminal Step 501 Receive Spatial Information Signal Step 503 Identify Header Step 505. Extract Configuration Information Step 507 Decode Spatial Information Step 601 Is there a header? Step 603 Is the configuration information taken from the first header? Step 605 Status information? Is the extracted configuration information equivalent to the group extracted from the first header? Step 607 Is an error detected? Step 609 Update Header Step 611 Decode Configuration Mind Step 613 Remove Spatial Information or Correct Spatial Information Error 22 1305347 Step 701 Extract Time Slot Location Information Step 703 Apply Parameter to Time Slot Step 801 Obtain Time Slot with Nth Parameter Applied The location information step 803 sets the information to obtain the number of time slots indicating that the (N+l)th parameter is applied. Step 901: Receive audio signal. Step 903: Extract audio descriptor. Step 905: Determine whether the audio signal generates multi-channel 23 by using the spatial information signal.

Claims (1)

1305347 十、申請專利範圍: 1. 一種音頻訊號解方法,包含有. 接收包含有-音頻描述符之一音頻訊號; 藉由該音頻描述符識別包含—下混訊號和—空間資訊訊 號之該音頻訊號;及 ' ° 齡該㈣資訊職轉換該下混喊為_多頻道訊號; 其中該空間資訊職包含—獅,每_標购 • 财時間或㈣間隔。 ' 2. 如申請專利範圍第1項所述之音頻訊號解碼方法,其中該多頻 道訊號包含一虛擬身歷聲多頻道訊號。 人、 3. 如申請專利範圍第1項所述之音頻訊號解碼方法,其中更包含 找m減號包含該铜時,糊該胸巾包含的组態資 訊變換該下混訊號為該多頻道訊號。 ' 鲁4·如申請專利範圍第1項所述之音頻訊號解碼方法,其中更包含 當該空間資訊訊號不包含該標頭時,利用之前提取之組態資訊 - 變換該下混訊號為該多頻道訊號。 ,5_如中請專利範圍第3項所述之音頻訊號解碼方法,其中更包含 當該音頻訊號包含之兩個該標頭彼此不同時,侧該標頭中產 生之一錯誤。 6.如申請專利範圍第3項或第4項所述之音頻訊號解碼方法,其 中更包含: 利用該組態資訊包含的-時序對位資訊調整該下混訊號 24 1305347 和該空間資訊訊號之起始點;及 利用該空間資訊訊號中包含的一時間槽的一位置資訊調 整應用該空間資訊訊號中包含之一參數的該時間槽的一位置。 7.如申請專利範圍第6項所述之音頻訊號解碼方法,其中代表該 時間槽數量的該位置資訊係可變。 8·如申請專利範圍第6項所述之音頻訊號解碼方法,其中代表應 用有一 N&參數的該時間槽的數量的該位置資訊係透過該時間 槽的數量、該參數的數量、N和應用有一(N-;! ) th參數的一時 間槽的一位置資訊組成的群組中至少其一獲得。 9.如申請專利範圍第8項所述之音頻訊號解碼方法,其中代表應 用有在一第二參數之後的該參數的該時間槽的數量的該位置 資訊透過公式ceil (^(k-i+N-jCN-l)))獲得,其中y、,,广 和j(N-l)分別表不該時間槽數量、該參數數量和應用有該 (N-1)參數的該時間槽的該位置資訊。 10·如申請專利細第8項所述之音頻訊號解财法,其中代表應 用有一第—參數之該時間槽數量的該位«訊透過公式㈣ (log2(k_i+l))獲得,其中,/ T k 1刀別表示該時間槽數量及 該參數數量。 11.如申請專利範圍第9 g _9項所述之音頻訊號解瑪方法,其中應用; =:)時::的該位置資訊透過公式~- 糸不存在於應用有該Nth參數之該時間4 25 1305347 及應用有該(^1产參數之該時間槽之間的時間槽數量。 12.如申請專利範圍第1項所述之音頻訊號解碼方法,其中該音頻 訊號包含一訊號識別資訊及一標頭識別資訊所組成的群組中 至少其一,該訊號識別資訊係代表該空間資訊訊號是否結合於 該下混訊號,該標頭識別資訊係代表該標頭是否包含於該空間 資訊訊號中。1305347 X. Patent application scope: 1. An audio signal solution method, comprising: receiving an audio signal containing an audio descriptor; and identifying, by the audio descriptor, the audio including a downmix signal and a space information signal The signal; and ' ° age (4) information job conversion, the downmix is called _ multi-channel signal; where the space information job includes - lion, every _ purchase time / (four) interval. 2. The audio signal decoding method of claim 1, wherein the multi-channel signal comprises a virtual human voice multi-channel signal. 3. The audio signal decoding method according to claim 1, wherein the method further comprises: when the m minus sign includes the copper, the configuration information included in the breast towel is changed to the multi-channel signal. . [4] The method for decoding an audio signal according to claim 1, wherein the method further comprises: when the spatial information signal does not include the header, using the previously extracted configuration information - converting the downmix signal to the Channel signal. The audio signal decoding method of claim 3, wherein the audio signal includes two of the headers different from each other, and one of the headers generates an error. 6. The audio signal decoding method of claim 3, wherein the method further comprises: adjusting the downmix signal 24 1305347 and the spatial information signal by using the timing alignment information included in the configuration information. a starting point; and using a position information of a time slot included in the spatial information signal to adjust a position of the time slot in which the parameter is included in the spatial information signal. 7. The audio signal decoding method of claim 6, wherein the location information representative of the number of time slots is variable. 8. The audio signal decoding method according to claim 6, wherein the position information indicating the number of time slots of the application having an N& parameter is the number of the time slots, the number of the parameters, the N, and the application. At least one of the group consisting of a positional information of a time slot having a (N-;!) th parameter is obtained. 9. The audio signal decoding method according to claim 8, wherein the position information indicating the number of the time slots of the parameter after the second parameter is applied by the formula ceil (^(k-i+ N-jCN-l))), wherein y,,, and j (Nl) respectively indicate the number of time slots, the number of the parameters, and the location information of the time slot to which the (N-1) parameter is applied. . 10. The audio signal unsolving method as described in claim 8 of the patent application, wherein the bit of the time slot of the first parameter is applied by the formula (4) (log2(k_i+l)), wherein / T k 1 knife indicates the number of time slots and the number of parameters. 11. The audio signal decoding method described in claim 9 g _9 of the patent application, wherein: =:):: The position information of the position: through the formula ~- 糸 does not exist at the time when the application has the Nth parameter 4 25 1305347 and the application of the time slot between the time slots of the (1) production parameter. 12. The audio signal decoding method according to claim 1, wherein the audio signal includes a signal identification information and a At least one of the groups of header identification information, the signal identification information indicates whether the spatial information signal is combined with the downmix signal, and the header identification information indicates whether the header is included in the spatial information signal. . 2626
TW95124070A 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal TWI305347B (en)

Applications Claiming Priority (13)

Application Number Priority Date Filing Date Title
US69500705P 2005-06-30 2005-06-30
US71211905P 2005-08-30 2005-08-30
US71920205P 2005-09-22 2005-09-22
US72300705P 2005-10-04 2005-10-04
US72622805P 2005-10-14 2005-10-14
US72922505P 2005-10-24 2005-10-24
US73562805P 2005-11-12 2005-11-12
KR20060004065 2006-01-13
KR20060004055 2006-01-13
KR20060004056 2006-01-13
US78674006P 2006-03-29 2006-03-29
US79232906P 2006-04-17 2006-04-17
KR1020060056480A KR20070003574A (en) 2005-06-30 2006-06-22 Method and apparatus for encoding and decoding an audio signal

Publications (2)

Publication Number Publication Date
TW200707406A TW200707406A (en) 2007-02-16
TWI305347B true TWI305347B (en) 2009-01-11

Family

ID=43352677

Family Applications (4)

Application Number Title Priority Date Filing Date
TW98104741A TWI412021B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal
TW95124070A TWI305347B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal
TW95124069A TWI333641B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal
TW098104744A TWI459373B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal

Family Applications Before (1)

Application Number Title Priority Date Filing Date
TW98104741A TWI412021B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal

Family Applications After (2)

Application Number Title Priority Date Filing Date
TW95124069A TWI333641B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal
TW098104744A TWI459373B (en) 2005-06-30 2006-06-30 Method and apparatus for encoding and decoding an audio signal

Country Status (2)

Country Link
BR (1) BRPI0613076A2 (en)
TW (4) TWI412021B (en)

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4621862A (en) * 1984-10-22 1986-11-11 The Coca-Cola Company Closing means for trucks
US5893066A (en) * 1996-10-15 1999-04-06 Samsung Electronics Co. Ltd. Fast requantization apparatus and method for MPEG audio decoding
TW405328B (en) * 1997-04-11 2000-09-11 Matsushita Electric Ind Co Ltd Audio decoding apparatus, signal processing device, sound image localization device, sound image control method, audio signal processing device, and audio signal high-rate reproduction method used for audio visual equipment
JPH11330980A (en) * 1998-05-13 1999-11-30 Matsushita Electric Ind Co Ltd Decoding device and method and recording medium recording decoding procedure
US6631352B1 (en) * 1999-01-08 2003-10-07 Matushita Electric Industrial Co. Ltd. Decoding circuit and reproduction apparatus which mutes audio after header parameter changes
TW569550B (en) * 2001-12-28 2004-01-01 Univ Nat Central Method of inverse-modified discrete cosine transform and overlap-add for MPEG layer 3 voice signal decoding and apparatus thereof
US20040199276A1 (en) * 2003-04-03 2004-10-07 Wai-Leong Poon Method and apparatus for audio synchronization

Also Published As

Publication number Publication date
TW200707406A (en) 2007-02-16
BRPI0613076A2 (en) 2010-12-21
TW200947418A (en) 2009-11-16
TWI412021B (en) 2013-10-11
TWI459373B (en) 2014-11-01
TWI333641B (en) 2010-11-21
TW200707401A (en) 2007-02-16
TW200939207A (en) 2009-09-16

Similar Documents

Publication Publication Date Title
JP5006315B2 (en) Audio signal encoding and decoding method and apparatus
CA2603027C (en) Device and method for generating a data stream and for generating a multi-channel representation
CN101243490A (en) Method and apparatus for encoding and decoding an audio signal
JP5394753B2 (en) Audio signal processing method and apparatus
TWI451401B (en) Method for encoding and decoding multi-channel audio signal and apparatus thereof
CZ20003235A3 (en) Process and apparatus for encoding digital information signal, decoding apparatus and record carrier
KR100880642B1 (en) Method and apparatus for decoding an audio signal
US20080288263A1 (en) Method and Apparatus for Encoding/Decoding
TWI305347B (en) Method and apparatus for encoding and decoding an audio signal
US8615021B2 (en) Method and apparatus to multiplex and demultiplex asynchronous bitstreams
KR20070061280A (en) Method and apparatus for decoding an audio signal
RU2383942C2 (en) Method and device for audio signal decoding
JPH05236009A (en) Multi-medium multiplexing system
KR20070003574A (en) Method and apparatus for encoding and decoding an audio signal
ES2389130T3 (en) Method and apparatus for encoding and decoding an audio signal
KR20080010980A (en) Method and apparatus for encoding/decoding

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees