JP2005072708A

JP2005072708A - Apparatus and method for frame format conversion

Info

Publication number: JP2005072708A
Application number: JP2003209383A
Authority: JP
Inventors: Hiroshi Ishimaru; 浩石丸; Harumi Aoyama; 春巳青山; Atsuhito Miyata; 篤人宮田
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2003-08-28
Filing date: 2003-08-28
Publication date: 2005-03-17
Anticipated expiration: 2023-08-28
Also published as: JP4209733B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a frame format converting apparatus and a frame converting method which convert a frame including a sound encoding bit encoded by a voice encoding system, such as AMR (Adaptive Multi-Rate) for switching an encoded bit rate per frame into another frame of the voice encoding system. <P>SOLUTION: This frame format converting apparatus 100 is provided with: a frame extracter 102 for extracting continuously inputted frames per frame; an FT detecting/storing section 103 for detecting a frame type bit from among the extracted frames; an unnecessary bit deleting section 104 for deleting the frame type bit and an additional information bit from among the extracted frames; a bit shift 105 and a bit reverting section 106 for changing the voice encoded bit into the predetermined position of the extracted frame; and an FT positioning section 107 for positioning the frame type bit to the predetermined position of the extracted frame. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、符号化ビットレートをフレーム単位で切り替えることができる所定の音声圧縮符号化方式、例えば、ＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅ（ＡＭＲ）に基づいて符号化された音声符号化ビットが含まれるフレームの構成を変換するフレーム構成変換装置及びフレーム構成変換方法に関する。
【０００２】
【従来の技術】
従来、第３世代移動通信システムにおいて利用される標準の音声符号化方式として、フレーム単位で符号化ビットレートを切り替えることができるＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅ（ＡＭＲ）が、標準化団体である３ＧＰＰ（３ｒｄＧｅｎｅｒａｔｉｏｎＰａｒｔｎｅｒｓｈｉｐＰｒｏｊｅｃｔ）において策定されている（例えば、非特許文献１参照）。
【０００３】
ＡＭＲは、第３世代移動通信システムにおいて、映像配信サービスによって映像とともに配信される音声コンテンツを符号化する場合や、ＴＶ電話サービスにおいて伝送される音声を符号化する場合に用いられている。
【０００４】
また、ＡＭＲは、インターネット上などにおいて、ＲＴＰ（Ｒｅａｌ−ｔｉｍｅＴｒａｎｓｐｏｒｔＰｒｏｔｏｃｏｌ）を使用してストリーミングデータとして配信される音声コンテンツを符号化する場合の音声符号化方式としても普及してきている。
【０００５】
【非特許文献１】
“Ｍａｎｄａｔｏｒｙｓｐｅｅｃｈｃｏｄｅｃｓｐｅｅｃｈｐｒｏｃｅｓｓｉｎｇｆｕｎｃｔｉｏｎｓ；ＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅ（ＡＭＲ）ｓｐｅｅｃｈｃｏｄｅｃｆｒａｍｅｓｔｒｕｃｔｕｒｅ − ＴＳ２６．１０１ｒｅｌｅａｓｅ５”、Ｔｈｅ３ｒｄＧｅｎｅｒａｔｉｏｎＰａｒｔｎｅｒｓｈｉｐＰｒｏｊｅｃｔ，２００２年６月
【０００６】
【発明が解決しようとする課題】
しかしながら、ＡＭＲには、その用途、すなわち使用される通信プロトコルに応じて、複数のフレーム構成（フレームフォーマット）が存在し、かつ、それぞれのフレームフォーマットは、他のフレームフォーマットと互換性がないという問題があった。
【０００７】
例えば、ＲＴＰを使用してＡＭＲによって符号化されたＡＭＲデータを配信する場合のフレーム構成（ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔ）は、第３世代移動通信システムにおけるＴＶ電話通信プロトコル（３Ｇ−３２４Ｍ）を使用してＡＭＲデータを配信する場合のフレーム構成（ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２）とは互換性がないため、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータを、第３世代移動通信システムにおいて使用される移動電話端末などに直接配信することができないという問題があった。
【０００８】
そこで、本発明は、上述した問題点を解決すべくなされたものであり、ＡＭＲなど、符号化ビットレートをフレーム単位で切り替えることができる音声符号化方式によって符号化された音声符号化ビットが含まれるフレームを、当該音声符号化方式の他のフレームに変換することができるフレーム構成変換装置及びフレーム構成変換方法を提供することをその目的とする。
【０００９】
【課題を解決するための手段】
上述した課題を解決するため、本発明は、次のような特徴を有している。まず、本発明の第１の特徴は、符号化ビットレートをフレーム単位で切り替えることができる所定の音声圧縮符号化方式、例えば、ＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅ（ＡＭＲ）に基づいて符号化された音声符号化ビットと、フレームの種別を示すフレーム種別ビット（ＦｒａｍｅＴｙｐｅ）と、付加情報ビット（ＣｈａｎｇｅｍｏｄｅＲｅｑｕｅｓｔビット、フレーム品質インジケータなど）とが、所定の順序で配列された第１のフレーム（例えば、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔ）を、第２のフレーム（例えば、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２）に変換するフレーム構成変換装置であって、連続して入力された前記第１のフレームをフレーム単位で抜き出すフレーム抜出部（フレーム抜出部１０２）と、抜き出された抜出フレームの中から、前記フレーム種別ビットを検出するフレーム種別検出部（ＦＴ検出・格納部１０３）と、前記第２のフレームの構成に基づいて、前記抜出フレームに含まれている前記音声符号化ビットを前記抜出フレーム上の所定の位置に変更する処理部（不要ビット削除部１０４、ビットシフト部１０５及びビット反転部１０６）と、前記第２のフレームの構成に基づいて、前記フレーム種別検出部によって検出された前記フレーム種別ビットを前記抜出フレーム上の所定の位置に配置するフレーム種別ビット配置部（ＦＴ配置部１０７）とを備えることを要旨とする。
【００１０】
かかる特徴によれば、処理部が、第２のフレーム構成に基づいて、音声符号化ビットをフレーム抜出部によって抜き出された抜出フレームの所定の位置に変更し、フレーム種別ビット配置部が、フレーム種別ビットを抜出フレーム上の所定の位置に配置するため、符号化ビットレートをフレーム単位で切り替えることができる所定の音声圧縮符号化方式に基づいて符号化された音声符号化ビットを含むフレームの構成を変換することができる。
【００１１】
本発明の第２の特徴は、本発明の第１の特徴において、前記処理部が、前記抜出フレームに含まれている前記フレーム種別ビットと、前記付加情報ビットとを削除し、前記抜出フレーム上の前記音声符号化ビットの位置をシフトさせることにより、前記音声符号化ビットを前記所定の位置に変更することを要旨とする。
【００１２】
本発明の第３の特徴は、本発明の第２の特徴において、前記処理部が、前記抜出フレーム上に順次配置されている所定ビット数の前記音声符号化ビットの順序を反転させることにより、前記音声符号化ビットを前記所定の位置に変更することを要旨とする。
【００１３】
かかる特徴によれば、ビットシフト部が、音声符号化ビットの抜出フレーム上の位置をシフトさせ、ビット反転部が、抜出フレーム上に順次配置されている所定ビット数の音声符号化ビットの順序を反転させるため、より少ない処理ステップ数でフレーム構成を変換することができる。
【００１４】
すなわち、かかる特徴によれば、第１のフレーム構成から第２のフレーム構成に変換するために必要な処理の内容を、予めビット削除部、ビットシフト部及びビット反転部に登録しておくことにより、少ない処理ステップ数によるフレーム構成の変換が図れ、フレーム構成の変換に係る処理速度を向上させることができる。
【００１５】
本発明の第４の特徴は、本発明の第１乃至第３の特徴において、前記所定の音声圧縮符号化方式として、ＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅにより符号化された前記音声符号化ビットが含まれるフレームの構成を変換することを要旨とする。
【００１６】
かかる特徴によれば、ＡＭＲによって符号化されたＡＭＲデータを、例えば、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔから、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２に変換することができる。この結果、本発明に係るフレーム構成変換装置を用いることにより、例えば、インターネット上において公開されているＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータを、第３世代移動通信システムにおいて使用される移動電話端末などに配信することが可能となる。
【００１７】
本発明の第５の特徴は、符号化ビットレートをフレーム単位で切り替えることができる所定の音声圧縮符号化方式に基づいて符号化された音声符号化ビットと、フレームの種別を示すフレーム種別ビットと、付加情報ビットとが、所定の順序で配列された第１のフレームを、第２のフレームに変換するフレーム構成変換方法であって、連続して入力された前記第１のフレームをフレーム単位で抜き出すステップと、抜き出された抜出フレームの中から、前記フレーム種別ビットを検出するステップと、前記第２のフレームの構成とに基づいて、前記抜出フレームに含まれている前記音声符号化ビットを前記抜出フレーム上の所定の位置に変更するステップと、前記第２のフレームの構成に基づいて、前記フレーム種別ビットを検出するステップによって検出された前記フレーム種別ビットを前記抜出フレーム上の所定の位置に配置するステップとを備えることを要旨とする。
【００１８】
【発明の実施の形態】
（フレーム構成変換装置の構成）
本発明の実施形態について図１乃至図３を参照しながら説明する。図１は、本実施形態に係るフレーム構成変換装置の論理ブロック構成を示している。
【００１９】
同図に示すように、フレーム構成変換装置１００は、データ入力部１０１と、フレーム抜出部１０２と、ＦＴ検出・格納部１０３と、不要ビット削除部１０４と、ビットシフト部１０５と、ビット反転部１０６と、ＦＴ配置部１０７と、データ出力部１０８とを備えている。
【００２０】
データ入力部１０１は、外部から入力されたＡＭＲデータをフレーム抜出部１０２に送出するものであり、本実施形態では、ＡＭＲ（ＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅ）によって符号化され、ＩＥＴＦＲＦＣ３２６７において規定されるＲＴＰＰａｙｌｏａｄＦｏｒｍａｔ（第１のフレーム）の構成を有するＡＭＲデータが、データ入力部１０１に入力される。
【００２１】
なお、データ入力部１０１が具備する入力インターフェースとしては、通信ネットワークを介してＡＭＲデータを取得する１００ＢＡＳＥ−ＴＸなどのＬＡＮカードや、ＣＤ−ＲＯＭ及びＤＶＤ−ＲＯＭなどの外部記憶媒体に記憶されたＡＭＲデータを読み込む外部記憶媒体アクセス装置などを用いることができる。
【００２２】
フレーム抜出部１０２は、データ入力部１０１に連続して入力されたＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータ・フレームをフレーム単位で抜き出すものである。
【００２３】
具体的には、フレーム抜出部１０２は、データ入力部１０１から送出されたＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータを、ＣＭＲ（ＣｈａｎｇｅＭｏｄｅＲｅｑｕｅｓｔ）ビットを最上位ビット（ＭＳＢ：ＭｏｓｔＳｉｇｎｉｆｉｃａｎｔＢｉｔ）として、フレーム単位で抜き出し、抜き出したＡＭＲデータ・フレームをＦＴ検出・格納部１０３に送出する。
【００２４】
ここで、図２は、フレーム抜出部１０２によって抜き出されたＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータのフレーム構成を示している。
【００２５】
同図に示すように、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータは、ＡＭＲによって符号化された音声符号化ビット（Ｄ）と、フレーム種別を示すフレーム種別ビット、つまり、符号化ビットレートなどを示す情報（ＦＴ：フレームタイプ）と、符号化ビットレートなどを変更する際に用いられるＣＭＲビット（ＣＭＲ）や、後に続くフレームの有無を示す確認ビット（Ｆ）などの付加情報ビットとから構成されている。
【００２６】
ＦＴ検出・格納部１０３は、フレーム抜出部１０２によって抜き出されたＡＭＲデータ・フレームの中から、フレーム種別ビット（ＦＴ）を検出するものであり、本実施形態では、フレーム種別検出部を構成する。
【００２７】
具体的には、ＦＴ検出・格納部１０３は、図２に示したＯｃｔｅｔ１の第２〜５ビットに位置するフレーム種別ビット（ＦＴ）を検出するとともに、検出したフレーム種別ビット（ＦＴ）の内容を格納し、その内容をＦＴ配置部１０７に転送する。
【００２８】
また、ＦＴ検出・格納部１０３は、フレーム種別ビット（ＦＴ）を検出後、フレーム抜出部１０２から送出されたＡＭＲデータ・フレームを不要ビット削除部１０４に送出する。
【００２９】
不要ビット削除部１０４は、出力すべきＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２（第２のフレーム）の構成）とに基づいて、ＦＴ検出・格納部１０３から送出されたＡＭＲデータ・フレームの中から、フレーム種別ビット（ＦＴ）と、ＣＭＲビットや確認ビット（Ｆ）などの付加情報ビットとを削除するものである。
【００３０】
具体的には、不要ビット削除部１０４は、図２に示したＯｃｔｅｔ０及びＯｃｔｅｔ１に位置する情報を削除する。
【００３１】
また、不要ビット削除部１０４は、フレーム種別ビット（ＦＴ）と、ＣＭＲビットや確認ビット（Ｆ）などの付加情報ビットとを削除したＡＭＲデータ・フレームをビットシフト部１０５に送出する。
【００３２】
ビットシフト部１０５は、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔの構成と、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２の構成とに基づいて、不要ビット削除部１０４から送出されたＡＭＲデータ・フレーム上の音声符号化ビット（Ｄ）の位置をシフトさせるものである。
【００３３】
具体的には、ビットシフト部１０５は、音声符号化ビット（Ｄ）をＬＳＢ側に４ビットシフトさせる。さらに、ビットシフト部１０５は、音声符号化ビット（Ｄ）が位置する最終オクテットに含まれているパディングビット（Ｐ）を削除する。
【００３４】
また、ビットシフト部１０５は、音声符号化ビット（Ｄ）をシフトさせ、パディングビット（Ｐ）を削除したＡＭＲデータ・フレームをビット反転部１０６に送出する。
【００３５】
ビット反転部１０６は、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔの構成と、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２の構成とに基づいて、ビットシフト部１０５から送出されたＡＭＲデータ・フレーム上に順次配置されている所定ビット数の音声符号化ビット（Ｄ）の順序を反転させるものである。
【００３６】
具体的には、ビット反転部１０６は、ビットシフト部１０５から送出されたＡＭＲデータ・フレーム上において、オクテット毎、つまり８ビット単位で音声符号化ビット（Ｄ）の順序を反転させる。
【００３７】
例えば、ＡＭＲデータ・フレーム上のあるオクテットに、Ｄ（２３６）−Ｄ（２３７）−Ｄ（２３８）−Ｄ（２３９）−Ｄ（２４０）−Ｄ（２４１）−Ｄ（２４２）−Ｄ（２４３）と、音声符号化ビットがＭＳＢ側からＬＳＢ（ＬｅａｓｔＳｉｇｎｉｆｉｃａｎｔＢｉｔ）側へ順次配置されていた場合、音声符号化ビットの当該オクテット上の配列を、Ｄ（２４３）−Ｄ（２４２）−Ｄ（２４１）−Ｄ（２４０）−Ｄ（２３９）−Ｄ（２３８）−Ｄ（２３７）−Ｄ（２３６）に反転させる。すなわち、ビット反転部１０６は、音声符号化ビット（Ｄ）をオクテット（８ビット）単位で、ＭＳＢｆｉｒｓｔからＬＳＢｆｉｒｓｔに変更する。
【００３８】
また、ビット反転部１０６は、音声符号化ビット（Ｄ）の順序を反転させたＡＭＲデータ・フレームをＦＴ配置部１０７に送出する。
【００３９】
なお、本実施形態では、不要ビット削除部１０４と、ビットシフト部１０５と、ビット反転部１０６とによって、処理部を構成する。また、不要ビット削除部１０４と、ビットシフト部１０５と、ビット反転部１０６による、より具体的なフレーム構成の変換方法については、後述する。
【００４０】
ＦＴ配置部１０７は、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２の構成に基づいて、ＦＴ検出・格納部１０３によって検出されたフレーム種別ビット（ＦＴ）を、ビット反転部１０６から送出されたＡＭＲデータ・フレーム上の所定位置に配置するものであり、本実施形態では、フレーム種別ビット配置部を構成する。
【００４１】
具体的には、ＦＴ配置部１０７は、ＦＴ検出・格納部１０３から転送されたフレーム種別ビット（ＦＴ）を、ビット反転部１０６から送出されたＡＭＲデータ・フレームのＯｃｔｅｔ０の第５ビットから、ＬＳＢ側へ順次配置、つまりＭＳＢｆｉｒｓｔで配置する。
【００４２】
また、ＦＴ配置部１０７は、フレーム種別ビット（ＦＴ）を配置したＡＭＲデータ・フレームをデータ出力部１０８に送出する。
【００４３】
データ出力部１０８は、不要ビット削除部１０４と、ビットシフト部１０５と、ビット反転部１０６と、ＦＴ配置部１０７とによって処理されたＡＭＲデータを、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２を有するＡＭＲデータとして出力するものであり、本実施形態では、出力部を構成する。
【００４４】
具体的には、データ出力部１０８は、ＦＴ配置部１０７から送出されたＡＭＲデータをＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２を有するＡＭＲデータとして、外部に出力する。ここで、図３は、データ出力部１０８から出力されるＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２を有するＡＭＲデータ・フレームの構成を示している。
【００４５】
同図に示すように、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２を有するＡＭＲデータは、ＡＭＲによって符号化された音声符号化ビット（Ｄ）と、フレーム種別を示すフレーム種別ビット（ＦＴ）とから構成されている。
【００４６】
さらに、図２に示したＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータと比較すると、音声符号化ビット（Ｄ）は、オクテット毎に、ＬＳＢからＭＳＢ側へ順次配置、つまりＬＳＢｆｉｒｓｔで配置されている。
【００４７】
また、データ出力部１０８が具備する出力インターフェースとしては、通信ネットワークを介してＡＭＲデータを出力する１００ＢＡＳＥ−ＴＸなどのＬＡＮカードなどを用いることができる。なお、かかる場合、データ入力部１０１と、データ出力部１０８とは、同一のＬＡＮカードによって構成することも勿論可能である。
【００４８】
（フレーム構成変換方法）
次に、上述した本実施形態に係るフレーム構成変換装置を用いたＡＭＲデータのフレーム構成の変換方法について説明する。
【００４９】
図４は、ＡＭＲデータ・フレームをＲＴＰＰａｙｌｏａｄＦｏｒｍａｔからＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２に変換する場合におけるフレーム構成変換装置１００の処理フローを示している。
【００５０】
同図に示すように、ステップＳ１０において、フレーム構成変換装置１００は、入力されたＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータをフレーム単位で抜き出す。ステップＳ１０において抜き出されたＡＭＲデータ・フレームは、上述したように、図２に示したフレーム構成を有している。
【００５１】
ステップＳ２０において、フレーム構成変換装置１００は、ステップＳ１０において抜き出したＡＭＲデータ・フレームの中から、フレーム種別ビット（ＦＴ）を検出するとともに、検出したフレーム種別ビット（ＦＴ）の内容を格納する。
【００５２】
ここで、図５は、ステップＳ２０において検出されるフレーム種別ビット（ＦＴ）のＡＭＲデータ・フレーム上の位置を示している。同図に示すように、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔでは、フレーム種別ビット（ＦＴ）は、Ｏｃｔｅｔ１の第２〜５ビットに配置されている。
【００５３】
ステップＳ３０において、フレーム構成変換装置１００は、ステップＳ１０において抜き出されたＡＭＲデータ・フレームの中から、フレーム種別ビット（ＦＴ）と、ＣＭＲビットや確認ビット（Ｆ）などの付加情報ビットとを削除する。
【００５４】
ここで、図６は、ステップＳ３０において削除される、フレーム種別ビット（ＦＴ）と、ＣＭＲビットや確認ビット（Ｆ）などの付加情報ビットとのＡＭＲデータ・フレーム上の位置を示している。同図に示すように、フレーム構成変換装置１００は、ステップＳ１０において抜き出されたＡＭＲデータ・フレームの中から、Ｏｃｔｅｔ０及びＯｃｔｅｔ１に位置するフレーム種別ビット（ＦＴ）と、ＣＭＲビットや確認ビット（Ｆ）などの付加情報ビットを削除する。
【００５５】
ステップＳ４０において、フレーム構成変換装置１００は、ＡＭＲデータ・フレームの中に含まれている音声符号化ビット（Ｄ）をＬＳＢ側に４ビットシフトさせる。
【００５６】
ステップＳ５０において、フレーム構成変換装置１００は、音声符号化ビット（Ｄ）が位置する最終オクテットに含まれているパディングビット（Ｐ）を削除する。
【００５７】
ここで、図７（ａ）は、ステップＳ４０における音声符号化ビット（Ｄ）のビットシフト処理が実行される前のＡＭＲデータ・フレームの構成を示している。また、図７（ｂ）は、ステップＳ４０及びＳ５０における処理、すなわち音声符号化ビット（Ｄ）をＬＳＢ側に４ビットシフトさせ、パディングビット（Ｐ）を削除した後のＡＭＲデータ・フレームの構成を示している。同図（ｂ）に示すように、Ｏｃｔｅｔ０のＭＳＢから４ビットは、ビットシフト処理の結果、何も情報が配置されていない状態となっている。
【００５８】
ステップＳ６０において、フレーム構成変換装置１００は、ステップＳ４０及びＳ５０の処理が実行されたＡＭＲデータ・フレーム上において、オクテット（８ビット）単位で音声符号化ビット（Ｄ）の順序を反転させる、すなわち、音声符号化ビット（Ｄ）をオクテット単位で、ＭＳＢｆｉｒｓｔからＬＳＢｆｉｒｓｔに変更する。
【００５９】
ここで、図８（ａ）は、音声符号化ビット（Ｄ）の順序を反転させる前のＡＭＲデータ・フレームの構成を示している。また、図８（ｂ）は、音声符号化ビット（Ｄ）の順序を反転させた後のＡＭＲデータ・フレームの構成を示している。
【００６０】
例えば、反転前にＯｃｔｅｔ３０のＭＳＢに位置するＤ（２３６）は、ステップＳ６０の処理により反転させられることにより、Ｏｃｔｅｔ３０のＬＳＢに配置、つまり反転前のＤ（２４２）の位置に配置される。以下、同図（ａ）の矢印で示すように、Ｄ（２３７）〜Ｄ（２４２）の位置が反転させられるとともに、他の音声符号化ビット（Ｄ）についても同様にオクテット単位で、順序が反転させられる。
【００６１】
ステップＳ７０において、フレーム構成変換装置１００は、ステップＳ２０において検出したフレーム種別ビット（ＦＴ）をＡＭＲデータ・フレーム上の所定の位置に配置する。
【００６２】
ここで、図９は、ステップＳ７０において、フレーム種別ビット（ＦＴ）が配置される位置を示している。同図に示すように、フレーム構成変換装置１００は、フレーム種別ビット（ＦＴ）を、ＡＭＲデータ・フレームのＯｃｔｅｔ０の第５ビットからＬＳＢ方向へ順次配置、つまりＭＳＢｆｉｒｓｔで配置する。
【００６３】
ステップＳ８０において、フレーム構成変換装置１００は、ステップＳ７０においてフレーム種別ビット（ＦＴ）が配置されたＡＭＲデータを、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２を有するＡＭＲデータとして出力する。
【００６４】
（作用・効果）
本実施形態によれば、不要ビット削除部１０４と、ビットシフト部１０５と、ビット反転部１０６とが、ＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２の構成に基づいて、音声符号化ビット（Ｄ）をフレーム抜出部１０２によって抜き出されたＡＭＲデータ・フレームの所定の位置に変更し、ＦＴ配置部１０７が、フレーム種別ビット（ＦＴ）を抜き出されたＡＭＲデータ・フレーム上の所定の位置に配置するため、ＡＭＲなど、符号化ビットレートをフレーム単位で切り替えることができる所定の音声圧縮符号化方式に基づいて符号化された音声符号化ビットを含むフレームの構成を変換することができる。
【００６５】
本実施形態によれば、ビットシフト部１０５が、音声符号化ビット（Ｄ）のＡＭＲデータ・フレーム上の位置をシフトさせ、ビット反転部１０６が、当該フレーム上に順次配置されている所定ビット数の音声符号化ビット（Ｄ）の順序を反転させるため、より少ない処理ステップ数でＲＴＰＰａｙｌｏａｄＦｏｒｍａｔからＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２に変換することができる。
【００６６】
すなわち、本実施形態によれば、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔからＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２に変換するために必要な処理の内容を、予め不要ビット削除部１０４、ビットシフト部１０５及びビット反転部１０６に登録しておくことにより、少ない処理ステップ数によるフレーム構成の変換が図れ、フレーム構成の変換に係る処理速度を向上させることができる。
【００６７】
さらに、本実施形態によれば、フレーム構成変換装置１００を用いることにより、例えば、インターネット上において公開されているＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータを、第３世代移動通信システムにおいて使用される移動電話端末などに配信することが可能となる。
【００６８】
（変更例）
上述した本発明の実施形態においては、ＲＴＰＰａｙｌｏａｄＦｏｒｍａｔを有するＡＭＲデータをＩｎｔｅｒｆａｃｅＦｏｒｍａｔ２を有するＡＭＲデータに変換する形態を例として説明したが、フレーム構成変換装置１００に入力されるフレームの構成から出力すべきフレームの構成に変換するための処理の内容を、不要ビット削除部１０４、ビットシフト部１０５、ビット反転部１０６などに予め登録することにより、本発明は、他のフレーム構成にも適用することができる。
【００６９】
【発明の効果】
以上説明したように本発明によれば、ＡＭＲ（ＡｄａｐｔｉｖｅＭｕｌｔｉ−Ｒａｔｅ）など、符号化ビットレートをフレーム単位で切り替えることができる音声符号化方式によって符号化された音声符号化ビットが含まれるフレームを、当該音声符号化方式の他のフレームに変換することができるフレーム構成変換装置及びフレーム構成変換方法を提供することができる。
【図面の簡単な説明】
【図１】本発明の実施形態に係るフレーム構成変換装置の論理ブロック構成を示す図である。
【図２】本発明の実施形態に係るフレーム構成変換装置に入力されるデータのフレーム構成を示す図である。
【図３】本発明の実施形態に係るフレーム構成変換装置から出力されるデータのフレーム構成を示す図である。
【図４】本発明の実施形態に係るフレーム構成変換装置によるフレーム構成の変換方法を示す図である。
【図５】本発明の実施形態に係るフレーム構成変換装置によるフレーム構成の変換途中におけるフレーム上のデータの配置状態を示す図である。
【図６】本発明の実施形態に係るフレーム構成変換装置によるフレーム構成の変換途中におけるフレーム上のデータの配置状態を示す図である。
【図７】本発明の実施形態に係るフレーム構成変換装置によるフレーム構成の変換途中におけるフレーム上のデータの配置状態を示す図である。
【図８】本発明の実施形態に係るフレーム構成変換装置によるフレーム構成の変換途中におけるフレーム上のデータの配置状態を示す図である。
【図９】本発明の実施形態に係るフレーム構成変換装置によるフレーム構成の変換途中におけるフレーム上のデータの配置状態を示す図である。
【符号の説明】
１００…フレーム構成変換装置、１０１…データ入力部、１０２…フレーム抜出部、１０３…ＦＴ検出・格納部、１０４…不要ビット削除部、１０５…ビットシフト部、１０６…ビット反転部、１０７…ＦＴ配置部、１０８…データ出力部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a configuration of a frame including audio encoded bits encoded based on a predetermined audio compression encoding method capable of switching an encoding bit rate in units of frames, for example, Adaptive Multi-Rate (AMR). The present invention relates to a frame configuration conversion apparatus and a frame configuration conversion method.
[0002]
[Prior art]
Conventionally, Adaptive Multi-Rate (AMR), which is capable of switching the coding bit rate in units of frames, is a standard speech coding method used in the third generation mobile communication system, 3GPP (3rd Generation Partnership) (See, for example, Non-Patent Document 1).
[0003]
AMR is used in the third generation mobile communication system when encoding audio content distributed together with video by a video distribution service, or encoding audio transmitted in a TV phone service.
[0004]
AMR has also become widespread as an audio encoding method for encoding audio content distributed as streaming data using RTP (Real-time Transport Protocol) on the Internet and the like.
[0005]
[Non-Patent Document 1]
“Mandatory spec code spec processing functions; Adaptive Multi-Rate (AMR) spec code frame structure − TS26.101 release 5 Prot.
[0006]
[Problems to be solved by the invention]
However, AMR has a plurality of frame configurations (frame formats) depending on its use, that is, a communication protocol to be used, and each frame format is not compatible with other frame formats. was there.
[0007]
For example, the frame structure (RTP Payload Format) when distributing AMR data encoded by AMR using RTP is AMR using the TV telephone communication protocol (3G-324M) in the third generation mobile communication system. Since the frame configuration (Interface Format 2) for data distribution is not compatible, AMR data having RTP Payload Format can be directly distributed to mobile phone terminals used in the third generation mobile communication system. There was a problem that I could not.
[0008]
Therefore, the present invention has been made to solve the above-described problems, and includes speech encoded bits encoded by a speech encoding method in which the encoding bit rate can be switched on a frame basis, such as AMR. It is an object of the present invention to provide a frame configuration conversion apparatus and a frame configuration conversion method that can convert a frame to be converted into another frame of the speech encoding method.
[0009]
[Means for Solving the Problems]
In order to solve the above-described problems, the present invention has the following features. First, the first feature of the present invention is that a voice coding encoded based on a predetermined voice compression coding method capable of switching a coding bit rate in units of frames, for example, Adaptive Multi-Rate (AMR). A first frame (for example, RTP Payload) in which a bit, a frame type bit (Frame Type) indicating a frame type, and an additional information bit (Change mode Request bit, frame quality indicator, etc.) are arranged in a predetermined order A frame configuration conversion device for converting (Format) into a second frame (for example, Interface Format 2), wherein a frame extraction unit (frame extraction) extracts the first frame that is continuously input in units of frames. Part 102) and extracted Based on the configuration of the frame type detection unit (FT detection / storage unit 103) that detects the frame type bit from the outgoing frame and the second frame, the speech code included in the extracted frame Based on the configuration of the processing unit (unnecessary bit deletion unit 104, bit shift unit 105, and bit inversion unit 106) that changes the digitized bit to a predetermined position on the extracted frame, and the frame type, The gist of the present invention is to include a frame type bit arrangement unit (FT arrangement unit 107) that arranges the frame type bit detected by the detection unit at a predetermined position on the extracted frame.
[0010]
According to this feature, the processing unit changes the speech encoded bit to a predetermined position of the extracted frame extracted by the frame extracting unit based on the second frame configuration, and the frame type bit arrangement unit In order to arrange the frame type bit at a predetermined position on the extracted frame, the encoded bit rate includes audio encoded bits encoded based on a predetermined audio compression encoding method that can be switched in units of frames. The structure of the frame can be converted.
[0011]
According to a second feature of the present invention, in the first feature of the present invention, the processing unit deletes the frame type bit and the additional information bit included in the extracted frame, and extracts the extracted information. The gist is to change the speech coding bits to the predetermined position by shifting the position of the speech coding bits on the frame.
[0012]
A third feature of the present invention is that, in the second feature of the present invention, the processing unit reverses the order of the speech encoded bits having a predetermined number of bits sequentially arranged on the extracted frame. The gist of the present invention is to change the speech coding bit to the predetermined position.
[0013]
According to such a feature, the bit shift unit shifts the position of the speech encoded bits on the extracted frame, and the bit inversion unit stores the speech encoded bits of a predetermined number of bits sequentially arranged on the extracted frame. Since the order is reversed, the frame configuration can be converted with a smaller number of processing steps.
[0014]
That is, according to this feature, by registering in advance the contents of processing necessary for conversion from the first frame configuration to the second frame configuration in the bit deletion unit, the bit shift unit, and the bit inversion unit. The frame configuration can be converted with a small number of processing steps, and the processing speed related to the frame configuration conversion can be improved.
[0015]
According to a fourth aspect of the present invention, in the first to third aspects of the present invention, as the predetermined audio compression encoding method, a frame including the audio encoded bits encoded by Adaptive Multi-Rate is included. The gist is to convert the configuration.
[0016]
According to this feature, AMR data encoded by AMR can be converted from, for example, RTP Payload Format to Interface Format 2. As a result, by using the frame configuration conversion apparatus according to the present invention, for example, AMR data having RTP Payload Format published on the Internet is distributed to mobile telephone terminals used in the third generation mobile communication system. It becomes possible to do.
[0017]
A fifth feature of the present invention is that a speech coding bit that is coded based on a predetermined speech compression coding method that can switch a coding bit rate in units of frames, a frame type bit that indicates a frame type, A frame configuration conversion method for converting a first frame in which additional information bits are arranged in a predetermined order into a second frame, wherein the first frame that is continuously input is converted into a frame unit. The speech encoding included in the extracted frame based on the step of extracting, the step of detecting the frame type bit from the extracted extracted frame, and the configuration of the second frame A step of changing a bit to a predetermined position on the extracted frame, and a step of detecting the frame type bit based on the configuration of the second frame. And summarized in that comprises the steps of placing the frame type bit is detected in a predetermined position on said extraction frames by.
[0018]
DETAILED DESCRIPTION OF THE INVENTION
(Configuration of frame configuration conversion device)
An embodiment of the present invention will be described with reference to FIGS. FIG. 1 shows a logical block configuration of the frame configuration conversion apparatus according to the present embodiment.
[0019]
As shown in the figure, the frame configuration conversion apparatus 100 includes a data input unit 101, a frame extraction unit 102, an FT detection / storage unit 103, an unnecessary bit deletion unit 104, a bit shift unit 105, and a bit inversion. Unit 106, FT placement unit 107, and data output unit 108.
[0020]
The data input unit 101 sends AMR data input from the outside to the frame extraction unit 102. In the present embodiment, the data input unit 101 is encoded by AMR (Adaptive Multi-Rate) and is defined by IETF RFC3267. AMR data having a Payload Format (first frame) configuration is input to the data input unit 101.
[0021]
The data input unit 101 includes an input interface such as a LAN card such as 100BASE-TX that acquires AMR data via a communication network, or an AMR stored in an external storage medium such as a CD-ROM or DVD-ROM. An external storage medium access device that reads data can be used.
[0022]
The frame extraction unit 102 extracts an AMR data frame having an RTP payload format continuously input to the data input unit 101 in units of frames.
[0023]
Specifically, the frame extraction unit 102 uses AMR data having an RTP payload format sent from the data input unit 101 as a frame with the CMR (Change Mode Request) bit as the most significant bit (MSB: Most Significant Bit). The AMR data frame extracted in units is sent to the FT detection / storage unit 103.
[0024]
Here, FIG. 2 shows a frame configuration of AMR data having the RTP payload format extracted by the frame extraction unit 102.
[0025]
As shown in the figure, the AMR data having the RTP Payload Format includes voice encoded bits (D) encoded by the AMR, frame type bits indicating the frame type, that is, information indicating the encoding bit rate ( FT: frame type) and additional information bits such as a CMR bit (CMR) used when changing the encoding bit rate and a confirmation bit (F) indicating the presence or absence of a subsequent frame.
[0026]
The FT detection / storage unit 103 detects a frame type bit (FT) from the AMR data frame extracted by the frame extraction unit 102. In this embodiment, the FT detection / storage unit 103 constitutes a frame type detection unit. To do.
[0027]
Specifically, the FT detection / storage unit 103 detects the frame type bit (FT) located in the second to fifth bits of Octet 1 shown in FIG. 2 and the contents of the detected frame type bit (FT). And the contents are transferred to the FT placement unit 107.
[0028]
Further, after detecting the frame type bit (FT), the FT detection / storage unit 103 sends the AMR data frame sent from the frame extraction unit 102 to the unnecessary bit deletion unit 104.
[0029]
The unnecessary bit deletion unit 104 selects a frame type bit (FT) from the AMR data frame sent from the FT detection / storage unit 103 based on the Interface Format 2 (second frame) to be output. ) And additional information bits such as a CMR bit and a confirmation bit (F).
[0030]
Specifically, the unnecessary bit deletion unit 104 deletes information located in Octet 0 and Octet 1 shown in FIG.
[0031]
Unnecessary bit deletion section 104 sends an AMR data frame from which frame type bits (FT) and additional information bits such as CMR bits and confirmation bits (F) are deleted to bit shift section 105.
[0032]
The bit shift unit 105 shifts the position of the speech encoded bit (D) on the AMR data frame sent from the unnecessary bit deletion unit 104 based on the configuration of the RTP Payload Format and the configuration of the Interface Format 2. Is.
[0033]
Specifically, the bit shift unit 105 shifts the speech encoded bit (D) by 4 bits to the LSB side. Further, the bit shift unit 105 deletes the padding bit (P) included in the last octet where the speech encoded bit (D) is located.
[0034]
Also, the bit shift unit 105 shifts the speech coding bit (D) and sends the AMR data frame from which the padding bit (P) is deleted to the bit inverting unit 106.
[0035]
Based on the configuration of RTP Payload Format and the configuration of Interface Format 2, the bit inverting unit 106 has a predetermined number of audio encoded bits sequentially arranged on the AMR data frame sent from the bit shift unit 105 The order of (D) is reversed.
[0036]
Specifically, the bit reversing unit 106 inverts the order of speech coding bits (D) on an octet basis, that is, in units of 8 bits on the AMR data frame sent from the bit shift unit 105.
[0037]
For example, an octet on an AMR data frame can be represented by D (236) -D (237) -D (238) -D (239) -D (240) -D (241) -D (242) -D (243 ) And the coded speech bits are sequentially arranged from the MSB side to the LSB (Least Significant Bit) side, the arrangement of the coded speech bits on the octet is represented by D (243) -D (242) -D ( 241) -D (240) -D (239) -D (238) -D (237) -D (236). That is, the bit inverting unit 106 changes the speech coding bit (D) from MSB first to LSB first in units of octets (8 bits).
[0038]
In addition, the bit inversion unit 106 sends the AMR data frame in which the order of the audio encoded bits (D) is inverted to the FT arrangement unit 107.
[0039]
In the present embodiment, the unnecessary bit deletion unit 104, the bit shift unit 105, and the bit inversion unit 106 constitute a processing unit. A more specific frame configuration conversion method by the unnecessary bit deleting unit 104, the bit shift unit 105, and the bit inversion unit 106 will be described later.
[0040]
Based on the configuration of Interface Format 2, the FT placement unit 107 sets the frame type bit (FT) detected by the FT detection / storage unit 103 to a predetermined position on the AMR data frame transmitted from the bit inversion unit 106. In this embodiment, a frame type bit arrangement unit is configured.
[0041]
Specifically, the FT placement unit 107 converts the frame type bit (FT) transferred from the FT detection / storage unit 103 from the fifth bit of Octet 0 of the AMR data frame sent from the bit inversion unit 106. Sequentially arranged on the LSB side, that is, MSB first.
[0042]
Also, the FT placement unit 107 sends the AMR data frame in which the frame type bit (FT) is placed to the data output unit 108.
[0043]
The data output unit 108 outputs the AMR data processed by the unnecessary bit deletion unit 104, the bit shift unit 105, the bit inversion unit 106, and the FT placement unit 107 as AMR data having the Interface Format 2. In the present embodiment, an output unit is configured.
[0044]
Specifically, the data output unit 108 outputs AMR data sent from the FT placement unit 107 to the outside as AMR data having Interface Format 2. Here, FIG. 3 shows a configuration of an AMR data frame having Interface Format 2 output from the data output unit 108.
[0045]
As shown in the figure, AMR data having Interface Format 2 is composed of speech coding bits (D) encoded by AMR and frame type bits (FT) indicating the frame type.
[0046]
Further, compared with the AMR data having the RTP Payload Format shown in FIG. 2, the speech coding bits (D) are sequentially arranged from the LSB to the MSB side for each octet, that is, LSB first.
[0047]
As an output interface provided in the data output unit 108, a LAN card such as 100BASE-TX that outputs AMR data via a communication network can be used. In such a case, the data input unit 101 and the data output unit 108 can of course be configured by the same LAN card.
[0048]
(Frame structure conversion method)
Next, a method for converting the frame configuration of AMR data using the above-described frame configuration conversion apparatus according to the present embodiment will be described.
[0049]
FIG. 4 shows a processing flow of the frame configuration conversion apparatus 100 when converting the AMR data frame from the RTP Payload Format to the Interface Format 2.
[0050]
As shown in the figure, in step S10, the frame configuration conversion apparatus 100 extracts AMR data having the input RTP payload format in units of frames. As described above, the AMR data frame extracted in step S10 has the frame configuration shown in FIG.
[0051]
In step S20, the frame configuration conversion apparatus 100 detects the frame type bit (FT) from the AMR data frame extracted in step S10, and stores the content of the detected frame type bit (FT).
[0052]
Here, FIG. 5 shows the position of the frame type bit (FT) detected in step S20 on the AMR data frame. As shown in the figure, in the RTP Payload Format, the frame type bit (FT) is arranged in the second to fifth bits of Octet 1.
[0053]
In step S30, the frame configuration conversion apparatus 100 deletes the frame type bit (FT) and the additional information bits such as the CMR bit and the confirmation bit (F) from the AMR data frame extracted in step S10. To do.
[0054]
Here, FIG. 6 shows the positions on the AMR data frame of the frame type bits (FT) and additional information bits such as CMR bits and confirmation bits (F) to be deleted in step S30. As shown in the figure, the frame configuration conversion apparatus 100 includes a frame type bit (FT), a CMR bit, and a confirmation bit located in Octet 0 and Octet 1 from the AMR data frame extracted in step S10. An additional information bit such as (F) is deleted.
[0055]
In step S40, the frame configuration conversion apparatus 100 shifts the speech coding bits (D) included in the AMR data frame by 4 bits to the LSB side.
[0056]
In step S50, the frame configuration conversion apparatus 100 deletes the padding bit (P) included in the last octet in which the speech encoded bit (D) is located.
[0057]
Here, FIG. 7A shows the configuration of the AMR data frame before the bit shift processing of the speech encoded bit (D) in step S40 is executed. FIG. 7B shows the structure of the AMR data frame after the processing in steps S40 and S50, that is, the speech encoded bit (D) is shifted by 4 bits to the LSB side and the padding bit (P) is deleted. Show. As shown in FIG. 4B, the 4 bits from the MSB of Octet 0 are in a state where no information is arranged as a result of the bit shift process.
[0058]
In step S60, the frame configuration conversion apparatus 100 inverts the order of speech coding bits (D) in units of octets (8 bits) on the AMR data frame on which the processing in steps S40 and S50 has been performed, that is, The speech coding bit (D) is changed from MSB first to LSB first in octets.
[0059]
Here, FIG. 8A shows the structure of the AMR data frame before the order of the speech coding bits (D) is reversed. FIG. 8B shows the structure of the AMR data frame after the order of the speech coding bits (D) is reversed.
[0060]
For example, D (236) located in the MSB of Octet 30 before inversion is inverted by the processing of Step S60, and is arranged in LSB of Octet 30, that is, placed in the position of D (242) before inversion. . Hereinafter, as indicated by the arrows in FIG. 5A, the positions of D (237) to D (242) are reversed, and the other speech coded bits (D) are similarly ordered in octets. Inverted.
[0061]
In step S70, the frame configuration conversion apparatus 100 arranges the frame type bit (FT) detected in step S20 at a predetermined position on the AMR data frame.
[0062]
Here, FIG. 9 shows the position where the frame type bit (FT) is arranged in step S70. As shown in the figure, the frame configuration conversion apparatus 100 sequentially arranges the frame type bits (FT) from the fifth bit of Octet 0 of the AMR data frame in the LSB direction, that is, MSB first.
[0063]
In step S80, the frame configuration conversion apparatus 100 outputs the AMR data in which the frame type bit (FT) is arranged in step S70 as AMR data having the Interface Format 2.
[0064]
(Action / Effect)
According to the present embodiment, the unnecessary bit deletion unit 104, the bit shift unit 105, and the bit inversion unit 106 have the speech extraction bit (D) generated by the frame extraction unit 102 based on the configuration of Interface Format 2. To change to a predetermined position of the extracted AMR data frame, the FT placement unit 107 places the frame type bit (FT) at a predetermined position on the extracted AMR data frame. It is possible to convert the configuration of a frame including audio encoded bits encoded based on a predetermined audio compression encoding method in which the encoding bit rate can be switched in units of frames.
[0065]
According to the present embodiment, the bit shift unit 105 shifts the position of the speech coding bit (D) on the AMR data frame, and the bit inversion unit 106 has the predetermined number of bits sequentially arranged on the frame. Therefore, the RTP Payload Format can be converted to the Interface Format 2 with a smaller number of processing steps.
[0066]
That is, according to the present embodiment, the contents of processing necessary for conversion from RTP Payload Format to Interface Format 2 are registered in advance in the unnecessary bit deletion unit 104, the bit shift unit 105, and the bit inversion unit 106. Thus, the frame configuration can be converted with a small number of processing steps, and the processing speed related to the frame configuration conversion can be improved.
[0067]
Furthermore, according to the present embodiment, by using the frame configuration conversion apparatus 100, for example, AMR data having RTP Payload Format published on the Internet is used as a mobile telephone terminal used in the third generation mobile communication system. It becomes possible to deliver to.
[0068]
(Example of change)
In the embodiment of the present invention described above, an example in which AMR data having RTP Payload Format is converted to AMR data having Interface Format 2 has been described. However, output from the configuration of a frame input to frame configuration conversion apparatus 100 is described. The present invention can be applied to other frame configurations by pre-registering the contents of the processing for converting into the configuration of the frame to be performed in the unnecessary bit deletion unit 104, the bit shift unit 105, the bit inversion unit 106, and the like. be able to.
[0069]
【The invention's effect】
As described above, according to the present invention, a frame including speech encoded bits encoded by a speech encoding method in which the encoding bit rate can be switched on a frame basis, such as AMR (Adaptive Multi-Rate). In addition, it is possible to provide a frame configuration conversion apparatus and a frame configuration conversion method that can convert the speech encoding scheme into another frame.
[Brief description of the drawings]
FIG. 1 is a diagram showing a logical block configuration of a frame configuration conversion apparatus according to an embodiment of the present invention.
FIG. 2 is a diagram showing a frame configuration of data input to the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 3 is a diagram showing a frame configuration of data output from the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 4 is a diagram illustrating a frame configuration conversion method by the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 5 is a diagram showing an arrangement state of data on a frame in the middle of frame configuration conversion by the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 6 is a diagram showing an arrangement state of data on a frame in the middle of frame configuration conversion by the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 7 is a diagram showing an arrangement state of data on a frame in the middle of frame configuration conversion by the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 8 is a diagram showing an arrangement state of data on a frame in the middle of frame configuration conversion by the frame configuration conversion apparatus according to the embodiment of the present invention.
FIG. 9 is a diagram showing an arrangement state of data on a frame in the middle of frame configuration conversion by the frame configuration conversion apparatus according to the embodiment of the present invention.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 100 ... Frame structure conversion apparatus, 101 ... Data input part, 102 ... Frame extraction part, 103 ... FT detection / storage part, 104 ... Unnecessary bit deletion part, 105 ... Bit shift part, 106 ... Bit inversion part, 107 ... FT Arrangement unit 108: Data output unit

Claims

An audio encoding bit encoded based on a predetermined audio compression encoding method in which an encoding bit rate can be switched in units of frames, a frame type bit indicating a frame type, and an additional information bit are predetermined A frame configuration conversion device that converts first frames arranged in order into second frames,
A frame extraction unit for extracting the first frame continuously input in units of frames;
A frame type detection unit for detecting the frame type bit from the extracted extracted frame;
Based on the configuration of the second frame, a processing unit that changes the speech encoded bits included in the extracted frame to a predetermined position on the extracted frame;
And a frame type bit arrangement unit that arranges the frame type bits detected by the frame type detection unit at a predetermined position on the extracted frame based on the configuration of the second frame. Frame configuration conversion device.

The processor is
Deleting the frame type bit and the additional information bit included in the extracted frame;
2. The frame configuration conversion device according to claim 1, wherein the speech coding bit is changed to the predetermined position by shifting the position of the speech coding bit on the extracted frame.

The processing unit changes the speech coding bits to the predetermined position by inverting the order of the speech coding bits of a predetermined number of bits sequentially arranged on the extracted frame. The frame configuration conversion apparatus according to claim 2.

4. The structure of claim 1, wherein as the predetermined audio compression encoding method, a configuration of a frame including the audio encoded bits encoded by Adaptive Multi-Rate is converted. 5. Frame configuration conversion device.

An audio encoding bit encoded based on a predetermined audio compression encoding method in which an encoding bit rate can be switched in units of frames, a frame type bit indicating a frame type, and an additional information bit are predetermined A frame configuration conversion method for converting a first frame arranged in order into a second frame,
Extracting the first frames input consecutively in units of frames;
Detecting the frame type bit from the extracted extracted frame;
Based on the configuration of the second frame, the step of changing the speech coding bits included in the extracted frame to a predetermined position on the extracted frame; and the configuration of the second frame And a step of arranging the frame type bit detected by the step of detecting the frame type bit at a predetermined position on the extracted frame.