JP3852890B2

JP3852890B2 - Recording / playback device

Info

Publication number: JP3852890B2
Application number: JP27678098A
Authority: JP
Inventors: 浩三林
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1998-09-30
Filing date: 1998-09-30
Publication date: 2006-12-06
Anticipated expiration: 2018-09-30
Also published as: JP2000105600A

Description

【０００１】
【発明の属する技術分野】
本発明は、情報の記録再生装置に関する発明である。
【０００２】
【従来の技術】
近年、音楽や映画などのアナログ情報をデジタル情報化して、記録、再生する様々な装置が開発されている。また記録媒体の有効利用のためにさらに情報に圧縮を加えて記録し、またこれを伸長して再生する装置も数多く開発されている。
【０００３】
多くの情報圧縮の手法も規格化、または提案されており、多くの機器においては装置に組み込まれたデジタルシグナルプロセッサ（以降ＤＳＰと呼ぶ）に演算処理をさせている。
【０００４】
一般的に圧縮、伸長のアルゴリズムは人間の聴覚の特性を利用し、単位時間内のアナロク情報をデジタル化した数値をもとに、パターンの分析や周波数の分析を行ない不要な部分を削除することで情報の圧縮を行っている。
【０００５】
最近ではこれらの圧縮手法を応用したテープ系の装置や、ディスクメディア系のポータブルオーディオ装置や、ビデオ装置が実用化されている。
【０００６】
またこれらの圧縮手法を半導体メモリに応用した記録再生装置も開発されつつあり、機器の小型化、省電力化に伴いその情報記録媒体として、大容量の半導体メモリが注目されている。
【０００７】
中でも一括消去型メモリであるフラッシュメモリはバックアップ電源の不要なことや大容量化が可能なことからこの情報記録媒体として注目されている。
【０００８】
一方、機能面から見ると、再生開始位置や録音開始位置を頭出しするために音声を早く再生したり（以降、早聞き再生と呼ぶ）、また正確なポジショニングの為にわざと遅く再生（以降、遅聞き再生と呼ぶ）するなど、利用者にとって便利な様々な機能も提供されている。
【０００９】
【発明が解決しようとする課題】
前述したように多くの装置は早聞き再生や遅聞き再生の機能を持っているが、音声を圧縮して録音する装置においては、可変速再生時に次のような問題が発生する。
【００１０】
まず早聞き時における問題について説明する。
【００１１】
音声圧縮は例えば図３にあるように、一定時間の区切り（図３では１０ｍｓｅｃ）に含まれる情報をＤＳＰで圧縮し記録される。
【００１２】
通常の再生であれば逆に、これを順番に読み出しＤＳＰにより伸長し録音時と同じ時間経過で波形再生すればよいのであるが、早聞きの場合はこれを何らかの手段で時間経過を短縮する処理が必要となる。
【００１３】
原理的には、２倍で早聞きしようとすれば、録音時における実時間の１／２の時間より早くＤＳＰの伸長処理を済ませる必要がある。
【００１４】
実際にはそれから、時間軸方向の短縮補正やシステム上の付随処理もあるので、録音時における実時間の１／４程度で伸長処理を行う必要がある。
【００１５】
しかしながらＤＳＰの処理速度にはプロセッサ自体の能力限界がある。たとえプロセッサ自体の能力限界以内であったとしても、処理速度を早くすればするほど、消費電流もおおきくなり、特に電池を用いた携帯機器においては、致命的な欠点となる。
【００１６】
このような理由から圧縮情報を再生する現在の多くのポータブル機器においては、一定時間の区切りを間引くことで早聞き処理を行っている。
【００１７】
したがって間引かれた一定時間の区切りに含まれる音声は捨てられ再生されないので、早聞き時に音声の連続性が損なわれてしまい、早聞きを速くすればするほど、聞き苦しい再生音に成ってしまうという問題がある。
【００１８】
次に遅聞き時における問題について説明する。
【００１９】
遅聞き時には逆に早聞き時よりもＤＳＰの処理速度には有利に働くのでプロセッサ自体の能力に問題はなくなる。
【００２０】
しかし時間経過を引き伸ばす処理は必要で、そのために同じ一定時間の区切りの音声も何度かリピートすることになる。
【００２１】
このように圧縮した情報を伸長した波形だけを利用して遅聞き再生をすると音声に連続性がなくなり、聞き苦しい再生になってしまうという問題が生じる。
【００２２】
次にメモリの有効利用に関する課題について説明する。
【００２３】
一方では、メモリの利用効率や利用方法に関する課題もある。多くのディスク系記録媒体がトラックという物理的な制約からセクタに代表される固定記録単位を有している。
【００２４】
ところが、図３の固定記録単位５１２バイトの例にあるように圧縮された情報は必ずしも固定記録単位の記録可能バイト数とは一致しない。
【００２５】
よってあまった部分は未使用のままになっていることが多い。その理由は次の通りである。
【００２６】
音声情報や動画情報などの時間軸をもった情報を圧縮して記録する場合には、リアルタイム性がないと実現できないので、トラックやセクタをまたがって圧縮情報単位を書くことは、記録場所のサーチ方法が複雑になり処理に限界があり現実的でないこと。
【００２７】
もうひとつはバッファフラッシュ（固定記録単位のバイト数が蓄積されたときにバッファから記録媒体に物理的に書き込みを実行すること）のために、通常より多くの時間を必要とするため、時間的な制約から、バッファサイズを大きくとってその時間を吸収する必要があること。
【００２８】
さらには、部分書き換え等の処理を行う場合にも、トラックやセクタをまたがって圧縮情報単位を書いてあると、処理が非常に複雑化し処理能力上、効率が悪いこと等が理由として挙げられる。
【００２９】
このように複雑な処理をするためには高機能の制御回路とＳＲＡＭ等のバッファを必要とする。ところがそれはコストアップにつながるので、メイン情報の配分によっては少々の未使用部分はあっても安価な大容量のメモリを使用したほうが、コスト的にも信頼性や機能面でも有利であるという側面もある。
【００３０】
上記のことは、ディスク系の記録メディアをエミュレートした半導体メモリを用いた記録メディアにおいても同じことが言える。
【００３１】
実際にある種のフラッシュメモリはチップの物理的書き込み仕様がディスク系の記録単位と同じにしてあるものもある。
【００３２】
ある意味ではディスク系に記録された情報との互換性を考えると半導体に記録する場合も同じにしておいた方がよいという側面もある。
【００３３】
以上のように、記録メディアを問わず、剰余部分はあまり有効に利用されないでいるのが現状である。
【００３４】
本発明はこれらの課題を鑑み、聞き苦しい音声の問題とメモリの有効利用とを同時に解決し、さらにこれを有効に利用し装置の機能を向上させようとするものである。
【００３５】
【課題を解決するための手段】
本発明は、上記課題を解決する事を目的に、以下の機能を組み込んだものである。
【００３６】
本発明では、アナログデータを圧縮して記録するにはまずアナログ情報をデジタル情報に変換する必要がある。これはＡ／Ｄコンバータで実現できる。さらにその情報をＤＳＰにより圧縮を行う。さらに、Ａ／Ｄコンバータの出力をＤＳＰへ入力すると同時に別系統の回路で圧縮を施さない情報として処理する回路を設ける。また、圧縮した情報を音声として出力する回路と、圧縮を行わず間引いた情報を音声として出力する回路が設けられ、更にどちらの回路を動作させるかを選ぶ操作キーが設けられる。
【００３７】
さらに、圧縮を行うアナログ情報のサンプリングレートで取得したＡ／Ｄコンバータの出力情報を、別系統のこれよりも少ないサンプリングレートを基準に間引く回路を設ける。
【００３９】
また、圧縮した情報と圧縮を行わず間引いた情報とを同時に記録媒体の同じ記録単位内に記録するには、記録単位にと同じサイズのバッファ内これを書き、圧縮情報が出力された時点でこれを書き込むよう回路を設ける。
【００４０】
また、圧縮を行わず間引いた情報だけを読み込む回路を設け、記録時のサンプリングレートで再生を行うＤ／Ａ回路を設ける。
【００４１】
また、圧縮を行わず間引いた情報をもとに再生を行う際にＤ／Ａの基準クロックを可変にする回路を設けレビュー、キュー等の可変速再生処理を行う。
【００４２】
また、レビュー、キュー等を加速再生を行う際に、音程を変えないで可変速再生処理を行う。
【００４３】
また、低サンプリングレートでの録音処理のときにサンプリングした情報を元に音程を変えない可変速再生の処理アルゴリズムを応用して、繰り返しパターンの検出を行いその繰り返し回数を記録することでさらに低ビットレートする。
【００４４】
また、低サンプリングレートでの再生と、高サンプリングレートでの圧縮、記録動作を別系統で同時に処理する回路を設ける。
【００４５】
また、音韻の繰り返しパターンの検出回路を応用しフレーズの切れ目を検出する手段を設ける。
【００４６】
また、圧縮を行わず間引いた情報と圧縮をおこなった情報とを別々に管理し、他の装置へ転送するインターフェース回路を設ける。
【００４７】
また、記録媒体にフラッシュメモリに代表される固定長記録単位書き込み型の半導体メモリの書き込み読み込み制御に則ったアルゴリズムで書き込み、読み込みを行う。またはメモリカードを使用し脱着可能なカードスロットとカードインターフェース回路を設ける。
【００４８】
本発明の作用は、以下の通りである。
【００４９】
本発明による作用は、圧縮情報と非圧縮情報を利用した処理が可能になるとともに、それらを選択的に音声再生できることである。
【００５０】
また、圧縮情報と非圧縮情報を利用した処理が可能になる上に、非圧縮情報の情報量を少なくすることができることである。
【００５２】
また、圧縮情報と非圧縮情報を同時に処理が可能になる上にこれらを同じ領域に記録することができるので領域を有効に、また高速に利用することができることである。
【００５３】
また、非圧縮情報を元に再生するのでＤＳＰの伸長回路を動作させなくとも再生ができることである。
【００５４】
また、可変速再生処理に非圧縮情報を利用できるようになることである。
【００５５】
また、可変速再生処理に非圧縮情報を利用できるようになる上に低ビットレートもできることである。
【００５６】
また、ＤＳＰが録音モードで動いていても録音中に再生ができるようになることである。
【００５７】
また、逆方向で意味のある（聞き取れる）再生ができるようになることである。
【００５８】
また、別の装置に非圧縮情報だけをまたは圧縮情報だけを転送または受信することができることである。
【００５９】
また、フラッシュメモリの書き込み方法を利用し、空き領域を有効に利用した高機能の携帯型の半導体記録再生装置が可能となることである。
【００６０】
【発明の実施の形態】
以下、本発明の実施の形態について図面を参照して説明する。
【００６１】
まず実施の形態における録音の場合を説明する。本発明の基本構成を図１のブロック図で説明する。ここでは音楽の録音再生機を例にサンプリングレート＝４８ＫＨｚ、分解能＝１６ビットとして説明する。
【００６２】
キー入力手段（Ｋ１）により録音が制御回路（Ｍ０）に指示されると、制御回路（Ｍ０）はＤＳＰである音声圧縮伸長回路（Ｄ１）を圧縮モードに設定し、またＡ／Ｄ変換回路（Ａ３）をアクティブにする。
【００６３】
これによりアナログ入力回路（Ａ１）から音声情報が入力され、Ａ／Ｄ変換回路（Ａ３）により、１６ビットのデジタル情報が４８ＫＨｚごとに出力される。
【００６４】
従来であればここで１６ビットのデジタル情報は音声圧縮伸長回路（Ｄ１）に送られ圧縮されたデータがシステム制御回路（Ｍ０）により読まれるという経路だけであるが、本発明においては、１６ビットのデジタル情報をサンプリング変換回路（Ｄ２）にも同時に送る。
【００６５】
サンプリング変換回路（Ｄ２）は例えばこれを４８ＫＨｚから８ＫＨｚに間引く処理を行う。
【００６６】
さらに１６ビットの情報は人間の聴覚特性を利用したμ―ｌｏｗ則等によりさらに８ビットのデータにすることができる。これはハードウエアで行ってもよいし、制御回路（Ｍ０）内のソフトウエアで行ってもよい。
【００６７】
これらにより、録音時には制御回路（Ｍ０）には圧縮された情報と非圧縮の低サンプリングの情報とが同時に入力されることになる。
【００６８】
次に制御回路（Ｍ０）はこれらのデータをメモリチップ群（Ｍ１からＭｎ）に順次書き込みを行う。
【００６９】
この例では圧縮された情報を４２４バイト（Ｌチャンネル＝２１２バイト・Ｒチャンネル＝２１２バイト）非圧縮の情報を８０バイトとして説明する。
【００７０】
書き込みに際しては図２のメイン領域に圧縮情報を記録する。
【００７１】
図２の未使用領域の８４バイトを利用し、非圧縮の情報の８０バイトを記録する。
【００７２】
図２でいうページはこの半導体メモリとしてページ書き込み型のフラッシュメモリを使用した例である。
【００７３】
このページが多くのディスク系記録媒体でいうセクタに代表される固定記録単位に相当する。
【００７４】
もちろんメインの圧縮情報と非圧縮情報はこれ以外の規則、法則で記録してもよい。
【００７５】
次に、実施の形態における通常再生について説明する。
【００７６】
キー入力手段（Ｋ１）により再生が制御回路（Ｍ０）に指示されると、制御回路（Ｍ０）はＤＳＰである音声圧縮伸長回路（Ｄ１）を伸長モードに設定し、またＤ／Ａ変換回路（Ａ４）をアクティブにする。
【００７７】
次にメモリチップ群（Ｍ１からＭｎ）のメイン領域（図２に示す）に書かれている圧縮された情報は制御回路（Ｍ０）により読み込まれる。
【００７８】
音声圧縮伸長回路（Ｄ１）は定期的に制御回路（Ｍ０）から圧縮情報を要求し、図３の例で言えば２１２バイト（ＬまたはＲチャンネル）を受け取り終ると伸長処理を行う。
【００７９】
伸長されたデータは録音時のサンプリングレート（図３の例では４８ＫＨｚ）に従いＤ／Ａ変換回路（Ａ４）に送られアナログ出力回路（Ａ２）から再生される。
【００８０】
次に、実施の形態における非圧縮情報の再生について説明する。
【００８１】
早聞き機能は使用者が頭出しを行いたい場所へ音声を確認しながら位置を移動させる機能で、例えば、曲やフレーズの頭出しや曲を分割したい場合の分割位置を特定するリハーサル機能、録音用件を録音実時間よりも早く聞きたい場合などで利用される機能である。
【００８２】
ところが、発明が解決しようとする課題の項でも述べたように、従来の方法によれば、一定時間の区切りを間引くことで早聞き処理を行っているために音声に連続性がなくなっている。
【００８３】
そこで本実施の形態においては、これを解決すべく非圧縮の情報を用いた早聞きの例を説明する。
【００８４】
非圧縮の情報は通常録音の実施の形態で説明したように圧縮情報と同時に録音されている。
【００８５】
まず、キー入力手段（Ｋ１）により早聞き再生が制御回路（Ｍ０）に指示されると、制御回路（Ｍ０）はＤＳＰである音声圧縮伸長回路（Ｄ１）を切り替え信号により休止させる。
【００８６】
これは早聞き再生には低サンプリングレートの非圧縮情報を使用するため伸長は必要ないからである。
【００８７】
ただしサンプリング変換回路（Ｄ３）とＤ／Ａ変換回路（Ａ４）はアクティブにする。
【００８８】
次にメモリチップ群（Ｍ１からＭｎ）のメイン領域（図２に示す）に書かれている非圧縮情報は制御回路（Ｍ０）により読み込まれる。
【００８９】
サンプリング変換回路（Ｄ２）は定期的に制御回路（Ｍ０）から情報を要求し、Ｄ／Ａ変換回路（Ａ４）に送られアナログ出力回路（Ａ２）から再生される。
【００９０】
次に、実施の形態における非圧縮情報の早聞き再生について説明する。
【００９１】
基本的には前述した非圧縮情報を再生する実施の形態と同じ処理の流れである。
【００９２】
違うのはサンプリング変換回路（Ｄ３）により時間軸方向の波形の短縮が行われるため、音程は高くなるもののテープ録音装置でテープを早送りしながら再生するいわゆる「キュー再生」と同じ効果が得られる。
【００９３】
次に、実施の形態における音程を変えない非圧縮情報の早聞き再生の第１例を説明する。
【００９４】
基本的には前述した非圧縮情報を早聞を再生する実施の形態と同じ処理の流れである。
【００９５】
違うのはサンプリング変換回路（Ｄ３）内でより時間軸方向の波形の削除が行われるため、音程を変えないで早送り再生するいわゆる「可変速再生」が得られる。
【００９６】
より具体的には、再生された音声波形をメモリに記憶させておき、基本単位毎に交互にサンプリングと棄却が繰り返され、サンプリングされた音声波形を棄却期間の時間長さ分だけ時間的に伸長することによって音程を下げ、これによって聞き手に聞き取りやすくするものである。
【００９７】
なお、この種の技術の一例としては、特許番号第２６２４８２６号公報に開示されている。
【００９８】
次に、実施の形態における音程を変えない非圧縮情報の早聞き再生の第２例を説明する。
【００９９】
基本的には前述した非圧縮情報を早聞を再生する実施の形態１と同じ処理の流れである。
【０１００】
違うのは、再生すべき波形情報を時間軸方向で繰り返しパターンを分析して同じ波形が繰り返されている部分を削除して時間を短縮するという手法を適用することである。
【０１０１】
これを応用すれば、より自然に近い発生音で音程を変えない早聞き再生が可能である。本実施の形態の場合には、すでに、非圧縮の波形情報は記録されているので、圧縮情報を伸長しなくとも分析の元となる波形が得られる。
【０１０２】
この分析は、その精度に応じて音声圧縮伸長回路（Ｄ１）ＤＳＰで行ってもよいし、サンプリング変換回路にその機能を組み込んでもよい。制御回路（Ｍ０）のソフトウエア処理でこれを行うことも可能である。
【０１０３】
またより自然な発音に近づけるためには、複数フレームの時間幅をもって音韻の繰り返しの分析を行う必要があるので、この場合には複数のフレームの非圧縮情報を一度に読むようにする。
【０１０４】
次に、実施の形態における遅聞き再生を説明する。
【０１０５】
前述した非圧縮情報を早聞を再生する実施の形態１、２と同じことが言える。
【０１０６】
違うのは、再生すべき波形情報を時間軸方向で繰り返しパターンを分析して同じ波形が繰り返されている部分を削除するのではなく逆に何度か実時間よりも多く繰り返す手法を適用することである。
【０１０７】
これをにより、より自然に近い発生音で音程を変えない遅聞き再生が可能である。
【０１０８】
次に、実施の形態における録音非圧縮情報の低ビットレート化を説明する。
【０１０９】
音程を変えない非圧縮情報の早聞き再生の実施の形態の第２例で説明したように、記録された非圧縮の情報は、音程を変えない早聞きに利用されるが、それに利用されることが決まっているのであれば、録音時に前もって再生すべき波形情報を時間軸方向で繰り返しパターンを分析して同じ波形が繰り返されている部分を削除して時間を短縮するという手法もある。
【０１１０】
これは前述した録音の実施の形態と基本的には同じ処理の流れであが、違うのは録音と同時に入力された波形情報を時間軸方向で繰り返しパターンを分析して同じ波形が繰り返されている部分を削除してからメモリ記録するという処理がはいることである。
【０１１１】
これにより残った波形だけを記録すれば録音時の処理は増加するものの再生時は簡単な処理で音程を変えない早聞き再生が可能となる。
【０１１２】
次に、実施の形態の同時録音再生を説明する。
【０１１３】
非圧縮情報の再生の実施の形態や早聞きの実施の形態にあるように、非圧縮の情報を再生するときは、ＤＳＰである音声圧縮伸長回路音声圧縮伸長回路（Ｄ１）は動作しなくともよい。
【０１１４】
従って録音の実施の形態にあるように録音中であっても、サンプリング変換回路（Ｄ３）Ｄ／Ａコンバータ（Ａ４）を稼動させることにより、再生が可能となる。
【０１１５】
次に、実施の形態の非圧縮情報のコメント録音について説明する。
【０１１６】
上記までの実施の形態の場合、非圧縮情報の録音は圧縮情報録音と同時に行っているが、他にも、圧縮情報を録音するとき非圧縮情報は録音しないように使用者が設定できるようにしてもよい。
【０１１７】
この場合、あとから圧縮情報を再生中に例えばコメント録音ボタンを押すと、その内容を現在位置に非圧縮情報として記録するようにすればよい。
【０１１８】
ちょうど同時録音再生の実施の形態にある録音中の非圧縮情報の再生と逆の処理になるわけである。
【０１１９】
次に、実施の形態の転送について説明する。
【０１２０】
図３にあるように低サンプリングの非圧縮情報はメイン情報の隙間にかいてあるので、例えば、この部分だけを切り出して読み出しすることが可能である。
【０１２１】
次に、実施の形態のカードについて説明する。
【０１２２】
図１の半導体メモリ群（Ｍ１からＭｎ）は制御回路（Ｍ０）から着脱式で切り離すことも可能である。
【０１２３】
次に、実施の形態の混在書きの第１例について説明する。
【０１２４】
図３には、ページ内の領域を２分割して記録する概念を説明してあるが、もちろんこの配置を圧縮情報と非圧縮情報を混在して書くことももちろん可能である。
【０１２５】
例えば、圧縮情報は偶数アドレスに非圧縮情報は奇数アドレスに交互に書くように決めてもよい。
【０１２６】
次に、実施の形態の混在書きの第２例について説明する。
【０１２７】
混在書きの実施の形態の第１例では同じページ内に混在する例を示したが、メモリ空間のまったく別の領域を利用して非圧縮情報を書くことも、もちろん可能である。
【０１２８】
ただしこの場合はページ内の未使用領域は別の用途（テキスト情報を入れる、曲分割等のスペアバッファ等）に使用すればよい。
【０１２９】
【発明の効果】
本発明によれば、これまで未使用であった領域を利用し圧縮情報と非圧縮情報の両方を記録できるので、それぞれの情報を有効に利用することで、より価値の高い装置を提供できる。
【０１３０】
第１の効果は、非圧縮情報の再生機能に関し、再生を簡単に提供できる点にある。圧縮情報だけ記録された記録媒体から再生をしようとすると、伸長回路を稼動させる必要がある。
【０１３１】
ところが、本発明によれば非圧縮の情報も同時に記録してあるので、これを再生するだけで簡単に再生ができる。伸長回路は時間当たりの処理量が一般の制御回路に比べると格段に多く、動作クロックを早くしてリアルタイムに伸長処理を行っているため、消費電流も多い。
【０１３２】
これは電池を使用する携帯型装置にとっては、致命的な欠陥となりかねない。そこで、少しでも消費電流を減らすには、実際に高音質で再生を行う以外の場面では本発明による非圧縮の情報を再生するようにすればよい。応用例には次のようなものがある。
【０１３３】
第２の効果は、音楽再生ほど音質をあまり問われない口述録音の再生には非圧縮情報を再生する方法も考えられる。もちろんこの場合、高音質の再生を指定し切り替えることもできる。
【０１３４】
例えば会議を録音したものを再生している場面を想定しよう。最初は非圧縮情報として再生を開始する。使用者はこれを聞き取りながら、ある発言者がなにか重要な発言をした場合や、聞き取りにくかった場合には、高音質を指定するキーを押すとすぐに高音質の再生を始めることができる。
【０１３５】
聞き取り終わったらまた非圧縮情報の再生にすればよい。これにより消費電流は格段に節約できることになる。またミュート再生中は音質を落としても目立たないので非圧縮情報の再生を適用してもよい。
【０１３６】
第３の効果は、音楽情報をライブで３０分連続して５曲を録音したとする。ところが装置側は連続して録音しているので５曲はいっていても１曲とみなしてしまう。
【０１３７】
そこで適当な曲の切れ目に曲の切れ目であることを示すフラグをなんらかの手段で指定してやる必要がある。いわゆる「ディバイド」処理が必要となるのである。
【０１３８】
こうすることで曲のスキップ等が可能になる。ところがこのディバイド位置を決めるのは極めて微妙なタイミングでのキー操作が必要となる。使用者がディバイドしたつもりの位置と実際の位置とが微妙にずれるからである。
【０１３９】
へたをすれば、有音部分でディバイドしてしまう可能性もある。そこでリハーサル機能を搭載した装置が提供されている。リハーサル機能は仮にディバイド位置を決めて、本当にこの位置でディバイドしてよいかどうかを使用者に確認させる機能である。
【０１４０】
このリハーサル中に本発明による非圧縮情報の再生にすれば、消費電流は格段に節約できる。
【０１４１】
第４の効果は、上記第３の効果に関連し、ドット表示器を持った装置においては、ディバイドする前後の波形を時間経過軸にグラフィカルに表示し、表示波形を確認してディバイドを実行するというアプリケーションが考えられる。
【０１４２】
従来であれば、波形を表示するには一旦、圧縮情報を伸長する必要があるのだが、本発明においては非圧縮情報が記録されているのでこれを読んで表示すればよい。
【０１４３】
ドット解像度の関係から低サンプリングレートの情報であってもグラフィック表示には充分である。伸長しないので、消費電流も減少するし、伸長に要する時間も必要ないので、波形を高速に表示することができ、格段に利便性が向上する。もちろんディバイド時以外でも波形のグラフィック表示を行ってもよい。
【０１４４】
第５の効果は、非圧縮情報の可変速再生機能に関し、本発明によれば、可変速再生を簡単に提供できる。圧縮情報だけ記録された記録媒体から可変速再生をしようとすると、発明が解決しようとする課題で述べたように、伸長回路を稼動し、かつ時間軸方向の調整をする必要がある。
【０１４５】
しかも、早く聞く場合には伸長に要する時間が長すぎて間に合わないので、やむおえず間引き再生をするため聞き苦しい早聞きになってしまうという欠点がある。
【０１４６】
ところが、本発明によれば非圧縮の情報も同時に記録してあり、これを利用するので伸長回路を稼動させる必要はない。さらに伸長回路は時間当たりの処理量が一般の制御回路に比べると格段に多く、動作クロックを早くしてリアルタイムに伸長処理を行っているため、消費電流も多い。
【０１４７】
これは電池を使用する携帯型装置にとっては、致命的な欠陥となりかねない。そこで、少しでも消費電流を減らすには、実際に高音質で再生を行う以外の場面では本発明による非圧縮の情報を利用するようにすればよい。
【０１４８】
応用例には次のようなものがある。
【０１４９】
第６の効果は、音楽再生ほど音質をあまり問われない口述録音の再生には可変速再生で早聞きする方法も考えられる。これはあまり重要でない部分は早く聞き流すためである。
【０１５０】
もちろんこの場合、高音質の再生を指定し切り替えることもできる。例えば会議を録音したものを再生している場面を想定しよう。最初は非圧縮情報として再生を開始する。
【０１５１】
使用者はこれを聞き取流しながら、ある発言者がなにか重要な発言をした場合や、聞き取りにくかった場合には高音質を指定するキーを押すとすぐに高音質の再生を始めることができる。
【０１５２】
聞き取り終わったらまた早聞き再生にすればよい。これにより消費電流は格段に節約できることになる。
【０１５３】
また、そしてなにより可変速再生であっても音声の連続性がそこなわれず、利用者にとって聞き取りやすいというメリットがある。
【０１５４】
第７の効果は、上記第３の効果で述べたようにリハーサル時にこの機能を利用することもできる。また、微妙な位置の指定には逆に遅く再生する方法もある。さらには遅聞きの場合には微妙な位置あわせが可能になるという利点もある。
【０１５５】
またリハーサル時だけでなく、曲のある部分を頭出ししたい場合の早送り再生時にもこれを適用すればよい。前述したように伸長回路は稼動しないので、処理時間に無理は無くなるし、消費電流も少なくてすむという利点がある。
【０１５６】
また可変速再生であっても音声の連続性がそこなわれず、利用者にとって聞き取りやすいというメリットがある。
【０１５７】
第８の効果は、音程を変えない早聞き再生２の実施の形態で述べたように、音韻の繰り返しパターンを分析して早聞き再生をさせる場合には、この分析にある程度の時間幅をもって分析する必要があるが、本発明によれば、伸長を行うことなく数ページに渡って非圧縮情報を読むことができるので、非常に好適である。
【０１５８】
第９の効果は、音節ごとの逆方向再生機能に関し、第１の効果乃至第８の効果で述べた順方向の処理における利点はそのままに、さらに逆方向でも再生を行うことができる。
【０１５９】
この場合でも半導体メモリに非圧縮の情報がデジタル化されて記録されているので、様々な加工が可能となる。
【０１６０】
第１０の効果は、逆方向での再生はそのままでは意味のない音声になるが、ある単位で順方向に再生すれば、ある程度の識別が可能である。
【０１６１】
第１１の効果は、さらに無音部分とか音声の繰り返し部分の検出手段を備えれば、その部分を区切りに逆方向へスキップしながら順方向の再生をすることを繰り返すことにより、音節ごとの逆方向再生が可能となる。
【０１６２】
これは、あらかじめ低サンプリングレートではあるものの非圧縮のデータが記録されているからこそ、簡単にできる処理である。これによりよりわかりやすい逆方向再生が可能となる。
【０１６３】
音節ごとの区切りは録音時にあらかじめ非圧縮情報と共にフラグとして記録しておくことも可能である。
【０１６４】
第１２の効果は、上記第１０の効果及び上記第１１の効果のような逆方向再生中でなく、単純に時系列方向に逆に再生することも、もちろん可能である。この場合、音声は意味のあるものとしては聞き取れない。
【０１６５】
しかしある程度の識別は可能である。またそのとき第４の効果にあるような波形表示をすれば、効果的に識別が可能となる。
【０１６６】
前述したように、これらの処理が非圧縮情報を用いることで消費電流の観点からも処理速度の観点からも有利に働くことは言うまでもない。
【０１６７】
第１３の効果は、同時録音再生に関し、例えば、会議録音をしていたとする。ところが、その録音中に、急用で以前録音しておいた電話番号のメモ録音を聞く必要が生じたとする。
【０１６８】
会議は録音しなければならないし、メモ録音も再生しなければならないという状況が発生する。ところが音声圧縮伸長回路はひとつしかなくしかも録音モード中に伸長はできないのが普通である。
【０１６９】
大規模なシステムは別として、特に、限られた資源を限られた消費電流で動作させなければならない電池駆動の携帯型の装置においては、音声圧縮伸長回路を圧縮用と伸長用にふたつ搭載することはあまり現実的ではない。
【０１７０】
第１４の効果は、録音中であっても、非圧縮の情報を再生できるので、録音中に再生ができ利便性が向上するばかりでなく、携帯型の機器であっても消費電流を押さえて、ハードウエアの追加もなしに、この便利な機能が実現できるという効果がある。
【０１７１】
第１５の効果は、コメント機能に関し、上記実施の形態でも述べたように、録音時に非圧縮の情報を同時に記録しない設定もできるので、圧縮情報だけを先に録音することができる。
【０１７２】
または非圧縮情報だけを先に録音しておいてあとから圧縮情報を記録することもできる。例えば曲をメイン領域に録音して、あとから、その歌手名などのコメントを非圧縮情報として録音するようにすることができる。
【０１７３】
また先に歌手名を非圧縮情報として録音し、あとから曲をメイン領域に録音してもよい。
【０１７４】
こうすることで、頭出しの操作中には、非圧縮情報を再生するのでコメントが再生され、圧縮情報に切り替えることでその曲を聞くことができるので、利便性が向上する。
【０１７５】
第１６の効果は、図２における半導体メモリの同一ページ内のメイン領域に圧縮情報を書いた残りの未使用領域に非圧縮情報を書くことができる。
【０１７６】
ここで、一定区間の音声情報を圧縮するためには時間がかかるため、情報の出力タイミングの流れに従って書き込みを行うと、同じページ内に書かれた圧縮情報と非圧縮情報とは時間的に非圧縮情報のほうが過去側の情報になっている。
【０１７７】
もちろんバッファ処理で時間的に同じ情報を同じページ内に書くことは容易であるが、ここではバッファ処理を行わず、わざと時間をずらして録音するものとする。
【０１７８】
こうすることで、不要なバッファ処理が必要なくなることもさることながら、再生中に前もって次ぎに伸長すべき情報の概要が分かるという利点がある。
【０１７９】
例えば、次ぎに再生すべき情報が殆ど無音である場合には伸長回路を休止させれば、消費電力が削減できる。
【０１８０】
また何らかの原因で次のページの読み込みに失敗した場合には非圧縮の情報を代わりに出力させるようにすれば、瞬間的に音質は変わるものの連続性は確保することができるので音韻が欠落することはなくなり信頼性が向上する。
【０１８１】
第１７の効果としては、これまでは、音声の録音、再生、の利便性や消費電流に主眼をおいて発明の効果を説明したが、次にメモリの有効利用という視点から見ると、メモリの有効利用に関する課題で説明したように、もともと利用されないであまった領域を有効に利用できるだけでなく、さらにそれ以上の利便性や低消費電力化が実現でき、装置の価値を格段に向上させることができる。
【０１８２】
第１８の効果は、混在書きの実施の形態にあるようにひとつのページ内に圧縮情報と非圧縮情報を混在して書くことももちろん可能である。
【０１８３】
例えば、圧縮情報は偶数アドレスに非圧縮情報は奇数アドレスに交互に書くように決めれば、最下位のアドレス制御信号を別の制御信号で切り替えることでホスト側からみると同じアドレス管理で非圧縮情報と圧縮情報とを瞬時に切り替えて読み込むことが可能となり処理を簡略化することができる。
【０１８４】
また１６ビットのデータバスをもった装置においては、圧縮情報と非圧縮情報が同時に読むことができるのでデータ処理を効率的に行うことができる。
【０１８５】
さらに、この方法であれば、いわゆる「ディバイド」処理において、ページを分割するような位置でディバイドする場合であっても圧縮情報も非圧縮情報も同じように分割されるので、処理を簡略化することができる。
【０１８６】
第１９の効果は、転送機能に関し、非圧縮の情報は別領域で管理するのでその部分だけを取り出して外部へ転送することが可能である。
【０１８７】
これにより、膨大なメイン情報に対応した非圧縮情報を取り出すことにより、音声の見出しを瞬時に作ることができる。
【０１８８】
例えばパーソナルコンピューターに転送しその情報をもとに編集等のレイアウトをこの情報で作成し、必要なメイン情報（圧縮情報）だけを取り出せばよいので、作業効率が格段に向上するという利点がある。
【０１８９】
また脱着可能なカードを使えばパーソナルコンピューターへ情報を直接受け渡すことも可能である。
【図面の簡単な説明】
【図１】本発明のブロック図である。
【図２】本発明のページの概念図である。
【図３】本発明の圧縮情報と非圧縮情報の概念図である。
【符号の説明】
Ｋ１キー入力手段
Ｍ０制御回路
Ｄ１音声圧縮伸長回路
Ｄ２サンプリング変換回路
Ｍ１〜Ｍｎメモリチップ
Ａ１アナログ入力回路
Ａ３Ａ／Ｄ変換回路[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an information recording / reproducing apparatus.
[0002]
[Prior art]
In recent years, various apparatuses for recording and reproducing analog information such as music and movies into digital information have been developed. In addition, many devices have been developed to record information by further compressing it for effective use of the recording medium, and to decompress and reproduce the information.
[0003]
Many information compression methods have also been standardized or proposed, and in many devices, a digital signal processor (hereinafter referred to as DSP) incorporated in the apparatus performs arithmetic processing.
[0004]
In general, compression and decompression algorithms use human auditory characteristics, and perform unnecessary pattern analysis and frequency analysis based on digitized analog information within a unit time. The information is compressed in
[0005]
Recently, tape-type devices, disk media-type portable audio devices, and video devices applying these compression techniques have been put into practical use.
[0006]
In addition, recording / reproducing apparatuses that apply these compression techniques to semiconductor memories are being developed, and a large-capacity semiconductor memory is attracting attention as an information recording medium in accordance with miniaturization of devices and power saving.
[0007]
In particular, a flash memory, which is a batch erasure type memory, has attracted attention as an information recording medium because it does not require a backup power source and can be increased in capacity.
[0008]
On the other hand, from the functional aspect, the audio is played quickly in order to cue the playback start position and the recording start position (hereinafter referred to as fast playback), or it is intentionally played back slowly for accurate positioning (hereinafter referred to as Various functions that are convenient for the user are also provided.
[0009]
[Problems to be solved by the invention]
As described above, many devices have a fast listening playback function and a slow listening playback function. However, in a device that compresses and records audio, the following problems occur during variable speed playback.
[0010]
First, I will explain the problem when listening quickly.
[0011]
For example, as shown in FIG. 3, the audio compression is performed by compressing and recording information included in a predetermined time interval (10 msec in FIG. 3) using a DSP.
[0012]
In the case of normal playback, on the contrary, it is only necessary to read out the data in order and reproduce the waveform with the same time lapse as when recording, but in the case of fast listening, this is a process for shortening the time lapse by some means. Is required.
[0013]
In principle, if the user wants to listen twice as fast as possible, it is necessary to complete the DSP expansion process earlier than half the real time during recording.
[0014]
Actually, since there is also a shortening correction in the time axis direction and an accompanying process on the system, it is necessary to perform the expansion process in about 1/4 of the real time at the time of recording.
[0015]
However, the processing speed of the DSP has a capacity limit of the processor itself. Even if it is within the capacity limit of the processor itself, the faster the processing speed, the larger the current consumption, which is a fatal defect especially in portable devices using batteries.
[0016]
For this reason, many current portable devices that reproduce compressed information perform fast listening processing by thinning out a predetermined time interval.
[0017]
Therefore, since the audio included in the delimited period of time is discarded and is not played back, the continuity of the audio is lost during fast listening, and the faster the fast listening, the harder it is to hear There's a problem.
[0018]
Next, the problem during slow listening will be described.
[0019]
On the contrary, at the time of slow listening, the DSP processing speed works more favorably than at the time of fast listening, so there is no problem in the capability of the processor itself.
[0020]
However, the process of extending the passage of time is necessary, and for this reason, the voice at the same fixed time interval is repeated several times.
[0021]
If the slow listening reproduction is performed using only the waveform obtained by decompressing the compressed information as described above, there is a problem that the voice is not continuous and the reproduction is difficult to hear.
[0022]
Next, problems related to effective use of the memory will be described.
[0023]
On the other hand, there are also issues regarding the memory utilization efficiency and utilization method. Many disk-based recording media have fixed recording units represented by sectors due to physical limitations of tracks.
[0024]
However, the compressed information as shown in the example of the fixed recording unit 512 bytes in FIG. 3 does not necessarily match the number of recordable bytes in the fixed recording unit.
[0025]
Therefore, the remaining part is often left unused. The reason is as follows.
[0026]
When compressing and recording information with a time axis such as audio information and video information, it cannot be realized without real-time capability, so writing compressed information units across tracks and sectors is a search for the recording location. The method is complicated and the processing is limited, so it is not realistic.
[0027]
The other is a buffer flush (physical writing from the buffer to the recording medium when the number of bytes of a fixed recording unit is accumulated), which requires more time than usual. Due to restrictions, it is necessary to increase the buffer size to absorb the time.
[0028]
Furthermore, even when processing such as partial rewriting is performed, if the compressed information unit is written across tracks and sectors, the processing becomes very complicated and the processing efficiency is inefficient.
[0029]
In order to perform such complicated processing, a highly functional control circuit and a buffer such as SRAM are required. However, this leads to an increase in cost. Depending on the distribution of main information, there is an aspect that it is advantageous in terms of cost, reliability, and function to use a cheap large-capacity memory even though there are a few unused parts. is there.
[0030]
The same can be said for a recording medium using a semiconductor memory emulating a disk recording medium.
[0031]
In fact, some flash memories have the same physical write specifications of the chip as the recording unit of the disk system.
[0032]
In a sense, considering compatibility with information recorded on a disk system, there is also an aspect that it is better to keep the same when recording on a semiconductor.
[0033]
As described above, the present situation is that the surplus portion is not so effectively used regardless of the recording medium.
[0034]
In view of these problems, the present invention is intended to simultaneously solve the problem of unpleasant speech and the effective use of memory, and further to effectively use this to improve the function of the apparatus.
[0035]
[Means for Solving the Problems]
The present invention incorporates the following functions for the purpose of solving the above problems.
[0036]
The present invention Then, in order to compress and record analog data, it is necessary to convert analog information into digital information. This can be realized by an A / D converter. Further, the information is compressed by the DSP. In addition, a circuit is provided that processes the output of the A / D converter as information that is not compressed by another circuit simultaneously with the input to the DSP. There are also provided a circuit for outputting the compressed information as sound, a circuit for outputting the thinned information as sound without performing compression, and an operation key for selecting which circuit to operate.
[0037]
further, A circuit is provided for thinning out the output information of the A / D converter acquired at the sampling rate of the analog information to be compressed with reference to a sampling rate lower than that of another system.
[0039]
Also, To record the compressed information and the information thinned out without compression in the same recording unit of the recording medium, write it in the buffer of the same size as the recording unit, and write it when the compressed information is output. A circuit is provided for writing.
[0040]
Also, A circuit for reading only the thinned information without compression is provided, and a D / A circuit for reproducing at the sampling rate at the time of recording is provided.
[0041]
Also, A circuit for changing the D / A reference clock when performing reproduction based on the information thinned out without compression is provided to perform variable speed reproduction processing such as review and cue.
[0042]
Also When performing accelerated playback of reviews, cues, etc., variable speed playback processing is performed without changing the pitch.
[0043]
Also By applying a variable speed playback processing algorithm that does not change the pitch based on the information sampled during recording processing at a low sampling rate, the repetition rate is detected and the number of repetitions is recorded to further reduce the bit rate. To do.
[0044]
Also A circuit is provided that simultaneously processes reproduction at a low sampling rate and compression and recording operations at a high sampling rate in different systems.
[0045]
Also A means for detecting a break in a phrase is provided by applying a circuit for detecting a repeated pattern of phonemes.
[0046]
Also An interface circuit is provided that separately manages information that has been thinned out without compression and information that has been compressed and transfers the information to another device.
[0047]
Also Then, writing and reading are performed on the recording medium by an algorithm in accordance with the writing and reading control of the fixed-length recording unit writing type semiconductor memory represented by the flash memory. Alternatively, a card slot and a card interface circuit that are removable using a memory card are provided.
[0048]
The operation of the present invention is as follows.
[0049]
The present invention The effect of is that processing using compressed information and non-compressed information becomes possible, and those can be selectively played back as audio.
[0050]
Also In addition to being able to perform processing using compressed information and non-compressed information, the amount of information of non-compressed information can be reduced.
[0052]
Also The compressed information and the uncompressed information can be processed simultaneously, and these can be recorded in the same area, so that the area can be used effectively and at high speed.
[0053]
Also Since reproduction is performed based on uncompressed information, reproduction is possible without operating a DSP decompression circuit.
[0054]
Also In other words, uncompressed information can be used for variable-speed playback processing.
[0055]
Also In addition, non-compressed information can be used for variable-speed playback processing, and a low bit rate can be achieved.
[0056]
Also Even if the DSP is operating in the recording mode, playback can be performed during recording.
[0057]
Also It is possible to perform meaningful (audible) playback in the reverse direction.
[0058]
Also It is possible to transfer or receive only uncompressed information or only compressed information to another device.
[0059]
Also Therefore, it is possible to provide a high-functional portable semiconductor recording / reproducing apparatus that uses a flash memory writing method and effectively uses a free space.
[0060]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to the drawings.
[0061]
First, the case of recording in the embodiment will be described. The basic configuration of the present invention will be described with reference to the block diagram of FIG. Here, a music recording / reproducing apparatus will be described as an example with sampling rate = 48 KHz and resolution = 16 bits.
[0062]
When recording is instructed to the control circuit (M0) by the key input means (K1), the control circuit (M0) sets the voice compression / expansion circuit (D1), which is a DSP, to the compression mode, and also the A / D conversion circuit ( Activate A3).
[0063]
As a result, audio information is input from the analog input circuit (A1), and 16-bit digital information is output every 48 KHz by the A / D conversion circuit (A3).
[0064]
Conventionally, the 16-bit digital information here is only the path through which the compressed data is sent to the audio compression / decompression circuit (D1) and read by the system control circuit (M0). Are simultaneously sent to the sampling conversion circuit (D2).
[0065]
For example, the sampling conversion circuit (D2) performs a process of thinning the sampling conversion circuit from 48 KHz to 8 KHz.
[0066]
Further, 16-bit information can be further converted into 8-bit data by the μ-low rule using human auditory characteristics. This may be performed by hardware or by software in the control circuit (M0).
[0067]
As a result, compressed information and uncompressed low sampling information are simultaneously input to the control circuit (M0) during recording.
[0068]
Next, the control circuit (M0) sequentially writes these data to the memory chip group (M1 to Mn).
[0069]
In this example, it is assumed that the compressed information is 424 bytes (L channel = 212 bytes, R channel = 212 bytes), and the uncompressed information is 80 bytes.
[0070]
When writing, the compressed information is recorded in the main area of FIG.
[0071]
Using the 84 bytes of the unused area in FIG. 2, 80 bytes of uncompressed information are recorded.
[0072]
The page in FIG. 2 is an example in which a page write type flash memory is used as the semiconductor memory.
[0073]
This page corresponds to a fixed recording unit represented by a sector in many disk recording media.
[0074]
Of course, the main compressed information and uncompressed information may be recorded according to other rules and laws.
[0075]
Next, normal reproduction in the embodiment will be described.
[0076]
When reproduction is instructed to the control circuit (M0) by the key input means (K1), the control circuit (M0) sets the audio compression / expansion circuit (D1), which is a DSP, to the expansion mode, and the D / A conversion circuit ( Activate A4).
[0077]
Next, the compressed information written in the main area (shown in FIG. 2) of the memory chip group (M1 to Mn) is read by the control circuit (M0).
[0078]
The audio compression / decompression circuit (D1) periodically requests compression information from the control circuit (M0), and in the example of FIG. 3, when the 212 bytes (L or R channel) are received, the decompression process is performed.
[0079]
The expanded data is sent to the D / A conversion circuit (A4) according to the sampling rate at the time of recording (48 KHz in the example of FIG. 3) and reproduced from the analog output circuit (A2).
[0080]
Next, reproduction of uncompressed information in the embodiment will be described.
[0081]
The fast listening function is a function that allows the user to move the position while confirming the sound to the place where the user wants to cue. This function is used when you want to hear a message earlier than the actual recording time.
[0082]
However, as described in the section of the problem to be solved by the invention, according to the conventional method, since the fast listening process is performed by thinning out a predetermined time interval, the continuity is lost in the voice.
[0083]
Therefore, in this embodiment, an example of fast listening using uncompressed information will be described in order to solve this problem.
[0084]
The uncompressed information is recorded simultaneously with the compressed information as described in the normal recording embodiment.
[0085]
First, when fast playback is instructed by the key input means (K1) to the control circuit (M0), the control circuit (M0) pauses the voice compression / decompression circuit (D1), which is a DSP, by a switching signal.
[0086]
This is because decompression is not required for fast playback because uncompressed information with a low sampling rate is used.
[0087]
However, the sampling conversion circuit (D3) and the D / A conversion circuit (A4) are activated.
[0088]
Next, uncompressed information written in the main area (shown in FIG. 2) of the memory chip group (M1 to Mn) is read by the control circuit (M0).
[0089]
The sampling conversion circuit (D2) periodically requests information from the control circuit (M0), is sent to the D / A conversion circuit (A4), and is reproduced from the analog output circuit (A2).
[0090]
Next, fast playback of uncompressed information in the embodiment will be described.
[0091]
This is basically the same processing flow as that of the embodiment for reproducing the above-described uncompressed information.
[0092]
The difference is that the sampling conversion circuit (D3) shortens the waveform in the time axis direction, so that the same effect as the so-called “cue reproduction” in which the tape recording device reproduces while fast-forwarding the tape is obtained although the pitch is increased.
[0093]
Next, a first example of quick playback of uncompressed information without changing the pitch in the embodiment will be described.
[0094]
Basically, the processing flow is the same as that of the embodiment for reproducing the uncompressed information described above.
[0095]
The difference is that the waveform in the time axis direction is deleted in the sampling conversion circuit (D3), so that so-called “variable speed reproduction” in which fast-forward reproduction is performed without changing the pitch is obtained.
[0096]
More specifically, the reproduced speech waveform is stored in the memory, and sampling and rejection are repeated alternately for each basic unit, and the sampled speech waveform is temporally extended by the length of the rejection period. This lowers the pitch and makes it easier for the listener to hear.
[0097]
An example of this type of technology is disclosed in Japanese Patent No. 2624826.
[0098]
Next, a second example of quick playback of uncompressed information without changing the pitch in the embodiment will be described.
[0099]
Basically, it is the same processing flow as in the first embodiment for reproducing the uncompressed information described above.
[0100]
The difference lies in applying a method of shortening the time by analyzing a repeated pattern of waveform information to be reproduced in the time axis direction and deleting a portion where the same waveform is repeated.
[0101]
If this is applied, it is possible to perform fast playback without changing the pitch with a more natural generated sound. In the case of the present embodiment, uncompressed waveform information has already been recorded, so that a waveform as a source of analysis can be obtained without decompressing the compressed information.
[0102]
This analysis may be performed by the audio compression / decompression circuit (D1) DSP depending on the accuracy, or the function may be incorporated in the sampling conversion circuit. It is also possible to do this by software processing of the control circuit (M0).
[0103]
In order to approximate more natural pronunciation, it is necessary to analyze phoneme repetition with a time width of a plurality of frames. In this case, uncompressed information of a plurality of frames is read at a time.
[0104]
Next, slow listening reproduction in the embodiment will be described.
[0105]
The same can be said as in the first and second embodiments in which the above-mentioned uncompressed information is reproduced.
[0106]
The difference is that instead of analyzing the repeating pattern of the waveform information to be reproduced in the time axis direction and deleting the part where the same waveform is repeated, a method is applied that repeats several times more than real time. It is.
[0107]
As a result, it is possible to perform slow listening reproduction without changing the pitch with a more natural generated sound.
[0108]
Next, a description will be given of a reduction in the bit rate of recorded uncompressed information in the embodiment.
[0109]
As described in the second example of the quick playback of the uncompressed information without changing the pitch, the recorded non-compressed information is used for the quick listening without changing the pitch. If it is determined, there is a method of shortening the time by analyzing a repetitive pattern in the time axis direction of waveform information to be reproduced in advance during recording and deleting a portion where the same waveform is repeated.
[0110]
This is basically the same processing flow as the recording embodiment described above, but the difference is that the waveform information input simultaneously with the recording is analyzed in the time axis direction and the same waveform is repeated. That is, the process of recording in the memory after deleting the existing part is included.
[0111]
As a result, if only the remaining waveform is recorded, the processing at the time of recording increases, but at the time of reproduction, it is possible to perform quick playback without changing the pitch with simple processing.
[0112]
Next, simultaneous recording and reproduction according to the embodiment will be described.
[0113]
As in the embodiment of playing back uncompressed information and the embodiment of fast listening, when playing back uncompressed information, the voice compression / decompression circuit (D1), which is a DSP, does not operate. Good.
[0114]
Accordingly, even during recording as in the recording embodiment, reproduction is possible by operating the sampling conversion circuit (D3) and D / A converter (A4).
[0115]
Next, comment recording of uncompressed information according to the embodiment will be described.
[0116]
In the above embodiments, the recording of the uncompressed information is performed simultaneously with the recording of the compressed information. In addition, when the compressed information is recorded, the user can be set not to record the uncompressed information. May be.
[0117]
In this case, if, for example, a comment recording button is pressed later while the compressed information is being reproduced, the content may be recorded as uncompressed information at the current position.
[0118]
This is just the reverse of the playback of uncompressed information during recording in the embodiment of simultaneous recording and playback.
[0119]
Next, transfer according to the embodiment will be described.
[0120]
As shown in FIG. 3, the low-sampled uncompressed information is in the gap between the main information. For example, only this portion can be cut out and read out.
[0121]
Next, the card of the embodiment will be described.
[0122]
The semiconductor memory group (M1 to Mn) in FIG. 1 can be detached from the control circuit (M0) in a detachable manner.
[0123]
Next, a first example of mixed writing according to the embodiment will be described.
[0124]
FIG. 3 illustrates the concept of recording an area in a page by dividing it into two. Of course, it is possible to write this arrangement in a mixture of compressed information and uncompressed information.
[0125]
For example, it may be determined that the compressed information is alternately written at even addresses and the uncompressed information is alternately written at odd addresses.
[0126]
Next, a second example of mixed writing according to the embodiment will be described.
[0127]
In the first example of the mixed writing embodiment, an example in which the same page is mixed is shown. However, it is of course possible to write uncompressed information using a completely different area of the memory space.
[0128]
However, in this case, the unused area in the page may be used for another purpose (for storing text information, spare buffer for music division, etc.).
[0129]
【The invention's effect】
According to the present invention, since both compressed information and non-compressed information can be recorded using an area that has not been used so far, a more valuable device can be provided by using each information effectively.
[0130]
The first effect relates to a function for playing back uncompressed information, and is that playback can be easily provided. If playback is to be performed from a recording medium on which only compressed information is recorded, the decompression circuit must be operated.
[0131]
However, according to the present invention, since uncompressed information is recorded at the same time, it can be easily reproduced simply by reproducing it. The decompression circuit has a much larger amount of processing per time than a general control circuit, and consumes a large amount of current because the decompression process is performed in real time by increasing the operating clock.
[0132]
This can be a fatal defect for portable devices using batteries. Therefore, in order to reduce the current consumption as much as possible, the uncompressed information according to the present invention may be reproduced in a scene other than the actual reproduction with high sound quality. Application examples include the following.
[0133]
As a second effect, a method of reproducing uncompressed information is also conceivable for reproduction of dictation recording that is not so much questionable as music reproduction. Of course, in this case, playback with high sound quality can be designated and switched.
[0134]
For example, suppose you are playing a recorded meeting. Initially, playback starts as uncompressed information. The user can start playback of high sound quality as soon as a certain speaker makes an important remark while listening to this, or when it is difficult to hear, by pressing a key specifying high sound quality.
[0135]
When listening is complete, uncompressed information can be replayed. As a result, the current consumption can be significantly saved. Further, during the mute reproduction, reproduction of uncompressed information may be applied because the sound quality is not noticeable even when the sound quality is lowered.
[0136]
The third effect is that five pieces of music information are recorded continuously for 30 minutes live. However, since the device side is recording continuously, even if it says 5 music, it will be regarded as 1 music.
[0137]
Therefore, it is necessary to designate a flag indicating that there is a song break at an appropriate song break by some means. A so-called “divide” process is required.
[0138]
This makes it possible to skip songs. However, determining the divide position requires key operations at extremely delicate timing. This is because the position where the user intended to divide is slightly different from the actual position.
[0139]
If you do it, there is a possibility that it will divide in the sound part. Therefore, a device equipped with a rehearsal function is provided. The rehearsal function is a function that determines the divide position and allows the user to confirm whether or not the divide is actually possible at this position.
[0140]
If the non-compressed information is reproduced according to the present invention during this rehearsal, the current consumption can be remarkably saved.
[0141]
The fourth effect is related to the third effect, and in a device having a dot display, the waveform before and after the divide is graphically displayed on the time lapse axis, the display waveform is confirmed, and the divide is executed. An application can be considered.
[0142]
Conventionally, in order to display the waveform, it is necessary to expand the compressed information once. However, in the present invention, since the uncompressed information is recorded, it may be read and displayed.
[0143]
Due to the dot resolution, even low sampling rate information is sufficient for graphic display. Since the current is not expanded, the current consumption is reduced and the time required for the expansion is not required, so that the waveform can be displayed at a high speed and the convenience is remarkably improved. Of course, the graphic display of the waveform may be performed even when the divide is not performed.
[0144]
The fifth effect relates to a variable speed reproduction function for uncompressed information. According to the present invention, variable speed reproduction can be easily provided. If variable speed playback is to be performed from a recording medium on which only compressed information is recorded, it is necessary to operate the decompression circuit and adjust the time axis direction as described in the problem to be solved by the invention.
[0145]
In addition, when listening quickly, the time required for decompression is too long, so there is a drawback that it is difficult to hear quickly because thinning reproduction is unavoidable.
[0146]
However, according to the present invention, uncompressed information is recorded at the same time, and since this is used, there is no need to operate the decompression circuit. Further, the decompression circuit has a much larger amount of processing per time than a general control circuit, and consumes a large amount of current because the decompression process is performed in real time by increasing the operating clock.
[0147]
This can be a fatal defect for portable devices using batteries. Therefore, in order to reduce the current consumption as much as possible, the uncompressed information according to the present invention may be used in a scene other than the actual reproduction with high sound quality.
[0148]
Application examples include the following.
[0149]
As a sixth effect, a method of listening quickly by variable-speed playback is also conceivable for the reproduction of oral recordings that are not as questionable as music playback. This is because parts that are not very important are heard quickly.
[0150]
Of course, in this case, playback with high sound quality can be designated and switched. For example, suppose you are playing a recorded meeting. Initially, playback starts as uncompressed information.
[0151]
The user can start playing the high-quality sound immediately by pressing a key for designating a high-quality sound when a certain speaker makes an important remark while listening to this, or when it is difficult to hear.
[0152]
When you have finished listening, you can start listening again. As a result, the current consumption can be significantly saved.
[0153]
In addition, there is a merit that the continuity of voice is not lost even in variable speed playback, and it is easy for the user to hear.
[0154]
As described in the third effect, the seventh effect can use this function during rehearsal. On the other hand, there is also a method of slow playback for specifying a subtle position. Furthermore, there is an advantage that fine alignment is possible in the case of slow listening.
[0155]
This can be applied not only during rehearsal but also during fast-forward playback when you want to cue up a part of a song. As described above, since the decompression circuit does not operate, there are advantages that the processing time is not excessive and the current consumption is small.
[0156]
Also, even with variable speed playback, there is a merit that the continuity of voice is not lost and it is easy for the user to hear.
[0157]
The eighth effect is that, as described in the embodiment of the fast listening playback 2 in which the pitch is not changed, when analyzing the repeated pattern of phonemes and playing the fast listening playback, this analysis is performed with a certain time width. However, according to the present invention, uncompressed information can be read over several pages without decompression, which is very suitable.
[0158]
The ninth effect relates to the backward reproduction function for each syllable, and the reproduction in the backward direction can be performed while maintaining the advantages in the forward processing described in the first to eighth effects.
[0159]
Even in this case, since non-compressed information is digitized and recorded in the semiconductor memory, various processes are possible.
[0160]
The tenth effect is that the reproduction in the reverse direction is meaningless as it is, but if it is reproduced in a certain unit in the forward direction, it can be distinguished to some extent.
[0161]
The eleventh effect is that, if a means for detecting a silent portion or a repeated portion of speech is further provided, the reproduction in the forward direction is repeated while skipping in the reverse direction with the portion as a break, thereby reversing the direction of each syllable. Playback is possible.
[0162]
This is a process that can be easily performed because uncompressed data is recorded in advance although the sampling rate is low. This makes it possible to perform reverse playback more easily.
[0163]
It is also possible to record the syllable break as a flag together with uncompressed information at the time of recording.
[0164]
Of course, the twelfth effect can be reproduced in the reverse direction in the time-series direction, not during the reverse reproduction as in the tenth effect and the eleventh effect. In this case, the voice cannot be heard as meaningful.
[0165]
However, a certain degree of identification is possible. At that time, if the waveform display as in the fourth effect is performed, the identification can be effectively performed.
[0166]
As described above, it goes without saying that these processes work advantageously from the viewpoint of current consumption and the processing speed by using uncompressed information.
[0167]
The thirteenth effect relates to simultaneous recording / playback. For example, it is assumed that conference recording is being performed. However, suppose that during the recording, it becomes necessary to listen to a memo recording of the telephone number that was previously recorded in an emergency.
[0168]
The situation arises that the meeting must be recorded and the memo recording must be played back. However, there is usually only one voice compression / decompression circuit, and decompression cannot be performed during the recording mode.
[0169]
Apart from large-scale systems, especially in battery-powered portable devices that have to operate limited resources with limited current consumption, two audio compression / decompression circuits are installed for compression and decompression. That is not very realistic.
[0170]
The fourteenth effect is that non-compressed information can be reproduced even during recording, so that not only can it be reproduced during recording, improving convenience but also reducing current consumption even in portable devices. There is an effect that this convenient function can be realized without adding hardware.
[0171]
The fifteenth effect relates to the comment function, as described in the above embodiment, since it is possible to set not to record uncompressed information at the time of recording, only compressed information can be recorded first.
[0172]
Alternatively, only the uncompressed information can be recorded first, and the compressed information can be recorded later. For example, a song can be recorded in the main area, and a comment such as the singer's name can be recorded as uncompressed information later.
[0173]
Alternatively, the singer's name may be recorded as uncompressed information first, and the song may be recorded later in the main area.
[0174]
By doing so, during the cueing operation, the uncompressed information is reproduced, so that the comment is reproduced, and the music can be heard by switching to the compressed information, so that convenience is improved.
[0175]
The sixteenth effect is that non-compressed information can be written in the remaining unused area in which compressed information is written in the main area in the same page of the semiconductor memory in FIG.
[0176]
Here, since it takes time to compress audio information in a certain section, if writing is performed according to the flow of information output timing, compressed information and non-compressed information written in the same page are not temporally non-compressed. The compressed information is the information on the past side.
[0177]
Of course, it is easy to write the same information in time on the same page by buffer processing. However, here, buffer processing is not performed, and recording is performed by deliberately shifting the time.
[0178]
In this way, there is an advantage that an outline of information to be decompressed in advance can be understood in advance during reproduction, as well as unnecessary buffer processing is not necessary.
[0179]
For example, if the information to be reproduced next is almost silent, the power consumption can be reduced by pausing the decompression circuit.
[0180]
Also, if the next page fails to load for some reason, if uncompressed information is output instead, the sound quality will change instantaneously, but continuity can be ensured, so the phoneme will be lost. The reliability is improved.
[0181]
As the seventeenth effect, the effects of the invention have been described so far, focusing on the convenience of audio recording and playback and the current consumption. From the viewpoint of effective use of the memory, As explained in the issues related to effective use, not only can the area that was not used originally be used effectively, but it is possible to achieve greater convenience and lower power consumption, and to significantly improve the value of the device. it can.
[0182]
As an eighteenth effect, it is of course possible to mix compressed information and non-compressed information in one page as in the mixed writing embodiment.
[0183]
For example, if it is decided that the compressed information is alternately written to the even address and the uncompressed information is written to the odd address, the non-compressed information is managed by the same address management from the host side by switching the lowest address control signal with another control signal. And compressed information can be instantaneously switched and read, and the processing can be simplified.
[0184]
Further, in a device having a 16-bit data bus, compressed information and non-compressed information can be read simultaneously, so that data processing can be performed efficiently.
[0185]
Further, with this method, in the so-called “divide” process, even when the page is divided at a position where the page is divided, the compressed information and the non-compressed information are divided in the same manner, so that the process is simplified. be able to.
[0186]
The nineteenth effect relates to the transfer function. Since uncompressed information is managed in a separate area, only that portion can be taken out and transferred to the outside.
[0187]
Thus, by extracting uncompressed information corresponding to a large amount of main information, a voice headline can be created instantaneously.
[0188]
For example, it is only necessary to transfer to a personal computer, create a layout for editing based on the information, and extract only necessary main information (compressed information). Thus, there is an advantage that the working efficiency is remarkably improved.
[0189]
Information can also be passed directly to a personal computer using a removable card.
[Brief description of the drawings]
FIG. 1 is a block diagram of the present invention.
FIG. 2 is a conceptual diagram of a page of the present invention.
FIG. 3 is a conceptual diagram of compressed information and uncompressed information according to the present invention.
[Explanation of symbols]
K1 key input means
M0 control circuit
D1 Audio compression / decompression circuit
D2 Sampling conversion circuit
M1-Mn memory chip
A1 Analog input circuit
A3 A / D conversion circuit

Claims

In a recording / reproducing apparatus that records and reproduces audio data using a recording medium that reads and writes information in a fixed-length recording unit,
Means for simultaneously recording the information obtained by digitizing and compressing audio analog data having a time axis in the recording unit and the information thinned out without compression into the same recording unit of the recording medium;
First reproducing means for reproducing the compressed information as sound;
Second reproduction means for reproducing the thinned information as audio without performing the compression;
A means for thinning out the AD information acquired at the sampling rate of the analog information to be compressed to a sampling rate lower than this when performing the thinning, and one of the reproducing means is configured to operate. A characteristic recording / reproducing apparatus.

2. The recording / reproducing apparatus according to claim 1, wherein the recording medium is a page write type semiconductor memory, the compressed information is recorded in a main area of the page, and the compression is performed in an unused area of the page. A recording / reproducing apparatus in which thinned information is recorded without being performed.

3. The recording / reproducing apparatus according to claim 1 , further comprising variable speed reproduction processing means such as a review and a cue for reproducing the thinned information without performing compression.

A recording and reproducing apparatus as claimed in any one of claims 1 to 3, review, when performing variable speed reproduction of the queue, etc., a recording and reproducing apparatus characterized in that it comprises a variable-speed reproduction processing means without changing the pitch .

A recording and reproducing apparatus as claimed in any one of claims 1 to 4, the recording and reproducing apparatus characterized by comprising means for further applying a low bit rate to the information of the low sampling rate in the recording process.

A recording and reproducing apparatus as claimed in any one of claims 1 to 5, a recording and reproducing apparatus characterized in that it comprises a means for processing and reproducing at a low sampling rate, compression at a high sampling rate, the recording operation at the same time .

A recording and reproducing apparatus as claimed in any one of claims 1 to 6, when information is reproduced low sampling rate in the opposite direction, means for reproducing in a direction back in time while playing a forward for each phrase A recording / reproducing apparatus comprising:

A recording and reproducing apparatus as claimed in any one of claims 1 to 7, manages and thinned without compression information was carried out compressed information separately comprising means for transferring to other apparatus A recording / reproducing apparatus.

Claim 1, a device according to any one of claims 3 to 8, reading and writing using the fixed-length recording unit writing type semiconductor memory or a memory card, typified by a flash memory in a recording medium And a recording / reproducing apparatus comprising means for writing compressed information and non-compressed information using an in-chip page buffer of a page writing type flash memory.