JP2000252832A

JP2000252832A - Data compressing device and recording medium with data compressing program is recorded therein

Info

Publication number: JP2000252832A
Application number: JP11048627A
Authority: JP
Inventors: Katsuya Miyoshi; 勝也三好; Takeshi Kishinami; 健史岸浪
Original assignee: Nikon Corp
Current assignee: Nikon Corp
Priority date: 1999-02-25
Filing date: 1999-02-25
Publication date: 2000-09-14

Abstract

PROBLEM TO BE SOLVED: To compress physical quantities as survey and statistic results with high compressibility by compressing data except images measured or calculated by sample positions obtained from outside, while removing the redundancy of data with the spatial correlate of a data array corresponding to the sample positions. SOLUTION: An MPU 12 opens a data file to be compressed, as a data compressing program is started, decides the kind of data from the extension or header information, and refers to a corresponding table on the basis of the data kind to automatically determine a compression system and compression parameters. When the MPU 12 can discriminate the delimiters of data and data lines according to the data kind and contents, sample space reproduction is possible as is, so that a multidimensional array corresponding to the sample positions is prepared in a memory 17. Then when the MPU 12 discriminates that the data are in text format and in floating-point format, the difference between the predicted value of following data and the actual data is calculated, on the basis of previously compressed data on the condition such that a compression system which was automatically determined is a reversible compression system, and then entropy encoding is carried out to compress the data amount.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、測量結果や統計結
果などの物理量を効率よく圧縮するデータ圧縮装置に関
する。また、本発明は、このような処理を実行するデー
タ圧縮プログラムを記録した機械読み取り可能な記録媒
体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data compression device for efficiently compressing physical quantities such as survey results and statistical results. Further, the present invention relates to a machine-readable recording medium on which a data compression program for executing such processing is recorded.

【０００２】[0002]

【従来の技術】従来、地勢データ（標高、地質、地理情
報など）、統計データおよび気象データ（気圧、温度、
湿度など）などの離散ディジタルデータは、非圧縮状態
で記録媒体に格納される。例えば、日本の国土地理院
（GeographicalSurveyInstitute）では、５０ｍもしく
は２５０ｍのメッシュ間隔ごとに測量された標高データ
を、非圧縮のテキスト形式で記録媒体に記録している。2. Description of the Related Art Conventionally, terrain data (altitude, geology, geographic information, etc.), statistical data and weather data (barometric pressure, temperature,
Discrete digital data such as humidity is stored in a recording medium in an uncompressed state. For example, the Geographical Survey Institute in Japan records elevation data measured at intervals of 50 m or 250 m on a recording medium in an uncompressed text format.

【０００３】また、財団法人気象業務支援センターで
は、気象庁による観測平均値などを、標本位置（観測
点）の位置情報と併せて、非圧縮のテキスト形式で記録
媒体に記録している。従来、このようなデータを圧縮す
る際には、データファイルの符号冗長性を排除する圧縮
方式（いわゆる、ＺＩＰ、ＴＡＲ、ＬＨＡなどの圧縮方
式）が一般に使用されていた。In addition, the Japan Meteorological Service Support Center records the average value observed by the Japan Meteorological Agency together with the position information of the sample position (observation point) on a recording medium in an uncompressed text format. Conventionally, when compressing such data, a compression method (so-called compression method such as ZIP, TAR, LHA, etc.) for eliminating code redundancy of a data file has been generally used.

【０００４】[0004]

【発明が解決しようとする課題】ところで、上記のよう
なデータは、一般的に観測点の数が多く、データ量が膨
大になる。特に、近年では、地球規模のデータを処理す
る機会も多く、さらに多くのデータ量を処理しなければ
ならない。このようにデータ量が多いため、以下のような問題が生
じる。（１）データを記録するため、大容量の記録媒体が必要
になる。（２）データを伝送する際に、伝送時間が長くなる。（３）データを処理する際に、大容量のメモリが必要と
なるため、大規模なコンピュータシステムが必要とな
る。（４）データファイルが大きいため、メモリ上や記録媒
体上に同時に収容することなどが困難となり、多種類の
データを効率よく管理することができない。Incidentally, the above-mentioned data generally has a large number of observation points and a large amount of data. In particular, in recent years, there are many opportunities to process global-scale data, and more data volumes must be processed. Since the data amount is large as described above, the following problem occurs. (1) To record data, a large-capacity recording medium is required. (2) When transmitting data, the transmission time becomes longer. (3) When processing data, a large-capacity memory is required, so a large-scale computer system is required. (4) Since the data file is large, it is difficult to simultaneously store it on a memory or a recording medium, and it is not possible to efficiently manage various types of data.

【０００５】そこで、請求項１〜８に記載の発明では、
上記の問題点を解決するために、測量結果や統計結果な
どの物理量を、高い圧縮率で圧縮可能なデータ圧縮装置
を提供することを目的とする。また、請求項９に記載の
発明では、このデータ圧縮装置をコンピュータ上で実現
するためのプログラムを記録した記録媒体を提供するこ
とを目的とする。Therefore, in the inventions according to claims 1 to 8,
In order to solve the above problems, it is an object to provide a data compression device capable of compressing a physical quantity such as a survey result or a statistical result at a high compression rate. Another object of the present invention is to provide a recording medium on which a program for realizing the data compression apparatus on a computer is recorded.

【０００６】[0006]

【課題を解決するための手段】（請求項１）請求項１に
記載の発明は、複数の標本位置ごとに測定または算出さ
れたデータ（ただし画像データは除く）を外部から取得
するデータ取得部と、「データを標本位置に応じて配置
したデータ配列」における空間的な相関性に基づいてデ
ータの冗長を排除し、データの圧縮を行うデータ圧縮部
とを備えて、データ圧縮装置を構成する。According to a first aspect of the present invention, there is provided a data acquisition unit for acquiring data (excluding image data) measured or calculated for each of a plurality of sample positions from outside. And a data compression unit that eliminates data redundancy based on the spatial correlation in the “data array in which data is arranged according to the sample position” and compresses data. .

【０００７】一般に、従来の圧縮方式では、データファ
イル内のシリアル化されたデータ列を適当な長さに区切
った上で、圧縮符号化（ハフマン符号化およびランレン
グス符号化など）を施す。そのため、シリアル化された
データ列上において、ある程度離れたデータについては
相互間の冗長性は判断できず、冗長性の排除は充分にな
されない。In general, in a conventional compression method, a serialized data string in a data file is divided into appropriate lengths, and then compression encoding (Huffman encoding, run-length encoding, etc.) is performed. For this reason, in the serialized data string, the redundancy between the data cannot be determined for the data separated to some extent, and the redundancy is not sufficiently eliminated.

【０００８】一方、本発明では、測定もしくは計算され
たデータを、標本位置に対応付けて配置することによ
り、データ配列の空間を概念上構成する。例えば、標本
位置が二次元であれば、データ配列の空間は二次元の空
間となる。一般に、標本位置がｎ次元であれば、データ
配列の空間はｎ次元の空間となる（ｎ≧３）。On the other hand, in the present invention, the space of the data array is conceptually configured by arranging the measured or calculated data in association with the sample position. For example, if the sample position is two-dimensional, the space of the data array is a two-dimensional space. In general, if the sample position is n-dimensional, the space of the data array is an n-dimensional space (n ≧ 3).

【０００９】このように構成されるデータ配列空間内で
は、近傍するデータ間において、値が類似したり、値が
連続したり、値が周期変動するなどの相関性が強く発生
する。このときの相関性（すなわち冗長性）は、従来の
ようなシリアル化の一方向にのみ生じるものとは異な
り、データ配列空間内のあるゆる方向に生じる。このよ
うに多方向に生じる空間的な相関性の中には、従来排除
できなかった、シリアル化データ列上で離れたデータ間
の冗長性も当然含まれる。In the data array space configured as described above, there is strong correlation between neighboring data, such as similar values, continuous values, and periodic fluctuations of values. The correlation (that is, redundancy) at this time occurs in any direction in the data array space, unlike the conventional case where the serialization occurs only in one direction. Such spatial correlations occurring in multiple directions naturally include redundancy between data separated on a serialized data sequence, which could not be eliminated conventionally.

【００１０】本発明のデータ圧縮部は、このような空間
的な相関性を排除するので、従来以上に効率よく、高い
圧縮率を達成することができる。なお、上述した標本位
置の次元の一つとして、時間軸を追加してもよい。この
ような場合は、単一のデータ配列空間内に、時系列デー
タをまとめて配置することが可能となる。その結果、時
間軸方向の冗長性も、データ配列空間内の空間的な相関
性とみなして排除することが可能となる。[0010] The data compression section of the present invention eliminates such spatial correlation, so that a higher compression ratio can be achieved more efficiently than in the past. Note that a time axis may be added as one of the dimensions of the sample position described above. In such a case, it is possible to arrange the time-series data collectively in a single data array space. As a result, it is possible to eliminate the redundancy in the time axis direction as a spatial correlation in the data array space.

【００１１】また、本発明では、圧縮対象のデータがテ
キスト形式であった場合、空間的な冗長性を排除する前
に、データをテキスト形式からバイナリ形式に変換する
ことが好ましい。この場合、バイナリ形式の方がテキス
ト形式よりもデータの相関性が強く現れるため、より効
率よく、高い圧縮率を達成することができる。さらに、
本発明では、圧縮対象のデータが浮動小数点形式であっ
た場合、空間的な冗長性を排除する前に、圧縮するデー
タブロックの単位で、データを浮動小数点形式から固定
小数点形式に変換したり、もしくは、圧縮するデータブ
ロックの単位で、指数部の値を一様に揃えることが好ま
しい。このような変換により、データの桁がそろって相
関性が強く現れるため、より効率よく、高い圧縮率を達
成することができる。In the present invention, if the data to be compressed is in text format, it is preferable to convert the data from text format to binary format before eliminating spatial redundancy. In this case, the binary format has higher data correlation than the text format, so that a high compression rate can be achieved more efficiently. further,
According to the present invention, when data to be compressed is in a floating-point format, before eliminating spatial redundancy, data is converted from a floating-point format to a fixed-point format in units of data blocks to be compressed, Alternatively, it is preferable that the value of the exponent part is uniformly set in units of data blocks to be compressed. By such conversion, the data digits are aligned and the correlation appears strongly, so that a high compression rate can be achieved more efficiently.

【００１２】（請求項２）請求項２に記載の発明は、請
求項１に記載のデータ圧縮装置において、データ圧縮部
は、予め定められた対応関係に基づいて、可逆圧縮方式
または非可逆圧縮方式のどちらかをデータの種類から決
定し、決定した圧縮方式を用いてデータの圧縮を行うこ
とを特徴とする。According to a second aspect of the present invention, in the data compression device according to the first aspect, the data compression unit is configured to perform a lossless compression method or a lossy compression method based on a predetermined correspondence. One of the methods is determined from the type of data, and the data is compressed using the determined compression method.

【００１３】本発明は、複数種類のデータを圧縮対象と
することができる。この場合、データの種類によって
は、データを完全に保存する必要があるものと、ある程
度データが消滅してもかまわないものとがある。請求項
２に記載の発明では、データの種類に対応して、圧縮の
仕様が予め定められる。データ圧縮部は、この対応関係
に基づいて、圧縮対象のデータの種類から使用すべき圧
縮方式（可逆圧縮か非可逆圧縮か）を自動決定する。According to the present invention, a plurality of types of data can be compressed. In this case, depending on the type of data, there are data that needs to be completely stored and data that can be deleted to some extent. According to the second aspect of the present invention, the specification of compression is predetermined in accordance with the type of data. The data compression unit automatically determines a compression method (lossless compression or lossy compression) to be used from the type of data to be compressed based on the correspondence.

【００１４】したがって、予め予想される種類のデータ
であれば、そのデータに合わせた適正な圧縮方式を手間
なく決定し、適切なデータ圧縮を施すことが可能とな
る。なお、予想されない種類のデータであった場合、本
発明品が、ユーザーに圧縮仕様を質問し、その回答結果
を上記対応関係に追加記録するようにしてもよい。この
ような構成では、対応可能なデータの種類を柔軟かつご
く自然に拡張することが可能となる。Therefore, if the data is of the type expected in advance, it is possible to easily determine an appropriate compression method in accordance with the data and perform appropriate data compression. If the data is of an unexpected type, the product of the present invention may ask the user about the compression specification, and may additionally record the result of the inquiry in the correspondence. With such a configuration, it is possible to flexibly and naturally expand the types of data that can be handled.

【００１５】（請求項３）請求項３に記載の発明は、請
求項１または請求項２に記載のデータ圧縮装置におい
て、データ圧縮部は、「標本空間の共通する複数の圧縮
済みデータ」を、１ファイルにまとめて、もしくは関連
付けられた複数ファイルとして格納することを特徴とす
る。According to a third aspect of the present invention, in the data compression apparatus according to the first or second aspect, the data compression section includes “a plurality of pieces of compressed data having a common sample space”. And is stored in a single file or as a plurality of associated files.

【００１６】従来は、データファイルのサイズが大きい
ため、個別にデータファイルを管理することが多かっ
た。そのため、例えば、データ全体をまとめて更新した
り、伝送するような場合も、一つ一つのデータファイル
について個別に処理しなければならず、データの管理が
非常に煩雑であった。しかしながら、請求項３に記載の
発明では、標本空間の共通する複数の圧縮済みデータ
を、次のいずれかの形式で格納する。（１）１ファイルにまとめて格納する。（２）関連付けられた複数ファイルとして格納する。このような形式で格納することにより、これらの標本空
間の共通するデータをひとまとめに管理および操作する
ことが容易となる。Conventionally, data files are often individually managed because of the large size of the data files. For this reason, for example, in the case where the entire data is collectively updated or transmitted, each data file must be individually processed, and data management is very complicated. However, according to the third aspect of the present invention, a plurality of compressed data having a common sample space are stored in one of the following formats. (1) Collectively store in one file. (2) Store as a plurality of associated files. By storing in such a format, it becomes easy to manage and operate the data common to these sample spaces collectively.

【００１７】（請求項４）請求項４に記載の発明は、請
求項１ないし請求項３のいずれか１項に記載のデータ圧
縮装置において、データ圧縮部は、データ配列をブロッ
ク分けし、各ブロックごとの直流成分と交流成分とを算
出し、該直流成分および該交流成分を圧縮して格納する
圧縮格納手段と、圧縮格納手段から圧縮済みデータを読
み出し、該圧縮済みデータから各ブロックごとの直流成
分を抽出して、データ処理に供する平均データ抽出手段
とを含むことを特徴とする。According to a fourth aspect of the present invention, in the data compression device according to any one of the first to third aspects, the data compression unit divides the data array into blocks, A compression storage unit that calculates a DC component and an AC component for each block, compresses and stores the DC component and the AC component, reads out compressed data from the compression storage unit, and reads each compressed block from the compressed data. Average data extracting means for extracting a DC component and providing the data to data processing.

【００１８】一般に、データ処理に使用される表計算ソ
フトなどでは、例えば、最大２５６列までのデータしか
処理できない。また、データ解析やグラフ表示などの用
途では、データの標本間隔が適当に広い方が全体傾向を
把握する上で好ましい場合も多い。このような事情か
ら、標本数の膨大なデータについては、データ処理に先
だって、データの標本数を適度に減少させる処理（以下
「減数処理」という）が、高い頻度で生じる。In general, for example, spreadsheet software used for data processing can process only up to 256 columns of data. Also, in applications such as data analysis and graph display, it is often preferable that the data sampling interval be appropriately wide in order to grasp the overall tendency. Under such circumstances, for data with a large number of samples, a process of appropriately reducing the number of data samples (hereinafter referred to as “subtraction process”) occurs at a high frequency prior to data processing.

【００１９】そのため従来は、圧縮ファイルを復号化
（解凍）したのち、データの減数処理を実行することが
多かった。この場合、復号化された膨大なデータ量をメ
モリ上で扱わなければならず、データの参照順序によっ
てはハードディスクとの間でスワップ処理が頻繁に発生
するなど、処理速度が遅くなるという不具合が生じてい
た。For this reason, conventionally, data reduction processing is often performed after decoding (decompressing) a compressed file. In this case, a huge amount of decrypted data must be handled in the memory, and depending on the data reference order, there is a problem that the processing speed is slow, such as frequent swap processing with the hard disk. I was

【００２０】ところが、請求項４に記載の発明では、圧
縮済みデータから直流成分をじかに抽出する。この直流
成分は圧縮ブロック数まで既に減数されているので、上
記の減数処理を省略または軽減することが可能となる。
この場合、膨大なデータをメモリ上に展開する必要がな
く、処理速度が遅くなるなどの不具合を回避することが
できる。その上、直流成分は、ブロックの平均値もしく
は中間値なので、標本数を低減する際に生じる折り返し
雑音が含まれない。したがって、このような直流成分
は、データ解析やグラフ表示を用いて全体傾向を把握す
る際に、特に適したデータとなる。However, in the fourth aspect of the present invention, a DC component is directly extracted from the compressed data. Since the DC component has already been reduced to the number of compressed blocks, the above-described reduction processing can be omitted or reduced.
In this case, it is not necessary to expand a huge amount of data on the memory, and it is possible to avoid a problem such as a slow processing speed. In addition, since the DC component is an average value or an intermediate value of blocks, aliasing noise generated when the number of samples is reduced is not included. Therefore, such a DC component becomes data particularly suitable for grasping the overall tendency using data analysis and graph display.

【００２１】（請求項５）請求項５に記載の発明は、請
求項１ないし請求項４のいずれか１項に記載のデータ圧
縮装置において、データ圧縮部は、データ配列を直交変
換により周波数空間上に変換し、変換後データの低域周
波数成分への偏りに基づいて冗長を排除し、データの圧
縮を行うことを特徴とする。(Claim 5) According to a fifth aspect of the present invention, in the data compression apparatus according to any one of the first to fourth aspects, the data compression section converts a data array into a frequency space by orthogonal transform. Up conversion is performed, and data is compressed by removing redundancy based on the bias of the converted data toward low frequency components.

【００２２】従来、地勢データや統計データなどを扱う
分野では、個々の値のまま処理されることが多く、周波
数空間に変換するなどの処理はおよそ誰も考えなかっ
た。しかしながら、本発明者は、これらのデータをデー
タ配列空間上にデータ配列することにより、データ配列
空間から周波数空間への直交変換を可能とした。この場
合、データ配列空間上におけるデータの連続性は、周波
数空間上において、低域周波数成分への偏りとなって現
れる。したがって、この偏りに基づく冗長を排除するこ
とにより、データ圧縮を合理的かつ能率的に実行するこ
とが可能となる。Conventionally, in the field of handling terrain data, statistical data, and the like, individual values are often processed as they are, and almost nobody considered processing such as conversion to frequency space. However, the present inventor has made it possible to perform orthogonal transformation from the data array space to the frequency space by arranging these data in the data array space. In this case, the continuity of data in the data array space appears as a bias toward low frequency components in the frequency space. Therefore, by eliminating the redundancy based on this bias, it is possible to execute data compression rationally and efficiently.

【００２３】（請求項６）請求項６に記載の発明は、請
求項１ないし請求項５のいずれか１項に記載のデータ圧
縮装置において、標本位置は、地理的な位置であり、デ
ータは、地勢データ、国勢データ、または気象データの
少なくとも一つであることを特徴とする。(Claim 6) According to a sixth aspect of the present invention, in the data compression apparatus according to any one of the first to fifth aspects, the sample position is a geographical position, and the data is , Terrain data, national census data, or weather data.

【００２４】（請求項７）請求項７に記載の発明は、請
求項６に記載のデータ圧縮装置において、地勢データ
は、標高データ、地質データ、または地理的情報を示す
地理データの少なくとも一つであることを特徴とする。According to a seventh aspect of the present invention, in the data compression apparatus according to the sixth aspect, the terrain data is at least one of elevation data, geological data, or geographic data indicating geographic information. It is characterized by being.

【００２５】（請求項８）請求項８に記載の発明は、請
求項６に記載のデータ圧縮装置において、国勢データ
は、人口データ、交通量データその他の社会統計データ
の少なくとも一つであることを特徴とする。(Claim 8) The invention according to claim 8 is the data compression device according to claim 6, wherein the national census data is at least one of population data, traffic data and other social statistics data. It is characterized by.

【００２６】（請求項９）請求項９に記載の記録媒体に
は、コンピュータを「請求項１ないし請求項８のいずれ
か１項に記載のデータ取得部およびデータ圧縮部」とし
て機能させるためのデータ圧縮プログラムが記録され
る。このデータ圧縮プログラムを実行することにより、
コンピュータ上で、「請求項１ないし請求項８のいずれ
か１項に記載のデータ圧縮装置」が実現される。(Claim 9) The recording medium according to claim 9 has a computer for causing the computer to function as the "data acquisition unit and data compression unit according to any one of claims 1 to 8". A data compression program is recorded. By running this data compression program,
On a computer, a “data compression device according to any one of claims 1 to 8” is realized.

【００２７】[0027]

【発明の実施の形態】以下、図面に基づいて本発明にお
ける実施の形態を説明する。本実施形態は、請求項１〜
９に記載の発明に対応した実施形態である。Embodiments of the present invention will be described below with reference to the drawings. This embodiment is defined by claims 1 to
This is an embodiment corresponding to the invention described in Item 9.

【００２８】（実施形態の構成）図１は、コンピュータ
１１を使用したデータ圧縮装置の全体構成を示す図であ
る。図１において、コンピュータ１１の内部には、ＭＰ
Ｕ（マイクロプロセッサ）１２が設けられる。このＭＰ
Ｕ１２には、キーボードやマウスなどからなる入力装置
１３、ハードディスク１６、メモリ１７、画像処理ボー
ド１８、並びにインターフェースボード２０が接続され
る。この画像処理ボード１８の画像出力端子には、モニ
タ１９が接続される。一方、インターフェースボード２
０には、外部記録媒体などの外部機器２１が接続され
る。(Configuration of Embodiment) FIG. 1 is a diagram showing the overall configuration of a data compression apparatus using a computer 11. In FIG. 1, an MP
A U (microprocessor) 12 is provided. This MP
The input device 13 including a keyboard and a mouse, a hard disk 16, a memory 17, an image processing board 18, and an interface board 20 are connected to the U12. A monitor 19 is connected to an image output terminal of the image processing board 18. On the other hand, interface board 2
0 is connected to an external device 21 such as an external recording medium.

【００２９】また一方、ＭＰＵ１２にはＣＤ−ＲＯＭド
ライブ装置２２が接続される。このＣＤ−ＲＯＭドライ
ブ装置２２には、データ圧縮プログラム、およびそのイ
ンストールプログラムを記録したＣＤ−ＲＯＭ２３が挿
入される。このＣＤ−ＲＯＭ２３内のインストールプロ
グラムにより、ＭＰＵ１２は、ＣＤ−ＲＯＭ２３内のデ
ータ圧縮プログラムを展開し、ハードディスク１６に実
行可能な状態で格納する。On the other hand, a CD-ROM drive device 22 is connected to the MPU 12. The CD-ROM drive 22 has a CD-ROM 23 in which a data compression program and an installation program thereof are recorded. With the installation program in the CD-ROM 23, the MPU 12 expands the data compression program in the CD-ROM 23 and stores it in the hard disk 16 in an executable state.

【００３０】（本発明と実施形態との対応関係につい
て）請求項１に記載の発明と実施形態との対応関係につ
いては、データ取得部は、ＭＰＵ１２の「外部機器２１
やＣＤ−ＲＯＭドライブ装置２２などからデータを読み
出す機能」に対応し、データ圧縮部は、ＭＰＵ１２の
「データをデータ配列空間上に配列し、そのデータ配列
空間の冗長性を排除してデータ圧縮を行う機能」に対応
する。(Regarding Correspondence Between the Present Invention and the Embodiment) Regarding the correspondence between the invention described in claim 1 and the embodiment, the data acquisition unit is configured to execute the “
And a function of reading data from the CD-ROM drive device 22 or the like, and the data compression unit of the MPU 12 arranges data in a data array space, eliminates redundancy in the data array space, and performs data compression. Function to perform ".

【００３１】請求項２に記載の発明と実施形態との対応
関係については、データ圧縮部は、ＭＰＵ１２の「デー
タの種類を判別して、データの圧縮方式を自動決定する
機能」に対応する。請求項３に記載の発明と実施形態と
の対応関係については、ＭＰＵ１２の「標本空間の共通
するデータを１ファイルもしくは関連ファイルとして記
録する機能」に対応する。Regarding the correspondence between the invention described in claim 2 and the embodiment, the data compression unit corresponds to the “function of determining the type of data and automatically determining the data compression method” of the MPU 12. The correspondence between the invention described in claim 3 and the embodiment corresponds to the “function of recording data common to a sample space as one file or a related file” of the MPU 12.

【００３２】請求項４に記載の発明と実施形態との対応
関係については、圧縮格納手段は、ＭＰＵ１２の「デー
タ配列をブロック単位で直流成分と交流成分とに分け、
それぞれに圧縮格納する機能」に対応し、平均データ抽
出手段は、ＭＰＵ１２の「圧縮済みファイルから直流成
分を抽出してデータ処理に供する機能」に対応する。請
求項５に記載の発明と実施形態との対応関係について
は、データ圧縮部は、ＭＰＵ１２の「データ配列をＤＣ
Ｔ変換して圧縮する機能」に対応する。請求項９に記載
の発明と実施形態との対応関係については、機械読み取
り可能な記録媒体が、ＣＤ−ＲＯＭ２３、ハードディス
ク１６またはメモリ１７に対応する。As for the correspondence between the invention described in claim 4 and the embodiment, the compression storage means is configured to divide the data array of the MPU 12 into a DC component and an AC component in block units,
The average data extracting means corresponds to the “function of extracting a DC component from a compressed file and providing the data processing” of the MPU 12. Regarding the correspondence between the invention described in claim 5 and the embodiment, the data compression unit determines whether the data array of the MPU 12
T-conversion and compression function ". Regarding the correspondence between the invention described in claim 9 and the embodiment, the machine-readable recording medium corresponds to the CD-ROM 23, the hard disk 16, or the memory 17.

【００３３】（圧縮処理の動作説明）図２は、本実施形
態における圧縮処理ルーチンを示す図である。以下、図
２に示すステップ番号を参照しながら、圧縮処理の動作
を説明する。まず、ユーザーによりデータ圧縮プログラ
ムが起動されると、ＭＰＵ１２は、圧縮対象のデータフ
ァイルをオープンする（ステップＳ１）。次に、ＭＰＵ
１２は、このデータファイルの拡張子またはヘッダ情報
などからデータの種類を判別する（ステップＳ２）。(Explanation of Operation of Compression Processing) FIG. 2 is a diagram showing a compression processing routine in this embodiment. Hereinafter, the operation of the compression process will be described with reference to the step numbers shown in FIG. First, when a data compression program is started by a user, the MPU 12 opens a data file to be compressed (step S1). Next, MPU
12 determines the type of data from the extension of the data file or the header information (step S2).

【００３４】ＭＰＵ１２は、判別したデータの種類に基
づいて対応テーブルを参照する。この対応テーブルに
は、データの種類と、それに対応する「圧縮方式や圧縮
パラメータなどの圧縮仕様」が予め登録されている。デ
ータの種類が未登録であれば、ＭＰＵ１２は、モニタ１
９上に入力ダイアログを表示し、圧縮仕様をユーザーに
新規入力してもらう。この新規の圧縮仕様は、データの
種類に対応付けて対応テーブルに追加記録される。一
方、データの種類が登録済みであれば、対応テーブルか
ら圧縮仕様を読み出し、圧縮方式と圧縮バラメータとを
自動決定する（ステップＳ３，Ｓ４）。The MPU 12 refers to the correspondence table based on the determined data type. In the correspondence table, data types and corresponding “compression specifications such as compression method and compression parameter” are registered in advance. If the data type has not been registered, the MPU 12
9 to display an input dialog and ask the user to input a new compression specification. This new compression specification is additionally recorded in the correspondence table in association with the type of data. On the other hand, if the data type has been registered, the compression specification is read from the correspondence table, and the compression method and compression parameter are automatically determined (steps S3 and S4).

【００３５】次に、ＭＰＵ１２は、データの種類やデー
タの内容から、データやデータ行の区切りが識別可能か
否かを判定する（ステップＳ５）。例えば、データ区切
りが可能なデータとしては、コンマ・改行区切りのテキ
ストデータや、縦横の標本数をファイルヘッダ部などに
格納した固定長データなどがある。Next, the MPU 12 determines from the type of data and the contents of the data whether or not data or a data line break can be identified (step S5). For example, data that can be delimited include comma-separated / new-line delimited text data and fixed-length data in which the number of vertical and horizontal samples is stored in a file header or the like.

【００３６】このようなデータについては、標本空間の
再現がそのまま可能なので、ＭＰＵ１２は、ステップＳ
７に動作を移す。一方、データやデータ行の区切りを識
別できないデータの場合（ステップＳ５のＮＯ側）、Ｍ
ＰＵ１２は、モニタ１９上に入力ダイアログを表示し、（１）標本空間の次元数（２）標本空間上における各軸方向の標本数などの情報をユーザーに要求する（ステップＳ６）。With respect to such data, the reproduction of the sample space can be performed as it is.
Move the operation to 7. On the other hand, if the data or the data line cannot be separated (NO in step S5), M
The PU 12 displays an input dialog on the monitor 19, and requests information such as (1) the number of dimensions of the sample space and (2) the number of samples in each axial direction on the sample space (step S6).

【００３７】このような処理の後、ＭＰＵ１２は、標本
位置に各対応した多次元配列をメモリ１７上に用意する
（ステップＳ７）。次に、ＭＰＵ１２は、データの種類
やデータの内容から、データがテキスト形式か否かを判
別する（ステップＳ８）。もしも、データがバイナリ形
式の場合（ステップＳ８のＮＯ側）、ＭＰＵ１２は、ス
テップＳ１０に動作を移行する。After such processing, the MPU 12 prepares a multidimensional array corresponding to each sample position on the memory 17 (step S7). Next, the MPU 12 determines whether the data is in a text format based on the type of the data and the content of the data (Step S8). If the data is in binary format (NO in step S8), the MPU 12 shifts the operation to step S10.

【００３８】一方、データがテキスト形式の場合（ステ
ップＳ８のＹＥＳ側）、ＭＰＵ１２は、テキストデータ
をバイナリデータに変換する（ステップＳ９）。次に、
ＭＰＵ１２は、データの種類やデータの内容から、デー
タが浮動小数点形式か否かを判別する（ステップＳ１
０）。もしも、データが浮動小数点形式でなければ（ス
テップＳ１０のＮＯ側）、ＭＰＵ１２は、ステップＳ１
２に動作を移行する。On the other hand, if the data is in a text format (YES in step S8), the MPU 12 converts the text data into binary data (step S9). next,
The MPU 12 determines whether the data is in a floating-point format based on the type of the data and the content of the data (Step S1).
0). If the data is not in the floating-point format (NO in step S10), the MPU 12 proceeds to step S1
The operation is shifted to 2.

【００３９】一方、データが浮動小数点形式であった場
合（ステップＳ１０のＹＥＳ側）、ＭＰＵ１２は、後述
する圧縮用ブロックの単位ごとに、データの指数部を揃
える（ステップＳ１１）。このような処理の後、ＭＰＵ
１２は、標本位置に応じて、データを多次元配列に格納
する（ステップＳ１２）。On the other hand, if the data is in the floating-point format (YES in step S10), the MPU 12 aligns the exponent part of the data for each compression block unit described later (step S11). After such processing, the MPU
12 stores the data in a multidimensional array according to the sample position (step S12).

【００４０】次に、ＭＰＵ１２は、上記のステップＳ３
で決定した圧縮方式に応じて、データ圧縮を開始する
（ステップＳ１３）。例えば、圧縮方式として可逆圧縮
方式が選択されていた場合、ＭＰＵ１２は、先に圧縮さ
れたデータを基準にして、後から圧縮されるデータの予
測値を算出する。続いて、ＭＰＵ１２は、この予測値と
実際のデータとの差分（予測誤差）を算出する（ステッ
プＳ１４）。Next, the MPU 12 executes the above step S3
Data compression is started in accordance with the compression method determined in (1) (step S13). For example, when the reversible compression method is selected as the compression method, the MPU 12 calculates a predicted value of data to be compressed later based on the data compressed earlier. Subsequently, the MPU 12 calculates a difference (prediction error) between the predicted value and actual data (Step S14).

【００４１】このように求めた予測誤差をエントロピー
符号化して、データ量を圧縮する（ステップＳ１５）。
また一方、圧縮方式として非可逆圧縮方式が選択された
場合、ＭＰＵ１２は、多次元配列を８×８程度の圧縮用
ブロックに分割し、ブロック単位にＤＣＴ（離散コサイ
ン）変換を実行する（ステップＳ１６）。なお、指数デ
ータの場合は、仮数部についてＤＣＴ変換を実行する。The thus obtained prediction error is entropy-coded to compress the data amount (step S15).
On the other hand, when the irreversible compression method is selected as the compression method, the MPU 12 divides the multidimensional array into approximately 8 × 8 compression blocks, and performs DCT (discrete cosine) transform on a block basis (step S16). ). In the case of exponential data, DCT is performed on the mantissa.

【００４２】ＭＰＵ１２は、このように求めたＤＣＴ係
数を、圧縮仕様により決定された量子化テーブルを用い
て、量子化する（ステップＳ１７）。ＭＰＵ１２は、量
子化ＤＣＴ係数の直流成分については、直前ブロックの
直流成分との差分をとってエントロピー符号化する（ス
テップＳ１８）。なお、指数データの場合、ＭＰＵ１２
は、データの指数部に対しても、直流成分と同様の圧縮
処理を施す。The MPU 12 quantizes the DCT coefficients thus obtained by using the quantization table determined by the compression specification (Step S17). The MPU 12 entropy-encodes the DC component of the quantized DCT coefficient by taking the difference from the DC component of the immediately preceding block (step S18). In the case of index data, MPU12
Performs the same compression processing on the exponent part of the data as the DC component.

【００４３】ＭＰＵ１２は、量子化ＤＣＴ係数の交流成
分については、空間周波数順にジグザグスキャンして、
エントロピー符号化する（ステップＳ１９）。以上の圧
縮処理を完了した後、ＭＰＵ１２は、圧縮処理されたデ
ータを新規の圧縮済みファイルに格納し、全ファイルを
クローズする（ステップＳ２０）。このような一連の動
作により、データファイルの圧縮が完了する。The MPU 12 performs a zigzag scan on the AC component of the quantized DCT coefficient in the order of the spatial frequency.
Entropy coding is performed (step S19). After completing the above compression processing, the MPU 12 stores the compressed data in a new compressed file, and closes all the files (Step S20). Through such a series of operations, compression of the data file is completed.

【００４４】（標本空間の共通する圧縮済みデータの格
納について）次に、図３を用いて、圧縮済みデータの格
納形態について数例説明する。図３（ａ）は、標本空間
の共通する複数種類の圧縮済みデータ（ここでは標高デ
ータ，地質データ，土地利用データ）を模式的に示した
図である。図３（ｂ）は、インターリーブ式の格納形態
を示す図である。この格納形態の場合、ＭＰＵ１２は複
数種類の圧縮済みデータを、１ファイル内に交互に格納
する。このような格納形態では、複数種類のデータを標
本位置ごとにひとまとめに読み出すことが可能となる。
したがって、複数種類のデータを相関処理するなどの解
析用途に特に適した格納形態となる。(Storage of Compressed Data Common to Sample Space) Next, several examples of the storage form of compressed data will be described with reference to FIG. FIG. 3A is a diagram schematically illustrating a plurality of types of compressed data (here, elevation data, geological data, and land use data) having a common sample space. FIG. 3B is a diagram showing an interleaved storage format. In the case of this storage mode, the MPU 12 stores a plurality of types of compressed data alternately in one file. In such a storage form, a plurality of types of data can be collectively read for each sample position.
Therefore, the storage mode is particularly suitable for analysis such as performing correlation processing on a plurality of types of data.

【００４５】図３（ｃ）は、ノンインターリーブ式の格
納形態を示す図である。この格納形態の場合、ＭＰＵ１
２は、複数種類の圧縮済みデータを、１ファイル内の独
立した領域に別々に格納する。このような格納形態で
は、複数種類のデータを個別に読み出すことが容易とな
る。したがって、複数種類のデータを個別に処理・解析
する用途に特に適した格納形態となる。FIG. 3C shows a non-interleaved storage format. In the case of this storage mode, MPU1
2 separately stores a plurality of types of compressed data in independent areas within one file. In such a storage form, it becomes easy to individually read a plurality of types of data. Therefore, the storage mode is particularly suitable for the purpose of individually processing and analyzing a plurality of types of data.

【００４６】図３（ｄ）は、関連付けファイルによる格
納形態を示す図である。この格納形態の場合、ＭＰＵ１
２は、複数種類の圧縮済みデータを、個別のファイルに
それぞれに格納する。このとき、ＭＰＵ１２は、各ファ
イルのヘッダ部（もしくは別の情報ファイル）に、標本
空間が共通することを示す関連付け情報を記録する。こ
のような格納形態では、個々のファイルを個別に管理し
たり、関連付けを辿って管理するなど、柔軟なデータ管
理が可能となる。FIG. 3D is a diagram showing a storage form using an association file. In the case of this storage mode, MPU1
2 stores a plurality of types of compressed data in individual files. At this time, the MPU 12 records the association information indicating that the sample space is common in the header part (or another information file) of each file. In such a storage form, flexible data management is possible, such as managing individual files individually or managing them by associating them.

【００４７】（直流成分の抽出動作について）図４は、
ＭＰＵ１２による直流成分の抽出ルーチンを示す図であ
る。次に、図４に基づいて、圧縮済みファイルから直流
成分を抽出する場合の動作について説明する。まず、ユ
ーザーが直流成分の抽出動作を指示すると、ＭＰＵ１２
は、圧縮済みファイルをオープンする（ステップＳ３
１）。(Regarding DC Component Extraction Operation) FIG.
FIG. 4 is a diagram showing a DC component extraction routine by the MPU 12. Next, an operation for extracting a DC component from a compressed file will be described with reference to FIG. First, when the user instructs a DC component extraction operation, the MPU 12
Opens the compressed file (step S3
1).

【００４８】ＭＰＵ１２は、この圧縮済みファイルから
直流成分のみを選択的に読み出す（ステップＳ３２）。
次に、ＭＰＵ１２は、圧縮符号化されている直流成分を
復号化する（ステップＳ３３）。続いて、ＭＰＵ１２
は、復号化された直流成分を、カンマ区切りテキスト形
式に変換して、テキストファイルを作成する（ステップ
Ｓ３４）。以上のような一連の直流成分抽出動作によ
り、表計算ソフトなどで処理するに適したカンマ区切り
形式のテキストファイルが迅速に作成できる。The MPU 12 selectively reads only the DC component from the compressed file (step S32).
Next, the MPU 12 decodes the DC component that has been compression-encoded (Step S33). Then, MPU12
Converts the decoded DC component into a comma-separated text format to create a text file (step S34). By a series of DC component extraction operations as described above, a comma-delimited text file suitable for processing with spreadsheet software or the like can be quickly created.

【００４９】（圧縮処理結果の一例について）図５
（ａ）は、圧縮前のデータをグラフ表示したものであ
る。このデータは、北海道の層雲峡の２次元位置におけ
る標高データを４バイト整数で表現したものであり、２
５．６ｋバイト程度のデータ容量である。一方、図５
（ｂ）は、この標高データを、上記した非可逆圧縮方式
の手順で一旦圧縮した後、解凍してグラフ表示したもの
である。圧縮済みファイルのデータ容量は、３．４ｋバ
イト程度と少なくなっている。また、図６は、この標高
データの圧縮誤差をグラフ表示したものである。この場
合、量子化テーブルやスケールファクタの設定によって
も異なるが、圧縮誤差はおおむね１５ｍ以下に制限され
る。(Example of Compression Process Result) FIG.
(A) is a graph showing data before compression. This data expresses the altitude data at the two-dimensional position of Sounkyo in Hokkaido as a 4-byte integer.
The data capacity is about 5.6 kbytes. On the other hand, FIG.
(B) is a graph obtained by temporarily compressing the altitude data according to the procedure of the irreversible compression method, decompressing the altitude data, and displaying the data in a graph. The data capacity of the compressed file is as small as about 3.4 kbytes. FIG. 6 is a graph showing the compression error of the elevation data. In this case, the compression error is generally limited to 15 m or less, depending on the setting of the quantization table and the scale factor.

【００５０】（本実施形態の効果）以上説明した動作に
より、本実施形態では、データ配列空間上において効率
よく空間的な冗長性を排除することが可能となり、高い
圧縮率を達成することができる。また、本実施形態で
は、そのデータに合わせた適正な圧縮方式を自動決定す
るので、多様なデータに対して適切なデータ圧縮を施す
ことができる。(Effect of the present embodiment) By the operation described above, in the present embodiment, it is possible to efficiently eliminate the spatial redundancy in the data array space and achieve a high compression ratio. . Further, in the present embodiment, since an appropriate compression method according to the data is automatically determined, appropriate data compression can be performed on various data.

【００５１】さらに、本実施形態では、標本空間の共通
する複数の圧縮済みデータを、図３に示すような格納形
態で格納する。したがって、標本空間の共通するデータ
をひとまとめに管理・操作することが容易となる。ま
た、本実施形態では、圧縮済みデータから直流成分をじ
かに抽出することにより、表計算ソフトに適した入力デ
ータを迅速に作成することができる。さらに、本実施形
態では、直交変換を利用して周波数空間上でデータ圧縮
を行うので、データの連続性、周期性などに着目して、
データ圧縮を合理的かつ能率的に実行することが可能と
なる。ところで、本実施形態に適する圧縮対象として
は、次のものが特に挙げられる。Further, in this embodiment, a plurality of compressed data having a common sample space are stored in a storage form as shown in FIG. Therefore, it becomes easy to collectively manage and operate data common to the sample space. In the present embodiment, by directly extracting the DC component from the compressed data, input data suitable for spreadsheet software can be quickly created. Furthermore, in the present embodiment, since data compression is performed on the frequency space using orthogonal transformation, attention is paid to continuity, periodicity, and the like of data.
Data compression can be performed reasonably and efficiently. By the way, the following are particularly mentioned as compression targets suitable for the present embodiment.

【００５２】 ◎地勢データ（特に、標高データ、地質データ、地理的
情報） ◎国勢データ（特に、人口データ、交通量データその他
の社会統計データ） ◎気象データこれらのデータは、いずれもデータの標本数が特に膨大
であり、かつデータ配列空間に冗長性が生じやすい。し
たがって、本実施形態のデータ圧縮対象として特に好適
である。◎ Terrain data (especially elevation data, geological data, geographic information) ◎ Census data (especially population data, traffic data and other social statistical data) ◎ Meteorological data These data are all data samples The number is particularly large, and the data array space is likely to have redundancy. Therefore, it is particularly suitable as a data compression target of the present embodiment.

【００５３】（実施形態の補足事項）なお、上述したデ
ータの種類に限らず、一般に、２次元空間や３次元空間
やｎ次元空間を標本空間とする物理量データを圧縮対象
としてもよい。さらに具体的には、海洋、人体、固体サ
ンプル面などを標本空間とするデータを圧縮対象として
もよい。(Supplementary Items of the Embodiment) Note that, not limited to the types of data described above, in general, physical quantity data using a two-dimensional space, a three-dimensional space, or an n-dimensional space as a sample space may be set as a compression target. More specifically, data having a sample space of the ocean, the human body, the solid sample surface, or the like may be the compression target.

【００５４】また、個々のデータの種類としては、電
界、磁界、温度、湿度、重量、弾性率、水分％、異常細
胞量、血液量、（人体の各位置に対する）脂肪量、蛋白
質量、海流の速度、海流の速度ベトクル、プランクトン
量、海水の成分などのデータを圧縮対象としてもよい。
さらに、［課題を解決するための手段］の欄で述べたよ
うに、データ配列空間の次元に時間軸を追加してもよ
い。このような一例としては、２次元の測定点ごとに測
定した月単位の気温データの圧縮処理が挙げられる。こ
の場合、データ配列空間は、２本の空間軸と、１本の時
間軸とから構成され、３次元空間となる。このデータ配
列空間上では、これら３軸の方向に強い相関性を生じる
ため、本発明のデータ圧縮を適用して高い圧縮率を得る
ことが可能となる。The types of individual data include electric field, magnetic field, temperature, humidity, weight, elastic modulus, moisture%, abnormal cell amount, blood amount, fat amount (for each position of the human body), protein mass, and ocean current. Data such as the velocity of the sea current, the velocity vector of the ocean current, the amount of plankton, and the composition of the seawater may be used as the compression target.
Further, as described in the section of [Means for Solving the Problem], a time axis may be added to the dimension of the data array space. An example of such an example is a process of compressing monthly temperature data measured for each two-dimensional measurement point. In this case, the data array space is composed of two spatial axes and one time axis, and is a three-dimensional space. In this data array space, since a strong correlation is generated in the directions of these three axes, a high compression ratio can be obtained by applying the data compression of the present invention.

【００５５】また、上述した実施形態では、以上のよう
な多様な物理量データを、適当な圧縮方式で圧縮し、適
切にまとめて管理することが容易に可能となる。したが
って、これら物理量間の相関関係を解析する研究環境を
提供する上で、本実施形態は非常に有効である。なお、
上述した実施形態では、ＤＣＴ変換を行う場合について
説明したが、これに限定されるものではない。ＤＣＴ変
換の代わりに、その他の直交変換を行ってもよい。Further, in the above-described embodiment, it is possible to easily compress the various physical quantity data as described above by an appropriate compression method and appropriately manage them collectively. Therefore, the present embodiment is very effective in providing a research environment for analyzing the correlation between these physical quantities. In addition,
In the embodiment described above, the case where the DCT transform is performed has been described, but the present invention is not limited to this. Other orthogonal transform may be performed instead of the DCT transform.

【００５６】さらに、上述した実施形態では、標本間隔
が等間隔であるデータについて説明したが、これに限定
されるものではない。一般に、データ配列空間上のデー
タ相関性を利用できれば本発明の効果が得られるので、
標本間隔は必ずしも等間隔である必要はない。例えば、
不等間隔な町単位のデータを町のおおまかな位置関係に
応じて配列することにより、データ配列空間を構成して
もよい。このような場合でも、データ配列空間上にデー
タ相関性が存在すれば本発明の効果を充分得ることが可
能である。Further, in the above-described embodiment, data in which the sample intervals are equal are described. However, the present invention is not limited to this. In general, if the data correlation on the data array space can be used, the effect of the present invention can be obtained.
The sample intervals need not necessarily be equal. For example,
The data arrangement space may be configured by arranging unequally-spaced town-unit data in accordance with the approximate positional relationship of the town. Even in such a case, if the data correlation exists in the data array space, the effect of the present invention can be sufficiently obtained.

【００５７】[0057]

【発明の効果】（請求項１）請求項１に記載の発明で
は、データ配列空間内の空間的な冗長性を排除するの
で、高い圧縮率を達成することができる。According to the first aspect of the present invention, since the spatial redundancy in the data array space is eliminated, a high compression ratio can be achieved.

【００５８】（請求項２）請求項２に記載の発明では、
予め予想される種類のデータであれば、そのデータに合
わせた適正な圧縮方式を手間なく自動決定して、適切な
データ圧縮を確実に施すことが可能となる。(Claim 2) In the invention according to claim 2,
If the data is of the type expected in advance, it is possible to automatically determine an appropriate compression method according to the data without any hassle, and to appropriately perform appropriate data compression.

【００５９】（請求項３）請求項３に記載の発明では、
標本空間の共通する複数の圧縮済みデータを、次のいず
れかの形式で格納する。(Claim 3) In the invention according to claim 3,
A plurality of compressed data having a common sample space is stored in one of the following formats.

【００６０】（１）１ファイルにまとめて格納する（２）関連付けられた複数ファイルとして格納するこのような形式で格納することにより、これらの標本空
間の共通するデータをひとまとめに管理および操作する
ことが容易となる。(1) Collectively store in one file (2) Store as associated multiple files By storing in such a format, data common to these sample spaces can be managed and operated collectively. Becomes easier.

【００６１】（請求項４）請求項４に記載の発明では、
圧縮済みデータから直流成分をじかに抽出するので、デ
ータの減数処理を改めて行う必要が特にない。したがっ
て、その分だけデータ処理時間を短縮することができ
る。(Claim 4) In the invention according to claim 4,
Since the DC component is directly extracted from the compressed data, it is not particularly necessary to perform the data reduction process again. Therefore, the data processing time can be shortened accordingly.

【００６２】（請求項５）請求項５に記載の発明では、
データ配列空間から周波数空間への直交変換を行うこと
により、データ圧縮を合理的かつ能率的に実行すること
が容易となる。(Claim 5) In the invention according to claim 5,
Performing orthogonal transform from the data array space to the frequency space facilitates rational and efficient data compression.

【００６３】（請求項６、７，８）請求項６に記載の発
明では、地勢データ、国勢データ、または気象データを
圧縮する。請求項７に記載の発明では、標高データ、地
質データ、または地理的情報を圧縮する。(Claims 6, 7, and 8) In the invention described in claim 6, terrain data, national census data, or weather data is compressed. According to the seventh aspect of the present invention, altitude data, geological data, or geographic information is compressed.

【００６４】請求項８に記載の発明では、人口データ、
交通量データその他の社会統計データを圧縮する。これ
らのデータは、いずれもデータの標本数が一般に膨大で
あり、かつデータ配列空間に冗長性が発生しやすい。し
たがって、本発明のデータ圧縮対象として特に好適であ
る。したがって、これらの膨大なデータを小規模のコン
ピュータで処理する上で、本発明は特に重要な技術とな
る。In the invention according to claim 8, the population data,
Compress traffic data and other social statistics data. Each of these data generally has an enormous number of data samples, and redundancy tends to occur in the data array space. Therefore, it is particularly suitable as a data compression target of the present invention. Therefore, the present invention is a particularly important technique when processing such a large amount of data with a small computer.

【００６５】（請求項９）請求項９に記載の記録媒体を
使用することにより、コンピュータ上において、「請求
項１ないし請求項８のいずれか１項に記載のデータ圧縮
装置」を実現することが可能となる。(Claim 9) By using the recording medium according to claim 9, a "data compression device according to any one of claims 1 to 8" is realized on a computer. Becomes possible.

[Brief description of the drawings]

【図１】コンピュータ１１を使用したデータ圧縮装置の
全体構成を示す図である。FIG. 1 is a diagram showing an overall configuration of a data compression device using a computer 11;

【図２】本実施形態における圧縮処理を説明する流れ図
である。FIG. 2 is a flowchart illustrating a compression process according to the embodiment.

【図３】標本空間の共通するデータの格納形態を説明す
る図である。FIG. 3 is a diagram illustrating a storage form of data common to a sample space.

【図４】直流成分の抽出ルーチンを示す図である。FIG. 4 is a diagram showing a DC component extraction routine.

【図５】圧縮処理結果の一例を示す図である。FIG. 5 is a diagram illustrating an example of a compression processing result.

【図６】圧縮誤差の一例を示す図である。FIG. 6 is a diagram illustrating an example of a compression error.

[Explanation of symbols]

１１コンピュータ１２ＭＰＵ１３入力装置１６ハードディスク１７メモリ１８画像処理ボード１９モニタ２０インターフェースボード２１外部機器２２ＣＤ−ＲＯＭドライブ装置２３ＣＤ−ＲＯＭ 11 Computer 12 MPU 13 Input Device 16 Hard Disk 17 Memory 18 Image Processing Board 19 Monitor 20 Interface Board 21 External Equipment 22 CD-ROM Drive Device 23 CD-ROM

Claims

[Claims]

A data acquisition unit for acquiring data measured or calculated for each of a plurality of sample positions (excluding image data) from outside, and a data array in which the data is arranged according to the sample positions. A data compression device, comprising: a data compression unit that eliminates redundancy of the data based on spatial correlation and compresses the data.

2. The data compression device according to claim 1, wherein the data compression unit determines one of a lossless compression method and a lossy compression method from the type of the data based on a predetermined correspondence relationship. A data compression device that compresses the data by using the determined compression method.

3. The data compression device according to claim 1, wherein the data compression unit collects a plurality of pieces of compressed data having a common sample space into one file or as a plurality of associated files. A data compression device characterized by storing.

4. The data compression device according to claim 1, wherein the data compression unit divides the data array into blocks, and converts a DC component and an AC component for each block. Calculated, compressed storage means for compressing and storing the DC component and the AC component, reading compressed data from the compression storage means, extracting a DC component for each block from the compressed data, And a mean data extracting means for processing.

5. The data compression device according to claim 1, wherein the data compression unit transforms the data array into a frequency space by orthogonal transformation, and reduces a data after conversion. A data compression apparatus, wherein the data compression is performed by eliminating the redundancy based on a bias toward a band frequency component.

6. The data compression apparatus according to claim 1, wherein the sample position is a geographical position, and the data is terrain data, national census data, or weather data. A data compression device characterized by at least one of the following.

7. The data compression device according to claim 6, wherein the terrain data is at least one of elevation data, geological data, and geographic data indicating geographic information.

8. The data compression device according to claim 6, wherein the national census data is at least one of population data, traffic volume data, and other social statistics data.

9. A machine-readable recording medium recording a data compression program for causing a computer to function as the “data acquisition unit and data compression unit according to any one of claims 1 to 8”.