JP2003248750A

JP2003248750A - Purchase information processing device, purchase information clustering method and program

Info

Publication number: JP2003248750A
Application number: JP2002046625A
Authority: JP
Inventors: Shinsuke Azuma; 辰輔東
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2002-02-22
Filing date: 2002-02-22
Publication date: 2003-09-05

Abstract

<P>PROBLEM TO BE SOLVED: To provide a purchase information processing device that easily and quickly extracts a cluster regulating a relation between a customer and a commodity from the purchase history information. <P>SOLUTION: A matrix forming section 2 allocates the customer and the commodity respectively to one of rows and columns on the basis of the purchase history information indicating commodity purchase history of the respective customer stored in a purchase history storing section 1 and forms a matrix table of which predetermined indicators respectively different from each other are respectively added to a matrix element of the commodity with the history of purchase of the customer and to that of the commodity without the history of purchase of the customer. A matrix rearranging section 4 rearranges the rows of the matrix table on the basis of summation of the indicators for the respective rows and rearranges the columns on the basis of summation of the indicators for the respective columns, and extracts the cluster regulated by distribution of the indicators on the matrix table. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は、小売業等におい
て顧客の趣味及び嗜好に応じて商品の提案をする際に用
いられ、顧客の分類及び商品の分類を行う購買情報処理
装置に係り、特に顧客の購買履歴データに応じて予め規
定された商品群の購入可能性の高低によって顧客群を抽
出する購買情報処理装置、顧客情報クラスタリング方法
及びプログラムに関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a purchase information processing apparatus for classifying customers and classifying products, which is used for suggesting products in accordance with a customer's hobbies and tastes in a retail business. The present invention relates to a purchase information processing apparatus, a customer information clustering method, and a program for extracting a customer group according to the possibility of purchasing a product group defined in advance according to customer purchase history data.

【０００２】[0002]

【従来の技術】近年、情報通信システムの発展に伴っ
て、スーパー、コンビニエンスストア、及びデパート等
では、個々の商品についてその売り上げ記録をＰＯＳ
（ＰｏｉｎｔｏｆＳａｌｅｓ）端末装置によって入
力し、各店舗毎、各地域毎、又は本部において、コンピ
ュータに売り上げ記録を蓄積するようにしている。具体
的には、各商品にバーコードに代表される識別子を付与
して、ＰＯＳ端末装置によってこの識別子を読み取って
商品の特定を行っている。さらに、顧客側には、例えば
ポイントカードと呼ばれるカードを配布する。このカー
ドは、商品購入時にＰＯＳ端末装置によって顧客ＩＤが
読み取られる。これにより、顧客の特定が行われる。2. Description of the Related Art In recent years, with the development of information and communication systems, at supermarkets, convenience stores, department stores, etc., POS records of sales of individual products have been made.
(Point of Sales) It is input by the terminal device, and the sales record is stored in the computer at each store, each region, or the head office. Specifically, each product is provided with an identifier represented by a bar code, and the POS terminal device reads the identifier to specify the product. Furthermore, for example, a card called a point card is distributed to the customer side. The customer ID of this card is read by the POS terminal device when the product is purchased. Thereby, the customer is specified.

【０００３】加えて、近年、インターネット上のＷｅｂ
サイトを商店（仮想店舗）とした商取引が普及してい
る。このような仮想店舗では、商品の特定はもちろんの
こと、ログインＩＤによって顧客の特定も行うことがで
きる。In addition, in recent years, the Web on the Internet
Commercial transactions where the site is a store (virtual store) are widespread. In such a virtual store, not only the product can be specified, but also the customer can be specified by the login ID.

【０００４】一方、小売業者などは、売り上げを拡大す
るために顧客にダイレクトメールを送付したり、あるい
は販売促進のためのキャンペーンを実施する。一般にダ
イレクトメールなどは送付先数に比例した費用がかかる
ため、購入してもらえそうな商品を購入してくれそうな
顧客に推薦、提案する必要がある。On the other hand, retailers and the like send direct mails to customers in order to increase sales, or carry out campaigns for sales promotion. In general, direct mail, etc., is costly in proportion to the number of recipients, so it is necessary to recommend and propose to customers who are likely to purchase products that they are likely to purchase.

【０００５】上述のような商品の推薦・提案を行うに当
たって、各顧客について過去の購買履歴に基づいて購買
パターンの類似する顧客の集合（クラスタ）を抽出して
おけば、商品の推薦・提案を容易に行うことができる。In the above-described product recommendation / suggestion, the product recommendation / suggestion can be made by extracting a set (cluster) of customers having similar purchase patterns based on the past purchase history for each customer. It can be done easily.

【０００６】ある情報から所定の要素ごとのクラスタを
抽出するクラスタリング分析には、種々の手法がある。
例えば、マイケル・Ｊ・Ａ・ベリー、ゴードン・リノフ
著「データマイニング手法」（海文堂）には、Ｋ−ｍｅ
ａｎｓ法及び凝集法が紹介されている。これらの手法を
顧客の分類及び商品の分類に適用した場合、全顧客につ
いて顧客間の類似度を計算する必要がある。There are various methods for clustering analysis for extracting a cluster for each predetermined element from certain information.
For example, in "Data Mining Techniques" by Michael JA Berry and Gordon Rinoff (Kaibundou), K-me
The ans method and the agglomeration method are introduced. When these methods are applied to customer classification and product classification, it is necessary to calculate the similarity between customers for all customers.

【０００７】また、情報処理学会論文誌Ｖｏｌ．４２
Ｎｏ．８（２００１年８月）において、小柳、久保田、
仲瀬によって寄稿された「ＭａｔｒｉｘＣｌｕｓｔｅ
ｒｉｎｇ：ＣＲＭ向けの新しいデータマイニング手法」
には、マトリックス・クラスタリングという手法が提案
されている。この手法を顧客の分類及び商品の分類に適
用すると、１つのクラスタを発見するために複数の反復
処理が必要である。The IPSJ journal Vol. 42
No. 8 (August 2001), Koyanagi, Kubota,
Contributed by Nakase, "Matrix Cluster
ring: A new data mining method for CRM ”
Has proposed a method called matrix clustering. Applying this approach to customer classification and product classification requires multiple iterations to find a cluster.

【０００８】[0008]

【発明が解決しようとする課題】従来のクラスタリング
方法は以上のようになされているので、Ｋ−ｍｅａｎｓ
法及び凝集法では、全顧客について顧客間の類似度を計
算する必要がある。このため、顧客の組み合せが顧客数
の２乗に比例することを考慮すると、顧客数が増加する
につれて、その計算量が膨大なものとなってしまうとい
う課題があった。Since the conventional clustering method is performed as described above, K-means is used.
In the method and the aggregation method, it is necessary to calculate the similarity between customers for all customers. Therefore, considering that the combination of customers is proportional to the square of the number of customers, there has been a problem that the amount of calculation becomes enormous as the number of customers increases.

【０００９】また、マトリックス・クラスタリングで
は、１つのクラスタを発見する際に反復処理を行なう関
係上、処理時間が長くなってしまうという課題があっ
た。Further, in the matrix clustering, there is a problem that the processing time becomes long because iterative processing is performed when finding one cluster.

【００１０】この発明は上記のような問題点を解決する
ためになされたもので、顧客情報のクラスタ抽出に要す
る計算量及び処理時間を低減させることができる購買情
報処理装置、顧客情報クラスタリング方法及びプログラ
ムを得ることを目的とする。The present invention has been made to solve the above problems, and a purchase information processing apparatus, a customer information clustering method, and a customer information clustering method capable of reducing the amount of calculation and processing time required for extracting clusters of customer information. The purpose is to get the program.

【００１１】[0011]

【課題を解決するための手段】この発明に係る購買情報
処理装置は、顧客ごとの商品の購買履歴データを格納す
る購買履歴記憶手段と、購買履歴データにある顧客と商
品をそれぞれ行及び列の一方の項目として当てはめ、顧
客が購入した履歴がある商品の行列要素と購入した履歴
がない商品の行列要素とに互いに異なる所定の指標値を
それぞれ付与してなる行列テーブルを生成する購買情報
生成手段と、行列テーブルについて、行ごとの指標値の
総和に基づいて行を並び替えると共に、列ごとの指標値
の総和に基づいて列を並び替えて、当該行列テーブル上
の指標値の分布にて規定されるクラスタを抽出する購買
情報処理手段とを備えるものである。A purchase information processing apparatus according to the present invention includes a purchase history storage unit for storing purchase history data of products for each customer, and a customer and a product in the purchase history data in rows and columns, respectively. Purchasing information generating means which is applied as one item and generates a matrix table in which predetermined matrix index values different from each other are respectively given to the matrix elements of products for which the customer has a history of purchase and the matrix elements of products for which the customer has no history of purchase. For the matrix table, the rows are sorted based on the sum of the index values for each row, and the columns are sorted based on the sum of the index values for each column, and the distribution of the index values on the matrix table is specified. And a purchase information processing means for extracting the clusters.

【００１２】この発明に係る購買情報処理装置は、購買
情報生成手段が、顧客が購入した履歴がある商品の行列
要素に付与する指標値を１とし、購入した履歴がない商
品の行列要素に付与する指標値を０とするものである。In the purchase information processing apparatus according to the present invention, the purchase information generation means sets the index value given to the matrix element of the product having a history of customer purchase to 1 and gives it to the matrix element of the product having no purchase history. The index value to be set is 0.

【００１３】この発明に係る購買情報処理装置は、購買
情報処理手段が、行列テーブルについて、行ごとの指標
値の総和の昇順又は降順に行を並び替えると共に、列ご
との指標値の総和の昇順又は降順に列を並び替えて、当
該行列テーブル上の指標値の分布にて規定されるクラス
タを抽出するものである。In the purchase information processing apparatus according to the present invention, the purchase information processing means rearranges the rows in the matrix table in ascending or descending order of the sum of the index values for each row, and in ascending order of the sum of the index values for each column. Alternatively, the columns are rearranged in descending order to extract clusters defined by the distribution of index values on the matrix table.

【００１４】この発明に係る購買情報処理装置は、購買
情報処理手段が、行列テーブルの行ごとに指標値の総和
を各行要素の指標値に乗じてなる行重み付け行列テーブ
ルと、行列テーブルの列ごとに指標値の総和を各列要素
の指標値に乗じてなる列重み付け行列テーブルとを生成
し、元の行列テーブルについて、行重み付け行列テーブ
ルの列ごとの指標値の総和の昇順又は降順にそれぞれ対
応する列を並び替えると共に、列重み付け行列テーブル
の行ごとの指標値の総和の昇順又は降順にそれぞれ対応
する行を並び替えて、当該行列テーブル上の指標値の分
布にて規定されるクラスタを抽出するものである。In the purchase information processing apparatus according to the present invention, the purchase information processing means multiplies the index value of each row element by the sum of the index values for each row of the matrix table, and a row weighting matrix table for each column of the matrix table. And a column weighting matrix table that is obtained by multiplying the index values of each column element by the sum of the index values, and for the original matrix table, it corresponds to the ascending or descending order of the sum of the index values for each column of the row weighting matrix table. The columns to be sorted are sorted, and the rows corresponding to the ascending or descending order of the sum of the index values for each row of the column weighting matrix table are sorted to extract the clusters defined by the distribution of the index values on the matrix table. To do.

【００１５】この発明に係る購買情報処理装置は、購買
情報処理手段が、行列テーブルの行ごとに指標値の総和
が所定の閾値を越えているものには当該総和を行要素の
指標値に乗じ、当該閾値以下のものには行要素に予め規
定した値を付与してなる行重み付け行列テーブルと、行
列テーブルの列ごとに指標値の総和が所定の閾値を越え
ているものには当該総和を列要素の指標値に乗じ、当該
閾値以下のものには列要素に予め規定した値を付与して
なる列重み付け行列テーブルとを生成し、元の行列テー
ブルについて、行重み付け行列テーブルの列ごとの指標
値の総和の昇順又は降順にそれぞれ対応する列を並び替
えると共に、列重み付け行列テーブルの行ごとの指標値
の総和の昇順又は降順にそれぞれ対応する行を並び替え
て、当該行列テーブル上の指標値の分布にて規定される
クラスタを抽出するものである。In the purchase information processing apparatus according to the present invention, the purchase information processing means multiplies the index value of the row element by the sum of the index values for each row of the matrix table when the sum of the index values exceeds a predetermined threshold value. , A row weighting matrix table in which a row element is given a predetermined value for a threshold value or less, and a summation index value for each column of the matrix table exceeds the predetermined threshold value A column weighting matrix table that is obtained by multiplying the index value of a column element and assigning a predetermined value to the column element for those that are less than or equal to the threshold value is generated, and the original matrix table for each column of the row weighting matrix table is generated. The columns corresponding to the ascending order or the descending order of the sum of the index values are sorted, and the rows corresponding to the ascending order or the descending order of the sum of the index values for each row of the column weighting matrix table are sorted, respectively. And extracts the cluster defined by the distribution of the index values on Le.

【００１６】この発明に係る購買情報処理装置は、購買
情報処理手段が、予め規定した値を０として、行重み付
け行列テーブル及び列重み付け行列テーブルを生成する
ものである。In the purchase information processing apparatus according to the present invention, the purchase information processing means sets the predetermined value to 0 and generates the row weighting matrix table and the column weighting matrix table.

【００１７】この発明に係る購買情報処理装置は、購買
情報処理手段が、クラスタを抽出するごとにこれを規定
する指標値を０とした行列テーブルを生成し、当該行列
テーブルについてクラスタ抽出処理を繰り返すものであ
る。In the purchase information processing apparatus according to the present invention, each time the purchase information processing means extracts a cluster, it creates a matrix table in which the index value defining the cluster is set to 0, and the cluster extraction processing is repeated for the matrix table. It is a thing.

【００１８】この発明に係る購買情報クラスタリング方
法は、顧客ごとの商品の購買履歴データをメモリ領域に
形成する購買履歴形成ステップと、購買履歴データにあ
る顧客と商品をそれぞれ行及び列の一方の項目として当
てはめ、顧客が購入した履歴がある商品の行列要素と購
入した履歴がない商品の行列要素とに互いに異なる所定
の指標値をそれぞれ付与してなる行列テーブルをメモリ
領域に形成する行列テーブル形成ステップと、行列テー
ブルについて、行ごとの指標値の総和に基づいて行を並
び替えると共に、列ごとの指標値の総和に基づいて列を
並び替えて、当該行列テーブル上の指標値の分布にて規
定されるクラスタを抽出するクラスタリングステップと
を備えるものである。The purchase information clustering method according to the present invention includes a purchase history forming step of forming purchase history data of products for each customer in a memory area, and one item in a row or column for each customer and product in the purchase history data. A matrix table forming step in which a matrix table is formed in the memory area, each matrix element having a history element purchased by a customer and a matrix element of a commodity having no history of purchase are respectively given different index values. For the matrix table, the rows are sorted based on the sum of the index values for each row, and the columns are sorted based on the sum of the index values for each column, and the distribution of the index values on the matrix table is specified. And a clustering step for extracting the clusters.

【００１９】この発明に係るプログラムは、顧客ごとの
商品の購買履歴データを格納する購買履歴記憶手段、購
買履歴データにある顧客と商品をそれぞれ行及び列の一
方の項目として当てはめ、顧客が購入した履歴がある商
品の行列要素と購入した履歴がない商品の行列要素とに
互いに異なる所定の指標値をそれぞれ付与してなる行列
テーブルを生成する購買情報生成手段、行列テーブルに
ついて、行ごとの指標値の総和に基づいて行を並び替え
ると共に、列ごとの指標値の総和に基づいて列を並び替
えて、当該行列テーブル上の指標値の分布にて規定され
るクラスタを抽出する購買情報処理手段としてコンピュ
ータを機能させるものである。The program according to the present invention applies the purchase history storage means for storing the purchase history data of the product for each customer, the customer and the product in the purchase history data are applied as one of the items in the row and the column, and the customer purchases them. Purchasing information generating means for generating a matrix table in which a predetermined index value different from each other is assigned to the matrix element of the product having a history and the matrix element of the product having no history of purchase, and the index value of each row for the matrix table As a purchase information processing means that sorts the rows based on the sum of the index values of each column and sorts the columns based on the sum of the index values of each column, and extracts the clusters defined by the distribution of the index values on the matrix table. It makes a computer work.

【００２０】[0020]

【発明の実施の形態】以下、この発明の実施の一形態を
説明する。実施の形態１．図１はこの発明の実施の形態１による購
買情報処理装置の構成を示す図である。図１において、
１は購買履歴記録部（購買履歴記憶手段）であって、全
顧客について顧客の購買履歴を表す購買履歴情報が格納
される。２は行列生成部（購買情報生成手段）で、購買
履歴情報に基づいて各顧客と購買された商品（商品名）
との関係を表す行列（顧客−商品の行列テーブル）を生
成する。３は行列記憶部（購買情報生成手段）であっ
て、行列生成部２が生成した行列が格納される。４は行
列並び替え部（購買情報処理手段）で、行列記録部３に
格納された各行列について、各行毎の和及び各列毎の和
に応じて商品の並び替えを行う。なお、購買履歴記録部
１や行列記憶部３は、例えば本発明の購買情報処理装置
として機能するコンピュータ装置に搭載されるハードデ
ィスク装置や、他の外部記憶装置に対して購買履歴情報
を形成し格納するプログラムによって具現化される。ま
た、行列生成部２や行列並び替え部４は、例えば本発明
の購買情報処理装置として機能するコンピュータ装置の
ＣＰＵなどの演算手段が実行するプログラムによって具
現化される。BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of the present invention will be described below. Embodiment 1. 1 is a diagram showing the configuration of a purchase information processing apparatus according to Embodiment 1 of the present invention. In FIG.
Reference numeral 1 denotes a purchase history recording unit (purchase history storage unit), which stores purchase history information indicating the purchase history of all customers. Reference numeral 2 denotes a matrix generation unit (purchase information generation means), which is a product (product name) purchased with each customer based on the purchase history information.
A matrix (customer-commodity matrix table) representing the relationship with is generated. A matrix storage unit (purchasing information generating means) 3 stores the matrix generated by the matrix generating unit 2. A matrix rearrangement unit (purchasing information processing means) 4 rearranges the products stored in the matrix recording unit 3 according to the sum of each row and the sum of each column. The purchase history recording unit 1 and the queue storage unit 3 form and store purchase history information in, for example, a hard disk device installed in a computer device functioning as a purchase information processing device of the present invention or another external storage device. It is embodied by the program. Further, the matrix generation unit 2 and the matrix rearrangement unit 4 are embodied by a program executed by a calculation unit such as a CPU of a computer device that functions as a purchase information processing device of the present invention.

【００２１】次に動作について説明する。先ず、ＰＯＳ
端末装置（図示せず）等から入力される売り上げデータ
は、購買履歴情報として購買履歴記憶部１に蓄積される
（購買履歴形成ステップ）。具体的には、購買履歴記憶
部１として機能するプログラムにより、入力データから
購買履歴情報が作成され、コンピュータ装置のメモリ領
域に記録される。図２は図１中の購買履歴記憶部に記憶
された購買履歴情報の一例を示す図である。図に示すよ
うに、購買履歴情報は、少なくとも顧客を特定するＩＤ
（顧客ＩＤ）、商品を特定するＩＤ（商品ＩＤ）を有し
ている。また、商品の購入した日時、単価、購入数、及
び金額（合計金額）などの情報も含まれている。Next, the operation will be described. First, POS
Sales data input from a terminal device (not shown) or the like is accumulated in the purchase history storage unit 1 as purchase history information (purchase history forming step). Specifically, the program functioning as the purchase history storage unit 1 creates purchase history information from the input data and records it in the memory area of the computer device. FIG. 2 is a diagram showing an example of purchase history information stored in the purchase history storage unit in FIG. As shown in the figure, the purchase history information is at least an ID that identifies a customer.
It has a (customer ID) and an ID (product ID) that identifies a product. It also includes information such as the date and time when the product was purchased, the unit price, the number of purchases, and the amount (total amount).

【００２２】行列生成部２は、購買履歴記憶部１から購
買履歴情報を所定の順序（例えば、古い順）に読みだし
て、購買履歴情報中の顧客（顧客ＩＤ）と商品（商品Ｉ
Ｄ）との組み合わせに基づいて顧客（顧客ＩＤ）と商品
（商品ＩＤ）との関係を表す行列（顧客−商品の行列テ
ーブル）を生成して行列記憶部３に格納する（行列テー
ブル形成ステップ）。顧客−商品の行列テーブルは、各
行列要素を２次元座標値に対応させて特定することがで
きる情報である。つまり、顧客−商品の行列テーブルに
対応する２次元座標面上での座標値によって、行や列の
各項目、これらが交差する行列要素の指標値が特定され
る。The matrix generation unit 2 reads the purchase history information from the purchase history storage unit 1 in a predetermined order (for example, oldest first), and the customer (customer ID) and product (product I) in the purchase history information.
A matrix (customer-product matrix table) representing the relationship between the customer (customer ID) and the product (product ID) is generated based on the combination with D) and stored in the matrix storage unit 3 (matrix table forming step). . The customer-commodity matrix table is information that allows each matrix element to be specified in association with a two-dimensional coordinate value. That is, the coordinate values on the two-dimensional coordinate plane corresponding to the customer-commodity matrix table specify the row and column items and the index value of the matrix element at which they intersect.

【００２３】図３は図１中の行列生成部によって生成さ
れた行列の一例を示す図である。図に示すように、この
行列は列方向が商品、行方向が顧客を表している。ここ
で、行列中の各要素（行列要素）に、指標値“１”が付
与されていると、当該要素の行に対応する顧客が列に対
応する商品を購入したことを示している。一方、行列中
の各要素に、指標値“０”が付与されていると、当該要
素の行に対応する顧客が列に対応する商品を購入したこ
とがないことを示している。例えば、顧客Ｂは商品
“ａ”，“ｃ”，及び“ｅ〜ｈ”は購入した実績がある
が、商品“ｂ”，“ｄ”，“ｉ”，及び“ｊ”の購入実
績はないことになる。なお、上記行列要素に付与される
指標値は、１や０に限らず、互いに異なる値であればよ
い。FIG. 3 is a diagram showing an example of a matrix generated by the matrix generator in FIG. As shown in the figure, this matrix represents products in the column direction and customers in the row direction. Here, when the index value “1” is given to each element (matrix element) in the matrix, it indicates that the customer corresponding to the row of the element purchases the product corresponding to the column. On the other hand, when the index value “0” is given to each element in the matrix, it indicates that the customer corresponding to the row of the element has never purchased the product corresponding to the column. For example, the customer B has a history of purchasing the products “a”, “c”, and “e to h”, but has no history of purchasing the products “b”, “d”, “i”, and “j”. It will be. The index values given to the matrix elements are not limited to 1 and 0, and may be different values.

【００２４】次に、行列生成部２は、上述した行列を生
成すると、各行毎及び各列毎にその総和をカウントす
る。例えば、顧客“Ａ”が商品“ａ”を購入したという
購買履歴情報を購買履歴記憶部１から読み出すと、顧客
−商品行列において該当する要素（顧客が“Ａ”で商品
が“ａ”である要素）を“１”とするとともに、行（顧
客）“Ａ”の総和に１を加算し、列（商品）“ａ”の総
和に１を加算する。Next, when the matrix generator 2 generates the above-mentioned matrix, it counts the total sum for each row and each column. For example, when the purchase history information that the customer “A” has purchased the product “a” is read from the purchase history storage unit 1, the corresponding element in the customer-product matrix (the customer is “A” and the product is “a”). The element is set to "1", 1 is added to the sum of rows (customers) "A", and 1 is added to the sum of columns (products) "a".

【００２５】行列記憶部３に上述のようにして生成およ
び格納された行列に対して、行列並び替え部４は、行毎
の総和の順に行を並び替えるとともに、列毎の総和の順
に列を並び替えて、並び替え行列を生成する。With respect to the matrix generated and stored in the matrix storage unit 3 as described above, the matrix rearrangement unit 4 rearranges the rows in the order of the total sum for each row and the columns in the order of the total sum for each column. Sorting is performed to generate a sorting matrix.

【００２６】図４はこの発明の実施の形態１による購買
情報処理装置によって得られた並び替え行列の一例を示
す図である。図に示すように、行列並び替え部４は、行
を並び替える際には、例えばその総和が大きい方が上に
なるように並び替えを行い、列を並び替える際には、そ
の総和が大きい方が左になるように並び替えを行う。な
お、行の並び替え及び列の並び替えはどちらを先に行っ
てもよい。FIG. 4 is a diagram showing an example of the rearrangement matrix obtained by the purchase information processing apparatus according to the first embodiment of the present invention. As shown in the figure, the matrix rearrangement unit 4 rearranges the rows so that, for example, the one with the larger total sum becomes higher, and the rearrangement with the column has the larger total sum. Rearrange so that the left side is on the left side. Either of the rearrangement of rows and the rearrangement of columns may be performed first.

【００２７】このようにして並び替えが行われた行列で
は、図に示すように、左上側に行く程、各要素が“１”
である割合が高くなり、右下側に行く程、各要素が
“０”である割合が高くなる。これを利用して、行列並
び替え部４は、上記行列において、左上側に位置する部
分をクラスタとして抽出すれば、購入可能性の高い商品
と購入意欲の高い顧客とを選択することができる（クラ
スタリングステップ）。上述したように、顧客−商品の
行列テーブルに対応する２次元座標面上での座標値によ
って、行や列の各項目、これらが交差する行列要素の指
標値が特定される。これにより、指標値に注目して、顧
客−商品の行列テーブルに対応する２次元座標面上での
座標値を求めれば、クラスタに関する座標値群を求める
ことができる。In the matrix thus rearranged, as shown in the figure, each element is "1" toward the upper left side.
Is higher, and the ratio of each element is “0” is higher toward the lower right side. Utilizing this, the matrix rearrangement unit 4 can select a product with a high purchase possibility and a customer with a high purchase intention by extracting the portion located on the upper left side in the above matrix as a cluster ( Clustering step). As described above, the coordinate values on the two-dimensional coordinate plane corresponding to the customer-commodity matrix table specify the row and column items and the index value of the matrix element at which they intersect. Thereby, if the coordinate values on the two-dimensional coordinate plane corresponding to the customer-product matrix table are obtained while paying attention to the index value, the coordinate value group regarding the cluster can be obtained.

【００２８】このあと、行列並び替え部４によって抽出
された購入可能性の高い商品と購入意欲の高い顧客とを
規定するクラスタを元にして購買情報の分類データを生
成し、不図示の表示装置などでユーザに提示するように
してもよい。Thereafter, the classification data of the purchase information is generated based on the clusters defining the products with high purchase possibility and the customers with high purchase intention, which are extracted by the matrix rearrangement unit 4, and a display device (not shown) is generated. It may be presented to the user by, for example.

【００２９】以上のように、この実施の形態１によれ
ば、購買履歴情報から商品と顧客との関係を示す行列
（顧客−商品の行列テーブル）を生成して、行ごと及び
列ごとの総和に基づいて行及び列を並び替えて、行列上
の行列要素の指標値の分布にて規定されるクラスタを抽
出するので、複雑な計算を要することなく、クラスタ抽
出を容易にしかも高速に行うことができる。As described above, according to the first embodiment, a matrix (customer-product matrix table) showing the relationship between products and customers is generated from the purchase history information, and the sum total for each row and each column is generated. The rows and columns are rearranged based on, and the clusters defined by the distribution of the index values of the matrix elements on the matrix are extracted, so cluster extraction can be performed easily and at high speed without requiring complicated calculations. You can

【００３０】実施の形態２．上記実施の形態１では、各
顧客及び各商品を平等に、つまり、均等に扱って並び替
え行列を生成する例を示したが、この実施の形態２は顧
客毎及び商品毎に重み付けを行って、並び替え行列を生
成するものである。Embodiment 2. In the above-described first embodiment, an example in which each customer and each product are treated equally, that is, the rearrangement matrix is generated evenly, is shown, but in the second embodiment, each customer and each product are weighted. , To generate a rearrangement matrix.

【００３１】この実施の形態２による購買情報処理装置
の基本的な構成は、上記実施の形態１と同様であるが、
行列並び替え部４の機能が上記実施の形態１で説明した
機能とは異なる。また、行列記憶部３には、上記実施の
形態１で説明した行列（顧客−商品の行列テーブル）が
格納されているものとする。The basic configuration of the purchase information processing apparatus according to the second embodiment is the same as that of the first embodiment,
The function of the matrix rearrangement unit 4 is different from the function described in the first embodiment. Further, it is assumed that the matrix storage unit 3 stores the matrix (customer-product matrix table) described in the first embodiment.

【００３２】次に動作について説明する。ここでは、上
記実施の形態１で説明した機能とは異なる行列並び替え
処理について説明し、他の動作は上記実施の形態１と同
様であるので説明を省略する。先ず、行列並び替え部４
は、行列記憶部３に格納された行列（顧客−商品の行列
テーブル：図３参照）に対して行及び列についての重み
付け行列を生成する。Next, the operation will be described. Here, a matrix rearrangement process different from the function described in the first embodiment will be described, and the other operations are the same as those in the first embodiment, and thus the description thereof will be omitted. First, the matrix rearrangement unit 4
Generates a weighting matrix for rows and columns for the matrix (customer-commodity matrix table: see FIG. 3) stored in the matrix storage unit 3.

【００３３】図５はこの発明の実施の形態２による購買
情報処理装置によって得られた並び替え行列を説明する
説明図であり、（ａ）は行重み付け行列を示し、（ｂ）
は列重み付け行列を示し、（ｃ）は並び替え後の行列を
示している。行列並び替え部４は、図５（ａ）に示すよ
うに、行列記憶部３に格納された行列について、行ごと
に指標値の総和を各行要素に乗じた行重み付け行列を生
成する。同様にして、図５（ｂ）に示すように、列ごと
に指標値の総和を各列要素に乗じた列重み付け行列を生
成する。FIG. 5 is an explanatory view for explaining a rearrangement matrix obtained by the purchase information processing apparatus according to the second embodiment of the present invention, where (a) shows a row weighting matrix and (b).
Shows a column weighting matrix, and (c) shows the matrix after rearrangement. As shown in FIG. 5A, the matrix rearrangement unit 4 generates a row weighting matrix in which each row element is multiplied by the sum of the index values for each row of the matrix stored in the matrix storage unit 3. Similarly, as shown in FIG. 5B, a column weighting matrix in which each column element is multiplied by the sum of the index values for each column is generated.

【００３４】次に、行列並び替え部４では、行重み付け
行列及び列重み付け行列を参照して、顧客−商品の行列
テーブルについて、行重み付け行列における各列の指標
値の総和の順に、これに対応する列を並び替えるととも
に、列重み付け行列における各行の指標値の総和の順
に、これに対応する行を並び替えて、図５（ｃ）に示す
ような並び替え後の行列を生成する。なお、行の並び替
え及び列の並び替えはどちらを先に行ってもよい。Next, the matrix rearrangement unit 4 refers to the row weighting matrix and the column weighting matrix and, in the matrix table of the customer-commodity, responds in the order of the sum of the index values of each column in the row weighting matrix. In addition to rearranging the columns, the corresponding rows are rearranged in the order of the sum of the index values of each row in the column weighting matrix to generate a rearranged matrix as shown in FIG. 5C. Either of the rearrangement of rows and the rearrangement of columns may be performed first.

【００３５】ところで、行列記憶部３に格納された顧客
−商品の行列テーブルにおいて、その行毎又は列毎の総
和は、それぞれ商品毎又は顧客毎の重要性を示してい
る。これを鑑みて、これら行の総和又は列の総和を重み
係数として各要素に乗算する。このようにして、並び替
えを行うことで、同数の商品を購入した顧客であって
も、例えばより重要な商品（つまり、並び替え後の行列
の左側に位置して、多数の顧客に購入されている商品）
を購入した顧客が図中上側に位置づけられることにな
る。この結果、図中左上側に行く程、各要素の指標値が
“１”である割合が高くなる。この指標値の分布を元に
クラスタを抽出する。By the way, in the customer-commodity matrix table stored in the matrix storage unit 3, the total sum for each row or each column indicates the importance for each commodity or each customer. In consideration of this, each element is multiplied by the sum of these rows or the sum of columns as a weighting coefficient. By sorting in this way, even customers who have purchased the same number of products can purchase, for example, more important products (i.e., they can be purchased by a large number of customers by being located on the left side of the sorted queue). Products)
The customer who has purchased is positioned on the upper side in the figure. As a result, the ratio of the index value of each element being “1” becomes higher toward the upper left side in the figure. Clusters are extracted based on this distribution of index values.

【００３６】以上のように、この実施の形態２によれ
ば、購買履歴情報から商品と顧客との関係を示す行列
（顧客−商品の行列テーブル）を生成して、この行列の
各要素について行毎及び列毎の総和を重み係数として重
み付けして行毎及び列毎に重み付き総和を求め、これら
重み付き総和に応じて顧客−商品行列について行及び列
を並び替えて、行列上の行列要素の指標値の分布にて規
定されるクラスタを抽出するので、商品及び顧客につい
て重み付けを行ってしかもクラスタ抽出を容易に高速に
行うことができる。As described above, according to the second embodiment, a matrix (customer-product matrix table) showing the relationship between the product and the customer is generated from the purchase history information, and a row is created for each element of this matrix. The weighted sum is calculated for each row and each column, and the weighted sum is obtained for each row and each column. The rows and columns of the customer-product matrix are rearranged according to these weighted sums, and matrix elements on the matrix are arranged. Since the clusters defined by the distribution of the index values are extracted, it is possible to weight the products and customers and to easily and quickly extract the clusters.

【００３７】実施の形態３．上記実施の形態２では、行
毎の総和を顧客に対する重み係数（顧客重み係数）と
し、列毎の総和を商品に対する重み係数（商品重み係
数）とする例を示したが、この実施の形態３は顧客重み
係数及び商品重み係数に対して閾値を設定するものであ
る。Embodiment 3. In the above-described second embodiment, an example is shown in which the sum of each row is used as a weighting coefficient for customers (customer weighting coefficient) and the sum of each column is used as a weighting coefficient for products (product weighting coefficient). Is to set a threshold for the customer weight coefficient and the product weight coefficient.

【００３８】この実施の形態２による購買情報処理装置
の基本的な構成は、上記実施の形態１と同様であるが、
行列並び替え部４の機能が上記実施の形態１，２で説明
した機能とは異なる。また、行列記憶部３には、上記実
施の形態１で説明した行列（顧客−商品行列テーブル）
が格納されているものとする。The basic structure of the purchase information processing apparatus according to the second embodiment is the same as that of the first embodiment,
The function of the matrix rearrangement unit 4 is different from the function described in the first and second embodiments. Further, the matrix storage unit 3 stores the matrix (customer-product matrix table) described in the first embodiment.
Is stored.

【００３９】次に動作について説明する。ここでは、上
記実施の形態１，２で説明した機能とは異なる行列並び
替え処理について説明し、他の動作は上記実施の形態１
と同様であるので説明を省略する。先ず、行列並び替え
部４は、行列記憶部３に格納された顧客−商品行列に対
して、行列の各要素に対応する行又は列の指標値の総和
が予め設定された閾値を越える際には、当該総和を行又
は列要素の指標値に乗じ、各要素の指標値の総和が閾値
以下である際には、要素を０として重み付け行列を生成
する。Next, the operation will be described. Here, a matrix rearrangement process different from the functions described in the first and second embodiments will be described, and other operations will be described in the first embodiment.
The description is omitted because it is similar to the above. First, the matrix rearrangement unit 4 determines whether the sum of index values of rows or columns corresponding to each element of the matrix exceeds a preset threshold value with respect to the customer-product matrix stored in the matrix storage unit 3. Multiplies the sum total by the index value of the row or column element, and when the sum total of the index values of the respective elements is less than or equal to the threshold value, sets the element to 0 and generates a weighting matrix.

【００４０】図６はこの発明の実施の形態３による購買
情報処理装置によって得られた並び替え行列を説明する
説明図であり、（ａ）は行重み付け行列を示し、（ｂ）
は列重み付け行列を示し、（ｃ）は並び替え後の行列を
示している。行列並び替え部４は、図６（ａ）に示すよ
うに、行列記憶部３に格納された行列について、行列の
各要素に対応する行の指標値の総和が予め設定された閾
値（図示の例では、４である）を越える際には、当該行
の総和を要素に乗じ、行列の各要素に対応する行の指標
値の総和が閾値以下である際には、要素を０として行重
み付け行列を生成する。同様にして、図６（ｂ）に示す
ように、行列の各要素に対応する列の指標値の総和が予
め設定された閾値（図示の例では、２である）を越える
際には、当該列の総和を要素に乗じ、行列の各要素に対
応する列の指標値の総和が閾値以下である際には、要素
を０として列重み付け行列を生成する。FIG. 6 is an explanatory view for explaining a rearrangement matrix obtained by the purchase information processing apparatus according to the third embodiment of the present invention, where (a) shows a row weighting matrix and (b).
Shows a column weighting matrix, and (c) shows the matrix after rearrangement. As shown in FIG. 6A, the matrix rearrangement unit 4 sets a threshold value (shown in the figure) in which the sum of the index values of the rows corresponding to the elements of the matrix is preset for the matrix stored in the matrix storage unit 3. When it exceeds 4, the element is multiplied by the total sum of the row, and when the total sum of the index values of the row corresponding to each element of the matrix is less than or equal to the threshold value, the element is set to 0 and the row weighting is performed. Generate a matrix. Similarly, as shown in FIG. 6B, when the sum of the index values of the columns corresponding to the respective elements of the matrix exceeds a preset threshold value (2 in the illustrated example), An element is multiplied by the total sum of columns, and when the total sum of the index values of the column corresponding to each element of the matrix is equal to or less than the threshold value, the element is set to 0 to generate a column weighting matrix.

【００４１】次に、行列並び替え部４では、行重み付け
行列及び列重み付け行列を参照して、顧客−商品の行列
テーブルについて、行重み付け行列における各列の指標
値の総和の順に、これに対応する列を並び替えるととも
に、列重み付け行列における各行の指標値の総和の順
に、これに対応する行を並び替えて、図６（ｃ）に示す
ような並び替え後の行列を生成する。なお、行の並び替
え及び列の並び替えはどちらを先に行ってもよい。Next, the matrix rearrangement unit 4 refers to the row weighting matrix and the column weighting matrix and, in the matrix table of the customer-commodity, corresponds to the sum of the index values of each column in the row weighting matrix in this order. The columns to be sorted are rearranged, and the corresponding rows are rearranged in the order of the sum of the index values of each row in the column weighting matrix to generate a rearranged matrix as shown in FIG. 6C. Either of the rearrangement of rows and the rearrangement of columns may be performed first.

【００４２】このようにして、顧客−商品行列におい
て、行の総和及び列の総和についてそれぞれ閾値を設定
して、行の総和及び列の総和がそれぞれの閾値を越えた
か否かによって、重み付けを行うか零とするかを決定す
るので、例えば重要でない商品（つまり、右側に位置し
て、少数の顧客にしか購入されていない商品）を多数購
入した顧客が図中上側に位置づけられることを避けるこ
とができる。この結果、図中左上側に行く程、各要素の
指標値が“１”である割合が高くなる。この指標値の分
布を元にクラスタを抽出する。Thus, in the customer-commodity matrix, thresholds are set for the sum of rows and the sum of columns, and weighting is performed depending on whether the sum of rows and the sum of columns exceed their respective thresholds. Since it decides whether to set it to zero or zero, avoid placing the customer who purchased many unimportant products (that is, products located on the right side and purchased by only a few customers) in the upper part of the figure. You can As a result, the ratio of the index value of each element being “1” becomes higher toward the upper left side in the figure. Clusters are extracted based on this distribution of index values.

【００４３】以上のように、この実施の形態３によれ
ば、顧客−商品行列について、行の総和及び列の総和に
ついてそれぞれ閾値を設定して、行の総和及び列の総和
が閾値を越えたか否かによって、各要素に重み付けを行
うか零とするかを決定するので、クラスタ抽出を容易に
高速に行うことができるばかりでなく、精度よくクラス
タ抽出を行うことができる。As described above, according to the third embodiment, with respect to the customer-commodity matrix, thresholds are set for the sum of rows and the sum of columns, respectively, and whether the sum of rows and the sum of columns exceed the threshold. Depending on whether or not each element is weighted or set to zero, it is possible not only to easily perform cluster extraction at high speed, but also to perform cluster extraction with high accuracy.

【００４４】実施の形態４．上記実施の形態では、顧客
−商品行列の要素が“１”である割合が多い領域、つま
り、クラスタを行列の左上側部分に抽出する例を示した
が、この実施の形態４は抽出したクラスタ内の要素を全
て０として再度同様の手法で処理を繰り返すことで、逐
次クラスタの抽出を行うものである。Fourth Embodiment In the above-described embodiment, an example in which an area in which the ratio of the elements of the customer-product matrix is “1” is high, that is, a cluster is extracted in the upper left part of the matrix, is shown. All the elements inside are set to 0, and the process is repeated again in the same manner to sequentially extract clusters.

【００４５】次に動作について説明する。先ず、行列並
び替え部４は、上記実施の形態で説明した操作によって
得られた並び替え後の行列についてクラスタを抽出す
る。そして、このクラスタ内の全要素を０として、再び
上記実施の形態で説明した操作で並び替えを実行する。Next, the operation will be described. First, the matrix rearrangement unit 4 extracts clusters from the rearranged matrix obtained by the operation described in the above embodiment. Then, with all the elements in this cluster set to 0, the sorting is executed again by the operation described in the above embodiment.

【００４６】図７はこの発明の実施の形態４による購買
情報処理装置によって得られた並び替え行列を説明する
説明図であり、（ａ）は行重み付け行列を示し、（ｂ）
は列重み付け行列を示し、（ｃ）は並び替え後の行列を
示している。図示の例では、行列並び替え部４は、上記
実施の形態２で説明した操作によって得られた並び替え
後の行列について、左上側に位置する４行５列を第１の
クラスタとして抽出する。そして、この第１のクラスタ
内の全要素を０として、再び実施の形態２で説明した手
法で並び替えを実行する。つまり、図７（ａ）に示すよ
うに、第１のクラスタ内の全要素を０とした並び替え後
の行列の各要素に対応する行の総和を乗じた行列を行重
み付け行列として生成する。このとき、行重み付け行列
の列毎にその総和を求める。FIG. 7 is an explanatory view for explaining a rearrangement matrix obtained by the purchase information processing apparatus according to the fourth embodiment of the present invention, where (a) shows a row weighting matrix and (b).
Shows a column weighting matrix, and (c) shows the matrix after rearrangement. In the illustrated example, the matrix rearrangement unit 4 extracts 4 rows and 5 columns located on the upper left side as a first cluster from the rearranged matrix obtained by the operation described in the second embodiment. Then, with all the elements in the first cluster set to 0, the sorting is executed again by the method described in the second embodiment. That is, as shown in FIG. 7A, a matrix obtained by multiplying the total sum of rows corresponding to each element of the rearranged matrix in which all the elements in the first cluster are 0 is generated as a row weighting matrix. At this time, the sum is calculated for each column of the row weighting matrix.

【００４７】同様にして、図７（ｂ）に示すように、第
１のクラスタ内の全要素を零とした重み付け並び替え行
列の各要素に対応する列の総和を乗じた行列を列重み付
け行列として生成する。このときも、列重み付け行列の
行毎にその総和を求める。Similarly, as shown in FIG. 7B, a column weighting matrix is obtained by multiplying the sum of the columns corresponding to each element of the weighting rearrangement matrix in which all the elements in the first cluster are zero. Generate as. Also at this time, the sum is obtained for each row of the column weighting matrix.

【００４８】次に、行及び列の重み付け行列を参照し
て、顧客−商品行列について行の重み付け総和の順に行
を並び替えるとともに、列重み付け総和の順に列を並び
替えて、再び重み付け並び替え行列を生成する（図７
（ｃ）参照）。なお、図７（ｃ）において、斜字体で示
す“１”は零と見なして並び替えが行われている。Next, referring to the row and column weighting matrix, the rows are rearranged in the order of the weighted sum of the customer-product matrix, the columns are rearranged in the order of the column weighted sum, and the weighted rearrangement matrix is again displayed. Is generated (Fig. 7
(See (c)). In FIG. 7C, rearrangement is performed by regarding “1” shown in italics as zero.

【００４９】以上のように、この実施の形態４によれ
ば、第１のクラスタを抽出した後、第１のクラスタ内の
全要素を零として、再び並び替えを実行するようにした
から、第２のクラスタを容易に抽出できる。このように
すれば、第（Ｎ−１）のクラスタを抽出した後、第（Ｎ
−１）のクラスタ内の全要素を零として、再び並び替え
を実行して、第Ｎのクラスタを抽出することができる。As described above, according to the fourth embodiment, after the first cluster is extracted, all the elements in the first cluster are set to zero, and the rearrangement is executed again. Two clusters can be easily extracted. In this way, after extracting the (N-1) th cluster, the (N-1) th cluster is extracted.
It is possible to extract the Nth cluster by executing the rearrangement again by setting all the elements in the cluster of -1) to zero.

【００５０】なお、上述した実施の形態では、顧客が商
品を購入して得られた購買履歴情報に応じてクラスタを
抽出する例について説明したが、例えば顧客がＷｅｂペ
ージを閲覧する場合やインターネット上で動画又は音楽
等のコンテンツを閲覧しダウンロードする際に得られる
顧客履歴情報についても同様にしてクラスタ抽出を行う
ことができる。つまり、本発明の商品は、Ｗｅｂページ
や上記コンテンツなどの情報も含む概念で定義される。In the above-described embodiment, an example has been described in which the cluster is extracted according to the purchase history information obtained by the customer purchasing the product. However, for example, when the customer browses a Web page or on the Internet. With respect to customer history information obtained when browsing and downloading contents such as moving images or music, cluster extraction can be similarly performed. That is, the product of the present invention is defined by a concept including information such as a Web page and the above content.

【００５１】[0051]

【発明の効果】以上のように、この発明によれば、顧客
ごとの商品の購買履歴データを格納しておき、購買履歴
データにある顧客と商品をそれぞれ行及び列の一方の項
目として当てはめ、顧客が購入した履歴がある商品の行
列要素と購入した履歴がない商品の行列要素とに互いに
異なる所定の指標値をそれぞれ付与してなる行列テーブ
ルを生成し、この行列テーブルについて、行ごとの指標
値の総和に基づいて行を並び替えると共に、列ごとの指
標値の総和に基づいて列を並び替えて、当該行列テーブ
ル上の指標値の分布にて規定されるクラスタを抽出する
ので、クラスタ抽出を容易にしかも高速に行うことがで
きるという効果がある。As described above, according to the present invention, the purchase history data of the product for each customer is stored, and the customer and the product in the purchase history data are applied as one item of the row and column, respectively. A matrix table is created by assigning different index values to the matrix elements of products that have a history of customer purchases and the matrix elements of products that do not have a history of purchases. The rows are sorted based on the sum of the values, and the columns are sorted based on the sum of the index values for each column to extract the clusters defined by the distribution of the index values on the matrix table. There is an effect that can be performed easily and at high speed.

【００５２】この発明によれば、顧客が購入した履歴が
ある商品の行列要素に付与する指標値を１とし、購入し
た履歴がない商品の行列要素に付与する指標値を０とす
るので、クラスタ抽出の際の演算量が少なくなって、高
速にクラスタ抽出を行うことができるという効果があ
る。According to the present invention, the index value given to the matrix element of the product having a history of purchase by the customer is set to 1, and the index value given to the matrix element of the product having no purchase history is set to 0. There is an effect that the amount of calculation at the time of extraction is reduced and cluster extraction can be performed at high speed.

【００５３】この発明によれば、行列テーブルについ
て、行ごとの指標値の総和の昇順又は降順に行を並び替
えると共に、列ごとの指標値の総和の昇順又は降順に列
を並び替えて、当該行列テーブル上の指標値の分布にて
規定されるクラスタを抽出するので、クラスタ抽出の際
の演算が簡単になるという効果がある。According to the present invention, in the matrix table, the rows are sorted in the ascending or descending order of the sum of the index values for each row, and the columns are sorted in the ascending or descending order of the sum of the index values for each column. Since the clusters defined by the distribution of the index values on the matrix table are extracted, there is an effect that the calculation at the time of cluster extraction becomes simple.

【００５４】この発明によれば、行列テーブルの行ごと
に指標値の総和を各行要素の指標値に乗じてなる行重み
付け行列テーブルと、行列テーブルの列ごとに指標値の
総和を各列要素の指標値に乗じてなる列重み付け行列テ
ーブルとを生成し、元の行列テーブルについて、行重み
付け行列テーブルの列ごとの指標値の総和の昇順又は降
順にそれぞれ対応する列を並び替えると共に、列重み付
け行列テーブルの行ごとの指標値の総和の昇順又は降順
にそれぞれ対応する行を並び替えて、当該行列テーブル
上の指標値の分布にて規定されるクラスタを抽出するの
で、商品及び顧客について重み付けを行ってしかもクラ
スタ抽出を容易に高速に行うことができるという効果が
ある。According to the present invention, the row weighting matrix table formed by multiplying the sum of the index values for each row of the matrix table by the index value of each row element, and the sum of the index values for each column of the matrix table of each column element And a column weighting matrix table that is obtained by multiplying the index values, and rearranges the corresponding columns in the ascending or descending order of the sum of the index values for each column of the row weighting matrix table with respect to the original matrix table, and the column weighting matrix The corresponding rows are sorted in ascending or descending order of the sum of the index values for each row in the table, and the clusters defined by the distribution of the index values on the matrix table are extracted, so weighting is applied to products and customers. Moreover, there is an effect that cluster extraction can be performed easily and at high speed.

【００５５】この発明によれば、行列テーブルの行ごと
に指標値の総和が所定の閾値を越えているものには当該
総和を行要素の指標値に乗じ、当該閾値以下のものには
行要素に予め規定した値を付与してなる行重み付け行列
テーブルと、行列テーブルの列ごとに指標値の総和が所
定の閾値を越えているものには当該総和を列要素の指標
値に乗じ、当該閾値以下のものには列要素に予め規定し
た値を付与してなる列重み付け行列テーブルとを生成
し、元の行列テーブルについて、行重み付け行列テーブ
ルの列ごとの指標値の総和の昇順又は降順にそれぞれ対
応する列を並び替えると共に、列重み付け行列テーブル
の行ごとの指標値の総和の昇順又は降順にそれぞれ対応
する行を並び替えて、当該行列テーブル上の指標値の分
布にて規定されるクラスタを抽出するので、クラスタ抽
出を容易に高速に行うことができるばかりでなく、閾値
に応じて精度よくクラスタ抽出を行うことができるとい
う効果がある。According to the present invention, when the sum of the index values for each row of the matrix table exceeds a predetermined threshold value, the sum of the index values of the row elements is multiplied by the sum. Row weighting matrix table in which a predetermined value is added to the matrix table, and the sum of the index values for each column of the matrix table exceeds a predetermined threshold, the sum is multiplied by the index value of the column element For the following, a column weighting matrix table in which column elements are given predetermined values is generated, and the original matrix table is sorted in ascending or descending order of the sum of the index values for each column of the row weighting matrix table. In addition to rearranging the corresponding columns, rearranging the corresponding rows in ascending or descending order of the sum of the index values for each row of the column weighting matrix table, the columns specified by the distribution of the index values in the matrix table are sorted. Since extracting the static, it is possible not only to perform the cluster extraction easily at high speed, there is an effect that it is possible to accurately perform cluster extraction according to the threshold.

【００５６】この発明によれば、予め規定した値を０と
して、行重み付け行列テーブル及び列重み付け行列テー
ブルを生成するので、簡単にしかも精度良くクラスタ抽
出を行うことができるという効果がある。According to the present invention, the row weighting matrix table and the column weighting matrix table are generated by setting the predetermined value to 0, so that there is an effect that cluster extraction can be performed easily and accurately.

【００５７】この発明によれば、クラスタを抽出するご
とにこれを規定する指標値を０とした行列テーブルを生
成し、当該行列テーブルについてクラスタ抽出処理を繰
り返すので、連続してクラスタ抽出を容易に行うことが
できるという効果がある。According to the present invention, each time a cluster is extracted, a matrix table in which the index value defining this is set to 0 is generated, and the cluster extraction process is repeated for the matrix table. The effect is that it can be done.

[Brief description of drawings]

【図１】この発明の実施の形態１による購買情報処理
装置の構成を示す図である。FIG. 1 is a diagram showing a configuration of a purchase information processing apparatus according to a first embodiment of the present invention.

【図２】図１中の購買履歴記憶部に記憶された購買履
歴情報の一例を示す図である。FIG. 2 is a diagram showing an example of purchase history information stored in a purchase history storage unit in FIG.

【図３】図１中の行列生成部によって生成された行列
の一例を示す図である。FIG. 3 is a diagram showing an example of a matrix generated by a matrix generator in FIG.

【図４】この発明の実施の形態１による購買情報処理
装置によって得られた並び替え行列の一例を示す図であ
る。FIG. 4 is a diagram showing an example of a rearrangement matrix obtained by the purchase information processing apparatus according to the first embodiment of the present invention.

【図５】この発明の実施の形態２による購買情報処理
装置によって得られた並び替え行列を説明する説明図で
ある。FIG. 5 is an explanatory diagram illustrating a rearrangement matrix obtained by the purchase information processing apparatus according to the second embodiment of the present invention.

【図６】この発明の実施の形態３による購買情報処理
装置によって得られた並び替え行列を説明する説明図で
ある。FIG. 6 is an explanatory diagram illustrating a rearrangement matrix obtained by the purchase information processing apparatus according to the third embodiment of the present invention.

【図７】この発明の実施の形態４による購買情報処理
装置によって得られた並び替え行列を説明する説明図で
ある。FIG. 7 is an explanatory diagram illustrating a sorting matrix obtained by a purchase information processing apparatus according to Embodiment 4 of the present invention.

[Explanation of symbols]

１購買履歴記憶部（購買履歴記憶手段）、２行列生
成部（購買情報生成手段）、３行列記憶部（購買情報
生成手段）、４行列並び替え部（購買情報処理手
段）。1 purchase history storage unit (purchase history storage unit), 2 matrix generation unit (purchase information generation unit), 3 matrix storage unit (purchase information generation unit), 4 matrix rearrangement unit (purchase information processing unit).

Claims

[Claims]

1. A purchase history storage means for storing purchase history data of a product for each customer, and a history of purchase by a customer by applying the customer and the product in the purchase history data as one of items in rows and columns, respectively. Purchasing information generating means for generating a matrix table in which predetermined matrix index values that are different from each other are assigned to the matrix elements of products and the matrix elements of products for which there is no history of purchase; A purchase information processing unit that sorts the rows based on the total sum, sorts the columns based on the total sum of the index values for each column, and extracts the clusters defined by the distribution of the index values on the matrix table. Purchasing information processing device equipped.

2. The purchase information generating means sets an index value of 1 to a matrix element of a product that a customer has a history of purchasing, and sets an index value of 0 to a matrix element of a product that has no history of purchase by a customer.
The purchase information processing apparatus according to claim 1, wherein:

3. The purchase information processing means sorts the rows of the matrix table in ascending or descending order of the sum of index values for each row, and sorts the columns in ascending or descending order of the sum of index values for each column. The customer information processing apparatus according to claim 1 or 2, wherein a cluster defined by a distribution of index values on the matrix table is extracted.

4. The purchase information processing means calculates a sum of index values for each row of the matrix table by multiplying the index value of each row element by a row weighting matrix table, and a sum of index values for each column of the matrix table. A column weighting matrix table generated by multiplying the index values of the column elements is generated, and the columns corresponding to the sum of the index values of each row of the row weighting matrix table are rearranged in the original matrix table, respectively, and the corresponding columns are rearranged. Sorting the corresponding rows in ascending or descending order of the sum of the index values for each row of the column weighting matrix table,
The cluster defined by the distribution of index values on the matrix table is extracted.
Customer information processing device described.

5. The purchase information processing means multiplies the sum of the index values of row elements by the sum of the index values for each row of the matrix table when the sum of the index values exceeds a predetermined threshold value. A row weighting matrix table in which a row element is given a predetermined value, and when the sum of index values for each column of the matrix table exceeds a predetermined threshold value, the index value of the column element is multiplied by the sum. , A column weighting matrix table in which column elements are given the above-specified values for those below the threshold, and the sum of the index values for each column of the row weighting matrix table is generated for the original matrix table. The columns corresponding to each of the ascending order and the descending order are rearranged, and the rows corresponding to the sum of the index values for each row of the column weighting matrix table are rearranged in the ascending order or the descending order, respectively, and the corresponding index value on the matrix table is divided. The customer information processing apparatus according to claim 1 or 2, wherein clusters defined by cloth are extracted.

6. The customer information processing apparatus according to claim 5, wherein the purchase information processing means sets a predetermined value to 0 and generates a row weighting matrix table and a column weighting matrix table.

7. The purchase information processing means generates a matrix table in which an index value defining the cluster is set to 0 every time a cluster is extracted, and the cluster extraction processing is repeated for the matrix table. 7. The customer information processing device according to claim 6.

8. A purchase history forming step of forming purchase history data of a product for each customer in a memory area, and applying the customer and the product in the purchase history data as one item of row and column, respectively, and purchasing by the customer A matrix table forming step of forming in the memory area a matrix table in which a predetermined index value different from each other is assigned to the matrix element of the product having a history and the matrix element of the product having no history of purchase, and the matrix table, The rows are sorted based on the sum of the index values for each row, and the columns are sorted according to the sum of the index values for each column to extract the clusters defined by the distribution of the index values on the matrix table. A purchasing information clustering method comprising a clustering step.

9. A purchase history storage means for storing purchase history data of a product for each customer, a product having a history of purchase by applying the customer and the product in the purchase history data as one of items in a row and a column, respectively. Purchase information generating means for generating a matrix table in which predetermined matrix index values that are different from each other are assigned to the matrix elements of and the matrix elements of products for which there is no history of purchase. The computer functions as a purchase information processing unit that sorts the rows based on the sum of the index values for each column, and extracts the clusters defined by the distribution of the index values on the matrix table. Program to let.