CN108804430A

CN108804430A - A kind of SEM launches data sorting system and its sorting technique

Info

Publication number: CN108804430A
Application number: CN201710281438.5A
Authority: CN
Inventors: 林正春; 梁文庆
Original assignee: Guangdong Original Mdt Infotech Ltd
Current assignee: Guangdong Original Mdt Infotech Ltd
Priority date: 2017-04-26
Filing date: 2017-04-26
Publication date: 2018-11-13

Abstract

The invention discloses a kind of SEM to launch data sorting system, including database module, and data information is launched for storing；Data launch analog module, for analogue data launch process and collect dispensing feedback information；Characteristic extracting module, for carrying out feature extraction in the feedback information of collection；Feature processing block, for being associated property of the feature processing to extraction；Sort module, for feature to classify to data according to treated；Mark module establishes classification map relationship for sorted data to be marked.The present invention can improve the deficiencies in the prior art, improve the accuracy rate and speed of data classification.

Description

A kind of SEM launches data sorting system and its sorting technique

Technical field

The present invention relates to SEM to launch analysis technical field, and especially a kind of SEM launches data sorting system and its classification side Method.

Background technology

Search engine refers to a kind of information query system based on Internet, including Information Access, information management and Information retrieval.Search engine marketing is exactly the mode that search engine is used according to consumer, passes through a whole set of technology and strategy Marketing message is passed to target group by system as far as possible using the chance of user search information.Search engine marketing (SEM) is made For a kind of Network Marketing Mode, it is therefore intended that promote website, enhance the reputation, obtained more by the result of search engine return Good sale or channels has more to moulding Network brand, website promotion, online sales and product promotion etc. Apparent effect.In order to improve the effect of SEM impression informations, need to classify to launching data, existing SEM launches data Categorizing system is directly compared both for some (or multiple) feature of data, is then classified, this mode classification It is not only computationally intensive, but also the relevance between data different characteristic can not be excavated, cause classification accuracy low.

Invention content

The technical problem to be solved in the present invention is to provide a kind of SEM to launch data sorting system and its sorting technique, can The deficiencies in the prior art are solved, the accuracy rate and speed of data classification are improved.

In order to solve the above technical problems, the technical solution used in the present invention is as follows.

A kind of SEM dispensings data sorting system, including,

Database module launches data information for storing；

Data launch analog module, for analogue data launch process and collect dispensing feedback information；

Characteristic extracting module, for carrying out feature extraction in the feedback information of collection；

Feature processing block, for being associated property of the feature processing to extraction；

Sort module, for feature to classify to data according to treated；

Mark module establishes classification map relationship for sorted data to be marked.

A kind of above-mentioned SEM launches the sorting technique of data sorting system, includes the following steps：

A, it will be stored in database module for the data of dispensing, and launches analog module by data and carries out dispensing simulation；

B, characteristic extracting module extracts the feature in feedback information；

C, feature processing block handles the being associated property of feature of extraction；

D, according to treated, feature classifies to data to sort module；

E, classification map relationship is established, and deposit as a result, sorted data are marked in mark module according to classification Storage is in database module.

Preferably, in the step A, dispensing simulation is carried out using several keywords at random, to the key simulated Word is recorded in data launch analog module, and is deleted and recorded keyword relevance and wait for that simulation is crucial more than threshold value Word；For the degree of correlation of keyword on the basis of keyword fields, the degree of correlation is directly proportional to fields degree of overlapping.

Preferably, in step B, the feature of extraction includes showing cost, the amount of showing, click volume, conversion ratio, launching area Domain, release time, frequency.

Preferably, in step B, the inhomogeneity characteristic value for belonging to same feedback information is extracted, is formed based on anti- The fisrt feature matrix of feedforward information；The same class characteristic value belonged in different feedback informations is extracted, feature based is formed The second characteristic matrix of value.

Preferably, in step C, being associated property of the feature processing of extraction is included the following steps,

C1, the transposed matrix for seeking second characteristic matrix successively, by second characteristic matrix transposed matrix institute corresponding with its The linear space of representative is compared, and the weighted value of this corresponding category feature value of similarity of two linear space is at just Than；

C2, corresponding characteristic value in fisrt feature matrix is modified using the weighted value of each category feature value；

C3, the feature vector for seeking revised fisrt feature matrix, using non-linear transform function to feature vector group At vector space carry out nonlinear transformation processing the eigenfunction for the dimension that disappears is returned with reducing vector space dimension One changes, and is merged into remaining feature vector.

Preferably, in step D, classification is carried out to data and is included the following steps,

D1, projection of the vector space after the corresponding dimensionality reduction of data in each dimension is sought；

D2, classified to data according to the similarity in same dimension upslide shadow.

It is using advantageous effect caused by above-mentioned technical proposal：The present invention can reduce the operation of data assorting process Amount increases the excavation of the degree of correlation pair different characteristic between, improves the accuracy of data classification, improvement SEM impression informations Effect.

Description of the drawings

Fig. 1 is the hardware elementary diagram of the present invention.

In figure：1, database module；2, data launch analog module；3, characteristic extracting module；4, feature processing block；5, Sort module；6, mark module.

Specific implementation mode

Referring to Fig.1, a kind of SEM launches data sorting system, it is characterised in that：Including,

Database module 1 launches data information for storing；

Data launch analog module 2, for analogue data launch process and collect dispensing feedback information；

Characteristic extracting module 3, for carrying out feature extraction in the feedback information of collection；

Feature processing block 4, for being associated property of the feature processing to extraction；

Sort module 5, for feature to classify to data according to treated；

Mark module 6 establishes classification map relationship for sorted data to be marked.

A, it will be stored in database module 1 for the data of dispensing, and launches analog module 2 by data and carries out dispensing mould It is quasi-；

B, characteristic extracting module 3 extracts the feature in feedback information；

C, feature processing block 4 handles the being associated property of feature of extraction；

D, according to treated, feature classifies to data to sort module 5；

E, classification map relationship is established as a result, sorted data are marked in mark module 6 according to classification, and It is stored in database module 1.

In step A, dispensing simulation is carried out using several keywords at random, mould is launched in data to the keyword simulated It is recorded in quasi- module 2, and deletes and recorded the keyword to be simulated that keyword relevance is more than threshold value；The phase of keyword For Guan Du on the basis of keyword fields, the degree of correlation is directly proportional to fields degree of overlapping.Threshold value is averaged according to keyword The degree of correlation and dispensing require to be specifically chosen.

In step B, the feature of extraction includes when showing cost, the amount of showing, click volume, conversion ratio, dispensing region, dispensing Between, frequency.

In step B, the inhomogeneity characteristic value for belonging to same feedback information is extracted, forms the based on feedback information One eigenmatrix；The same class characteristic value belonged in different feedback informations is extracted, form feature based value second is special Levy matrix.

In step C, being associated property of the feature processing of extraction is included the following steps,

In step D, classification is carried out to data and is included the following steps,

Wherein, in step D1, first choice is iterated processing to vector space,

T '=TK

T is vector space, and K is Iterative Matrix, and K is non-non-singular matrix.By iteration, can be effectively increased in vector space The difference degree of characteristic information and non-characteristic information, convenient for distinguishing.

In step E, secondary classification is carried out to the classification map relationship of foundation, the path counted between different mappings intersects Point, the classification map relationship that crosspoint quantity is more than to threshold value are classified as one kind.Threshold value is according to the average in crosspoint between mapping Amount and dispensing require specific determine.By the secondary classification to classification map relationship, can improve for launching data after classification Retrieval rate.

The present invention is suitable for the classification of the SEM dispensing data of big data quantity, and classification speed is fast, and it is good to launch effect.

In the description of the present invention, it is to be understood that, term " longitudinal direction ", " transverse direction ", "upper", "lower", "front", "rear", The orientation or positional relationship of the instructions such as "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outside" is based on attached drawing institute The orientation or positional relationship shown is merely for convenience of the description present invention, does not indicate or imply the indicated device or element must There must be specific orientation, with specific azimuth configuration and operation, therefore be not considered as limiting the invention.

The above shows and describes the basic principles and main features of the present invention and the advantages of the present invention.The technology of the industry Personnel are it should be appreciated that the present invention is not limited to the above embodiments, and the above embodiments and description only describe this The principle of invention, without departing from the spirit and scope of the present invention, various changes and improvements may be made to the invention, these changes Change and improvement all fall within the protetion scope of the claimed invention.The claimed scope of the invention by appended claims and its Equivalent thereof.

Claims

1. a kind of SEM launches data sorting system, it is characterised in that：Including,

Database module (1) launches data information for storing；

Data launch analog module (2), for analogue data launch process and collect dispensing feedback information；

Characteristic extracting module (3), for carrying out feature extraction in the feedback information of collection；

Feature processing block (4), for being associated property of the feature processing to extraction；

Sort module (5), for feature to classify to data according to treated；

Mark module (6) establishes classification map relationship for sorted data to be marked.

2. a kind of SEM described in claim 1 launches the sorting technique of data sorting system, it is characterised in that including following step Suddenly：

A, dispensing mould will be carried out for the data of dispensing deposit database module (1), and by data dispensing analog module (2) It is quasi-；

B, characteristic extracting module (3) extracts the feature in feedback information；

C, feature processing block (4) handles the being associated property of feature of extraction；

D, according to treated, feature classifies to data to sort module (5)；

E, classification map relationship is established, and deposit as a result, sorted data are marked in mark module (6) according to classification Storage is in database module (1).

3. SEM according to claim 2 launches the sorting technique of data sorting system, it is characterised in that：In step A, with Machine carries out dispensing simulation using several keywords, remembers in data launch analog module (2) to the keyword simulated Record, and delete and recorded the keyword to be simulated that keyword relevance is more than threshold value；The degree of correlation of keyword is with keyword institute On the basis of category field, the degree of correlation is directly proportional to fields degree of overlapping.

4. SEM according to claim 2 launches the sorting technique of data sorting system, it is characterised in that：In step B, carry The feature taken includes showing cost, the amount of showing, click volume, conversion ratio, launching region, release time, frequency.

5. SEM according to claim 4 launches the sorting technique of data sorting system, it is characterised in that：It, will in step B The inhomogeneity characteristic value for belonging to same feedback information extracts, and forms the fisrt feature matrix based on feedback information；It will belong to Same class characteristic value in different feedback informations extracts, and forms the second characteristic matrix of feature based value.

6. SEM according to claim 5 launches the sorting technique of data sorting system, it is characterised in that：It is right in step C The being associated property of feature of extraction, which is handled, to be included the following steps,

C1, the transposed matrix for seeking second characteristic matrix successively, representated by second characteristic matrix transposed matrix corresponding with its Linear space compared, the weighted value of this corresponding category feature value of similarity of two linear space is directly proportional；

C3, the feature vector for seeking revised fisrt feature matrix, form feature vector using non-linear transform function Vector space carries out nonlinear transformation processing and the eigenfunction for the dimension that disappears is normalized with reducing vector space dimension, It is merged into remaining feature vector.

7. SEM according to claim 6 launches the sorting technique of data sorting system, it is characterised in that：It is right in step D Data carry out classification and include the following steps,