CN106649523A - Commodity resource data processing method - Google Patents

Commodity resource data processing method Download PDF

Info

Publication number
CN106649523A
CN106649523A CN201610910500.8A CN201610910500A CN106649523A CN 106649523 A CN106649523 A CN 106649523A CN 201610910500 A CN201610910500 A CN 201610910500A CN 106649523 A CN106649523 A CN 106649523A
Authority
CN
China
Prior art keywords
data
commodity
processing method
data processing
format
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610910500.8A
Other languages
Chinese (zh)
Inventor
李让剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Tianda Network Technology Co Ltd
Original Assignee
Anhui Tianda Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Tianda Network Technology Co Ltd filed Critical Anhui Tianda Network Technology Co Ltd
Priority to CN201610910500.8A priority Critical patent/CN106649523A/en
Publication of CN106649523A publication Critical patent/CN106649523A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0623Item investigation
    • G06Q30/0625Directed, with specific intent or strategy
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/03Data mining

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Development Economics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a commodity resource data processing method. The method comprises the steps that screened-out commodity data are cleaned; the screened-out commodity data are classified and stored according to the keywords of each commodity; different layouts of data stored in different data storage devices are transformed into a uniform layout; data with the uniform layout are examined, data with noise and redundancies are deleted, missing data are filled in, and data are identified using a binary data coding; the characteristic values of the target data are determined; data are processed based on a specific characteristic value of the target data using a mining algorithm, the mined data are attached with an ID and are exported. Data screening in a cloud server of a network are completed through the search of commodity-related keywords, and relevant optimization processes and layout transformations are performed on the data so that the efficiency is even higher when data information is searched. The method provides very good sources of data for e-commerce websites.

Description

A kind of commodity resource data processing method
Technical field
The present invention relates to technical field of data processing, particularly a kind of commodity resource data processing method.
Background technology
In recent years, the development of internet is more and more rapider, is also increasingly popularized using the people of internet, and people are using mutual When networking carries out daily movable, program, information are checked in such as net purchase, and commodity can all produce substantial amounts of data, and these Data are very valuable for e-commerce website or the Internet media class website, using the process of these big datas Analysis can obtain very valuable commercial value.
Big data is widely used in the every application in internet, great to the significance of website, by mass data point Analysis and the realization of cloud computing, can maximize help the Internet media class advertiser web site system and ecommerce class website big data Commodity supplying system obtains maximized lifting.The big data advertisement of the Internet media class website is read preference and is pushed according to user, For the cloud computing of mass data, website browsing user is pushed to by various advertisement forms, for example, is applied in chamber of commerce's net;Electronics Commercial class website big data commodity are pushed to on-line purchase person, and by analyzing user behavior, buying behavior, product correlation are clicked on Property, preference and use time rule push corresponding commodity and sales promotion information, for example apply and obtain store in product.
Prior art there is presently no a kind of commodity resource big data processing method for business website.
The content of the invention
To solve above-mentioned technical problem, the invention provides a kind of commodity resource data processing method, it includes following step Suddenly:
Commodity data is cleaned:Commodity data is filtered out according to commodity keyword in each network Cloud Server, to filtering out Commodity data cleaned;
Data are classified:The commodity data for filtering out is classified according to each commodity keyword, and is individually stored in In different data storages;
Data form is unified:The data of the different-format being stored in different data storages are converted into unification Form;
Data prediction:The data of the consolidation form are checked, the data containing noise data, redundancy is rejected, to lacking Save data to be supplemented, while being identified data by binary data coding;
Data search:It is determined that the data critical word to be found, data name, storage date, data length are used as number of targets According to characteristic value;
Data mining:According to the specific features value of target data data are processed using mining algorithm, will be excavated Data affix mark after derive.
It is preferred that the unified concrete grammar of the data form is:
Storage size according to occupied by the data of different-format sorts successively, will occupy the lattice of maximum memory space The data unification of extended formatting is converted into the object format by formula data as object format.
It is preferred that the mining algorithm is k-means clustering algorithms or the cluster algorithm based on level.
It is preferred that the supplemental content of the default data includes data extension and system store path.
It is preferred that the data of the cleaning include referring to data in origin system not in given scope or for actual industry Business is meaningless, and data form is illegal, and there are the data of nonstandard coding and ambiguous service logic in origin system.
The invention has the advantages that:
The present invention in the relevant keyword search of commodity by completing to the data screening in network Cloud Server, and logarithm Make data in hgher efficiency in search with format conversion according to corresponding optimization processing is carried out, the present invention is by network cloud service Big data in device provides good data source through screening as source data for business website.
Certainly, the arbitrary product for implementing the present invention it is not absolutely required to while reaching all the above advantage.
Specific embodiment
The technical scheme in the present invention is clearly and completely described below in conjunction with the embodiment of the present invention, it is clear that institute The embodiment of description is only a part of embodiment of the invention, rather than the embodiment of whole.Based on the embodiment in the present invention, All other embodiment that those of ordinary skill in the art are obtained under the premise of creative work is not made, belongs to this The scope of bright protection.
A kind of commodity resource data processing method is embodiments provided, it is comprised the following steps:
Commodity data is cleaned:Commodity data is filtered out according to commodity keyword in each network Cloud Server, to filtering out Commodity data cleaned;
Data are classified:The commodity data for filtering out is classified according to each commodity keyword, and is individually stored in In different data storages;
Data form is unified:The data of the different-format being stored in different data storages are converted into unification Form;
Data prediction:The data of the consolidation form are checked, the data containing noise data, redundancy is rejected, to lacking Save data to be supplemented, while being identified data by binary data coding;
Data search:It is determined that the data critical word to be found, data name, storage date, data length are used as number of targets According to characteristic value;
Data mining:According to the specific features value of target data data are processed using mining algorithm, will be excavated Data affix mark after derive.
The unified concrete grammar of data form described in the present embodiment is:
Storage size according to occupied by the data of different-format sorts successively, will occupy the lattice of maximum memory space The data unification of extended formatting is converted into the object format by formula data as object format.
Wherein described mining algorithm is k-means clustering algorithms or the cluster algorithm based on level.
The supplemental content of the default data includes data extension and system store path.
The data of the cleaning are including the data referred in origin system not in given scope or for practical business has no Meaning, data form is illegal, and there are the data of nonstandard coding and ambiguous service logic in origin system.
The present invention is completed to the data screening in network Cloud Server by the keyword search relevant to commodity, and logarithm Make data in hgher efficiency in search with format conversion according to corresponding optimization processing is carried out, the present invention is by network cloud service Big data in device provides good data source through screening as source data for business website.
Present invention disclosed above preferred embodiment is only intended to help and illustrates the present invention.Preferred embodiment is not detailed All of details is described, it is only described specific embodiment also not limit the invention.Obviously, according to the content of this specification, Can make many modifications and variations.These embodiments are chosen and specifically described to this specification, is to preferably explain the present invention Principle and practical application so that skilled artisan can be best understood by and utilize the present invention.The present invention is only Limited by claims and its four corner and equivalent.

Claims (5)

1. a kind of commodity resource data processing method, it is characterised in that comprise the following steps:
Commodity data is cleaned:Commodity data is filtered out according to commodity keyword in each network Cloud Server, to the business for filtering out Product data are cleaned;
Data are classified:The commodity data for filtering out is classified according to each commodity keyword, and is individually stored in difference Data storage in;
Data form is unified:The data of the different-format being stored in different data storages are converted into unified lattice Formula;
Data prediction:The data of the consolidation form are checked, the data containing noise data, redundancy is rejected, to default number According to being supplemented, while being identified by binary data coding to data;
Data search:It is determined that the data critical word to be found, data name, storage date, data length are used as target data Characteristic value;
Data mining:According to the specific features value of target data data are processed using mining algorithm, by the number excavated According to derivation after affix mark.
2. commodity resource data processing method as claimed in claim 1, it is characterised in that the data form unification it is concrete Method is:
Storage size according to occupied by the data of different-format sorts successively, will occupy the form number of maximum memory space According to as object format, and the data unification of extended formatting is converted into the object format.
3. commodity resource data processing method as claimed in claim 1, it is characterised in that the mining algorithm is k-means Clustering algorithm or the cluster algorithm based on level.
4. commodity resource data processing method as claimed in claim 1, it is characterised in that the supplemental content of the default data Including data extension and system store path.
5. commodity resource data processing method as claimed in claim 1, it is characterised in that the data of the cleaning include finger source Data in system are not in given scope or meaningless for practical business, and data form is illegal, and in origin system The middle data that there is nonstandard coding and ambiguous service logic.
CN201610910500.8A 2016-10-18 2016-10-18 Commodity resource data processing method Pending CN106649523A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610910500.8A CN106649523A (en) 2016-10-18 2016-10-18 Commodity resource data processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610910500.8A CN106649523A (en) 2016-10-18 2016-10-18 Commodity resource data processing method

Publications (1)

Publication Number Publication Date
CN106649523A true CN106649523A (en) 2017-05-10

Family

ID=58855353

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610910500.8A Pending CN106649523A (en) 2016-10-18 2016-10-18 Commodity resource data processing method

Country Status (1)

Country Link
CN (1) CN106649523A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110782263A (en) * 2019-11-04 2020-02-11 中国电子信息产业发展研究院 Method for capturing, removing duplicate and repairing tracing data

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599161A (en) * 2009-07-17 2009-12-09 用友软件股份有限公司 Marketing support system
CN104750813A (en) * 2015-03-30 2015-07-01 浪潮集团有限公司 Data cleaning method based on data reduction model
CN105930466A (en) * 2016-04-21 2016-09-07 成都数联铭品科技有限公司 Massive data processing method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599161A (en) * 2009-07-17 2009-12-09 用友软件股份有限公司 Marketing support system
CN104750813A (en) * 2015-03-30 2015-07-01 浪潮集团有限公司 Data cleaning method based on data reduction model
CN105930466A (en) * 2016-04-21 2016-09-07 成都数联铭品科技有限公司 Massive data processing method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110782263A (en) * 2019-11-04 2020-02-11 中国电子信息产业发展研究院 Method for capturing, removing duplicate and repairing tracing data

Similar Documents

Publication Publication Date Title
CN106649516A (en) A large data processing method for educational resources
TWI522942B (en) User favorites data processing method and device, user favorite data searching method and device, and user favorite system
TWI508011B (en) Category information providing method and device
JP5897019B2 (en) Method and apparatus for determining linked list of candidate products
JP6646931B2 (en) Method and apparatus for providing recommendation information
US11836778B2 (en) Product and content association
CN110827112B (en) Deep learning commodity recommendation method and device, computer equipment and storage medium
CN109241403B (en) Project recommendation method and device, machine equipment and computer-readable storage medium
CN105404699A (en) Method, device and server for searching articles of finance and economics
CN106991175B (en) Customer information mining method, device, equipment and storage medium
CN110352427B (en) System and method for collecting data associated with fraudulent content in a networked environment
US10346496B2 (en) Information category obtaining method and apparatus
EP2836978A1 (en) Searching supplier information based on transaction platform
JP2015525418A (en) Search method and apparatus
CN103020128B (en) With the method and apparatus of data interaction with terminal device
CN106372956B (en) Method and system for identifying intention entity based on user search log
CN112818230B (en) Content recommendation method, device, electronic equipment and storage medium
JP2018537768A (en) Identifying users with social business characteristics
CN105335386A (en) Method and apparatus for providing navigation tag
CN106649523A (en) Commodity resource data processing method
CN106503114A (en) Commodity resource data obtains system
US9378277B1 (en) Search query segmentation
CN104715374A (en) Method and system for governing repetition products of e-commerce platform
CN105224547A (en) The disposal route of object set and satisfaction thereof and device
CN104050174B (en) A kind of personal page generation method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170510