CN106649523A - Commodity resource data processing method - Google Patents
Commodity resource data processing method Download PDFInfo
- Publication number
- CN106649523A CN106649523A CN201610910500.8A CN201610910500A CN106649523A CN 106649523 A CN106649523 A CN 106649523A CN 201610910500 A CN201610910500 A CN 201610910500A CN 106649523 A CN106649523 A CN 106649523A
- Authority
- CN
- China
- Prior art keywords
- data
- commodity
- processing method
- data processing
- format
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0623—Item investigation
- G06Q30/0625—Directed, with specific intent or strategy
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/03—Data mining
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- Physics & Mathematics (AREA)
- Economics (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Marketing (AREA)
- Development Economics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention provides a commodity resource data processing method. The method comprises the steps that screened-out commodity data are cleaned; the screened-out commodity data are classified and stored according to the keywords of each commodity; different layouts of data stored in different data storage devices are transformed into a uniform layout; data with the uniform layout are examined, data with noise and redundancies are deleted, missing data are filled in, and data are identified using a binary data coding; the characteristic values of the target data are determined; data are processed based on a specific characteristic value of the target data using a mining algorithm, the mined data are attached with an ID and are exported. Data screening in a cloud server of a network are completed through the search of commodity-related keywords, and relevant optimization processes and layout transformations are performed on the data so that the efficiency is even higher when data information is searched. The method provides very good sources of data for e-commerce websites.
Description
Technical field
The present invention relates to technical field of data processing, particularly a kind of commodity resource data processing method.
Background technology
In recent years, the development of internet is more and more rapider, is also increasingly popularized using the people of internet, and people are using mutual
When networking carries out daily movable, program, information are checked in such as net purchase, and commodity can all produce substantial amounts of data, and these
Data are very valuable for e-commerce website or the Internet media class website, using the process of these big datas
Analysis can obtain very valuable commercial value.
Big data is widely used in the every application in internet, great to the significance of website, by mass data point
Analysis and the realization of cloud computing, can maximize help the Internet media class advertiser web site system and ecommerce class website big data
Commodity supplying system obtains maximized lifting.The big data advertisement of the Internet media class website is read preference and is pushed according to user,
For the cloud computing of mass data, website browsing user is pushed to by various advertisement forms, for example, is applied in chamber of commerce's net;Electronics
Commercial class website big data commodity are pushed to on-line purchase person, and by analyzing user behavior, buying behavior, product correlation are clicked on
Property, preference and use time rule push corresponding commodity and sales promotion information, for example apply and obtain store in product.
Prior art there is presently no a kind of commodity resource big data processing method for business website.
The content of the invention
To solve above-mentioned technical problem, the invention provides a kind of commodity resource data processing method, it includes following step
Suddenly:
Commodity data is cleaned:Commodity data is filtered out according to commodity keyword in each network Cloud Server, to filtering out
Commodity data cleaned;
Data are classified:The commodity data for filtering out is classified according to each commodity keyword, and is individually stored in
In different data storages;
Data form is unified:The data of the different-format being stored in different data storages are converted into unification
Form;
Data prediction:The data of the consolidation form are checked, the data containing noise data, redundancy is rejected, to lacking
Save data to be supplemented, while being identified data by binary data coding;
Data search:It is determined that the data critical word to be found, data name, storage date, data length are used as number of targets
According to characteristic value;
Data mining:According to the specific features value of target data data are processed using mining algorithm, will be excavated
Data affix mark after derive.
It is preferred that the unified concrete grammar of the data form is:
Storage size according to occupied by the data of different-format sorts successively, will occupy the lattice of maximum memory space
The data unification of extended formatting is converted into the object format by formula data as object format.
It is preferred that the mining algorithm is k-means clustering algorithms or the cluster algorithm based on level.
It is preferred that the supplemental content of the default data includes data extension and system store path.
It is preferred that the data of the cleaning include referring to data in origin system not in given scope or for actual industry
Business is meaningless, and data form is illegal, and there are the data of nonstandard coding and ambiguous service logic in origin system.
The invention has the advantages that:
The present invention in the relevant keyword search of commodity by completing to the data screening in network Cloud Server, and logarithm
Make data in hgher efficiency in search with format conversion according to corresponding optimization processing is carried out, the present invention is by network cloud service
Big data in device provides good data source through screening as source data for business website.
Certainly, the arbitrary product for implementing the present invention it is not absolutely required to while reaching all the above advantage.
Specific embodiment
The technical scheme in the present invention is clearly and completely described below in conjunction with the embodiment of the present invention, it is clear that institute
The embodiment of description is only a part of embodiment of the invention, rather than the embodiment of whole.Based on the embodiment in the present invention,
All other embodiment that those of ordinary skill in the art are obtained under the premise of creative work is not made, belongs to this
The scope of bright protection.
A kind of commodity resource data processing method is embodiments provided, it is comprised the following steps:
Commodity data is cleaned:Commodity data is filtered out according to commodity keyword in each network Cloud Server, to filtering out
Commodity data cleaned;
Data are classified:The commodity data for filtering out is classified according to each commodity keyword, and is individually stored in
In different data storages;
Data form is unified:The data of the different-format being stored in different data storages are converted into unification
Form;
Data prediction:The data of the consolidation form are checked, the data containing noise data, redundancy is rejected, to lacking
Save data to be supplemented, while being identified data by binary data coding;
Data search:It is determined that the data critical word to be found, data name, storage date, data length are used as number of targets
According to characteristic value;
Data mining:According to the specific features value of target data data are processed using mining algorithm, will be excavated
Data affix mark after derive.
The unified concrete grammar of data form described in the present embodiment is:
Storage size according to occupied by the data of different-format sorts successively, will occupy the lattice of maximum memory space
The data unification of extended formatting is converted into the object format by formula data as object format.
Wherein described mining algorithm is k-means clustering algorithms or the cluster algorithm based on level.
The supplemental content of the default data includes data extension and system store path.
The data of the cleaning are including the data referred in origin system not in given scope or for practical business has no
Meaning, data form is illegal, and there are the data of nonstandard coding and ambiguous service logic in origin system.
The present invention is completed to the data screening in network Cloud Server by the keyword search relevant to commodity, and logarithm
Make data in hgher efficiency in search with format conversion according to corresponding optimization processing is carried out, the present invention is by network cloud service
Big data in device provides good data source through screening as source data for business website.
Present invention disclosed above preferred embodiment is only intended to help and illustrates the present invention.Preferred embodiment is not detailed
All of details is described, it is only described specific embodiment also not limit the invention.Obviously, according to the content of this specification,
Can make many modifications and variations.These embodiments are chosen and specifically described to this specification, is to preferably explain the present invention
Principle and practical application so that skilled artisan can be best understood by and utilize the present invention.The present invention is only
Limited by claims and its four corner and equivalent.
Claims (5)
1. a kind of commodity resource data processing method, it is characterised in that comprise the following steps:
Commodity data is cleaned:Commodity data is filtered out according to commodity keyword in each network Cloud Server, to the business for filtering out
Product data are cleaned;
Data are classified:The commodity data for filtering out is classified according to each commodity keyword, and is individually stored in difference
Data storage in;
Data form is unified:The data of the different-format being stored in different data storages are converted into unified lattice
Formula;
Data prediction:The data of the consolidation form are checked, the data containing noise data, redundancy is rejected, to default number
According to being supplemented, while being identified by binary data coding to data;
Data search:It is determined that the data critical word to be found, data name, storage date, data length are used as target data
Characteristic value;
Data mining:According to the specific features value of target data data are processed using mining algorithm, by the number excavated
According to derivation after affix mark.
2. commodity resource data processing method as claimed in claim 1, it is characterised in that the data form unification it is concrete
Method is:
Storage size according to occupied by the data of different-format sorts successively, will occupy the form number of maximum memory space
According to as object format, and the data unification of extended formatting is converted into the object format.
3. commodity resource data processing method as claimed in claim 1, it is characterised in that the mining algorithm is k-means
Clustering algorithm or the cluster algorithm based on level.
4. commodity resource data processing method as claimed in claim 1, it is characterised in that the supplemental content of the default data
Including data extension and system store path.
5. commodity resource data processing method as claimed in claim 1, it is characterised in that the data of the cleaning include finger source
Data in system are not in given scope or meaningless for practical business, and data form is illegal, and in origin system
The middle data that there is nonstandard coding and ambiguous service logic.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610910500.8A CN106649523A (en) | 2016-10-18 | 2016-10-18 | Commodity resource data processing method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610910500.8A CN106649523A (en) | 2016-10-18 | 2016-10-18 | Commodity resource data processing method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106649523A true CN106649523A (en) | 2017-05-10 |
Family
ID=58855353
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610910500.8A Pending CN106649523A (en) | 2016-10-18 | 2016-10-18 | Commodity resource data processing method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106649523A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110782263A (en) * | 2019-11-04 | 2020-02-11 | 中国电子信息产业发展研究院 | Method for capturing, removing duplicate and repairing tracing data |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101599161A (en) * | 2009-07-17 | 2009-12-09 | 用友软件股份有限公司 | Marketing support system |
CN104750813A (en) * | 2015-03-30 | 2015-07-01 | 浪潮集团有限公司 | Data cleaning method based on data reduction model |
CN105930466A (en) * | 2016-04-21 | 2016-09-07 | 成都数联铭品科技有限公司 | Massive data processing method |
-
2016
- 2016-10-18 CN CN201610910500.8A patent/CN106649523A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101599161A (en) * | 2009-07-17 | 2009-12-09 | 用友软件股份有限公司 | Marketing support system |
CN104750813A (en) * | 2015-03-30 | 2015-07-01 | 浪潮集团有限公司 | Data cleaning method based on data reduction model |
CN105930466A (en) * | 2016-04-21 | 2016-09-07 | 成都数联铭品科技有限公司 | Massive data processing method |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110782263A (en) * | 2019-11-04 | 2020-02-11 | 中国电子信息产业发展研究院 | Method for capturing, removing duplicate and repairing tracing data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106649516A (en) | A large data processing method for educational resources | |
TWI522942B (en) | User favorites data processing method and device, user favorite data searching method and device, and user favorite system | |
TWI508011B (en) | Category information providing method and device | |
JP5897019B2 (en) | Method and apparatus for determining linked list of candidate products | |
JP6646931B2 (en) | Method and apparatus for providing recommendation information | |
US11836778B2 (en) | Product and content association | |
CN110827112B (en) | Deep learning commodity recommendation method and device, computer equipment and storage medium | |
CN109241403B (en) | Project recommendation method and device, machine equipment and computer-readable storage medium | |
CN105404699A (en) | Method, device and server for searching articles of finance and economics | |
CN106991175B (en) | Customer information mining method, device, equipment and storage medium | |
CN110352427B (en) | System and method for collecting data associated with fraudulent content in a networked environment | |
US10346496B2 (en) | Information category obtaining method and apparatus | |
EP2836978A1 (en) | Searching supplier information based on transaction platform | |
JP2015525418A (en) | Search method and apparatus | |
CN103020128B (en) | With the method and apparatus of data interaction with terminal device | |
CN106372956B (en) | Method and system for identifying intention entity based on user search log | |
CN112818230B (en) | Content recommendation method, device, electronic equipment and storage medium | |
JP2018537768A (en) | Identifying users with social business characteristics | |
CN105335386A (en) | Method and apparatus for providing navigation tag | |
CN106649523A (en) | Commodity resource data processing method | |
CN106503114A (en) | Commodity resource data obtains system | |
US9378277B1 (en) | Search query segmentation | |
CN104715374A (en) | Method and system for governing repetition products of e-commerce platform | |
CN105224547A (en) | The disposal route of object set and satisfaction thereof and device | |
CN104050174B (en) | A kind of personal page generation method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170510 |