CN114463053A - Enterprise attribution classification method and system - Google Patents

Enterprise attribution classification method and system Download PDF

Info

Publication number
CN114463053A
CN114463053A CN202210070106.3A CN202210070106A CN114463053A CN 114463053 A CN114463053 A CN 114463053A CN 202210070106 A CN202210070106 A CN 202210070106A CN 114463053 A CN114463053 A CN 114463053A
Authority
CN
China
Prior art keywords
data
enterprise
classification
attribution
enterprises
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210070106.3A
Other languages
Chinese (zh)
Inventor
吴呈良
郑敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Original Assignee
Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chaozhou Zhuoshu Big Data Industry Development Co Ltd filed Critical Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Priority to CN202210070106.3A priority Critical patent/CN114463053A/en
Publication of CN114463053A publication Critical patent/CN114463053A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0204Market segmentation
    • G06Q30/0205Location or geographical consideration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Accounting & Taxation (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Game Theory and Decision Science (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Life Sciences & Earth Sciences (AREA)

Abstract

The invention provides a method and a system for classifying enterprise affiliation, which belong to the field of data processing.

Description

Enterprise attribution classification method and system
Technical Field
The invention relates to the technical fields of statistical analysis, data processing and the like, in particular to a method and a system for enterprise attribution classification.
Background
Statistical analysis refers to a process of analyzing a large amount of collected data by using an appropriate statistical analysis method, extracting useful information, forming a conclusion, and performing detailed research and summary on the data. By carrying out statistical analysis on the data, corresponding conclusions can be obtained, and the conclusions can help people to make judgment so as to take proper action.
Data processing is the collection, storage, retrieval, processing, transformation and transmission of data, and is the extraction and derivation of valuable and meaningful data for certain people from a large amount of data that may be cluttered and unintelligible. Data processing is a basic link of system engineering and automatic control, runs through various fields of social production and social life, and greatly influences the development process of the human society.
Enterprises play a very important role as participants in marketing activities, both online and offline. How to accurately and effectively divide the attribution of an enterprise on a regional dimension according to the enterprise business registration information so as to monitor and analyze the sales condition of the enterprise on the regional dimension and the enterprise sales condition summarized on the regional dimension becomes a key point and a difficult point for researching the current business activities.
Disclosure of Invention
In order to solve the technical problems, the invention provides a method for classifying the enterprise attribution, which achieves the purposes of effectively utilizing the acquired enterprise data, accurately and effectively classifying the enterprise attribution and further summarizing regional sales hotspots and particularly performs regional standardized processing on the enterprise.
The technical scheme of the invention is as follows:
a method of enterprise attribution classification, comprising:
firstly, data acquisition, namely acquiring public information of an online enterprise, and storing acquired original data into a database;
secondly, cleaning data, namely cleaning and denoising the acquired original data;
thirdly, data verification, namely, checking and verifying the cleaned data;
fourthly, classifying the data, namely after the acquired data are cleaned and verified, inputting the enterprise related data into a data classification device, and classifying the attributions of the enterprises;
fifthly, data supplementary collection, namely repeating the steps for reclassification after the data supplementary collection is carried out on the enterprises which are not classified;
and sixthly, displaying data, namely performing visual display on the enterprises which finish the enterprise attribution classification, wherein the enterprises comprise maps, tables and documents.
And seventhly, analyzing data, namely combining other related data to analyze and monitor the enterprises which finish the administrative division attribution classification in the subsequent region dimension.
Further, in the above-mentioned case,
the data cleaning process includes: removing irrelevant and wrong characters (blank space, messy codes and redundant characters), converting the characters into English semi-angle symbols, removing repeated data and supplementing null data.
The data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed by the webpage to ensure the accuracy of the data.
The data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
The invention also provides a system for classifying the enterprise affiliation, which comprises the following steps:
the method comprises the following steps:
the data acquisition device is used for acquiring the public information of the online enterprise and storing the acquired original data into the database;
the data cleaning device is used for cleaning and denoising the acquired original data;
the data checking device is used for checking and checking the cleaned data;
and after the acquired data is cleaned and verified, the data classification device inputs the part of enterprise related data into the data classification device, and the enterprise is divided according to the attribution:
the data supplementing and collecting device is used for classifying the data classification device again after data supplementing and collecting is carried out on the enterprises which are not classified by the data classification device;
the data display device is used for visually displaying the enterprises which finish the enterprise attribution classification, and comprises a map, a table and a document;
and the data analysis device is used for performing subsequent analysis and monitoring on regional dimensions on the enterprises which finish the administrative division attribution classification by combining with other related data.
Further, in the above-mentioned case,
the data cleaning process includes: removing irrelevant and wrong characters (blank space, messy codes and redundant characters), converting the characters into English semi-angle symbols, removing repeated data and supplementing null data.
The data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed by the webpage to ensure the accuracy of the data.
The data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
The invention has the advantages that
After the enterprise is classified according to the three-level attribution of the region, the operation and sales data of the enterprise can be classified according to the region, so that the difficulty and the complexity of analysis of the later-stage enterprise data in the region dimension are greatly reduced, the sales hotspots and conclusions of the enterprise in the region dimension can be conveniently summarized, and a large amount of time cost and labor and material cost can be saved for the analysis and the monitoring of the enterprise data.
Drawings
FIG. 1 is a schematic workflow diagram of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer and more complete, the technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention, and based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without creative efforts belong to the scope of the present invention.
According to the invention, the enterprise data is standardized by carrying out method and process design on the acquired enterprise public data, province, city and district three-level region labels are marked for the enterprise, the enterprise is clearly classified in the region of ownership, and further the statistics and analysis of the subsequent enterprise sales data in the region dimension aspect are facilitated.
The method mainly comprises the following steps:
the first step is as follows: a data acquisition device. Collecting public information of the online enterprises, and storing the collected original data in a database;
the second step is that: and a data cleaning device. Cleaning and denoising the collected original data, including but not limited to removing irrelevant and wrong characters (blank spaces, messy codes, redundant characters and the like), converting the characters into English semi-angle symbols, eliminating repeated data, supplementing null data and the like;
the third step: and a data checking device. Checking and checking the cleaned data, including but not limited to checking and checking in terms of integrity, uniqueness, non-nullability and the like; sampling the data, and comparing the sampled data with original data displayed by a webpage to ensure the accuracy of the data;
the fourth step: and a data classification device. The acquired data is cleaned and verified, and high accuracy is achieved. Inputting the accurate enterprise related data into a data classification device, and classifying the enterprise as belonging places:
1. inputting a classification item, which is a standard geographical administrative division table;
2. inputting data to be classified, including enterprise names, unified social credit codes, registration organs, addresses and other data;
3. classifying the attribution of the enterprise, and sequentially adopting the following methods:
(1) acquiring three-level attribution information of an administrative district according to an enterprise registration authority;
(2) acquiring three-level attribution information of an administrative district according to an enterprise registration address;
(3) acquiring three-level attribution information of an administrative division according to the unified social credit code of the enterprise;
(4) acquiring three-level attribution information of an administrative division according to the name of an enterprise;
(5) the special enterprises comprehensively acquire the three-level attribution information of the administrative division according to the enterprise name, the unified social credit code, the registration authority, the address and the like.
4. And outputting the classification result of the administrative division affiliation of the enterprise.
The fifth step: the data supplement and collection device is used for repeating the steps to classify the enterprises which are not classified in the fourth step again after data supplement and collection;
and a sixth step: a data presentation device. Carrying out visual display on the enterprises which finish the enterprise attribution classification, wherein the visual display comprises but is not limited to maps, tables, documents and the like;
the seventh step: and a data analysis device. And performing subsequent analysis and monitoring on regional dimensions on the enterprises which finish the administrative region attribution classification by combining with other related data, such as sales data.
The above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (10)

1. A method for classifying enterprise affiliations,
the method comprises the following steps:
firstly, data acquisition, namely acquiring public information of an online enterprise, and storing acquired original data into a database;
secondly, cleaning data, namely cleaning and denoising the acquired original data;
thirdly, data verification, namely, checking and verifying the cleaned data;
fourthly, classifying the data, namely inputting the relevant data of the enterprise into a data classification device after the acquired data is cleaned and verified, and classifying the attributions of the enterprises;
fifthly, data supplementary collection, namely repeating the steps for reclassification after the data supplementary collection is carried out on the enterprises which are not classified;
sixthly, displaying data, namely visually displaying the enterprises which finish the enterprise attribution classification,
and seventhly, analyzing data, namely combining other related data to analyze and monitor the enterprises which finish the administrative division attribution classification in the subsequent region dimension.
2. The method of claim 1,
the data cleaning process includes: irrelevant and wrong characters (blank spaces, messy codes and redundant characters) are removed, the characters are converted into English half-corner symbols, repeated data are removed, and null value data are supplemented.
3. The method of claim 1,
the data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed on the webpage to ensure the accuracy of the data.
4. The method of claim 1,
the data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
5. The method of claim 1,
the data display content comprises maps, tables and documents.
6. A system for classifying affiliations of an enterprise,
the method comprises the following steps:
the data acquisition device is used for acquiring the public information of the online enterprises and storing the acquired original data into the database;
the data cleaning device is used for cleaning and denoising the acquired original data;
the data checking device is used for checking and checking the cleaned data;
and after the acquired data is cleaned and verified, the data classification device inputs the part of enterprise related data into the data classification device, and the enterprise is divided according to the attribution:
the data supplementing and collecting device is used for classifying the data classification device again after data supplementing and collecting is carried out on the enterprises which are not classified by the data classification device;
the data display device is used for visually displaying the enterprises which finish the enterprise attribution classification;
and the data analysis device is used for performing subsequent analysis and monitoring on regional dimensions on the enterprises which finish the administrative division attribution classification by combining with other related data.
7. The system of claim 6,
the data cleaning process includes: removing irrelevant and wrong characters (blank space, messy codes and redundant characters), converting the characters into English semi-angle symbols, removing repeated data and supplementing null data.
8. The system of claim 6,
the data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed on the webpage to ensure the accuracy of the data.
9. The system of claim 6,
the data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
10. The system of claim 6,
the data display content comprises maps, tables and documents.
CN202210070106.3A 2022-01-21 2022-01-21 Enterprise attribution classification method and system Pending CN114463053A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210070106.3A CN114463053A (en) 2022-01-21 2022-01-21 Enterprise attribution classification method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210070106.3A CN114463053A (en) 2022-01-21 2022-01-21 Enterprise attribution classification method and system

Publications (1)

Publication Number Publication Date
CN114463053A true CN114463053A (en) 2022-05-10

Family

ID=81410135

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210070106.3A Pending CN114463053A (en) 2022-01-21 2022-01-21 Enterprise attribution classification method and system

Country Status (1)

Country Link
CN (1) CN114463053A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117689349A (en) * 2024-01-31 2024-03-12 江苏荣泽信息科技股份有限公司 Office personnel-oriented enterprise data rapid splitting and sharing method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109961324A (en) * 2019-03-19 2019-07-02 山东浪潮云信息技术有限公司 A kind of electric business enterprise stamps the standardization processing method and system of region label
CN110569322A (en) * 2019-07-26 2019-12-13 苏宁云计算有限公司 Address information analysis method, device and system and data acquisition method
CN111523853A (en) * 2020-04-14 2020-08-11 上海资信有限公司 Management method for processing, sorting and storing enterprise credit information
CN112418688A (en) * 2020-11-26 2021-02-26 深圳市中博科创信息技术有限公司 Enterprise data management method based on enterprise service portal

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109961324A (en) * 2019-03-19 2019-07-02 山东浪潮云信息技术有限公司 A kind of electric business enterprise stamps the standardization processing method and system of region label
CN110569322A (en) * 2019-07-26 2019-12-13 苏宁云计算有限公司 Address information analysis method, device and system and data acquisition method
WO2021017679A1 (en) * 2019-07-26 2021-02-04 苏宁易购集团股份有限公司 Address information parsing method and apparatus, system and data acquisition method
CN111523853A (en) * 2020-04-14 2020-08-11 上海资信有限公司 Management method for processing, sorting and storing enterprise credit information
CN112418688A (en) * 2020-11-26 2021-02-26 深圳市中博科创信息技术有限公司 Enterprise data management method based on enterprise service portal

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117689349A (en) * 2024-01-31 2024-03-12 江苏荣泽信息科技股份有限公司 Office personnel-oriented enterprise data rapid splitting and sharing method
CN117689349B (en) * 2024-01-31 2024-04-16 江苏荣泽信息科技股份有限公司 Office personnel-oriented enterprise data rapid splitting and sharing method

Similar Documents

Publication Publication Date Title
CN110781246A (en) Enterprise association relationship construction method and system
CN103605651A (en) Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis
CN110109908B (en) Analysis system and method for mining potential relationship of person based on social basic information
CN104572615A (en) Method and system for on-line case investigation processing
CN112288247B (en) Soil heavy metal risk identification method based on space interaction relationship
CN110825805B (en) Data visualization method and device
CN113032403B (en) Data insight method, device, electronic equipment and storage medium
CN112860905A (en) Text information extraction method, device and equipment and readable storage medium
CN111125284A (en) Integrated space-time big data and geographic information public service cloud platform
CN114463053A (en) Enterprise attribution classification method and system
CN112363996B (en) Method, system and medium for establishing physical model of power grid knowledge graph
CN115759640A (en) Public service information processing system and method for smart city
CN111651432A (en) Suspected contaminated site space-time information identification method
CN113220875A (en) Internet information classification method and system based on industry label and electronic equipment
CN107562605B (en) Information verification system and information verification method for new technology product
CN115718807A (en) Personnel relationship analysis method, device, equipment and storage medium
CN113806311A (en) Deep learning-based file classification method and device, electronic equipment and medium
CN112597763A (en) Method and device for extracting and displaying judicial literature information in association manner and storage medium
CN112364177B (en) Method, system and medium for establishing logic model of power grid knowledge graph
CN113435701B (en) Method and device for processing consumption quality information
CN115577099B (en) Polluted land block boundary identification method, system, medium and equipment
CN113239105B (en) Method for automatically detecting and storing head, head and tail of observation data in field surgery
CN111258865B (en) Processor, alarm data management system and method of multi-informatization system
CN112559844A (en) Natural disaster public opinion analysis method and device
CN113077115A (en) Intelligent financial data analysis system and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination