CN114463053A - Enterprise attribution classification method and system - Google Patents
Enterprise attribution classification method and system Download PDFInfo
- Publication number
- CN114463053A CN114463053A CN202210070106.3A CN202210070106A CN114463053A CN 114463053 A CN114463053 A CN 114463053A CN 202210070106 A CN202210070106 A CN 202210070106A CN 114463053 A CN114463053 A CN 114463053A
- Authority
- CN
- China
- Prior art keywords
- data
- enterprise
- classification
- attribution
- enterprises
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0201—Market modelling; Market analysis; Collecting market data
- G06Q30/0204—Market segmentation
- G06Q30/0205—Location or geographical consideration
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Finance (AREA)
- Development Economics (AREA)
- Accounting & Taxation (AREA)
- Entrepreneurship & Innovation (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Game Theory and Decision Science (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Life Sciences & Earth Sciences (AREA)
Abstract
The invention provides a method and a system for classifying enterprise affiliation, which belong to the field of data processing.
Description
Technical Field
The invention relates to the technical fields of statistical analysis, data processing and the like, in particular to a method and a system for enterprise attribution classification.
Background
Statistical analysis refers to a process of analyzing a large amount of collected data by using an appropriate statistical analysis method, extracting useful information, forming a conclusion, and performing detailed research and summary on the data. By carrying out statistical analysis on the data, corresponding conclusions can be obtained, and the conclusions can help people to make judgment so as to take proper action.
Data processing is the collection, storage, retrieval, processing, transformation and transmission of data, and is the extraction and derivation of valuable and meaningful data for certain people from a large amount of data that may be cluttered and unintelligible. Data processing is a basic link of system engineering and automatic control, runs through various fields of social production and social life, and greatly influences the development process of the human society.
Enterprises play a very important role as participants in marketing activities, both online and offline. How to accurately and effectively divide the attribution of an enterprise on a regional dimension according to the enterprise business registration information so as to monitor and analyze the sales condition of the enterprise on the regional dimension and the enterprise sales condition summarized on the regional dimension becomes a key point and a difficult point for researching the current business activities.
Disclosure of Invention
In order to solve the technical problems, the invention provides a method for classifying the enterprise attribution, which achieves the purposes of effectively utilizing the acquired enterprise data, accurately and effectively classifying the enterprise attribution and further summarizing regional sales hotspots and particularly performs regional standardized processing on the enterprise.
The technical scheme of the invention is as follows:
a method of enterprise attribution classification, comprising:
firstly, data acquisition, namely acquiring public information of an online enterprise, and storing acquired original data into a database;
secondly, cleaning data, namely cleaning and denoising the acquired original data;
thirdly, data verification, namely, checking and verifying the cleaned data;
fourthly, classifying the data, namely after the acquired data are cleaned and verified, inputting the enterprise related data into a data classification device, and classifying the attributions of the enterprises;
fifthly, data supplementary collection, namely repeating the steps for reclassification after the data supplementary collection is carried out on the enterprises which are not classified;
and sixthly, displaying data, namely performing visual display on the enterprises which finish the enterprise attribution classification, wherein the enterprises comprise maps, tables and documents.
And seventhly, analyzing data, namely combining other related data to analyze and monitor the enterprises which finish the administrative division attribution classification in the subsequent region dimension.
Further, in the above-mentioned case,
the data cleaning process includes: removing irrelevant and wrong characters (blank space, messy codes and redundant characters), converting the characters into English semi-angle symbols, removing repeated data and supplementing null data.
The data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed by the webpage to ensure the accuracy of the data.
The data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
The invention also provides a system for classifying the enterprise affiliation, which comprises the following steps:
the method comprises the following steps:
the data acquisition device is used for acquiring the public information of the online enterprise and storing the acquired original data into the database;
the data cleaning device is used for cleaning and denoising the acquired original data;
the data checking device is used for checking and checking the cleaned data;
and after the acquired data is cleaned and verified, the data classification device inputs the part of enterprise related data into the data classification device, and the enterprise is divided according to the attribution:
the data supplementing and collecting device is used for classifying the data classification device again after data supplementing and collecting is carried out on the enterprises which are not classified by the data classification device;
the data display device is used for visually displaying the enterprises which finish the enterprise attribution classification, and comprises a map, a table and a document;
and the data analysis device is used for performing subsequent analysis and monitoring on regional dimensions on the enterprises which finish the administrative division attribution classification by combining with other related data.
Further, in the above-mentioned case,
the data cleaning process includes: removing irrelevant and wrong characters (blank space, messy codes and redundant characters), converting the characters into English semi-angle symbols, removing repeated data and supplementing null data.
The data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed by the webpage to ensure the accuracy of the data.
The data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
The invention has the advantages that
After the enterprise is classified according to the three-level attribution of the region, the operation and sales data of the enterprise can be classified according to the region, so that the difficulty and the complexity of analysis of the later-stage enterprise data in the region dimension are greatly reduced, the sales hotspots and conclusions of the enterprise in the region dimension can be conveniently summarized, and a large amount of time cost and labor and material cost can be saved for the analysis and the monitoring of the enterprise data.
Drawings
FIG. 1 is a schematic workflow diagram of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer and more complete, the technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention, and based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without creative efforts belong to the scope of the present invention.
According to the invention, the enterprise data is standardized by carrying out method and process design on the acquired enterprise public data, province, city and district three-level region labels are marked for the enterprise, the enterprise is clearly classified in the region of ownership, and further the statistics and analysis of the subsequent enterprise sales data in the region dimension aspect are facilitated.
The method mainly comprises the following steps:
the first step is as follows: a data acquisition device. Collecting public information of the online enterprises, and storing the collected original data in a database;
the second step is that: and a data cleaning device. Cleaning and denoising the collected original data, including but not limited to removing irrelevant and wrong characters (blank spaces, messy codes, redundant characters and the like), converting the characters into English semi-angle symbols, eliminating repeated data, supplementing null data and the like;
the third step: and a data checking device. Checking and checking the cleaned data, including but not limited to checking and checking in terms of integrity, uniqueness, non-nullability and the like; sampling the data, and comparing the sampled data with original data displayed by a webpage to ensure the accuracy of the data;
the fourth step: and a data classification device. The acquired data is cleaned and verified, and high accuracy is achieved. Inputting the accurate enterprise related data into a data classification device, and classifying the enterprise as belonging places:
1. inputting a classification item, which is a standard geographical administrative division table;
2. inputting data to be classified, including enterprise names, unified social credit codes, registration organs, addresses and other data;
3. classifying the attribution of the enterprise, and sequentially adopting the following methods:
(1) acquiring three-level attribution information of an administrative district according to an enterprise registration authority;
(2) acquiring three-level attribution information of an administrative district according to an enterprise registration address;
(3) acquiring three-level attribution information of an administrative division according to the unified social credit code of the enterprise;
(4) acquiring three-level attribution information of an administrative division according to the name of an enterprise;
(5) the special enterprises comprehensively acquire the three-level attribution information of the administrative division according to the enterprise name, the unified social credit code, the registration authority, the address and the like.
4. And outputting the classification result of the administrative division affiliation of the enterprise.
The fifth step: the data supplement and collection device is used for repeating the steps to classify the enterprises which are not classified in the fourth step again after data supplement and collection;
and a sixth step: a data presentation device. Carrying out visual display on the enterprises which finish the enterprise attribution classification, wherein the visual display comprises but is not limited to maps, tables, documents and the like;
the seventh step: and a data analysis device. And performing subsequent analysis and monitoring on regional dimensions on the enterprises which finish the administrative region attribution classification by combining with other related data, such as sales data.
The above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.
Claims (10)
1. A method for classifying enterprise affiliations,
the method comprises the following steps:
firstly, data acquisition, namely acquiring public information of an online enterprise, and storing acquired original data into a database;
secondly, cleaning data, namely cleaning and denoising the acquired original data;
thirdly, data verification, namely, checking and verifying the cleaned data;
fourthly, classifying the data, namely inputting the relevant data of the enterprise into a data classification device after the acquired data is cleaned and verified, and classifying the attributions of the enterprises;
fifthly, data supplementary collection, namely repeating the steps for reclassification after the data supplementary collection is carried out on the enterprises which are not classified;
sixthly, displaying data, namely visually displaying the enterprises which finish the enterprise attribution classification,
and seventhly, analyzing data, namely combining other related data to analyze and monitor the enterprises which finish the administrative division attribution classification in the subsequent region dimension.
2. The method of claim 1,
the data cleaning process includes: irrelevant and wrong characters (blank spaces, messy codes and redundant characters) are removed, the characters are converted into English half-corner symbols, repeated data are removed, and null value data are supplemented.
3. The method of claim 1,
the data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed on the webpage to ensure the accuracy of the data.
4. The method of claim 1,
the data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
5. The method of claim 1,
the data display content comprises maps, tables and documents.
6. A system for classifying affiliations of an enterprise,
the method comprises the following steps:
the data acquisition device is used for acquiring the public information of the online enterprises and storing the acquired original data into the database;
the data cleaning device is used for cleaning and denoising the acquired original data;
the data checking device is used for checking and checking the cleaned data;
and after the acquired data is cleaned and verified, the data classification device inputs the part of enterprise related data into the data classification device, and the enterprise is divided according to the attribution:
the data supplementing and collecting device is used for classifying the data classification device again after data supplementing and collecting is carried out on the enterprises which are not classified by the data classification device;
the data display device is used for visually displaying the enterprises which finish the enterprise attribution classification;
and the data analysis device is used for performing subsequent analysis and monitoring on regional dimensions on the enterprises which finish the administrative division attribution classification by combining with other related data.
7. The system of claim 6,
the data cleaning process includes: removing irrelevant and wrong characters (blank space, messy codes and redundant characters), converting the characters into English semi-angle symbols, removing repeated data and supplementing null data.
8. The system of claim 6,
the data check comprises check on the aspects of integrity, uniqueness and non-nullability; and sampling the data, and comparing the sampled data with the original data displayed on the webpage to ensure the accuracy of the data.
9. The system of claim 6,
the data classification divides the enterprise into attributions and comprises the following steps:
1) inputting a classification item, which is a standard geographical administrative division table;
2) inputting data to be classified, including enterprise names, unified social credit codes, registration authorities and addresses;
3) classifying the attribution of the enterprise, and sequentially adopting the following methods:
3.1) acquiring three-level attribution information of the administrative district according to an enterprise registration authority;
3.2) acquiring three-level attribution information of the administrative division according to the enterprise registration address;
3.3) acquiring three-level attribution information of the administrative division according to the uniform social credit code of the enterprise;
3.4) acquiring three-level attribution information of the administrative division according to the enterprise name;
and 3.5) the special enterprises comprehensively acquire the three-level attribution information of the administrative district according to the enterprise name, the unified social credit code, the registration authority, the address and other information.
4) And outputting the classification result of the administrative division affiliation of the enterprise.
10. The system of claim 6,
the data display content comprises maps, tables and documents.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210070106.3A CN114463053A (en) | 2022-01-21 | 2022-01-21 | Enterprise attribution classification method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210070106.3A CN114463053A (en) | 2022-01-21 | 2022-01-21 | Enterprise attribution classification method and system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114463053A true CN114463053A (en) | 2022-05-10 |
Family
ID=81410135
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210070106.3A Pending CN114463053A (en) | 2022-01-21 | 2022-01-21 | Enterprise attribution classification method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114463053A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117689349A (en) * | 2024-01-31 | 2024-03-12 | 江苏荣泽信息科技股份有限公司 | Office personnel-oriented enterprise data rapid splitting and sharing method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109961324A (en) * | 2019-03-19 | 2019-07-02 | 山东浪潮云信息技术有限公司 | A kind of electric business enterprise stamps the standardization processing method and system of region label |
CN110569322A (en) * | 2019-07-26 | 2019-12-13 | 苏宁云计算有限公司 | Address information analysis method, device and system and data acquisition method |
CN111523853A (en) * | 2020-04-14 | 2020-08-11 | 上海资信有限公司 | Management method for processing, sorting and storing enterprise credit information |
CN112418688A (en) * | 2020-11-26 | 2021-02-26 | 深圳市中博科创信息技术有限公司 | Enterprise data management method based on enterprise service portal |
-
2022
- 2022-01-21 CN CN202210070106.3A patent/CN114463053A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109961324A (en) * | 2019-03-19 | 2019-07-02 | 山东浪潮云信息技术有限公司 | A kind of electric business enterprise stamps the standardization processing method and system of region label |
CN110569322A (en) * | 2019-07-26 | 2019-12-13 | 苏宁云计算有限公司 | Address information analysis method, device and system and data acquisition method |
WO2021017679A1 (en) * | 2019-07-26 | 2021-02-04 | 苏宁易购集团股份有限公司 | Address information parsing method and apparatus, system and data acquisition method |
CN111523853A (en) * | 2020-04-14 | 2020-08-11 | 上海资信有限公司 | Management method for processing, sorting and storing enterprise credit information |
CN112418688A (en) * | 2020-11-26 | 2021-02-26 | 深圳市中博科创信息技术有限公司 | Enterprise data management method based on enterprise service portal |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117689349A (en) * | 2024-01-31 | 2024-03-12 | 江苏荣泽信息科技股份有限公司 | Office personnel-oriented enterprise data rapid splitting and sharing method |
CN117689349B (en) * | 2024-01-31 | 2024-04-16 | 江苏荣泽信息科技股份有限公司 | Office personnel-oriented enterprise data rapid splitting and sharing method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110781246A (en) | Enterprise association relationship construction method and system | |
CN103605651A (en) | Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis | |
CN110109908B (en) | Analysis system and method for mining potential relationship of person based on social basic information | |
CN104572615A (en) | Method and system for on-line case investigation processing | |
CN112288247B (en) | Soil heavy metal risk identification method based on space interaction relationship | |
CN110825805B (en) | Data visualization method and device | |
CN113032403B (en) | Data insight method, device, electronic equipment and storage medium | |
CN112860905A (en) | Text information extraction method, device and equipment and readable storage medium | |
CN111125284A (en) | Integrated space-time big data and geographic information public service cloud platform | |
CN114463053A (en) | Enterprise attribution classification method and system | |
CN112363996B (en) | Method, system and medium for establishing physical model of power grid knowledge graph | |
CN115759640A (en) | Public service information processing system and method for smart city | |
CN111651432A (en) | Suspected contaminated site space-time information identification method | |
CN113220875A (en) | Internet information classification method and system based on industry label and electronic equipment | |
CN107562605B (en) | Information verification system and information verification method for new technology product | |
CN115718807A (en) | Personnel relationship analysis method, device, equipment and storage medium | |
CN113806311A (en) | Deep learning-based file classification method and device, electronic equipment and medium | |
CN112597763A (en) | Method and device for extracting and displaying judicial literature information in association manner and storage medium | |
CN112364177B (en) | Method, system and medium for establishing logic model of power grid knowledge graph | |
CN113435701B (en) | Method and device for processing consumption quality information | |
CN115577099B (en) | Polluted land block boundary identification method, system, medium and equipment | |
CN113239105B (en) | Method for automatically detecting and storing head, head and tail of observation data in field surgery | |
CN111258865B (en) | Processor, alarm data management system and method of multi-informatization system | |
CN112559844A (en) | Natural disaster public opinion analysis method and device | |
CN113077115A (en) | Intelligent financial data analysis system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |