CN113283972A - System and method for constructing tax big data model - Google Patents

System and method for constructing tax big data model Download PDF

Info

Publication number
CN113283972A
CN113283972A CN202110492123.1A CN202110492123A CN113283972A CN 113283972 A CN113283972 A CN 113283972A CN 202110492123 A CN202110492123 A CN 202110492123A CN 113283972 A CN113283972 A CN 113283972A
Authority
CN
China
Prior art keywords
tax
data
service
data model
business
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110492123.1A
Other languages
Chinese (zh)
Inventor
胡立禄
黄小蕊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202110492123.1A priority Critical patent/CN113283972A/en
Publication of CN113283972A publication Critical patent/CN113283972A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/123Tax preparation or submission
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Abstract

The embodiment of the invention relates to the field of tax, and particularly discloses a system and a method for constructing a tax big data model. The embodiment of the invention stores the tax business data into the NoSQL database in a centralized way; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point. The embodiment of the invention can carry out standardized processing on tax business data of different departments, different service points and different types, thereby completing the construction of a tax big data model, and further promoting the management big data, the refinement of tax handling service and the continuity of business improvement of the tax authority.

Description

System and method for constructing tax big data model
Technical Field
The invention belongs to the field of tax, and particularly relates to a system and a method for constructing a tax big data model.
Background
Tax is the abbreviation of tax affairs work. It is divided into broad and narrow terms. The generalized tax is tax based on national political power, and participates in all works in the whole process of national income distribution, including research, formulation, propaganda, implementation and execution of tax policy; establishing, adjusting, revising, reforming, perfecting, publicizing, explaining, consulting and executing tax legal system; tax in the narrow sense generally refers to the collection and management of tax.
Big data is a basic resource in China, and has very important significance for developing overall data analysis and decision-making by tax departments. Along with the development of social economy, the number of taxpayers is also increasing continuously, and the trend of enterprises towards intellectualization and internationalization is more obvious. In this situation, the difficulty factor of tax risk management is also increasing. Therefore, it is necessary to construct a tax big data model to promote tax authorities to manage big data, refine tax services and keep business improvement. However, due to the problems of division of tax business bars, extensive management and the like, most business systems of all departments are independently constructed and self-made systems, and the tax authority systems of all the regions are respectively administrative and are not uniform in standard, so that the businesses are difficult to cooperate, data cannot be uniformly processed, and a tax big data model is difficult to construct.
Disclosure of Invention
The embodiment of the invention aims to provide a system and a method for constructing a tax big data model, and aims to solve the problems in the background art.
In order to achieve the above purpose, the embodiments of the present invention provide the following technical solutions:
a construction system of a tax big data model comprises a data processing terminal and at least two data acquisition terminals, wherein:
the data acquisition terminal is used for acquiring tax service data of different tax service systems and sending the tax service data to the data processing terminal;
the data processing terminal is used for storing the tax business data into a NoSQL database in a centralized way; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
As a further limitation of the preferred embodiment of the present invention, the data processing terminal specifically includes:
the NoSQL database is used for intensively storing the tax business data;
the effective tax business data extraction unit is used for extracting effective tax business data in the NoSQL database;
the standard tax business data acquisition unit is used for carrying out data collision on the effective tax business data to obtain standard tax business data;
the tax big data model building unit is used for extracting influence molecules in the standard tax business data and building a tax big data model according to the influence molecules;
the accuracy verification unit is used for verifying the accuracy of the tax big data model;
and the service point tax data model building unit is used for building a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
As a further limitation of the preferred embodiment of the present invention, the effective tax service data extraction unit specifically includes:
the data length calculation module is used for calculating the data length of the tax service data in the NoSQL database;
the data length comparison module is used for comparing the data length of the tax business data with the size of the defined effective data length;
and the effective data extraction module is used for extracting the tax service data of which the data length is not less than the effective data length.
As a further limitation of the preferred embodiment of the present invention, the standard tax service data acquiring unit specifically includes:
the data collision module is used for carrying out data collision on the effective tax business data;
and the standard tax business data acquisition module is used for sorting the effective tax business data after the data collision to obtain the standard tax business data.
As a further limitation of the preferred embodiment of the present invention, the tax big data model building unit specifically includes:
the influencing molecule extracting module is used for extracting influencing molecules in the standard tax business data;
the training set generating module is used for generating a training set according to the influence molecules;
and the tax big data model building module is used for building a tax big data model according to the training set.
As a further limitation of the preferred embodiment of the present invention, the accuracy verification unit specifically includes:
the test set generating module is used for generating a test set according to the influencing molecules;
and the accuracy verification module is used for verifying the accuracy of the tax big data model according to the test set.
As a further limitation of the preferred embodiment of the present invention, the service point tax data model building unit specifically includes:
the tax business data classification module is used for classifying the tax business data according to different service points;
and the service point tax data model building module is used for building a corresponding service point tax data model according to the tax service data and the tax big data model of the corresponding service point.
A construction method of a tax big data model is characterized by being applied to a data processing terminal, and specifically comprises the following steps:
storing the tax business data into a NoSQL database in a centralized way;
extracting effective tax service data in the NoSQL database;
performing data collision on the effective tax service data to obtain standard tax service data;
extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules;
carrying out accuracy verification on the tax big data model;
and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
As a further limitation of the preferred embodiment of the present invention, the extracting effective tax business data in the NoSQL database specifically includes:
calculating the data length of the tax service data in the NoSQL database;
comparing the data length of the tax business data with the size of the defined effective data length;
and extracting the tax service data with the data length not less than the effective data length.
As a further limitation of the preferred embodiment of the present invention, the extracting the influencing factors from the standard tax business data, and constructing the tax big data model according to the influencing factors specifically includes:
extracting influencing molecules in the standard tax business data;
generating a training set according to the influencing molecules;
and constructing a tax big data model according to the training set.
Compared with the prior art, the invention has the beneficial effects that:
the embodiment of the invention stores the tax business data into the NoSQL database in a centralized way; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point. The embodiment of the invention can carry out standardized processing on tax business data of different departments, different service points and different types, thereby completing the construction of a tax big data model and promoting the management big data, the tax handling service refinement and the business improvement persistence of the tax authority.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention.
Fig. 1 shows an application architecture diagram of a system provided by an embodiment of the present invention.
Fig. 2 shows a block diagram of a data processing terminal according to an embodiment of the present invention.
Fig. 3 shows a block diagram of an effective tax service data extraction unit in the system according to the embodiment of the present invention.
Fig. 4 shows a block diagram of a standard tax service data obtaining unit in the system according to the embodiment of the present invention.
Fig. 5 shows a structural block diagram of a tax big data model building unit in the system according to the embodiment of the present invention.
Fig. 6 shows a block diagram of an accuracy verification unit in the system according to the embodiment of the present invention.
Fig. 7 shows a flowchart of a service point tax data model building unit in the system according to the embodiment of the present invention.
Fig. 8 shows a flow chart of a method provided by an embodiment of the invention.
Fig. 9 shows a specific flowchart one of the method provided by the embodiment of the present invention.
Fig. 10 shows a specific flowchart two of the method provided by the embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
It can be understood that in the prior art, with the development of socio-economic, the number of taxpayers is increasing, and the trend of enterprises towards intellectualization and internationalization is more obvious. In this situation, the difficulty factor of tax risk management is also increasing. Therefore, it is necessary to construct a tax big data model to promote tax authorities to manage big data, refine tax services and keep business improvement. However, due to the problems of division of tax business bars, extensive management and the like, most business systems of all departments are independently constructed and self-made systems, and the tax authority systems of all the regions are respectively administrative and are not uniform in standard, so that the businesses are difficult to cooperate, data cannot be uniformly processed, and a tax big data model is difficult to construct.
In order to solve the above problems, in the embodiments of the present invention, tax business data is centrally stored in a NoSQL database; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point. The embodiment of the invention can carry out standardized processing on tax business data of different departments, different service points and different types, thereby completing the construction of a tax big data model and promoting the management big data, the tax handling service refinement and the business improvement persistence of the tax authority.
Specifically, as shown in fig. 1, fig. 1 is a diagram illustrating an application architecture of a system provided by an embodiment of the present invention.
Specifically, the system for constructing the tax big data model comprises a data processing terminal 101 and at least two data acquisition terminals 102 and 103.
It should be understood that the number of data collection terminals in fig. 1 is merely illustrative. According to the implementation requirement, any number of data acquisition terminals can be provided, the number of the data acquisition terminals is at least two, and the data acquired by the data acquisition terminals are analyzed and processed through one data processing terminal.
Specifically, in a preferred embodiment of the present invention, the embodiment provides a system, wherein:
the data acquisition terminal 102 is configured to acquire tax service data of different tax service systems, and send the tax service data to a data processing terminal. The data acquisition terminal 102 is arranged in a service database of each tax transaction system, comprises a Jinsan system, an online transaction hall, a self-service transaction terminal and the like, can acquire tax transaction data of different types, and uploads the data to the data processing terminal 101.
The data processing terminal 101 is configured to store the tax service data into a NoSQL database in a centralized manner; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point. The data processing terminal 101 may be an independent physical server or terminal, may also be a server cluster formed by a plurality of physical servers, and may be a cloud server providing basic cloud computing services such as a cloud server, a cloud database, a cloud storage, and a CDN.
FIG. 2 is a block diagram of a data processing terminal 101 according to an embodiment of the present invention
In a preferred embodiment provided by the present invention, the data processing terminal 101 specifically includes:
and the NoSQL database 1011 is used for intensively storing the tax business data.
Specifically, a NoSQL database 1011 is built on a cloud platform in the data processing terminal 101, and various tax business data collected by the data collection terminal 102 are stored.
It should be understood that in the embodiment of the present invention, the NoSQL database refers to a non-relational database, which is different from a relational database, and they do not guarantee the ACID property of the relational data, and there is no relation between the data, so that the expansion is very easy. Intangibles also bring scalability at the architectural level. The NoSQL database has very high read-write performance, and especially has excellent performance under the condition of large data volume.
An effective tax service data extracting unit 1012, configured to extract effective tax service data in the NoSQL database.
It can be understood that, in the embodiment of the present invention, the tax business data stored in the NoSQL database are huge and disordered, many tax business data are incomplete, or the effectiveness is not high, and all information in the whole tax business data process cannot be contained, so the effective tax business data in the NoSQL database needs to be extracted.
Specifically, fig. 3 shows a block diagram of an effective tax service data extraction unit 1012 in the system according to the embodiment of the present invention.
In an embodiment of the present invention, the effective tax service data extracting unit 1012 specifically includes:
a data length calculating module 10121, configured to calculate a data length of the tax service data in the NoSQL database.
Specifically, the data length represents the integrity of the data to a certain extent, and the data length calculating module 10121 calculates the data length of each piece of tax service data in the NoSQL database.
The data length comparison module 10122 is configured to compare the data length of the tax service data with a size that defines the valid data length.
Specifically, a data length defined as valid data is preset, and the data length comparison module 10122 compares the data length of each piece of tax service data calculated by the data length calculation module 10121 with the data length defined as valid data.
The valid data extraction module 10123 is configured to extract tax service data with a data length not less than the valid data length.
Specifically, the valid data extraction module 10123 determines whether the data length of each piece of tax service data is not less than the valid data length, and extracts the tax service data not less than the valid data length to obtain valid tax service data.
Further, in a preferred embodiment provided by the present invention, the data processing terminal 101 further includes:
and a standard tax business data obtaining unit 1013, configured to perform data collision on the valid tax business data to obtain standard tax business data.
Specifically, the standard tax business data may include data in various forms such as files, geographic information, logs, pictures, and the like, where corresponding information such as tax type, time, place, and task is disordered, and therefore, effective tax business data needs to be converted into standard tax business data including the whole tax process.
Specifically, fig. 4 shows a block diagram of a standard tax service data acquiring unit 1013 in the system according to the embodiment of the present invention.
In a preferred embodiment of the present invention, the standard tax service data acquiring unit 1013 specifically includes:
the data collision module 10131 is configured to perform data collision on the valid tax service data.
Specifically, the data collision module 10131 performs processing such as cleaning, conversion, collision and the like on each effective tax service data, so that the effective tax service data can restore the information of the tax service transaction process, including tax type, time, place, task and the like.
A standard tax business data obtaining module 10132, configured to sort the valid tax business data after the data collision, to obtain the standard tax business data.
Specifically, the standard tax business data obtaining module 10132 performs data collision on the valid tax business data and then sorts the valid tax business data to obtain the standard tax business data.
Further, in a preferred embodiment provided by the present invention, the data processing terminal 101 further includes:
the tax big data model building unit 1014 is configured to extract an influencing molecule in the standard tax business data, and build a tax big data model according to the influencing molecule.
Specifically, the influence molecules in the standard tax business data can be screened through business experience or intelligent judgment of a system, and a tax big data model is constructed according to the influence molecules.
It can be understood that some of the tax transaction items have obvious periodicity, and some have strong randomness, for example, some are related to holidays and tax periods, and the tax handling number of each tax service point is different, so that influencing molecules in some standard tax service data need to be screened, and a tax big data model is constructed according to the influencing molecules to predict changes behind the influencing molecules.
Specifically, fig. 5 shows a block diagram of a tax big data model building unit 1014 in the system according to the embodiment of the present invention.
In an embodiment of the present invention, the tax big data model building unit 1014 specifically includes:
an influence molecule extracting module 10141, configured to extract influence molecules in the standard tax business data;
specifically, the influencer module 10141 can automatically or according to business experience select influencer, such as tax, holidays, etc.
A training set generating module 10142, configured to generate a training set according to the influencing molecules;
specifically, the training set generating module 10142 generates a training set according to the selected influencing molecules, and the training set is randomly extracted to account for more than 75% of the influencing molecule samples and used for estimating the model.
A tax big data model building module 10143, configured to build a tax big data model according to the training set.
Specifically, the tax big data model building module 10143 builds the tax big data model according to the training set extracted randomly.
Further, in a preferred embodiment provided by the present invention, the data processing terminal 101 further includes:
and an accuracy verification unit 1015, configured to perform accuracy verification on the tax big data model.
Specifically, the accuracy of the tax big data model can be verified through new data or some data remained before in the tax process, and whether the tax big data model achieves the expected effect is checked.
Specifically, fig. 6 shows a block diagram of an accuracy verification unit 1015 in the system according to the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the accuracy verification unit 1015 specifically includes:
a test set generating module 10151, configured to generate a test set according to the influencing molecules.
Specifically, the remaining data less than 25% of the molecular samples are affected, plus new data from the tax process, to create a test set that can verify how well the final selected optimal model performs.
And the accuracy verification module 10152 is used for verifying the accuracy of the tax big data model according to the test set.
Specifically, the accuracy verification module 10152 checks the accuracy of the tax big data model according to the test set generated by the test set generation module 10151.
Further, in a preferred embodiment provided by the present invention, the data processing terminal 101 further includes:
the service point tax data model building unit 1016 is configured to build a service point tax data model according to the verified tax big data model and the tax data of each service point.
Specifically, after the tax big data model is verified to achieve the expected effect, the tax data model of each service point can be constructed through the independent tax business data of each tax point based on the tax big data model.
Specifically, fig. 7 shows a block diagram of a service point tax data model building unit 1016 in the system according to the embodiment of the present invention.
In an embodiment of the present invention, the service point tax data model building unit 1016 includes:
the tax business data classification module 10161 is configured to classify the tax business data according to different service points.
Specifically, the tax business data classification module 10161 may classify the tax business data according to different service points, so as to classify the standard tax business data according to different service points.
A service point tax data model building module 10162, configured to build a corresponding service point tax data model according to the tax service data and the tax big data model of the corresponding service point.
Specifically, the service point tax data model building module 10162 builds a tax data model of each service point according to the independent tax business data of each tax point based on the tax big data model.
It can be understood that, for each service point, because the faced regions are different, the corresponding services and the service time are also different, and after the tax big data model is built and checked, the tax data model of each service point can be built through the independent tax service data of each tax point based on the tax big data model so as to guide the tax services of the corresponding service point.
Further, fig. 8 shows a flowchart of a method provided by the embodiment of the present invention.
In another preferred embodiment provided by the present invention, a method for constructing a tax big data model is applied to a data processing terminal 101, and the method specifically includes:
and step S101, storing the tax business data into a NoSQL database in a centralized manner.
Specifically, the tax service data collected by at least two data collection terminals 102 and 103 is centrally stored in the NoSQL database.
And step S102, extracting effective tax service data in the NoSQL database.
Specifically, the tax business data stored in the NoSQL database are huge and messy, so that valid tax business data in the NoSQL database needs to be extracted.
Specifically, fig. 9 shows a specific flowchart one of the method provided in the embodiment of the present invention.
In a preferred embodiment of the present invention, the extracting effective tax service data from the NoSQL database specifically includes:
and step S1021, calculating the data length of the tax business data in the NoSQL database.
Specifically, the data length represents the integrity of the data to some extent, so that the data length of each piece of tax business data in the NoSQL database needs to be calculated.
In step S1022, the data length of the tax service data is compared with the size of the defined effective data length.
Specifically, a data length defined as valid data is preset, and the calculated data length of each piece of tax service data is compared with the defined valid data length.
Step S1023, the tax business data with the data length not less than the effective data length is extracted.
Specifically, whether the data length of each piece of tax service data is not less than the effective data length is judged, and the tax service data not less than the effective data length is extracted to obtain the effective tax service data.
Further, in a preferred embodiment provided by the present invention, the method specifically includes:
and step S103, performing data collision on the effective tax service data to obtain standard tax service data.
Specifically, each effective tax business data is cleaned, converted, collided and the like, so that the effective tax business data can restore the information of the tax business handling process, including tax type, time, place, task and the like.
And step S104, extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules.
Specifically, through business experience or intelligent judgment of a system, influence molecules in standard tax business data are screened, and a tax big data model is constructed according to the influence molecules.
Specifically, fig. 10 shows a specific flowchart of a method provided in the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the extracting an influencing molecule from the standard tax business data, and constructing a tax big data model according to the influencing molecule specifically includes:
and step S1041, extracting influencing molecules in the standard tax business data.
Specifically, the selection of influencing molecules, such as tax period, holidays, etc., is performed automatically or based on business experience.
Step S1042, generating a training set according to the influencing molecules.
Specifically, a training set is generated according to the selected influencing molecules, and the training set is randomly extracted to account for more than 75% of influencing molecule samples and is used for estimating the model.
And S1043, constructing a tax big data model according to the training set.
Further, in a preferred embodiment provided by the present invention, the method specifically includes:
and step S105, carrying out accuracy verification on the tax big data model.
Specifically, according to new data or some data remained before in the tax process, the accuracy of the tax big data model is verified, and whether the tax big data model achieves the expected effect is checked.
And step S106, constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
Specifically, after the tax big data model is verified to achieve the expected effect, the tax data model of each service point is constructed through the independent tax business data of each tax point based on the tax big data model.
In summary, in the embodiment of the present invention, the tax business data is stored in the NoSQL database in a centralized manner; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point. The embodiment of the invention can carry out standardized processing on tax business data of different departments, different service points and different types, thereby completing the construction of a tax big data model, and further promoting the management big data, the refinement of tax handling service and the continuity of business improvement of the tax authority.
It should be understood that, although the steps in the flowcharts of the embodiments of the present invention are shown in sequence as indicated by the arrows, the steps are not necessarily performed in sequence as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least a portion of the steps in various embodiments may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternately with other steps or at least a portion of the sub-steps or stages of other steps.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (10)

1. The construction system of the tax big data model is characterized by comprising a data processing terminal and at least two data acquisition terminals, wherein:
the data acquisition terminal is used for acquiring tax service data of different tax service systems and sending the tax service data to the data processing terminal;
the data processing terminal is used for storing the tax business data into a NoSQL database in a centralized way; extracting effective tax service data in the NoSQL database; performing data collision on the effective tax service data to obtain standard tax service data; extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules; carrying out accuracy verification on the tax big data model; and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
2. The tax big data model building system according to claim 1, wherein the data processing terminal specifically comprises:
the NoSQL database is used for intensively storing the tax business data;
the effective tax business data extraction unit is used for extracting effective tax business data in the NoSQL database;
the standard tax business data acquisition unit is used for carrying out data collision on the effective tax business data to obtain standard tax business data;
the tax big data model building unit is used for extracting influence molecules in the standard tax business data and building a tax big data model according to the influence molecules;
the accuracy verification unit is used for verifying the accuracy of the tax big data model;
and the service point tax data model building unit is used for building a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
3. The system for constructing the tax big data model according to claim 2, wherein the effective tax business data extraction unit specifically comprises:
the data length calculation module is used for calculating the data length of the tax service data in the NoSQL database;
the data length comparison module is used for comparing the data length of the tax business data with the size of the defined effective data length;
and the effective data extraction module is used for extracting the tax service data of which the data length is not less than the effective data length.
4. The system for constructing the tax big data model according to claim 2, wherein the standard tax business data obtaining unit specifically comprises:
the data collision module is used for carrying out data collision on the effective tax business data;
and the standard tax business data acquisition module is used for sorting the effective tax business data after the data collision to obtain the standard tax business data.
5. The tax big data model building system according to claim 2, wherein the tax big data model building unit specifically comprises:
the influencing molecule extracting module is used for extracting influencing molecules in the standard tax business data;
the training set generating module is used for generating a training set according to the influence molecules;
and the tax big data model building module is used for building a tax big data model according to the training set.
6. The tax big data model building system according to claim 2, wherein the accuracy verification unit specifically comprises:
the test set generating module is used for generating a test set according to the influencing molecules;
and the accuracy verification module is used for verifying the accuracy of the tax big data model according to the test set.
7. The tax big data model building system according to claim 2, wherein the service point tax data model building unit specifically comprises:
the tax business data classification module is used for classifying the tax business data according to different service points;
and the service point tax data model building module is used for building a corresponding service point tax data model according to the tax service data and the tax big data model of the corresponding service point.
8. A construction method of a tax big data model is characterized by being applied to a data processing terminal, and specifically comprises the following steps:
storing the tax business data into a NoSQL database in a centralized way;
extracting effective tax service data in the NoSQL database;
performing data collision on the effective tax service data to obtain standard tax service data;
extracting influence molecules in the standard tax business data, and constructing a tax big data model according to the influence molecules;
carrying out accuracy verification on the tax big data model;
and constructing a service point tax data model according to the tax big data model passing the verification and the tax service data of each service point.
9. The method of claim 8, wherein the extracting valid tax business data in the NoSQL database specifically comprises:
calculating the data length of the tax service data in the NoSQL database;
comparing the data length of the tax business data with the size of the defined effective data length;
and extracting the tax service data with the data length not less than the effective data length.
10. The method of claim 8, wherein the extracting the influencing factors from the standard tax data and constructing the tax big data model according to the influencing factors specifically comprises:
extracting influencing molecules in the standard tax business data;
generating a training set according to the influencing molecules;
and constructing a tax big data model according to the training set.
CN202110492123.1A 2021-05-06 2021-05-06 System and method for constructing tax big data model Pending CN113283972A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110492123.1A CN113283972A (en) 2021-05-06 2021-05-06 System and method for constructing tax big data model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110492123.1A CN113283972A (en) 2021-05-06 2021-05-06 System and method for constructing tax big data model

Publications (1)

Publication Number Publication Date
CN113283972A true CN113283972A (en) 2021-08-20

Family

ID=77278136

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110492123.1A Pending CN113283972A (en) 2021-05-06 2021-05-06 System and method for constructing tax big data model

Country Status (1)

Country Link
CN (1) CN113283972A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105139079A (en) * 2015-07-30 2015-12-09 广州时韵信息科技有限公司 Tax revenue prediction method and device based on hybrid model
CN109165249A (en) * 2018-08-07 2019-01-08 阿里巴巴集团控股有限公司 Data processing model construction method, device, server and user terminal
CN109657947A (en) * 2018-12-06 2019-04-19 西安交通大学 A kind of method for detecting abnormality towards enterprises ' industry classification
EP3567539A1 (en) * 2018-05-09 2019-11-13 Volvo Car Corporation Method and system for orchestrating multi-party services using semi-cooperative nash equilibrium based on artificial intelligence, neural network models, reinforcement learning and finite-state automata
CN111047423A (en) * 2019-11-01 2020-04-21 支付宝(杭州)信息技术有限公司 Risk determination method and device and electronic equipment
CN111931991A (en) * 2020-07-14 2020-11-13 上海眼控科技股份有限公司 Weather nowcast method, weather nowcast device, computer equipment and storage medium
CN112257842A (en) * 2020-09-23 2021-01-22 河北航天信息技术有限公司 LSTM-based intelligent tax leading model construction method and device
CN112633998A (en) * 2020-12-23 2021-04-09 航天信息股份有限公司海外业务分公司 Tax big data decision analysis method and system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105139079A (en) * 2015-07-30 2015-12-09 广州时韵信息科技有限公司 Tax revenue prediction method and device based on hybrid model
EP3567539A1 (en) * 2018-05-09 2019-11-13 Volvo Car Corporation Method and system for orchestrating multi-party services using semi-cooperative nash equilibrium based on artificial intelligence, neural network models, reinforcement learning and finite-state automata
CN109165249A (en) * 2018-08-07 2019-01-08 阿里巴巴集团控股有限公司 Data processing model construction method, device, server and user terminal
CN109657947A (en) * 2018-12-06 2019-04-19 西安交通大学 A kind of method for detecting abnormality towards enterprises ' industry classification
CN111047423A (en) * 2019-11-01 2020-04-21 支付宝(杭州)信息技术有限公司 Risk determination method and device and electronic equipment
CN111931991A (en) * 2020-07-14 2020-11-13 上海眼控科技股份有限公司 Weather nowcast method, weather nowcast device, computer equipment and storage medium
CN112257842A (en) * 2020-09-23 2021-01-22 河北航天信息技术有限公司 LSTM-based intelligent tax leading model construction method and device
CN112633998A (en) * 2020-12-23 2021-04-09 航天信息股份有限公司海外业务分公司 Tax big data decision analysis method and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杭州市数据资源管理局编著: "《数据资源管理》", 30 November 2019, 浙江大学出版社 *

Similar Documents

Publication Publication Date Title
CN109598095B (en) Method and device for establishing scoring card model, computer equipment and storage medium
CN110009225B (en) Risk assessment system construction method, risk assessment system construction device, computer equipment and storage medium
CN110929879A (en) Business decision logic updating method based on decision engine and model platform
WO2020073727A1 (en) Risk forecast method, device, computer apparatus, and storage medium
CN109726737B (en) Track-based abnormal behavior detection method and device
CN115563477B (en) Harmonic data identification method, device, computer equipment and storage medium
CN112035775B (en) User identification method and device based on random forest model and computer equipment
CN111325255B (en) Specific crowd delineating method and device, electronic equipment and storage medium
CN113283973A (en) Account checking difference data processing method and device, computer equipment and storage medium
CN113283972A (en) System and method for constructing tax big data model
CN116227989A (en) Multidimensional business informatization supervision method and system
CN113205442A (en) E-government data feedback management method and device based on block chain
CN115170304A (en) Method and device for extracting risk feature description
CN110175181B (en) Information updating method based on data analysis and related equipment
CN114519445A (en) Prediction method and device of service interaction network
CN112434471A (en) Method, system, electronic device and storage medium for improving model generalization capability
US11829866B1 (en) System and method for hierarchical deep semi-supervised embeddings for dynamic targeted anomaly detection
CN111724048A (en) Characteristic extraction method for finished product library scheduling system performance data based on characteristic engineering
CN112035366A (en) Test case generation method, device and equipment
CN111652597A (en) Archive management method, device, computer equipment and storage medium
CN109359866B (en) Risk hidden danger monitoring method and device based on leasing equipment and computer equipment
CN112395277B (en) Vehicle information screening method, device, equipment and storage medium
CN117573951B (en) Target user screening method, device, medium and equipment
CN109726898B (en) Personnel access management method, device, equipment and storage medium
van Delden ISSUES WHEN INTEGRATING DATA SETS WITH

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210820

RJ01 Rejection of invention patent application after publication