CN106780001A - A kind of invoice writes out falsely enterprise supervision recognition methods and system - Google Patents

A kind of invoice writes out falsely enterprise supervision recognition methods and system Download PDF

Info

Publication number
CN106780001A
CN106780001A CN201611220015.4A CN201611220015A CN106780001A CN 106780001 A CN106780001 A CN 106780001A CN 201611220015 A CN201611220015 A CN 201611220015A CN 106780001 A CN106780001 A CN 106780001A
Authority
CN
China
Prior art keywords
enterprise
invoice
business
type
month
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611220015.4A
Other languages
Chinese (zh)
Inventor
蔡燕
刘勇
王培勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SERVYOU SOFTWARE GROUP Co Ltd
Original Assignee
SERVYOU SOFTWARE GROUP Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SERVYOU SOFTWARE GROUP Co Ltd filed Critical SERVYOU SOFTWARE GROUP Co Ltd
Priority to CN201611220015.4A priority Critical patent/CN106780001A/en
Publication of CN106780001A publication Critical patent/CN106780001A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/123Tax preparation or submission

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Engineering & Computer Science (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Enterprise supervision recognition methods is write out falsely this application discloses a kind of invoice, including:The characteristic information of Target Enterprise is extracted, target signature information is obtained;Target signature information is input into the type of business forecast model being pre-created;Type of business forecast model is, using default machine learning algorithm and sample set, to carry out the discrimination model obtained after corresponding model training;Every part of sample data in sample set includes the type of business of the type of business characteristic information of clear and definite enterprise and the enterprise;Wherein, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes;Obtain type of business forecast model to be predicted the outcome according to the type of business that target signature information export, whether the type of business for determining Target Enterprise to predict the outcome according to the type of business is voiding invoice enterprise.The application improves the monitoring recognition effect and recognition efficiency that enterprise is write out falsely to invoice.In addition, the application further correspondingly discloses a kind of invoice writes out falsely enterprise supervision identifying system.

Description

A kind of invoice writes out falsely enterprise supervision recognition methods and system
Technical field
The present invention relates to enterprise's regulation technique field, more particularly to a kind of invoice writes out falsely enterprise supervision recognition methods and is System.
Background technology
As tax system examines and approves continuous development of the reform, while simplification does tax program, excites the market vitality, The tax jurisdiction risk of portion link also be increased therewith, and particularly lawless person is write out falsely by setting up ghost commerce and trade enterprise Invoice phenomenon come back in indivedual areas, very disruptive normal tax revenue and economic order.
It is still to rely on tax analysis in the real work of the existing tax risk prevention and control that enterprise is write out falsely for invoice The business experience of personnel, cause to pay taxes defer to risk model subjective composition it is heavier, accuracy is not strong enough.And due to people's work point Analysis it is less efficient the reason for, cause be difficult comprehensively identify in the presence of write out falsely invoice enterprise.
In sum as can be seen that how to be lifted to the monitoring recognition effect and recognition efficiency of invoice voiding enterprise is mesh It is preceding to also have problem to be solved.
The content of the invention
In view of this, enterprise supervision recognition methods and system, lifting are write out falsely it is an object of the invention to provide a kind of invoice The monitoring recognition effect and recognition efficiency of enterprise are write out falsely to invoice.Its concrete scheme is as follows:
A kind of invoice writes out falsely enterprise supervision recognition methods, including:
The characteristic information of Target Enterprise is extracted, target signature information is obtained;
The target signature information is input into the type of business forecast model being pre-created;The type of business predicts mould Type is, using default machine learning algorithm and sample set, to carry out the discrimination model obtained after corresponding model training;The sample Every part of sample data of this concentration includes the type of business of the type of business characteristic information of clear and definite enterprise and the enterprise; Wherein, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes;
The type of business forecast model is obtained to be predicted the outcome according to the type of business that the target signature information is exported, with Predicted the outcome according to the type of business and determine whether the type of business of the Target Enterprise is to write out falsely invoice enterprise.
Optionally, the process of the characteristic information for extracting Target Enterprise, including:
The first kind characteristic information of the Target Enterprise is extracted, Enterprise Age, the enterprises registration of the Target Enterprise is obtained Registered number of the enterprise, enterprises registration address and the Enterprise Law that registered number of the enterprise corresponding to address, business entity possess The age of improper pay taxes status information and the business entity of otherness, business entity between the personal household register ground of people.
Optionally, the establishment process of the type of business forecast model, including:
First group of sample data and second group of sample data are obtained, the sample set is obtained;Wherein, first group of sample Data include N1The corresponding first kind characteristic information of enterprise of family and the type of business, second group of sample data include M1 The corresponding first kind characteristic information of enterprise of family and the type of business, N1And M1It is positive integer, also, the N1Appoint in enterprise of family The type of business of one enterprise is voiding invoice enterprise, the M1The type of business of any enterprise is legal paying taxes in enterprise of family Enterprise;
According to default first division proportion, the sample set is divided into two parts, obtains corresponding first training set With the first test set;
Using the machine learning algorithm, first training set and first test set, corresponding model instruction is carried out Practice and model measurement, obtain the type of business forecast model.
Optionally, the process of the characteristic information for extracting Target Enterprise, including:
The Equations of The Second Kind characteristic information of the Target Enterprise is extracted, the Enterprise Age of the Target Enterprise is obtained, is continuously issued The quantity information and time interval information of invoice, evening this month amount of money of making out an invoice account for this month to date and make out an invoice ratio, this month of the amount of money Evening number of making out an invoice accounts for this month to date ratio of number, this month amount of money of making out an invoice of making out an invoice and is accounted for more than the invoice amount of preset cost threshold value This month to date ratio of the amount of money, this month amount of money of making out an invoice of making out an invoice accounts for this month and tires out more than the invoice number of the default amount of money threshold value of making out an invoice Meter make out an invoice the ratio of number, this month to the number that same purchaser issues invoice account for this month to date make out an invoice number ratio and This month, accounts for this month to date and makes out an invoice the ratio of the amount of money to the invoice amount that same purchaser issues invoice.
Optionally, the process of the characteristic information for extracting Target Enterprise, including:
The 3rd group of sample data and the 4th group of sample data are obtained, the sample set is obtained;Wherein, the 3rd group of sample Data include N2The corresponding Equations of The Second Kind characteristic information of enterprise of family and the type of business, the 4th group of sample data include M2 The corresponding Equations of The Second Kind characteristic information of enterprise of family and the type of business, N2And M2It is positive integer, also, the N2Appoint in enterprise of family The type of business of one enterprise is voiding invoice enterprise, the M2The type of business of any enterprise is legal paying taxes in enterprise of family Enterprise;
According to default second division proportion, the sample set is divided into two parts, obtains corresponding second training set With the second test set;
Using the machine learning algorithm, second training set and second test set, corresponding model instruction is carried out Practice and model measurement, obtain the type of business forecast model.
Optionally, the machine learning algorithm is decision Tree algorithms.
Enterprise supervision identifying system is write out falsely the invention also discloses a kind of invoice, including:
Information extraction modules, the characteristic information for extracting Target Enterprise, obtain target signature information;
MIM message input module, for the target signature information to be input into the type of business forecast model being pre-created; The type of business forecast model is, using default machine learning algorithm and sample set, to be obtained after carrying out corresponding model training Discrimination model;Every part of sample data in the sample set include the type of business the characteristic information of clear and definite enterprise and The type of business of the enterprise;Wherein, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes;
As a result acquisition module, for obtaining the enterprise that the type of business forecast model is exported according to the target signature information Industry type prediction result, whether the type of business that the Target Enterprise is determined to predict the outcome according to the type of business is voiding Invoice enterprise.
Optionally, described information extraction module, including:
First kind feature extraction submodule, the first kind characteristic information for extracting the Target Enterprise, obtains the mesh Mark the Enterprise Age of enterprise, the registered enterprise that the registered number of the enterprise corresponding to enterprises registration address, business entity possess The improper state of paying taxes of otherness, business entity between the personal household register ground of quantity, enterprises registration address and business entity Information and the age of business entity.
Optionally, described information extraction module, including:
Equations of The Second Kind feature extraction submodule, the Equations of The Second Kind characteristic information for extracting the Target Enterprise, obtains the mesh Mark the Enterprise Age of enterprise, the continuous quantity information for issuing invoice and time interval information, evening this month amount of money of making out an invoice and account for this The make out an invoice ratio of the amount of money, evening this month of month to date makes out an invoice that to account for this month to date ratio of number, this month amount of money of making out an invoice of making out an invoice big for number This month to date ratio of the amount of money, this month amount of money of making out an invoice of making out an invoice is accounted in the invoice amount of preset cost threshold value default to be made out an invoice more than described The invoice number of amount of money threshold value accounts for the make out an invoice ratio of number, this month of this month to date and accounts for this to the number that same purchaser issues invoice Month to date makes out an invoice the ratio of number and this month accounts for this month to date and makes out an invoice the amount of money to the invoice amount that same purchaser issues invoice Ratio.
Optionally, the machine learning algorithm is decision Tree algorithms.
In the present invention, invoice writes out falsely enterprise supervision recognition methods, including:The characteristic information of Target Enterprise is extracted, mesh is obtained Mark characteristic information;Target signature information is input into the type of business forecast model being pre-created;Type of business forecast model is Using default machine learning algorithm and sample set, the discrimination model obtained after corresponding model training is carried out;In sample set Every part of sample data includes the type of business of the type of business characteristic information of clear and definite enterprise and the enterprise;Wherein, look forward to The type of business of industry is voiding invoice enterprise or legal enterprise of paying taxes;Type of business forecast model is obtained according to target signature information The type of business of output predicts the outcome, with according to the type of business predict the outcome determine Target Enterprise the type of business whether be voiding Invoice enterprise.
It can be seen that, the present invention advances with machine learning algorithm and the sample comprising enterprise characteristic information and the type of business Collection, creates the forecast model for being predicted to the type of business, when needing to be monitored identification to Target Enterprise, can be right The Target Enterprise carries out corresponding feature extraction, and the target signature information for extracting then is delivered to above-mentioned forecast model, from And the type of business of above-mentioned Target Enterprise can be obtained, actually or above-mentioned Target Enterprise voiding invoice enterprise is predicted with this Legal enterprise of paying taxes.Therefore, the present invention is to be monitored identification to enterprise based on machine learning algorithm, to determine enterprise Whether industry be to write out falsely invoice enterprise, it is to avoid the various defects that manual analysis is brought, and thus improves and writes out falsely enterprise to invoice Monitoring recognition effect and recognition efficiency.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing The accompanying drawing to be used needed for having technology description is briefly described, it should be apparent that, drawings in the following description are only this Inventive embodiment, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis The accompanying drawing of offer obtains other accompanying drawings.
Fig. 1 is that a kind of invoice writes out falsely enterprise supervision recognition methods flow chart disclosed in the embodiment of the present invention;
Fig. 2 is that a kind of specific invoice writes out falsely enterprise supervision recognition methods flow chart disclosed in the embodiment of the present invention;
Fig. 3 is that a kind of specific invoice writes out falsely enterprise supervision recognition methods flow chart disclosed in the embodiment of the present invention;
Fig. 4 is that a kind of invoice writes out falsely enterprise supervision identifying system structural representation disclosed in the embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.It is based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made Embodiment, belongs to the scope of protection of the invention.
Enterprise supervision recognition methods, shown in Figure 1, the method bag are write out falsely the embodiment of the invention discloses a kind of invoice Include:
Step S11:The characteristic information of Target Enterprise is extracted, target signature information is obtained.
It should be noted that in the present embodiment the characteristic information of enterprise be specifically including but not limited to enterprises registration information and/ Or enterprise's invoice issuing behavior characteristic information.
Step S12:Target signature information is input into the type of business forecast model being pre-created;Wherein, the type of business Forecast model is, using default machine learning algorithm and sample set, to carry out the discrimination model obtained after corresponding model training; Every part of sample data in above-mentioned sample set includes the enterprise of the type of business characteristic information of clear and definite enterprise and the enterprise Industry type;In the present embodiment, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes.
In the present embodiment, above-mentioned machine learning algorithm can be disclosed various machine in normal service study in the prior art Algorithm.Wherein, the present embodiment is preferentially using decision Tree algorithms as above-mentioned machine learning algorithm.
Step S13:Type of business forecast model is obtained to be predicted the outcome according to the type of business that target signature information is exported, with Predicted the outcome according to the type of business and determine whether the type of business of Target Enterprise is to write out falsely invoice enterprise.
It can be seen that, the present invention advances with machine learning algorithm and the sample comprising enterprise characteristic information and the type of business Collection, creates the forecast model for being predicted to the type of business, when needing to be monitored identification to Target Enterprise, can be right The Target Enterprise carries out corresponding feature extraction, and the target signature information for extracting then is delivered to above-mentioned forecast model, from And the type of business of above-mentioned Target Enterprise can be obtained, actually or above-mentioned Target Enterprise voiding invoice enterprise is predicted with this Legal enterprise of paying taxes.Therefore, the present invention is to be monitored identification to enterprise based on machine learning algorithm, to determine enterprise Whether industry be to write out falsely invoice enterprise, it is to avoid the various defects that manual analysis is brought, and thus improves and writes out falsely enterprise to invoice Monitoring recognition effect and recognition efficiency.
It is shown in Figure 2, enterprise supervision recognition methods is write out falsely the embodiment of the invention discloses a kind of specific invoice, including Following steps:
Step S21:The first kind characteristic information of Target Enterprise is extracted, Enterprise Age, the enterprises registration of Target Enterprise is obtained Registered number of the enterprise, enterprises registration address and the Enterprise Law that registered number of the enterprise corresponding to address, business entity possess The age of improper pay taxes status information and the business entity of otherness, business entity between the personal household register ground of people.
That is, in the present embodiment, the first kind characteristic information of Target Enterprise can be based on, whether the Target Enterprise is judged To write out falsely invoice enterprise.Wherein, the Enterprise Age of the first kind characteristic information of enterprise including enterprise, corresponding to enterprises registration address Registered number of the enterprise, the personal family of registered number of the enterprise, enterprises registration address and business entity that possesses of business entity The age of improper pay taxes status information and the business entity of otherness, business entity between nationality ground.
Invoice behavior is write out falsely due to existing now with many newly-set-up commerce and trade enterprises, so the embodiment of the present invention can be preferential Identification is monitored to newly-set-up commerce and trade enterprise.So-called newly-set-up commerce and trade enterprise refers to wholesale less than or equal to 2 years Enterprise Age Retailer.
In the present embodiment, between the Enterprise Age of above-mentioned enterprise refers to the time between current time and enterprises registration date of registration Every, wherein, if the Enterprise Age of Target Enterprise is less than or equal to 2 years, mean that the Target Enterprise possesses voiding higher The suspicion of invoice.Registered number of the enterprise corresponding to above-mentioned enterprises registration address refers on the enterprises registration address where enterprise The quantity of the registered enterprise for possessing.If on the enterprises registration address that Target Enterprise is currently located, while possessing two families or two Taxpayer more than family, then the Target Enterprise possess the suspicion of voiding invoice higher.It is registered that above-mentioned business entity possesses Number of the enterprise refers to the quantity of registered enterprise that possesses of same juristic person of enterprise.If the same Enterprise Law person of Target Enterprise To that should have taxpayers more than two families or two families, then the Target Enterprise possesses the suspicion of voiding invoice higher to part card number.It is above-mentioned Otherness between enterprises registration address and the personal household register ground of business entity refers to enterprises registration address and the Enterprise Law of enterprise Difference between the personal household register ground of people.If the individual of the business entity of the enterprises registration address of Target Enterprise and Target Enterprise Differ, then the Target Enterprise possesses the suspicion of voiding invoice higher household register.The improper shape of paying taxes of above-mentioned business entity State information refers to whether the business entity of enterprise once had improper family criminal record.If the business entity of Target Enterprise was once In the presence of excessively improper family criminal record, then the Target Enterprise possesses the suspicion of voiding invoice higher.The age of above-mentioned business entity is Refer to the actual age of the business entity of enterprise.If the age of the business entity of Target Enterprise is less than 25 years old or more than 60 years old, Then the Target Enterprise possesses the suspicion of voiding invoice higher.
Step S22:The first kind characteristic information of Target Enterprise is input into the type of business forecast model being pre-created.
Specifically, the establishment process of above-mentioned type of business forecast model, including below step S201 to step S203:
Step S201:First group of sample data and second group of sample data are obtained, sample set is obtained;Wherein, first group of sample Notebook data includes N1The corresponding first kind characteristic information of enterprise of family and the type of business, second group of sample data include M1Family The corresponding first kind characteristic information of enterprise and the type of business, N1And M1It is positive integer, also, N1Any enterprise in enterprise of family The type of business be voiding invoice enterprise, M1The type of business of any enterprise is legal enterprise of paying taxes in enterprise of family.
In the present embodiment, in order to ensure the uniformity of sample, above-mentioned N1Value and M1Value is preferentially taken as identical numerical value.In addition, In the present embodiment, above-mentioned N1Value and M1Value sum is preferentially more than or equal to 10000.
Step S202:According to default first division proportion, sample set is divided into two parts, obtains corresponding first instruction Practice collection and the first test set.
Wherein, above-mentioned first division proportion can need specifically to be set according to practical application, for example, can be set to 7:3 Ratio, the ratio between the sample size in sample size and the first test set in the first training set for so finally giving is 7:3.
Step S203:Using machine learning algorithm, the first training set and the first test set, carry out corresponding model training with And model measurement, obtain above-mentioned type of business forecast model.
Wherein, the machine learning algorithm in above-mentioned steps S203 is specially decision Tree algorithms.In addition, in the present embodiment, base The model depth of the type of business forecast model obtained in decision Tree algorithms training is preferentially set to 5.
Step S23:Obtain the enterprise that above-mentioned type of business forecast model is exported according to the first kind characteristic information of Target Enterprise Industry type prediction result, with according to the type of business predict the outcome determine Target Enterprise the type of business whether be write out falsely invoice look forward to Industry.
Technical scheme disclosed in the present embodiment is applied to invoice and leads the purchase stage, in the invoice neck purchase Structure Stage of enterprise, By the technical scheme disclosed in the embodiment of the present invention, identification can be monitored to the potential invoice enterprise that writes out falsely, so that real Present invoice voiding event finds potential voiding invoice enterprise in time before occurring.
It is shown in Figure 3, enterprise supervision recognition methods is write out falsely the embodiment of the invention discloses a kind of specific invoice, including Following steps:
Step S31:The Equations of The Second Kind characteristic information of Target Enterprise is extracted, the Enterprise Age of Target Enterprise is obtained, is continuously issued The quantity information and time interval information of invoice, evening this month amount of money of making out an invoice account for this month to date and make out an invoice ratio, this month of the amount of money Evening number of making out an invoice accounts for this month to date ratio of number, this month amount of money of making out an invoice of making out an invoice and is accounted for more than the invoice amount of preset cost threshold value This month to date ratio of the amount of money, this month amount of money of making out an invoice of making out an invoice accounts for this month to date and opens more than the invoice number of default amount of money threshold value of making out an invoice The ratio of ticket number, this month to the number that same purchaser issues invoice account for this month to date make out an invoice number ratio and this month The invoice amount issued invoice to same purchaser accounts for this month to date and makes out an invoice the ratio of the amount of money.
That is, in the present embodiment, the Equations of The Second Kind characteristic information of Target Enterprise can be based on, whether the Target Enterprise is judged To write out falsely invoice enterprise.Wherein, the Equations of The Second Kind characteristic information of enterprise includes the Enterprise Age of enterprise, the continuous quantity for issuing invoice Information and time interval information, evening this month amount of money of making out an invoice account for the make out an invoice ratio of the amount of money, evening this month of this month to date and make out an invoice part Number accounts for this month to date ratio of number, this month amount of money of making out an invoice of making out an invoice and accounts for this month to date more than the invoice amount of preset cost threshold value and open The ratio of the ticket amount of money, this month amount of money of making out an invoice account for this month to date and make out an invoice the ratio of number more than the invoice number of default amount of money threshold value of making out an invoice Example, this month to the number that same purchaser issues invoice account for this month to date make out an invoice number ratio and this month to same purchase The invoice amount that issues invoice of side accounts for this month to date and makes out an invoice the ratio of the amount of money.
Specifically, in the present embodiment, if the Enterprise Age of Target Enterprise is less than or equal to 2 years, meaning the target Enterprise possesses the suspicion of voiding invoice higher.If Target Enterprise issued a large amount of invoices in 30 minutes, and issued invoice Time interval be no more than 3 minutes, then mean that the Target Enterprise possesses the suspicion of voiding invoice higher.If this month target Enterprise issue at night special invoice without tax volume and this month to date issue special invoice without the ratio between tax volume Example is more than 50%, then mean that the Target Enterprise possesses the suspicion of voiding invoice higher.If this month Target Enterprise is at night The ratio between the special invoice number that special invoice number and this month to date are issued is issued more than 20%, then means the target Enterprise possesses the suspicion of voiding invoice higher.If all top special invoices issued of lattice of this month Target Enterprise without tax Ratio between the amount of money that volume and this month to date issue invoice is more than 20%, then mean that the Target Enterprise possesses voiding higher The suspicion of invoice.Wherein, the special invoice that above-mentioned top lattice are issued refer to make out an invoice the amount of money more than maximum issue limit 99% it is special Use invoice.The ratio if quantity and this month to date of the special invoice that all top lattice of this month Target Enterprise are issued are made out an invoice between number Example is more than 20%, then mean that the Target Enterprise possesses the suspicion of voiding invoice higher.If this month Target Enterprise is outwards The invoice number issued of same purchaser and the ratio made out an invoice between number of this month to date more than 80%, then mean the target Enterprise possesses the suspicion of voiding invoice higher.If all invoices that this month Target Enterprise same purchaser outwards issues Invoice amount and the amount of money that issues invoice of this month to date between ratio more than 20%, then mean the Target Enterprise possess compared with The suspicion of voiding invoice high.
Step S32:The Equations of The Second Kind characteristic information of Target Enterprise is input into the type of business forecast model being pre-created.
Specifically, the establishment process of above-mentioned type of business forecast model, including below step S301 to step S303:
Step S301:The 3rd group of sample data and the 4th group of sample data are obtained, sample set is obtained;Wherein, the 3rd group of sample Notebook data includes N2The corresponding Equations of The Second Kind characteristic information of enterprise of family and the type of business, the 4th group of sample data include M2Family The corresponding Equations of The Second Kind characteristic information of enterprise and the type of business, N2And M2It is positive integer, also, N2Any enterprise in enterprise of family The type of business be voiding invoice enterprise, M2The type of business of any enterprise is legal enterprise of paying taxes in enterprise of family.
In the present embodiment, in order to ensure the uniformity of sample, above-mentioned N2Value and M2Value is preferentially taken as identical numerical value.In addition, In the present embodiment, above-mentioned N2Value and M2Value sum is preferentially more than or equal to 10000.
Step S302:According to default second division proportion, sample set is divided into two parts, obtains corresponding second instruction Practice collection and the second test set.
Wherein, above-mentioned second division proportion can need specifically to be set according to practical application, for example, can be set to 7:3 Ratio, the ratio between the sample size in sample size and the second test set in the second training set for so finally giving is 7:3.
Step S303:Using machine learning algorithm, the second training set and the second test set, carry out corresponding model training with And model measurement, obtain type of business forecast model.
Wherein, the machine learning algorithm in above-mentioned steps S303 is specially decision Tree algorithms.In addition, in the present embodiment, base The model depth of the type of business forecast model obtained in decision Tree algorithms training is preferentially set to 5.
Step S33:Obtain the enterprise that above-mentioned type of business forecast model is exported according to the Equations of The Second Kind characteristic information of Target Enterprise Industry type prediction result, with according to the type of business predict the outcome determine Target Enterprise the type of business whether be write out falsely invoice look forward to Industry.
Technical scheme disclosed in the present embodiment is applied to invoice issuing monitor stages, and rank is monitored in the invoice issuing of enterprise Section, by the technical scheme disclosed in the embodiment of the present invention, can be to invoice that may be present during the invoice issuing of enterprise Voiding behavior, so as to identify whether the enterprise is to write out falsely invoice enterprise.
Accordingly, enterprise supervision identifying system is write out falsely the embodiment of the invention also discloses a kind of invoice, it is shown in Figure 4, The system includes:
Information extraction modules 11, the characteristic information for extracting Target Enterprise, obtain target signature information;
MIM message input module 12, for target signature information to be input into the type of business forecast model being pre-created;Enterprise Industry type prediction model is, using default machine learning algorithm and sample set, to carry out the differentiation obtained after corresponding model training Model;Every part of sample data in sample set includes the enterprise of the type of business characteristic information of clear and definite enterprise and the enterprise Industry type;Wherein, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes;
As a result acquisition module 13, for obtaining the type of business that type of business forecast model is exported according to target signature information Predict the outcome, whether the type of business that Target Enterprise is determined to predict the outcome according to the type of business is to write out falsely invoice enterprise.
It can be seen that, the present invention advances with machine learning algorithm and the sample comprising enterprise characteristic information and the type of business Collection, creates the forecast model for being predicted to the type of business, when needing to be monitored identification to Target Enterprise, can be right The Target Enterprise carries out corresponding feature extraction, and the target signature information for extracting then is delivered to above-mentioned forecast model, from And the type of business of above-mentioned Target Enterprise can be obtained, actually or above-mentioned Target Enterprise voiding invoice enterprise is predicted with this Legal enterprise of paying taxes.Therefore, the present invention is to be monitored identification to enterprise based on machine learning algorithm, to determine enterprise Whether industry be to write out falsely invoice enterprise, it is to avoid the various defects that manual analysis is brought, and thus improves and writes out falsely enterprise to invoice Monitoring recognition effect and recognition efficiency.
Specifically, above- mentioned information extraction module, can include first kind feature extraction submodule, for extracting Target Enterprise First kind characteristic information, obtain the Enterprise Age of Target Enterprise, registered number of the enterprise, enterprise corresponding to enterprises registration address Otherness, enterprise between the personal household register ground of registered number of the enterprise, enterprises registration address and business entity that industry legal person possesses The age of improper pay taxes status information and the business entity of industry legal person.
Corresponding with above-mentioned first kind feature extraction submodule, the type of business forecast model in the present embodiment is specifically profit The sample set of the first kind characteristic information with default machine learning algorithm and comprising enterprise and the type of business carries out model instruction The model obtained after white silk.
In addition, the information extraction modules in the present embodiment, it is also possible to including:
Equations of The Second Kind feature extraction submodule, the Equations of The Second Kind characteristic information for extracting Target Enterprise, obtains Target Enterprise Enterprise Age, the continuous quantity information for issuing invoice and time interval information, evening this month amount of money of making out an invoice account for this month to date and open The ratio of the ticket amount of money, evening this month number of making out an invoice account for the make out an invoice ratio of number, this month of this month to date and make out an invoice the amount of money more than default gold The invoice amount of volume threshold value accounts for the make out an invoice ratio of the amount of money, this month of this month to date and makes out an invoice hair of the amount of money more than default amount of money threshold value of making out an invoice Ticket number accounts for the make out an invoice ratio of number, this month of this month to date and accounts for this month to date to the number that same purchaser issues invoice and make out an invoice part Several ratio and this month, account for this month to date and make out an invoice the ratio of the amount of money to the invoice amount that same purchaser issues invoice.
Corresponding with above-mentioned Equations of The Second Kind feature extraction submodule, the type of business forecast model in the present embodiment is specifically profit The sample set of the Equations of The Second Kind characteristic information with default machine learning algorithm and comprising enterprise and the type of business carries out model instruction The model obtained after white silk.
Specifically, above-mentioned default machine learning algorithm is preferably decision Tree algorithms.
Finally, in addition it is also necessary to explanation, herein, such as first and second or the like relational terms be used merely to by One entity or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or operation Between there is any this actual relation or order.And, term " including ", "comprising" or its any other variant meaning Covering including for nonexcludability, so that process, method, article or equipment including a series of key elements not only include that A little key elements, but also other key elements including being not expressly set out, or also include for this process, method, article or The intrinsic key element of equipment.In the absence of more restrictions, the key element limited by sentence "including a ...", does not arrange Except also there is other identical element in the process including the key element, method, article or equipment.
Enterprise supervision recognition methods is write out falsely to a kind of invoice provided by the present invention above and system is described in detail, Specific case used herein is set forth to principle of the invention and implementation method, and the explanation of above example is use Understand the method for the present invention and its core concept in help;Simultaneously for those of ordinary skill in the art, according to of the invention Thought, will change in specific embodiments and applications, and in sum, this specification content should not be construed as Limitation of the present invention.

Claims (10)

1. a kind of invoice writes out falsely enterprise supervision recognition methods, it is characterised in that including:
The characteristic information of Target Enterprise is extracted, target signature information is obtained;
The target signature information is input into the type of business forecast model being pre-created;The type of business forecast model is Using default machine learning algorithm and sample set, the discrimination model obtained after corresponding model training is carried out;The sample set In every part of sample data include the type of business of the type of business characteristic information of clear and definite enterprise and the enterprise;Its In, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes;
Obtain the type of business forecast model to be predicted the outcome according to the type of business that the target signature information is exported, with basis The type of business predicts the outcome and determines whether the type of business of the Target Enterprise is to write out falsely invoice enterprise.
2. invoice according to claim 1 writes out falsely enterprise supervision recognition methods, it is characterised in that the extraction Target Enterprise Characteristic information process, including:
The first kind characteristic information of the Target Enterprise is extracted, Enterprise Age, the enterprises registration address of the Target Enterprise is obtained Registered number of the enterprise, enterprises registration address and the business entity's that corresponding registered number of the enterprise, business entity possess The age of improper pay taxes status information and the business entity of otherness, business entity between personal household register ground.
3. invoice according to claim 2 writes out falsely enterprise supervision recognition methods, it is characterised in that the type of business prediction The establishment process of model, including:
First group of sample data and second group of sample data are obtained, the sample set is obtained;Wherein, first group of sample data Include N1The corresponding first kind characteristic information of enterprise of family and the type of business, second group of sample data include M1Enterprise of family The corresponding first kind characteristic information of industry and the type of business, N1And M1It is positive integer, also, the N1Any enterprise in enterprise of family The type of business of industry is voiding invoice enterprise, the M1The type of business of any enterprise is legal enterprise of paying taxes in enterprise of family Industry;
According to default first division proportion, the sample set is divided into two parts, obtains corresponding first training set and One test set;
Using the machine learning algorithm, first training set and first test set, carry out corresponding model training with And model measurement, obtain the type of business forecast model.
4. invoice according to claim 1 writes out falsely enterprise supervision recognition methods, it is characterised in that the extraction Target Enterprise Characteristic information process, including:
The Equations of The Second Kind characteristic information of the Target Enterprise is extracted, the Enterprise Age of the Target Enterprise is obtained, is continuously issued invoice Quantity information and time interval information, evening this month amount of money of making out an invoice account for this month to date and make out an invoice ratio, evening this month of the amount of money Number of making out an invoice accounts for this month to date ratio of number, this month amount of money of making out an invoice of making out an invoice and accounts for this month more than the invoice amount of preset cost threshold value The accumulative ratio of the amount of money of making out an invoice, this month amount of money of making out an invoice account for this month to date and open more than the invoice number of the default amount of money threshold value of making out an invoice The ratio of ticket number, this month to the number that same purchaser issues invoice account for this month to date make out an invoice number ratio and this month The invoice amount issued invoice to same purchaser accounts for this month to date and makes out an invoice the ratio of the amount of money.
5. invoice according to claim 4 writes out falsely enterprise supervision recognition methods, it is characterised in that the extraction Target Enterprise Characteristic information process, including:
The 3rd group of sample data and the 4th group of sample data are obtained, the sample set is obtained;Wherein, the 3rd group of sample data Include N2The corresponding Equations of The Second Kind characteristic information of enterprise of family and the type of business, the 4th group of sample data include M2Enterprise of family The corresponding Equations of The Second Kind characteristic information of industry and the type of business, N2And M2It is positive integer, also, the N2Any enterprise in enterprise of family The type of business of industry is voiding invoice enterprise, the M2The type of business of any enterprise is legal enterprise of paying taxes in enterprise of family Industry;
According to default second division proportion, the sample set is divided into two parts, obtains corresponding second training set and Two test sets;
Using the machine learning algorithm, second training set and second test set, carry out corresponding model training with And model measurement, obtain the type of business forecast model.
6. the invoice according to any one of claim 1 to 5 writes out falsely enterprise supervision recognition methods, it is characterised in that the machine Device learning algorithm is decision Tree algorithms.
7. a kind of invoice writes out falsely enterprise supervision identifying system, it is characterised in that including:
Information extraction modules, the characteristic information for extracting Target Enterprise, obtain target signature information;
MIM message input module, for the target signature information to be input into the type of business forecast model being pre-created;It is described Type of business forecast model is that, using default machine learning algorithm and sample set, what is obtain after corresponding model training sentences Other model;Every part of sample data in the sample set includes the type of business characteristic information of clear and definite enterprise and the enterprise The type of business of industry;Wherein, the type of business of enterprise is voiding invoice enterprise or legal enterprise of paying taxes;
As a result acquisition module, for obtaining the enterprise-class that the type of business forecast model is exported according to the target signature information Type predicts the outcome, and whether the type of business that the Target Enterprise is determined to predict the outcome according to the type of business is to write out falsely invoice Enterprise.
8. invoice according to claim 7 writes out falsely enterprise supervision identifying system, it is characterised in that described information extracts mould Block, including:
First kind feature extraction submodule, the first kind characteristic information for extracting the Target Enterprise obtains the target enterprise Registered number of the enterprise that registered number of the enterprise, business entity corresponding to the Enterprise Age of industry, enterprises registration address possess, The improper status information of paying taxes of otherness, business entity between enterprises registration address and the personal household register ground of business entity, And the age of business entity.
9. invoice according to claim 7 writes out falsely enterprise supervision identifying system, it is characterised in that described information extracts mould Block, including:
Equations of The Second Kind feature extraction submodule, the Equations of The Second Kind characteristic information for extracting the Target Enterprise obtains the target enterprise The Enterprise Age of industry, the continuous quantity information for issuing invoice and time interval information, evening this month amount of money of making out an invoice account for this month and tire out The meter ratio of the amount of money, evening this month number of making out an invoice of making out an invoice accounts for the make out an invoice ratio of number, this month of this month to date and makes out an invoice the amount of money more than pre- If the invoice amount of amount of money threshold value accounts for this month to date, the make out an invoice ratio of the amount of money, this month, makes out an invoice the amount of money more than the default amount of money of making out an invoice The invoice number of threshold value accounts for the make out an invoice ratio of number, this month of this month to date and accounts for this month to the number that same purchaser issues invoice and tire out Meter makes out an invoice the ratio of number and this month accounts for this month to date and makes out an invoice the ratio of the amount of money to the invoice amount that same purchaser issues invoice Example.
10. the invoice according to any one of claim 7 to 9 writes out falsely enterprise supervision identifying system, it is characterised in that the machine Device learning algorithm is decision Tree algorithms.
CN201611220015.4A 2016-12-26 2016-12-26 A kind of invoice writes out falsely enterprise supervision recognition methods and system Pending CN106780001A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611220015.4A CN106780001A (en) 2016-12-26 2016-12-26 A kind of invoice writes out falsely enterprise supervision recognition methods and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611220015.4A CN106780001A (en) 2016-12-26 2016-12-26 A kind of invoice writes out falsely enterprise supervision recognition methods and system

Publications (1)

Publication Number Publication Date
CN106780001A true CN106780001A (en) 2017-05-31

Family

ID=58926284

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611220015.4A Pending CN106780001A (en) 2016-12-26 2016-12-26 A kind of invoice writes out falsely enterprise supervision recognition methods and system

Country Status (1)

Country Link
CN (1) CN106780001A (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108038490A (en) * 2017-10-30 2018-05-15 上海思贤信息技术股份有限公司 A kind of P2P enterprises automatic identifying method and system based on internet data
CN108595621A (en) * 2018-04-23 2018-09-28 泰华智慧产业集团股份有限公司 A kind of early warning analysis method and system write false value added tax invoice
CN109409969A (en) * 2018-10-24 2019-03-01 仲恺农业工程学院 A kind of business taxation fraud detection method, electronic equipment and storage medium
CN109583978A (en) * 2018-11-30 2019-04-05 税友软件集团股份有限公司 The method, device and equipment of invoice enterprise is write out falsely in a kind of identification
CN109636036A (en) * 2018-12-12 2019-04-16 税友软件集团股份有限公司 A kind of method, system and the equipment of the prediction of enterprise's invoiced amount
CN109657887A (en) * 2017-10-10 2019-04-19 爱信诺征信有限公司 A kind of enterprise operation health Evaluation method and system
CN109727111A (en) * 2018-12-29 2019-05-07 航天信息股份有限公司 Method, system and the computer readable storage medium of invoice behavior are write out falsely in monitoring
CN109993641A (en) * 2017-12-28 2019-07-09 航天信息股份有限公司 A kind of invoice writes out falsely method for early warning and system
CN110532542A (en) * 2019-07-15 2019-12-03 西安交通大学 It is a kind of that recognition methods and system are write out falsely with the invoice for not marking study based on positive example
CN110852856A (en) * 2019-11-04 2020-02-28 西安交通大学 Invoice false invoice identification method based on dynamic network representation
CN110866536A (en) * 2019-09-25 2020-03-06 西安交通大学 Cross-regional enterprise tax evasion identification method based on PU learning
CN111178219A (en) * 2019-12-24 2020-05-19 泰康保险集团股份有限公司 Bill identification management method and device, storage medium and electronic equipment
CN111192121A (en) * 2019-12-17 2020-05-22 航天信息股份有限公司 ANN-based automatic risk taxpayer early warning method and system
CN111242422A (en) * 2019-12-31 2020-06-05 航天信息股份有限公司 Method and device for identifying false invoice of oil product
CN111311340A (en) * 2018-12-10 2020-06-19 航天信息股份有限公司 Method and device for identifying false invoice issuing behaviors
CN113159874A (en) * 2021-05-25 2021-07-23 北京中科闻歌科技股份有限公司 Method and device for detecting value-added tax invoice and readable storage medium
CN113869383A (en) * 2021-09-17 2021-12-31 北京中科闻歌科技股份有限公司 Ticket-changing entity detection method and system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646335A (en) * 2013-12-27 2014-03-19 税友软件集团股份有限公司 Electronic invoice detecting method and device
CN104700304A (en) * 2013-12-05 2015-06-10 航天信息股份有限公司 Method and system for supervising enterprise tax evasion through values of plain invoices of value-added tax

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104700304A (en) * 2013-12-05 2015-06-10 航天信息股份有限公司 Method and system for supervising enterprise tax evasion through values of plain invoices of value-added tax
CN103646335A (en) * 2013-12-27 2014-03-19 税友软件集团股份有限公司 Electronic invoice detecting method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王宏争 等: "决策树生成算法在税务稽查选案中的应用", 《科技情报开发与经济》 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109657887A (en) * 2017-10-10 2019-04-19 爱信诺征信有限公司 A kind of enterprise operation health Evaluation method and system
CN108038490A (en) * 2017-10-30 2018-05-15 上海思贤信息技术股份有限公司 A kind of P2P enterprises automatic identifying method and system based on internet data
CN109993641A (en) * 2017-12-28 2019-07-09 航天信息股份有限公司 A kind of invoice writes out falsely method for early warning and system
CN108595621B (en) * 2018-04-23 2020-10-30 泰华智慧产业集团股份有限公司 Early warning analysis method and system for false value-added tax invoice
CN108595621A (en) * 2018-04-23 2018-09-28 泰华智慧产业集团股份有限公司 A kind of early warning analysis method and system write false value added tax invoice
CN109409969A (en) * 2018-10-24 2019-03-01 仲恺农业工程学院 A kind of business taxation fraud detection method, electronic equipment and storage medium
CN109583978A (en) * 2018-11-30 2019-04-05 税友软件集团股份有限公司 The method, device and equipment of invoice enterprise is write out falsely in a kind of identification
CN111311340A (en) * 2018-12-10 2020-06-19 航天信息股份有限公司 Method and device for identifying false invoice issuing behaviors
CN111311340B (en) * 2018-12-10 2024-02-13 航天信息股份有限公司 Method and device for identifying virtual invoice behavior
CN109636036A (en) * 2018-12-12 2019-04-16 税友软件集团股份有限公司 A kind of method, system and the equipment of the prediction of enterprise's invoiced amount
CN109636036B (en) * 2018-12-12 2021-03-26 亿企赢网络科技有限公司 Method, system and equipment for enterprise invoice quantity prediction
CN109727111A (en) * 2018-12-29 2019-05-07 航天信息股份有限公司 Method, system and the computer readable storage medium of invoice behavior are write out falsely in monitoring
CN110532542A (en) * 2019-07-15 2019-12-03 西安交通大学 It is a kind of that recognition methods and system are write out falsely with the invoice for not marking study based on positive example
CN110532542B (en) * 2019-07-15 2021-07-13 西安交通大学 Invoice false invoice identification method and system based on positive case and unmarked learning
CN110866536A (en) * 2019-09-25 2020-03-06 西安交通大学 Cross-regional enterprise tax evasion identification method based on PU learning
WO2021057427A1 (en) * 2019-09-25 2021-04-01 西安交通大学 Pu learning based cross-regional enterprise tax evasion recognition method and system
CN110866536B (en) * 2019-09-25 2022-06-07 西安交通大学 Cross-regional enterprise tax evasion identification method based on PU learning
CN110852856A (en) * 2019-11-04 2020-02-28 西安交通大学 Invoice false invoice identification method based on dynamic network representation
CN111192121A (en) * 2019-12-17 2020-05-22 航天信息股份有限公司 ANN-based automatic risk taxpayer early warning method and system
CN111178219A (en) * 2019-12-24 2020-05-19 泰康保险集团股份有限公司 Bill identification management method and device, storage medium and electronic equipment
CN111242422A (en) * 2019-12-31 2020-06-05 航天信息股份有限公司 Method and device for identifying false invoice of oil product
CN113159874A (en) * 2021-05-25 2021-07-23 北京中科闻歌科技股份有限公司 Method and device for detecting value-added tax invoice and readable storage medium
CN113869383A (en) * 2021-09-17 2021-12-31 北京中科闻歌科技股份有限公司 Ticket-changing entity detection method and system

Similar Documents

Publication Publication Date Title
CN106780001A (en) A kind of invoice writes out falsely enterprise supervision recognition methods and system
CN111352971A (en) Bank system monitoring data anomaly detection method and system
CN108053310A (en) Credit scoring method, apparatus, computer equipment and storage medium
CN106557882A (en) Power consumer screening technique and system based on various dimensions Risk Evaluation Factors
CN106067088A (en) E-bank accesses detection method and the device of behavior
CN110895758B (en) Screening method, device and system for credit card account with cheating transaction
CN107507038A (en) A kind of electricity charge sensitive users analysis method based on stacking and bagging algorithms
CN109102396A (en) A kind of user credit ranking method, computer equipment and readable medium
CN112613977A (en) Personal credit loan admission credit granting method and system based on government affair data
CN106570753A (en) Method and server for controlling default risk
CN104036367A (en) Intelligent customer credit optimizing processing method based on ERP system
CN112907356A (en) Overdue collection method, device and system and computer readable storage medium
CN109242165A (en) A kind of model training and prediction technique and device based on model training
CN110097443A (en) A kind of accounting processing method and its system based on economic affairs classification
CN107194536A (en) The power supply reliability negative benefit appraisal procedure and device of a kind of project investment
CN107679862B (en) Method and device for determining characteristic value of fraud transaction model
CN107291922B (en) A kind of data distributing method and device
Banihashemi Is horizontal curvature a significant factor of safety in rural multilane highways?
CN106372969A (en) Power user feature identification method and system
CN114757596B (en) Power business monitoring and early warning method and system and readable storage medium
CN114998022A (en) Compliance wind control method and system
CN106022915A (en) Enterprise credit risk assessment method and apparatus
CN109272412A (en) A kind of vehicle insurance Claims Resolution air control maintenance system
CN108009271A (en) With the screening method and device of the car fee evasion of falling card vehicle
CN106960019A (en) A kind of data Growth Rate Calculation method and apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170531

RJ01 Rejection of invention patent application after publication