CN116628283A - Manual data verification method based on big data - Google Patents
Manual data verification method based on big data Download PDFInfo
- Publication number
- CN116628283A CN116628283A CN202310722078.3A CN202310722078A CN116628283A CN 116628283 A CN116628283 A CN 116628283A CN 202310722078 A CN202310722078 A CN 202310722078A CN 116628283 A CN116628283 A CN 116628283A
- Authority
- CN
- China
- Prior art keywords
- data
- subset
- manual
- subsets
- feedback
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 24
- 238000013524 data verification Methods 0.000 title claims abstract description 17
- 238000013075 data extraction Methods 0.000 claims description 6
- 238000001514 detection method Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013479 data entry Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/90335—Query processing
- G06F16/90348—Query processing by searching ordered data, e.g. alpha-numerically ordered data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/12—Accounting
- G06Q40/125—Finance or payroll
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Strategic Management (AREA)
- Databases & Information Systems (AREA)
- Development Economics (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Marketing (AREA)
- Economics (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Data Mining & Analysis (AREA)
- Educational Administration (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Technology Law (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention discloses a manual data verification method based on big data in the field of data verification, which can divide standard data and entered data after manual entry into a plurality of first data subsets and second data subsets respectively, and inquire the first data subsets and the second data subsets respectively.
Description
Technical Field
The invention relates to the field of data verification, in particular to a manual data verification method based on big data.
Background
In the customs inspection process, enterprises need to provide own electronic accounts for customs departments, various influences are caused in the process of uploading electronic account data, and errors can be caused in the data due to the influence. Therefore, the uploaded data needs to be checked and compared, however, although the traditional data comparison method has simpler comparison logic, the comparison and operation speed are slower, and the comparison efficiency is not high, and the specific measure is that one piece of comparison data is extracted from the comparison data, and then the comparison data is compared with the compared data one by one and fed back to the comparison result one by one, and the comparison method has extremely huge operand and low comparison operation efficiency.
Disclosure of Invention
Technical problem to be solved
Aiming at the problems in the prior art, the invention provides a manual data verification method based on big data.
Technical proposal
The invention is realized by the following technical scheme:
a manual data verification method based on big data comprises the following steps: s1: acquiring manual data and defining the manual data as a first data set, acquiring input data of the manual, defining the input data as a second data set, wherein the sequence positions of the data in the first data set and the second data set are the same in one-to-one correspondence; splitting manual data and entered data into a plurality of first data subsets and second data subsets respectively; by adopting a splitting and inquiring mode, the area with unequal data before and after the manual input can be quickly positioned, and then the reasons of the unequal data can be further inquired.
S2: splitting data in a first data set into a plurality of first data subsets according to a fixed length, and splitting data in a second data set into a plurality of second data subsets according to a fixed length;
s3: respectively inquiring the first data subset and the second data subset to obtain a first data feedback subset and a second data feedback subset which are obtained in response to the search command;
s4: the first data feedback subset and the second data feedback subset are equal in number of target data obtained by responding to the search command, and the next first data feedback subset and the second data feedback subset are sequentially inquired; the method specifically comprises the steps that when target data in a first data feedback subset and target data in a second data feedback subset obtained by corresponding search commands are equal, the probability of occurrence of problems in the first data subset and the second data subset is smaller, and therefore the next first data subset and the next second data subset are queried sequentially;
s5: if the number of target data obtained by the response search command of the first data feedback subset is not equal to that of target data obtained by the response search command of the second data feedback subset, the first data subset and the second data subset are extracted through the data extraction module, and then the data in the first data subset and the second data subset are compared through the data comparison module; the method specifically comprises the steps of comparing the first data subset and the second data subset of the part preferentially to acquire the reasons of unequal target data when the target data in the first data feedback subset and the second data feedback subset obtained by corresponding search commands are unequal.
Further, the data sequence of the plurality of first data subsets is spliced and arranged and then is the same as the data sequence in the first data set.
Further, the data sequence of the plurality of second data subsets is spliced and arranged in the same order as the data sequence of the second data sets.
Further, in S5, the target data amounts obtained by the response search command of the first data feedback subset and the second data feedback subset are equal, and then the data in the first data subset and the second data subset are sequentially compared by the data comparison module.
Further, in S5, the data comparison module marks data corresponding to inconsistencies in the first subset of data and the second subset of data.
Further, the data, corresponding to the inconsistent data in the first data subsets and the second data subsets, are uniformly extracted by the data extraction module and are collected into a list for display.
Advantageous effects
Compared with the known public technology, the technical scheme provided by the invention has the following beneficial effects:
the invention provides a manual data verification method based on big data, which can divide manual data and entered data into a plurality of first data subsets and second data subsets respectively, and inquire the first data subsets and the second data subsets respectively, when target data in the first data subsets and the second data subsets obtained by corresponding search commands are equal, the probability of occurrence of problems in the first data subsets and the second data subsets is smaller, so that the next first data subsets and the second data subsets are inquired sequentially, when target data in the first data subsets and the second data subsets obtained by corresponding search commands are not equal, the first data subsets and the second data subsets of the part are preferentially compared, and the reasons of the unequal target data are obtained. The invention adopts a splitting and inquiring mode, can rapidly locate the area of unequal data before and after the manual input, then further inquire the reasons of the unequal data, can rapidly find the phenomenon of data loss and locate the specific area, and has high verification efficiency and high detection speed.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below. It is evident that the drawings in the following description are only some embodiments of the present invention and that other drawings may be obtained from these drawings without inventive effort for a person of ordinary skill in the art.
FIG. 1 is a unitary frame diagram of the present invention;
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. It will be apparent that the described embodiments are some, but not all, embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The invention is further described below with reference to examples.
Embodiment 1, in combination with fig. 1, a manual data verification method based on big data includes the steps of: s1: acquiring manual data and defining the manual data as a first data set, acquiring input data of the manual, defining the input data as a second data set, wherein the sequence positions of the data in the first data set and the second data set are the same in one-to-one correspondence;
s2: splitting data in a first data set into a plurality of first data subsets according to a fixed length, and splitting data in a second data set into a plurality of second data subsets according to a fixed length;
s3: respectively inquiring the first data subset and the second data subset to obtain a first data feedback subset and a second data feedback subset which are obtained in response to the search command;
s4: the first data feedback subset and the second data feedback subset are equal in number of target data obtained by responding to the search command, and the next first data feedback subset and the second data feedback subset are sequentially inquired;
s5: and if the target data quantity obtained by the response search command of the first data feedback subset and the second data feedback subset is unequal, the first data subset and the second data subset are extracted through the data extraction module, and then the data in the first data subset and the second data subset are compared through the data comparison module.
The invention provides a manual data verification method based on big data, which can divide manual data and entered data into a plurality of first data subsets and second data subsets respectively, and inquire the first data subsets and the second data subsets respectively, when target data in the first data subsets and the second data subsets obtained by corresponding search commands are equal, the probability of occurrence of problems in the first data subsets and the second data subsets is smaller, so that the next first data subsets and the second data subsets are inquired sequentially, when target data in the first data subsets and the second data subsets obtained by corresponding search commands are not equal, the first data subsets and the second data subsets of the part are preferentially compared, and the reasons of the unequal target data are obtained. The invention adopts a splitting and inquiring mode, can rapidly locate the area of unequal data before and after the manual input, then further inquire the reasons of the unequal data, can rapidly find the phenomenon of data loss and locate the specific area, and has high verification efficiency and high detection speed.
Further, the data sequence of the plurality of first data subsets is spliced and arranged and then is the same as the data sequence in the first data set.
Further, the data sequence of the plurality of second data subsets is spliced and arranged in the same order as the data sequence of the second data sets.
Further, in S5, the target data amounts obtained by the response search command of the first data feedback subset and the second data feedback subset are equal, and then the data in the first data subset and the second data subset are sequentially compared by the data comparison module.
Further, in S5, the data comparison module marks data corresponding to inconsistencies in the first subset of data and the second subset of data. The data which are inconsistent in correspondence among the plurality of first data subsets and the plurality of second data subsets are uniformly extracted by the data extraction module and are collected into a list for display, so that the data difference before and after manual data entry can be displayed clearly.
In the description of the present specification, the descriptions of the terms "one embodiment," "example," "specific example," and the like, mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiments or examples. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The preferred embodiments of the invention disclosed above are intended only to assist in the explanation of the invention. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise form disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best understand and utilize the invention. The invention is limited only by the claims and the full scope and equivalents thereof.
Claims (6)
1. A manual data verification method based on big data is characterized by comprising the following steps: the method comprises the following steps: s1: acquiring manual data and defining the manual data as a first data set, acquiring input data of the manual, defining the input data as a second data set, wherein the sequence positions of the data in the first data set and the second data set are the same in one-to-one correspondence; splitting manual data and entered data into a plurality of first data subsets and second data subsets respectively; by adopting a splitting and inquiring mode, the area with unequal data before and after the manual input can be quickly positioned, and then the reasons of the unequal data can be further inquired.
S2: splitting data in a first data set into a plurality of first data subsets according to a fixed length, and splitting data in a second data set into a plurality of second data subsets according to a fixed length;
s3: respectively inquiring the first data subset and the second data subset to obtain a first data feedback subset and a second data feedback subset which are obtained in response to the search command;
s4: the first data feedback subset and the second data feedback subset are equal in number of target data obtained by responding to the search command, and the next first data feedback subset and the second data feedback subset are sequentially inquired; the method specifically comprises the steps that when target data in a first data feedback subset and target data in a second data feedback subset obtained by corresponding search commands are equal, the probability of occurrence of problems in the first data subset and the second data subset is smaller, and therefore the next first data subset and the next second data subset are queried sequentially;
s5: if the number of target data obtained by the response search command of the first data feedback subset is not equal to that of target data obtained by the response search command of the second data feedback subset, the first data subset and the second data subset are extracted through the data extraction module, and then the data in the first data subset and the second data subset are compared through the data comparison module; the method specifically comprises the steps of comparing the first data subset and the second data subset of the part preferentially to acquire the reasons of unequal target data when the target data in the first data feedback subset and the second data feedback subset obtained by corresponding search commands are unequal.
2. The big data-based manual data verification method according to claim 1, wherein: the data sequence of the plurality of first data subsets is spliced and arranged and then is the same as the data arrangement sequence in the first data set.
3. The big data-based manual data verification method according to claim 1, wherein: the data sequence of the plurality of second data subsets is spliced and arranged and then is the same as the data arrangement sequence in the second data set.
4. The big data-based manual data verification method according to claim 1, wherein: in S5, the number of target data obtained by the first data feedback subset and the second data feedback subset in response to the search command is equal, and then the data in the first data subset and the second data subset are sequentially compared by the data comparison module.
5. The big data-based manual data verification method according to claim 1, wherein: in S5, the data comparison module marks data corresponding to inconsistencies in the first subset of data and the second subset of data.
6. The big data based manual data verification method according to claim 5, wherein: and uniformly extracting and converging the data which are inconsistent in correspondence among the plurality of first data subsets and the plurality of second data subsets into a list for display by adopting a data extraction module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310722078.3A CN116628283A (en) | 2019-06-04 | 2019-06-04 | Manual data verification method based on big data |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310722078.3A CN116628283A (en) | 2019-06-04 | 2019-06-04 | Manual data verification method based on big data |
CN201910482758.6A CN110362722A (en) | 2019-06-04 | 2019-06-04 | A kind of handbook data method of calibration based on big data |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910482758.6A Division CN110362722A (en) | 2019-06-04 | 2019-06-04 | A kind of handbook data method of calibration based on big data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116628283A true CN116628283A (en) | 2023-08-22 |
Family
ID=68215151
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310722078.3A Pending CN116628283A (en) | 2019-06-04 | 2019-06-04 | Manual data verification method based on big data |
CN201910482758.6A Pending CN110362722A (en) | 2019-06-04 | 2019-06-04 | A kind of handbook data method of calibration based on big data |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910482758.6A Pending CN110362722A (en) | 2019-06-04 | 2019-06-04 | A kind of handbook data method of calibration based on big data |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN116628283A (en) |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010006416A1 (en) * | 2008-06-30 | 2010-01-21 | Ali Davar | System and method for interacting with a plurality of search engines |
CN101639849B (en) * | 2009-06-01 | 2011-06-01 | 北京四维图新科技股份有限公司 | Method for processing data and device therefor |
CN104298736B (en) * | 2014-09-30 | 2017-10-17 | 华为软件技术有限公司 | Data acquisition system connection method, device and Database Systems |
CN107291716B (en) * | 2016-03-30 | 2020-07-21 | 阿里巴巴集团控股有限公司 | Link data checking method and device |
CN108268496A (en) * | 2016-12-30 | 2018-07-10 | 北京国双科技有限公司 | The method of calibration and device of data |
CN109344151A (en) * | 2018-08-08 | 2019-02-15 | 航天信息股份有限公司 | The method of batch registration and modification user information |
CN109308299B (en) * | 2018-09-12 | 2020-01-14 | 北京字节跳动网络技术有限公司 | Method and apparatus for searching information |
CN109635256B (en) * | 2018-12-20 | 2023-07-11 | 上海掌门科技有限公司 | Method and device for verifying data |
-
2019
- 2019-06-04 CN CN202310722078.3A patent/CN116628283A/en active Pending
- 2019-06-04 CN CN201910482758.6A patent/CN110362722A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
CN110362722A (en) | 2019-10-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE112016005292T5 (en) | Automatic deviation detection service on heterogeneous protocol streams | |
CN109858476B (en) | Tag expansion method and electronic equipment | |
RU2016124134A (en) | MAKING REQUESTS FOR PERFORMANCE ON MULTIDIMENSIONAL DATA STRUCTURES | |
JP2007148677A5 (en) | ||
CN102207857B (en) | Method, device and system for identifying graphical user interface (GUI) element | |
WO2022116107A1 (en) | Data management platform, intelligent defect analysis system, intelligent defect analysis method, computer-program product, and method for defect analysis | |
US11106650B2 (en) | Data selection system and data selection method | |
CN115797808A (en) | Unmanned aerial vehicle inspection defect image identification method, system, device and medium | |
CN105354138A (en) | Method for determining specific control attribute of mobile phone page | |
CN116628283A (en) | Manual data verification method based on big data | |
CN111696095B (en) | Method and device for detecting surface defects of object | |
CN111104418B (en) | Text description processing method and device and electronic equipment | |
CN112711542B (en) | Method for switching testing environment of mobile phone client | |
WO2016046729A1 (en) | Inspection tool for manufactured components | |
CN105320603B (en) | The alignment system and method and its automated testing method of browser page element | |
CN110955603A (en) | Automatic testing method and device, electronic equipment and computer readable storage medium | |
KR102434442B1 (en) | Method of performing defect inspection of inspection object at high speed and apparatuses performing the same | |
CN107105328A (en) | A kind of control method of set top box, device and system | |
CN111783891B (en) | Customized object detection method | |
CN103995895A (en) | Image recognition method and device based on image | |
CN103034719A (en) | CPU (Central Processing Unit) type identification method, CPU type identification device and hardware detection system | |
CN106446902B (en) | Non-legible image-recognizing method and device | |
CN111475997A (en) | Bit number arranging method, system, equipment and medium for printed circuit board diagram | |
CN111460821A (en) | Entity identification and linking method and device | |
US11829336B1 (en) | Systems and methods for automated data dictionary generation and validation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |