CN104317850B - Data processing method and device - Google Patents

Data processing method and device Download PDF

Info

Publication number
CN104317850B
CN104317850B CN201410542598.7A CN201410542598A CN104317850B CN 104317850 B CN104317850 B CN 104317850B CN 201410542598 A CN201410542598 A CN 201410542598A CN 104317850 B CN104317850 B CN 104317850B
Authority
CN
China
Prior art keywords
affairs
mode bit
data
run
data processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410542598.7A
Other languages
Chinese (zh)
Other versions
CN104317850A (en
Inventor
戴培林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201410542598.7A priority Critical patent/CN104317850B/en
Publication of CN104317850A publication Critical patent/CN104317850A/en
Application granted granted Critical
Publication of CN104317850B publication Critical patent/CN104317850B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of data processing method and device.Wherein, the method for data processing includes:Obtain pending data;Multiple affairs are separately operable to handle pending data, wherein, multiple affairs are the mutual independent multiple affairs split out in advance from default affairs, default affairs are the affairs for handling pending data, default affairs include multiple subtransactions, and each affairs in multiple affairs correspond to a subtransaction in multiple subtransactions;Judge in multiple affairs with the presence or absence of the affairs of operation exception;And if it is judged that the affairs of operation exception in multiple affairs be present, then rerun the affairs of the operation exception.By the present invention, solve the problems, such as that data-handling efficiency is low in the prior art, reached the effect for the efficiency for improving data processing.

Description

Data processing method and device
Technical field
The present invention relates to data processing field, in particular to a kind of data processing method and device.
Background technology
Affairs are the elementary cells that database program performs, and can unify submission after an affairs content has all performed holds Row result, in program any step perform it is abnormal data mode can be caused to roll back to before affairs perform, and then cause to hold again The row affairs.Affairs can be a SQL (Structured in relational database (such as SQL server) Query Language, referred to as SQL) sentence or a program.
Multiple subtransactions can generally be included in affairs, the separate part of function can be divided into an affairs Multiple subtransactions, these subtransactions are performed parallel, so as to effectively improve data processing performance.Subtransaction refers to be included in one Affairs in affairs, such as a SQL statement in some transaction program.
Current is to ensure program parallelization and not broken ring complete function, and database journey is realized using the method for parallel subtransaction The parallel processing of sequence.I.e. in the complete affairs of One function, the separate part of built-in function is divided into multiple subtransactions It is parallel to perform.Such as in the data warehouse ablation process of Conceptual Modeling engineering, when the dimension table data in data warehouse has been filled Bi Hou, the data of true table can be filled parallel, data are separate between each true table, but all rely in dimension table Data.Will whole filling data warehouse process as a complete affairs, then can be the processing of dimension data and each Subtransaction is regarded in the processing of individual true table data as, and each true table data can be written in parallel to, and is multiple parallel sub- things Business.The write efficiency of data warehouse can be improved by the parallel processing of subtransaction.
However, because affairs have the characteristics of atomicity, that is, need etc. unified after the completion of all performing to submit implementing result. In the meantime any step occur it is abnormal can cause data rewind start to affairs before state.For example, in parallel subtransaction In, although subtransaction function is separate, if occurring processing exception in any one subtransaction, then other institutes can be influenceed There is parallel subtransaction, it is necessary to handle all subtransactions in affairs again when some subtransaction is handled again, and not only handle There is abnormal subtransaction.So, due to causing the affairs belonging to the subtransaction all to re-execute after certain subtransaction exception, It is low to ultimately result in data-handling efficiency.
For the problem of data-handling efficiency is low in the prior art, effective solution is not yet proposed at present.
The content of the invention
It is low to solve data-handling efficiency it is a primary object of the present invention to provide a kind of data processing method and device Problem.
To achieve these goals, a kind of one side according to embodiments of the present invention, there is provided data processing method.Root Method according to the data processing of the present invention includes:Obtain pending data;Multiple affairs are separately operable to handle pending data, Wherein, multiple affairs are mutual independent multiple affairs for being split out in advance from default affairs, preset affairs be for Handle the affairs of pending data, default affairs include multiple subtransactions, and each affairs in multiple affairs correspond to more height A subtransaction in affairs;Judge in multiple affairs with the presence or absence of the affairs of operation exception;And if it is judged that multiple things The affairs of operation exception in business be present, then rerun the affairs of the operation exception.
Further, multiple affairs are being separately operable come before handling pending data, data processing method also includes:Obtain Take multiple subtransactions in default affairs;And multiple subtransactions are extracted as multiple independent affairs, obtain multiple affairs.
Further, judge that the affairs in multiple affairs with the presence or absence of operation exception include:Record each in multiple affairs Mode bit corresponding to affairs, mode bit are used for the running status for reflecting each affairs in multiple affairs, wherein, when mode bit is pre- When bidding is known, affairs operation exception corresponding to mode bit;By judging whether mode bit is default identify to judge multiple affairs In whether there is operation exception affairs.
Further, multiple affairs are being separately operable come before handling pending data, data processing method also includes:Wound Build the state table for flag state position;The mode bit updated in state table is operation starting time, and operation starting time is more The time that individual affairs bring into operation, wherein, when each affairs end of run in multiple affairs, by end of run on state table Affairs mode bit be updated to end of run affairs the end of run time, judge whether mode bit is default mark bag Include:Judge whether mode bit is operation starting time, wherein, if it is judged that mode bit is operation starting time, it is determined that shape Affairs operation exception corresponding to state position.
Further, mode bit is digital state position, wherein, when digital state position is the first numerical value, digital state position Corresponding affairs end of run, when digital state position is second value, affairs operation exception corresponding to digital state position, judge Affairs in multiple affairs with the presence or absence of operation exception include:Judge that digital state position is the first numerical value or second value, its In, if it is judged that when digital state position is the first numerical value, determine affairs end of run corresponding to digital state position;If it is determined that When to go out digital state position be second value, affairs operation exception corresponding to digital state position is determined.
To achieve these goals, a kind of another aspect according to embodiments of the present invention, there is provided data processing equipment.Root Data processing equipment according to the present invention includes:First acquisition unit, for obtaining pending data;Processing unit, for respectively Multiple affairs are run to handle pending data, wherein, multiple affairs are mutual to be split out in advance from default affairs Independent multiple affairs, default affairs are the affairs for handling pending data, and default affairs include multiple subtransactions, multiple Each affairs in affairs correspond to a subtransaction in multiple subtransactions;Judging unit, it is in multiple affairs for judging The no affairs that operation exception be present;And running unit, for if it is judged that the affairs of operation exception in multiple affairs be present, Then rerun the affairs of the operation exception.
Further, data processing equipment also includes:Second acquisition unit, for being handled being separately operable multiple affairs Before pending data, multiple subtransactions in default affairs are obtained;And extraction unit, for multiple subtransactions to be extracted as Multiple independent affairs, obtain multiple affairs.
Further, judging unit includes:Logging modle, for recording each state corresponding to affairs in multiple affairs Position, mode bit are used for the running status for reflecting each affairs in multiple affairs, wherein, when mode bit is presets mark, state Affairs operation exception corresponding to position;First judge module, for by judging whether mode bit is that default mark is multiple to judge It whether there is the affairs of operation exception in affairs.
Further, data processing equipment also includes:Creating unit, for waiting to locate to handle being separately operable multiple affairs Before managing data, the state table for flag state position is created;Updating block, for updating the mode bit in state table for beginning Run time, operation starting time are the time that multiple affairs bring into operation, wherein, when each affairs operation knot in multiple affairs Shu Shi, the mode bit of the affairs of end of run is updated to the end of run time of the affairs of end of run on state table, sentenced Disconnected module is additionally operable to judge whether mode bit is operation starting time, wherein, if it is judged that mode bit is operation starting time, Then determine affairs operation exception corresponding to mode bit.
Further, mode bit is digital state position, wherein, when digital state position is the first numerical value, digital state position Corresponding affairs end of run, when digital state position is second value, affairs operation exception corresponding to digital state position, judge Unit includes:Second judge module, for judging that digital state position is the first numerical value or second value, wherein, if it is determined that When to go out digital state position be the first numerical value, affairs end of run corresponding to digital state position is determined;If it is judged that digital state When position is second value, affairs operation exception corresponding to digital state position is determined.
According to embodiments of the present invention, pending data is handled by being separately operable multiple affairs, wherein, multiple affairs are The mutual independent multiple affairs split out in advance from default affairs, it is for handling pending data to preset affairs Affairs, default affairs include multiple parallel subtransactions, and each affairs in multiple affairs correspond to one in multiple subtransactions Individual subtransaction, when the affairs of operation exception in judging multiple affairs be present, then it need to only rerun the abnormal affairs, nothing Other affairs need to be run.And when using an affairs, such as default affairs are to handle pending data in the prior art, when wherein There is exception in some subtransaction, because affairs have atomicity, it is necessary to all subtransactions be reruned, in the embodiment of the present invention In, it need to only rerun and abnormal affairs occur, so substantially increase the efficiency of data processing, solve and count in the prior art According to treatment effeciency it is low the problem of, reached improve data processing efficiency effect.
Brief description of the drawings
The accompanying drawing for forming the part of the application is used for providing a further understanding of the present invention, schematic reality of the invention Apply example and its illustrate to be used to explain the present invention, do not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart of data processing method according to embodiments of the present invention;
Fig. 2 is the flow chart of preferable data processing method according to embodiments of the present invention;
Fig. 3 is the flow chart of another preferable data processing method according to embodiments of the present invention;
Fig. 4 is the schematic diagram of multiple affairs according to embodiments of the present invention;And
Fig. 5 is the schematic diagram of data processing equipment according to embodiments of the present invention.
Embodiment
It should be noted that in the case where not conflicting, the feature in embodiment and embodiment in the application can phase Mutually combination.Describe the present invention in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
In order that those skilled in the art more fully understand the present invention program, below in conjunction with the embodiment of the present invention Accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only The embodiment of a part of the invention, rather than whole embodiments.Based on the embodiment in the present invention, ordinary skill people The every other embodiment that member is obtained under the premise of creative work is not made, it should all belong to the model that the present invention protects Enclose.
It should be noted that term " first " in description and claims of this specification and above-mentioned accompanying drawing, " Two " etc. be for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that so use Data can exchange in the appropriate case, so as to embodiments of the invention described herein.In addition, term " comprising " and " tool Have " and their any deformation, it is intended that cover it is non-exclusive include, for example, containing series of steps or unit Process, method, system, product or equipment are not necessarily limited to those steps clearly listed or unit, but may include without clear It is listing to Chu or for the intrinsic other steps of these processes, method, product or equipment or unit.
The embodiments of the invention provide a kind of data processing method.
Fig. 1 is the flow chart of data processing method according to embodiments of the present invention.As shown in figure 1, the data processing method It is as follows including step:
Step S102, obtain pending data.
Pending data can need data to be processed, for example, dimension data (such as time data) and/or true number According to (such as production marketing data).
Step S104, multiple affairs are separately operable to handle pending data, wherein, multiple affairs are from default affairs The mutual independent multiple affairs split out in advance, default affairs are the affairs for handling pending data, preset thing Business includes multiple parallel subtransactions, and each affairs in multiple affairs correspond to a subtransaction in multiple subtransactions.
Wherein, affairs are a program execution units in database, can in relational database (such as SQL server) To be a SQL statement or a program.In processing data, an affairs are divided into more height of functional independence Affairs, and these subtransactions are run, data processing performance can be effectively improved.Multiple affairs can be:Write dimension data Affairs with write-in factual data affairs.
Default affairs can be to standalone transaction corresponding to pending data, and the default affairs are for handling pending number According to affairs, for example, being the data for being written to data warehouse in pending data, then, preset affairs be then right therewith That answers writes pending data the affairs of data warehouse.Specifically, default affairs can be by SSIS (SQL Server Integration Services) for engineering by initial data write-in data warehouse, data warehouse is to be used to deposit in Conceptual Modeling The database of multi-dimensional data is stored up, is generally divided into dimension table and true table.Dimension table stores the dimension of all kinds of factual datas Information is spent, true table stores achievement data, and associated with dimension table.If dimension table is timetable, true table is product pin Table is sold, then has achievement data in production marketing table, and join with time correlation, then dimension table when being associated with.
Default affairs can include multiple subtransactions, there may be parallel subtransaction in the plurality of subtransaction.The present invention Multiple affairs of embodiment are that the multiple affairs split out in affairs according to the function of multiple subtransactions are preset from this, be that is to say, Each affairs have identical function with a subtransaction in default affairs in multiple affairs, for example, default transaction packet enclosed tool Affairs A ', subtransaction B ' and subtransaction C ', then the multiple affairs split into include:Affairs A, affairs B and affairs C, wherein, thing Business A and subtransaction A ' has identical function, and affairs B and subtransaction B ' has identical function, and affairs C has with subtransaction C ' Identical function.Affairs A, affairs B and affairs C are independent affairs, and each affairs are respectively provided with atomicity.So, use is utilized Multiple affairs handle pending data, a part for each issued transaction pending data, due to each affairs in multiple affairs Independence is respectively provided with, during pending data is handled, when some affairs appearance exception, for example, when affairs perform failure, then The abnormal affairs can only be re-executed.
Step S106, judge to whether there is the affairs of operation exception in multiple affairs.
Step S108, if it is judged that the affairs of operation exception in multiple affairs be present, then rerun the operation exception Affairs.
Judge with the presence or absence of the affairs of operation exception in multiple affairs, if it is present the abnormal affairs are reruned, Because independently of each other, other affairs in multiple affairs need not then rerun between multiple affairs;If there is no abnormal thing Business, then pending data terminates.Specifically, judge that the affairs in multiple affairs with the presence or absence of operation exception can pass through detection The mark of affairs running status is recorded, judges that this identifies whether the mode of display affairs operation exception to judge.
According to embodiments of the present invention, pending data is handled by being separately operable multiple affairs, wherein, multiple affairs are The mutual independent multiple affairs split out in advance from default affairs, it is for handling pending data to preset affairs Affairs, default affairs include multiple parallel subtransactions, and each affairs in multiple affairs correspond to one in multiple subtransactions Individual subtransaction, when the affairs of operation exception in judging multiple affairs be present, then it need to only rerun the abnormal affairs, nothing Other affairs need to be run.And when using an affairs, such as default affairs are to handle pending data in the prior art, when wherein There is exception in some subtransaction, because affairs have atomicity, it is necessary to all subtransactions be reruned, in the embodiment of the present invention In, it need to only rerun and abnormal affairs occur, so substantially increase the efficiency of data processing, solve and count in the prior art According to treatment effeciency it is low the problem of, reached improve data processing efficiency effect.
Preferably, multiple affairs are being separately operable come before handling pending data, the data processing of the embodiment of the present invention Method also includes:Obtain multiple subtransactions in default affairs;Multiple subtransactions are extracted as multiple independent affairs, obtained more Individual affairs.
Here default affairs include multiple subtransactions, i.e., the separate part of function are divided into an affairs more Individual subtransaction.To ensure program parallelization and not destroying complete function, database program is realized using the method for parallel subtransaction Parallel processing, can so improve the write efficiency of data warehouse.For example, the data warehouse in Conceptual Modeling engineering write Cheng Zhong, after the dimension data table data filling of data warehouse, the data of true table, each true table can be filled parallel Between data it is separate, but all rely on dimension table in data.Process using whole filling data warehouse is complete as one Whole affairs, then the processing of dimension data and the processing of each true table data can be regarded as subtransaction, and each thing Real table data can be written in parallel to, and be multiple parallel subtransactions.But because affairs have the characteristics of atomicity, that is, need Deng could unify after the completion of all operationss to submit.Exception, which occurs, in any step in the meantime can all cause data rewind to be held to affairs State before beginning.In parallel subtransaction, although subtransaction function is separate, if gone out in any one subtransaction Now processing is abnormal, then can influence other all parallel subtransactions.When handling again, it is necessary to handle whole mistakes in affairs again Journey, rather than just the branch for exception occur.In the present invention, parallel subtransaction is extracted as parallel independent thing by function Business, wherein, there is exception in any one subtransaction, all only can rerun the affairs in there is abnormal subtransaction, and often Individual subtransaction is all submitted to be independent, with this avoid interactional situation between parallel subtransaction (if desired for etc. whole affairs grasp It could unify to submit after the completion of work).
The embodiment of the present invention, multiple word subtransactions in default affairs are obtained, multiple subtransactions are extracted as of the invention real Multiple affairs of example are applied, specifically, each subtransaction completed in same affairs is extracted as into multiple independent affairs can be with By the way that a SQL program is split as into multiple programs or in data warehouse write-in by the Sequence in SSIS engineerings Container instruments, a Container is split as multiple Container by function and completed.So, by original multiple Subtransaction obtains multiple separate affairs so that when occurring the affairs of operation exception in multiple affairs, need to only rerun The abnormal affairs.
Fig. 2 is the flow chart of preferable data processing method according to embodiments of the present invention.The data processing side of the embodiment Method can be a kind of preferred embodiment of the data processing method of above-described embodiment.As shown in Fig. 2 the data processing method bag It is as follows to include step:
Step S202, obtain pending data.
Step S204, multiple affairs are separately operable to handle pending data.
Step S202 and step S204 is identical with the step S102 shown in Fig. 1 and step S104 successively, does not repeat here.
Step S206, records each mode bit corresponding to affairs in multiple affairs, and mode bit is used to reflect in multiple affairs The running status of each affairs.
It should be noted that mode bit can be used to indicate that the running status of multiple affairs, wherein, it is each in multiple affairs The corresponding mode bit of affairs, the mode bit can be intended to indicate that the timestamp of affairs operation, can be used for representing affairs The numerical value of operation.For example, when affairs are run successfully, the mode bit of the affairs becomes the first mark, when affairs operation exception (or Failure) when, the mode bit is changed into the second mark, in this manner it is possible to judge the affairs by detecting mode bit corresponding to affairs Whether operation is successful or abnormal.
Affairs corresponding to the mode bit judge whether multiple affairs run completion by record status bit.By recording shape The mode of state position, the operation conditions for the affairs that can be quickly detected corresponding to mode bit, so as to be handled, effectively increase Data processing performance.
Step S208, by judging whether mode bit is default identify to judge to whether there is operation exception in multiple affairs Affairs.Wherein, when mode bit for default mark when, then affairs operation exception corresponding to mode bit, i.e. default mark is in thing Business runs one set before is used for that a affairs normal operation whether mark judged.
Step S210, if judging, existence position is the affairs of default mark in affairs, it is determined that affairs operation is different Often.
Step S212, rerun and abnormal affairs occur.
According to embodiments of the present invention, by recording each mode bit corresponding to affairs in multiple affairs, multiple affairs are judged Mode bit whether be default mark, if default mark, it is determined that affairs corresponding to the mode bit occur abnormal, rerun Abnormal affairs.The affairs of operation exception are determined using mode bit, the detection speed of abnormal transaction is improved, further improves number According to the efficiency of processing.
Fig. 3 is the flow chart of another preferable data processing method according to embodiments of the present invention.At the data of the embodiment Reason method can be a kind of preferred embodiment of the data processing method of above-described embodiment.As shown in figure 3, the data processing side It is as follows that method includes step:
Step S302, obtain pending data.
Step S304, create the state table for flag state position.
Step S306, it is operation starting time to update the mode bit in state table, and operation starting time is that multiple affairs are opened Begin the time run.
Step S308, multiple affairs are separately operable to handle pending data.
Step S310, when each affairs end of run in multiple affairs, by the affairs of end of run on state table Mode bit is updated to the end of run time of the affairs of end of run.
Step S312, judge whether mode bit is operation starting time.
Step S314, if it is judged that mode bit is operation starting time, it is determined that affairs operation is different corresponding to mode bit Often, the affairs are reruned.
Specifically, the mark of running status can be used as by the use of timestamp in embodiments of the present invention.Record affairs operation Timestamp, wherein it is possible to be when affairs are run successfully, change the end of run time corresponding with the affairs in state table;Such as Fruit operation failure, then the mode bit in state table is still operation starting time, so, can by judge the mode bit whether be Operation starting time judges the state whether to be default mark, so that it is determined that affairs whether operation exception.
For example, as shown in figure 4, the data in embodiments of the invention are written with 5 affairs includes:Affairs A (write-in dimensions Data), affairs B (write-in factual data), affairs C (write-in factual data), affairs D (write-in factual data) and affairs E (write-in Factual data), wherein, share the parallel branch that 4 (affairs B, affairs C, affairs D and affairs E) writes factual data.
State table is as shown in table 1, and when multiple affairs bring into operation, the time in more row state table is when bringing into operation Between, for example, first time operation starting time is arranged to 0:00:00, the paralleling transaction first time end of run time is recorded, if the The end time of paralleling transaction normal operation is 15:00:00, then may determine that the paralleling transaction end of run time whether phase Etc. being 15:00:00, if equal, then it is assumed that this time issued transaction success, and by this end of run time 15:00:00 makees The time to be brought into operation for affairs next time.Program enters before parallel each branch, when first determining whether corresponding to the branch Between stab whether be equal to this end of run time 15:00:00, if unequal, branch's affairs operation exception, hold again The affairs of the row branch, other affairs need not rerun;If equal, branch is directly jumped out, and this end of run Time 15:00:00 conduct runs the time started next time.This step judges to be used for ensureing if a certain parallel branch appearance is different before Often, then the branch is only handled after reruning, without normal branch between processing.The effect handled again is improved with this Rate.
After branch's affairs end of run of the operation exception, timestamp corresponding to renewal is the end of run time 15:00: 00.Run and complete when whole branch's affairs, when judging whether the operation starting time of 4 paralleling transactions is equal to end of run again Between, if equal, the operation of this affairs is fully completed.In the present embodiment if operation starting time and end of run time All it is 15:00:00, then this affairs operation is fully completed.
Table 1:
Alternatively, mode bit can also be digital state position.Wherein, when digital state position is the first numerical value, digital state Affairs end of run corresponding to position, when digital state position is second value, affairs operation exception corresponding to digital state position.
First numerical value and second value can be the data for representing different running statuses, and the first numerical value can be 1, the second number Value can be 0, for example, it is assumed that when mode bit is 1, affairs corresponding to mode bit 1 terminate, that is, judge affairs normal operation;Assuming that When mode bit is 0, affairs operation exception corresponding to mode bit 0.
Specifically, before multiple paralleling transactions are run first, the mode bit of the plurality of affairs is arranged to 0, first During end of run, judge multiple affairs respectively corresponding to mode bit, if mode bit is changed into 1, multiple affairs whole normal operations, The directly affairs of operation next time;If it is 0 to have mode bit corresponding at least one affairs between multiple affairs, the corresponding shape The affairs that state position is 0 are reruned, and the affairs of other normal operations are not influenceed by the affairs of the operation exception, when the operation is different When normal affairs are completed, mode bit corresponding to renewal is 1.When whole affairs, which are run, to be completed, judge again the plurality of and act Whether mode bit corresponding to business is 1, if corresponding mode bit is all 1, the operation of this affairs is fully completed.
The embodiment of the present invention additionally provides a kind of data processing equipment.The device can realize its work(by computer equipment Energy.It should be noted that the data processing equipment of the embodiment of the present invention can be used for performing the number that the embodiment of the present invention is provided According to processing method, the data processing method of the embodiment of the present invention can also be filled by the data processing that the embodiment of the present invention is provided Put to perform.
Fig. 5 is the schematic diagram of data processing equipment according to embodiments of the present invention.As shown in figure 5, the data processing equipment Including:First acquisition unit 10, processing unit 20, judging unit 30 and running unit 40.
First acquisition unit 10 is used to obtain pending data.
Pending data can need data to be processed, for example, dimension data (such as time data) and/or true number According to (such as production marketing data).
Processing unit 20 is used to be separately operable multiple affairs to handle pending data, wherein, multiple affairs are from default The mutual independent multiple affairs split out in advance in affairs, default affairs are the affairs for handling pending data, Default affairs include multiple subtransactions, and each affairs in multiple affairs correspond to a subtransaction in multiple subtransactions.
Wherein, affairs are a program execution units in database, can in relational database (such as SQL server) To be a SQL statement or a program.In processing data, an affairs are divided into more height of functional independence Affairs, and these subtransactions are run, data processing performance can be effectively improved.Multiple affairs can be:Write dimension data Affairs with write-in factual data affairs.
Default affairs can be to standalone transaction corresponding to pending data, and the default affairs are for handling pending number According to affairs, for example, being the data for being written to data warehouse in pending data, then, preset affairs be then right therewith That answers writes pending data the affairs of data warehouse.Specifically, default affairs can be by SSIS (SQL Server Integration Services) for engineering by initial data write-in data warehouse, data warehouse is to be used to deposit in Conceptual Modeling The database of multi-dimensional data is stored up, is generally divided into dimension table and true table.Dimension table stores the dimension of all kinds of factual datas Information is spent, true table stores achievement data, and associated with dimension table.If dimension table is timetable, true table is product pin Table is sold, then has achievement data in production marketing table, and join with time correlation, then dimension table when being associated with.
Default affairs can include multiple subtransactions, there may be parallel subtransaction in the plurality of subtransaction.The present invention Multiple affairs of embodiment are that the multiple affairs split out in affairs according to the function of multiple subtransactions are preset from this, be that is to say, Each affairs have identical function with a subtransaction in default affairs in multiple affairs, for example, default transaction packet enclosed tool Affairs A ', subtransaction B ' and subtransaction C ', then the multiple affairs split into include:Affairs A, affairs B and affairs C, wherein, thing Business A and subtransaction A ' has identical function, and affairs B and subtransaction B ' has identical function, and affairs C has with subtransaction C ' Identical function.Affairs A, affairs B and affairs C are independent affairs, and each affairs are respectively provided with atomicity.So, using more Individual affairs handle pending data, a part for each issued transaction pending data, because each affairs are equal in multiple affairs With independence, during pending data is handled, when some affairs appearance exception, for example, when affairs perform failure, then may be used Only to re-execute the abnormal affairs.
Judging unit 30 is used to judge the affairs that whether there is operation exception in multiple affairs.
Running unit 40 is used for if it is judged that the affairs of operation exception in multiple affairs be present, then it is different to rerun operation Normal affairs.
Judge with the presence or absence of the affairs of operation exception in multiple affairs, if it is present the abnormal affairs are reruned, Because independently of each other, other affairs in multiple affairs need not then rerun between multiple affairs;If there is no abnormal thing Business, then pending data terminates.Specifically, judge that the affairs in multiple affairs with the presence or absence of operation exception can pass through detection The mark of affairs running status is recorded, judges that this identifies whether the mode of display affairs operation exception to judge.
According to embodiments of the present invention, pending data is handled by being separately operable multiple affairs, wherein, multiple affairs are The mutual independent multiple affairs split out in advance from default affairs, it is for handling pending data to preset affairs Affairs, default affairs include multiple parallel subtransactions, and each affairs in multiple affairs correspond to one in multiple subtransactions Individual subtransaction, when the affairs of operation exception in judging multiple affairs be present, then it need to only rerun the abnormal affairs, nothing Other affairs need to be run.And when using an affairs, such as default affairs are to handle pending data in the prior art, when wherein There is exception in some subtransaction, because affairs have atomicity, it is necessary to all subtransactions be reruned, in the embodiment of the present invention In, it need to only rerun and abnormal affairs occur, so substantially increase the efficiency of data processing, solve and count in the prior art According to treatment effeciency it is low the problem of, reached improve data processing efficiency effect.
Preferably, data processing equipment also includes:Second acquisition unit, for being treated to handle being separately operable multiple affairs Before processing data, multiple subtransactions in default affairs are obtained;And extraction unit, it is more for multiple subtransactions to be extracted as Individual independent affairs, obtain multiple affairs.
Here default affairs include multiple subtransactions, i.e., the separate part of function are divided into an affairs more Individual subtransaction.To ensure program parallelization and not destroying complete function, database program is realized using the method for parallel subtransaction Parallel processing, can so improve the write efficiency of data warehouse.For example, the data warehouse in Conceptual Modeling engineering write Cheng Zhong, after the dimension data table data filling of data warehouse, the data of true table, each true table can be filled parallel Between data it is separate, but all rely on dimension table in data.Process using whole filling data warehouse is complete as one Whole affairs, then the processing of dimension data and the processing of each true table data can be regarded as subtransaction, and each thing Real table data can be written in parallel to, and be multiple parallel subtransactions.But because affairs have the characteristics of atomicity, that is, need Deng could unify after the completion of all operationss to submit.Exception, which occurs, in any step in the meantime can all cause data rewind to be held to affairs State before beginning.In parallel subtransaction, although subtransaction function is separate, if gone out in any one subtransaction Now processing is abnormal, then can influence other all parallel subtransactions.When handling again, it is necessary to handle whole mistakes in affairs again Journey, rather than just the branch for exception occur.In the present invention, parallel subtransaction is extracted as parallel independent thing by function Business, wherein, there is exception in any one subtransaction, all only can rerun the affairs in there is abnormal subtransaction, and often Individual subtransaction is all submitted to be independent, with this avoid interactional situation between parallel subtransaction (if desired for etc. whole affairs grasp It could unify to submit after the completion of work).
The embodiment of the present invention, multiple word subtransactions in default affairs are obtained, multiple subtransactions are extracted as of the invention real Multiple affairs of example are applied, specifically, each subtransaction completed in same affairs is extracted as into multiple independent affairs can be with By the way that a SQL program is split as into multiple programs or in data warehouse write-in by the Sequence in SSIS engineerings Container instruments, a Container is split as multiple Container by function and completed.So, by original multiple Subtransaction obtains multiple separate affairs so that when occurring the affairs of operation exception in multiple affairs, need to only rerun The abnormal affairs.
Preferably, judging unit includes:Logging modle, for recording each mode bit corresponding to affairs in multiple affairs, Mode bit is used for the running status for reflecting each affairs in multiple affairs, wherein, when mode bit is presets mark, mode bit pair The affairs operation exception answered;First judge module, for by judging whether mode bit is default identify to judge multiple affairs In whether there is operation exception affairs.
It should be noted that mode bit can be used to indicate that the running status of multiple affairs, wherein, it is each in multiple affairs The corresponding mode bit of affairs, the mode bit can be intended to indicate that the timestamp of affairs operation, can be used for representing affairs The numerical value of operation.For example, when affairs are run successfully, the mode bit of the affairs becomes the first mark, when affairs operation exception (or Failure) when, the mode bit is changed into the second mark, in this manner it is possible to judge the affairs by detecting mode bit corresponding to affairs Whether operation is successful or abnormal.
Affairs corresponding to the mode bit judge whether multiple affairs run completion by record status bit.By recording shape The mode of state position, the operation conditions for the affairs that can be quickly detected corresponding to mode bit, so as to be handled, effectively increase Data processing performance.
When mode bit for default mark when, then affairs operation exception corresponding to mode bit, i.e. default mark is transported in affairs One set before that goes is used for that a affairs normal operation whether mark judged.
According to embodiments of the present invention, by recording each mode bit corresponding to affairs in multiple affairs, multiple affairs are judged Mode bit whether be default mark, if default mark, it is determined that affairs corresponding to the mode bit occur abnormal, rerun Abnormal affairs.The affairs of operation exception are determined using mode bit, the detection speed of abnormal transaction is improved, further improves number According to the efficiency of processing.
Preferably, data processing equipment also includes:Creating unit, for be separately operable multiple affairs pending to handle Before data, the state table for flag state position is created;Updating block, it is to start to transport for updating the mode bit in state table Row time, operation starting time are the time that multiple affairs bring into operation, wherein, when each affairs end of run in multiple affairs When, the mode bit of the affairs of end of run is updated to the end of run time of the affairs of end of run on state table, judged Module is additionally operable to judge whether mode bit is operation starting time, wherein, if it is judged that mode bit is operation starting time, then Determine affairs operation exception corresponding to mode bit.
Specifically, the mark of running status can be used as by the use of timestamp in embodiments of the present invention.Record affairs operation Timestamp, wherein it is possible to be when affairs are run successfully, change the end of run time corresponding with the affairs in state table;Such as Fruit operation failure, then the mode bit in state table is still operation starting time, so, can by judge the mode bit whether be Operation starting time judges the state whether to be default mark, so that it is determined that affairs whether operation exception.
For example, as shown in figure 4, the data in embodiments of the invention are written with 5 affairs includes:Affairs A (write-in dimensions Data), affairs B (write-in factual data), affairs C (write-in factual data), affairs D (write-in factual data) and affairs E (write-in Factual data), wherein, share the parallel branch that 4 (affairs B, affairs C, affairs D and affairs E) writes factual data.
State table is as shown in table 1, and when multiple affairs bring into operation, the time in more row state table is when bringing into operation Between, for example, first time operation starting time is arranged to 0:00:00, the paralleling transaction first time end of run time is recorded, if the The end time of paralleling transaction normal operation is 15:00:00, then may determine that the paralleling transaction end of run time whether phase Etc. being 15:00:00, if equal, then it is assumed that this time issued transaction success, and by this end of run time 15:00:00 makees The time to be brought into operation for affairs next time.Program enters before parallel each branch, when first determining whether corresponding to the branch Between stab whether be equal to this end of run time 15:00:00, if unequal, branch's affairs operation exception, hold again The affairs of the row branch, other affairs need not rerun;If equal, branch is directly jumped out, and this end of run Time 15:00:00 conduct runs the time started next time.This step judges to be used for ensureing if a certain parallel branch appearance is different before Often, then the branch is only handled after reruning, without normal branch between processing.The effect handled again is improved with this Rate.
After branch's affairs end of run of the operation exception, timestamp corresponding to renewal is the end of run time 15:00: 00.Run and complete when whole branch's affairs, when judging whether the operation starting time of 4 paralleling transactions is equal to end of run again Between, if equal, the operation of this affairs is fully completed.In the present embodiment if operation starting time and end of run time All it is 15:00:00, then this affairs operation is fully completed.
Alternatively, mode bit is digital state position, wherein, when digital state position is the first numerical value, digital state position is right The affairs end of run answered, when digital state position is second value, affairs operation exception, judges list corresponding to digital state position Member includes:Second judge module, for judging that digital state position is the first numerical value or second value, wherein, if it is judged that When digital state position is the first numerical value, affairs end of run corresponding to digital state position is determined;If it is judged that digital state position When being second value, affairs operation exception corresponding to digital state position is determined.
First numerical value and second value can be the data for representing different running statuses, and the first numerical value can be 1, the second number Value can be 0, for example, it is assumed that when mode bit is 1, affairs corresponding to mode bit 1 terminate, that is, judge affairs normal operation;Assuming that When mode bit is 0, affairs operation exception corresponding to mode bit 0.
Specifically, before multiple paralleling transactions are run first, the mode bit of the plurality of affairs is arranged to 0, first During end of run, judge multiple affairs respectively corresponding to mode bit, if mode bit is changed into 1, multiple affairs whole normal operations, The directly affairs of operation next time;If it is 0 to have mode bit corresponding at least one affairs between multiple affairs, the corresponding shape The affairs that state position is 0 are reruned, and the affairs of other normal operations are not influenceed by the affairs of the operation exception, when the operation is different When normal affairs are completed, mode bit corresponding to renewal is 1.When whole affairs, which are run, to be completed, judge again the plurality of and act Whether mode bit corresponding to business is 1, if corresponding mode bit is all 1, the operation of this affairs is fully completed.
It should be noted that for foregoing each method embodiment, in order to be briefly described, therefore it is all expressed as a series of Combination of actions, but those skilled in the art should know, the present invention is not limited by described sequence of movement because According to the present invention, some steps can use other orders or carry out simultaneously.Secondly, those skilled in the art should also know Know, embodiment described in this description belongs to preferred embodiment, and involved action and module are not necessarily of the invention It is necessary.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion being described in detail in some embodiment Point, it may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed device, can be by another way Realize.For example, device embodiment described above is only schematical, such as the division of the unit, it is only one kind Division of logic function, can there is an other dividing mode when actually realizing, such as multiple units or component can combine or can To be integrated into another system, or some features can be ignored, or not perform.Another, shown or discussed is mutual Coupling direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING or communication connection of device or unit, Can be electrical or other forms.
The unit illustrated as separating component can be or may not be physically separate, show as unit The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list Member can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is used as independent production marketing or use When, it can be stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially The part to be contributed in other words to prior art or all or part of the technical scheme can be in the form of software products Embody, the computer software product is stored in a storage medium, including some instructions are causing a computer Equipment (can be personal computer, mobile terminal, server or network equipment etc.) performs side described in each embodiment of the present invention The all or part of step of method.And foregoing storage medium includes:USB flash disk, read-only storage (ROM, Read-Only Memory), Random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disc or CD etc. are various to be stored The medium of program code.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.Within the spirit and principles of the invention, that is made any repaiies Change, equivalent substitution, improvement etc., should be included in the scope of the protection.

Claims (10)

  1. A kind of 1. data processing method, it is characterised in that including:
    Pending data is obtained, wherein, the pending data includes dimension data and/or factual data;
    Multiple affairs are separately operable to handle the pending data, wherein, the multiple affairs are advance from default affairs The mutual independent multiple affairs split out, the default affairs are the affairs for handling the pending data, institute Stating default affairs includes multiple subtransactions, and each affairs in the multiple affairs correspond to one in the multiple subtransaction Subtransaction, wherein, the multiple affairs include:Write affairs of the affairs of the dimension data with writing the factual data;
    Judge in the multiple affairs with the presence or absence of the affairs of operation exception;And
    If it is judged that the affairs of operation exception in the multiple affairs be present, then the affairs of the operation exception are reruned.
  2. 2. data processing method according to claim 1, it is characterised in that handled being separately operable multiple affairs described Before pending data, the data processing method also includes:
    Obtain multiple subtransactions in the default affairs;And
    The multiple subtransaction is extracted as multiple independent affairs, obtains the multiple affairs.
  3. 3. data processing method according to claim 1, it is characterised in that judge in the multiple affairs with the presence or absence of fortune The abnormal affairs of row include:
    Each mode bit corresponding to affairs in the multiple affairs is recorded, the mode bit is used to reflect every in the multiple affairs The running status of individual affairs, wherein, when the mode bit is presets mark, affairs operation exception corresponding to the mode bit;
    By judging whether the mode bit is that the default mark judges to whether there is operation exception in the multiple affairs Affairs.
  4. 4. data processing method according to claim 3, it is characterised in that
    Multiple affairs are being separately operable come before handling the pending data, the data processing method also includes:Create and use In the state table for marking the mode bit;It is operation starting time to update the mode bit in the state table, described to bring into operation Time is the time that the multiple affairs bring into operation, wherein, when each affairs end of run in the multiple affairs, in institute The end of run time that the mode bit of the affairs of end of run is updated to the affairs of the end of run on state table is stated,
    Judge whether the mode bit is that the default mark includes:When whether judge the mode bit be described bring into operation Between, wherein, if it is judged that the mode bit is the operation starting time, it is determined that affairs corresponding to the mode bit are run It is abnormal.
  5. 5. data processing method according to claim 3, it is characterised in that the mode bit is digital state position, wherein, When the digital state position is the first numerical value, affairs end of run corresponding to the digital state position, when the digital state When position is second value, affairs operation exception corresponding to the digital state position,
    Judge that the affairs in the multiple affairs with the presence or absence of operation exception include:It is described first to judge the digital state position Numerical value or the second value, wherein, if it is judged that when the digital state position is first numerical value, determine the number Affairs end of run corresponding to word state position;If it is judged that when the digital state position is the second value, it is determined that described Affairs operation exception corresponding to digital state position.
  6. A kind of 6. data processing equipment, it is characterised in that including:
    First acquisition unit, for obtaining pending data, wherein, the pending data includes dimension data and/or the fact Data;
    Processing unit, the pending data is handled for being separately operable multiple affairs, wherein, the multiple affairs are from pre- If the mutual independent multiple affairs split out in advance in affairs, the default affairs are for handling the pending number According to affairs, the default affairs include multiple subtransactions, and each affairs in the multiple affairs correspond to the multiple son A subtransaction in affairs, wherein, the multiple affairs include:The affairs of the dimension data are write with writing the fact The affairs of data;
    Judging unit, for judging in the multiple affairs with the presence or absence of the affairs of operation exception;And
    Running unit, for if it is judged that the affairs of operation exception in the multiple affairs be present, then reruning the fortune The abnormal affairs of row.
  7. 7. data processing equipment according to claim 6, it is characterised in that the data processing equipment also includes:
    Second acquisition unit, it is described default for before handling the pending data, being obtained being separately operable multiple affairs Multiple subtransactions in affairs;And
    Extraction unit, for the multiple subtransaction to be extracted as into multiple independent affairs, obtain the multiple affairs.
  8. 8. data processing equipment according to claim 6, it is characterised in that the judging unit includes:
    Logging modle, for recording each mode bit corresponding to affairs in the multiple affairs, the mode bit is used to reflect institute The running status of each affairs in multiple affairs is stated, wherein, when the mode bit is presets mark, corresponding to the mode bit Affairs operation exception;
    First judge module, for by judging whether the mode bit is that the default mark judges in the multiple affairs With the presence or absence of the affairs of operation exception.
  9. 9. data processing equipment according to claim 8, it is characterised in that
    The data processing equipment also includes:Creating unit, for handling the pending number being separately operable multiple affairs According to the state table before, created for marking the mode bit;Updating block, it is for updating the mode bit in the state table Operation starting time, the operation starting time are the time that the multiple affairs bring into operation, wherein, when the multiple affairs In each affairs end of run when, the mode bit of the affairs of end of run is updated to the end of run on the state table Affairs the end of run time,
    The judge module is additionally operable to judge whether the mode bit is the operation starting time, wherein, if it is judged that institute It is the operation starting time to state mode bit, it is determined that affairs operation exception corresponding to the mode bit.
  10. 10. data processing equipment according to claim 8, it is characterised in that the mode bit is digital state position, its In, when the digital state position is the first numerical value, affairs end of run corresponding to the digital state position, when the digital shape When state position is second value, affairs operation exception corresponding to the digital state position,
    The judging unit includes:Second judge module, for judging that the digital state position is first numerical value or institute Second value is stated, wherein, if it is judged that when the digital state position is first numerical value, determine that the digital state position is right The affairs end of run answered;If it is judged that when the digital state position is the second value, the digital state position is determined Corresponding affairs operation exception.
CN201410542598.7A 2014-10-14 2014-10-14 Data processing method and device Active CN104317850B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410542598.7A CN104317850B (en) 2014-10-14 2014-10-14 Data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410542598.7A CN104317850B (en) 2014-10-14 2014-10-14 Data processing method and device

Publications (2)

Publication Number Publication Date
CN104317850A CN104317850A (en) 2015-01-28
CN104317850B true CN104317850B (en) 2017-11-14

Family

ID=52373082

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410542598.7A Active CN104317850B (en) 2014-10-14 2014-10-14 Data processing method and device

Country Status (1)

Country Link
CN (1) CN104317850B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105183791A (en) * 2015-08-21 2015-12-23 中国人民解放军装备学院 Transaction-based data integration method
CN106445687A (en) * 2016-09-27 2017-02-22 金蝶软件(中国)有限公司 Large transaction execution method and system
CN108205464B (en) * 2016-12-20 2022-05-06 阿里云计算有限公司 Database deadlock processing method and device and database system
CN109033301B (en) * 2018-07-16 2021-07-06 中国科学技术大学 Database transaction execution method based on graphic processor
CN110990182B (en) * 2019-12-03 2021-06-11 腾讯科技(深圳)有限公司 Transaction processing method, device, equipment and storage medium
US11544245B2 (en) 2019-12-03 2023-01-03 Tencent Technology (Shenzhen) Company Limited Transaction processing method, apparatus, and device and computer storage medium
CN112883045B (en) * 2021-03-31 2024-05-17 中国工商银行股份有限公司 Database transaction splitting execution method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101089857A (en) * 2007-07-24 2007-12-19 中兴通讯股份有限公司 Internal store data base transaction method and system
CN102073540A (en) * 2010-12-15 2011-05-25 北京新媒传信科技有限公司 Distributed affair submitting method and device thereof
CN102782644A (en) * 2010-03-01 2012-11-14 国际商业机器公司 Performing aggressive code optimization with an ability to rollback changes made by the aggressive optimizations
CN103077006A (en) * 2012-12-27 2013-05-01 浙江工业大学 Multithreading-based parallel executing method for long transaction

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8024714B2 (en) * 2006-11-17 2011-09-20 Microsoft Corporation Parallelizing sequential frameworks using transactions

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101089857A (en) * 2007-07-24 2007-12-19 中兴通讯股份有限公司 Internal store data base transaction method and system
CN102782644A (en) * 2010-03-01 2012-11-14 国际商业机器公司 Performing aggressive code optimization with an ability to rollback changes made by the aggressive optimizations
CN102073540A (en) * 2010-12-15 2011-05-25 北京新媒传信科技有限公司 Distributed affair submitting method and device thereof
CN103077006A (en) * 2012-12-27 2013-05-01 浙江工业大学 Multithreading-based parallel executing method for long transaction

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"PARAWARE中事务管理的研究和设计";李盛恩 等;《计算机科学》;20011231;第315页第3段-第318页第5段,附图2-3 *
"事务管理器软件构架及调度优化方法研究";"廖正新";《中国优秀硕士学位论文全文数据库 信息科技辑》;20110615;论文正文第37页第1段-第41页第2段,附图4.1、4.2,表4.1、4.2 *

Also Published As

Publication number Publication date
CN104317850A (en) 2015-01-28

Similar Documents

Publication Publication Date Title
CN104317850B (en) Data processing method and device
CN107704539B (en) Method and device for large-scale text information batch structuring
CN104021043B (en) The interruption re-access method and system of batch application program
CN109285076A (en) Intelligent core protects processing method, server and storage medium
WO2014021978A4 (en) Aggregating data in a mediation system
CN105740462A (en) Method for supporting data migration between different environments
CN103577455A (en) Data processing method and system for database aggregating operation
CN109983459A (en) Method and apparatus for identifying the counting of the N-GRAM occurred in corpus
US9639587B2 (en) Social network analyzer
CN108182595A (en) A kind of formulation migration efficiency method and device
CN104298570B (en) Data processing method and device
JP2023553220A (en) Process mining for multi-instance processes
WO2016119508A1 (en) Method for recognizing large-scale objects based on spark system
CN110647845A (en) Invoice data identification device, related method and related device
JP6652141B2 (en) Item name association processing method, item name association processing program, and information processing apparatus
JP2010165141A (en) Method for extracting specific location from text log, and program
WO2017114455A1 (en) Data processing method and system based on graph
CN114066331A (en) Shareholder investment information acquisition method and device, electronic equipment and storage medium
CN107153651A (en) A kind of multidimensional intersects data processing method and processing device
CN112989823B (en) Log processing method, device, equipment and storage medium
CN110188069A (en) A kind of csv file storage method, device and computer equipment
CN106940698A (en) A kind of dimension data processing method and processing device
CN115063187B (en) Electronic commerce data processing method, system, electronic device and medium
CN109558756B (en) EMV message analysis tool
CN111125830B (en) Long-period data storage inspection method based on model definition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Parallel processed data prcessing method and apparatus thereof

Effective date of registration: 20190531

Granted publication date: 20171114

Pledgee: Shenzhen Black Horse World Investment Consulting Co., Ltd.

Pledgor: Beijing Guoshuang Technology Co.,Ltd.

Registration number: 2019990000503

PE01 Entry into force of the registration of the contract for pledge of patent right
CP02 Change in the address of a patent holder

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Patentee after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A

Patentee before: Beijing Guoshuang Technology Co.,Ltd.

CP02 Change in the address of a patent holder