Summary of the invention
The purpose of this invention is to provide a kind of processing method for service stream that can improve the polycaryon processor reliability, and can realize the polycaryon processor of said method.
For achieving the above object, the invention provides a kind of processing method for service stream of polycaryon processor, comprising:
Shunting nuclear is examined first traffic flow assignment according to the traffic flow assignment principle to first work;
The first work nuclear is replied response signal according to predefined response latent period to shunting nuclear in the processing procedure to first Business Stream;
Shunting is examined in each response latent period and is judged whether to receive the response signal that comes from the first work nuclear, when not receiving this response signal, first traffic flow assignment is examined to second work.
For achieving the above object, the present invention also provides a kind of polycaryon processor, comprises shunting nuclear and a plurality of work nuclear, wherein,
Described shunting is examined and is comprised,
The traffic flow assignment unit is used for according to the traffic flow assignment principle and/or comes from the judged result of response signal judging unit, and the control Business Stream is in the internuclear distribution of work;
The response signal judging unit is used for judging whether to receive the response signal that comes from work nuclear in the latent period in each response, when judging when not receiving response signal, judged result is sent to the traffic flow assignment unit;
Described work is examined and is comprised,
The Business Stream processing unit is used for the Business Stream that comes from shunting nuclear is handled;
Response signal is replied the unit, is used for the process handled to the Business Stream that comes from shunting nuclear, replys response signal according to predefined response latent period to shunting nuclear.
Therefore based on the present invention, because the duty by shunting checking work nuclear is monitored, when certain work nuclear generation is unusual, the traffic flow assignment of originally distributing to this work nuclear is given the work nuclear of other operate as normal, therefore can not interrupt processing, thereby improve the reliability of polycaryon processor Business Stream.Because this method is based on the existing existing internal resource of polycaryon processor, need not increase other alternate devices in networking, therefore reduced production cost; And owing to do not need to establish in addition spare core, do not need to take service port yet, therefore improved the service efficiency of system resource.In addition,, can not produce in the time of therefore can guaranteeing to redistribute Business Stream, thereby further reduce influence that Business Stream is handled than long time delay because the described method of present embodiment realizes that the speed of internal bus is very high in that polycaryon processor is inner.
Below by drawings and Examples, technical scheme of the present invention is described in further detail.
Embodiment
Embodiment 1
It is a kind of by being monitored by the duty of shunting checking work nuclear that present embodiment provides, to improve the processing method for service stream of polycaryon processor reliability, as shown in Figure 1,
Step 101, shunting nuclear is examined first traffic flow assignment according to the traffic flow assignment principle to first work.Wherein, shunting nuclear is meant in existing multi-core parallel concurrent is managed business, and is arranged on the Business Stream porch, the nuclear that a plurality of Business Streams that are used for receiving distribute between the work nuclear of a plurality of concurrent workings.The traffic flow assignment principle can be determined according to prior art, as distributing according to the five-tuple information in the Business Stream, or distribute according to the busy-idle condition of different operating nuclear etc., to make a plurality of Business Streams distribute to work nuclear uniformly in a word, to give full play to the work efficiency of all working nuclear as far as possible.In this step, first Business Stream is meant the Business Stream of distributing to the first work nuclear according to existing business flow distribution principle.Shunting nuclear and work nuclear phase ratio, because the function of shunting nuclear is fairly simple, it is very little to occur unusual possibility, so can ignore to the influence of the reliability of whole polycaryon processor.
Step 102, first work is checked first Business Stream and is handled, and in processing procedure, replys response signal according to predefined response latent period to shunting nuclear.If the enough operate as normal of the first work nuclear energy do not take place unusually, then periodically reply response signal to shunting nuclear, shunt nuclear with notice, this first work nuclear is in normal operating conditions.Otherwise unusual if the first work nuclear has taken place, unusual or hardware fault is unusual etc. as running software, just this first work nuclear can't be replied response signal within the scheduled time.In addition, if the service data information of this first Business Stream as results of intermediate calculations, formula variable etc., need be shared with other work nuclear, these service data information can also be kept in the shared drive of polycaryon processor.
Step 103, shunting nuclear judges whether to receive the response signal that comes from the first work nuclear in each response latent period, when judging when not receiving this response signal, with first traffic flow assignment to the second work nuclear, and no longer to the first unusual work nuclear distribution services stream has taken place.Wherein, the second work nuclear is meant another work nuclear that is different from the first work nuclear that is in normal operating conditions.If shunting nuclear is not received response signal, it is unusual to illustrate that the first work nuclear has taken place when processing first Business Stream, if this first Business Stream still has been untreated, then shunting nuclear is examined this first traffic flow assignment to second work.When if second work is checked first Business Stream and is handled, need use the related service data message that produces when the first work nuclear is handled first Business Stream, then can obtain these service data information by the shared drive described in the accessing step 102.
In addition, need to prove, in some cases, owing to reasons such as software processes time-delay or communication failures, though work nuclear is in normal operating conditions, but failing in the response latent period response signal to be replied to shunting in time examines, perhaps the response signal of Hui Fuing fails in time to arrive shunting nuclear, and it is unusual to make shunting nuclear think that this work nuclear has taken place by mistake, just interrupts to this work nuclear distribution services stream, processing to Business Stream can cause unnecessary influence, also causes the waste of resource.In order to reduce the erroneous judgement of shunting checking work nuclear state as far as possible, desensitising, can preestablish one and judge number of times, have only when shunting nuclear and judge when not receiving response signal according to the inferior number average of this judgements, just first traffic flow assignment is examined to second work, and no longer to the first work nuclear distribution services stream.Only for example can preestablish all judges when not receiving response signal in continuous three response latent periods, just first Business Stream is redistributed, and no longer distribute other Business Streams to this first work nuclear, prevented influence as much as possible to work nuclear operate as normal.
In addition, also need to prove, in the polycaryon processor abnormality detection hardware cell can also be set, work nuclear is monitored, take place when unusual, authorize to shunting and send look-at-me when detecting work nuclear by the abnormality detection hardware cell; After shunting nuclear is received this look-at-me, first traffic flow assignment is examined to second work, thereby guaranteed reliability of processor.Realize that by hardware the advantage that the work of work nuclear is monitored is that response speed is faster.Combine with step 101 to 103 described methods, when the response latent period is longer, can detect the unusual of work nuclear quickly, to prevent in the Business Stream implementation, owing to waits for too long impacts.
By the described method of present embodiment, because the duty by shunting checking work nuclear is monitored, when certain work nuclear generation is unusual, the traffic flow assignment of originally distributing to this work nuclear is given the work nuclear of other operate as normal, therefore can not interrupt processing, thereby improve the reliability of polycaryon processor Business Stream.Because this method is based on the existing existing internal resource of polycaryon processor, need not increase other alternate devices in networking, therefore reduced production cost; And owing to do not need to establish in addition spare core, do not need to take service port yet, therefore improved the service efficiency of system resource.In addition,, can not produce in the time of therefore can guaranteeing to redistribute Business Stream, thereby further reduce influence that Business Stream is handled than long time delay because the described method of present embodiment realizes that the speed of internal bus is very high in that polycaryon processor is inner.
Embodiment 2
Embodiment 1 described method has improved the reliability of whole polycaryon processor, but unusual work nuclear has taken place can't resume operation automatically, caused the waste of resource to a certain extent, utilization ratio for raising work nuclear, it is a kind of by resetting to unusual work nuclear has taken place that present embodiment provides, and makes it recover the processing method for service stream of operate as normal.As shown in Figure 2,
Step 101,102 and 103 identical with embodiment 1 described step repeats no more herein.
Step 104, shunting nuclear are also authorized to first work and are sent reset instruction signal after first traffic flow assignment is examined to second work, make the first work nuclear carry out reset operation.Particularly, shunting is endorsed to authorize and send reset instruction signal to first work in person, perhaps can also authorize to first work and send reset instruction signal by being arranged on the abnormality detection hardware cell in the polycaryon processor or unusual operate as normal nuclear not taking place.Wherein, the abnormality detection hardware cell is arranged on being used in the polycaryon processor and detects the hardware cell that unusual work nuclear has taken place, and can reset to work nuclear by this hardware cell.In addition, by prior art, also can be can the reset function of other work nuclears of work nuclear design, for example in this step, the first work nuclear has taken place unusual, can check this first work nuclear by other operate as normal and reset.
Step 105 after the first work nuclear is finished reset operation, is authorized to shunting and to be sent the confirmation signal that resets.Finish reset operation if fail, also just can't send this signal certainly.
Step 106, shunting nuclear judge whether to receive the confirmation signal that resets that comes from the first work nuclear in the predefined stand-by period that resets, receive when resetting confirmation signal execution in step 111, otherwise execution in step 112 when judging.
Step 111 after shunting nuclear is received this confirmation signal that resets, continues according to the traffic flow assignment principle to the first work nuclear distribution services stream.Receive the confirmation signal that resets that comes from the first work nuclear, the success that resetted of this work nuclear is described, recovered normal operating conditions, therefore continue to its distribution services stream, particularly, still uncompleted first Business Stream can be redistributed to the first work nuclear, perhaps first work of giving is examined and is handled with other new traffic flow assignment, distributes according to existing traffic assignments principle in a word to get final product.
Step 112 sends caution signal to output device.Work nuclear takes place to have software reason or hardware reason unusually, generally speaking, by the software reason cause unusually can be by the solution that resets, and hardware hardware reason such as be damaged can't be solved by resetting.Therefore, if in the stand-by period that resets, receive the confirmation signal that resets that comes from the first work nuclear failure that resets of this work nuclear is not described, and hardware damage has taken place probably, therefore send warning signal and carry out respective handling with the prompting user.
Though it is pointed out that herein by the software reason cause unusually can be by the solution that resets, the success that not necessarily can once reset, perhaps resetting need be for a long time.If surpassed the predefined stand-by period that resets its actual reset time, then shunting nuclear can't be judged this first work nuclear energy enough by the reparation that resets, but can judge it mistakenly hardware damage has taken place.In order to reduce the generation of erroneous judgement as far as possible, judge first in the predefined stand-by period that resets when shunting nuclear and not receive when resetting confirmation signal, do not send caution signal to output device immediately, but according to the described method of step 104, authorize to first work once more and send reset instruction signal, and in the predefined stand-by period that resets, judge whether to receive the confirmation signal that resets that comes from the first work nuclear again, have only when shunting nuclear according to predefined judgement number of times, all judge and do not receive when resetting confirmation signal, just send caution signal to output device, show the first work nuclear because hardware reason has caused unusually, notify the user to solve.
By the described method of present embodiment, the first work nuclear that has stopped normal operation has in time recovered running status by reset operation, has therefore improved the utilization factor of work nuclear.And, even the hardware damage fault has taken place, also can in time notify the user to solve, do not influence the operate as normal of other work nuclears, guaranteed the reliability of whole polycaryon processor.
In addition, it is pointed out that in the step 106 of present embodiment, also can not preestablish the stand-by period that resets,, then continue to its distribution services stream if shunting nuclear is received the confirmation signal that resets that comes from the first work nuclear; Do not receive else if, can be left intact yet, but notify the user to solve by abnormality detection hardware cell or other operate as normal nuclear that the first work nuclear is resetted described in the step 104.
Embodiment 3
Present embodiment provides a kind of polycaryon processor that can improve reliability.As shown in Figure 3, be this polycaryon processor inner structure synoptic diagram.Polycaryon processor 10 comprises: shunting nuclear 20, a plurality of work nuclears are as the nuclear 30,40 etc. of working.Wherein, shunting nuclear 20 comprises response signal judging unit 22 and traffic flow assignment unit 21; Each work nuclear includes response signal and replys unit and Business Stream processing unit.
When work, traffic flow assignment unit 21 is examined according to the traffic flow assignment principle Business Stream that receives to each work.The traffic flow assignment principle can be determined according to prior art, as distributing according to the five-tuple information in the Business Stream, or distribute according to the busy-idle condition of different operating nuclear etc., to make a plurality of Business Streams distribute to work nuclear uniformly in a word, to give full play to the work efficiency of all working nuclear as far as possible.Comprise one first Business Stream in the Business Stream of supposing to receive traffic flow assignment unit 21, this first Business Stream is meant the Business Stream of distributing to work nuclear 30 according to existing business flow distribution principle.Shunting nuclear 20 and several work nuclear phase ratios, because the function of shunting nuclear 20 is fairly simple, it is very little to occur unusual possibility, so can ignore to the influence of the reliability of whole polycaryon processor 10.
32 pairs of the Business Stream processing units of work nuclear 30 come from first Business Stream of shunting nuclear 20 to be handled.In processing procedure, response signal is replied unit 31 and is replied response signal with predefined response latent period to shunting nuclear 20.If work nuclear 30 can operate as normal, do not take place unusually, then can periodically reply response signals to shunting nuclear 20, with notice shunting nuclear 20, this work nuclear 30 is in normal operating conditions.Otherwise unusual if work nuclear 30 has taken place, unusual or hardware fault is unusual etc. as running software, can not reply response signal within the scheduled time just the response signal of this work nuclear 30 is replied unit 31.In addition, if the service data information of this first Business Stream as results of intermediate calculations, formula variable etc., need be shared with other work nuclear, these service data information can also be kept in the shared drive (not marking among the figure) of polycaryon processor 10.
The response signal judging unit 22 of shunting nuclear 20 judges whether to receive the response signal that comes from work nuclear 30 in each response latent period, when judging when not receiving this response signal, judged result is sent to traffic flow assignment unit 21 by internal bus, taken place unusual with informing business stream assignment unit 21 work nuclears 30.Traffic assignments unit 21 examines 40 with first traffic flow assignment to work after receiving this judged result, and no longer examines 30 distribution services stream to unusual work has taken place.Wherein, work nuclear 40 is meant another work nuclear of the work that the is different from nuclear 30 that is in normal operating conditions.If shunting nuclear 20 is not received response signal, explanation work is examined 30 and has been taken place when handling first Business Stream unusually, if this first Business Stream still has been untreated, then shunting nuclear 20 examines 40 with this first traffic flow assignment to work.When 42 pairs of these first Business Streams of Business Stream processing unit of work nuclear 40 are handled, also to reply unit 41 and reply response signal, make the duty of 20 pairs of work nuclears 40 of shunting nuclear keep monitoring to shunting nuclear 20 by response signal.When if 40 pairs first Business Streams of work nuclear are handled, need use the related service data message that produces when work nuclear 30 is handled first Business Stream, then can obtain these service data information by the shared drive of visit polycaryon processor 10.
In addition, need to prove, in some cases, owing to reasons such as software processes time-delay or communication failures, though work nuclear 30 is in normal operating conditions, but fail in the response latent period, response signal to be replied to shunting in time and examine 20, perhaps the response signal of Hui Fuing fails in time to arrive shunting nuclear 20, make shunting nuclear 20 think that this work nuclear 30 has taken place unusual by mistake, just interrupt to work nuclear 30 distribution services stream, processing to Business Stream can become unnecessary influence, also causes the waste of resource.In order to reduce the erroneous judgement of 20 pairs of work nuclear of shunting nuclear, 30 states as far as possible, desensitising, can preestablish one and judge number of times, has only the response signal judging unit 22 of working as shunting nuclear 20 according to this judgement number of times, for example continuous three times, all judge when not receiving response signal, just judged result is sent to traffic flow assignment unit 21, by traffic flow assignment unit 21 first traffic flow assignment is examined 40 to work, and no longer to work nuclear 30 distribution services stream.Prevent influence as much as possible to the operate as normal of work nuclear 30.
By the described method of present embodiment, because shunting nuclear 20 can be monitored the duty of all working nuclear, when certain work nuclear generation is unusual, the traffic flow assignment of originally distributing to this work nuclear is given the work nuclear of other operate as normal, therefore can not interrupt processing, thereby improve the reliability of polycaryon processor Business Stream.Because this method is based on the existing existing internal resource of polycaryon processor, need not increase other alternate devices in networking, therefore reduced production cost; And owing to do not need to establish in addition spare core, do not need to take service port yet, therefore improved the service efficiency of system resource.In addition,, can not produce in the time of therefore can guaranteeing to redistribute Business Stream, thereby further reduce influence that Business Stream handled than long time delay because the described method of present embodiment realizes that the speed of internal bus is very high in that polycaryon processor is inner.And therefore the work nuclear that has stopped normal operation improved the utilization factor of work nuclear by the reset operation state that can in time resume operation.And, even the hardware damage fault has taken place, also can in time notify the user to solve, do not influence the operate as normal of other work nuclears, guaranteed the reliability of whole polycaryon processor.
Embodiment 4
Present embodiment provides a kind of can making on the basis of embodiment 3 that the polycaryon processor that unusual work nuclear resumes operation takes place.As shown in Figure 4, shunting nuclear 20 also comprises reset instruction unit 23, and every work nuclear also comprises the control module that resets, as the control module 33,43 etc. that resets.
When work, response signal judging unit 22 is judged when not receiving the response signal that comes from work nuclear 30, when judged result is sent to traffic flow assignment unit 21 by internal bus, also this judged result is sent to reset instruction unit 23, taken place unusual with notice reset instruction unit 23 work nuclears 30.After this judged result is received in reset instruction unit 23, send reset instruction signal to work nuclear 30.After the control module 33 that resets of work nuclear 30 is received this reset instruction signal, work nuclear 30 is carried out reset operation.Reset after the success, the control module 33 that resets sends the confirmation signal that resets to shunting nuclear 20.Finish reset operation if fail, also just can't send this signal certainly.The reset instruction unit 23 of shunting nuclear 20 judges whether to receive the confirmation signal that resets that comes from work nuclear 30 in the predefined stand-by period that resets, receive when resetting confirmation signal when judging, the result that will reset sends to traffic flow assignment unit 21, is continued according to the traffic flow assignment principle to work nuclear 30 distribution services stream by traffic flow assignment unit 21.Otherwise send caution signal to output device (not marking among the figure).Work nuclear takes place to have software reason or hardware reason unusually, generally speaking, by the software reason cause unusually can be by the solution that resets, and hardware hardware reason such as be damaged can't be solved by resetting.Therefore, if work nuclear 30 failure that resets is judged in reset instruction unit 23, hardware damage has taken place probably then, therefore sending warning signal solves with the prompting user.
Though it is pointed out that herein by the software reason cause unusually can be by the solution that resets, the success that not necessarily can once reset, perhaps resetting need be for a long time.If surpassed the predefined stand-by period that resets its actual reset time, then reset instruction unit 23 can't be judged this work nuclear 30 and can but can judge it mistakenly hardware damage take place by the reparation that resets.In order to reduce the generation of erroneous judgement as far as possible, judge first in the predefined stand-by period that resets when reset instruction unit 23 and not receive when resetting confirmation signal, do not send caution signal to output device immediately, but send reset instruction signal to work nuclear 30 once more, and in the predefined stand-by period that resets, judge whether to receive that coming from work examines 30 the confirmation signal that resets again, has only the reset instruction unit 23 of working as according to predefined judgement number of times, do not receive when resetting confirmation signal as all judging for continuous three times, just send caution signal to output device, show work nuclear 30 because hardware reason has caused unusually, notify the user to solve.
It should be noted last that, above embodiment is only unrestricted in order to technical scheme of the present invention to be described, although the present invention is had been described in detail with reference to preferred embodiment, those of ordinary skill in the art is to be understood that, can make amendment or be equal to replacement technical scheme of the present invention, and not break away from the spirit and scope of technical solution of the present invention.