CN109448792A - Choosing method, device and the electronic equipment of interpreter's gene - Google Patents

Choosing method, device and the electronic equipment of interpreter's gene Download PDF

Info

Publication number
CN109448792A
CN109448792A CN201811096578.6A CN201811096578A CN109448792A CN 109448792 A CN109448792 A CN 109448792A CN 201811096578 A CN201811096578 A CN 201811096578A CN 109448792 A CN109448792 A CN 109448792A
Authority
CN
China
Prior art keywords
interpreter
genome
gene
maximum
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811096578.6A
Other languages
Chinese (zh)
Other versions
CN109448792B (en
Inventor
张芃
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Language Network (wuhan) Information Technology Co Ltd
Original Assignee
Language Network (wuhan) Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Language Network (wuhan) Information Technology Co Ltd filed Critical Language Network (wuhan) Information Technology Co Ltd
Priority to CN201811096578.6A priority Critical patent/CN109448792B/en
Publication of CN109448792A publication Critical patent/CN109448792A/en
Application granted granted Critical
Publication of CN109448792B publication Critical patent/CN109448792B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The embodiment of the present invention provides choosing method, device and the electronic equipment of a kind of interpreter's gene, this method comprises: choosing multiple interpreter's genomes respectively;For each interpreter's genome, multiple successful match rate samples are obtained, and calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome accordingly;The corresponding contribution genome of the maximum in all mean values is chosen as maximum contribution genome, and its mean value and standard deviation are respectively defined as Largest Mean and maximum standard deviation;Its corresponding Z value is calculated based on its corresponding mean value and standard deviation and Largest Mean and maximum standard deviation for each contribution genome in addition to maximum contribution genome;Based on the corresponding Z value of each interpreter's genome, the gene met in the gene in the interpreter's genome to impose a condition and maximum interpreter's genome is merged, the interpreter's gene finally chosen is obtained.The embodiment of the present invention enables to the interpreter's gene selected that can preferably embody the otherness between interpreter.

Description

Choosing method, device and the electronic equipment of interpreter's gene
Technical field
The present embodiments relate to technical field of data processing, more particularly, to a kind of interpreter's gene choosing method, Device and electronic equipment.
Background technique
Information age and networking have changed a lot translation mode.Platform is managed using translation flow, Talent's data is stored according to different objects, to match most suitable interpreter according to contribution to be translated.Different interpreters, institute The key message for including is not quite similar, then according to these key messages, can match most suitable translation contribution for interpreter, thus Effectively improve translation efficiency and translation accuracy.
Interpreter matches with the gene of contribution to be referred to contribution gene and interpreter's gene under set strategy through Matching Model, It is embodied as the process that contribution finds best interpreter.Selected is used to carry out the matched interpreter's gene of gene and other interpreter's genes It compares, it should can preferably embody the otherness of interpreter, could be so that contribution to be translated is matched to the interpreter being more suitable for.
Interpreter's gene is referred mainly to by carrying out analytical calculation, quantification treatment, accessed presence to interpreter's characteristic attribute It is combined in key message specific interpreter, that be different from other interpreters, unique.There are many sources of interpreter's gene, In the social epoch, all data of the every act and every move of interpreter can extract gene.
Interpreter's gene is present in all interpreters of management platform, and different interpreters have different interpreter's genes.Due to tool Body application difference, presently, there are interpreter/manuscript gene matching algorithm selection interpreter gene to be matched carry out matching meter When calculation, the corresponding assortment of genes is often rule of thumb selected.
But in interpreter's course of work, gene can with the promotion of ability, the increase of time, accumulation of knowledge and send out Raw corresponding variation.I.e. with the processing of task, examine and revise evaluation with QC, accumulation, the community activity of history corpus participation with And the activities such as test of interpreter's ability, interpreter's gene will be constantly updated.Therefore, above-mentioned interpreter's gene selects mode empirically It can have some limitations, cause the interpreter's gene selected that cannot embody the otherness between interpreter well.
Summary of the invention
In order to overcome the above problem or at least be partially solved the above problem, the embodiment of the present invention provides a kind of interpreter's base Choosing method, device and the electronic equipment of cause, with so that the interpreter's gene selected can preferably embody the difference between interpreter It is anisotropic.
In a first aspect, the embodiment of the present invention provides a kind of choosing method of interpreter's gene, comprising: arranged from alternative interpreter's gene In table, the different gene of multiple groups is chosen respectively, constitutes multiple interpreter's genomes;For interpreter's genome described in each, carry out Multiple matching result sampling, obtains multiple successful match rate samples, and be based on the multiple successful match rate sample, calculates this and translate The mean value and standard deviation of the corresponding successful match rate of member's genome;Choose the corresponding interpreter's base of the maximum in all mean values Because of group, it is defined as maximum interpreter's genome, and the mean value of maximum interpreter's genome is defined as Largest Mean, by institute The standard deviation for stating maximum interpreter's genome is defined as maximum standard deviation;For in all interpreter's genomes except it is described most Each described interpreter's genome except big interpreter's genome, is based on the corresponding mean value of interpreter's genome and the mark The quasi- poor and Largest Mean and the maximum standard deviation, calculate the corresponding Z value of interpreter's genome;Based on all described The corresponding Z value of each described interpreter's genome in interpreter's genome in addition to maximum interpreter's genome, from institute There are the interpreter's genome for choosing in interpreter's genome and meeting and imposing a condition, and interpreter's gene that the satisfaction is imposed a condition The gene in gene and maximum interpreter's genome in group merges, and obtains the interpreter's gene finally chosen;Wherein, the Z Value indicates Z value in the verifying of large sample otherness.
Second aspect, the embodiment of the present invention provide a kind of selecting device of interpreter's gene, comprising: initial gene chooses mould Block, for choosing the different gene of multiple groups respectively, constituting multiple interpreter's genomes from alternative interpreter's list of genes;First meter Module is calculated, for multiple matching result sampling being carried out, obtaining multiple successful match rate samples for interpreter's genome described in each This, and it is based on the multiple successful match rate sample, calculate the mean value and standard of the corresponding successful match rate of interpreter's genome Difference;Maximum genome is chosen module and is defined as most for choosing the corresponding interpreter's genome of the maximum in all mean values Big interpreter's genome, and the mean value of maximum interpreter's genome is defined as Largest Mean, by maximum interpreter's base Because the standard deviation of group is defined as maximum standard deviation;Second computing module, for for being removed in all interpreter's genomes Each described interpreter's genome except the maximum interpreter genome, based on the corresponding mean value of interpreter's genome and The standard deviation and the Largest Mean and the maximum standard deviation calculate the corresponding Z value of interpreter's genome;Final base Because choosing module, for based on being translated described in each in all interpreter's genomes in addition to maximum interpreter's genome The corresponding Z value of member's genome chooses the interpreter's genome for meeting and imposing a condition from all interpreter's genomes, and will Gene in the gene met in the interpreter's genome to impose a condition and maximum interpreter's genome merges, and obtains final Interpreter's gene of selection;Wherein, the Z value indicates Z value in the verifying of large sample otherness.
The third aspect, the embodiment of the present invention provide a kind of electronic equipment, comprising: at least one processor, at least one Manage device, communication interface and bus;The memory, the processor and the communication interface are completed mutual by the bus Communication, the communication interface between the electronic equipment and interpreter's information equipment information transmission;In the memory It is stored with the computer program that can be run on the processor, when the processor executes the computer program, is realized such as The choosing method of interpreter's gene described in upper first aspect.
Fourth aspect, the embodiment of the present invention provide a kind of non-transient computer readable storage medium, the non-transient calculating Machine readable storage medium storing program for executing stores computer instruction, and the computer instruction executes the computer described in first aspect as above The choosing method of interpreter's gene.
Choosing method, device and the electronic equipment of interpreter's gene provided in an embodiment of the present invention, by being translated in advance from all Multiple groups interpreter genome is chosen in interpreter's gene pool of member, and by calculating Z value corresponding to these interpreter's genomes, to choose Z value meets the interpreter's genome to impose a condition, to choose as final as a result, enabling the interpreter's gene selected more preferable Embody interpreter between otherness.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is this hair Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to these attached drawings.
Fig. 1 is the flow diagram of the choosing method of interpreter's gene provided in an embodiment of the present invention;
Fig. 2 is the pass according to interpreter's feature and interpreter's gene in the choosing method of interpreter's gene provided in an embodiment of the present invention It is schematic diagram;
Fig. 3 is the structural schematic diagram of the selecting device of interpreter's gene provided in an embodiment of the present invention;
Fig. 4 is the entity structure schematic diagram of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the embodiment of the present invention, instead of all the embodiments.Based on the embodiment in the embodiment of the present invention, ability Domain those of ordinary skill every other embodiment obtained without making creative work, belongs to the present invention The range of embodiment protection.
There are many sources of interpreter's gene, and in the social epoch, all data of the every act and every move of interpreter can be extracted Gene out.Due to the difference of concrete application, presently, there are interpreter/manuscript gene matching algorithm in the to be matched of selection interpreter When gene carries out matching primitives, the corresponding assortment of genes is often rule of thumb selected.But conventional method has certain limitation Property, cause the interpreter's gene selected that cannot embody the otherness of interpreter well.
In view of the above-mentioned problems, the embodiment of the present invention from interpreter's gene pool of interpreter by choosing multiple groups interpreter gene in advance Group, and by calculating Z value corresponding to these interpreter's genomes, meet the interpreter's genome to impose a condition to choose Z value, to make It chooses for final as a result, the interpreter's gene selected is enabled preferably to embody the otherness between interpreter.Wherein, Z value table Show Z value in the verifying of large sample otherness.
As the one aspect of the embodiment of the present invention, the present embodiment provides a kind of choosing methods of interpreter's gene, with reference to figure 1, it is the flow diagram of the choosing method of interpreter's gene provided in an embodiment of the present invention, comprising:
S101 chooses the different gene of multiple groups respectively, constitutes multiple interpreter's genomes from alternative interpreter's list of genes.
It is to be understood that before the interpreter's gene for carrying out the present embodiment is chosen, it in advance can be according to all properties of interpreter Information establishes alternative interpreter's list of genes, may include in alternative interpreter's list of genes relevant to interpreter's particular community All genes.Specifically, alternative interpreter's list of genes may be considered a gene pool, with gene for singly in the gene pool Position storage has the gene relevant to interpreter's information extracted from all interpreters, i.e. interpreter's gene.Interpreter's gene refers mainly to pass through Analytical calculation, quantification treatment carried out to interpreter's characteristic attribute, it is accessed be present in specific interpreter, be different from other interpreters , the combination of unique key message.
According to alternative interpreter's list of genes in this step, multiple groups interpreter gene is chosen respectively, and translate respectively with each group Member's gene constitutes a genome, and as interpreter's genome, which is the interpreter's genome selected.It is understood that , can be from multiple interpreters in alternative interpreter's list of genes in random selection table when carrying out the selection of each group interpreter gene Gene, then the interpreter's gene randomly selected using these may be constructed a genome, as interpreter's genome.
It is of course also possible to which predefined decimation rule, e.g., while extracting or successively extracting, interlacing is extracted or specified line number It extracts, according to different interpreter's information extractions of gene characterization, quantity of extraction, etc..Carrying out actual extraction process later When, the extraction for each group of interpreter's gene extracts phase from alternative interpreter's list of genes according to the decimation rule predetermined The multiple genes answered.
For example, randomly selecting 3-5 different genes from alternative interpreter's list of genes, as one group of gene, one is constituted A interpreter's genome.It then adopts in a like fashion, can choose or successively choose simultaneously multiple groups gene respectively, constitute multiple Interpreter's genome, the embodiment of the present invention to this with no restriction.
S102 carries out multiple matching result sampling for each interpreter's genome, obtains multiple successful match rate samples This, and multiple successful match rate samples are based on, calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome.
It is to be understood that for the interpreter's genome selected for each group, it is thus necessary to determine that the matching of itself and contribution is imitated Fruit, so that selection is more suitable for the matched interpreter's gene of gene.Meanwhile in order to without loss of generality, for each group of interpreter's genome, Interpreter's genome can be inputted to given Matching Model, carry out multiple matching result sampling using given Matching Model, every time Sampling can obtain a successful match rate sample.
It is understood that carrying out successful match rate sample using Matching Model for each group of interpreter's genome When acquisition, the gene in this group of interpreter's genome is input in Matching Model, which can be according to the original text itself provided Part gene calculates the successful match rate score of gene in interpreter's genome and contribution gene automatically and exports, then matches mould The successful match rate score of type output can be used as a successful match rate sample.For same interpreter's genome, carry out more Secondary above-mentioned matching result sampling process, then available multiple successful match rate samples.
Later, it for each interpreter selected genome, is obtained according to above-mentioned multiple matching result sampling Multiple successful match rate samples calculate the comprehensive matching success rate of interpreter's genome, that is, calculate separately interpreter's genome pair The mean value and standard deviation for the successful match rate answered.It is understood that each successful match rate sample, actually primary The successful match rate score sampled with result.
For example, it is assumed that carrying out matching result sampling according to some interpreter's genome, n successful match rate sample difference is obtained For p1,p2,...pn.The mean value of the corresponding successful match rate of interpreter's genome is then calculated according to it are as follows:
In formula, E (p) indicates the mean value of the corresponding successful match rate of interpreter's genome, piIndicate i-th of interpreter's genome Successful match rate sample, n indicate the total number of the successful match rate sample for interpreter's genome acquisition.
On this basis, the standard deviation for calculating the corresponding successful match rate of interpreter's genome is as follows:
In formula, S indicates the standard deviation of the corresponding successful match rate of interpreter's genome, and E (p) indicates that interpreter's genome is corresponding The mean value of successful match rate, piIndicate that i-th of successful match rate sample of interpreter's genome, n indicate to be directed to interpreter's genome The total number of the successful match rate sample of acquisition.
Wherein, in one embodiment, multiple matching result sampling is being carried out, is obtaining the step of multiple successful match rate samples Before rapid, further includes: according to the demand of the gene matching primitives precision with contribution to be translated, be set for matching result sampling Total degree threshold value, i.e. given threshold.Then accordingly in actual samples, the number of acquisition successful match rate sample is total not less than this Frequency threshold value.For example, for each interpreter's genome, it is desirable that the number of the successful match rate sample of extraction is no less than 50, then The data 50 are preset total degree threshold value.
S103 chooses the corresponding interpreter's genome of the maximum in all mean values, is defined as maximum interpreter's genome, and will The mean value of maximum interpreter's genome is defined as Largest Mean, and the standard deviation of maximum interpreter's genome is defined as maximum standard deviation.
It can be understood as, it is assumed that total group of number of interpreter's genome of selection is that m group can calculate then according to above-mentioned steps To the corresponding mean value of m group and standard deviation.The embodiment of the present invention chooses value the maximum first from m mean value, and by the maximum Interpreter's genome corresponding to person is defined as maximum interpreter's genome.Then correspondingly, the mean value of maximum interpreter's genome is determined Justice is Largest Mean, with variable EmaxIt indicates, the standard deviation of maximum interpreter's genome is defined as maximum standard deviation, uses variable SmaxIt indicates.
S104 is based on each interpreter's genome in all interpreter's genomes in addition to maximum interpreter's genome It is corresponding to calculate interpreter's genome for the corresponding mean value of interpreter's genome and standard deviation and Largest Mean and maximum standard deviation Z value;Wherein, Z value indicates Z value in the verifying of large sample otherness.
It is to be understood that in all interpreter's genomes are calculated according to above-mentioned steps in addition to maximum interpreter's genome Each of on the basis of the mean value and standard deviation of the corresponding successful match rate of interpreter's genome that select, these are selected Interpreter's genome, calculate its Z value.Specifically, for each of these interpreter's genomes interpreter's genome, according to it The standard deviation and mean value of corresponding successful match rate, in conjunction with the corresponding Largest Mean E of maximum interpreter's genomemaxIt is marked with maximum Quasi- difference Smax, calculate separately the corresponding Z value of interpreter's genome.
It is understood that the concept of Z value therein is the verifying of large sample otherness, the i.e. concept of Z value in Z verifying.Z Inspection is the method for being generally used for large sample (i.e. sample size is greater than 30) mean difference and examining.It is with standard normal point The theory of cloth come infer difference occur probability, so that whether the difference for comparing two average significant.When known standard deviation, Whether the mean value for verifying one group of number is equal with a certain desired value.Translating of selecting is measured in the embodiment of the present invention using Z verifying The matching difference verifying of member's genome, therefore the calculating of Z value is carried out to interpreter's genome that each is selected.
S105, it is corresponding based on each interpreter's genome in all interpreter's genomes in addition to maximum interpreter's genome Z value chooses the interpreter's genome for meeting and imposing a condition, and interpreter's base that the satisfaction is imposed a condition from all interpreter's genomes Because the gene in the gene and maximum interpreter's genome in group merges, the interpreter's gene finally chosen is obtained.
It is to be understood that the Z except the extragenic each interpreter's genome of maximum interpreter can be calculated according to above-mentioned steps Value, may determine that otherness performance of each corresponding interpreter's genome when carrying out gene matching according to the Z value.Therefore, according to every The corresponding Z value of a interpreter's genome, can use preset setting condition, judges that the corresponding interpreter's genome of the Z value is The no otherness requirement for meeting setting.If conditions are not met, then reject it from each interpreter's genome selected, it is final remaining All interpreter's genomes not being removed are satisfactory interpreter's genome, include that Z value is full in these interpreter's genomes The interpreter's genome and maximum interpreter's genome that foot setting otherness requires.Gene in remaining all interpreter's genomes is taken Out, and after removing the duplicate factor in these genes, one group of new gene is formed, i.e., as the interpreter's gene finally chosen.
For example, it is assumed that acquiring n successful match rate sample, these successful match rates in total for some interpreter's genome Sample meets normal distribution.Meanwhile it presetting and having selected the setting condition of interpreter's gene for the confidence level for the gene selected is not Lower than 95%, the Z value which corresponds to interpreter's genome is 1.96.Then, for each the interpreter's genome selected, Its corresponding Z value is compared with 1.96, if Z value is greater than 1.96, the corresponding interpreter's genome of the Z value is rejected, otherwise, Retain the corresponding interpreter's genome of the Z value.
Assuming that eliminating p according to above-mentioned treatment process from a interpreter's genome selected of all n and being unsatisfactory for setting Interpreter's genome of condition, remaining n-p interpreter genome are to meet to impose a condition.Then, in this n-p interpreter's genome In, may there are two or more than two interpreter's genomes in contain some interpreter's gene simultaneously.Therefore this n-p are translated Whole interpreter's genes in member's genome take out, and are put into a new gene pool, multiple for occurring in the gene pool Each interpreter's gene, rejects extra and only retains interpreter's gene.It is included in this final new gene pool Multiple non-repetitive interpreter's genes, using these genes as the interpreter's gene finally chosen.
The choosing method of interpreter's gene provided in an embodiment of the present invention, by advance from interpreter's gene pool of all interpreters Multiple groups interpreter genome is chosen, and by calculating Z value corresponding to these interpreter's genomes, meets setting condition to choose Z value Interpreter's genome, as between final choose as a result, the interpreter's gene selected is enabled preferably to embody interpreter Otherness.In addition, the interpreter chosen accordingly and contribution to be translated can be made to carry out more reasonable in gene matching application Match, to effectively improve translation efficiency and translation accuracy rate.
Wherein, in one embodiment, from alternative interpreter's list of genes, the step of the different gene of multiple groups is chosen respectively Before rapid, the method for the embodiment of the present invention further include:
Corresponding gene is extracted from the basic information of all interpreters, ability information, credit information and posterior infromation respectively, And it is correspondingly formed basic information gene, ability information gene, credit information gene and the posterior infromation gene of interpreter;
Based on basic information gene, ability information gene, credit information gene and posterior infromation gene, alternative interpreter is constituted List of genes.
It is to be understood that there are many sources of interpreter's gene, and in the social epoch, all data of the every act and every move of interpreter Gene can be extracted, by the sources of interpreter's gene, the present embodiment extracts interpreter's gene from the following aspects, Constitute alternative interpreter's list of genes:
Basic information, the personal relevant information of interpreter, such as name, age, location and contact information;
Ability information, the translation ability information that interpreter possesses, languages direction, industry field and the translation speed being such as good at Deng;
Credit information, the credit information that interpreter accumulates during undertaking translations such as hand over original text rate and midway in time Rate of sending back the manuscript etc.;
Posterior infromation, the correlation experience that interpreter accumulates during long campaigns translation, such as translate total number of word and Total amount etc..
Above- mentioned information based on interpreter extract the corresponding corresponding gene of interpreter respectively, and according to above-mentioned various aspects, formation pair Basic information gene, ability information gene, credit information gene and the posterior infromation gene answered.Later, above-mentioned various aspects are based on Gene, constitute alternative interpreter's list of genes.For example, it is corresponding standby that basic information can be constructed for the basic information of interpreter The person's of translating selectively list of genes is as shown in table 1, for according to a kind of alternative interpreter's list of genes of basic information of the embodiment of the present invention.
Table 1, a kind of alternative interpreter's list of genes of basic information according to an embodiment of the present invention
Then, when carrying out the selection of multiple interpreter's genomes according to table 1, multiple points in each data item can be randomly selected Not corresponding interpreter's gene, such as selection to " institute learns profession " corresponding gene " oil exploitation " and " abroad works and learns to pass through Go through " corresponding gene " having ", then interpreter's genome is constituted with the two.Using same treatment process, can also choose not Multiple and different interpreter's genomes.
If can choose likewise, decimation rule has been previously set to choose gene relevant to interpreter's qualification information The corresponding gene such as " IM ", " learning profession ", " date of birth " and " overseas work experience " in table 1 constitutes interpreter's genome.
The choosing method of interpreter's gene provided in an embodiment of the present invention, by from the basic information of interpreter, ability information, letter With four aspects of information and posterior infromation, the gene of interpreter is extracted respectively, and constitute alternative interpreter's list of genes accordingly, to carry out The selection and matching of more excellent interpreter's gene can more fully consider the specific information of interpreter's different aspect, for more reasonably into The matching of row gene provides reliable basis.
Wherein, according to the above embodiments optionally, respectively from all basic informations of interpreter, ability information, credit letter The step of extracting corresponding gene in breath and posterior infromation further comprises:
Obtain interpreter all basic informations, ability information, credit information and posterior infromation, and respectively from basic information, Interpreter's feature is obtained in ability information, credit information and posterior infromation;
Based on interpreter's feature, interpreter's direct gene of interpreter is extracted.
It is to be understood that interpreter's gene is present in interpreter, different interpreters have different genes, there is general character but more important Be otherness to be extracted gene, can just be treated in this way with differentiation, match best interpreter.
But gene is not feature, simply can not explicitly recognize, extract so needing step.Gene and spy Sign is characterized in taking out a certain concept to characteristic common to object there are essential distinction.Include segment attribute in feature, and belongs to Most basic information --- the gene of object included in property.
Therefore the present embodiment is when carrying out the extraction of interpreter's gene, first according to the four of the interpreter of above-described embodiment aspects Information extracts corresponding characteristic information, as interpreter's feature.Later, according to different interpreter's features, interpreter is extracted most respectively Basic information constitutes interpreter's direct gene.For example, as shown in Fig. 2, for according to the choosing of interpreter's gene provided in an embodiment of the present invention Take the relation schematic diagram of interpreter's feature and interpreter's gene in method.
The choosing method of interpreter's gene provided in an embodiment of the present invention is further extracted by the extraction to interpreter's feature Interpreter's gene, what can be got is present in specific interpreter, be different from other interpreters, unique key message.
Wherein, according to the above embodiments optionally, multiple matching result sampling is carried out, multiple successful match rate samples are obtained This step of, further comprises:
Matching result sampling multiple for any wheel, executes following process flow:
The initial value of the successful match rate of all interpreter's genomes is initially set;
Interpreter's genome is randomly selected from all interpreter's genomes, and interpreter's genome of selection is matched Test, and based on to interpreter's genome this match test successful match rate result and history match success rate as a result, more The current successful match rate value of new interpreter's genome;
It repeats and randomly selects to the step of update, until the number to the match test of any interpreter's genome reaches First given threshold stops the match test to interpreter's genome, and records the current successful match rate of interpreter's genome Value;
To interpreter's genome other than the interpreter's genome for stopping match test, the step randomly selected to record is repeated Suddenly, until reaching the second given threshold to the total degree of the match test of all interpreter's genomes, then each interpreter's gene is recorded The current successful match rate value of group, and terminate the multiple matching result sampling of epicycle, into the multiple matching result sampling of next round, directly Reach third given threshold to the total wheel number for executing multiple matching result sampling, the quantity for obtaining each interpreter's genome is third The successful match rate sample of given threshold.
It is to be understood that according to the above embodiments, for interpreter's genome that each group selects, it is thus necessary to determine that The matching effect of itself and contribution, so that selection is more suitable for the matched interpreter's gene of gene.Meanwhile in order to without loss of generality, for Each group of interpreter's genome carries out multiple matching result sampling.And specifically carrying out each group of interpreter's genome selected When matching result samples, carried out using above-mentioned Matching Model.
Specifically, can use given Matching Model, carry out the multiple matching results sampling of more wheels, obtain multiple matchings at When power sample, it can be assumed that have chosen m group interpreter's genome according to the above embodiments, then it can be to each interpreter's genome Successful match rate sampled, (no less than general 30 times) matching experiment is carried out repeatedly based on the above m genome, it is every to take turns It is as follows with test process:
Step 1, initializing set, such as Initialize installation are carried out to the value of the successful match rate of each interpreter's genome It is 0.
Step 2, interpreter's genome is randomly choosed, successful match rate result is carried out in given Matching Model and calculates, Obtain the successful match rate result of this match test.Meanwhile in conjunction in epicycle multiple matching result sampling historical record it The successful match rate of preceding match test for several times is as a result, i.e. history match success rate is as a result, calculate the interpreter's genome chosen Current successful match rate value.
Step 3, repeatedly circulation executes above-mentioned steps 1 and 2, is from all interpreter's bases due to choosing interpreter's genome every time all Because randomly selecting in group, therefore each genome may be different by the number of carry out match test, then when to some interpreter's gene When the number of the match test of group reaches the first given threshold, that is, stop the epicycle match test to interpreter's genome, And when recording stopping test, the current successful match rate value of interpreter's genome.
Step 4, remaining interpreter's genome except interpreter's genome of the first given threshold is reached for removing, continues to hold The process flow of row above-mentioned steps 1-3 stops epicycle matching until the total degree of epicycle match test reaches the second given threshold Test.At this point for each interpreter's genome, there is a successful match rate value to be corresponding to it, as the multiple matching result of epicycle M successful match rate sample can be obtained then for m interpreter's genome by sampling obtained successful match rate sample.
So, for all interpreter's genomes, the above-mentioned multiple matching of more wheels (such as reaching third given threshold) is carried out As a result sample, it can obtain multiple successful match rate samples of each interpreter's genome, such as wheel number is set as n, then matching at Power sample number is n (n is typically no less than 50).
For example, it is assumed that having selected a1、a2And a3Totally three interpreter's genomes, and preset the first given threshold, second Given threshold and third given threshold are respectively 3,8 and 5.Then, in the multiple matching result sampling of each round:
First time selection is carried out first, from a1、a2And a3In randomly select one, such as choose and arrive a1, then to a1Progress With test, test result is successful match, then obtains a1Successful match rate value be 100%.
It is chosen followed by second, it is assumed that choose and arrive a2, match test is carried out to it, obtains test result as matching It is unsuccessful, then obtain a2Successful match rate value be 0%.
Next third time selection is carried out again, it is assumed that and choose and arrive a1, and match test result is to match unsuccessful, then root According to a1In total twice match test as a result, obtaining a1Current successful match rate value is 50%.
Next the 4th selection is carried out again, it is assumed that is chosen and is arrived a3, and match test result is successful match, then obtains a3 Successful match rate value be 100%.
Next the 5th selection is carried out again, it is assumed that and choose and arrive a1, and match test result is successful match, then basis To a1In total three times match test as a result, obtaining a1Current successful match rate value is 66.6%.At this point, to a1Matching examination It tests number and has had reached the first given threshold 3, then stop continuing to a1Carry out match test, and export its current matching at Interpreter's genome a in the multiple matching result sampling of performance number 66.6%, as epicycle1Successful match rate sample.
Next the 6th selection is carried out again, due to a13 match tests are had reached, then only in a2And a3Middle progress Match test is randomly selected and carries out, specific selection and match test process are similar with above-mentioned steps.In this way, until total matching The number of test, i.e., to a1、a2And a3The total degree of match test when reaching the second given threshold 8 times, terminate epicycle multiple It is sampled with result.At this point, having obtained a successful match rate sample all in accordance with above-mentioned match test for each interpreter's genome This.
So, to three interpreter's genome a1、a2And a3, repeat more wheels and carry out above-mentioned multiple matching result sampling, then often One wheel can obtain a1、a2And a3Corresponding one group of successful match rate sample.Until duplicate discussion reaches third given threshold 5, then available a1、a2And a3Respectively corresponding 5 successful match rate sample.
The choosing method of interpreter's gene provided in an embodiment of the present invention carries out each interpreter's genome using given Matching Model Multiple successful match rate calculate, and accordingly choose the higher interpreter's genome of successful match rate, calculated result reliability can be made It is higher.
Wherein, according to the above embodiments optionally, it is based on the corresponding mean value of interpreter's genome and standard deviation, and most Big mean value and maximum standard deviation, the step of calculating interpreter's genome corresponding Z value, further comprise:
It using following calculation formula, calculates in all interpreter's genomes, each in addition to maximum interpreter's genome is translated The corresponding Z value of member's genome:
In formula, ZiIndicate the corresponding Z value of i-th of interpreter's genome, n indicates the corresponding successful match of each interpreter's genome The number of rate sample, EiIndicate the corresponding mean value of i-th of interpreter's genome, SiIndicate the corresponding standard of i-th of interpreter's genome Difference, EmaxIndicate Largest Mean, SmaxIndicate maximum standard deviation.
It is to be understood that being translated in conjunction with each is calculated in addition to maximum interpreter's genome according to the above embodiments The standard deviation and mean value of the corresponding successful match rate of member's genome, and the corresponding Largest Mean E of maximum interpreter's genomemaxWith Maximum standard deviation Smax, using Z value calculation formula given herein above, can correspond to calculate select except maximum interpreter's gene Each interpreter's genome Z value except group.
The choosing method of interpreter's gene provided in an embodiment of the present invention is distinguished using each the interpreter's genome selected Corresponding mean value and standard deviation calculate the Z value of each interpreter's genome in combination with Largest Mean and maximum standard deviation, can The successful match rate situation of each interpreter's genome is more accurately characterized, is come and original text so as to more accurately choose interpreter's gene Part gene is matched, and matching effect is improved.
In addition, meeting translating for setting condition choosing from all interpreter's genomes on the basis of the various embodiments described above After the step of member's genome, the method for the embodiment of the present invention can also include following processing step: if all except maximum interpreter In interpreter's genome except genome, the Z value of none interpreter's genome can satisfy the setting condition of above-described embodiment, Step S101 is then returned to, the different interpreter's genome of multiple groups is chosen again from alternative interpreter's list of genes, re-starts above-mentioned The calculating and selection process of embodiment.
For example, the case where successful match rate sample for sampling meets normal distribution, to obtain 95% confidence level, I.e. preset setting condition is that the confidence level of interpreter's genome meets 95%, then the Z value calculated for interpreter's genome is answered No more than 1.96.And in practical application, it, may be due to being when choosing multiple groups interpreter genome from alternative interpreter's list of genes It the reasons such as randomly selects, causes when calculating Z value to the interpreter's genome selected, Z value is not able to satisfy above-mentioned standard, then needs Again other interpreter's genome is selected in alternative interpreter's list of genes, and is recalculated and chosen.
The choosing method of interpreter's gene provided in an embodiment of the present invention, by the judgement to calculated result and to selecting step Circulating repetition execute, can guarantee that the high quality gene met the requirements can be selected, it is to be translated for more accurately matching Contribution is of great significance.
As the other side of the embodiment of the present invention, the embodiment of the present invention provides a kind of interpreter according to the above embodiments The selecting device of gene, the device are used to realize the selection to final interpreter's gene in the above embodiments.Therefore, above-mentioned Description and definition in the choosing method of interpreter's gene of each embodiment can be used for each execution module in the embodiment of the present invention Understanding, specifically refer to above-described embodiment, do not repeating herein.
One embodiment of present aspect embodiment according to the present invention, the structure of the selecting device of interpreter's gene as shown in figure 3, For the structural schematic diagram of the selecting device of interpreter's gene provided in an embodiment of the present invention, which can be used for above-mentioned each method The selection of interpreter's gene in embodiment, the device include: that initial gene chooses module 301, the first computing module 302, maximum base Because a group selection module 303, the second computing module 304 and final gene choose module 305.
Wherein, initial gene is chosen module 301 and is used for from alternative interpreter's list of genes, chooses the different base of multiple groups respectively Cause constitutes multiple interpreter's genomes;First computing module 302 is used to carry out repeatedly matching knot for each interpreter's genome Fruit sampling obtains multiple successful match rate samples, and is based on multiple successful match rate samples, and it is corresponding to calculate interpreter's genome The mean value and standard deviation of successful match rate;It is corresponding that maximum genome chooses the maximum that module 303 is used to choose in all mean values Interpreter's genome, be defined as maximum interpreter's genome, and the mean value of maximum interpreter's genome is defined as Largest Mean, will most The standard deviation of big interpreter's genome is defined as maximum standard deviation;Second computing module 304 is used for in all interpreter's genomes Each interpreter's genome in addition to maximum interpreter's genome is based on the corresponding mean value of interpreter's genome and standard deviation, with And Largest Mean and maximum standard deviation, calculate the corresponding Z value of interpreter's genome;Final gene chooses module 305 for being based on The corresponding Z value of each interpreter's genome in all interpreter's genomes in addition to maximum interpreter's genome, from all interpreter's bases Because choosing the interpreter's genome for meeting and imposing a condition in group, and gene in interpreter's genome that the satisfaction is imposed a condition and most Gene in big interpreter's genome merges, and obtains the interpreter's gene finally chosen;Wherein, the Z value indicates large sample otherness Z value in verifying.
Specifically, initial gene chooses module 301 can select respectively according to the alternative interpreter's list of genes pre-established Multiple groups interpreter's gene is taken, and a genome is constituted with each group of interpreter's gene respectively, as interpreter's genome, interpreter's gene Group is the interpreter's genome selected.For example, initial gene chooses module 301 can when carrying out the selection of each group interpreter gene With from multiple interpreter's genes in alternative interpreter's list of genes in random selection table, and the interpreter's gene randomly selected using these Constitute a genome, as interpreter's genome.
Later, for the interpreter's genome selected for each group, it is thus necessary to determine that the matching effect of itself and contribution, thus Selection is more suitable for the matched interpreter's gene of gene.Meanwhile in order to without loss of generality, be counted for each group of interpreter's genome, first Calculating module 302 can repeatedly be matched by the way that interpreter's genome is inputted given Matching Model using given Matching Model As a result it samples, sampling can obtain a successful match rate sample every time.It is understood that each successful match rate sample This, the successful match rate score that an actually matching result samples.
In addition, the first computing module 302 is according to above-mentioned multiple matching for each interpreter selected genome As a result the multiple successful match rate samples for sampling acquisition, calculate the comprehensive matching success rate of interpreter's genome, that is, calculate separately The mean value and standard deviation of the corresponding successful match rate of interpreter's genome.
Later, maximum genome chooses module 303 and chooses value the maximum first from all mean values of above-mentioned calculating, and Interpreter's genome corresponding to the maximum is defined as maximum interpreter's genome.Then correspondingly, maximum genome chooses module The mean value of maximum interpreter's genome is also defined as Largest Mean by 303, with variable EmaxIt indicates, by maximum interpreter's genome Standard deviation be defined as maximum standard deviation, with variable SmaxIt indicates.
Later, the second computing module 304 is according to the above-mentioned all mean values and standard deviation being calculated, maximum gene at calculating The Z value of each genome except group.Specifically, for each of above-mentioned interpreter's genome interpreter's genome, the second meter The standard deviation and mean value for calculating successful match rate of the module 304 according to corresponding to it, in conjunction with the corresponding maximum of maximum interpreter's genome Mean value and maximum standard deviation calculate separately the corresponding Z value of interpreter's genome.
Finally, may determine that each corresponding interpreter's genome according to the Z value of each interpreter's genome on the basis of above-mentioned calculating Otherness performance when carrying out gene matching.Therefore, according to the corresponding Z value of each interpreter's genome, final gene chooses mould Block 305 can use preset setting condition, judge whether the corresponding interpreter's genome of the Z value meets the otherness of setting It is required that.If conditions are not met, then reject it from each interpreter's genome selected, what final residue was not removed all is translated Member's genome is satisfactory interpreter's genome, includes that Z value meets setting otherness requirement in these interpreter's genomes Interpreter's genome and maximum interpreter's genome.Finally, final gene selection module 305 will be in remaining all interpreter's genomes Gene takes out, and after removing the duplicate factor in these genes, forms one group of new gene, i.e., as the interpreter finally chosen Gene.
Further, on the basis of the above embodiments, the device of the embodiment of the present invention further includes alternative interpreter's gene column Table constructs module, is used for: extracting phase from all basic informations of interpreter, ability information, credit information and posterior infromation respectively The gene answered, and it is correspondingly formed basic information gene, ability information gene, credit information gene and the posterior infromation base of interpreter Cause;Based on the basic information gene, ability information gene, credit information gene and posterior infromation gene, constitute described alternative Interpreter's list of genes.
Wherein optional, alternative interpreter's list of genes building module is specifically used for: obtaining all basic informations, the energy of interpreter Force information, credit information and posterior infromation, and obtained from basic information, ability information, credit information and posterior infromation respectively Interpreter's feature;Based on interpreter's feature, interpreter's direct gene of interpreter is extracted.
Wherein optional, the second computing module is specifically used for: utilizing following formula, calculates in all interpreter's genomes, remove The corresponding Z value of each interpreter's genome except maximum interpreter's genome:
In formula, ZiIndicate the corresponding Z value of i-th of interpreter's genome, n indicates the corresponding successful match of each interpreter's genome The number of rate sample, EiIndicate the corresponding mean value of i-th of interpreter's genome, SiIndicate the corresponding standard of i-th of interpreter's genome Difference, EmaxIndicate Largest Mean, SmaxIndicate maximum standard deviation.
Wherein optional, the first computing module is specifically used for: any multiple matching result of wheel being sampled, following place is executed Reason process: the initial value of the successful match rate of all interpreter's genomes is initially set;From all interpreter's genomes with Machine chooses interpreter's genome, carries out match test to interpreter's genome of selection, and based on to interpreter's genome sheet The successful match rate result and history match success rate of secondary match test are as a result, update the current successful match of interpreter's genome Rate value;It repeats and randomly selects to the step of update, until reaching the to the number of the match test of any interpreter's genome One given threshold stops the match test to interpreter's genome, and records the current successful match rate value of interpreter's genome; To stop match test interpreter's genome other than interpreter's genome, repeat randomly select to record the step of, until Second given threshold is reached to the total degree of the match test of all interpreter's genomes, then it is current to record each interpreter's genome Successful match rate value, and terminate the multiple matching result sampling of epicycle, into the multiple matching result sampling of next round, until executing more Total wheel number of secondary matching result sampling reaches third given threshold, and the quantity for obtaining each interpreter's genome is third given threshold Successful match rate sample.
Wherein optional, the first computing module is specifically used for, for each interpreter's genome, the successful match rate of extraction The number of sample is no less than given threshold.
It is understood that can be by hardware processor (hardware processor) come real in the embodiment of the present invention Each relative program module in the device of existing the various embodiments described above.Also, the selecting device of each interpreter's gene of the embodiment of the present invention When for the selection of interpreter's gene in above-mentioned each method embodiment, the beneficial effect of generation and corresponding above-mentioned each method are real It is identical to apply example, above-mentioned each method embodiment can be referred to, details are not described herein again.
As the another aspect of the embodiment of the present invention, the present embodiment provides a kind of electronics according to the above embodiments and sets It is standby, it is the entity structure schematic diagram of electronic equipment provided in an embodiment of the present invention, comprising: at least one processor with reference to Fig. 4 401, at least one processor 402, communication interface 403 and bus 404.
Wherein, memory 401, processor 402 and communication interface 403 complete mutual communication by bus 404, communicate Interface 403 is for the information transmission between the electronic equipment and interpreter's information equipment;Being stored in memory 401 can be in processor The computer program run on 402 when processor 402 executes the computer program, realizes translating as described in the various embodiments described above The choosing method of member's gene.
It is to be understood that including at least memory 401, processor 402, communication interface 403 and bus in the electronic equipment 404, and memory 401, processor 402 and communication interface 403 form mutual communication connection by bus 404, and can be complete The program instruction of the choosing method of interpreter's gene is read from memory 401 at mutual communication, such as processor 402.Separately Outside, communication interface 403 can also realize the communication connection between the electronic equipment and interpreter's information equipment, and achievable mutual Information transmission, such as the selection to interpreter's gene is realized by communication interface 403.
When electronic equipment is run, processor 402 calls the program instruction in memory 401, real to execute above-mentioned each method Apply method provided by example, for example, from alternative interpreter's list of genes, choose the different gene of multiple groups respectively, constitute more A interpreter's genome;For each interpreter's genome, multiple matching result sampling is carried out, obtains multiple successful match rate samples This, and multiple successful match rate samples are based on, calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome;Choosing The corresponding interpreter's genome of the maximum in all mean values is taken, is defined as maximum interpreter's genome, and by maximum interpreter's genome Mean value be defined as Largest Mean, the standard deviation of maximum interpreter's genome is defined as maximum standard deviation;For all interpreter's bases Because of each interpreter's genome in group in addition to maximum interpreter's genome, it is based on the corresponding mean value of interpreter's genome and standard Difference and Largest Mean and maximum standard deviation calculate the corresponding Z value of interpreter's genome;Based on being removed in all interpreter's genomes The corresponding Z value of each interpreter's genome except maximum interpreter's genome is chosen from all interpreter's genomes and meets setting Interpreter's genome of condition, and the base in the gene in interpreter's genome that the satisfaction is imposed a condition and maximum interpreter's genome Because merging, the interpreter's gene finally chosen is obtained;Wherein, the Z value indicates Z value etc. in the verifying of large sample otherness.
Program instruction in above-mentioned memory 401 can be realized and as independent by way of SFU software functional unit Product when selling or using, can store in a computer readable storage medium.Alternatively, realizing that above-mentioned each method is implemented This can be accomplished by hardware associated with program instructions for all or part of the steps of example, and program above-mentioned can store to be calculated in one In machine read/write memory medium, when being executed, execution includes the steps that above-mentioned each method embodiment to the program;And storage above-mentioned Medium includes: USB flash disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), the various media that can store program code such as magnetic or disk.
The embodiment of the present invention provides a kind of non-transient computer readable storage medium also according to the various embodiments described above, this is non-temporarily State computer-readable recording medium storage computer instruction, the computer instruction execute computer as described in the various embodiments described above Interpreter's gene choosing method, for example, from alternative interpreter's list of genes, choose the different gene of multiple groups, structure respectively At multiple interpreter's genomes;For each interpreter's genome, multiple matching result sampling is carried out, obtains multiple successful match rates Sample, and multiple successful match rate samples are based on, calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome; The corresponding interpreter's genome of the maximum in all mean values is chosen, is defined as maximum interpreter's genome, and by maximum interpreter's gene The mean value of group is defined as Largest Mean, and the standard deviation of maximum interpreter's genome is defined as maximum standard deviation;For all interpreters Each interpreter's genome in genome in addition to maximum interpreter's genome is based on the corresponding mean value of interpreter's genome and mark Quasi- poor and Largest Mean and maximum standard deviation, calculate the corresponding Z value of interpreter's genome;Based in all interpreter's genomes The corresponding Z value of each interpreter's genome in addition to maximum interpreter's genome is chosen satisfaction from all interpreter's genomes and is set Interpreter's genome of fixed condition, and in the gene in interpreter's genome that the satisfaction is imposed a condition and maximum interpreter's genome Gene merges, and obtains the interpreter's gene finally chosen;Wherein, the Z value indicates Z value etc. in the verifying of large sample otherness.
Electronic equipment provided in an embodiment of the present invention and non-transient computer readable storage medium, by executing above-mentioned each reality The choosing method of interpreter's gene described in example is applied, chooses multiple groups interpreter genome from interpreter's gene pool of all interpreters in advance, And by calculate these interpreter's genomes corresponding to Z value, come choose Z value meet impose a condition interpreter's genome, using as Final chooses as a result, the interpreter's gene selected is enabled preferably to embody the otherness between interpreter.In addition, in gene With in application, the interpreter chosen accordingly can be made more reasonably to be matched with contribution to be translated, to effectively improve translation effect Rate and translation accuracy rate.
It is understood that the embodiment of device described above, electronic equipment and storage medium is only schematic , wherein unit may or may not be physically separated as illustrated by the separation member, it can both be located at one Place, or may be distributed on heterogeneous networks unit.Some or all of modules can be selected according to actual needs To achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are without paying creative labor To understand and implement.
By the description of embodiment of above, those skilled in the art is it will be clearly understood that each embodiment can borrow Help software that the mode of required general hardware platform is added to realize, naturally it is also possible to pass through hardware.Based on this understanding, above-mentioned Substantially the part that contributes to existing technology can be embodied in the form of software products technical solution in other words, the meter Calculation machine software product may be stored in a computer readable storage medium, such as USB flash disk, mobile hard disk, ROM, RAM, magnetic disk or light Disk etc., including some instructions, with so that a computer equipment (such as personal computer, server or network equipment etc.) Execute method described in certain parts of above-mentioned each method embodiment or embodiment of the method.
In addition, those skilled in the art are it should be understood that in the application documents of the embodiment of the present invention, term "include", "comprise" or any other variant thereof is intended to cover non-exclusive inclusion, so that including a series of elements Process, method, article or equipment not only include those elements, but also including other elements that are not explicitly listed, or Person is to further include for elements inherent to such a process, method, article, or device.In the absence of more restrictions, by The element that sentence "including a ..." limits, it is not excluded that in the process, method, article or apparatus that includes the element There is also other identical elements.
In the specification of the embodiment of the present invention, numerous specific details are set forth.It should be understood, however, that the present invention is implemented The embodiment of example can be practiced without these specific details.In some instances, it is not been shown in detail well known Methods, structures and technologies, so as not to obscure the understanding of this specification.Similarly, it should be understood that in order to simplify implementation of the present invention Example is open and helps to understand one or more of the various inventive aspects, above to the exemplary embodiment of the embodiment of the present invention Description in, each feature of the embodiment of the present invention is grouped together into single embodiment, figure or descriptions thereof sometimes In.
However, the disclosed method should not be interpreted as reflecting the following intention: i.e. the claimed invention is implemented Example requires features more more than feature expressly recited in each claim.More precisely, such as claims institute As reflection, inventive aspect is all features less than single embodiment disclosed above.Therefore, it then follows specific embodiment party Thus claims of formula are expressly incorporated in the specific embodiment, wherein each claim itself is real as the present invention Apply the separate embodiments of example.
Finally, it should be noted that above embodiments are only to illustrate the technical solution of the embodiment of the present invention, rather than it is limited System;Although the embodiment of the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art it is understood that It is still possible to modify the technical solutions described in the foregoing embodiments, or part of technical characteristic is carried out etc. With replacement;And these are modified or replaceed, each embodiment skill of the embodiment of the present invention that it does not separate the essence of the corresponding technical solution The spirit and scope of art scheme.

Claims (8)

1. a kind of choosing method of interpreter's gene characterized by comprising
From alternative interpreter's list of genes, the different gene of multiple groups is chosen respectively, constitutes multiple interpreter's genomes;
For interpreter's genome described in each, multiple matching result sampling is carried out, obtains multiple successful match rate samples, and base In the multiple successful match rate sample, the mean value and standard deviation of the corresponding successful match rate of interpreter's genome are calculated;
The corresponding interpreter's genome of the maximum in all mean values is chosen, is defined as maximum interpreter's genome, and will be described The mean value of maximum interpreter's genome is defined as Largest Mean, and the standard deviation of maximum interpreter's genome is defined as Maximum standard deviation;
For each described interpreter's genome in all interpreter's genomes in addition to maximum interpreter's genome, base In the corresponding mean value of interpreter's genome and the standard deviation and the Largest Mean and the maximum standard deviation, meter Calculate the corresponding Z value of interpreter's genome;
Based on each described interpreter's genome pair in all interpreter's genomes in addition to maximum interpreter's genome The Z value answered chooses the interpreter's genome for meeting and imposing a condition from all interpreter's genomes, and the satisfaction is set The gene in gene and maximum interpreter's genome in interpreter's genome of fixed condition merges, and obtains the interpreter finally chosen Gene;
Wherein, the Z value indicates Z value in the verifying of large sample otherness.
2. the method according to claim 1, wherein being chosen respectively from alternative interpreter's list of genes described Before the step of multiple groups different gene, further includes:
Corresponding gene is extracted from all basic informations of interpreter, ability information, credit information and posterior infromation respectively, and right Basic information gene, ability information gene, credit information gene and the posterior infromation gene of interpreter should be formed;
Based on the basic information gene, ability information gene, credit information gene and posterior infromation gene, constitute described alternative Interpreter's list of genes.
3. the method according to claim 1, wherein it is described based on the corresponding mean value of interpreter's genome and The standard deviation and the Largest Mean and the maximum standard deviation, the step of calculating interpreter's genome corresponding Z value into One step includes:
Using following calculation formula, calculate in all interpreter's genomes, it is each in addition to maximum interpreter's genome The corresponding Z value of a interpreter's genome:
In formula, ZiIndicate the corresponding Z value of i-th of interpreter's genome, n indicates the corresponding matching of each interpreter's genome The number of success rate sample, EiIndicate the corresponding mean value of i-th of interpreter's genome, SiIndicate that i-th of interpreter's genome is corresponding The standard deviation, EmaxIndicate the Largest Mean, SmaxIndicate the maximum standard deviation.
4. obtaining multiple the method according to claim 1, wherein described carry out multiple matching result sampling The step of being made into power sample further comprises:
The multiple matching result sampling described for any wheel, executes following process flow:
The initial value of the successful match rate of all interpreter's genomes is initially set;
Interpreter's genome is randomly selected from all interpreter's genomes, and interpreter's genome of selection is carried out Match test, and based on the successful match rate result and history match success rate knot to this match test of interpreter's genome Fruit updates the current successful match rate value of interpreter's genome;
Repeat it is described randomly select to the step of the update, until to the match test of any interpreter's genome Number reaches the first given threshold, stops the match test to interpreter's genome, and records current of interpreter's genome With success ratio values;
To interpreter's genome other than the interpreter's genome for stopping match test, described randomly select to the record is repeated The step of, until reaching the second given threshold to the total degree of the match test of all interpreter's genomes, then record is each The current successful match rate value of interpreter's genome, and terminate multiple matching result sampling described in epicycle, into next round institute Multiple matching result sampling is stated, until the total wheel number for executing the multiple matching result sampling reaches third given threshold, is obtained The quantity of each interpreter's genome is the successful match rate sample of third given threshold.
5. the method according to claim 1, wherein for interpreter's genome described in each, extraction it is described The number of successful match rate sample is no less than given threshold.
6. a kind of selecting device of interpreter's gene characterized by comprising
Initial gene chooses module, for choosing the different gene of multiple groups respectively, constituting multiple from alternative interpreter's list of genes Interpreter's genome;
First computing module obtains multiple for carrying out multiple matching result sampling for interpreter's genome described in each It is made into power sample, and is based on the multiple successful match rate sample, calculates the corresponding successful match rate of interpreter's genome Mean value and standard deviation;
Maximum genome is chosen module and is defined as choosing the corresponding interpreter's genome of the maximum in all mean values Maximum interpreter's genome, and the mean value of maximum interpreter's genome is defined as Largest Mean, by the maximum interpreter The standard deviation of genome is defined as maximum standard deviation;
Second computing module, for for each in all interpreter's genomes in addition to maximum interpreter's genome Interpreter's genome, based on the corresponding mean value of interpreter's genome and the standard deviation and the Largest Mean and The maximum standard deviation calculates the corresponding Z value of interpreter's genome;
Final gene chooses module, for based on every in addition to maximum interpreter's genome in all interpreter's genomes The corresponding Z value of one interpreter's genome, chooses the interpreter for meeting and imposing a condition from all interpreter's genomes Genome, and the gene in the gene and maximum interpreter's genome in the interpreter's genome for meeting and imposing a condition is closed And obtain the interpreter's gene finally chosen;
Wherein, the Z value indicates Z value in the verifying of large sample otherness.
7. a kind of electronic equipment characterized by comprising at least one processor, at least one processor, communication interface and total Line;
The memory, the processor and the communication interface complete mutual communication, the communication by the bus Interface is for the information transmission between the electronic equipment and interpreter's information equipment;
The computer program that can be run on the processor is stored in the memory, the processor executes the calculating When machine program, the method as described in any in claim 1 to 5 is realized.
8. a kind of non-transient computer readable storage medium, which is characterized in that the non-transient computer readable storage medium is deposited Computer instruction is stored up, the computer instruction makes the computer execute the method as described in any in claim 1 to 5.
CN201811096578.6A 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment Active CN109448792B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811096578.6A CN109448792B (en) 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811096578.6A CN109448792B (en) 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment

Publications (2)

Publication Number Publication Date
CN109448792A true CN109448792A (en) 2019-03-08
CN109448792B CN109448792B (en) 2021-11-05

Family

ID=65532883

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811096578.6A Active CN109448792B (en) 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment

Country Status (1)

Country Link
CN (1) CN109448792B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101853398A (en) * 2010-05-11 2010-10-06 浙江大学 Chinese paper cutting identification method based on space constraint characteristic selection and combination thereof
CN101937440A (en) * 2009-06-30 2011-01-05 华为技术有限公司 Feature selection method and device
CN102080129A (en) * 2010-12-01 2011-06-01 杭州师范大学 Multiple method-based differential gene screening control method for gene chip
CN103064970A (en) * 2012-12-31 2013-04-24 武汉传神信息技术有限公司 Search method for optimizing translators
CN106326458A (en) * 2016-06-02 2017-01-11 广西智度信息科技有限公司 Method for classifying city management cases based on text classification
CN107463799A (en) * 2017-08-23 2017-12-12 福建师范大学福清分校 Interaction fusion feature represents the DBP recognition methods with selective ensemble
CN107808100A (en) * 2017-10-25 2018-03-16 中国科学技术大学 For the steganalysis method of fc-specific test FC sample
CN107885730A (en) * 2017-09-25 2018-04-06 沈阳航空航天大学 Translation knowledge method for distinguishing validity under more interpreter's patterns

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101937440A (en) * 2009-06-30 2011-01-05 华为技术有限公司 Feature selection method and device
CN101853398A (en) * 2010-05-11 2010-10-06 浙江大学 Chinese paper cutting identification method based on space constraint characteristic selection and combination thereof
CN102080129A (en) * 2010-12-01 2011-06-01 杭州师范大学 Multiple method-based differential gene screening control method for gene chip
CN103064970A (en) * 2012-12-31 2013-04-24 武汉传神信息技术有限公司 Search method for optimizing translators
CN106326458A (en) * 2016-06-02 2017-01-11 广西智度信息科技有限公司 Method for classifying city management cases based on text classification
CN107463799A (en) * 2017-08-23 2017-12-12 福建师范大学福清分校 Interaction fusion feature represents the DBP recognition methods with selective ensemble
CN107885730A (en) * 2017-09-25 2018-04-06 沈阳航空航天大学 Translation knowledge method for distinguishing validity under more interpreter's patterns
CN107808100A (en) * 2017-10-25 2018-03-16 中国科学技术大学 For the steganalysis method of fc-specific test FC sample

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
K.Z.MAO: ""Identifying Critical Variables of Principal Components for Unsupervised Feature Selection"", 《IEEE TRANSACTIONS ON SYSTEMS,MAN,AND CYBERNETICS-PART B:CYBERNETICS》 *
亓呈明等: "《机器学习智能计算与高光谱遥感影像分类应用研究》", 31 May 2018 *
程涛: ""基于机器学习的蛋白质相互作用预测结果与样本重复性关系的研究"", 《中国优秀硕士学位论文全文数据库 基础科学及》 *
陈飞飞: ""基于特征表示的行为识别方法研究"", 《中国博士学位论文全文数据库 信息科技辑》 *

Also Published As

Publication number Publication date
CN109448792B (en) 2021-11-05

Similar Documents

Publication Publication Date Title
CN103577989B (en) A kind of information classification approach and information classifying system based on product identification
CN108509408A (en) A kind of sentence similarity judgment method
CN112667794A (en) Intelligent question-answer matching method and system based on twin network BERT model
CN108509411A (en) Semantic analysis and device
CN110427461A (en) Intelligent answer information processing method, electronic equipment and computer readable storage medium
CN111309887B (en) Method and system for training text key content extraction model
CN105868179B (en) A kind of intelligent answer method and device
CN108804677A (en) In conjunction with the deep learning question classification method and system of multi-layer attention mechanism
CN106296195A (en) A kind of Risk Identification Method and device
CN110008327A (en) Law answers generation method and device
CN107273406A (en) Dialog process method and device in task dialogue system
CN108304364A (en) keyword extracting method and device
CN108804526A (en) Interest determines that system, interest determine method and storage medium
CN109857846A (en) The matching process and device of user's question sentence and knowledge point
CN111694937A (en) Interviewing method and device based on artificial intelligence, computer equipment and storage medium
CN113761218A (en) Entity linking method, device, equipment and storage medium
CN110390107A (en) Hereafter relationship detection method, device and computer equipment based on artificial intelligence
CN115687925A (en) Fault type identification method and device for unbalanced sample
CN114490998B (en) Text information extraction method and device, electronic equipment and storage medium
CN109918681A (en) It is a kind of based on Chinese character-phonetic fusion problem semantic matching method
CN109189892A (en) A kind of recommended method and device based on article review
CN110688478A (en) Answer sorting method, device and storage medium
TW201820172A (en) System, method and non-transitory computer readable storage medium for conversation analysis
CN115130538A (en) Training method of text classification model, text processing method, equipment and medium
CN115048505A (en) Corpus screening method and device, electronic equipment and computer readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant