CN109299737A - Choosing method, device and the electronic equipment of interpreter's gene - Google Patents

Choosing method, device and the electronic equipment of interpreter's gene Download PDF

Info

Publication number
CN109299737A
CN109299737A CN201811095799.1A CN201811095799A CN109299737A CN 109299737 A CN109299737 A CN 109299737A CN 201811095799 A CN201811095799 A CN 201811095799A CN 109299737 A CN109299737 A CN 109299737A
Authority
CN
China
Prior art keywords
interpreter
genome
gene
value
successful match
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811095799.1A
Other languages
Chinese (zh)
Other versions
CN109299737B (en
Inventor
张芃
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Language Network (wuhan) Information Technology Co Ltd
Original Assignee
Language Network (wuhan) Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Language Network (wuhan) Information Technology Co Ltd filed Critical Language Network (wuhan) Information Technology Co Ltd
Priority to CN201811095799.1A priority Critical patent/CN109299737B/en
Priority to PCT/CN2018/124951 priority patent/WO2020057003A1/en
Priority to PCT/CN2018/124891 priority patent/WO2020057001A1/en
Publication of CN109299737A publication Critical patent/CN109299737A/en
Application granted granted Critical
Publication of CN109299737B publication Critical patent/CN109299737B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2113Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06311Scheduling, planning or task assignment for a person or group

Abstract

The embodiment of the present invention provides choosing method, device and the electronic equipment of a kind of interpreter's gene, this method comprises: choosing the different gene of multiple groups respectively from alternative interpreter's list of genes, constituting multiple interpreter's genomes;For each interpreter's genome, multiple matching result sampling is carried out, obtains multiple successful match rate samples, and calculates the mean value and standard deviation of the corresponding successful match rate of interpreter's genome accordingly;Based on the corresponding mean value of all interpreter's genomes standard deviation corresponding with each interpreter's genome, the corresponding Z value of interpreter's genome is calculated;Based on the corresponding Z value of each interpreter's genome, the interpreter's genome for meeting and imposing a condition is chosen, and is merged the gene in the interpreter's genome to impose a condition is met, the interpreter's gene finally chosen is obtained.The embodiment of the present invention can choose more effective interpreter's assortment of genes and match with contribution to be translated, to effectively improve translation efficiency and translation accuracy rate.

Description

Choosing method, device and the electronic equipment of interpreter's gene
Technical field
The present embodiments relate to technical field of data processing, more particularly, to a kind of interpreter's gene choosing method, Device and electronic equipment.
Background technique
Information age and networking have changed a lot translation mode.Platform is managed using translation flow, Talent's data is stored according to different objects, to match most suitable interpreter according to contribution to be translated.Different interpreters, institute The key message for including is not quite similar, then according to these key messages, can match most suitable translation contribution for interpreter, thus Effectively improve translation efficiency and translation accuracy.
Interpreter matches with the gene of contribution to be referred to contribution gene and interpreter's gene under set strategy through Matching Model, It is embodied as the process that contribution finds best interpreter.Selected is used to carry out the matched interpreter's gene of gene and other interpreter's genes It compares, it should can preferably embody the otherness of interpreter, could be so that contribution to be translated is matched to the interpreter being more suitable for.
Interpreter's gene is referred mainly to by carrying out analytical calculation, quantification treatment, accessed presence to interpreter's characteristic attribute It is combined in key message specific interpreter, that be different from other interpreters, unique.There are many sources of interpreter's gene, In the social epoch, all data of the every act and every move of interpreter can extract gene.
Interpreter's gene is present in all interpreters of management platform, and different interpreters have different interpreter's genes.Due to tool Body application difference, presently, there are interpreter/manuscript gene matching algorithm selection interpreter gene to be matched carry out matching meter When calculation, the corresponding assortment of genes is often rule of thumb selected.
But in interpreter's course of work, gene can with the promotion of ability, the increase of time, accumulation of knowledge and send out Raw corresponding variation.I.e. with the processing of task, examine and revise evaluation with QC, accumulation, the community activity of history corpus participation with And the activities such as test of interpreter's ability, interpreter's gene will be constantly updated.Therefore, above-mentioned interpreter's gene selects mode empirically It can have some limitations, cause the interpreter's gene selected that cannot embody the otherness between interpreter well.
Summary of the invention
In order to overcome the above problem or at least be partially solved the above problem, the embodiment of the present invention provides a kind of interpreter's base Choosing method, device and the electronic equipment of cause, with so that the interpreter's gene selected can preferably embody the difference between interpreter It is anisotropic.
In a first aspect, the embodiment of the present invention provides a kind of choosing method of interpreter's gene, comprising: arranged from alternative interpreter's gene In table, the different gene of multiple groups is chosen respectively, constitutes multiple interpreter's genomes;For interpreter's genome described in each, carry out Multiple matching result sampling, obtains multiple successful match rate samples, and be based on the multiple successful match rate sample, calculates this and translate The mean value and standard deviation of the corresponding successful match rate of member's genome;It is corresponding described equal based on all interpreter's genomes It is worth the standard deviation corresponding with interpreter's genome described in each, calculates the corresponding Z value of interpreter's genome;Based on each The corresponding Z value of interpreter's genome, chooses the interpreter's gene for meeting and imposing a condition from all interpreter's genomes Group, and the gene in the interpreter's genome for meeting and imposing a condition is merged, obtain the interpreter's gene finally chosen;Wherein, The Z value indicates Z value in the verifying of large sample otherness.
Second aspect, the embodiment of the present invention provide a kind of selecting device of interpreter's gene, comprising: initial gene chooses mould Block, for choosing the different gene of multiple groups respectively, constituting multiple interpreter's genomes from alternative interpreter's list of genes;First meter Module is calculated, for multiple matching result sampling being carried out, obtaining multiple successful match rate samples for interpreter's genome described in each This, and it is based on the multiple successful match rate sample, calculate the mean value and standard of the corresponding successful match rate of interpreter's genome Difference;Second computing module, for based on all corresponding mean values of interpreter's genome and each described interpreter The corresponding standard deviation of genome calculates the corresponding Z value of interpreter's genome;Final gene chooses module, for based on every The corresponding Z value of one interpreter's genome, chooses the interpreter for meeting and imposing a condition from all interpreter's genomes Genome, and the gene in the interpreter's genome for meeting and imposing a condition is merged, obtain the interpreter's gene finally chosen;Its In, the Z value indicates Z value in the verifying of large sample otherness.
The third aspect, the embodiment of the present invention provide a kind of electronic equipment, comprising: at least one processor, at least one Manage device, communication interface and bus;The memory, the processor and the communication interface are completed mutual by the bus Communication, the communication interface between the electronic equipment and interpreter's information equipment information transmission;In the memory It is stored with the computer program that can be run on the processor, when the processor executes the computer program, is realized such as The choosing method of interpreter's gene described in upper first aspect.
Fourth aspect, the embodiment of the present invention provide a kind of non-transient computer readable storage medium, the non-transient calculating Machine readable storage medium storing program for executing stores computer instruction, and the computer instruction executes the computer described in first aspect as above The choosing method of interpreter's gene.
Choosing method, device and the electronic equipment of interpreter's gene provided in an embodiment of the present invention, by being translated in advance from all Multiple groups interpreter genome is chosen in interpreter's gene pool of member, and by calculating Z value corresponding to these interpreter's genomes, to choose Z value meets the interpreter's genome to impose a condition, to choose as final as a result, enabling the interpreter's gene selected more preferable Embody interpreter between otherness.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is this hair Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to these attached drawings.
Fig. 1 is the flow diagram of the choosing method of interpreter's gene provided in an embodiment of the present invention;
Fig. 2 is the pass according to interpreter's feature and interpreter's gene in the choosing method of interpreter's gene provided in an embodiment of the present invention It is schematic diagram;
Fig. 3 is according to the flow diagram for calculating Z value in the choosing method of interpreter's gene provided in an embodiment of the present invention;
Fig. 4 is the structural schematic diagram of the selecting device of interpreter's gene provided in an embodiment of the present invention;
Fig. 5 is the entity structure schematic diagram of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the embodiment of the present invention, instead of all the embodiments.Based on the embodiment in the embodiment of the present invention, ability Domain those of ordinary skill every other embodiment obtained without making creative work, belongs to the present invention The range of embodiment protection.
There are many sources of interpreter's gene, and in the social epoch, all data of the every act and every move of interpreter can be extracted Gene out.Due to the difference of concrete application, presently, there are interpreter/manuscript gene matching algorithm in the to be matched of selection interpreter When gene carries out matching primitives, the corresponding assortment of genes is often rule of thumb selected.But conventional method has certain limitation Property, cause the interpreter's gene selected that cannot embody the otherness of interpreter well.
In view of the above-mentioned problems, the embodiment of the present invention from interpreter's gene pool of interpreter by choosing multiple groups interpreter gene in advance Group, and by calculating Z value corresponding to these interpreter's genomes, meet the interpreter's genome to impose a condition to choose Z value, to make It chooses for final as a result, the interpreter's gene selected is enabled preferably to embody the otherness between interpreter.Wherein, Z value table Show Z value in the verifying of large sample otherness.
As the one aspect of the embodiment of the present invention, the present embodiment provides a kind of choosing methods of interpreter's gene, with reference to figure 1, it is the flow diagram of the choosing method of interpreter's gene provided in an embodiment of the present invention, comprising:
S101 chooses the different gene of multiple groups respectively, constitutes multiple interpreter's genomes from alternative interpreter's list of genes.
It is to be understood that before the interpreter's gene for carrying out the present embodiment is chosen, it in advance can be according to all properties of interpreter Information establishes alternative interpreter's list of genes, may include in alternative interpreter's list of genes relevant to interpreter's particular community All genes.Specifically, alternative interpreter's list of genes may be considered a gene pool, with gene for singly in the gene pool Position storage has the gene relevant to interpreter's information extracted from all interpreters, i.e. interpreter's gene.Interpreter's gene refers mainly to pass through Analytical calculation, quantification treatment carried out to interpreter's characteristic attribute, it is accessed be present in specific interpreter, be different from other interpreters , the combination of unique key message.
According to alternative interpreter's list of genes in this step, multiple groups interpreter gene is chosen respectively, and translate respectively with each group Member's gene constitutes a genome, and as interpreter's genome, which is the interpreter's genome selected.It is understood that , when carrying out the selection of each group interpreter gene, can be translated from multiple in random selection table in alternative interpreter's list of genes Member's gene, then the interpreter's gene randomly selected using these may be constructed a genome, as interpreter's genome.
It is of course also possible to which predefined decimation rule, e.g., while extracting or successively extracting, interlacing is extracted or specified line number It extracts, according to different interpreter's information extractions of gene characterization, quantity of extraction, etc..Carrying out actual extraction process later When, the extraction for each group of interpreter's gene extracts phase from alternative interpreter's list of genes according to the decimation rule predetermined The multiple genes answered.
For example, randomly selecting 3-5 different genes from alternative interpreter's list of genes, as one group of gene, one is constituted A interpreter's genome.It then adopts in a like fashion, can choose or successively choose simultaneously multiple groups gene respectively, constitute multiple Interpreter's genome, the embodiment of the present invention to this with no restriction.
S102 carries out multiple matching result sampling for each interpreter's genome, obtains multiple successful match rate samples This, and multiple successful match rate samples are based on, calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome.
It is to be understood that for the interpreter's genome selected for each group, it is thus necessary to determine that the matching of itself and contribution is imitated Fruit, so that selection is more suitable for the matched interpreter's gene of gene.Meanwhile in order to without loss of generality, for each group of interpreter's genome, Interpreter's genome can be inputted to given Matching Model, carry out multiple matching result sampling using given Matching Model, every time Sampling can obtain a successful match rate sample.
It is understood that carrying out successful match rate sample using Matching Model for each group of interpreter's genome When acquisition, the gene in this group of interpreter's genome is input in Matching Model, which can be according to the original text itself provided Part gene calculates the successful match rate score of gene in interpreter's genome and contribution gene automatically and exports, then matches mould The successful match rate score of type output can be used as a successful match rate sample.For same interpreter's genome, carry out more Secondary above-mentioned matching result sampling process, then available multiple successful match rate samples.
Later, it for each interpreter selected genome, is obtained according to above-mentioned multiple matching result sampling Multiple successful match rate samples calculate the comprehensive matching success rate of interpreter's genome, that is, calculate separately interpreter's genome pair The mean value and standard deviation for the successful match rate answered.It is understood that each successful match rate sample, actually primary The successful match rate score sampled with result.
For example, it is assumed that carrying out matching result sampling according to some interpreter's genome, n successful match rate sample difference is obtained For p1,p2,...pn.The mean value of the corresponding successful match rate of interpreter's genome is then calculated according to it are as follows:
In formula, E (p) indicates the mean value of the corresponding successful match rate of interpreter's genome, piIndicate i-th of interpreter's genome Successful match rate sample, n indicate the total number of the successful match rate sample for interpreter's genome acquisition.
On this basis, the standard deviation for calculating the corresponding successful match rate of interpreter's genome is as follows:
In formula, S indicates the standard deviation of the corresponding successful match rate of interpreter's genome, and E (p) indicates that interpreter's genome is corresponding The mean value of successful match rate, piIndicate that i-th of successful match rate sample of interpreter's genome, n indicate to be directed to interpreter's genome The total number of the successful match rate sample of acquisition.
Wherein, in one embodiment, multiple matching result sampling is being carried out, is obtaining the step of multiple successful match rate samples Before rapid, further includes: according to the demand of the gene matching primitives precision with contribution to be translated, be set for matching result sampling Total degree threshold value, i.e. given threshold.Then accordingly in actual samples, the number of acquisition successful match rate sample is total not less than this Frequency threshold value.For example, for each interpreter's genome, it is desirable that the number of the successful match rate sample of extraction is no less than 50, then The data 50 are preset total degree threshold value.
S103 is based on the corresponding mean value of all interpreter's genomes standard deviation corresponding with each interpreter's genome, Calculate the corresponding Z value of interpreter's genome;Wherein, Z value indicates Z value in the verifying of large sample otherness.
It is to be understood that each corresponding successful match of interpreter's genome selected is calculated according to above-mentioned steps On the basis of the mean value and standard deviation of rate, for interpreter's genome that each is selected, its Z value is calculated.Specifically, for Each interpreter's genome, the standard deviation of the successful match rate according to corresponding to it and all interpreter's genomes respectively correspond Mean value with success rate calculates its corresponding Z value.
It is understood that the concept of Z value therein is the verifying of large sample otherness, the i.e. concept of Z value in Z verifying.Z Inspection is the method for being generally used for large sample (i.e. sample size is greater than 30) mean difference and examining.It is with standard normal point The theory of cloth come infer difference occur probability, so that whether the difference for comparing two average significant.When known standard deviation, Whether the mean value for verifying one group of number is equal with a certain desired value.Translating of selecting is measured in the embodiment of the present invention using Z verifying The matching difference verifying of member's genome, therefore the calculating of Z value is carried out to interpreter's genome that each is selected.
S104 is based on the corresponding Z value of each interpreter's genome, chooses from all interpreter's genomes and meets setting condition Interpreter's genome, and gene in interpreter's genome that the satisfaction is imposed a condition merges, and obtains the interpreter's base finally chosen Cause.
It is to be understood that the Z value of each interpreter's genome can be calculated according to above-mentioned steps, can be sentenced according to the Z value Break otherness performance of each corresponding interpreter's genome when carrying out gene matching.Therefore, according to the corresponding Z of each interpreter's genome Value, can use preset setting condition, the otherness for judging whether the corresponding interpreter's genome of the Z value meets setting is wanted It asks.If conditions are not met, then it is rejected from each interpreter's genome selected, all interpreters that final residue is not removed Genome is satisfactory interpreter's genome.Gene in remaining all interpreter's genomes is taken out, and is removing this After duplicate factor in a little genes, one group of new gene is formed, i.e., as the interpreter's gene finally chosen.
For example, it is assumed that acquiring n successful match rate sample, these successful match rates in total for some interpreter's genome Sample meets normal distribution.Meanwhile it presetting and having selected the setting condition of interpreter's gene for the confidence level for the gene selected is not Lower than 95%, the Z value which corresponds to interpreter's genome is 1.96.Then, for each the interpreter's genome selected, Its corresponding Z value is compared with 1.96, if Z value is greater than 1.96, the corresponding interpreter's genome of the Z value is rejected, otherwise, Retain the corresponding interpreter's genome of the Z value.
Assuming that eliminating p according to above-mentioned treatment process from a interpreter's genome selected of all n and being unsatisfactory for setting Interpreter's genome of condition, remaining n-p interpreter genome are to meet to impose a condition.Then, in this n-p interpreter's genome In, may there are two or more than two interpreter's genomes in contain some interpreter's gene simultaneously.Therefore this n-p are translated Whole interpreter's genes in member's genome take out, and are put into a new gene pool, multiple for occurring in the gene pool Each interpreter's gene, rejects extra and only retains interpreter's gene.It is included in this final new gene pool Multiple non-repetitive interpreter's genes, using these genes as the interpreter's gene finally chosen.
The choosing method of interpreter's gene provided in an embodiment of the present invention, by advance from interpreter's gene pool of all interpreters Multiple groups interpreter genome is chosen, and by calculating Z value corresponding to these interpreter's genomes, meets setting condition to choose Z value Interpreter's genome, as between final choose as a result, the interpreter's gene selected is enabled preferably to embody interpreter Otherness.In addition, the interpreter chosen accordingly and contribution to be translated can be made to carry out more reasonable in gene matching application Match, to effectively improve translation efficiency and translation accuracy rate.
Wherein, in one embodiment, from alternative interpreter's list of genes, the step of the different gene of multiple groups is chosen respectively Before rapid, the method for the embodiment of the present invention further include:
Corresponding gene is extracted from the basic information of all interpreters, ability information, credit information and posterior infromation respectively, And it is correspondingly formed basic information gene, ability information gene, credit information gene and the posterior infromation gene of interpreter;
Based on basic information gene, ability information gene, credit information gene and posterior infromation gene, alternative interpreter is constituted List of genes.
It is to be understood that there are many sources of interpreter's gene, and in the social epoch, all data of the every act and every move of interpreter Gene can be extracted, by the sources of interpreter's gene, the present embodiment extracts interpreter's gene from the following aspects, Constitute alternative interpreter's list of genes:
Basic information, the personal relevant information of interpreter, such as name, age, location and contact information;
Ability information, the translation ability information that interpreter possesses, languages direction, industry field and the translation speed being such as good at Deng;
Credit information, the credit information that interpreter accumulates during undertaking translations such as hand over original text rate and midway in time Rate of sending back the manuscript etc.;
Posterior infromation, the correlation experience that interpreter accumulates during long campaigns translation, such as translate total number of word and Total amount etc..
Above- mentioned information based on interpreter extract the corresponding corresponding gene of interpreter respectively, and according to above-mentioned various aspects, formation pair Basic information gene, ability information gene, credit information gene and the posterior infromation gene answered.Later, above-mentioned various aspects are based on Gene, constitute alternative interpreter's list of genes.For example, it is corresponding standby that basic information can be constructed for the basic information of interpreter The person's of translating selectively list of genes is as shown in table 1, for according to a kind of alternative interpreter's list of genes of basic information of the embodiment of the present invention.
Table 1, a kind of alternative interpreter's list of genes of basic information according to an embodiment of the present invention
Then, when carrying out the selection of multiple interpreter's genomes according to table 1, multiple points in each data item can be randomly selected Not corresponding interpreter's gene, such as selection to " institute learns profession " corresponding gene " oil exploitation " and " abroad works and learns to pass through Go through " corresponding gene " having ", then interpreter's genome is constituted with the two.Using same treatment process, can also choose not Multiple and different interpreter's genomes.
If can choose likewise, decimation rule has been previously set to choose gene relevant to interpreter's qualification information The corresponding gene such as " IM ", " learning profession ", " date of birth " and " overseas work experience " in table 1 constitutes interpreter's genome.
The choosing method of interpreter's gene provided in an embodiment of the present invention, by from the basic information of interpreter, ability information, letter With four aspects of information and posterior infromation, the gene of interpreter is extracted respectively, and constitute alternative interpreter's list of genes accordingly, to carry out The selection and matching of more excellent interpreter's gene can more fully consider the specific information of interpreter's different aspect, for more reasonably into The matching of row gene provides reliable basis.
Wherein, optional according to above-described embodiment, respectively from all basic informations, ability information, the credit information of interpreter Further comprise with the step of extracting corresponding gene in posterior infromation:
Obtain interpreter all basic informations, ability information, credit information and posterior infromation, and respectively from basic information, Interpreter's feature is obtained in ability information, credit information and posterior infromation;
Based on interpreter's feature, interpreter's direct gene of interpreter is extracted.
It is to be understood that interpreter's gene is present in interpreter, different interpreters have different genes, there is general character but more important Be otherness to be extracted gene, can just be treated in this way with differentiation, match best interpreter.
But gene is not feature, simply can not explicitly recognize, extract so needing step.Gene and spy Sign is characterized in taking out a certain concept to characteristic common to object there are essential distinction.Include segment attribute in feature, and belongs to Most basic information --- the gene of object included in property.
Therefore the present embodiment is when carrying out the extraction of interpreter's gene, first according to the four of the interpreter of above-described embodiment aspects Information extracts corresponding characteristic information, as interpreter's feature.Later, according to different interpreter's features, interpreter is extracted most respectively Basic information constitutes interpreter's direct gene.For example, as shown in Fig. 2, for according to the choosing of interpreter's gene provided in an embodiment of the present invention Take the relation schematic diagram of interpreter's feature and interpreter's gene in method.
The choosing method of interpreter's gene provided in an embodiment of the present invention is further extracted by the extraction to interpreter's feature Interpreter's gene, what can be got is present in specific interpreter, be different from other interpreters, unique key message.
Wherein, according to the above embodiments optionally, multiple matching result sampling is carried out, multiple successful match rate samples are obtained This step of, further comprises:
Matching result sampling multiple for any wheel, executes following process flow:
The initial value of the successful match rate of all interpreter's genomes is initially set;
Interpreter's genome is randomly selected from all interpreter's genomes, and interpreter's genome of selection is matched Test, and based on to interpreter's genome this match test successful match rate result and history match success rate as a result, more The current successful match rate value of new interpreter's genome;
It repeats and randomly selects to the step of update, until the number to the match test of any interpreter's genome reaches First given threshold stops the match test to interpreter's genome, and records the current successful match rate of interpreter's genome Value;
To interpreter's genome other than the interpreter's genome for stopping match test, the step randomly selected to record is repeated Suddenly, until reaching the second given threshold to the total degree of the match test of all interpreter's genomes, then each interpreter's gene is recorded The current successful match rate value of group, and terminate the multiple matching result sampling of epicycle, into the multiple matching result sampling of next round, directly Reach third given threshold to the total wheel number for executing multiple matching result sampling, the quantity for obtaining each interpreter's genome is third The successful match rate sample of given threshold.
It is to be understood that according to the above embodiments, for interpreter's genome that each group selects, it is thus necessary to determine that The matching effect of itself and contribution, so that selection is more suitable for the matched interpreter's gene of gene.Meanwhile in order to without loss of generality, for Each group of interpreter's genome carries out multiple matching result sampling.And specifically carrying out each group of interpreter's genome selected When matching result samples, carried out using above-mentioned Matching Model.
Specifically, can use given Matching Model, the multiple matching result sampling of more wheels is carried out.Obtain it is multiple matching at When power sample, it can be assumed that have chosen m group interpreter's genome according to the above embodiments, then it can be to each interpreter's genome Successful match rate sampled, carrying out more wheels based on the above m genome, repeatedly (general no less than 30 times) matching is tested, often It is as follows to take turns match test process:
Step 1, initializing set, such as Initialize installation are carried out to the value of the successful match rate of each interpreter's genome It is 0.
Step 2, interpreter's genome is randomly choosed, successful match rate result is carried out in given Matching Model and calculates, Obtain the successful match rate result of this match test.Meanwhile in conjunction in epicycle multiple matching result sampling historical record it The successful match rate of preceding match test for several times is as a result, i.e. history match success rate is as a result, calculate the interpreter's genome chosen Current successful match rate value.
Step 3, repeatedly circulation executes above-mentioned steps 1 and 2, is from all interpreter's bases due to choosing interpreter's genome every time all Because randomly selecting in group, therefore each genome may be different by the number of carry out match test, then when to some interpreter's gene When the number of the match test of group reaches the first given threshold, that is, stop the epicycle match test to interpreter's genome, And when recording stopping test, the current successful match rate value of interpreter's genome.
Step 4, remaining interpreter's genome except interpreter's genome of the first given threshold is reached for removing, continues to hold The process flow of row above-mentioned steps 1-3 stops epicycle matching until the total degree of epicycle match test reaches the second given threshold Test.At this point for each interpreter's genome, there is a successful match rate value to be corresponding to it, as the multiple matching result of epicycle M successful match rate sample can be obtained then for m interpreter's genome by sampling obtained successful match rate sample.So, For all interpreter's genomes, the above-mentioned multiple matching result sampling of more wheels (such as reaching third given threshold) is carried out To obtain multiple successful match rate samples of each interpreter's genome, such as wheel number is set as n, then successful match rate sample number is n (n is typically no less than 50).
For example, it is assumed that having selected a1、a2And a3Totally three interpreter's genomes, and preset the first given threshold, second Given threshold and third given threshold are respectively 3,8 and 5.Then, in the multiple matching result sampling of each round:
First time selection is carried out first, from a1、a2And a3In randomly select one, such as choose and arrive a1, then to a1Progress With test, test result is successful match, then obtains a1Successful match rate value be 100%.
It is chosen followed by second, it is assumed that choose and arrive a2, match test is carried out to it, obtains test result as matching It is unsuccessful, then obtain a2Successful match rate value be 0%.
Next third time selection is carried out again, it is assumed that and choose and arrive a1, and match test result is to match unsuccessful, then root According to a1In total twice match test as a result, obtaining a1Current successful match rate value is 50%.
Next the 4th selection is carried out again, it is assumed that is chosen and is arrived a3, and match test result is successful match, then obtains a3 Successful match rate value be 100%.
Next the 5th selection is carried out again, it is assumed that and choose and arrive a1, and match test result is successful match, then basis To a1In total three times match test as a result, obtaining a1Current successful match rate value is 66.6%.At this point, to a1Matching examination It tests number and has had reached the first given threshold 3, then stop continuing to a1Carry out match test, and export its current matching at Interpreter's genome a in the multiple matching result sampling of performance number 66.6%, as epicycle1Successful match rate sample.
Next the 6th selection is carried out again, due to a13 match tests are had reached, then only in a2And a3Middle progress Match test is randomly selected and carries out, specific selection and match test process are similar with above-mentioned steps.In this way, until total matching The number of test, i.e., to a1、a2And a3The total degree of match test when reaching the second given threshold 8 times, terminate epicycle multiple It is sampled with result.At this point, having obtained a successful match rate sample all in accordance with above-mentioned match test for each interpreter's genome This.
So, to three interpreter's genome a1、a2And a3, repeat more wheels and carry out above-mentioned multiple matching result sampling, then often One wheel can obtain a1、a2And a3Corresponding one group of successful match rate sample.Until duplicate discussion reaches third given threshold 5, then available a1、a2And a3Respectively corresponding 5 successful match rate sample.
The choosing method of interpreter's gene provided in an embodiment of the present invention carries out each interpreter's genome using given Matching Model Multiple successful match rate calculate, and accordingly choose the higher interpreter's genome of successful match rate, calculated result reliability can be made It is higher.
Wherein, optional according to above-described embodiment, it is translated based on the corresponding mean value of all interpreter's genomes with each The corresponding standard deviation of member's genome, the step that is further processed for calculating the corresponding Z value of interpreter's genome refer to Fig. 3, according to The flow diagram of Z value is calculated in the choosing method of interpreter's gene provided in an embodiment of the present invention, comprising:
S301 is based on the corresponding mean value of all interpreter's genomes, calculates the successful match rate of all interpreter's genomes Unified mean value.
It is to be understood that can be calculated respectively for all interpreter's genomes selected according to above-described embodiment The mean value of the corresponding successful match rate of interpreter's genome.Then respectively corresponded first according to each interpreter's genome in the present embodiment Successful match rate mean value, calculate the mean value of successful match rate corresponding to all interpreter's genome entirety, i.e. successful match The unified mean value of rate.Specifically, can calculate according to the following formula:
In formula, μ indicates that integrally corresponding successful match rate unifies mean value to all interpreter's genomes, and m indicates the institute selected There are the group number of interpreter's genome, Ei(p) mean value of the corresponding successful match rate of i-th of interpreter's genome is indicated.
S302, it is corresponding based on each corresponding standard deviation of interpreter's genome and mean value and all interpreter's genomes Unified mean value, calculates the corresponding Z value of interpreter's genome.
It is to be understood that on the basis of all interpreter's genomes of above-mentioned steps calculating acquisition corresponding unified mean value, knot It closes above-described embodiment and the standard deviation and mean value of the corresponding successful match rate of each interpreter's genome is calculated, using given Z value calculation formula can correspond to each interpreter's genome Z value for calculating and selecting.
Wherein, in one embodiment, the step of calculating interpreter's genome corresponding Z value further comprises: using such as Lower formula calculates the corresponding Z value of each interpreter's genome:
In formula, ZiIndicate the corresponding Z value of i-th of interpreter's genome, n indicates the corresponding successful match of each interpreter's genome The number of rate sample, Ei(p) the corresponding mean value of i-th of interpreter's genome is indicated, μ indicates the corresponding unification of all interpreter's genomes Mean value, SiIndicate the corresponding standard deviation of i-th of interpreter's genome.
The choosing method of interpreter's gene provided in an embodiment of the present invention is distinguished using each the interpreter's genome selected Corresponding mean value successively calculates the unified mean value of all interpreter's genomes and the Z value of each interpreter's genome, can be more accurate Characterize the successful match rate situation of each interpreter's genome, so as to more accurately choose interpreter's gene come with contribution gene into Row matching, improves matching effect.
In addition, on the basis of the above embodiments, the interpreter for meeting and imposing a condition is being chosen from all interpreter's genomes After the step of genome, the method for the embodiment of the present invention can also include following processing step: if in all interpreter's genomes, The Z value of none interpreter's genome can satisfy the setting condition of above-described embodiment, then returns to step S101, from alternative interpreter Again the different interpreter's genome of multiple groups is chosen in list of genes, re-starts the calculating and selection process of above-described embodiment.
For example, the case where successful match rate sample for sampling meets normal distribution, to obtain 95% confidence level, I.e. preset setting condition is that the confidence level of interpreter's genome meets 95%, then the Z value calculated for interpreter's genome is answered No more than 1.96.And in practical application, it, may be due to being when choosing multiple groups interpreter genome from alternative interpreter's list of genes It the reasons such as randomly selects, causes when calculating Z value to the interpreter's genome selected, Z value is not able to satisfy above-mentioned standard, then needs Again other interpreter's genome is selected in alternative interpreter's list of genes, and is recalculated and chosen.
The choosing method of interpreter's gene provided in an embodiment of the present invention, by the judgement to calculated result and to selecting step Circulating repetition execute, can guarantee that the high quality gene met the requirements can be selected, it is to be translated for more accurately matching Contribution is of great significance.
As the other side of the embodiment of the present invention, the embodiment of the present invention provides a kind of interpreter according to the above embodiments The selecting device of gene, the device are used to realize the selection to final interpreter's gene in the above embodiments.Therefore, above-mentioned Description and definition in the choosing method of interpreter's gene of each embodiment can be used for each execution module in the embodiment of the present invention Understanding, specifically refer to above-described embodiment, do not repeating herein.
One embodiment of present aspect embodiment according to the present invention, the structure of the selecting device of interpreter's gene as shown in figure 4, For the structural schematic diagram of the selecting device of interpreter's gene provided in an embodiment of the present invention, which can be used for above-mentioned each method The selection of interpreter's gene in embodiment, the device include: that initial gene chooses module 401, the first computing module 402, second meter It calculates module 403 and final gene chooses module 404.
Wherein, initial gene is chosen module 401 and is used for from alternative interpreter's list of genes, chooses the different base of multiple groups respectively Cause constitutes multiple interpreter's genomes;First computing module 402 is used to carry out repeatedly matching knot for each interpreter's genome Fruit sampling obtains multiple successful match rate samples, and is based on multiple successful match rate samples, and it is corresponding to calculate interpreter's genome The mean value and standard deviation of successful match rate;Second computing module 403 is used to be based on the corresponding mean value of all interpreter's genomes Standard deviation corresponding with each interpreter's genome calculates the corresponding Z value of interpreter's genome;Final gene chooses module 404 For being based on the corresponding Z value of each interpreter's genome, the interpreter's base for meeting and imposing a condition is chosen from all interpreter's genomes Gene because of group, and in interpreter's genome that the satisfaction is imposed a condition merges, and obtains the interpreter's gene finally chosen;Wherein, The Z value indicates Z value in the verifying of large sample otherness.
Specifically, initial gene chooses module 301 can select respectively according to the alternative interpreter's list of genes pre-established Multiple groups interpreter's gene is taken, and a genome is constituted with each group of interpreter's gene respectively, as interpreter's genome, interpreter's gene Group is the interpreter's genome selected.For example, initial gene chooses module 301 can when carrying out the selection of each group interpreter gene With from multiple interpreter's genes in alternative interpreter's list of genes in random selection table, and the interpreter's gene randomly selected using these Constitute a genome, as interpreter's genome.
Later, for the interpreter's genome selected for each group, it is thus necessary to determine that the matching effect of itself and contribution, thus Selection is more suitable for the matched interpreter's gene of gene.Meanwhile in order to without loss of generality, be counted for each group of interpreter's genome, first Calculating module 302 can repeatedly be matched by the way that interpreter's genome is inputted given Matching Model using given Matching Model As a result it samples, sampling can obtain a successful match rate sample every time.It is understood that each successful match rate sample This, the successful match rate score that an actually matching result samples.
In addition, the first computing module 302 is according to above-mentioned multiple matching for each interpreter selected genome As a result the multiple successful match rate samples for sampling acquisition, calculate the comprehensive matching success rate of interpreter's genome, that is, calculate separately The mean value and standard deviation of the corresponding successful match rate of interpreter's genome.
Later, interpreter's genome that the second computing module 403 selects each calculates its Z value.Specifically, For each interpreter's genome, the standard deviation of the successful match rate according to corresponding to it and all interpreter's genomes are right respectively The mean value for answering successful match rate calculates its corresponding Z value.
Finally, final gene, which chooses module 404, may determine that each corresponding interpreter's genome according to the above-mentioned Z value being calculated Otherness performance when carrying out gene matching.Therefore, according to the corresponding Z value of each interpreter's genome, final gene chooses mould Block 404 can use preset setting condition, judge whether the corresponding interpreter's genome of the Z value meets the otherness of setting It is required that.If conditions are not met, then reject it from each interpreter's genome selected, what final residue was not removed all is translated Member's genome is satisfactory interpreter's genome.Final gene chooses module 404 again by remaining all interpreter's genomes In gene take out, and after removing the duplicate factor in these genes, form one group of new gene, i.e., as finally choosing Interpreter's gene.
Further, on the basis of the above embodiments, the device of the embodiment of the present invention further includes alternative interpreter's gene column Table constructs module, is used for: extracting phase from all basic informations of interpreter, ability information, credit information and posterior infromation respectively The gene answered, and it is correspondingly formed basic information gene, ability information gene, credit information gene and the posterior infromation base of interpreter Cause;Based on the basic information gene, ability information gene, credit information gene and posterior infromation gene, constitute described alternative Interpreter's list of genes.
Wherein optional, alternative interpreter's list of genes building module is specifically used for: obtaining all basic informations, the energy of interpreter Force information, credit information and posterior infromation, and obtained from basic information, ability information, credit information and posterior infromation respectively Interpreter's feature;Based on interpreter's feature, interpreter's direct gene of interpreter is extracted.
Wherein optional, the second computing module is specifically used for: being based on the corresponding mean value of all interpreter's genomes, calculates The unified mean value of the successful match rate of all interpreter's genomes;Based on each corresponding standard deviation of interpreter's genome and mean value, And the corresponding unified mean value of all interpreter's genomes, calculate the corresponding Z value of interpreter's genome.
Wherein optional, the second computing module is specifically used for: utilizing following formula, calculates the corresponding Z of each interpreter's genome Value:
In formula, ZiIndicate the corresponding Z value of i-th of interpreter's genome, n indicates the corresponding successful match of each interpreter's genome The number of rate sample, Ei(p) the corresponding mean value of i-th of interpreter's genome is indicated, μ indicates the corresponding unification of all interpreter's genomes Mean value, SiIndicate the corresponding standard deviation of i-th of interpreter's genome.
Wherein optional, the first computing module is specifically used for: any multiple matching result of wheel being sampled, following place is executed Reason process: the initial value of the successful match rate of all interpreter's genomes is initially set;From all interpreter's genomes with Machine chooses interpreter's genome, carries out match test to interpreter's genome of selection, and based on to interpreter's genome sheet The successful match rate result and history match success rate of secondary match test are as a result, update the current successful match of interpreter's genome Rate value;It repeats and randomly selects to the step of update, until reaching the to the number of the match test of any interpreter's genome One given threshold stops the match test to interpreter's genome, and records the current successful match rate value of interpreter's genome; To stop match test interpreter's genome other than interpreter's genome, repeat randomly select to record the step of, until Second given threshold is reached to the total degree of the match test of all interpreter's genomes, then it is current to record each interpreter's genome Successful match rate value, and terminate the multiple matching result sampling of epicycle, into the multiple matching result sampling of next round, until executing more Total wheel number of secondary matching result sampling reaches third given threshold, and the quantity for obtaining each interpreter's genome is third given threshold Successful match rate sample.
Wherein optional, the first computing module is specifically used for, for each interpreter's genome, the successful match rate of extraction The number of sample is no less than given threshold.
It is understood that can be by hardware processor (hardware processor) come real in the embodiment of the present invention Each relative program module in the device of existing the various embodiments described above.Also, the selecting device of each interpreter's gene of the embodiment of the present invention When for the selection of interpreter's gene in above-mentioned each method embodiment, the beneficial effect of generation and corresponding above-mentioned each method are real It is identical to apply example, above-mentioned each method embodiment can be referred to, details are not described herein again.
As the another aspect of the embodiment of the present invention, the present embodiment provides a kind of electronics according to the above embodiments and sets It is standby, it is the entity structure schematic diagram of electronic equipment provided in an embodiment of the present invention, comprising: at least one processor with reference to Fig. 5 501, at least one processor 502, communication interface 503 and bus 504.
Wherein, memory 501, processor 502 and communication interface 503 complete mutual communication by bus 504, communicate Interface 503 is for the information transmission between the electronic equipment and interpreter's information equipment;Being stored in memory 501 can be in processor The computer program run on 502 when processor 502 executes the computer program, realizes translating as described in the various embodiments described above The choosing method of member's gene.
It is to be understood that including at least memory 501, processor 502, communication interface 503 and bus in the electronic equipment 504, and memory 501, processor 502 and communication interface 503 form mutual communication connection by bus 504, and can be complete The program instruction of the choosing method of interpreter's gene is read from memory 501 at mutual communication, such as processor 502.Separately Outside, communication interface 503 can also realize the communication connection between the electronic equipment and interpreter's information equipment, and achievable mutual Information transmission, such as the selection to interpreter's gene is realized by communication interface 503.
When electronic equipment is run, processor 502 calls the program instruction in memory 501, real to execute above-mentioned each method Apply method provided by example, for example, from alternative interpreter's list of genes, choose the different gene of multiple groups respectively, constitute more A interpreter's genome;For each interpreter's genome, multiple matching result sampling is carried out, obtains multiple successful match rate samples This, and multiple successful match rate samples are based on, calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome;Base In the corresponding mean value of all interpreter's genomes standard deviation corresponding with each interpreter's genome, interpreter's genome is calculated Corresponding Z value;Based on the corresponding Z value of each interpreter's genome, is chosen from all interpreter's genomes and meet setting condition Interpreter's genome, and the gene in interpreter's genome that the satisfaction is imposed a condition merges, and obtains the interpreter's gene finally chosen; Wherein, the Z value indicates Z value etc. in the verifying of large sample otherness.
Program instruction in above-mentioned memory 501 can be realized and as independent by way of SFU software functional unit Product when selling or using, can store in a computer readable storage medium.Alternatively, realizing that above-mentioned each method is implemented This can be accomplished by hardware associated with program instructions for all or part of the steps of example, and program above-mentioned can store to be calculated in one In machine read/write memory medium, when being executed, execution includes the steps that above-mentioned each method embodiment to the program;And storage above-mentioned Medium includes: USB flash disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), the various media that can store program code such as magnetic or disk.
The embodiment of the present invention provides a kind of non-transient computer readable storage medium also according to the various embodiments described above, this is non-temporarily State computer-readable recording medium storage computer instruction, the computer instruction execute computer as described in the various embodiments described above Interpreter's gene choosing method, for example, from alternative interpreter's list of genes, choose the different gene of multiple groups, structure respectively At multiple interpreter's genomes;For each interpreter's genome, multiple matching result sampling is carried out, obtains multiple successful match rates Sample, and multiple successful match rate samples are based on, calculate the mean value and standard deviation of the corresponding successful match rate of interpreter's genome; Based on the corresponding mean value of all interpreter's genomes standard deviation corresponding with each interpreter's genome, interpreter's gene is calculated The corresponding Z value of group;Based on the corresponding Z value of each interpreter's genome, is chosen from all interpreter's genomes and meet setting condition Interpreter's genome, and gene in interpreter's genome that the satisfaction is imposed a condition merges, and obtains the interpreter's base finally chosen Cause;Wherein, the Z value indicates Z value etc. in the verifying of large sample otherness.
Electronic equipment provided in an embodiment of the present invention and non-transient computer readable storage medium, by executing above-mentioned each reality The choosing method of interpreter's gene described in example is applied, chooses multiple groups interpreter genome from interpreter's gene pool of all interpreters in advance, And by calculate these interpreter's genomes corresponding to Z value, come choose Z value meet impose a condition interpreter's genome, using as Final chooses as a result, the interpreter's gene selected is enabled preferably to embody the otherness between interpreter.In addition, in gene With in application, the interpreter chosen accordingly can be made more reasonably to be matched with contribution to be translated, to effectively improve translation effect Rate and translation accuracy rate.
It is understood that the embodiment of device described above, electronic equipment and storage medium is only schematic , wherein unit may or may not be physically separated as illustrated by the separation member, it can both be located at one Place, or may be distributed on heterogeneous networks unit.Some or all of modules can be selected according to actual needs To achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are without paying creative labor To understand and implement.
By the description of embodiment of above, those skilled in the art is it will be clearly understood that each embodiment can borrow Help software that the mode of required general hardware platform is added to realize, naturally it is also possible to pass through hardware.Based on this understanding, above-mentioned Substantially the part that contributes to existing technology can be embodied in the form of software products technical solution in other words, the meter Calculation machine software product may be stored in a computer readable storage medium, such as USB flash disk, mobile hard disk, ROM, RAM, magnetic disk or light Disk etc., including some instructions, with so that a computer equipment (such as personal computer, server or network equipment etc.) Execute method described in certain parts of above-mentioned each method embodiment or embodiment of the method.
In addition, those skilled in the art are it should be understood that in the application documents of the embodiment of the present invention, term "include", "comprise" or any other variant thereof is intended to cover non-exclusive inclusion, so that including a series of elements Process, method, article or equipment not only include those elements, but also including other elements that are not explicitly listed, or Person is to further include for elements inherent to such a process, method, article, or device.In the absence of more restrictions, by The element that sentence "including a ..." limits, it is not excluded that in the process, method, article or apparatus that includes the element There is also other identical elements.
In the specification of the embodiment of the present invention, numerous specific details are set forth.It should be understood, however, that the present invention is implemented The embodiment of example can be practiced without these specific details.In some instances, it is not been shown in detail well known Methods, structures and technologies, so as not to obscure the understanding of this specification.Similarly, it should be understood that in order to simplify implementation of the present invention Example is open and helps to understand one or more of the various inventive aspects, above to the exemplary embodiment of the embodiment of the present invention Description in, each feature of the embodiment of the present invention is grouped together into single embodiment, figure or descriptions thereof sometimes In.
However, the disclosed method should not be interpreted as reflecting the following intention: i.e. the claimed invention is implemented Example requires features more more than feature expressly recited in each claim.More precisely, such as claims institute As reflection, inventive aspect is all features less than single embodiment disclosed above.Therefore, it then follows specific embodiment party Thus claims of formula are expressly incorporated in the specific embodiment, wherein each claim itself is real as the present invention Apply the separate embodiments of example.
Finally, it should be noted that above embodiments are only to illustrate the technical solution of the embodiment of the present invention, rather than it is limited System;Although the embodiment of the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art it is understood that It is still possible to modify the technical solutions described in the foregoing embodiments, or part of technical characteristic is carried out etc. With replacement;And these are modified or replaceed, each embodiment skill of the embodiment of the present invention that it does not separate the essence of the corresponding technical solution The spirit and scope of art scheme.

Claims (9)

1. a kind of choosing method of interpreter's gene characterized by comprising
From alternative interpreter's list of genes, the different gene of multiple groups is chosen respectively, constitutes multiple interpreter's genomes;
For interpreter's genome described in each, multiple matching result sampling is carried out, obtains multiple successful match rate samples, and base In the multiple successful match rate sample, the mean value and standard deviation of the corresponding successful match rate of interpreter's genome are calculated;
It is corresponding with interpreter's genome described in each described based on the corresponding mean value of all interpreter's genomes Standard deviation calculates the corresponding Z value of interpreter's genome;
Based on the corresponding Z value of interpreter's genome described in each, is chosen from all interpreter's genomes and meet setting Interpreter's genome of condition, and the gene in the interpreter's genome for meeting and imposing a condition is merged, what acquisition was finally chosen Interpreter's gene;
Wherein, the Z value indicates Z value in the verifying of large sample otherness.
2. the method according to claim 1, wherein being chosen respectively from alternative interpreter's list of genes described Before the step of multiple groups different gene, further includes:
Corresponding gene is extracted from all basic informations of interpreter, ability information, credit information and posterior infromation respectively, and right Basic information gene, ability information gene, credit information gene and the posterior infromation gene of interpreter should be formed;
Based on the basic information gene, ability information gene, credit information gene and posterior infromation gene, constitute described alternative Interpreter's list of genes.
3. the method according to claim 1, wherein described corresponding based on all interpreter's genomes The mean value standard deviation corresponding with interpreter's genome described in each, calculates the step of the corresponding Z value of interpreter's genome Suddenly further comprise:
Based on the corresponding mean value of all interpreter's genomes, the successful match of all interpreter's genomes is calculated The unified mean value of rate;
Based on the corresponding standard deviation of interpreter's genome described in each and the mean value and all interpreter's genomes The corresponding unified mean value, calculates the corresponding Z value of interpreter's genome.
4. according to the method described in claim 3, it is characterized in that, the interpreter's genome corresponding Z value of calculating Step further comprises:
Using following formula, the corresponding Z value of each interpreter's genome is calculated:
In formula, ZiIndicate the corresponding Z value of i-th of interpreter's genome, n indicates the corresponding matching of each interpreter's genome The number of success rate sample, Ei(p) the corresponding mean value of i-th of interpreter's genome is indicated, μ indicates all interpreter's genes The corresponding unified mean value of group, SiIndicate the corresponding standard deviation of i-th of interpreter's genome.
5. obtaining multiple the method according to claim 1, wherein described carry out multiple matching result sampling The step of being made into power sample further comprises:
The multiple matching result sampling described for any wheel, executes following process flow:
The initial value of the successful match rate of all interpreter's genomes is initially set;
Interpreter's genome is randomly selected from all interpreter's genomes, and interpreter's genome of selection is carried out Match test, and based on the successful match rate result and history match success rate knot to this match test of interpreter's genome Fruit updates the current successful match rate value of interpreter's genome;
Repeat it is described randomly select to the step of the update, until to the match test of any interpreter's genome Number reaches the first given threshold, stops the match test to interpreter's genome, and records current of interpreter's genome With success ratio values;
To interpreter's genome other than the interpreter's genome for stopping match test, described randomly select to the record is repeated The step of, until reaching the second given threshold to the total degree of the match test of all interpreter's genomes, then record is each The current successful match rate value of interpreter's genome, and terminate multiple matching result sampling described in epicycle, into next round institute Multiple matching result sampling is stated, until the total wheel number for executing the multiple matching result sampling reaches third given threshold, is obtained The quantity of each interpreter's genome is the successful match rate sample of third given threshold.
6. the method according to claim 1, wherein for interpreter's genome described in each, extraction it is described The number of successful match rate sample is no less than given threshold.
7. a kind of selecting device of interpreter's gene characterized by comprising
Initial gene chooses module, for choosing the different gene of multiple groups respectively, constituting multiple from alternative interpreter's list of genes Interpreter's genome;
First computing module obtains multiple for carrying out multiple matching result sampling for interpreter's genome described in each It is made into power sample, and is based on the multiple successful match rate sample, calculates the corresponding successful match rate of interpreter's genome Mean value and standard deviation;
Second computing module, for based on all corresponding mean values of interpreter's genome and each described interpreter The corresponding standard deviation of genome calculates the corresponding Z value of interpreter's genome;
Final gene chooses module, for based on the corresponding Z value of each described interpreter's genome, from all interpreters The interpreter's genome for meeting and imposing a condition, and the gene in interpreter's genome that the satisfaction is imposed a condition are chosen in genome Merge, obtains the interpreter's gene finally chosen;
Wherein, the Z value indicates Z value in the verifying of large sample otherness.
8. a kind of electronic equipment characterized by comprising at least one processor, at least one processor, communication interface and total Line;
The memory, the processor and the communication interface complete mutual communication, the communication by the bus Interface is for the information transmission between the electronic equipment and interpreter's information equipment;
The computer program that can be run on the processor is stored in the memory, the processor executes the calculating When machine program, the method as described in any in claim 1 to 6 is realized.
9. a kind of non-transient computer readable storage medium, which is characterized in that the non-transient computer readable storage medium is deposited Computer instruction is stored up, the computer instruction makes the computer execute the method as described in any in claim 1 to 6.
CN201811095799.1A 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment Active CN109299737B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201811095799.1A CN109299737B (en) 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment
PCT/CN2018/124951 WO2020057003A1 (en) 2018-09-19 2018-12-28 Translator gene selection method and apparatus, and electronic device
PCT/CN2018/124891 WO2020057001A1 (en) 2018-09-19 2018-12-28 Machine translation engine recommendation method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811095799.1A CN109299737B (en) 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment

Publications (2)

Publication Number Publication Date
CN109299737A true CN109299737A (en) 2019-02-01
CN109299737B CN109299737B (en) 2021-10-26

Family

ID=65163510

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811095799.1A Active CN109299737B (en) 2018-09-19 2018-09-19 Translator gene selection method and device and electronic equipment

Country Status (2)

Country Link
CN (1) CN109299737B (en)
WO (2) WO2020057003A1 (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100793990B1 (en) * 2006-09-18 2008-01-16 삼성전자주식회사 Method and system for early z test for tile-based 3d rendering
CN102103612A (en) * 2009-12-22 2011-06-22 北大方正集团有限公司 Information extraction method and device
CN103064970A (en) * 2012-12-31 2013-04-24 武汉传神信息技术有限公司 Search method for optimizing translators
CN103092827A (en) * 2012-12-31 2013-05-08 武汉传神信息技术有限公司 Method for multi-strategy interpreter manuscript automatic matching
CN103729349A (en) * 2013-12-23 2014-04-16 武汉传神信息技术有限公司 Analyzing method for affecting factors on translation quality
CN105138521A (en) * 2015-08-27 2015-12-09 武汉传神信息技术有限公司 General translator recommendation method for risk project in translation industry
CN105279147A (en) * 2015-09-29 2016-01-27 武汉传神信息技术有限公司 Translator document quick matching method
CN106844303A (en) * 2016-12-23 2017-06-13 语联网(武汉)信息技术有限公司 A kind of is to treat the method that manuscript of a translation part matches interpreter based on similarity mode algorithm
CN106844304A (en) * 2016-12-26 2017-06-13 语联网(武汉)信息技术有限公司 It is a kind of to be categorized as treating the method that manuscript of a translation part matches interpreter based on the manuscript of a translation
CN107016131A (en) * 2017-05-19 2017-08-04 北方工业大学 Machine learning algorithm based on enhanced clustering and application of algorithm
CN107357783A (en) * 2017-07-04 2017-11-17 桂林电子科技大学 A kind of English translation mass analysis method of translator of Chinese into English
CN108538284A (en) * 2017-03-06 2018-09-14 北京搜狗科技发展有限公司 Simultaneous interpretation result shows method and device, simultaneous interpreting method and device

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8775155B2 (en) * 2010-10-25 2014-07-08 Xerox Corporation Machine translation using overlapping biphrase alignments and sampling
US9619463B2 (en) * 2012-11-14 2017-04-11 International Business Machines Corporation Document decomposition into parts based upon translation complexity for translation assignment and execution
CN104537009B (en) * 2014-12-17 2017-09-29 武汉传神信息技术有限公司 Interpreter recommends method and device
US10067936B2 (en) * 2014-12-30 2018-09-04 Facebook, Inc. Machine translation output reranking
CN106776583A (en) * 2015-11-24 2017-05-31 株式会社Ntt都科摩 Machine translation evaluation method and apparatus and machine translation method and equipment
CN106021239B (en) * 2016-04-29 2018-10-26 北京创鑫旅程网络技术有限公司 A kind of translation quality real-time estimating method
CN107480147A (en) * 2017-08-15 2017-12-15 中译语通科技(北京)有限公司 A kind of method and system of comparative evaluation's machine translation system

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100793990B1 (en) * 2006-09-18 2008-01-16 삼성전자주식회사 Method and system for early z test for tile-based 3d rendering
CN102103612A (en) * 2009-12-22 2011-06-22 北大方正集团有限公司 Information extraction method and device
CN103064970A (en) * 2012-12-31 2013-04-24 武汉传神信息技术有限公司 Search method for optimizing translators
CN103092827A (en) * 2012-12-31 2013-05-08 武汉传神信息技术有限公司 Method for multi-strategy interpreter manuscript automatic matching
CN103729349A (en) * 2013-12-23 2014-04-16 武汉传神信息技术有限公司 Analyzing method for affecting factors on translation quality
CN105138521A (en) * 2015-08-27 2015-12-09 武汉传神信息技术有限公司 General translator recommendation method for risk project in translation industry
CN105279147A (en) * 2015-09-29 2016-01-27 武汉传神信息技术有限公司 Translator document quick matching method
CN106844303A (en) * 2016-12-23 2017-06-13 语联网(武汉)信息技术有限公司 A kind of is to treat the method that manuscript of a translation part matches interpreter based on similarity mode algorithm
CN106844304A (en) * 2016-12-26 2017-06-13 语联网(武汉)信息技术有限公司 It is a kind of to be categorized as treating the method that manuscript of a translation part matches interpreter based on the manuscript of a translation
CN108538284A (en) * 2017-03-06 2018-09-14 北京搜狗科技发展有限公司 Simultaneous interpretation result shows method and device, simultaneous interpreting method and device
CN107016131A (en) * 2017-05-19 2017-08-04 北方工业大学 Machine learning algorithm based on enhanced clustering and application of algorithm
CN107357783A (en) * 2017-07-04 2017-11-17 桂林电子科技大学 A kind of English translation mass analysis method of translator of Chinese into English

Also Published As

Publication number Publication date
WO2020057003A1 (en) 2020-03-26
CN109299737B (en) 2021-10-26
WO2020057001A1 (en) 2020-03-26

Similar Documents

Publication Publication Date Title
CN107329967B (en) Question answering system and method based on deep learning
CN110197672B (en) Voice call quality detection method, server and storage medium
CN103577989B (en) A kind of information classification approach and information classifying system based on product identification
CN110427461A (en) Intelligent answer information processing method, electronic equipment and computer readable storage medium
CN105868179B (en) A kind of intelligent answer method and device
CN110110062A (en) Machine intelligence answering method, device and electronic equipment
CN111309887B (en) Method and system for training text key content extraction model
CN106649742A (en) Database maintenance method and device
CN105912645B (en) A kind of intelligent answer method and device
CN107273406A (en) Dialog process method and device in task dialogue system
CN108804526A (en) Interest determines that system, interest determine method and storage medium
CN108108347B (en) Dialogue mode analysis system and method
CN109522395A (en) Automatic question-answering method and device
CN114490998B (en) Text information extraction method and device, electronic equipment and storage medium
CN109857846A (en) The matching process and device of user's question sentence and knowledge point
CN115687925A (en) Fault type identification method and device for unbalanced sample
CN109582970A (en) A kind of semantic measurement method, apparatus, equipment and readable storage medium storing program for executing
CN107766560A (en) The evaluation method and system of customer service flow
CN113868422A (en) Multi-label inspection work order problem traceability identification method and device
CN113850387A (en) Expert system knowledge base construction method, question and answer method, system, device and medium
CN110162769A (en) Text subject output method and device, storage medium and electronic device
Dihingia et al. Chatbot implementation in customer service industry through deep neural networks
CN107122378A (en) Object processing method and device
CN109299737A (en) Choosing method, device and the electronic equipment of interpreter's gene
CN109448792A (en) Choosing method, device and the electronic equipment of interpreter's gene

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant