CN114139560A - Translation system based on artificial intelligence - Google Patents

Translation system based on artificial intelligence Download PDF

Info

Publication number
CN114139560A
CN114139560A CN202111465059.4A CN202111465059A CN114139560A CN 114139560 A CN114139560 A CN 114139560A CN 202111465059 A CN202111465059 A CN 202111465059A CN 114139560 A CN114139560 A CN 114139560A
Authority
CN
China
Prior art keywords
target
translation
unit
constraint condition
revision
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111465059.4A
Other languages
Chinese (zh)
Other versions
CN114139560B (en
Inventor
刘建伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Shiyu Translation Co ltd
Original Assignee
Shandong Shiyu Translation Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Shiyu Translation Co ltd filed Critical Shandong Shiyu Translation Co ltd
Priority to CN202111465059.4A priority Critical patent/CN114139560B/en
Publication of CN114139560A publication Critical patent/CN114139560A/en
Application granted granted Critical
Publication of CN114139560B publication Critical patent/CN114139560B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation

Abstract

The invention relates to a translation system based on artificial intelligence, which comprises: the system comprises a neural network model, a storage module, a first machine translation model, an arrangement module, a second machine translation model, an output module, a user habit acquisition module, a recording module, a cache module and an updating module, wherein the first machine translation model is provided with a receiving unit, a combination unit and a pre-translation unit; the method comprises the steps that firstly, a plurality of pre-translation targets are obtained through a neural network model with artificial intelligence, the plurality of pre-translation targets comprise pre-translation targets corresponding to source sentences and pre-translation targets which are related to the source sentences and are expanded, the pre-translation targets are sorted according to habits selected by a user, and a plurality of translation targets selected based on the habits are provided for the user; meanwhile, the selection habits of the user are collected, and the user habits are trained again so as to deduce a translation target according with the selection habits of the user.

Description

Translation system based on artificial intelligence
Technical Field
The invention relates to the technical field of translation systems, in particular to a translation system based on artificial intelligence.
Background
Artificial intelligence research includes robotics, language identification, image identification, natural language processing, expert systems, and the like. In the field of language translation, compared with the traditional dictionary base, the artificial intelligence can analyze the tone and the semantics of the source sentences to be translated, and the translation result is more accurate.
With the research and progress of artificial intelligence technology, neural network machine translation is commonly applied as a new generation of translation technology. The neural network machine translation model has the capabilities of self-learning and continuous updating, so that the artificial intelligence and neural network machine translation model is mainly researched in the field of translation.
In the prior art, for example, publication number "CN 111738025A" discloses an artificial intelligence based translation method, which includes: acquiring a first language text to be translated, wherein the first language text comprises at least one first language word segmentation; obtaining a word vector corresponding to the at least one first language participle in the first language text through a first translation model, wherein the first translation model is obtained by training based on a plurality of pseudo parallel corpus pairs, each pseudo parallel corpus pair comprises a second language corpus and a first language corpus obtained by translating the second language corpus through a second translation model, the second translation model is obtained by training based on a second parallel corpus pair, each second parallel corpus pair comprises a target second language corpus and a plurality of reference first language corpora, and each reference first language corpus is a reference translation of the target second language corpus; predicting, by the first translation model, based on the word vector, at least two candidate second language participles corresponding to each of the first language participles, and a first probability that each of the first language participles is translated into a corresponding candidate second language participle; determining, by the first translation model, a target second language participle corresponding to each first language participle from the candidate second language participles based on the first probability; and fusing each target second language participle through the first translation model to obtain a second language text after the first language text is translated.
Generally speaking, the translation of the language is translation plus revision, and the route is a relatively mature route, in the method, at least two candidate second language participles corresponding to each of the first language participles are predicted, and a first probability that each of the first language participles is translated into a corresponding candidate second language participle is utilized; then, determining target second language participles corresponding to the first language participles from the candidate second language participles through the first translation model based on the first probability; and fusing each target second language participle through the first translation model. The fusion technology needs additional training, and therefore, the neural network resources are occupied.
Disclosure of Invention
In view of the above, the present invention provides an artificial intelligence based translation system to solve the above problems in the prior art.
In order to achieve the purpose, the invention provides the following technical scheme:
an artificial intelligence based translation system comprising:
the neural network model is used for training in the neural network model to obtain a constraint condition set based on the historical habit elements of the translation target selected by the user;
the storage module is used for storing the constraint condition set;
a first machine translation model having a receiving unit, a combining unit, and a pre-translation unit; the receiving unit receives a source sentence to be translated;
the combination unit is connected with the receiving unit and used for recombining the source sentences based on combination elements formed by keywords, semantics and moods of the source sentences to form a plurality of new combination targets;
the pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentences to obtain a plurality of pre-translation targets;
the arrangement module is used for loading a constraint condition set, arranging and combining a plurality of pre-translation targets according to a first constraint condition, and inputting the objects into a second machine translation model according to the sequence of the arrangement and combination, wherein the second machine translation model revises the plurality of pre-translation targets according to the sequence of the arrangement and combination to obtain a plurality of target sentences;
the output module is used for outputting the target sentence to be presented to the user;
the user habit acquisition module is used for selecting a target sentence based on user operation and operation behaviors, marking a source sentence corresponding to the selected target sentence and one or more of a plurality of new combined targets by using a set tag set, and forming acquired data;
the recording module is used for tracking and recording the acquired data to form a tracking log; inputting the tracking log into a neural network model for training to form a constraint condition subunit;
the cache module is used for storing the constraint condition subunit;
the updating module is used for extracting at least one constraint condition subunit in the cache module based on the updating period set by the control module, comparing the constraint condition subunit with the constraint condition set, checking whether the constraint condition subunit is a newly added element, if so, updating the constraint condition set, and if not, judging the constraint condition set as a repeated element; the deletion is directly performed by the deletion module.
In the above, the neural network model is provided with a recognition unit,
the identification unit is used for acquiring the arrangement of an optimal set, the arrangement of the optimal set is realized by loading a tracking log, scanning the tracking log to acquire an element set contained in the tracking log, and counting the elements contained in the element set according to the same or similar elements;
after statistics, the comparison and arrangement are carried out one by one according to whether the constraint condition set has one same element, two same elements, … … and N same elements, and the set with the least same elements in the comparison result is the optimal set.
In the above, the historical habit elements include a combination of one or more of selection of a type of translation target, selection of clickable keywords or key elements, selection of related parts of speech, semantic selection, selection of associated sentences, selection of external associated word banks, browsing stay time and click viewing times.
In the above, the constraint condition set is formed by inputting the historical habit elements into the neural network model and training according to the frequency of the historical habit elements, and the weight value of each historical habit element is set according to the frequency of the historical habit elements.
In the above, the target sentence corresponds to the pre-translation target one to one or more than one.
In the above, the user habit acquisition module has at least one monitoring unit, the monitoring unit monitors user operation and operation behavior in the user habit acquisition module, and when it is monitored that the user operation and operation behavior is selection of a target sentence, a set tag set is used to label a source sentence corresponding to the selected target sentence, and one or more of a plurality of new combined targets, so as to form acquired data.
In the above, the arrangement module further includes a priority determination unit that determines the arrangement order of the pre-translation targets according to the weight value of each of the historical habit elements in the constraint condition set by loading the constraint condition set.
In the foregoing, the recording module includes at least one tracking unit, and the tracking unit tracks, as a target, a monitoring behavior of the monitoring unit on at least one of a user operation and an operation behavior, captures a monitoring behavior including a selection of a target statement, and records a tracking log.
In the above, the updating module has at least one comparing unit, when the comparing unit performs comparison between the constraint condition subunit and the constraint condition set, if a certain element contained in the constraint condition subunit has an appearance frequency greater than a set threshold in an updating period, and the certain element does not have the same or similar element characteristics in the constraint condition set and can cause a deviation that can cause habitual appearance of a user in selection of a target sentence, the certain element is set as a new element, and a set weight value is given to the new element, and the new element is updated in the constraint condition set; if a certain element contained in the constraint condition subunit has an appearance frequency less than half of a set threshold value in an updating period, directly utilizing a deleting module to delete the element; if a certain element contained in the constraint condition subunit has an occurrence frequency which is more than half of the set threshold value and less than or equal to the set threshold value in the updating period, the certain element is registered in the cache module, and when the updating module in the next period starts updating, the certain element is added into the next updating module for updating.
In the above, the second machine translation model has at least one revision unit therein, each revision unit having a first revision subunit, a second revision subunit, and a comparison unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts sentence segments corresponding to the pre-translation target according to the pre-translation target;
the first revision subunit divides the sentence segments into at least one sentence segment according to the set punctuation marks, revises the corresponding contents of the sentence segments one by one, and assembles the sentence segments after revising one by one to form a first revision target;
the second revision subunit directly revises the sentence segments to form a second revision target;
the comparison unit is used for inputting the first revision target and the second revision target into the comparison unit for comparison, carrying out grammar comparison, and directly outputting to obtain a target sentence if the first revision target and the second revision target are the same;
and if the first revision target and the second revision target are different, outputting the target sentence with no errors or least errors in grammar.
The method comprises the steps that firstly, a plurality of pre-translation targets are obtained through a neural network model with artificial intelligence, the plurality of pre-translation targets comprise pre-translation targets corresponding to source sentences and pre-translation targets which are related to the source sentences and are expanded, the pre-translation targets are sorted according to habits selected by a user, and a plurality of translation targets selected based on the habits are provided for the user; meanwhile, the selection habits of the user are collected, and the user habits are trained again so as to deduce a translation target according with the selection habits of the user.
Drawings
FIG. 1 is a schematic diagram of the framework of the present invention;
fig. 2 is a schematic diagram of a framework in an embodiment of the invention.
Detailed Description
The present invention is described in detail below with reference to the accompanying drawings, which refer to fig. 1 to 2.
The invention provides a communication switching method of an intelligent sound box, which comprises the following steps:
an artificial intelligence based translation system comprising:
the neural network model is used for training in the neural network model to obtain a constraint condition set based on the historical habit elements of the translation target selected by the user;
the historical habit elements comprise a combination formed by one or more of selection of the type of a translation target, selection of clickable keywords or key elements, selection of related part of speech, semantic selection, selection of associated sentences, selection of external associated word stock reference, browsing retention time and click viewing times;
the constraint condition set is formed by inputting historical habit elements into a neural network model according to the occurrence frequency of the historical habit elements and training the historical habit elements, and the weight value of each historical habit element is set according to the occurrence frequency of each historical habit element;
the storage module is used for storing the constraint condition set;
a first machine translation model having a receiving unit, a combining unit, and a pre-translation unit; the receiving unit receives a source sentence to be translated;
the combination unit is connected with the receiving unit and used for recombining the source sentences based on combination elements formed by keywords, semantics and moods of the source sentences to form a plurality of new combination targets;
the pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentences to obtain a plurality of pre-translation targets, and the target sentences correspond to the pre-translation targets one by one or correspond to more pre-translation targets;
the arrangement module is used for loading a constraint condition set, arranging and combining a plurality of pre-translation targets according to a first constraint condition, and inputting the objects into a second machine translation model according to the sequence of the arrangement and combination, wherein the second machine translation model revises the plurality of pre-translation targets according to the sequence of the arrangement and combination to obtain a plurality of target sentences;
in the above, the arrangement module further includes a priority determination unit, where the priority determination unit determines the arrangement order of the pre-translation targets according to the weight value of each element of the historical habits in the constraint condition set by loading the constraint condition set;
in the above, the second machine translation model has at least one revision unit therein, each revision unit having a first revision subunit, a second revision subunit, and a comparison unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts sentence segments corresponding to the pre-translation target according to the pre-translation target;
the first revision subunit divides the sentence segments into at least one sentence segment according to the set punctuation marks, revises the corresponding contents of the sentence segments one by one, and assembles the sentence segments after revising one by one to form a first revision target;
the second revision subunit directly revises the sentence segments to form a second revision target;
the comparison unit is used for inputting the first revision target and the second revision target into the comparison unit for comparison, carrying out grammar comparison, and directly outputting to obtain a target sentence if the first revision target and the second revision target are the same;
and if the first revision target and the second revision target are different, outputting the target sentence with no errors or least errors in grammar.
The output module is used for outputting the target sentence to be presented to the user;
the user habit acquisition module is used for selecting a target sentence based on user operation and operation behaviors, marking a source sentence corresponding to the selected target sentence and one or more of a plurality of new combined targets by using a set tag set, and forming acquired data;
in the above, the user habit acquisition module has at least one monitoring unit, the monitoring unit monitors user operation and operation behavior in the user habit acquisition module, and when it is monitored that the user operation and operation behavior is selection of a target sentence, a set tag set is used to label a source sentence corresponding to the selected target sentence, and one or more of a plurality of new combined targets, so as to form acquired data.
The recording module is used for tracking and recording the acquired data to form a tracking log; inputting the tracking log into a neural network model for training to form a constraint condition subunit;
the recording module is provided with at least one tracking unit, and the tracking unit takes the monitoring behavior of the monitoring unit on at least one of the user operation and the operation behavior as a target to track, captures the selected monitoring behavior including the target statement and records a tracking log;
the cache module is used for storing the constraint condition subunit;
the updating module is used for extracting at least one constraint condition subunit in the cache module based on the updating period set by the control module, comparing the constraint condition subunit with the constraint condition set, checking whether the constraint condition subunit is a newly added element, if so, updating the constraint condition set, and if not, judging the constraint condition set as a repeated element; the deletion is directly performed by the deletion module.
The neural network model is provided with a recognition unit,
the identification unit is used for acquiring the arrangement of an optimal set, the arrangement of the optimal set is realized by loading a tracking log, scanning the tracking log to acquire an element set contained in the tracking log, and counting the elements contained in the element set according to the same or similar elements;
after statistics, the comparison and arrangement are carried out one by one according to whether the constraint condition set has one same element, two same elements, … … and N same elements, and the set with the least same elements in the comparison result is the optimal set.
In the above, the updating module has at least one comparing unit, when the comparing unit performs comparison between the constraint condition subunit and the constraint condition set, if a certain element contained in the constraint condition subunit has an appearance frequency greater than a set threshold in an updating period, and the certain element does not have the same or similar element characteristics in the constraint condition set and can cause a deviation that can cause habitual appearance of a user in selection of a target sentence, the certain element is set as a new element, and a set weight value is given to the new element, and the new element is updated in the constraint condition set; if a certain element contained in the constraint condition subunit has an appearance frequency less than half of a set threshold value in an updating period, directly utilizing a deleting module to delete the element; if a certain element contained in the constraint condition subunit has an occurrence frequency which is more than half of the set threshold value and less than or equal to the set threshold value in the updating period, the certain element is registered in the cache module, and when the updating module in the next period starts updating, the certain element is added into the next updating module for updating.
In the application, the neural network model is an artificial intelligence carrier and has the capability of self-training and self-learning, specifically, the monitoring unit monitors the user operation and operation behavior in the user habit acquisition module, and when the user operation and operation behavior are monitored to be the selection of a target statement, the tracking unit tracks the monitoring behavior of the monitoring unit on at least one of the user operation and operation behavior as a target, captures the monitoring behavior including the selection of the target statement, and records the tracking log; the neural network model is provided with an identification unit, the identification unit is used for acquiring the arrangement of an optimal set, the arrangement of the optimal set scans a tracking log by loading the tracking log so as to acquire an element set contained in the tracking log, and statistics is carried out according to the elements contained in the same or similar element sets; after statistics, the elements are compared one by one and arranged according to whether the elements have the same element, two same elements, … … and N, wherein the set with the least elements in the comparison result is the optimal set, and then the elements are arranged according to whether the elements are the optimal set. That is, if the same element as the constraint condition set is more, it is verified that the habit of the user described in the trace log is the same as the habit of the user currently described in the neural network model, and if the habit of the user described in the trace log is different from the habit of the user currently described in the neural network model, it is verified that the habit of the user has a deviation at a certain time, and at this time, it is necessary to record the deviation, and when the user uses the translation target again, a translation target that matches the updated habit of the user is recommended to the user.
The principle is as follows: when a user inputs a source sentence, the receiving unit receives the source sentence to be translated;
the combination unit is connected with the receiving unit and used for recombining the source sentences based on combination elements formed by keywords, semantics and moods of the source sentences to form a plurality of new combination targets;
the pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentences to obtain a plurality of pre-translation targets, and the target sentences correspond to the pre-translation targets one by one or correspond to more pre-translation targets;
the arrangement module loads a constraint condition set and arranges and combines a plurality of pre-translation targets according to a first constraint condition, specifically, the magnitude of a weight value of each historical habit element in the constraint condition set determines the arrangement sequence of the pre-translation targets and inputs the arrangement sequence into a second machine translation model according to the sequence of the arrangement and combination, at least one revision unit is arranged in the second machine translation model, and each revision unit is provided with a first revision subunit, a second revision subunit and a comparison unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts sentence segments corresponding to the pre-translation target according to the pre-translation target;
the first revision subunit divides the sentence segments into at least one sentence segment according to the set punctuation marks, revises the corresponding contents of the sentence segments one by one, and assembles the sentence segments after revising one by one to form a first revision target;
the second revision subunit directly revises the sentence segments to form a second revision target;
the comparison unit is used for inputting the first revision target and the second revision target into the comparison unit for comparison, carrying out grammar comparison, and directly outputting to obtain a target sentence if the first revision target and the second revision target are the same;
and if the first revision target and the second revision target are different, outputting the target sentence with no errors or least errors in grammar.
In the invention, if the source sentence is a word, when a user inputs the source sentence, the receiving unit receives the source sentence to be translated;
the combination unit is connected with the receiving unit and expands the words with the same or similar semantics in the source sentences based on the semantics of the source sentences to form a plurality of new combination targets;
the pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentences to obtain a plurality of pre-translation targets, and the target sentences correspond to the pre-translation targets one by one or correspond to more pre-translation targets;
the arrangement module loads a constraint condition set and arranges and combines a plurality of pre-translation targets according to a first constraint condition, specifically, the magnitude of a weight value of each historical habit element in the constraint condition set determines the arrangement sequence of the pre-translation targets and inputs the arrangement sequence into a second machine translation model according to the sequence of the arrangement and combination, at least one revision unit is arranged in the second machine translation model, and each revision unit is provided with a first revision subunit, a second revision subunit and a comparison unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts a target word corresponding to the pre-translation target according to the pre-translation target;
and the first revision subunit revises the target words one by one to form a first revision target and outputs the first revision target.
In the present invention, if the source sentence is a complete sentence, when the user inputs the source sentence, the receiving unit receives the source sentence to be translated;
the combination unit is connected with the receiving unit, and is used for recombining the source sentences based on combination elements formed by keywords, semantics and moods of the source sentences to form a plurality of new combination targets, wherein the new combination targets comprise the keywords, low-frequency vocabularies, similar vocabularies expanded based on the keywords and the low-frequency vocabularies and corresponding application examples.
The pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentence to obtain a plurality of pre-translation targets, and the target sentences correspond to the pre-translation targets one by one or correspond to the pre-translation targets one by one;
the arrangement module loads a constraint condition set and arranges and combines a plurality of pre-translation targets according to a first constraint condition, specifically, the magnitude of a weight value of each historical habit element in the constraint condition set determines the arrangement sequence of the pre-translation targets and inputs the arrangement sequence into a second machine translation model according to the sequence of the arrangement and combination, at least one revision unit is arranged in the second machine translation model, and each revision unit is provided with a first revision subunit, a second revision subunit and a comparison unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts sentence segments corresponding to the pre-translation target according to the pre-translation target;
the first revising subunit is used for directly revising and directly outputting the keywords, the low-frequency vocabulary and similar vocabulary expanded based on the keywords and the low-frequency vocabulary; dividing a source sentence into at least one sub-sentence according to a set punctuation mark (semicolon), revising the corresponding contents of the sub-sentences one by one, and assembling the sub-sentences after revising one by one to form a first revising target;
the second revision subunit directly revises the source sentence to form a second revision target;
the comparison unit is used for inputting the first revision target and the second revision target into the comparison unit for comparison, carrying out grammar comparison, and directly outputting to obtain a target sentence if the first revision target and the second revision target are the same; and if the first revision target and the second revision target are different, outputting the target sentence with no errors or least errors in grammar.
In the invention, if the source sentence is a complete sentence segment, when a user inputs the source sentence segment, the receiving unit receives the source sentence segment to be translated;
the combination unit is connected with the receiving unit, and is used for recombining the source sentence based on combination elements formed by keywords, semantics and moods of the source sentence to form a plurality of new combination targets, wherein the new combination targets comprise keywords, low-frequency vocabularies, similar vocabularies expanded based on the keywords and the low-frequency vocabularies and corresponding application examples, and splitting the sentence segments based on the sentence segments to form sub-sentence segments according to whether the sentence segments are complete sentences or not;
the pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentence to obtain a plurality of pre-translation targets, and the target sentences correspond to the pre-translation targets one by one or correspond to the pre-translation targets one by one;
the arrangement module loads a constraint condition set and arranges and combines a plurality of pre-translation targets according to a first constraint condition, specifically, the magnitude of a weight value of each historical habit element in the constraint condition set determines the arrangement sequence of the pre-translation targets and inputs the arrangement sequence into a second machine translation model according to the sequence of the arrangement and combination, at least one revision unit is arranged in the second machine translation model, and each revision unit is provided with a first revision subunit, a second revision subunit and a comparison unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts sentence segments corresponding to the pre-translation target according to the pre-translation target;
the first revising subunit is used for directly revising and directly outputting the keywords, the low-frequency vocabulary and similar vocabulary expanded based on the keywords and the low-frequency vocabulary; dividing a source sentence segment into at least one sub-sentence segment according to a set punctuation mark (a semicolon or a sentence number), dividing the sub-sentence segment into sub-sentences, revising the sub-sentence segments and the corresponding contents of the sub-sentences corresponding to the word sentence segments one by one, and assembling the sub-sentence segments and the corresponding contents after revising one by one to form a first revising target;
the second revision subunit directly revises the source sentence to form a second revision target;
the comparison unit is used for inputting the first revision target and the second revision target into the comparison unit for comparison, carrying out grammar comparison, and directly outputting to obtain a target sentence if the first revision target and the second revision target are the same; and if the first revision target and the second revision target are different, outputting the target sentence with no errors or least errors in grammar.
The method comprises the steps that firstly, a plurality of pre-translation targets are obtained through a neural network model with artificial intelligence, the plurality of pre-translation targets comprise pre-translation targets corresponding to source sentences and pre-translation targets which are related to the source sentences and are expanded, the pre-translation targets are sorted according to habits selected by a user, and a plurality of translation targets selected based on the habits are provided for the user; meanwhile, the selection habits of the user are collected, and the user habits are trained again so as to deduce a translation target according with the selection habits of the user.
The principles and embodiments of the present invention are explained herein using specific examples, which are presented only to assist in understanding the method and its core concepts of the present invention. The foregoing is only a preferred embodiment of the present invention, and it should be noted that there are objectively infinite specific structures due to the limited character expressions, and it will be apparent to those skilled in the art that a plurality of modifications, decorations or changes may be made without departing from the principle of the present invention, and the technical features described above may be combined in a suitable manner; such modifications, variations, combinations, or adaptations of the invention using its spirit and scope, as defined by the claims, may be directed to other uses and embodiments.

Claims (10)

1. Based on artificial intelligence translation system, its characterized in that includes:
the neural network model is used for training in the neural network model to obtain a constraint condition set based on the historical habit elements of the translation target selected by the user;
the storage module is used for storing the constraint condition set;
a first machine translation model having a receiving unit, a combining unit, and a pre-translation unit; the receiving unit receives a source sentence to be translated;
the combination unit is connected with the receiving unit and used for recombining the source sentences based on combination elements formed by keywords, semantics and moods of the source sentences to form a plurality of new combination targets;
the pre-translation unit is connected with the receiving unit and translates the new combined target and the source sentences to obtain a plurality of pre-translation targets;
the arrangement module is used for loading a constraint condition set, arranging and combining a plurality of pre-translation targets according to a first constraint condition, and inputting the objects into a second machine translation model according to the sequence of the arrangement and combination, wherein the second machine translation model revises the plurality of pre-translation targets according to the sequence of the arrangement and combination to obtain a plurality of target sentences;
the output module is used for outputting the target sentence to be presented to the user;
the user habit acquisition module is used for selecting a target sentence based on user operation and operation behaviors, marking a source sentence corresponding to the selected target sentence and one or more of a plurality of new combined targets by using a set tag set, and forming acquired data;
the recording module is used for tracking and recording the acquired data to form a tracking log; inputting the tracking log into a neural network model for training to form a constraint condition subunit;
the cache module is used for storing the constraint condition subunit;
the updating module is used for extracting at least one constraint condition subunit in the cache module based on the updating period set by the control module, comparing the constraint condition subunit with the constraint condition set, checking whether the constraint condition subunit is a newly added element, if so, updating the constraint condition set, and if not, judging the constraint condition set as a repeated element; the deletion is directly performed by the deletion module.
2. The artificial intelligence based translation system of claim 1, wherein a recognition unit is disposed in the neural network model,
the identification unit is used for acquiring the arrangement of an optimal set, the arrangement of the optimal set is realized by loading a tracking log, scanning the tracking log to acquire an element set contained in the tracking log, and counting the elements contained in the element set according to the same or similar elements;
after statistics, the comparison and arrangement are carried out one by one according to whether the constraint condition set has one same element, two same elements, … … and N same elements, and the set with the least same elements in the comparison result is the optimal set.
3. The artificial intelligence based translation system of claim 1 or 2, wherein the historical habit elements comprise a combination of one or more of selection of type of translation target, selection of clickable keywords or key elements, selection of related parts of speech, semantic selection, selection of associated sentences, selection of referring to external associated word stock, browsing dwell time and number of click views.
4. The artificial intelligence based translation system according to claim 1 or 2, wherein the constraint condition set is formed by training according to the frequency of the historical habit elements input into the neural network model, and the weight value of each historical habit element is set according to the frequency of the historical habit elements.
5. The artificial intelligence based translation system of claim 1, wherein the target sentences correspond one-to-one or one-to-many with pre-translation targets.
6. The artificial intelligence based translation system of claim 1, wherein the user habit acquisition module has at least one monitoring unit, the monitoring unit monitors user operations and operations in the user habit acquisition module, and when it is monitored that the user operations and operations are a selection of a target sentence, a set tag set is used to label one or more of a source sentence and a plurality of new combined targets corresponding to the selected target sentence, so as to form acquired data.
7. The artificial intelligence based translation system according to claim 1, wherein the ranking module further comprises a priority determining part, and the priority determining part determines the ranking order of the pre-translation targets according to the weight value of each element of the historical habits in the constraint set by loading the constraint set.
8. The artificial intelligence based translation system of claim 1, wherein the recording module has at least one tracking unit therein, the tracking unit tracks a monitoring behavior of the monitoring unit on at least one of a user operation and an operation behavior as a target, and captures a monitoring behavior including a selection of a target sentence, and records in a tracking log.
9. The artificial intelligence based translation system according to claim 1, wherein the updating module has at least one comparing unit, and when the comparing unit performs the comparison between the constraint condition subunit and the constraint condition set, if a certain element included in the constraint condition subunit has an appearance frequency greater than a set threshold in the updating period, and the certain element does not have the same or similar element characteristics in the constraint condition set and can cause a deviation that a user habitually appears in the selection of the target sentence, the certain element is set as a new element, and a set weight value is given to the new element, and the new element is updated in the constraint condition set; if a certain element contained in the constraint condition subunit has an appearance frequency less than half of a set threshold value in an updating period, directly utilizing a deleting module to delete the element; if a certain element contained in the constraint condition subunit has an occurrence frequency which is more than half of the set threshold value and less than or equal to the set threshold value in the updating period, the certain element is registered in the cache module, and when the updating module in the next period starts updating, the certain element is added into the next updating module for updating.
10. The artificial intelligence based translation system of claim 1, wherein the second machine translation model has at least one revision unit therein, each revision unit having a first revision subunit, a second revision subunit, and a collation unit;
the revision unit and a pre-translation target have a one-to-one or one-to-many correspondence relationship, and the revision unit extracts sentence segments corresponding to the pre-translation target according to the pre-translation target;
the first revision subunit divides the sentence segments into at least one sentence segment according to the set punctuation marks, revises the corresponding contents of the sentence segments one by one, and assembles the sentence segments after revising one by one to form a first revision target;
the second revision subunit directly revises the sentence segments to form a second revision target;
the comparison unit is used for inputting the first revision target and the second revision target into the comparison unit for comparison, carrying out grammar comparison, and directly outputting to obtain a target sentence if the first revision target and the second revision target are the same;
and if the first revision target and the second revision target are different, outputting the target sentence with no errors or least errors in grammar.
CN202111465059.4A 2021-12-03 2021-12-03 Translation system based on artificial intelligence Active CN114139560B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111465059.4A CN114139560B (en) 2021-12-03 2021-12-03 Translation system based on artificial intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111465059.4A CN114139560B (en) 2021-12-03 2021-12-03 Translation system based on artificial intelligence

Publications (2)

Publication Number Publication Date
CN114139560A true CN114139560A (en) 2022-03-04
CN114139560B CN114139560B (en) 2022-12-09

Family

ID=80387462

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111465059.4A Active CN114139560B (en) 2021-12-03 2021-12-03 Translation system based on artificial intelligence

Country Status (1)

Country Link
CN (1) CN114139560B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115828940A (en) * 2022-12-08 2023-03-21 吉林农业科技学院 Chinese-Korean translation system based on corpus processing technology
CN115965017A (en) * 2023-01-04 2023-04-14 北京三维天地科技股份有限公司 Multi-language input and analysis system and method based on development platform

Citations (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06348751A (en) * 1992-11-09 1994-12-22 Ricoh Co Ltd Language translating device
KR20080052282A (en) * 2006-12-05 2008-06-11 한국전자통신연구원 Apparatus and method for unsupervised learning translation relationships among words and phrases in the statistical machine translation system
US20110099000A1 (en) * 2009-10-23 2011-04-28 Samsung Electronics Co., Ltd. Method and device for transliteration
CN102455786A (en) * 2010-10-25 2012-05-16 三星电子(中国)研发中心 System and method for optimizing Chinese sentence input method
CN102789451A (en) * 2011-05-16 2012-11-21 北京百度网讯科技有限公司 Individualized machine translation system, method and translation model training method
CN104679735A (en) * 2013-11-30 2015-06-03 赵会军 Pragmatic machine translation method
US20170046333A1 (en) * 2015-08-13 2017-02-16 Xerox Corporation Predicting translational preferences
CN106649288A (en) * 2016-12-12 2017-05-10 北京百度网讯科技有限公司 Translation method and device based on artificial intelligence
US20180075022A1 (en) * 2016-09-13 2018-03-15 Kabushiki Kaisha Toshiba Machine translation method and machine translation apparatus
CN107861954A (en) * 2017-11-06 2018-03-30 北京百度网讯科技有限公司 Information output method and device based on artificial intelligence
CN109359308A (en) * 2018-10-31 2019-02-19 腾讯科技(武汉)有限公司 Machine translation method, device and readable storage medium storing program for executing
US20190155877A1 (en) * 2017-11-17 2019-05-23 Adobe Inc. Generating a Targeted Summary of Textual Content Tuned to a Target Audience Vocabulary
CN109840331A (en) * 2019-01-31 2019-06-04 沈阳雅译网络技术有限公司 A kind of neural machine translation method based on user-oriented dictionary
CN110457719A (en) * 2019-10-08 2019-11-15 北京金山数字娱乐科技有限公司 A kind of method and device of translation model result reordering
CN111126082A (en) * 2019-12-03 2020-05-08 北京明略软件系统有限公司 Translation method and device
CN111401085A (en) * 2020-04-24 2020-07-10 南京莱科智能工程研究院有限公司 Mobile device machine translation system based on hybrid strategy
CN111507113A (en) * 2020-03-18 2020-08-07 北京捷通华声科技股份有限公司 Method and device for machine-assisted manual translation
US10747962B1 (en) * 2018-03-12 2020-08-18 Amazon Technologies, Inc. Artificial intelligence system using phrase tables to evaluate and improve neural network based machine translation
CN112329482A (en) * 2020-10-28 2021-02-05 北京嘀嘀无限科技发展有限公司 Machine translation method, device, electronic equipment and readable storage medium
US20210056270A1 (en) * 2019-08-20 2021-02-25 Samsung Electronics Co., Ltd. Electronic device and deep learning-based interactive messenger operation method
US20210224491A1 (en) * 2018-09-19 2021-07-22 Samsung Electronics Co., Ltd. Device and method for providing application translation information
CN113343717A (en) * 2021-06-15 2021-09-03 沈阳雅译网络技术有限公司 Neural machine translation method based on translation memory library
CN113408308A (en) * 2021-07-16 2021-09-17 苏州美卡网络科技有限公司 Language translation system and method based on artificial intelligence
US20210294987A1 (en) * 2020-03-20 2021-09-23 Google Llc Model localization for data analytics and business intelligence
CN113569585A (en) * 2021-02-25 2021-10-29 腾讯科技(深圳)有限公司 Translation method and device, storage medium and electronic equipment

Patent Citations (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06348751A (en) * 1992-11-09 1994-12-22 Ricoh Co Ltd Language translating device
KR20080052282A (en) * 2006-12-05 2008-06-11 한국전자통신연구원 Apparatus and method for unsupervised learning translation relationships among words and phrases in the statistical machine translation system
US20110099000A1 (en) * 2009-10-23 2011-04-28 Samsung Electronics Co., Ltd. Method and device for transliteration
CN102455786A (en) * 2010-10-25 2012-05-16 三星电子(中国)研发中心 System and method for optimizing Chinese sentence input method
CN102789451A (en) * 2011-05-16 2012-11-21 北京百度网讯科技有限公司 Individualized machine translation system, method and translation model training method
CN104679735A (en) * 2013-11-30 2015-06-03 赵会军 Pragmatic machine translation method
US20170046333A1 (en) * 2015-08-13 2017-02-16 Xerox Corporation Predicting translational preferences
US20180075022A1 (en) * 2016-09-13 2018-03-15 Kabushiki Kaisha Toshiba Machine translation method and machine translation apparatus
CN106649288A (en) * 2016-12-12 2017-05-10 北京百度网讯科技有限公司 Translation method and device based on artificial intelligence
CN107861954A (en) * 2017-11-06 2018-03-30 北京百度网讯科技有限公司 Information output method and device based on artificial intelligence
US20190155877A1 (en) * 2017-11-17 2019-05-23 Adobe Inc. Generating a Targeted Summary of Textual Content Tuned to a Target Audience Vocabulary
US10747962B1 (en) * 2018-03-12 2020-08-18 Amazon Technologies, Inc. Artificial intelligence system using phrase tables to evaluate and improve neural network based machine translation
US20210224491A1 (en) * 2018-09-19 2021-07-22 Samsung Electronics Co., Ltd. Device and method for providing application translation information
CN109359308A (en) * 2018-10-31 2019-02-19 腾讯科技(武汉)有限公司 Machine translation method, device and readable storage medium storing program for executing
CN109840331A (en) * 2019-01-31 2019-06-04 沈阳雅译网络技术有限公司 A kind of neural machine translation method based on user-oriented dictionary
US20210056270A1 (en) * 2019-08-20 2021-02-25 Samsung Electronics Co., Ltd. Electronic device and deep learning-based interactive messenger operation method
US20210103704A1 (en) * 2019-10-08 2021-04-08 Beijing Kingsoft Digital Entertainment Co., Ltd. Method and Apparatus for Reordering Results of a Translation Model
CN110457719A (en) * 2019-10-08 2019-11-15 北京金山数字娱乐科技有限公司 A kind of method and device of translation model result reordering
CN111126082A (en) * 2019-12-03 2020-05-08 北京明略软件系统有限公司 Translation method and device
CN111507113A (en) * 2020-03-18 2020-08-07 北京捷通华声科技股份有限公司 Method and device for machine-assisted manual translation
US20210294987A1 (en) * 2020-03-20 2021-09-23 Google Llc Model localization for data analytics and business intelligence
CN111401085A (en) * 2020-04-24 2020-07-10 南京莱科智能工程研究院有限公司 Mobile device machine translation system based on hybrid strategy
CN112329482A (en) * 2020-10-28 2021-02-05 北京嘀嘀无限科技发展有限公司 Machine translation method, device, electronic equipment and readable storage medium
CN113569585A (en) * 2021-02-25 2021-10-29 腾讯科技(深圳)有限公司 Translation method and device, storage medium and electronic equipment
CN113343717A (en) * 2021-06-15 2021-09-03 沈阳雅译网络技术有限公司 Neural machine translation method based on translation memory library
CN113408308A (en) * 2021-07-16 2021-09-17 苏州美卡网络科技有限公司 Language translation system and method based on artificial intelligence

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115828940A (en) * 2022-12-08 2023-03-21 吉林农业科技学院 Chinese-Korean translation system based on corpus processing technology
CN115965017A (en) * 2023-01-04 2023-04-14 北京三维天地科技股份有限公司 Multi-language input and analysis system and method based on development platform
CN115965017B (en) * 2023-01-04 2023-11-10 北京三维天地科技股份有限公司 Multi-language input and analysis system and method based on development platform

Also Published As

Publication number Publication date
CN114139560B (en) 2022-12-09

Similar Documents

Publication Publication Date Title
KR102577514B1 (en) Method, apparatus for text generation, device and storage medium
CN110298033B (en) Keyword corpus labeling training extraction system
CN114139560B (en) Translation system based on artificial intelligence
CN110825845A (en) Hierarchical text classification method based on character and self-attention mechanism and Chinese text classification method
CN100483417C (en) Method for catching limit word information, optimizing output and input method system
CN110765759B (en) Intention recognition method and device
CN109460459B (en) Log learning-based dialogue system automatic optimization method
CN108538294B (en) Voice interaction method and device
CN101937430A (en) Method for extracting event sentence pattern from Chinese sentence
CN110442880B (en) Translation method, device and storage medium for machine translation
CN101082909A (en) Method and system for dividing Chinese sentences for recognizing deriving word
CN110717045A (en) Letter element automatic extraction method based on letter overview
CN112131876A (en) Method and system for determining standard problem based on similarity
US20220414463A1 (en) Automated troubleshooter
CN111709242A (en) Chinese punctuation mark adding method based on named entity recognition
CN111401058A (en) Attribute value extraction method and device based on named entity recognition tool
CN113157859A (en) Event detection method based on upper concept information
Watts et al. Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger.
KR101941924B1 (en) Method for providing association model based intention nano analysis service using cognitive neural network
CN112380848A (en) Text generation method, device, equipment and storage medium
CN115115432B (en) Product information recommendation method and device based on artificial intelligence
CN116974554A (en) Code data processing method, apparatus, computer device and storage medium
Iosif et al. Speech understanding for spoken dialogue systems: From corpus harvesting to grammar rule induction
CN113468311B (en) Knowledge graph-based complex question and answer method, device and storage medium
CN114969347A (en) Defect duplication checking implementation method and device, terminal equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: Room 1501, Quanjing Yayuan business building, 33 liuchangshan Road, Shizhong District, Jinan City, Shandong Province

Applicant after: Shandong Shiyu Information Technology Co.,Ltd.

Address before: Room 1501, Quanjing Yayuan business building, 33 liuchangshan Road, Shizhong District, Jinan City, Shandong Province

Applicant before: Shandong Shiyu Translation Co.,Ltd.

GR01 Patent grant
GR01 Patent grant