CN109062913B - Internationalization resource intelligent acquisition method and storage medium - Google Patents

Internationalization resource intelligent acquisition method and storage medium Download PDF

Info

Publication number
CN109062913B
CN109062913B CN201811080850.1A CN201811080850A CN109062913B CN 109062913 B CN109062913 B CN 109062913B CN 201811080850 A CN201811080850 A CN 201811080850A CN 109062913 B CN109062913 B CN 109062913B
Authority
CN
China
Prior art keywords
entries
entry
dictionary
translated
vocabulary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811080850.1A
Other languages
Chinese (zh)
Other versions
CN109062913A (en
Inventor
刘德建
原玉飞
吴仁海
杨洋
杨文军
郭玉湖
黄梦飞
林浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Tianquan Educational Technology Ltd
Original Assignee
Fujian Tianquan Educational Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Tianquan Educational Technology Ltd filed Critical Fujian Tianquan Educational Technology Ltd
Priority to CN201811080850.1A priority Critical patent/CN109062913B/en
Publication of CN109062913A publication Critical patent/CN109062913A/en
Application granted granted Critical
Publication of CN109062913B publication Critical patent/CN109062913B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

The invention provides an internationalized resource intelligent acquisition method and a storage medium, wherein the method comprises the following steps: the preset entry template comprises a Chinese name, a current language identification and a translation set; extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template to obtain dictionary vocabulary entries; extracting an entry to be translated to obtain the entry to be translated; filling the vocabulary entries to be translated according to the dictionary vocabulary entries; if the content of the filled entry to be translated is incomplete, submitting the filled entry to be translated to a translation end; and receiving the vocabulary entries to be translated with complete contents, and storing the vocabulary entries as dictionary vocabulary entries. The invention can not only continuously improve the speed of responding to the translation requirement; and the correctness of translation is ensured; further saving translation costs.

Description

Internationalization resource intelligent acquisition method and storage medium
Technical Field
The invention relates to the field of internationalized resources, in particular to an internationalized resource intelligent acquisition method and a storage medium.
Background
In the information era, politics, business and civil communication among the world are increasingly widespread, language is not always an obstacle to communication, characters are not understood, and communication cannot be carried out through the language, so that the world is receiving attention. The online translation, the manual translation, the translation machine and the like are endless, and the craving of people for communication among different languages is relieved to a certain extent.
The implementation mechanism of the online translation and the translation machine is similar, a given sentence is translated in real time through a powerful database and an intelligent algorithm, but the fact that the sentences cannot be translated truly and accurately in practice is found, even the original meaning of the sentences can be distorted, and the method is confusing for some special words and extremely professional words, which can become trouble for users.
The manual translation requires the intervention of a professional, and the professional analyzes the original sentence through professional knowledge and accurately translates the target sentence by combining with the actual context. But the labor cost is too high, the response speed is slow, the economic burden of enterprises and organizations is increased, and more risks are added.
Therefore, it is necessary to provide a scheme that can combine timely response to translation requirements, ensure translation accuracy, and save translation cost.
Disclosure of Invention
The technical problems to be solved by the invention are as follows: the internationalization resource intelligent acquisition method and the storage medium can improve the translation response speed and ensure the translation accuracy.
In order to solve the technical problems, the invention adopts the following technical scheme:
the internationalization resource intelligent acquisition method comprises the following steps:
the preset entry template comprises a Chinese name, a current language identification and a translation set;
extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template to obtain dictionary vocabulary entries;
extracting an entry to be translated to obtain the entry to be translated;
filling the vocabulary entries to be translated according to the dictionary vocabulary entries;
if the content of the filled entry to be translated is incomplete, submitting the filled entry to be translated to a translation end;
and receiving the vocabulary entries to be translated with complete contents, and storing the vocabulary entries as dictionary vocabulary entries.
The other technical scheme provided by the invention is as follows:
a computer storage medium having stored thereon a computer program which, when processed by a processor, is capable of carrying out the steps comprised in the above-described internationalized resource intelligent acquisition method.
The invention has the beneficial effects that: extracting vocabulary entries from the existing internationalized resources according to the vocabulary entry templates to form a dictionary library; when receiving the translation requirement, searching based on a dictionary library, fully utilizing the existing international resources, and rapidly acquiring translated resources; if the untranslated resources exist, the untranslated resources are then submitted to the translation end for artificial or intelligent translation, and the translated vocabulary entries are stored as dictionary vocabulary entries, so that the dictionary library is perfected. Thereby remarkably improving the response speed of translation requirements, ensuring the accuracy of translation and saving the translation cost.
Drawings
FIG. 1 is a schematic flow chart of an internationalized resource intelligent acquisition method of the invention;
FIG. 2 is a schematic flow chart of a first embodiment;
fig. 3 is an interaction diagram of the first embodiment.
Detailed Description
In order to describe the technical contents, the achieved objects and effects of the present invention in detail, the following description will be made with reference to the embodiments in conjunction with the accompanying drawings.
The most critical concept of the invention is as follows: firstly searching based on a dictionary library, fully utilizing the existing international resources, and rapidly acquiring translated resources; if the untranslated resources exist, the untranslated resources are then submitted to the translation end for artificial or intelligent translation.
Referring to fig. 1, the present invention provides an internationalized resource intelligent acquisition method, which includes:
the preset entry template comprises a Chinese name, a current language identification and a translation set;
extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template to obtain dictionary vocabulary entries;
extracting an entry to be translated to obtain the entry to be translated;
filling the vocabulary entries to be translated according to the dictionary vocabulary entries;
if the content of the filled entry to be translated is incomplete, submitting the filled entry to be translated to a translation end;
and receiving the vocabulary entries to be translated with complete contents, and storing the vocabulary entries as dictionary vocabulary entries.
From the above description, the beneficial effects of the invention are as follows: fully utilizing the existing internationalization resources, and introducing manual translation and intelligent translation for supplementing when the requirements cannot be met; meanwhile, a dictionary library is optimized in the continuous translation process, and the translation efficiency is continuously improved. Not only the response efficiency of the translation requirement is obviously improved, but also the translation accuracy is obviously improved, and meanwhile, the translation cost is also saved.
Further, the entry template further comprises an application field;
the method further comprises the steps of:
and classifying the dictionary entries according to the application field to obtain a dictionary library.
As can be seen from the description, the searching and matching efficiency can be further improved through classification, and the translation response speed is further improved; while facilitating data maintenance of the dictionary library.
Further, extracting the vocabulary entry from the translated resource library according to a preset vocabulary entry template to obtain a dictionary vocabulary entry, which specifically includes:
constructing a dictionary library;
extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template;
sequentially storing the extracted entries into a dictionary database, taking the storage positions of the entries in the dictionary database as entry keys, taking the specific values of the entries as entry values, and packaging to obtain dictionary entries.
From the above description, it can be seen that, based on the existing internationalized resources, the vocabulary entry is extracted, and the vocabulary entry is constructed corresponding to the storage location, so as to provide support for the subsequent fast traversal of the dictionary library, the translated resources are obtained, and the translation efficiency is improved.
Further, the storage format of the dictionary library is JSON, properties, xml, js or Strings;
if the JSON format is adopted, the dictionary library is created specifically as follows: creating a JSON document;
sequentially storing the extracted entries into a dictionary database, taking the storage positions of the entries in the dictionary database as entry keys, and taking the specific values of the entries as entry values, wherein the specific values are as follows:
sequentially storing the extracted entries as node values to all nodes of the document;
each layer node of the recursive document takes the traversing path as the key of the entry, and the node value is taken as the entry value.
From the above description, it can be seen that multiple types of storage formats can be supported, and the implementation manner of specialization 3 is provided for different requirements, so as to better meet the different requirements.
Further, the method further comprises the following steps:
submitting dictionary entries to a translation end for verification;
and updating the dictionary base according to the checked dictionary entries.
As can be seen from the above description, in a specific embodiment, the vocabulary entry extracted based on the translated internationalized resource is checked and then stored in the dictionary database, so that the accuracy of the vocabulary entry content can be ensured, and the translation accuracy of the invention can be further improved.
Further, the method further comprises the following steps:
and updating data of dictionary entries stored in a dictionary database according to a preset time interval, and simultaneously clearing invalid dictionary entries.
As can be seen from the above description, a failure strategy is provided at the same time, so that translation requirements can be translated based on the latest and most reliable dictionary entry.
The other technical scheme provided by the invention is as follows:
a computer storage medium having stored thereon a computer program which, when processed by a processor, is capable of carrying out the steps comprised in the above-described internationalized resource intelligent acquisition method.
From the above description, the beneficial effects of the invention are as follows: it will be appreciated by those skilled in the art that implementing all or part of the above-described aspects may be implemented by means of hardware associated with a computer program, where the program may be stored in a computer readable storage medium, and the program may include the steps of the methods described above when executed.
The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a random access Memory (Random Access Memory, RAM), or the like.
Example 1
Referring to fig. 2 and 3, the present embodiment further defines fig. 2, and provides an internationalized resource intelligent acquisition method, which includes:
s1: the preset entry template comprises Chinese name, current language identification language, translation set i18n, os, type, itemTYPE and other attributes; the name is used for establishing the translation relation of different languages; the language is used for identifying the current language; i18n is used to count the frequency of different alternative translations; os and type together determine the application area of the current entry; the itemType is used to identify the type of entry.
Taking the button "confirm" on the web page as an example, the button should be displayed as "confirm" in the chinese language environment, and may be displayed as "OK" in the english language environment, where the term attribute values are name=confirm, language=en, i18n= { OK }, os=browser, type=web, itemtype=string, respectively.
S2: and extracting the vocabulary entries from the translated resource library according to a preset vocabulary entry template, obtaining dictionary vocabulary entries, and generating a dictionary library according to the dictionary vocabulary entries.
Specifically, the method comprises the steps of firstly, collecting the existing internationalized resources, namely a translated resource library; then, extracting vocabulary entries from the vocabulary entry template by comparing the vocabulary entry template to be used as dictionary vocabulary entries; the corresponding resource can be extracted from the translated resource library without affecting the validity of the term.
Specifically, the term extraction process includes:
s21: constructing a dictionary library;
s22: extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template;
s23: sequentially storing the extracted entries into a dictionary database, taking the storage positions of the entries in the dictionary database as entry keys, taking the specific values of the entries as entry values, and packaging to obtain dictionary entries.
In a specific application process, the analysis work of the resources is standardized by providing a uniform interface, and specialized realization can be provided for different resources. The interface extrater is used for analyzing the standard resources, receiving the resources and finally outputting an entry list.
Resource file JSON format example: the file stores the resources in the form of JSON format key value pairs, and simultaneously JSON supports multiple layers, such as { "single": "single", "first": { "second": "second" }, a business can define any layer according to own business, any number of key value pairs are provided, and peer node names are not repeatable.
Taking JSOnExacter as an example, analyzing an original file into a JSON object, recursively traversing nodes of each layer of JSON, taking a traversing path as a key of an entry, packaging a final node value as a value of the entry into the entry, and finally delivering the entry to an outer layer for processing. PropertiesExtracter, xmlExtracter, jsExtracter is similar to StringExacter, and the difference is only the way of constructing entry keys, and the unification process is: 1. constructing entry keys; 2. constructing an entry value.
S3: and classifying the dictionary entries in the dictionary base according to the application field of the entry templates.
In one embodiment, to ensure accuracy of dictionary library resources, the method further includes:
s31: submitting dictionary entries to a translation end for verification; the translation end can be any terminal with display and editing functions and is used for verifying and confirming submitted dictionary entries for professional translation staff or management staff, so that the accuracy of resources is improved; the system can also be an intelligent translation terminal, has a translation function with high performance and high accuracy, and can automatically check dictionary entries.
S32: and updating the dictionary base according to the checked dictionary entries. Preferably, only dictionary entries that have entered and exited are updated.
Preferably, the embodiment also provides a failure policy at the same time, so that the correctness of dictionary library resources is further improved, and the timeliness of the resources is ensured. Specific:
s33: and updating data of dictionary entries stored in a dictionary database according to a preset time interval, and simultaneously clearing invalid dictionary entries.
In order to ensure that each translation is based on the latest and most reliable vocabulary entry, statistics of certain vocabulary entries can be regularly reduced and invalid vocabulary entries can be cleared through regular maintenance, so that correctness and validity of the vocabulary entries of the dictionary are ensured.
S4: extracting an entry to be translated to obtain the entry to be translated;
specifically, when a user has a translation requirement, firstly, the entry to be translated is screened out according to the item to be translated.
S5: filling the vocabulary entries to be translated according to the dictionary vocabulary entries;
i.e. based on dictionary entries in the dictionary base, intelligently populating the translation items of the corresponding language.
S6: if the resources can be obtained by the filled entry to be translated corresponding to each attribute, directly outputting the translated target content, and ending the flow.
S7: if the content of the filled word to be translated is incomplete, namely, if the individual attribute is not filled, the filled word to be translated is submitted to a translation terminal, and is translated by a professional or is translated by an intelligent translation terminal.
S8: and receiving the vocabulary entries to be translated with complete contents, and storing the vocabulary entries as dictionary vocabulary entries.
After the vocabulary entry to be translated is perfected, outputting translated target content, and updating resources in the dictionary base according to the perfected vocabulary entry to form a closed loop so as to fill the gap of the dictionary base.
Example two
In accordance with the first embodiment of the present invention, a specific application scenario is provided:
a website contains an internationalized file of Chinese version, the content is { "key1": "confirm", "key2": "cancel" }, and an internationalized file of English version is generated.
The translated resource library contains one term of "name=confirm, language=en, i18n=ok".
The filling steps are as follows:
extracting vocabulary entries: extracting the vocabulary entries to obtain two vocabulary entries with names of 'confirmation' and 'cancellation', and converting the current target into the vocabulary entries with names of 'confirmation' and 'cancellation' respectively;
and (3) intelligent filling: after comparing the existing dictionary (translated resource library), finding that the existing name is the entry for confirming that the language is en, and directly filling the entry into the target resource;
manual translation: after the filling in the previous step, a term of "cancel" is not internationalized, and is translated by the professional, and it is assumed that the professional gives a translation cancel. At this time, the internationalized resource of the English version is completed, and the content is { "key1": "OK", "key2": "excel" }.
Entry import: introducing internationalization of the final English version into a recommendation library, wherein two entries are arranged in the dictionary, and the entry 1 is: name=confirm, language=en, i18n=ok; entry 2: name=cancel, language=en, i18n=cancel.
Example III
The first embodiment corresponds to the second embodiment, and provides a computer storage medium having a computer program stored thereon, where the computer program, when processed by a processor, can implement the steps included in the internationalized resource intelligent acquisition method described in the first or second embodiment. The details of the specific steps are not described herein, and please refer to the descriptions of the first embodiment and the second embodiment.
In summary, the internationalized resource intelligent acquisition method and the storage medium provided by the invention can not only continuously improve the speed of responding to the translation requirement; and the correctness of translation is ensured; further saving translation costs.
The foregoing description is only illustrative of the present invention and is not intended to limit the scope of the invention, and all equivalent changes made by the specification and drawings of the present invention, or direct or indirect application in the relevant art, are included in the scope of the present invention.

Claims (4)

1. The internationalization resource intelligent acquisition method is characterized by comprising the following steps:
the preset entry template comprises a Chinese name, a current language identification and a translation set;
extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template to obtain dictionary vocabulary entries;
extracting an entry to be translated to obtain the entry to be translated;
filling the vocabulary entries to be translated according to the dictionary vocabulary entries;
if the content of the filled entry to be translated is incomplete, submitting the filled entry to be translated to a translation end;
receiving the vocabulary entries to be translated with complete contents, and storing the vocabulary entries as dictionary vocabulary entries;
according to a preset time interval, updating data of dictionary entries stored in a dictionary database, and simultaneously clearing invalid dictionary entries;
if the content of the entry to be translated after filling is complete, directly outputting translated target content;
extracting the vocabulary entry from the translated resource library according to a preset vocabulary entry template to obtain dictionary vocabulary entry, wherein the dictionary vocabulary entry is specifically:
constructing a dictionary library;
extracting vocabulary entries from the translated resource library according to a preset vocabulary entry template;
sequentially storing the extracted entries into a dictionary database, taking the storage positions of the entries in the dictionary database as entry keys, taking the specific values of the entries as entry values, and packaging to obtain dictionary entries;
the entry template further comprises an application field;
the method further comprises the steps of:
and classifying the dictionary entries according to the application field to obtain a dictionary library.
2. The internationalized resource intelligent acquisition method of claim 1, wherein the dictionary library is stored in JSON, properties, xml, js or Strings format;
if the dictionary database is in the JSON format, the dictionary database is constructed specifically as follows: creating a JSON document;
sequentially storing the extracted entries into a dictionary database, taking the storage positions of the entries in the dictionary database as entry keys, and taking the specific values of the entries as entry values, wherein the specific values are as follows:
sequentially storing the extracted entries as node values to all nodes of the document;
each layer node of the recursive document takes the traversing path as the key of the entry, and the node value is taken as the entry value.
3. The internationalized resource intelligent acquisition method of claim 1, further comprising:
submitting dictionary entries to a translation end for verification;
and updating the dictionary base according to the checked dictionary entries.
4. A computer storage medium having stored thereon a computer program, wherein the program, when processed by a processor, is capable of implementing the steps comprised in the internationalized resource intelligent acquisition method of any one of the preceding claims 1 to 3.
CN201811080850.1A 2018-09-17 2018-09-17 Internationalization resource intelligent acquisition method and storage medium Active CN109062913B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811080850.1A CN109062913B (en) 2018-09-17 2018-09-17 Internationalization resource intelligent acquisition method and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811080850.1A CN109062913B (en) 2018-09-17 2018-09-17 Internationalization resource intelligent acquisition method and storage medium

Publications (2)

Publication Number Publication Date
CN109062913A CN109062913A (en) 2018-12-21
CN109062913B true CN109062913B (en) 2023-07-11

Family

ID=64762707

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811080850.1A Active CN109062913B (en) 2018-09-17 2018-09-17 Internationalization resource intelligent acquisition method and storage medium

Country Status (1)

Country Link
CN (1) CN109062913B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111950300B (en) * 2020-07-22 2021-12-24 广州朗国电子科技有限公司 Translation management method and device for universal software entries, storage medium and server
CN115618887B (en) * 2022-10-17 2023-07-25 北京志凌海纳科技有限公司 Dynamic long text internationalization method, device, equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049436B (en) * 2011-10-12 2015-11-25 北京百度网讯科技有限公司 Obtain method and device, the method and system of generation translation model, the method and system of mechanical translation of language material
CN106502998A (en) * 2016-10-31 2017-03-15 用友网络科技股份有限公司 The multi-lingual entry integrated management system of Product Interface and management method
CN106776587A (en) * 2016-12-19 2017-05-31 新译信息科技(深圳)有限公司 Data processing method and device
CN107748746A (en) * 2017-11-06 2018-03-02 四川长虹电器股份有限公司 A kind of method of automatic management entry

Also Published As

Publication number Publication date
CN109062913A (en) 2018-12-21

Similar Documents

Publication Publication Date Title
CN111708773B (en) Multi-source scientific and creative resource data fusion method
EP4141733A1 (en) Model training method and apparatus, electronic device, and storage medium
US8914419B2 (en) Extracting semantic relationships from table structures in electronic documents
CN111881290A (en) Distribution network multi-source grid entity fusion method based on weighted semantic similarity
CN110555205B (en) Negative semantic recognition method and device, electronic equipment and storage medium
CN109740159B (en) Processing method and device for named entity recognition
CN113094512B (en) Fault analysis system and method in industrial production and manufacturing
CN113986864A (en) Log data processing method and device, electronic equipment and storage medium
US20230016403A1 (en) Method of processing triple data, method of training triple data processing model, device, and medium
CN109062913B (en) Internationalization resource intelligent acquisition method and storage medium
CN115438740A (en) Multi-source data convergence and fusion method and system
CN114186533A (en) Model training method and device, knowledge extraction method and device, equipment and medium
CN116468009A (en) Article generation method, apparatus, electronic device and storage medium
CN115640406A (en) Multi-source heterogeneous big data analysis processing and knowledge graph construction method
CN111143394B (en) Knowledge data processing method, device, medium and electronic equipment
CN112395425A (en) Data processing method and device, computer equipment and readable storage medium
CN114911893A (en) Method and system for automatically constructing knowledge base based on knowledge graph
CN115545558A (en) Method, device, machine readable medium and equipment for obtaining risk identification model
CN116245177A (en) Geographic environment knowledge graph automatic construction method and system and readable storage medium
CN111831624A (en) Data table creating method and device, computer equipment and storage medium
CN114840685A (en) Emergency plan knowledge graph construction method
CN112582073B (en) Medical information acquisition method, device, electronic equipment and medium
RU2546064C1 (en) Distributed system and method of language translation
CN108932225A (en) For natural language demand to be converted into the method and system of semantic modeling language statement
CN117473054A (en) Knowledge graph-based general intelligent question-answering method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant