WO2022161307A1 - Text translation method and apparatus, and device and medium - Google Patents

Text translation method and apparatus, and device and medium Download PDF

Info

Publication number
WO2022161307A1
WO2022161307A1 PCT/CN2022/073454 CN2022073454W WO2022161307A1 WO 2022161307 A1 WO2022161307 A1 WO 2022161307A1 CN 2022073454 W CN2022073454 W CN 2022073454W WO 2022161307 A1 WO2022161307 A1 WO 2022161307A1
Authority
WO
WIPO (PCT)
Prior art keywords
text information
translated
target
session
text
Prior art date
Application number
PCT/CN2022/073454
Other languages
French (fr)
Chinese (zh)
Inventor
张凯
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2022161307A1 publication Critical patent/WO2022161307A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Definitions

  • the present application belongs to the technical field of electronic devices, and in particular relates to a text translation method, device, device and medium.
  • NMT Neural Machine Translation
  • each word in a sentence is translated, and then the translation result corresponding to the sentence is displayed.
  • the purpose of the embodiments of the present application is to provide a text translation method, apparatus, device and medium, which can solve the problem of inaccurate sentence translation.
  • an embodiment of the present application provides a text translation method, including:
  • Translate the text information to be translated obtain the first translation text information corresponding to the text information to be translated, and replace the second translation text information corresponding to the first text information in the first translation text information with the first text information to obtain the target translation text information; or,
  • Translate the second text information except the first text information in the text information to be translated obtain the third translation text information corresponding to the second text information, combine the third translation text information and the first text information, and obtain the target translation text information .
  • an embodiment of the present application provides a text translation device, including:
  • the first acquisition module is used to acquire the text information to be translated
  • a first determining module configured to determine first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
  • a translation module configured to translate the text information to be translated, obtain first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information , obtain the target translation text information; or, translate the second text information except the first text information in the to-be-translated text information, obtain the third translation text information corresponding to the second text information, and combine the third translation text information and the third translation text information. a text information, and obtain the target translation text information.
  • an embodiment of the present application provides an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored in the memory and executable on the processor.
  • the program or instruction is executed by the processor, the Steps of the text translation method of the first aspect.
  • embodiments of the present application provide a computer-readable storage medium, where programs or instructions are stored on the computer-readable storage medium, and when the programs or instructions are executed by a processor, the steps of the text translation method of the first aspect are implemented.
  • an embodiment of the present application provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or instruction to implement the steps of the text translation method of the first aspect.
  • the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated
  • the second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information.
  • the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated.
  • the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
  • FIG. 1 is a schematic flowchart of a text translation method provided by an embodiment of the present application.
  • FIG. 2 is a schematic diagram of a session interface provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of displaying target translation text information provided by an embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of a text translation device provided by an embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
  • FIG. 6 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
  • FIG. 1 is a schematic flowchart of a text translation method provided by an embodiment of the present application. As shown in Figure 1, the text translation method can include:
  • S102 Determine first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
  • S104 Translate the second text information except the first text information in the text information to be translated, obtain third translation text information corresponding to the second text information, and combine the third translation text information and the first text information to obtain the target translation text information.
  • the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated
  • the second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information.
  • the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated.
  • the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
  • the session in S102 may be a group chat session in which multiple people chat.
  • FIG. 2 is a schematic diagram of a session interface provided by an embodiment of the present application.
  • the name of the session shown in Figure 2 is "discussion group about mask”
  • the session shown in Figure 2 includes: the text information "whole word mask is not support in chinese" input by user H, the text input by user A The information "Can you support Chinese full-character masks in the future?" and the text information entered by user G "Yes, Chinese mask support is still necessary”.
  • the text information to be translated is "whole word mask is not support in chinese" in FIG. 2 .
  • the preset database corresponding to the session "discussion group about mask” shown in Figure 2 it is determined that the first text information in the text information to be translated "whole word mask is not support in chinese" is "mask”.
  • the preset database corresponding to the conversation in S102 may be a database including text information in multiple languages. It can be understood that, at this time, text information of multiple languages is stored in a database.
  • the preset database corresponding to the session in S102 may include multiple databases, wherein each of the multiple databases corresponds to a language, and each of the multiple databases corresponds to a language.
  • the text information of the language corresponding to the database is stored.
  • the second text information other than "mask” in "whole word mask is not support in chinese” includes “whole”, “word”, “is”, “not", “support”, “in” and “chinese”.
  • FIG. 3 is a schematic diagram of displaying translated text information provided by an embodiment of the present application.
  • the preset database in S102 may include at least one piece of text information.
  • S102 may include: determining the text information in the text information to be translated that matches the text information in the preset database as the first text information.
  • the preset database in S102 may include at least one piece of text information and a weight corresponding to each text information.
  • S102 may include: determining the text information that matches the text information in the preset database in the text information to be translated, and whose weight is greater than the preset threshold, as the first text information.
  • the text information included in the preset database includes “mask”, “bert” and “trans”, wherein the corresponding weights of “mask”, “bert” and “trans” are 95, 10 and 2, respectively.
  • Set the threshold to 30.
  • the weight corresponding to each text information included in the preset database may be the number of times the text information appears in the session corresponding to the preset database.
  • the text translation method provided by the embodiments of the present application may further include: acquiring target text information other than the text information to be translated in the session to which the text information to be translated belongs; determining the target The second language to which the text information belongs; the third text information that does not correspond to the second language in the target text information is identified; a preset database is constructed according to the third text information.
  • a preset database corresponding to the session shown in FIG. 2 is first created, and then “mask” is added to the preset database corresponding to the session shown in FIG. 2 . , and assign the weight corresponding to "mask” to 1.
  • the preset database corresponding to the session shown in FIG. 2 is shown in Table 1.
  • the text translation method provided by the embodiments of the present application may further include: acquiring a target database, where the target database is the same as the text to be translated The database corresponding to the target session whose domain similarity of the session to which the information belongs is greater than the preset threshold; and the preset database is updated according to the target database.
  • the obtained target database corresponding to the target session whose domain similarity shown in FIG. 2 belongs to the session is greater than the preset threshold is shown in Table 2.
  • the above Table 1 is updated according to the above Table 2, and the updated preset database corresponding to the session shown in FIG. 2 is obtained, as shown in Table 3.
  • the text translation method provided by the embodiments of the present application may further include: according to the first vocabulary included in the session to which the text information to be translated belongs and the first vocabulary in the to-be-translated The number of occurrences in the session to which the text information belongs, and a first vocabulary distribution table corresponding to the session to which the text information to be translated belongs is generated; according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session, the target session corresponding the second vocabulary distribution table; according to the first vocabulary distribution table and the second vocabulary distribution table, determine the similarity of the domain of the conversation to which the text information to be translated belongs and the target conversation.
  • each word included in all conversations in its history and the number of occurrences of each word in the conversation may be counted to generate a word distribution table.
  • the word distribution may be a vector characterization of the conversation, and each dimension in the vector is the number of occurrences of each different word in the conversation.
  • the words that appear in the dialogue of a conversation can reflect the domain of the conversation. For example, if a conversation includes more words related to the medical field, the more likely the conversation is in the medical field. Therefore, the lexical distribution can be used to characterize the domain features of group chats.
  • the first vocabulary distribution table is shown in Table 4
  • the second vocabulary table is shown in Table 5.
  • first word The number of occurrences whole 2 support 30 translate 5 us 3 word 2 mask 100 ... ...
  • the first vocabulary distribution table when determining the similarity between the fields of the conversation to which the text information to be translated belongs and the target conversation according to the first vocabulary distribution table and the second vocabulary distribution table, the first vocabulary distribution table may be calculated. Cosine similarity or Pearson correlation coefficient with the second vocabulary distribution table, etc.
  • a word distribution table corresponding to the conversation when generating the vocabulary distribution table corresponding to each conversation, numbers, prepositions, adverbs, conjunctions, pronouns, etc. in all conversations in the history of the conversation may be eliminated first. Based on the number of occurrences of each word included in the conversation after excluding the above numbers, prepositions, adverbs, conjunctions and pronouns, a word distribution table corresponding to the conversation is generated.
  • the number of occurrences of each word in all dialogues in the conversation history may be counted to form a word distribution table corresponding to the conversation. Then, delete numbers, prepositions, adverbs, conjunctions, and pronouns in the vocabulary distribution table.
  • the text translation method provided in the embodiments of the present application may further include: determining a session that has the same domain identification information as the session to which the text information to be translated belongs as a session with the same domain identification information.
  • a target session whose similarity in the domain of the session to which the text information to be translated belongs is greater than a preset threshold.
  • the realm identification information is used to identify the realm to which the session belongs.
  • the user may set realm identification information for the session.
  • the user when creating the session, the user can select one realm identification information from multiple realm identification information as the realm identification information of the session.
  • the text translation method provided by the embodiments of the present application may further include: receiving a first input of the text information to be translated.
  • S102 may include: in response to the first input, determining the first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs.
  • the user clicks the translation button corresponding to the text information to be translated it means that the user performs the first input.
  • the text translation method provided by the embodiments of the present application may further include: determining a third language corresponding to the text information to be translated; The text information included in the conversation determines the fourth language corresponding to the conversation to which the text information to be translated belongs. Further, the translation model from the third language to the fourth language is used to translate the text information to be translated or the second text information.
  • the first vocabulary included in the session A and the number of occurrences of the first vocabulary in the session A are counted, and a corresponding first vocabulary distribution table of the session A is generated.
  • the second vocabulary included in the conversation B and the number of occurrences of the second vocabulary in the conversation B are counted, and the corresponding second vocabulary distribution table of the conversation B is generated.
  • the similarity of the domain of session A and other sessions is calculated, and the session with the similarity of the domain of session A greater than the preset threshold is used as the target session.
  • a session with a similarity in the domain of session A that is greater than a preset threshold is session C.
  • the preset database corresponding to session C is shown in Table 7.
  • the execution subject may be a text translation device, or a control module in the text translation device for executing the text translation method.
  • the text translation device provided by the embodiment of the present application is described by taking the text translation method performed by the text translation device as an example.
  • FIG. 4 is a schematic structural diagram of a text translation apparatus provided by an embodiment of the present application. As shown in FIG. 4, the text translation apparatus 400 may include:
  • the first obtaining module 401 is used to obtain text information to be translated
  • the first determining module 402 is configured to determine the first text information in the text information to be translated according to the preset database corresponding to the session to which the text information to be translated belongs;
  • the translation module 403 is configured to translate the text information to be translated, obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information, obtain the target translation text information; or, translate the second text information except the first text information in the to-be-translated text information, obtain the third translation text information corresponding to the second text information, and combine the third translation text information and For the first text information, target translation text information is obtained.
  • the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated
  • the second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information.
  • the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated.
  • the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
  • the text translation apparatus 400 provided by the embodiments of the present application may further include:
  • the second determining module is used to determine the second language to which the target text information belongs
  • the building module is used for building a preset database according to the third text information.
  • the text translation apparatus 400 provided by the embodiments of the present application may further include:
  • the updating module is used for updating the preset database according to the target database.
  • the text translation apparatus 400 provided by the embodiments of the present application may further include:
  • the first generation module is configured to generate a first vocabulary corresponding to the session to which the text information to be translated belongs according to the first vocabulary included in the session to which the text information to be translated belongs and the number of occurrences of the first vocabulary in the session to which the text information to be translated belongs distribution table;
  • the second generation module is used to generate the second vocabulary distribution table corresponding to the target session according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session;
  • the third determining module is configured to determine, according to the first vocabulary distribution table and the second vocabulary distribution table, the similarity between the domain of the conversation to which the text information to be translated belongs and the target conversation.
  • the text translation apparatus in this embodiment of the present application may be an apparatus, or may be a component, an integrated circuit, or a chip in a terminal.
  • the apparatus may be a mobile electronic device or a non-mobile electronic device.
  • the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant).
  • UMPC ultra-mobile personal computer
  • netbook or a personal digital assistant
  • non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
  • Network Attached Storage NAS
  • personal computer personal computer, PC
  • television television
  • teller machine or self-service machine etc.
  • the text translation apparatus in this embodiment of the present application may be an apparatus with an operating system.
  • the operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
  • the text translation apparatus provided in the embodiments of the present application can implement each process in the text translation method embodiments shown in FIG. 1 to FIG. 3 , and to avoid repetition, details are not repeated here.
  • an embodiment of the present application further provides an electronic device 500, including a processor 501, a memory 502, and a program or instruction stored in the memory 502 and executable on the processor 501, the program Or when the instruction is executed by the processor 501, each process of the above text translation method embodiment can be implemented, and the same technical effect can be achieved. To avoid repetition, details are not described here.
  • the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.
  • FIG. 6 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
  • the electronic device 600 includes but is not limited to: a radio frequency unit 601, a network module 602, an audio output unit 603, an input unit 604, a sensor 605, a display unit 606, a user input unit 607, an interface unit 608, a memory 609, and a processor 610, etc. part.
  • the electronic device 600 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 610 through a power management system, so as to manage charging, discharging, and power management through the power management system. consumption management and other functions.
  • a power source such as a battery
  • the structure of the electronic device shown in FIG. 6 does not constitute a limitation on the electronic device, and the electronic device may include more or less components than those shown in the figure, or combine some components, or arrange different components, which will not be repeated here. .
  • the processor 610 is configured to: obtain text information to be translated; determine the first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs; translate the text information to be translated to obtain the text information to be translated For the first translation text information corresponding to the text information, the second translation text information corresponding to the first text information in the first translation text information is replaced with the first text information to obtain the target translation text information; The second text information other than the first text information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information.
  • the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated
  • the second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information.
  • the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated.
  • the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
  • the preset database includes at least one text information and a weight corresponding to each text information; the processor 610 may be specifically configured to:
  • the text information to be translated matches the text information in the preset database, and the text information whose weight is greater than the preset threshold is determined as the first text information.
  • the processor 610 may also be used for:
  • a preset database is constructed.
  • the processor 610 may also be used for:
  • the target database is a database corresponding to a target session whose similarity with the domain to which the session to which the text information to be translated belongs belongs is greater than a preset threshold;
  • the preset database is updated.
  • the processor 610 may also be used for:
  • the similarity of the domain of the conversation to which the text information to be translated belongs and the target conversation are determined.
  • the input unit 604 may include a graphics processor (Graphics Processing Unit, GPU) 6041 and a microphone 6042. Such as camera) to obtain still pictures or video image data for processing.
  • the display unit 606 may include a display panel 6061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like.
  • the user input unit 607 includes a touch panel 6071 and other input devices 6072 .
  • the touch panel 6071 is also called a touch screen.
  • the touch panel 6071 may include two parts, a touch detection device and a touch controller.
  • Other input devices 6072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which are not described herein again.
  • Memory 609 may be used to store software programs as well as various data, including but not limited to application programs and operating systems.
  • the processor 610 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, and an application program, and the like, and the modem processor mainly processes wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 610.
  • the embodiments of the present application further provide a computer-readable storage medium, where a program or an instruction is stored on the computer-readable storage medium.
  • a program or an instruction is stored on the computer-readable storage medium.
  • the processor is the processor in the electronic device in the above embodiment.
  • the processor may be, but is not limited to, a general-purpose processor, a special-purpose processor, an application-specific processor, or a field programmable logic circuit.
  • Examples of computer-readable storage media include non-transitory machine-readable storage media such as electronic circuits, semiconductor memory devices, read-only memory (ROM), flash memory, erasable ROM (EROM), floppy disks, CDs -ROM, CD-ROM, HDD, etc.
  • the embodiment of the present application also provides a chip, including a processor and a communication interface, the communication interface is coupled with the processor, and the processor is used for running a program or an instruction, implementing each process of the above text translation method embodiment, and can achieve the same technology The effect, in order to avoid repetition, is not repeated here.
  • the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.

Abstract

A text translation method and apparatus, and a device and a medium, which belong to the technical field of electronic devices. The text translation method comprises: acquiring text information to be translated (S101); determining, according to a preset database corresponding to a session to which the text information to be translated belongs, first text information in the text information to be translated (S102); and translating the text information to be translated, so as to obtain first translated text information corresponding to the text information to be translated, and replacing second translated text information, corresponding to the first text information, in the first translated text information with the first text information, so as to obtain target translated text information (S103); or translating second text information other than the first text information in the text information to be translated, so as to obtain third translated text information corresponding to the second text information, and combining the third translated text information and the first text information, so as to obtain target translated text information (S104).

Description

文本翻译方法、装置、设备及介质Text translation method, device, equipment and medium
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请主张在2021年01月27日在中国提交的中国专利申请号202110112331.4的优先权,其全部内容通过引用包含于此。This application claims priority to Chinese Patent Application No. 202110112331.4 filed in China on Jan. 27, 2021, the entire contents of which are hereby incorporated by reference.
技术领域technical field
本申请属于电子设备技术领域,具体涉及一种文本翻译方法、装置、设备及介质。The present application belongs to the technical field of electronic devices, and in particular relates to a text translation method, device, device and medium.
背景技术Background technique
随着神经机器翻译(Neural Machine Translation,NMT)技术的不断完善和日益成熟,跨语言交流的门槛也随之降低,越来越多的人群在工作群聊、线上会议、直播等场景,使用母语并结合机器翻译与跨国友人进行实时的沟通交流。With the continuous improvement and maturity of Neural Machine Translation (NMT) technology, the threshold for cross-language communication has also been lowered, and more and more people are using Native language combined with machine translation for real-time communication with multinational friends.
相关技术中是将一句话中的每个词都进行翻译,然后显示该句话对应的翻译结果。In the related art, each word in a sentence is translated, and then the translation result corresponding to the sentence is displayed.
但是,在实现本申请过程中,发明人发现相关技术中至少存在如下问题:在某些情况下,对整句话进行整体翻译,则会造成该句话对应的翻译结果不准确,进而在显示该句话对应的翻译结果时,会造成用户误解,影响用户体验。However, in the process of realizing this application, the inventor found that there are at least the following problems in the related art: in some cases, if the whole sentence is translated as a whole, the translation result corresponding to the sentence will be inaccurate, and then the display When the translation result corresponding to this sentence is used, it will cause user misunderstanding and affect user experience.
发明内容SUMMARY OF THE INVENTION
本申请实施例的目的是提供一种文本翻译方法、装置、设备及介质,能够解决句子翻译不准确的问题。The purpose of the embodiments of the present application is to provide a text translation method, apparatus, device and medium, which can solve the problem of inaccurate sentence translation.
为了解决上述技术问题,本申请是这样实现的:In order to solve the above technical problems, this application is implemented as follows:
第一方面,本申请实施例提供了一种文本翻译方法,包括:In a first aspect, an embodiment of the present application provides a text translation method, including:
获取待翻译文本信息;Get the text information to be translated;
根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息;determining the first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,Translate the text information to be translated, obtain the first translation text information corresponding to the text information to be translated, and replace the second translation text information corresponding to the first text information in the first translation text information with the first text information to obtain the target translation text information; or,
对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。Translate the second text information except the first text information in the text information to be translated, obtain the third translation text information corresponding to the second text information, combine the third translation text information and the first text information, and obtain the target translation text information .
第二方面,本申请实施例提供了一种文本翻译装置,包括:In a second aspect, an embodiment of the present application provides a text translation device, including:
第一获取模块,用于获取待翻译文本信息;The first acquisition module is used to acquire the text information to be translated;
第一确定模块,用于根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息;a first determining module, configured to determine first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
翻译模块,用于对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。A translation module, configured to translate the text information to be translated, obtain first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information , obtain the target translation text information; or, translate the second text information except the first text information in the to-be-translated text information, obtain the third translation text information corresponding to the second text information, and combine the third translation text information and the third translation text information. a text information, and obtain the target translation text information.
第三方面,本申请实施例提供了一种电子设备,该电子设备包括处理器、存储器及存储在存储器上并可在处理器上运行的程序或指令,程序或指令被处理器执行时实现如第一方面的文本翻译方法的步骤。In a third aspect, an embodiment of the present application provides an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored in the memory and executable on the processor. When the program or instruction is executed by the processor, the Steps of the text translation method of the first aspect.
第四方面,本申请实施例提供了一种计算机可读存储介质,计算机可读存储介质上存储程序或指令,程序或指令被处理器执行时实现如第一方面的文本翻译方法的步骤。In a fourth aspect, embodiments of the present application provide a computer-readable storage medium, where programs or instructions are stored on the computer-readable storage medium, and when the programs or instructions are executed by a processor, the steps of the text translation method of the first aspect are implemented.
第五方面,本申请实施例提供了一种芯片,芯片包括处理器和通信接口,通信接口和处理器耦合,处理器用于运行程序或指令,实现如第一方面的文本翻译方法的步骤。In a fifth aspect, an embodiment of the present application provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or instruction to implement the steps of the text translation method of the first aspect.
在本申请实施例中,在获取到待翻译文本信息后,根据待翻译文本信 息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息,进而对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。也就是说,目标翻译文本信息包括待翻译文本信息中的第一文本信息以及待翻译文本信息中除第一文本信息之外的第二文本信息对应的翻译文本信息。在待翻译文本信息中的第一文本信息不需要翻译的情况下,目标翻译文本信息呈现了没有对待翻译文本信息中的第一文本信息进行翻译的效果,因此,能够提高文本信息翻译的准确性。In the embodiment of the present application, after acquiring the text information to be translated, the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated The second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information. That is, the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated. In the case where the first text information in the text information to be translated does not need to be translated, the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
附图说明Description of drawings
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例中所需要使用的附图作简单的介绍,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present application more clearly, the following briefly introduces the accompanying drawings that need to be used in the embodiments of the present application. For those of ordinary skill in the art, without creative work, the Additional drawings can be obtained from these drawings.
图1是本申请实施例提供的文本翻译方法的流程示意图;1 is a schematic flowchart of a text translation method provided by an embodiment of the present application;
图2是本申请实施例提供的会话界面的示意图;2 is a schematic diagram of a session interface provided by an embodiment of the present application;
图3是本申请实施例提供的显示目标翻译文本信息的示意图;3 is a schematic diagram of displaying target translation text information provided by an embodiment of the present application;
图4是本申请实施例提供的文本翻译装置的结构示意图;4 is a schematic structural diagram of a text translation device provided by an embodiment of the present application;
图5是本申请实施例提供的电子设备的结构示意图;5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application;
图6是实现本申请实施例的电子设备的硬件结构示意图。FIG. 6 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
具体实施方式Detailed ways
下面将详细描述本申请的各个方面的特征和示例性实施例,为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及具体实施例,对本申请进行进一步详细描述。应理解,此处所描述的具体实施例仅意在解释本申请,而不是限定本申请。对于本领域技术人员来说,本申请可以在不需要这些具体细节中的一些细节的情况下实施。下面对实施例的 描述仅仅是为了通过示出本申请的示例来提供对本申请更好的理解。The features and exemplary embodiments of various aspects of the present application will be described in detail below. In order to make the purpose, technical solutions and advantages of the present application more clear, the present application will be further described in detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described herein are only intended to explain the present application, but not to limit the present application. It will be apparent to those skilled in the art that the present application may be practiced without some of these specific details. The following description of the embodiments is merely to provide a better understanding of the present application by illustrating examples of the present application.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that, in this document, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any relationship between these entities or operations. any such actual relationship or sequence exists. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass a non-exclusive inclusion such that a process, method, article or device that includes a list of elements includes not only those elements, but also includes not explicitly listed or other elements inherent to such a process, method, article or apparatus. Without further limitation, an element defined by the phrase "comprises" does not preclude the presence of additional identical elements in a process, method, article, or device that includes the element.
下面结合附图,通过具体的实施例及其应用场景对本申请实施例提供的文本翻译方法、装置、设备及介质进行详细地说明。The text translation method, apparatus, device, and medium provided by the embodiments of the present application will be described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.
图1是本申请实施例提供的文本翻译方法的流程示意图。如图1所示,文本翻译方法可以包括:FIG. 1 is a schematic flowchart of a text translation method provided by an embodiment of the present application. As shown in Figure 1, the text translation method can include:
S101:获取待翻译文本信息;S101: Obtain the text information to be translated;
S102:根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息;S102: Determine first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
S103:对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;S103: Translate the text information to be translated, obtain first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translate text messages;
S104:对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。S104: Translate the second text information except the first text information in the text information to be translated, obtain third translation text information corresponding to the second text information, and combine the third translation text information and the first text information to obtain the target translation text information.
上述各步骤的具体实现方式将在下文中进行详细描述。The specific implementation of the above steps will be described in detail below.
在本申请实施例中,在获取到待翻译文本信息后,根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息,进而对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信 息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。也就是说,目标翻译文本信息包括待翻译文本信息中的第一文本信息以及待翻译文本信息中除第一文本信息之外的第二文本信息对应的翻译文本信息。在待翻译文本信息中的第一文本信息不需要翻译的情况下,目标翻译文本信息呈现了没有对待翻译文本信息中的第一文本信息进行翻译的效果,因此,能够提高文本信息翻译的准确性。In the embodiment of the present application, after acquiring the text information to be translated, the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated The second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information. That is, the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated. In the case where the first text information in the text information to be translated does not need to be translated, the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
在本申请实施例的一些可能实现中,S102中的会话可以为多人聊天的群聊会话。如图2所示,图2是本申请实施例提供的会话界面的示意图。其中,图2所示的会话的名称为“关于mask的讨论群”,图2所示的会话中包括:用户H输入的文本信息“whole word mask is not support in chinese”、用户A输入的文本信息“那未来能够支持中文全字mask吗”以及用户G输入的文本信息“是的,中文的mask支持还是很有必要的”。In some possible implementations of the embodiments of the present application, the session in S102 may be a group chat session in which multiple people chat. As shown in FIG. 2 , FIG. 2 is a schematic diagram of a session interface provided by an embodiment of the present application. Among them, the name of the session shown in Figure 2 is "discussion group about mask", and the session shown in Figure 2 includes: the text information "whole word mask is not support in chinese" input by user H, the text input by user A The information "Can you support Chinese full-character masks in the future?" and the text information entered by user G "Yes, Chinese mask support is still necessary".
示例性地,假设待翻译文本信息为图2中的“whole word mask is not support in chinese”。根据图2所示的会话“关于mask的讨论群”对应的预设数据库,确定出待翻译文本信息“whole word mask is not support in chinese”中的第一文本信息为“mask”。Exemplarily, it is assumed that the text information to be translated is "whole word mask is not support in chinese" in FIG. 2 . According to the preset database corresponding to the session "discussion group about mask" shown in Figure 2, it is determined that the first text information in the text information to be translated "whole word mask is not support in chinese" is "mask".
在本申请实施例的一些可能实现中,S102中的会话对应的预设数据库可以为包括多个语种的文本信息的一个数据库。可以理解的是,此时,多个语种的文本信息存储于一个数据库中。In some possible implementations of the embodiments of the present application, the preset database corresponding to the conversation in S102 may be a database including text information in multiple languages. It can be understood that, at this time, text information of multiple languages is stored in a database.
在本申请实施例的一些可能实现中,S102中的会话对应的预设数据库可以包括多个数据库,其中,多个数据库中的每一个数据库对应一种语种,多个数据库中的每一个数据库中存储该数据库对应的语种的文本信息。In some possible implementations of the embodiments of the present application, the preset database corresponding to the session in S102 may include multiple databases, wherein each of the multiple databases corresponds to a language, and each of the multiple databases corresponds to a language. The text information of the language corresponding to the database is stored.
在S103中,翻译“whole word mask is not support in chinese”,得到“whole word mask is not support in chinese”对应的第一翻译文本信息“不支持中文全字掩码”。其中,“不支持中文全字掩码”中与“mask”对应的第二翻译文本信息为“掩码”。将“不支持中文全字掩码”中的“掩 码”替换为“mask”,得到目标翻译文本信息“不支持中文全字mask”。In S103, translate "whole word mask is not support in chinese", and obtain the first translated text information "do not support Chinese full-word mask" corresponding to "whole word mask is not support in chinese". Wherein, the second translation text information corresponding to "mask" in "do not support Chinese full-character mask" is "mask". Replace "mask" in "do not support Chinese full-character mask" with "mask" to get the target translation text information "do not support Chinese full-character mask".
在S104中,“whole word mask is not support in chinese”中除“mask”之外的第二文本信息包括“whole”、“word”、“is”、“not”、“support”、“in”和“chinese”。对第二文本信息进行翻译,得到第二文本信息包括的“whole”、“word”、“is”、“not”、“support”、“in”和“chinese”分别对应的翻译文本信息:“全部的;完整的”、“字”、“是”、“不”、“支持”、“在”和“汉语;中文”。将“全部的;完整的”、“字”、“是”、“不”、“支持”、“在”和“汉语;中文”与“mask”进行组合,得到目标翻译文本信息“不支持中文全字mask”。In S104, the second text information other than "mask" in "whole word mask is not support in chinese" includes "whole", "word", "is", "not", "support", "in" and "chinese". Translate the second text information to obtain translated text information corresponding to "whole", "word", "is", "not", "support", "in" and "chinese" included in the second text information: " All; Complete", "Word", "Yes", "No", "Support", "Zai" and "Chinese; Chinese". Combine "full; complete", "character", "yes", "no", "support", "zai" and "Chinese; Chinese" with "mask" to get the target translation text information "Chinese is not supported Whole word mask".
显示目标翻译文本信息如图3所示,图3是本申请实施例提供的显示翻译文本信息的示意图。Displaying target translation text information is shown in FIG. 3 , which is a schematic diagram of displaying translated text information provided by an embodiment of the present application.
在本申请实施例的一些可能实现中,S102中的预设数据库中可以包括至少一个文本信息。S102可以包括:将待翻译文本信息中与预设数据库中的文本信息相匹配的文本信息,确定为第一文本信息。In some possible implementations of the embodiments of the present application, the preset database in S102 may include at least one piece of text information. S102 may include: determining the text information in the text information to be translated that matches the text information in the preset database as the first text information.
示例性地,假设预设数据库中包括的文本信息有“mask”、“bert”和“trans”。Exemplarily, it is assumed that the text information included in the preset database is "mask", "bert" and "trans".
对于上述待翻译文本信息“whole word mask is not support in chinese”,“whole word mask is not support in chinese”中的“mask”与预设数据库中的文本信息相匹配,则将“mask”确定为上述第一文本信息。For the above text information to be translated "whole word mask is not support in chinese", "mask" in "whole word mask is not support in chinese" matches the text information in the preset database, then "mask" is determined as The above-mentioned first text information.
本申请实施例的一些可能实现中,S102中的预设数据库中可以包括至少一个文本信息和每个文本信息对应的权重。S102可以包括:将待翻译文本信息中与预设数据库中的文本信息相匹配,且权重大于预设阈值的文本信息,确定为第一文本信息。In some possible implementations of the embodiments of the present application, the preset database in S102 may include at least one piece of text information and a weight corresponding to each text information. S102 may include: determining the text information that matches the text information in the preset database in the text information to be translated, and whose weight is greater than the preset threshold, as the first text information.
示例性地,假设预设数据库包括的文本信息有“mask”、“bert”和“trans”,其中,“mask”、“bert”和“trans”对应的权重分别为95、10和2,预设阈值为30。对于上述待翻译文本信息“whole word mask is  not support in chinese”,则将“whole word mask is not support in chinese”中的“mask”确定为上述第一文本信息。Exemplarily, it is assumed that the text information included in the preset database includes "mask", "bert" and "trans", wherein the corresponding weights of "mask", "bert" and "trans" are 95, 10 and 2, respectively. Set the threshold to 30. For the above-mentioned text information to be translated "whole word mask is not support in Chinese", "mask" in "whole word mask is not support in Chinese" is determined as the above-mentioned first text information.
在本申请实施例的一些可能实现中,预设数据库包括的每个文本信息对应的权重可以为该文本信息在该预设数据库对应的会话中出现的次数。In some possible implementations of the embodiments of the present application, the weight corresponding to each text information included in the preset database may be the number of times the text information appears in the session corresponding to the preset database.
在本申请实施例的一些可能实现中,在S102之前,本申请实施例提供的文本翻译方法还可以包括:获取待翻译文本信息所属的会话中除待翻译文本信息以外的目标文本信息;确定目标文本信息所属的第二语种;识别目标文本信息中与第二语种不对应的第三文本信息;根据第三文本信息,构建预设数据库。In some possible implementations of the embodiments of the present application, before S102, the text translation method provided by the embodiments of the present application may further include: acquiring target text information other than the text information to be translated in the session to which the text information to be translated belongs; determining the target The second language to which the text information belongs; the third text information that does not correspond to the second language in the target text information is identified; a preset database is constructed according to the third text information.
示例性地,以图2所示的会话中的文本信息“那未来能够支持中文全字mask吗”为例。“那未来能够支持中文全字mask吗”包括汉字和英语单词,且汉字的数量多于英语单词的数量,则确定“那未来能够支持中文全字mask吗”所属的语种为中文,识别出“那未来能够支持中文全字mask吗”与中文不对应的第三文本信息是“mask”。则根据“mask”,构建图2所示的会话对应的预设数据库。Illustratively, take the text information in the conversation shown in FIG. 2 as an example, "Can the Chinese full-character mask be supported in the future?" "Can it support Chinese full-character masks in the future" includes Chinese characters and English words, and the number of Chinese characters is more than the number of English words, then determine the language of "Can it support Chinese full-character masks in the future?" Can you support Chinese full-character mask in the future?" The third text information that does not correspond to Chinese is "mask". Then, according to the "mask", a preset database corresponding to the session shown in FIG. 2 is constructed.
具体地,当图2所示的会话不存在预设数据库时,先创建图2所示的会话对应的预设数据库,然后,将“mask”添加在图2所示的会话对应的预设数据库中,并将“mask”对应的权重赋值为1。Specifically, when there is no preset database for the session shown in FIG. 2 , a preset database corresponding to the session shown in FIG. 2 is first created, and then “mask” is added to the preset database corresponding to the session shown in FIG. 2 . , and assign the weight corresponding to "mask" to 1.
当图2所示的会话存在其对应的预设数据库时,将该预设数据库中“mask”对应的权重加1。When the session shown in FIG. 2 has its corresponding preset database, the weight corresponding to "mask" in the preset database is increased by 1.
示例性地,图2所示的会话对应的预设数据库如表1所示。Exemplarily, the preset database corresponding to the session shown in FIG. 2 is shown in Table 1.
表1Table 1
文本信息text message 权重Weights
maskmask 3434
bertbert 5050
在本申请实施例的一些可能实现中,在根据第三文本信息,构建预设数据库之后,本申请实施例提供的文本翻译方法还可以包括:获取目标数据库,其中,目标数据库为与待翻译文本信息所属的会话所属领域的相似度大于预设阈值的目标会话对应的数据库;根据目标数据库,更新预设数 据库。In some possible implementations of the embodiments of the present application, after the preset database is constructed according to the third text information, the text translation method provided by the embodiments of the present application may further include: acquiring a target database, where the target database is the same as the text to be translated The database corresponding to the target session whose domain similarity of the session to which the information belongs is greater than the preset threshold; and the preset database is updated according to the target database.
示例性地,获取到的与图2所示的会话所属领域的相似度大于预设阈值的目标会话对应的目标数据库如表2所示。Exemplarily, the obtained target database corresponding to the target session whose domain similarity shown in FIG. 2 belongs to the session is greater than the preset threshold is shown in Table 2.
表2Table 2
文本信息text message 权重Weights
maskmask 6060
transtrans 4040
根据上述表2对上述表1进行更新,得到更新后的图2所示的会话对应的预设数据库,如表3所示。The above Table 1 is updated according to the above Table 2, and the updated preset database corresponding to the session shown in FIG. 2 is obtained, as shown in Table 3.
表3table 3
文本信息text message 权重Weights
maskmask 9494
bertbert 5050
transtrans 4040
在本申请实施例的一些可能实现中,在获取目标数据库之前,本申请实施例提供的文本翻译方法还可以包括:根据待翻译文本信息所属的会话包括的第一词汇以及第一词汇在待翻译文本信息所属的会话中的出现次数,生成待翻译文本信息所属的会话对应的第一词汇分布表;根据目标会话包括的第二词汇以及第二词汇在目标会话中的出现次数,生成目标会话对应的第二词汇分布表;根据第一词汇分布表与第二词汇分布表,确定待翻译文本信息所属的会话与目标会话的领域的相似度。In some possible implementations of the embodiments of the present application, before acquiring the target database, the text translation method provided by the embodiments of the present application may further include: according to the first vocabulary included in the session to which the text information to be translated belongs and the first vocabulary in the to-be-translated The number of occurrences in the session to which the text information belongs, and a first vocabulary distribution table corresponding to the session to which the text information to be translated belongs is generated; according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session, the target session corresponding the second vocabulary distribution table; according to the first vocabulary distribution table and the second vocabulary distribution table, determine the similarity of the domain of the conversation to which the text information to be translated belongs and the target conversation.
在本申请实施例的一些可能实现中,对于每个会话,可以统计其历史所有对话包括的每个词汇以及每个词汇在该会话中的出现次数,生成词汇分布表。In some possible implementations of the embodiments of the present application, for each conversation, each word included in all conversations in its history and the number of occurrences of each word in the conversation may be counted to generate a word distribution table.
在本申请实施例中,词汇分布可以是会话的向量特征化,向量中的各个维度为会话中各个不同词汇的出现次数。一个会话的对话出现的词汇能够反映该会话的领域。例如,某会话包括的医疗领域相关词汇较多,该会话是医疗领域的可能性越大。因此,词汇分布可以用于表征群聊的领域特 征。In this embodiment of the present application, the word distribution may be a vector characterization of the conversation, and each dimension in the vector is the number of occurrences of each different word in the conversation. The words that appear in the dialogue of a conversation can reflect the domain of the conversation. For example, if a conversation includes more words related to the medical field, the more likely the conversation is in the medical field. Therefore, the lexical distribution can be used to characterize the domain features of group chats.
示例性地,第一词汇分布表如表4所示,第二词汇表如表5所示。Exemplarily, the first vocabulary distribution table is shown in Table 4, and the second vocabulary table is shown in Table 5.
表4Table 4
第一词汇first word 出现次数The number of occurrences
wholewhole 22
支持support 3030
翻译translate 55
我们us 33
wordword 22
maskmask 100100
……... ……...
表5table 5
第二词汇second vocabulary 出现次数The number of occurrences
all 55
支持support 5656
supportsupport 5050
transtrans 3333
wordword 2525
maskmask 33
……... ……...
在本申请实施例的一些可能实现中,在根据第一词汇分布表与第二词汇分布表,确定待翻译文本信息所属的会话与目标会话的领域的相似度时,可以计算第一词汇分布表与第二词汇分布表的余弦相似度或皮尔森相关系数等。In some possible implementations of the embodiments of the present application, when determining the similarity between the fields of the conversation to which the text information to be translated belongs and the target conversation according to the first vocabulary distribution table and the second vocabulary distribution table, the first vocabulary distribution table may be calculated. Cosine similarity or Pearson correlation coefficient with the second vocabulary distribution table, etc.
在本申请实施例的一些可能实现中,在生成每个会话对应的词汇分布表时,可以先剔除该会话的历史所有对话中的数字、介词、副词、连词和代词等。基于剔除上述数字、介词、副词、连词和代词后的该会话包括的 每个词汇的出现次数,生成该会话对应的词汇分布表。In some possible implementations of the embodiments of the present application, when generating the vocabulary distribution table corresponding to each conversation, numbers, prepositions, adverbs, conjunctions, pronouns, etc. in all conversations in the history of the conversation may be eliminated first. Based on the number of occurrences of each word included in the conversation after excluding the above numbers, prepositions, adverbs, conjunctions and pronouns, a word distribution table corresponding to the conversation is generated.
在本申请实施例的一些可能实现中,对于每个会话,可以统计该会话历史所有对话中每个词汇的出现次数,形成该会话对应的词汇分布表。然后,将该词汇分布表中的数字、介词、副词、连词和代词等删除。In some possible implementations of the embodiments of the present application, for each conversation, the number of occurrences of each word in all dialogues in the conversation history may be counted to form a word distribution table corresponding to the conversation. Then, delete numbers, prepositions, adverbs, conjunctions, and pronouns in the vocabulary distribution table.
在本申请实施例的一些可能实现中,在获取目标数据库之前,本申请实施例提供的文本翻译方法还可以包括:将与待翻译文本信息所属的会话具有相同领域标识信息的会话,确定为与待翻译文本信息所属的会话所属领域的相似度大于预设阈值的目标会话。其中,领域标识信息用于标识会话所属的领域。In some possible implementations of the embodiments of the present application, before acquiring the target database, the text translation method provided in the embodiments of the present application may further include: determining a session that has the same domain identification information as the session to which the text information to be translated belongs as a session with the same domain identification information. A target session whose similarity in the domain of the session to which the text information to be translated belongs is greater than a preset threshold. The realm identification information is used to identify the realm to which the session belongs.
在本申请实施例的一些可能实现中,用户可以为会话设置领域标识信息。In some possible implementations of the embodiments of the present application, the user may set realm identification information for the session.
具体地,对于某个会话,用户在创建该会话时,可以从多个领域标识信息中选择一个领域标识信息,作为该会话的领域标识信息。Specifically, for a certain session, when creating the session, the user can select one realm identification information from multiple realm identification information as the realm identification information of the session.
在本申请实施例的一些可能实现中,在S102之前,本申请实施例提供的文本翻译方法还可以包括:接收对于待翻译文本信息的第一输入。相应地,S102可以包括:响应于第一输入,根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息。In some possible implementations of the embodiments of the present application, before S102, the text translation method provided by the embodiments of the present application may further include: receiving a first input of the text information to be translated. Correspondingly, S102 may include: in response to the first input, determining the first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs.
示例性,用户点击待翻译文本信息对应的翻译按钮,则表示用户执行第一输入。Exemplarily, if the user clicks the translation button corresponding to the text information to be translated, it means that the user performs the first input.
在本申请实施例的一些可能实现中,在对文本信息进行翻译之前,本申请实施例提供的文本翻译方法还可以包括:确定待翻译文本信息对应的第三语种;根据待翻译文本信息所属的会话包括的文本信息,确定待翻译文本信息所属的会话对应的第四语种。进而利用由第三语种到第四语种的翻译模型,翻译待翻译文本信息或第二文本信息。In some possible implementations of the embodiments of the present application, before the text information is translated, the text translation method provided by the embodiments of the present application may further include: determining a third language corresponding to the text information to be translated; The text information included in the conversation determines the fourth language corresponding to the conversation to which the text information to be translated belongs. Further, the translation model from the third language to the fourth language is used to translate the text information to be translated or the second text information.
示例性地,确定出待翻译文本信息“whole word mask is not support in chinese”对应的语种为英语。图2所示的会话对应的语种为汉语。则利用英语到汉语的“英译汉”翻译模型,翻译“whole word mask is not support in chinese”或翻译“whole word mask is not support in chinese”中除“mask”之外的文本信息。Exemplarily, it is determined that the language corresponding to the text information to be translated "whole word mask is not support in chinese" is English. The language corresponding to the conversation shown in Figure 2 is Chinese. Then use the English-to-Chinese "English to Chinese" translation model to translate "whole word mask is not support in chinese" or translate the text information except "mask" in "whole word mask is not support in chinese".
下面结合具体的实例对本申请实施例提供的文本翻译方法进行说明。The text translation method provided by the embodiments of the present application will be described below with reference to specific examples.
对于某一会话(比如会话A),当该会话A中有新对话语句时,对该语句的语种进行识别。假设该新对话语句为“那未来能够支持中文全字mask吗”,确定该语句的语种为汉语,并识别出该语句中与汉语不对应的词为“mask”,根据“mask”,更新该会话A对应的预设数据库。假设更新后的该会话A对应的预设数据库如表6所示。For a certain conversation (such as conversation A), when there is a new dialogue sentence in the conversation A, the language of the sentence is identified. Assuming that the new dialogue sentence is "Can the whole Chinese word mask be supported in the future?", determine the language of the sentence as Chinese, and identify the word that does not correspond to Chinese in the sentence as "mask". According to "mask", update the The preset database corresponding to session A. It is assumed that the updated preset database corresponding to the session A is shown in Table 6.
表6Table 6
文本信息text message 权重Weights
maskmask 3434
bertbert 5050
对于该会话A,统计该会话A包括的第一词汇以及第一词汇在该会话A中的出现次数,生成该会话A的对应的第一词汇分布表。For the session A, the first vocabulary included in the session A and the number of occurrences of the first vocabulary in the session A are counted, and a corresponding first vocabulary distribution table of the session A is generated.
对于其他会话中的任一会话(比如会话B),统计该会话B包括的第二词汇以及第二词汇在该会话B中的出现次数,生成该会话B的对应的第二词汇分布表。For any conversation in other conversations (such as conversation B), the second vocabulary included in the conversation B and the number of occurrences of the second vocabulary in the conversation B are counted, and the corresponding second vocabulary distribution table of the conversation B is generated.
根据第一词汇分布表和第二词汇分布表,计算会话A与其他会话的所属领域的相似度,将与会话A的所属领域的相似度大于预设阈值的会话,作为目标会话。According to the first vocabulary distribution table and the second vocabulary distribution table, the similarity of the domain of session A and other sessions is calculated, and the session with the similarity of the domain of session A greater than the preset threshold is used as the target session.
示例性地,与会话A的所属领域的相似度大于预设阈值的会话为会话C。会话C对应的预设数据库如表7所示。Exemplarily, a session with a similarity in the domain of session A that is greater than a preset threshold is session C. The preset database corresponding to session C is shown in Table 7.
表7Table 7
文本信息text message 权重Weights
maskmask 6060
transtrans 4040
基于上述表7对上述表6进行更新,得到更新后的会话A的预设数据库如表8所示。The above-mentioned Table 6 is updated based on the above-mentioned Table 7, and the updated preset database of the session A is obtained as shown in Table 8.
表8Table 8
文本信息text message 权重Weights
maskmask 9494
bertbert 5050
transtrans 4040
当用户选择对会话A中的“whole word mask is not support in chinese”进行翻译时,首先,根据表8确定出“whole word mask is not support in chinese”中的第一文本信息为“mask”,然后,确定出“whole word mask is not support in chinese”对应的语种为英语。该会话A对应的语种为汉语。When the user chooses to translate "whole word mask is not support in chinese" in session A, first, according to Table 8, it is determined that the first text information in "whole word mask is not support in chinese" is "mask", Then, it is determined that the language corresponding to "whole word mask is not support in chinese" is English. The language corresponding to this session A is Chinese.
利用英语到汉语的“英译汉”翻译模型,翻译“whole word mask is not support in chinese”,得到“whole word mask is not support in chinese”对应的翻译文本信息“不支持中文全字掩码”。Using the "English to Chinese" translation model from English to Chinese, translate "whole word mask is not support in chinese", and get the translated text information corresponding to "whole word mask is not support in chinese" "does not support Chinese whole word mask" .
将翻译文本信息“不支持中文全字掩码”中与“mask”对应的文本信息“掩码”替换为“mask”,得到目标翻译文本信息“不支持中文全字mask”。Replace the text information "mask" corresponding to "mask" in the translation text information "does not support Chinese full-character mask" with "mask" to obtain the target translation text information "does not support Chinese full-character mask".
需要说明的是,本申请实施例提供的文本翻译方法,执行主体可以为文本翻译装置,或者该文本翻译装置中的用于执行文本翻译方法的控制模块。本申请实施例中以文本翻译装置执行文本翻译方法为例,说明本申请实施例提供的文本翻译装置。It should be noted that, in the text translation method provided by the embodiments of the present application, the execution subject may be a text translation device, or a control module in the text translation device for executing the text translation method. In the embodiment of the present application, the text translation device provided by the embodiment of the present application is described by taking the text translation method performed by the text translation device as an example.
图4是本申请实施例提供的文本翻译装置的结构示意图。如图4所示,文本翻译装置400可以包括:FIG. 4 is a schematic structural diagram of a text translation apparatus provided by an embodiment of the present application. As shown in FIG. 4, the text translation apparatus 400 may include:
第一获取模块401,用于获取待翻译文本信息;The first obtaining module 401 is used to obtain text information to be translated;
第一确定模块402,用于根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息;The first determining module 402 is configured to determine the first text information in the text information to be translated according to the preset database corresponding to the session to which the text information to be translated belongs;
翻译模块403,用于对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到 第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。The translation module 403 is configured to translate the text information to be translated, obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information, obtain the target translation text information; or, translate the second text information except the first text information in the to-be-translated text information, obtain the third translation text information corresponding to the second text information, and combine the third translation text information and For the first text information, target translation text information is obtained.
在本申请实施例中,在获取到待翻译文本信息后,根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息,进而对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。也就是说,目标翻译文本信息包括待翻译文本信息中的第一文本信息以及待翻译文本信息中除第一文本信息之外的第二文本信息对应的翻译文本信息。在待翻译文本信息中的第一文本信息不需要翻译的情况下,目标翻译文本信息呈现了没有对待翻译文本信息中的第一文本信息进行翻译的效果,因此,能够提高文本信息翻译的准确性。In the embodiment of the present application, after acquiring the text information to be translated, the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated The second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information. That is, the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated. In the case where the first text information in the text information to be translated does not need to be translated, the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
在本申请实施例的一些可能实现中,预设数据库包括至少一个文本信息和每个文本信息对应的权重;第一确定模块402具体可以用于:In some possible implementations of the embodiments of the present application, the preset database includes at least one text information and a weight corresponding to each text information; the first determination module 402 may be specifically used for:
将待翻译文本信息中与预设数据库中的文本信息相匹配,且权重大于预设阈值的文本信息,确定为第一文本信息。The text information to be translated matches the text information in the preset database, and the text information whose weight is greater than the preset threshold is determined as the first text information.
在本申请实施例的一些可能实现中,本申请实施例提供的文本翻译装置400还可以包括:In some possible implementations of the embodiments of the present application, the text translation apparatus 400 provided by the embodiments of the present application may further include:
第二获取模块,用于获取待翻译文本信息所属的会话中除待翻译文本信息以外的目标文本信息;a second acquisition module, configured to acquire target text information other than the text information to be translated in the session to which the text information to be translated belongs;
第二确定模块,用于确定目标文本信息所属的第二语种;The second determining module is used to determine the second language to which the target text information belongs;
识别模块,用于识别目标文本信息中与第二语种不对应的第三文本信息;an identification module for identifying third text information in the target text information that does not correspond to the second language;
构建模块,用于根据第三文本信息,构建预设数据库。The building module is used for building a preset database according to the third text information.
在本申请实施例的一些可能实现中,本申请实施例提供的文本翻译装置400还可以包括:In some possible implementations of the embodiments of the present application, the text translation apparatus 400 provided by the embodiments of the present application may further include:
第三获取模块,用于获取目标数据库,其中,目标数据库为与待翻译 文本信息所属的会话所属领域的相似度大于预设阈值的目标会话对应的数据库;The 3rd acquisition module is used to acquire the target database, wherein, the target database is the database corresponding to the target session whose similarity degree is greater than the preset threshold with the session to which the text information to be translated belongs;
更新模块,用于根据目标数据库,更新预设数据库。The updating module is used for updating the preset database according to the target database.
在本申请实施例的一些可能实现中,本申请实施例提供的文本翻译装置400还可以包括:In some possible implementations of the embodiments of the present application, the text translation apparatus 400 provided by the embodiments of the present application may further include:
第一生成模块,用于根据待翻译文本信息所属的会话包括的第一词汇以及第一词汇在待翻译文本信息所属的会话中的出现次数,生成待翻译文本信息所属的会话对应的第一词汇分布表;The first generation module is configured to generate a first vocabulary corresponding to the session to which the text information to be translated belongs according to the first vocabulary included in the session to which the text information to be translated belongs and the number of occurrences of the first vocabulary in the session to which the text information to be translated belongs distribution table;
第二生成模块,用于根据目标会话包括的第二词汇以及第二词汇在目标会话中的出现次数,生成目标会话对应的第二词汇分布表;The second generation module is used to generate the second vocabulary distribution table corresponding to the target session according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session;
第三确定模块,用于根据第一词汇分布表与第二词汇分布表,确定待翻译文本信息所属的会话与目标会话的领域的相似度。The third determining module is configured to determine, according to the first vocabulary distribution table and the second vocabulary distribution table, the similarity between the domain of the conversation to which the text information to be translated belongs and the target conversation.
本申请实施例中的文本翻译装置可以是装置,也可以是终端中的部件、集成电路、或芯片。该装置可以是移动电子设备,也可以为非移动电子设备。示例性地,移动电子设备可以为手机、平板电脑、笔记本电脑、掌上电脑、车载电子设备、可穿戴设备、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本或者个人数字助理(personal digital assistant,PDA)等,非移动电子设备可以为服务器、网络附属存储器(Network Attached Storage,NAS)、个人计算机(personal computer,PC)、电视机(television,TV)、柜员机或者自助机等,本申请实施例不作具体限定。The text translation apparatus in this embodiment of the present application may be an apparatus, or may be a component, an integrated circuit, or a chip in a terminal. The apparatus may be a mobile electronic device or a non-mobile electronic device. Illustratively, the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant). assistant, PDA), etc., non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
本申请实施例中的文本翻译装置可以为具有操作系统的装置。该操作系统可以为安卓(Android)操作系统,可以为ios操作系统,还可以为其他可能的操作系统,本申请实施例不作具体限定。The text translation apparatus in this embodiment of the present application may be an apparatus with an operating system. The operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
本申请实施例提供的文本翻译装置能够实现图1至图3的文本翻译方法实施例中的各个过程,为避免重复,这里不再赘述。The text translation apparatus provided in the embodiments of the present application can implement each process in the text translation method embodiments shown in FIG. 1 to FIG. 3 , and to avoid repetition, details are not repeated here.
可选的,如图5所示,本申请实施例还提供一种电子设备500,包括处理器501,存储器502,存储在存储器502上并可在处理器501上运行的程序或指令,该程序或指令被处理器501执行时实现上述文本翻译方法 实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。Optionally, as shown in FIG. 5 , an embodiment of the present application further provides an electronic device 500, including a processor 501, a memory 502, and a program or instruction stored in the memory 502 and executable on the processor 501, the program Or when the instruction is executed by the processor 501, each process of the above text translation method embodiment can be implemented, and the same technical effect can be achieved. To avoid repetition, details are not described here.
需要注意的是,本申请实施例中的电子设备包括上述的移动电子设备和非移动电子设备。It should be noted that the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.
图6是实现本申请实施例的电子设备的硬件结构示意图。该电子设备600包括但不限于:射频单元601、网络模块602、音频输出单元603、输入单元604、传感器605、显示单元606、用户输入单元607、接口单元608、存储器609、以及处理器610等部件。FIG. 6 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application. The electronic device 600 includes but is not limited to: a radio frequency unit 601, a network module 602, an audio output unit 603, an input unit 604, a sensor 605, a display unit 606, a user input unit 607, an interface unit 608, a memory 609, and a processor 610, etc. part.
本领域技术人员可以理解,电子设备600还可以包括给各个部件供电的电源(比如电池),电源可以通过电源管理系统与处理器610逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。图6中示出的电子设备结构并不构成对电子设备的限定,电子设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置,在此不再赘述。Those skilled in the art can understand that the electronic device 600 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 610 through a power management system, so as to manage charging, discharging, and power management through the power management system. consumption management and other functions. The structure of the electronic device shown in FIG. 6 does not constitute a limitation on the electronic device, and the electronic device may include more or less components than those shown in the figure, or combine some components, or arrange different components, which will not be repeated here. .
其中,处理器610用于:获取待翻译文本信息;根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息;对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目标翻译文本信息。The processor 610 is configured to: obtain text information to be translated; determine the first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs; translate the text information to be translated to obtain the text information to be translated For the first translation text information corresponding to the text information, the second translation text information corresponding to the first text information in the first translation text information is replaced with the first text information to obtain the target translation text information; The second text information other than the first text information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information.
在本申请实施例中,在获取到待翻译文本信息后,根据待翻译文本信息所属的会话对应的预设数据库,确定待翻译文本信息中的第一文本信息,进而对待翻译文本信息进行翻译,得到待翻译文本信息对应的第一翻译文本信息,将第一翻译文本信息中与第一文本信息对应的第二翻译文本信息替换为第一文本信息,得到目标翻译文本信息;或,对待翻译文本信息中除第一文本信息之外的第二文本信息进行翻译,得到第二文本信息对应的第三翻译文本信息,组合第三翻译文本信息和第一文本信息,得到目 标翻译文本信息。也就是说,目标翻译文本信息包括待翻译文本信息中的第一文本信息以及待翻译文本信息中除第一文本信息之外的第二文本信息对应的翻译文本信息。在待翻译文本信息中的第一文本信息不需要翻译的情况下,目标翻译文本信息呈现了没有对待翻译文本信息中的第一文本信息进行翻译的效果,因此,能够提高文本信息翻译的准确性。In the embodiment of the present application, after acquiring the text information to be translated, the first text information in the text information to be translated is determined according to the preset database corresponding to the session to which the text information to be translated belongs, and then the text information to be translated is translated, Obtain the first translated text information corresponding to the text information to be translated, and replace the second translated text information corresponding to the first text information in the first translated text information with the first text information to obtain the target translated text information; or, the text to be translated The second text information other than the first text information in the information is translated to obtain third translated text information corresponding to the second text information, and the third translated text information and the first text information are combined to obtain the target translated text information. That is, the target translated text information includes the first text information in the text information to be translated and the translated text information corresponding to the second text information other than the first text information in the text information to be translated. In the case where the first text information in the text information to be translated does not need to be translated, the target translation text information presents the effect that the first text information in the text information to be translated is not translated, so the accuracy of the translation of the text information can be improved .
在本申请实施例的一些可能实现中,预设数据库包括至少一个文本信息和每个文本信息对应的权重;处理器610具体可以用于:In some possible implementations of the embodiments of the present application, the preset database includes at least one text information and a weight corresponding to each text information; the processor 610 may be specifically configured to:
将待翻译文本信息中与预设数据库中的文本信息相匹配,且权重大于预设阈值的文本信息,确定为第一文本信息。The text information to be translated matches the text information in the preset database, and the text information whose weight is greater than the preset threshold is determined as the first text information.
在本申请实施例的一些可能实现中,处理器610还可以用于:In some possible implementations of the embodiments of the present application, the processor 610 may also be used for:
获取待翻译文本信息所属的会话中除待翻译文本信息以外的目标文本信息;Obtain the target text information other than the to-be-translated text information in the session to which the to-be-translated text information belongs;
确定目标文本信息所属的第二语种;Determine the second language to which the target text information belongs;
识别目标文本信息中与第二语种不对应的第三文本信息;Identifying third text information that does not correspond to the second language in the target text information;
根据第三文本信息,构建预设数据库。According to the third text information, a preset database is constructed.
在本申请实施例的一些可能实现中,处理器610还可以用于:In some possible implementations of the embodiments of the present application, the processor 610 may also be used for:
获取目标数据库,其中,目标数据库为与待翻译文本信息所属的会话所属领域的相似度大于预设阈值的目标会话对应的数据库;Acquiring a target database, wherein the target database is a database corresponding to a target session whose similarity with the domain to which the session to which the text information to be translated belongs belongs is greater than a preset threshold;
根据目标数据库,更新预设数据库。According to the target database, the preset database is updated.
在本申请实施例的一些可能实现中,在本申请实施例的一些可能实现中,处理器610还可以用于:In some possible implementations of the embodiments of the present application, in some possible implementations of the embodiments of the present application, the processor 610 may also be used for:
根据待翻译文本信息所属的会话包括的第一词汇以及第一词汇在待翻译文本信息所属的会话中的出现次数,生成待翻译文本信息所属的会话对应的第一词汇分布表;generating a first vocabulary distribution table corresponding to the session to which the text information to be translated belongs according to the first vocabulary included in the session to which the text information to be translated belongs and the number of occurrences of the first vocabulary in the session to which the text information to be translated belongs;
根据目标会话包括的第二词汇以及第二词汇在目标会话中的出现次数,生成目标会话对应的第二词汇分布表;generating a second vocabulary distribution table corresponding to the target session according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session;
根据第一词汇分布表与第二词汇分布表,确定待翻译文本信息所属的会话与目标会话的领域的相似度。According to the first vocabulary distribution table and the second vocabulary distribution table, the similarity of the domain of the conversation to which the text information to be translated belongs and the target conversation are determined.
应理解的是,本申请实施例中,输入单元604可以包括图形处理器 (Graphics Processing Unit,GPU)6041和麦克风6042,图形处理器6041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元606可包括显示面板6061,可以采用液晶显示器、有机发光二极管等形式来配置显示面板6061。用户输入单元607包括触控面板6071以及其他输入设备6072。触控面板6071,也称为触摸屏。触控面板6071可包括触摸检测装置和触摸控制器两个部分。其他输入设备6072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。存储器609可用于存储软件程序以及各种数据,包括但不限于应用程序和操作系统。处理器610可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器610中。It should be understood that, in this embodiment of the present application, the input unit 604 may include a graphics processor (Graphics Processing Unit, GPU) 6041 and a microphone 6042. Such as camera) to obtain still pictures or video image data for processing. The display unit 606 may include a display panel 6061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 607 includes a touch panel 6071 and other input devices 6072 . The touch panel 6071 is also called a touch screen. The touch panel 6071 may include two parts, a touch detection device and a touch controller. Other input devices 6072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which are not described herein again. Memory 609 may be used to store software programs as well as various data, including but not limited to application programs and operating systems. The processor 610 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, and an application program, and the like, and the modem processor mainly processes wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 610.
本申请实施例还提供一种计算机可读存储介质,计算机可读存储介质上存储有程序或指令,该程序或指令被处理器执行时实现上述文本翻译方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiments of the present application further provide a computer-readable storage medium, where a program or an instruction is stored on the computer-readable storage medium. When the program or instruction is executed by a processor, each process of the above text translation method embodiment can be achieved, and the same can be achieved. In order to avoid repetition, the technical effect will not be repeated here.
其中,处理器为上述实施例中的电子设备中的处理器。处理器可以是但不限于是通用处理器、专用处理器、特殊应用处理器或者现场可编程逻辑电路。计算机可读存储介质的示例包括非暂态机器可读存储介质,如电子电路、半导体存储器设备、只读存储器(Read-Only Memory,ROM)、闪存、可擦除ROM(EROM)、软盘、CD-ROM、光盘、硬盘等。The processor is the processor in the electronic device in the above embodiment. The processor may be, but is not limited to, a general-purpose processor, a special-purpose processor, an application-specific processor, or a field programmable logic circuit. Examples of computer-readable storage media include non-transitory machine-readable storage media such as electronic circuits, semiconductor memory devices, read-only memory (ROM), flash memory, erasable ROM (EROM), floppy disks, CDs -ROM, CD-ROM, HDD, etc.
本申请实施例还提供了一种芯片,包括处理器和通信接口,通信接口和处理器耦合,处理器用于运行程序或指令,实现上述文本翻译方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiment of the present application also provides a chip, including a processor and a communication interface, the communication interface is coupled with the processor, and the processor is used for running a program or an instruction, implementing each process of the above text translation method embodiment, and can achieve the same technology The effect, in order to avoid repetition, is not repeated here.
应理解,本申请实施例提到的芯片还可以称为系统级芯片、系统芯片、芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他 变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。此外,需要指出的是,本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能,还可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能,例如,可以按不同于所描述的次序来执行所描述的方法,并且还可以添加、省去、或组合各种步骤。另外,参照某些示例所描述的特征可在其他示例中被组合。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element. Furthermore, it should be noted that the scope of the methods and apparatus in the embodiments of the present application is not limited to performing the functions in the order shown or discussed, but may also include performing the functions in a substantially simultaneous manner or in the reverse order depending on the functions involved. To perform functions, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to some examples may be combined in other examples.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对相关技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation. Based on this understanding, the technical solutions of the present application can be embodied in the form of software products in essence or the parts that make contributions to related technologies, and the computer software products are stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) ), including several instructions to enable a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to execute the methods described in the various embodiments of this application.
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本申请的保护之内。The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of this application, without departing from the scope of protection of the purpose of this application and the claims, many forms can be made, which all fall within the protection of this application.

Claims (15)

  1. 一种文本翻译方法,包括:A text translation method comprising:
    获取待翻译文本信息;Get the text information to be translated;
    根据所述待翻译文本信息所属的会话对应的预设数据库,确定所述待翻译文本信息中的第一文本信息;determining the first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
    对所述待翻译文本信息进行翻译,得到所述待翻译文本信息对应的第一翻译文本信息,将所述第一翻译文本信息中与所述第一文本信息对应的第二翻译文本信息替换为所述第一文本信息,得到目标翻译文本信息;或,Translate the to-be-translated text information to obtain first translated text information corresponding to the to-be-translated text information, and replace the second translated text information corresponding to the first text information in the first translated text information with For the first text information, the target translation text information is obtained; or,
    对所述待翻译文本信息中除所述第一文本信息之外的第二文本信息进行翻译,得到所述第二文本信息对应的第三翻译文本信息,组合所述第三翻译文本信息和所述第一文本信息,得到所述目标翻译文本信息。Translate the second text information other than the first text information in the text information to be translated, obtain third translation text information corresponding to the second text information, and combine the third translated text information and the The first text information is obtained, and the target translation text information is obtained.
  2. 根据权利要求1所述的方法,其中,所述预设数据库包括至少一个文本信息和每个文本信息对应的权重;The method according to claim 1, wherein the preset database includes at least one text information and a weight corresponding to each text information;
    所述根据所述待翻译文本信息所属的会话对应的预设数据库,确定所述待翻译文本信息中的第一文本信息,包括:The determining of the first text information in the text information to be translated according to the preset database corresponding to the session to which the text information to be translated belongs includes:
    将所述待翻译文本信息中与所述预设数据库中的文本信息相匹配,且所述权重大于预设阈值的文本信息,确定为所述第一文本信息。The text information to be translated matches the text information in the preset database, and the text information whose weight is greater than the preset threshold is determined as the first text information.
  3. 根据权利要求1所述的方法,其中,在所述根据所述待翻译文本信息所属的会话对应的预设数据库,确定所述待翻译文本信息中的第一文本信息之前,所述方法还包括:The method according to claim 1, wherein before determining the first text information in the text information to be translated according to the preset database corresponding to the session to which the text information to be translated belongs, the method further comprises: :
    获取所述会话中除所述待翻译文本信息以外的目标文本信息;Acquiring target text information in the session other than the to-be-translated text information;
    确定所述目标文本信息所属的第二语种;determining the second language to which the target text information belongs;
    识别所述目标文本信息中与所述第二语种不对应的第三文本信息;Identifying third text information that does not correspond to the second language in the target text information;
    根据所述第三文本信息,构建所述预设数据库。The preset database is constructed according to the third text information.
  4. 根据权利要求3所述的方法,其中,在所述根据所述第三文本信息,构建所述预设数据库之后,所述方法还包括:The method according to claim 3, wherein after constructing the preset database according to the third text information, the method further comprises:
    获取目标数据库,其中,所述目标数据库为与所述会话所属领域的相似度大于预设阈值的目标会话对应的数据库;Acquiring a target database, wherein the target database is a database corresponding to a target session whose similarity in the field to which the session belongs is greater than a preset threshold;
    根据所述目标数据库,更新所述预设数据库。According to the target database, the preset database is updated.
  5. 根据权利要求4所述的方法,其中,在所述获取目标数据库之前,所述方法还包括:The method according to claim 4, wherein, before the acquiring the target database, the method further comprises:
    根据所述会话包括的第一词汇以及所述第一词汇在所述会话中的出现次数,生成所述会话对应的第一词汇分布表;generating a first vocabulary distribution table corresponding to the conversation according to the first vocabulary included in the conversation and the number of occurrences of the first vocabulary in the conversation;
    根据所述目标会话包括的第二词汇以及所述第二词汇在所述目标会话中的出现次数,生成所述目标会话对应的第二词汇分布表;generating a second vocabulary distribution table corresponding to the target session according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session;
    根据所述第一词汇分布表与所述第二词汇分布表,确定所述会话与所述目标会话的领域的相似度。According to the first vocabulary distribution table and the second vocabulary distribution table, the similarity of the domain of the conversation and the target conversation is determined.
  6. 一种文本翻译装置,包括:A text translation device, comprising:
    第一获取模块,用于获取待翻译文本信息;The first acquisition module is used to acquire the text information to be translated;
    第一确定模块,用于根据所述待翻译文本信息所属的会话对应的预设数据库,确定所述待翻译文本信息中的第一文本信息;a first determining module, configured to determine first text information in the text information to be translated according to a preset database corresponding to the session to which the text information to be translated belongs;
    翻译模块,用于对所述待翻译文本信息进行翻译,得到所述待翻译文本信息对应的第一翻译文本信息,将所述第一翻译文本信息中与所述第一文本信息对应的第二翻译文本信息替换为所述第一文本信息,得到目标翻译文本信息;或,对所述待翻译文本信息中除所述第一文本信息之外的第二文本信息进行翻译,得到所述第二文本信息对应的第三翻译文本信息,组合所述第三翻译文本信息和所述第一文本信息,得到所述目标翻译文本信息。A translation module, configured to translate the text information to be translated, obtain first translation text information corresponding to the text information to be translated, and convert the second translation text information corresponding to the first text information in the first translated text information The translated text information is replaced with the first text information to obtain the target translated text information; or, the second text information other than the first text information in the to-be-translated text information is translated to obtain the second text information. For the third translated text information corresponding to the text information, the third translated text information and the first text information are combined to obtain the target translated text information.
  7. 根据权利要求6所述的装置,其中,所述预设数据库包括至少一个文本信息和每个文本信息对应的权重;The apparatus according to claim 6, wherein the preset database includes at least one text information and a weight corresponding to each text information;
    所述第一确定模块,具体用于:The first determining module is specifically used for:
    将所述待翻译文本信息中与所述预设数据库中的文本信息相匹配,且所述权重大于预设阈值的文本信息,确定为所述第一文本信息。The text information to be translated matches the text information in the preset database, and the text information whose weight is greater than the preset threshold is determined as the first text information.
  8. 根据权利要求6所述的装置,其中,所述装置还包括:The apparatus of claim 6, wherein the apparatus further comprises:
    第二获取模块,用于获取所述会话中除所述待翻译文本信息以外的目标文本信息;a second acquisition module, configured to acquire target text information other than the to-be-translated text information in the session;
    第二确定模块,用于确定所述目标文本信息所属的第二语种;a second determining module, configured to determine the second language to which the target text information belongs;
    识别模块,用于识别所述目标文本信息中与所述第二语种不对应的第三文本信息;an identification module, configured to identify third text information that does not correspond to the second language in the target text information;
    构建模块,用于根据所述第三文本信息,构建所述预设数据库。and a building module, configured to build the preset database according to the third text information.
  9. 根据权利要求8所述的装置,其中,所述装置还包括:The apparatus of claim 8, wherein the apparatus further comprises:
    第三获取模块,用于获取目标数据库,其中,所述目标数据库为与所述会话所属领域的相似度大于预设阈值的目标会话对应的数据库;A third obtaining module, configured to obtain a target database, wherein the target database is a database corresponding to a target session whose similarity in the field to which the session belongs is greater than a preset threshold;
    更新模块,用于根据所述目标数据库,更新所述预设数据库。An update module, configured to update the preset database according to the target database.
  10. 根据权利要求9所述的装置,其中,所述装置还包括:The apparatus of claim 9, wherein the apparatus further comprises:
    第一生成模块,用于根据所述会话包括的第一词汇以及所述第一词汇在所述会话中的出现次数,生成所述会话对应的第一词汇分布表;a first generating module, configured to generate a first vocabulary distribution table corresponding to the conversation according to the first vocabulary included in the conversation and the number of occurrences of the first vocabulary in the conversation;
    第二生成模块,用于根据所述目标会话包括的第二词汇以及所述第二词汇在所述目标会话中的出现次数,生成所述目标会话对应的第二词汇分布表;A second generating module, configured to generate a second vocabulary distribution table corresponding to the target session according to the second vocabulary included in the target session and the number of occurrences of the second vocabulary in the target session;
    第三确定模块,用于根据所述第一词汇分布表与所述第二词汇分布表,确定所述会话与所述目标会话的领域的相似度。The third determining module is configured to determine the similarity between the domain of the conversation and the target conversation according to the first vocabulary distribution table and the second vocabulary distribution table.
  11. 一种电子设备,包括:处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求1至5任一项所述的文本翻译方法的步骤。An electronic device, comprising: a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being executed by the processor to implement claims 1 to 5. The steps of any one of the text translation methods.
  12. 一种计算机可读存储介质,所述计算机可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如权利要求1至5任一项所述的文本翻译方法的步骤。A computer-readable storage medium storing programs or instructions on the computer-readable storage medium, when the programs or instructions are executed by a processor, the steps of the text translation method according to any one of claims 1 to 5 are implemented.
  13. 一种电子设备,被配置为用于执行如权利要求1至5任一项所述的文本翻译方法的步骤。An electronic device configured to perform the steps of the text translation method of any one of claims 1 to 5.
  14. 一种计算机程序产品,所述计算机程序产品被至少一个处理器执行以实现如权利要求1至5任一项所述的文本翻译方法的步骤。A computer program product executed by at least one processor to implement the steps of the text translation method of any one of claims 1 to 5.
  15. 一种芯片,包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如权利要求1至5任一项所述的文本翻译方法的步骤。A chip, comprising a processor and a communication interface, wherein the communication interface is coupled with the processor, and the processor is used for running a program or an instruction to implement the steps of the text translation method according to any one of claims 1 to 5 .
PCT/CN2022/073454 2021-01-27 2022-01-24 Text translation method and apparatus, and device and medium WO2022161307A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110112331.4A CN112766005A (en) 2021-01-27 2021-01-27 Text translation method, device, equipment and medium
CN202110112331.4 2021-01-27

Publications (1)

Publication Number Publication Date
WO2022161307A1 true WO2022161307A1 (en) 2022-08-04

Family

ID=75706180

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/073454 WO2022161307A1 (en) 2021-01-27 2022-01-24 Text translation method and apparatus, and device and medium

Country Status (2)

Country Link
CN (1) CN112766005A (en)
WO (1) WO2022161307A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112766005A (en) * 2021-01-27 2021-05-07 维沃移动通信有限公司 Text translation method, device, equipment and medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1841364A (en) * 2005-03-28 2006-10-04 富士施乐株式会社 Document translation method and document translation device
US20070129932A1 (en) * 2005-12-01 2007-06-07 Yen-Fu Chen Chinese to english translation tool
CN104102629A (en) * 2013-04-02 2014-10-15 三星电子株式会社 Text data processing method and electronic device thereof
CN110543644A (en) * 2019-09-04 2019-12-06 语联网(武汉)信息技术有限公司 Machine translation method and device containing term translation and electronic equipment
CN111783482A (en) * 2020-07-06 2020-10-16 南京百家云科技有限公司 Text translation method and device, computer equipment and storage medium
CN112766005A (en) * 2021-01-27 2021-05-07 维沃移动通信有限公司 Text translation method, device, equipment and medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1841364A (en) * 2005-03-28 2006-10-04 富士施乐株式会社 Document translation method and document translation device
US20070129932A1 (en) * 2005-12-01 2007-06-07 Yen-Fu Chen Chinese to english translation tool
CN104102629A (en) * 2013-04-02 2014-10-15 三星电子株式会社 Text data processing method and electronic device thereof
CN110543644A (en) * 2019-09-04 2019-12-06 语联网(武汉)信息技术有限公司 Machine translation method and device containing term translation and electronic equipment
CN111783482A (en) * 2020-07-06 2020-10-16 南京百家云科技有限公司 Text translation method and device, computer equipment and storage medium
CN112766005A (en) * 2021-01-27 2021-05-07 维沃移动通信有限公司 Text translation method, device, equipment and medium

Also Published As

Publication number Publication date
CN112766005A (en) 2021-05-07

Similar Documents

Publication Publication Date Title
CN107810496B (en) User text analysis
US11409813B2 (en) Method and apparatus for mining general tag, server, and medium
CN110770694B (en) Obtaining response information from multiple corpora
US11538468B2 (en) Using semantic frames for intent classification
JP7301922B2 (en) Semantic retrieval method, device, electronic device, storage medium and computer program
CN110334347A (en) Information processing method, relevant device and storage medium based on natural language recognition
US20120297294A1 (en) Network search for writing assistance
US20090210214A1 (en) Universal Language Input
US20140040741A1 (en) Smart Auto-Completion
CN107621886B (en) Input recommendation method and device and electronic equipment
WO2022083750A1 (en) Text display method and apparatus and electronic device
US10915697B1 (en) Computer-implemented presentation of synonyms based on syntactic dependency
WO2022135474A1 (en) Information recommendation method and apparatus, and electronic device
US20190034080A1 (en) Automatic translations by a keyboard
US9557818B2 (en) Contextually-specific automatic separators
WO2022143524A1 (en) Information display method and apparatus, and electronic device
WO2022161307A1 (en) Text translation method and apparatus, and device and medium
WO2022166808A1 (en) Text restoration method and apparatus, and electronic device
WO2022105754A1 (en) Character input method and apparatus, and electronic device
CN113359999B (en) Candidate word updating method and device and electronic equipment
CN111553163A (en) Text relevance determining method and device, storage medium and electronic equipment
WO2022194207A1 (en) Input method and apparatus, electronic device and readable storage medium
WO2022257883A1 (en) Presentation method and presentation apparatus
WO2022135339A1 (en) Message content input method and apparatus, and electronic device
WO2022228294A1 (en) Method and apparatus for carrying out searching, electronic device, and readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22745182

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22745182

Country of ref document: EP

Kind code of ref document: A1