KR102115645B1

KR102115645B1 - Systems and methods for multi-user multi-lingual communications

Info

Publication number: KR102115645B1
Application number: KR1020177014779A
Authority: KR
Inventors: 가브리엘 레이던; 프랑수아 오르시니; 니킬 보짜; 아룬 네둔체치안; 바르트로미에즈 푸존
Original assignee: 엠지 아이피 홀딩스, 엘엘씨
Priority date: 2013-06-03
Filing date: 2014-06-03
Publication date: 2020-05-26
Also published as: CA2913984C; CA2913984A1; CA3037036A1; KR20160017035A; HK1216789A1; KR101743686B1; AU2019205995A1; WO2014197463A2; CN105408891A; CN110069789A; NZ719940A; AU2017221809A1; AU2014275087A1; JP6672344B2; SG11201509840YA; JP2018097885A; WO2014197463A3; KR20170062556A; EP3005151A2; CN105408891B

Abstract

여기에 설명된 다양한 실시예에는 다중 언어 통신을 가능하게 한다. 일부 실시예의 시스템 및 방법은, 예를 들어, 인터넷 기반 채팅, 이메일, 텍스트 기반 모바일 폰 통신, 온라인 포럼에 대한 포스팅, 온라인 소셜 미디어 서비스에 대한 포스팅 등을 포함하는 서로 다른 통신 모드를 통하여, 다중 언어 통신을 가능하게 할 수 있다. 어떤 실시예들은, 두 개 또는 그 이상의 언어들 간의 텍스트를 번역하는 통신 시스템 및 방법을 구현한다. 그 시스템 및 방법의 사용자들은, 부정확하고 또는 오류가 있는 번역들에 대한 정정들을 제출하도록 장려될 수 있고, 이러한 제출들에 대한 보상을 수신할 수 있다. 번역들의 정확도를 평가하기 위한 시스템 및 방법들이 설명된다. The various embodiments described herein enable multilingual communication. The systems and methods of some embodiments are multi-language, through different communication modes including, for example, Internet-based chat, email, text-based mobile phone communication, posting to online forums, posting to online social media services, and the like. Communication can be enabled. Some embodiments implement a communication system and method for translating text between two or more languages. Users of the system and method can be encouraged to submit corrections for inaccurate or erroneous translations, and receive compensation for these submissions. Systems and methods for evaluating the accuracy of translations are described.

Description

Multi-user multi-language communication system and method {SYSTEMS AND METHODS FOR MULTI-USER MULTI-LINGUAL COMMUNICATIONS}

본 출원은, 인용에 의해 그 전체 내용이 본 명세서에 합체되는, 2013.06.03.에 출원된 미국특허출원 제13/908,979의 우선권 및 이익을 주장한다.This application claims the priority and interests of United States Patent Application Serial No. 13 / 908,979, filed on June 3, 2013, the entire content of which is incorporated herein by reference.

본 발명은 일반적으로 언어 번역과 관련되는 것으로, 보다 구체적으로, 복수의 사용자와 복수의 언어를 포함하는 언어 번역에 관한 것이다.The present invention generally relates to language translation, and more particularly, to a language translation including a plurality of users and a plurality of languages.

기계에 의한 언어 번역(이하, “기계 번역”이라 함)이 등장하기 전에는, 두 언어 사이의 번역은, 양쪽 언어를 모두 교육받은 사람의 개입(intervention)이나 해석(interpretation)에 의하여만 가능하였다. 반면에, 전형적인 기계 번역기는, 일반적으로 문맥(context)과 문법(grammar)에 대한 통계적(statistical)/확률적(stochastic) 해석에 기반하여 동작하며, 주로 사람의 개입/해석은 필요하지 않았다.Before machine-language translation (hereinafter referred to as “machine translation”), translation between two languages was possible only by intervention or interpretation of persons educated in both languages. On the other hand, a typical machine translator generally operates based on statistical / stochastic interpretation of context and grammar, and mainly human intervention / interpretation is not required.

통상적인 기계 해석은, 특히 번역될 텍스트(text)가 최소한의 문맥을 가지는 경우에, 주로 오류가 발생이 쉽다. 최소한의 문맥을 가지는 텍스트는 간단한 문장 구조를 사용하는 대화에서 주로 발견된다. 추가적으로, 기계 번역은, 대화 텍스트에서도 일반적으로 발견되는, 약어(abbreviations), 두문자어(acronyms), 지소자(diminutives), 구어체 단어/구절(colloquial word/phrases), 고유명사(proper nouns) 및 보통명사(common noun)에 대하여, 주로 문제점을 가진다.Conventional machine interpretation is prone to errors, especially when the text to be translated has a minimal context. Text with minimal context is mainly found in conversations using simple sentence structures. Additionally, machine translation is commonly found in conversational text, abbreviations, acronyms, diminutives, colloquial words / phrases, proper nouns and common nouns. For (common noun), there are mainly problems.

여기서 설명되는 다양한 실시예들은, 2 또는 그 이상의 클라이언트 시스템의 사용자일 수 있는, 복수의 사용자들 사이의 다중 언어 통신과 관련된 시스템 및 방법을 제공한다. Various embodiments described herein provide systems and methods related to multilingual communication between multiple users, which may be users of two or more client systems.

실시예들에 의하여 가능하게 되는 통신의 모드들은, 인터넷 기반의 채팅(예를 들어, 애플®의 iMessage, Windows® Live Messenger 등), 이메일(예를 들어, 임베디드 포럼 메시징, Yahoo® 메일, RFC 5322 등), 텍스트 기반의 모바일 폰 통신(예를 들어, SMS 메시지나 MMS 메시지), 온라인 포럼에 대한 포스팅(예를 들어, 웹 기반의 취미 포럼에 대한 포스팅) 및 온라인 소셜 미디어 서비스(online social media service)에 대한 포스팅(예를 들어, Twitter®, Facebook® 등)을 포함할 수 있다. 예를 들어, 시스템 및 방법들은 다중 언어, 다중 사용자 채팅 시스템을 실행할 수 있다. Modes of communication enabled by the embodiments include Internet-based chat (eg, Apple® iMessage, Windows® Live Messenger, etc.), email (eg, embedded forum messaging, Yahoo® mail, RFC 5322). Etc.), text-based mobile phone communication (e.g. SMS messages or MMS messages), posting to online forums (e.g. posting to web-based hobby forums) and online social media services ) (For example, Twitter®, Facebook®, etc.). For example, the systems and methods can implement a multi-language, multi-user chat system.

일부 실시예에서, 제공되는 상기 방법은, 제 1 언어와 제 2 언어를 식별하는 단계; 제 1 채팅 클라이언트 시스템에서 상기 제 1 언어로 통신하는 제 1 사람으로부터, 상기 제 1 언어로 된 초기 메시지(initial message)를 수신하는 단계; 및 데이터 저장부로 상기 제 2 언어로 된 제 1 대응 메시지를 질의하는 것으로, 상기 제 1 대응 메시지는 상기 제 1 언어의 상기 초기 메시지에 기반하는 것인 단계를 포함할 수 있다. 상기 데이터 저장부가 상기 제 1 대응 메시지를 포함하면, 상기 방법은 제 2 채팅 클라이언트 시스템에서 상기 제 2 언어로 통신하는 제 2 사람에게로의 상기 대응 메시지의 전송을 지원(assist)할 수 있다. 상기 실시예에 의하면, 상기 초기 메시지는 텍스트, 이모티콘, ASCII 기반의 기술, 또는 네트워크를 통하여 전송된, 사람이 읽을 수 있는 메시지에 적용가능(suitable) 하거나 관습적인(customary) 다른 콘텐츠를 포함할 수 있다. 추가적으로, 상기 초기 메시지는 채팅 클라이언트 시스템 사이에서 통신되는 더 큰 메시지(예를 들어, 상기 초기 메시지는, 복수-문장 메시지에 포함된 하나의 문장)의 일 부분일 수 있다. In some embodiments, the provided method includes: identifying a first language and a second language; Receiving an initial message in the first language from a first person communicating in the first language in a first chat client system; And querying a first correspondence message in the second language with the data storage unit, wherein the first correspondence message is based on the initial message in the first language. When the data storage unit includes the first correspondence message, the method may assist in the transmission of the correspondence message from the second chat client system to the second person communicating in the second language. According to the above embodiment, the initial message may include text, emoticons, ASCII-based technology, or other content applicable or customary to a human-readable message transmitted through a network. have. Additionally, the initial message may be part of a larger message (eg, the initial message is one sentence included in a multi-sentence message) communicated between chat client systems.

상기 데이터 저장부가 상기 제 1 대응 메시지를 포함하지 않으면, 상기 방법은, 상기 초기 메시지의 적어도 일 부분을, 상기 제 1 언어로 된 변환된 메시지로 변환하기 위한 시도를 수행하기 위하여, 변환 엔진(transformation engine)을 활용할 수 있다. 상기 변환된 메시지를 이용하여, 상기 방법은, 상기 변환된 메시지에 기반하는, 상기 제 2 언어로 된 제 2 대응 메시지를, 상기 데이터 저장부로 질의할 수 있다.If the data storage unit does not include the first corresponding message, the method may perform an attempt to convert at least a portion of the initial message into a converted message in the first language. engine). Using the converted message, the method may query the data storage unit for a second corresponding message in the second language based on the converted message.

특정 실시예에서, 상기 시스템 또는 방법은, 상기 변환된 메시지에 기반하는 제 2 대응 메시지에 대해 상기 데이터 저장부에 질의하기 전에, 일련의 변환 동작을 사용하여 상기 초기 메시지를 변환하려고 시도할 수 있다. 대안으로, 일부 실시예에서는, 상기 시스템 또는 방법이 상기 변환 및 질의를 반복적으로 수행할 수 있으며, 여기서, 상기 초기 메시지는, 이용 가능한 변환 동작의 부분 집합(subset)을 이용하여 변환되고, 상기 데이터 저장부는 상기 결과로 초래된 변환된 메시지에 기반하는 제 2 대응메시지에 대하여 질의를 받으며, 만약 제 2 대응 메시지가 식별되지 않으면, 변환 및 질의의 다른 반복이 수행된다(예를 들어, 상기 결과로 초래된 변환된 메시지는, 다른 가능한 변환 동작의 부분 집합을 이용하여 더 변환될 수 있으며, 상기 데이터 저장부는, 상기 결과로 초래된 변환된 메시지에 기반하는, 제 2 대응 메시지를 질의 받을 수 있다). 이러한 일부 실시예에서, 각각의 반복마다 적용되는 변환 동작의 부분 집합은, 상기 초기 메시지에 적용되거나, 또는 최근에 결과로 초래된 변환된 메시지에 적용될 수 있다. In certain embodiments, the system or method may attempt to transform the initial message using a series of transform operations before querying the data store for a second corresponding message based on the transformed message. . Alternatively, in some embodiments, the system or method may iteratively perform the transformation and query, where the initial message is transformed using a subset of the transformation operations available, and the data The storage unit is queried for a second corresponding message based on the converted message resulting from the result, and if the second corresponding message is not identified, conversion and other repetition of the query are performed (for example, as the result) The resulting transformed message can be further transformed using a subset of other possible transform operations, and the data storage can be queried for a second corresponding message based on the resulting transformed message) . In some of these embodiments, a subset of transform operations applied to each iteration may be applied to the initial message, or to a recently resulting transformed message.

결국, 상기 방법은, 상기 초기 메시지 또는 상기 변환된 메시지를, 제 2 언어로 된, 대응 메시지로 번역하는 것을 지원할 수 있다. 일부 실시예에서는, 상기 초기 메시지에 대한 상기 제 1 대응 메시지가 데이터 저장부에 있지 않고, 상기 변환 엔진(transformation engine)이 상기 초기 메시지의 적어도 일 부분을 변환하지 않으면, 상기 초기 메시지는 대응 메시지로 번역될 수 있다. 추가적으로, 다양한 실시예에서, 상기 초기 메시지에 대한 상기 제 1 대응 메시지가 상기 데이터 저장부에 없고; 상기 변환 엔진이, 상기 초기 메시지의 적어도 일부분의 변환을 포함하는, 변환된 메시지를 야기하고; 및 상기 데이터 저장부가 상기 변환된 메시지에 대한 제 2 대응 메시지를 포함하지 않는 경우에, 상기 변환된 메시지는 상기 대응 메시지로 번역될 수 있다. Consequently, the method can support translating the initial message or the converted message into a corresponding message, in a second language. In some embodiments, if the first corresponding message to the initial message is not in the data storage and the transformation engine does not transform at least a portion of the initial message, the initial message is a corresponding message. Can be translated. Additionally, in various embodiments, the first corresponding message to the initial message is not in the data storage; The conversion engine causes a converted message, including conversion of at least a portion of the initial message; And when the data storage unit does not include a second corresponding message to the converted message, the converted message may be translated into the corresponding message.

상기 실시예에 의하면, 상기 초기 메시지의 일부를 변환하는 것은, 상기 초기 메시지에 포함된 채팅용어(chatspeak) 단어 또는 구절(예를 들어, 'lol', 'gr8')을 식별하고, 상기 채팅용어 단어 또는 구절을 채팅용어가 아닌 단어 또는 구절로 교체하며, 상기 초기 메시지의 일부에 대한 맞춤법 검사를 수행하거나, 또는 상기 초기 메시지의 일부의 약어(abbreviation)를 식별하고, 및 상기 약어를 상기 약어에 대응하는(예를 들어, 나타내는) 단어 또는 구절로 교체하는 것을 포함할 수 있다(예를 들어, 'CA'를 'California' 또는 'brb'를 'be right back'). According to the embodiment, converting a part of the initial message identifies a chatspeak word or phrase (eg, 'lol', 'gr8') included in the initial message, and the chat term Replace words or phrases with words or phrases other than chat terms, perform a spell check on parts of the initial message, or identify abbreviations of parts of the initial message, and assign the abbreviations to the abbreviations It may include substituting a corresponding (eg, representing) word or phrase (eg, 'CA' to 'California' or 'brb' to 'be right back').

추가적으로, 상기 초기 메시지의 일부를 변환하는 것은, 상기 초기 메시지의 일부의 두음자어(acronym)을 식별하고, 상기 두음자어를 상기 두음자어(예를 들어, 'USA')에 대응하는(예를 들어, 나타내는) 단어 또는 구절로 교체하거나, 또는 상기 초기 메시지의 일부의 구어체(colloquial)를 식별하고 상기 구어체를 상기 구어체에 대응하는 단어 또는 구절로 교체하는 것을 포함할 수 있다. 나아가, 상기 초기 메시지의 일부를 변환하는 것은, 상기 초기 메시지의 일부의 비속어(profane) 단어 또는 구절을 식별하고, 상기 비속어 단어 또는 구절을 비속어가 아닌 단어 또는 구절로 교체하거나(예를 들어, 상기 비속어 단어 또는 구절이 나타내는 것으로), 상기 비속어 단어 또는 구절을 상기 초기 메시지에서 삭제하는 것을 포함할 수 있다. Additionally, converting a portion of the initial message identifies an acronym of a portion of the initial message, and corresponds to the acronym (eg, 'USA') for the acronym (eg, 'USA'). , Indicating a word or phrase, or identifying a colloquial of a part of the initial message and replacing the colloquial language with a word or phrase corresponding to the colloquial language. Further, converting a portion of the initial message identifies a profanity word or phrase of a portion of the initial message, and replaces the profanity word or phrase with a non-profanity word or phrase (eg, the Slang words or phrases, and may include deleting the slang words or phrases from the initial message.

일부 실시예에 의하면, 상기 초기 메시지의 일부를 변환하는 것은, 상기 초기 메시지의 번역하지 않을 부분을 표시하는 것을 포함한다. 예를 들어, 상기 초기 메시지의 특정 부분에, 고유명사(proper noun), 보통명사(common noun), 지소사(diminutive), 약어(abbreviation) 또는 두음자어(acronym)가 포함되면, 상기 방법은 상기 특정 부분에, 이후의 동작에서 번역되지 않음을 표시할 수 있다. According to some embodiments, converting a portion of the initial message includes marking a portion of the initial message that is not to be translated. For example, if a specific part of the initial message includes a proper noun, a common noun, a diminutive, an abbreviation or an acronym, the method In the part, it can be indicated that it is not translated in a subsequent operation.

일 실시예에서는, 여기서 설명되는 다양한 동작들을 수행하도록 구현된 다양한 구성들을 포함하는 실시예를 제공한다. 유사하게, 컴퓨터 프로그램 제품을 제공하는 일 실시예들은, 컴퓨터 시스템이 여기서 기술된 다양한 동작들을 수행할 수 있도록 구현된 컴퓨터 명령어 코드를 포함할 수 있다. In one embodiment, an embodiment is provided that includes various configurations implemented to perform various operations described herein. Similarly, one embodiment of providing a computer program product may include computer instruction code implemented to enable a computer system to perform various operations described herein.

일 측면에서, 발명은 컴퓨터로 구현되는 방법과 관련된다. 그 방법은, 단어 또는 구절을 포함하는 메시지를 제 1 언어에서 제 2 언어로 번역하는데 실패한 번역 실패와 연관된 상기 단어 또는 구절을 데이터 저장부로부터 선택하는 단계; 상기 번역 실패에 대한 사용자 피드백을 간청할 사용자를 선택하는 단계; 사용자 피드백에 대한 대가로 상기 사용자에게 제공하기 위한 인센티브의 가치를 결정하는 단계; 상기 사용자의 컴퓨팅 디바이스로 피드백을 위한 요청을 전송하는 단계로서, 상기 요청은 인센티브를 포함하는, 전송하는 단계; 상기 컴퓨팅 디바이스로부터 사용자 피드백을 수신하는 단계로서, 그 사용자 피드백은 제 1 또는 제 2 언어로 된 각각의 단어 또는 구절을 포함하는, 수신하는 단계; 상기 사용자 피드백이 승인되는 것으로 결정하는 단계; 및 승인에 기초하여, 상기 인센티브의 가치에 따라 사용자의 계좌에 입금하는 단계를 포함한다. In one aspect, the invention relates to a computer-implemented method. The method comprises selecting a word or phrase from a data store associated with a translation failure that fails to translate a message containing the word or phrase from a first language to a second language; Selecting a user to solicit user feedback for the translation failure; Determining the value of an incentive to provide to the user in exchange for user feedback; Sending a request for feedback to the user's computing device, wherein the request includes an incentive; Receiving user feedback from the computing device, the user feedback comprising each word or phrase in a first or second language; Determining that the user feedback is approved; And based on the approval, depositing into the user's account according to the value of the incentive.

특정 실시예에서, 상기 실패는, 상기 메시지를 번역하는데 있어서 실제적인 실패에 기인한다. 상기 실패는, 사용자가 상기 메시지를 잠재적으로 부정확한 것으로 표시하는 것에 의해 식별되거나 또는 그것에 기인할 수 있다. 상기 사용자를 선택하는 단계는, 상기 사용자의 신뢰도 측정, 상기 사용자와 연관된 쿼터(quota), 상기 사용자의 계좌로의 이전의 입금, 상기 사용자의 선호도, 또는 상기 사용자의 언어 능력에 기초할 수 있다. 일부 실시예에서, 상기 단어 또는 구절은, 제 1 언어로 된 채팅용어를 포함한다. 상기 응답은, 제 2 언어로 된 채팅용어를 포함할 수 있다. 일부 구현에서, 상기 질의는, 텍스트 기반 값을 수신하도록 구성된 필드를 포함한다. In certain embodiments, the failure is due to an actual failure in translating the message. The failure can be identified or attributed to the user marking the message as potentially incorrect. The step of selecting the user may be based on the user's reliability measure, a quote associated with the user, a previous deposit into the user's account, the user's preference, or the user's language ability. In some embodiments, the word or phrase includes chat terms in a first language. The response may include chat terms in a second language. In some implementations, the query includes fields configured to receive text-based values.

특정 실시예에서, 상기 요청은, 상기 사용자가 상기 단어 또는 구절에 대한 정의를 선택할 수 있는 미리 선택된 정의들의 세트를 포함한다. 상기 미리 선택된 정의들의 세트는, 예를 들어, 다른 요청에 응답하여 다른 사용자에 의해 제공된 적어도 하나의 정의를 포함하되, 상기 다른 요청은 상기 다른 사용자로부터 상기 단어 또는 구절에 대해 이전의 사용자 피드백을 얻기 위해 이전에 생성된 것일 수 있다. 상기 다른 요청은, 상기 다른 사용자가 정의를 선택하였던 다른 미리 선택된 정의들의 세트를 포함할 수 있다. 상기 방법은, 가장 인기있는 응답을 결정하기 위해 상기 사용자 피드백을 평가하는 단계를 더 포함할 수 있다.In a particular embodiment, the request includes a set of preselected definitions from which the user can select a definition for the word or phrase. The set of pre-selected definitions includes, for example, at least one definition provided by another user in response to another request, wherein the other request obtains previous user feedback for the word or phrase from the other user. May have been previously created. The other request may include another set of preselected definitions from which the other user has selected the definition. The method may further include evaluating the user feedback to determine the most popular response.

다양한 실시예에서, 상기 방법은, 상기 사용자 피드백에 기초하여 상기 사용자의 능력을 결정하는 단계를 더 포함한다. 상기 방법은, 상기 사용자 피드백을 기초로, 제 1 언어에서 제 2 언어로의 상기 단어 또는 구절의 변환 또는 번역을 업데이트하는 단계를 더 포함할 수 있다. 일부 실시예에서, 상기 사용자 피드백이 승인되는 것으로 결정하는 단계는, 상기 사용자 피드백이 사기적이지 않은 것으로 결정하는 단계; 및/또는 상기 사용자 피드백이 정확한 것으로 결정하는 단계를 포함할 수 있다. 상기 사용자 피드백이 승인되는 것으로 결정하는 단계는, 상기 사용자 피드백과, 다른 요청에 응답하여 다른 사용자에 의해 제공된 적어도 하나의 이전의 사용자 피드백의 비교에 기초할 수 있고, 상기 다른 요청은 상기 다른 사용자로부터 상기 단어 또는 구절에 대한 피드백을 얻기 위해 이전에 생성된 것일 수 있다.In various embodiments, the method further comprises determining the user's capabilities based on the user feedback. The method may further include updating the translation or translation of the word or phrase from a first language to a second language based on the user feedback. In some embodiments, determining that the user feedback is approved includes: determining that the user feedback is not fraudulent; And / or determining that the user feedback is correct. The step of determining that the user feedback is approved may be based on a comparison of the user feedback with at least one previous user feedback provided by another user in response to another request, the other request being from the other user. It may have been previously generated to get feedback on the word or phrase.

특정 실시예에서, 상기 인센티브는, 인-게임 통화(in-game currency) 또는 인-게임 아이템(in-game item)을 포함한다(또는 이를 위한 제안이다). 상기 인센티브의 가치는, 예를 들어, 상기 단어 또는 구절의 중요성, 또는 상기 단어 또는 구절의 복잡성에 기초하여 결정될 수 있다. 일부 예에서, 상기 인센티브의 가치를 결정하는 단계는, (i) 상기 단어 또는 구절의 복잡성, (ii) 상기 단어 또는 구절의 중요성, (iii) 상기 사용자에 의해 이용된 응답 방법, (iv) 상기 단어 또는 구절의 유형, 및/또는 (v) 상기 번역 실패에 관여된 언어를 고려하는 단계를 포함한다. In certain embodiments, the incentive includes (or is an offer for) an in-game currency or an in-game item. The value of the incentive may be determined based on, for example, the importance of the word or phrase, or the complexity of the word or phrase. In some examples, determining the value of the incentive includes: (i) the complexity of the word or phrase, (ii) the importance of the word or phrase, (iii) the response method used by the user, (iv) the Considering the type of word or phrase, and / or (v) the language involved in the translation failure.

다른 측면에서, 발명은, 동작을 수행하도록 프로그램된 하나 또는 그 이상의 컴퓨터를 포함하는 시스템과 관련된다. 그 동작은, 단어 또는 구절을 포함하는 메시지를 제 1 언어에서 제 2 언어로 번역하는데 실패한 번역 실패와 연관된 상기 단어 또는 구절을 데이터 저장부로부터 선택하는 단계; 상기 번역 실패에 대한 사용자 피드백을 간청할 사용자를 선택하는 단계; 사용자 피드백에 대한 대가로 상기 사용자에게 제공하기 위한 인센티브의 가치를 결정하는 단계; 상기 사용자의 컴퓨팅 디바이스로 피드백을 위한 요청을 전송하는 단계로서, 상기 요청은 인센티브를 포함하는, 단계; 상기 컴퓨팅 디바이스로부터 상기 사용자 피드백을 수신하는 단계로서, 상기 사용자 피드백은 상기 제 1 또는 제 2 언어로 된 각각의 단어 또는 구절을 포함하는, 단계; 상기 사용자 피드백이 승인되는 것으로 결정하는 단계; 및 승인에 기초하여, 상기 사용자의 계좌에 입금하는 단계를 포함한다. In another aspect, the invention relates to a system comprising one or more computers programmed to perform an operation. The operation includes selecting a word or phrase from a data storage unit associated with a translation failure that fails to translate a message containing a word or phrase from a first language to a second language; Selecting a user to solicit user feedback for the translation failure; Determining the value of an incentive to provide to the user in exchange for user feedback; Sending a request for feedback to the user's computing device, the request comprising an incentive; Receiving the user feedback from the computing device, the user feedback comprising each word or phrase in the first or second language; Determining that the user feedback is approved; And based on approval, depositing into the user's account.

특정 실시예에서, 상기 요청은, 상기 사용자가 상기 단어 또는 구절에 대한 정의를 선택할 수 있는 미리 선택된 정의들의 세트를 포함한다. 상기 미리 선택된 정의들의 세트는, 예를 들어, 다른 요청에 응답하여 다른 사용자에 의해 제공된 적어도 하나의 정의를 포함하되, 상기 다른 요청은 상기 다른 사용자로부터 상기 단어 또는 구절에 대해 이전의 사용자 피드백을 얻기 위해 이전에 생성된 것일 수 있다. 상기 다른 요청은, 상기 다른 사용자가 정의를 선택하였던 다른 미리 선택된 정의들의 세트를 포함할 수 있다. 상기 동작은, 가장 인기있는 응답을 결정하기 위해 상기 사용자 피드백을 평가하는 단계를 더 포함할 수 있다.In a particular embodiment, the request includes a set of preselected definitions from which the user can select a definition for the word or phrase. The set of pre-selected definitions includes, for example, at least one definition provided by another user in response to another request, wherein the other request obtains previous user feedback for the word or phrase from the other user. May have been previously created. The other request may include another set of preselected definitions from which the other user has selected the definition. The operation may further include evaluating the user feedback to determine the most popular response.

다양한 실시예에서, 상기 동작은, 상기 사용자 피드백에 기초하여 상기 사용자의 능력을 결정하는 단계를 더 포함한다. 상기 동작은, 또한 상기 사용자 피드백을 기초로, 제 1 언어에서 제 2 언어로의 상기 단어 또는 구절의 변환 또는 번역을 업데이트하는 단계를 더 포함할 수 있다. 일부 실시예에서, 상기 사용자 피드백이 승인되는 것으로 결정하는 단계는, 상기 사용자 피드백이 사기적이지 않은 것으로 결정하는 단계; 및/또는 상기 사용자 피드백이 정확한 것으로 결정하는 단계를 포함할 수 있다. 상기 사용자 피드백이 승인되는 것으로 결정하는 단계는, 상기 사용자 피드백과, 다른 요청에 응답하여 다른 사용자에 의해 제공된 적어도 하나의 이전의 사용자 피드백의 비교에 기초할 수 있고, 상기 다른 요청은 상기 다른 사용자로부터 상기 단어 또는 구절에 대한 피드백을 얻기 위해 이전에 생성된 것일 수 있다.In various embodiments, the operation further comprises determining the capability of the user based on the user feedback. The operation may further include updating the translation or translation of the word or phrase from a first language to a second language based on the user feedback. In some embodiments, determining that the user feedback is approved includes: determining that the user feedback is not fraudulent; And / or determining that the user feedback is correct. The step of determining that the user feedback is approved may be based on a comparison of the user feedback with at least one previous user feedback provided by another user in response to another request, the other request being from the other user. It may have been previously generated to get feedback on the word or phrase.

다른 측면에서, 발명은, 인센티브화된 피드백을 통해 언어 번역을 개선하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품에 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행되고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, 단어 또는 구절을 포함하는 메시지를 제 1 언어에서 제 2 언어로 번역하는데 실패한 번역 실패와 연관된 상기 단어 또는 구절을 데이터 저장부로부터 선택하는 단계; 상기 번역 실패에 대한 사용자 피드백을 간청할 사용자를 선택하는 단계; 사용자 피드백에 대한 대가로 상기 사용자에게 제공하기 위한 인센티브의 가치를 결정하는 단계; 상기 사용자의 컴퓨팅 디바이스로 피드백을 위한 요청을 전송하는 단계로서, 상기 요청은 인센티브를 포함하는, 단계; 상기 컴퓨팅 디바이스로부터 상기 사용자 피드백을 수신하는 단계로서, 상기 사용자 피드백은 상기 제 1 또는 제 2 언어로 된 각각의 단어 또는 구절을 포함하는, 단계; 상기 사용자 피드백이 승인되는 것으로 결정하는 단계; 및 승인에 기초하여, 상기 사용자의 계좌에 입금하는 단계를 포함한다. In another aspect, the invention relates to a computer program product stored on one or more storage media for improving language translation through incentivized feedback. The computer program product is executed by a data processing device and causes the data processing device to perform an operation, wherein the operation fails to translate a message containing a word or phrase from a first language to a second language. Selecting the word or phrase associated with the data storage unit; Selecting a user to solicit user feedback for the translation failure; Determining the value of an incentive to provide to the user in exchange for user feedback; Sending a request for feedback to the user's computing device, the request comprising an incentive; Receiving the user feedback from the computing device, the user feedback comprising each word or phrase in the first or second language; Determining that the user feedback is approved; And based on approval, depositing into the user's account.

특정 실시예에서, 상기 인센티브는, 인-게임 통화(in-game currency) 또는 인-게임 아이템(in-game item)을 포함한다(또는 이를 위한 제안이다). 상기 인센티브의 가치는, 예를 들어, 상기 단어 또는 구절의 중요성, 또는 상기 단어 또는 구절의 복잡성에 기초하여 결정될 수 있다. 일부 예에서, 상기 인센티브의 가치를 결정하는 단계는, (i) 상기 단어 또는 구절의 복잡성, (ii) 상기 단어 또는 구절의 중요성, (iii) 상기 사용자에 의해 이용된 응답 방법, (iv) 상기 단어 또는 구절의 유형, 및/또는 (v) 상기 번역 실패에 관여된 언어를 고려하는 단계를 포함한다.In certain embodiments, the incentive includes (or is an offer for) an in-game currency or an in-game item. The value of the incentive may be determined based on, for example, the importance of the word or phrase, or the complexity of the word or phrase. In some examples, determining the value of the incentive includes: (i) the complexity of the word or phrase, (ii) the importance of the word or phrase, (iii) the response method used by the user, (iv) the Considering the type of word or phrase, and / or (v) the language involved in the translation failure.

일 측면에서, 발명은, 데이터 처리 장치에 의해 구현되는 방법에 관련된다. 그 방법은, 텍스트 메시지 채팅 시스템을 복수의 사용자에게 제공하는 단계; 제 1 사용자로부터 제 1 언어로 된 원본 텍스트 메시지를 수신하는 단계; 상기 원본 텍스트 메시지의 제 2 언어로 된 초기 번역을 생성하는 단계; 상기 원본 텍스트 메시지 및 상기 초기 번역을 제 2 사용자에게 제공하는 단계; 상기 제 2 사용자로부터 상기 초기 번역에서의 에러를 제기하는 번역 정정을 수신하는 단계; 및 (a) 복수의 번역 정정으로부터 가장 정확한 번역 정정을 식별하는 단계로서, 상기 복수의 번역 정정은 상기 제 2 사용자로부터의 번역 정정을 포함하는, 식별하는 단계; 및 (b) 단어-기반 특징, 언어-기반 특징, 및/또는 단어 배치 특징을 이용하여 상기 제 2 사용자로부터의 번역 정정의 정확도를 평가하는 단계 중 적어도 하나를 포함한다. In one aspect, the invention relates to a method implemented by a data processing apparatus. The method includes providing a text message chat system to a plurality of users; Receiving an original text message in a first language from a first user; Generating an initial translation in the second language of the original text message; Providing the original text message and the initial translation to a second user; Receiving a translation correction that raises an error in the initial translation from the second user; And (a) identifying the most accurate translation correction from a plurality of translation corrections, wherein the plurality of translation corrections include translation corrections from the second user; And (b) evaluating the accuracy of translation correction from the second user using word-based features, language-based features, and / or word placement features.

특정 실시예에서, 상기 방법은, 상기 제 2 사용자가 번역 정정을 제출하도록 장려하기 위해 인센티브(예, 온라인 게임에서 사용하기 위한 가상 재화 및/또는 가상 통화)를 제공하는 단계를 포함한다. 상기 가장 정확한 번역 정정을 결정하는 단계는, 적어도 하나의 추가 사용자로부터 상기 초기 번역에서의 에러를 제기하기 위한 적어도 하나의 추가 번역 정정을 수신하는 단계로서, 상기 적어도 하나의 추가 번역 정정 및 상기 제 2 사용자로부터의 번역 정정은, 상기 복수의 번역 정정을 정의하는, 추가 번역 정정을 수신하는 단계; 상기 복수의 번역 정정의 정확도에 관한 피드백을 사용자들로부터 수신하는 단계; 및 그 피드백에 기초하여, 상기 복수의 번역 정정으로부터 가장 정확한 번역 정정을 식별하는 단계를 포함한다. In certain embodiments, the method includes providing incentives (eg, virtual goods and / or virtual currencies for use in online games) to encourage the second user to submit translation corrections. Determining the most accurate translation correction comprises: receiving at least one additional translation correction for raising an error in the initial translation from at least one additional user, wherein the at least one additional translation correction and the second The translation correction from the user comprises: receiving an additional translation correction defining the plurality of translation corrections; Receiving feedback from users about the accuracy of the plurality of translation corrections; And based on the feedback, identifying the most accurate translation correction from the plurality of translation corrections.

일부 구현에서, 상기 방법은, 또한 상기 가장 정확한 번역 정정을 제출한 사용자에게 보상(예를 들어, 온라인 게임에서 사용하기 위한 가상 재화 및/또는 가상 통화)을 제공하는 단계를 포함한다. 상기 방법은 또한 상기 가장 정확한 번역을 식별하는데 사용된 피드백을 제공한 사용자에게 보상(예를 들어, 온라인 게임에서 사용하기 위한 가상 재화 및/또는 가상 통화)을 제공하는 단계를 포함할 수 있다. 상기 단어-기반 특징은, 예를 들어, 단어 카운트(count), 문자 카운트(count), 이모지(emoji), 숫자, 및/또는 구두점을 포함할 수 있다. 상기 언어-기반 특징을 이용하는 것은, 상기 원본 텍스트 메시지 및 상기 제 2 사용자로부터의 상기 번역 정정에 있는 품사를 식별하는 것을 포함할 수 있다. In some implementations, the method also includes providing a reward (eg, virtual goods and / or virtual currencies for use in online games) to the user who submitted the most accurate translation correction. The method may also include providing a reward (eg, virtual goods and / or virtual currencies for use in online games) to a user who provided feedback used to identify the most accurate translation. The word-based features may include, for example, word counts, character counts, emojis, numbers, and / or punctuation marks. Using the language-based feature may include identifying the part of speech in the original text message and the translation correction from the second user.

일부 실시예에서, 상기 방법은 또한, 상기 원본 텍스트 메시지 및 상기 제 2 사용자로부터의 번역 정정의 각각에 있는 동사의 개수를 식별하는 단계; 및 상기 원본 텍스트 메시지에 있는 동사의 개수와, 상기 제 2 사용자로부터의 번역 정정에 있는 동사의 개수를 비교하는 단계를 더 포함한다. 상기 원본 텍스트 메시지 및/또는 상기 제 2 사용자로부터의 번역 정정에서의 품사의 부재(absence)는, 언어 검출 실패를 가리킬 수 있다. 상기 방법은 또한 상기 제 2 사용자로부터의 번역 정정이 상기 초기 번역과 동일한 경우 상기 제 2 사용자로부터의 번역 정정을 거절하는 단계를 더 포함할 수 있다. In some embodiments, the method also includes identifying the number of verbs in each of the original text message and the translation correction from the second user; And comparing the number of verbs in the original text message with the number of verbs in the translation correction from the second user. The absence of part-of-speech in the original text message and / or translation correction from the second user may indicate a language detection failure. The method may further include rejecting the translation correction from the second user when the translation correction from the second user is the same as the initial translation.

다른 측면에서, 발명은, 명령들이 저장되는 컴퓨터 판독 가능한 매체, 및 데이터 처리 장치를 포함하는 시스템과 관련된다. 상기 데이터 처리 장치는, 동작을 수행하기 위한 명령들을 실행하도록 구성된다. 그 동작은, 텍스트 메시지 채팅 시스템을 복수의 사용자에게 제공하는 단계; 제 1 사용자로부터 제 1 언어로 된 원본 텍스트 메시지를 수신하는 단계; 상기 원본 텍스트 메시지의 제 2 언어로 된 초기 번역을 생성하는 단계; 상기 원본 텍스트 메시지 및 상기 초기 번역을 제 2 사용자에게 제공하는 단계; 상기 제 2 사용자로부터 상기 초기 번역에서의 에러를 제기하는 번역 정정을 수신하는 단계; 및 (a) 복수의 번역 정정으로부터 가장 정확한 번역 정정을 식별하는 단계로서, 상기 복수의 번역 정정은 상기 제 2 사용자로부터의 번역 정정을 포함하는, 단계; 및 (b) 단어-기반 특징, 언어-기반 특징, 및/또는 단어 배치 특징을 이용하여 상기 제 2 사용자로부터의 번역 정정의 정확도를 평가하는 단계 중 적어도 하나를 포함한다. In another aspect, the invention relates to a system comprising a computer readable medium on which instructions are stored, and a data processing apparatus. The data processing apparatus is configured to execute instructions for performing an operation. The operation includes providing a text message chat system to a plurality of users; Receiving an original text message in a first language from a first user; Generating an initial translation in the second language of the original text message; Providing the original text message and the initial translation to a second user; Receiving a translation correction that raises an error in the initial translation from the second user; And (a) identifying the most accurate translation correction from a plurality of translation corrections, wherein the plurality of translation corrections include translation corrections from the second user; And (b) evaluating the accuracy of translation correction from the second user using word-based features, language-based features, and / or word placement features.

특정 실시예에서, 상기 동작은, 상기 제 2 사용자가 번역 정정을 제출하도록 장려하기 위해 인센티브(예, 온라인 게임에서 사용하기 위한 가상 재화 및/또는 가상 통화)를 제공하는 단계를 포함한다. 상기 가장 정확한 번역 정정을 결정하는 단계는, 적어도 하나의 추가 사용자로부터 상기 초기 번역에서의 에러를 제기하기 위한 적어도 하나의 추가 번역 정정을 수신하는 단계로서, 상기 적어도 하나의 추가 번역 정정 및 상기 제 2 사용자로부터의 번역 정정은, 상기 복수의 번역 정정을 정의하는, 추가 번역 정정을 수신하는 단계; 상기 복수의 번역 정정의 정확도에 관한 피드백을 사용자들로부터 수신하는 단계; 및 그 피드백에 기초하여, 상기 복수의 번역 정정으로부터 가장 정확한 번역 정정을 식별하는 단계를 포함한다. In certain embodiments, the operation includes providing incentives (eg, virtual goods and / or virtual currencies for use in online games) to encourage the second user to submit translation corrections. Determining the most accurate translation correction comprises: receiving at least one additional translation correction for raising an error in the initial translation from at least one additional user, wherein the at least one additional translation correction and the second The translation correction from the user comprises: receiving an additional translation correction defining the plurality of translation corrections; Receiving feedback from users about the accuracy of the plurality of translation corrections; And based on the feedback, identifying the most accurate translation correction from the plurality of translation corrections.

일부 구현에서, 상기 동작은 또한 상기 가장 정확한 번역 정정을 제출한 사용자에게 보상(예를 들어, 온라인 게임에서 사용하기 위한 가상 재화 및/또는 가상 통화)을 제공하는 단계를 포함한다. 상기 동작은 또한 상기 가장 정확한 번역을 식별하는데 사용된 피드백을 제공한 사용자에게 보상(예를 들어, 온라인 게임에서 사용하기 위한 가상 재화 및/또는 가상 통화)을 제공하는 단계를 포함할 수 있다. 상기 단어-기반 특징은, 예를 들어, 단어 카운트(count), 문자 카운트(count), 이모지(emoji), 숫자, 및/또는 구두점을 포함할 수 있다. 상기 언어-기반 특징을 이용하는 것은, 상기 원본 텍스트 메시지 및 상기 제 2 사용자로부터의 상기 번역 정정에 있는 품사를 식별하는 것을 포함할 수 있다. In some implementations, the operation also includes providing a reward (eg, virtual goods and / or virtual currencies for use in online games) to the user who submitted the most accurate translation correction. The operation may also include providing a reward (eg, virtual goods and / or virtual currencies for use in online games) to a user who provided feedback used to identify the most accurate translation. The word-based features may include, for example, word counts, character counts, emojis, numbers, and / or punctuation marks. Using the language-based feature may include identifying the part of speech in the original text message and the translation correction from the second user.

일부 실시예에서, 상기 동작은 또한, 상기 원본 텍스트 메시지 및 상기 제 2 사용자로부터의 번역 정정의 각각에 있는 동사의 개수를 식별하는 단계; 및 상기 원본 텍스트 메시지에 있는 동사의 개수와, 상기 제 2 사용자로부터의 번역 정정에 있는 동사의 개수를 비교하는 단계를 더 포함한다. 상기 원본 텍스트 메시지 및/또는 상기 제 2 사용자로부터의 번역 정정에서의 품사의 부재(absence)는, 언어 검출 실패를 가리킬 수 있다. 상기 동작은 또한 상기 제 2 사용자로부터의 번역 정정이 상기 초기 번역과 동일한 경우 상기 제 2 사용자로부터의 번역 정정을 거절하는 단계를 더 포함할 수 있다. In some embodiments, the operation also includes identifying the number of verbs in each of the original text message and the translation correction from the second user; And comparing the number of verbs in the original text message with the number of verbs in the translation correction from the second user. The absence of part-of-speech in the original text message and / or translation correction from the second user may indicate a language detection failure. The operation may further include rejecting the translation correction from the second user when the translation correction from the second user is the same as the initial translation.

다른 측면에서, 발명은, 데이터 처리 장치의 처리 모드를 제어하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품과 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행 가능하고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, 텍스트 메시지 채팅 시스템을 복수의 사용자에게 제공하는 단계; 제 1 사용자로부터 제 1 언어로 된 원본 텍스트 메시지를 수신하는 단계; 상기 원본 텍스트 메시지의 제 2 언어로 된 초기 번역을 생성하는 단계; 상기 원본 텍스트 메시지 및 상기 초기 번역을 제 2 사용자에게 제공하는 단계; 상기 제 2 사용자로부터 상기 초기 번역에서의 에러를 제기하는 번역 정정을 수신하는 단계; 및 (a) 복수의 번역 정정으로부터 가장 정확한 번역 정정을 식별하는 단계로서, 상기 복수의 번역 정정은 상기 제 2 사용자로부터의 번역 정정을 포함하는, 단계; 및 (b) 단어-기반 특징, 언어-기반 특징, 및/또는 단어 배치 특징을 이용하여 상기 제 2 사용자로부터의 번역 정정의 정확도를 평가하는 단계 중 적어도 하나를 포함한다. In another aspect, the invention relates to a computer program product stored on one or more storage media for controlling a processing mode of a data processing device. The computer program product is executable by a data processing device and causes the data processing device to perform an operation, the operation comprising: providing a text message chat system to a plurality of users; Receiving an original text message in a first language from a first user; Generating an initial translation in the second language of the original text message; Providing the original text message and the initial translation to a second user; Receiving a translation correction that raises an error in the initial translation from the second user; And (a) identifying the most accurate translation correction from a plurality of translation corrections, wherein the plurality of translation corrections include translation corrections from the second user; And (b) evaluating the accuracy of translation correction from the second user using word-based features, language-based features, and / or word placement features.

일 측면에서, 발명은, 데이터 처리 장치에 의해 구현되는 방법과 관련된다. 그 방법은, 제 1 언어 및 제 2 언어를 식별하는 단계; 제 1 언어로 통신하는 제 1 채팅 클라이언트 시스템의 제 1 사람으로부터 제 1 언어로 된 채팅용어 가청 메시지를 수신하는 단계; 상기 채팅용어 가청 메시지를 제 1 언어로 된 채팅용어 텍스트 메시지로 변경하는 단계; 상기 채팅용어 텍스트 메시지를 제 1 언어로 된 분명한 용어 텍스트 메시지로 변환하는 단계; 상기 분명한 용어 텍스트 메시지를 제 2 언어로 된 대응하는 분명한 용어 텍스트 메시지로 번역하는 단계; 상기 대응하는 분명한 용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 텍스트 메시지로 변환하는 단계; 상기 대응하는 채팅용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 가청 메시지로 변경하는 단계; 및 제 2 언어로 통신하는 제 2 채팅 클라이언트 시스템의 제 2 사용자에게 상기 대응하는 채팅용어 가청 메시지를 전송하는 단계를 포함한다. In one aspect, the invention relates to a method implemented by a data processing apparatus. The method includes identifying a first language and a second language; Receiving a chat language audible message in a first language from a first person in a first chat client system communicating in a first language; Changing the chat term audible message to a chat term text message in a first language; Converting the chat term text message into a clear term text message in a first language; Translating the clear term text message into a corresponding clear term text message in a second language; Converting the corresponding clear term text message into a corresponding chat term text message in a second language; Changing the corresponding chat term text message to a corresponding chat term audible message in a second language; And transmitting the corresponding chat term audible message to a second user of a second chat client system communicating in a second language.

특정 실시예에서, 상기 채팅용어 가청 메시지를 제 1 언어로 된 채팅용어 텍스트 메시지로 변경하는 단계는, 상기 채팅용어 가청 메시지를 음성 인식 시스템으로 제공하는 단계를 포함한다. 상기 채팅용어 텍스트 메시지를 변환하는 단계는, 상기 채팅용어 텍스트 메시지에 있는 채팅용어 단어 또는 구절을 식별하는 단계; 및 상기 채팅용어 단어 또는 구절을 비채팅용어 단어 또는 구절로 교체하는 단계를 포함할 수 있다. 일부 예에서, 상기 대응하는 채팅용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 가청 메시지로 변경하는 단계는, 상기 대응하는 채팅용어 텍스트 메시지를 텍스트-음성(text-to-speech) 시스템으로 제공하는 단계를 포함한다. In a specific embodiment, the step of changing the audible message of the chat term to a text message of the chat language in the first language includes providing the audible message of the chat term to the speech recognition system. The step of converting the chat term text message may include identifying a chat term word or phrase in the chat term text message; And replacing the chat term word or phrase with a non-chat term word or phrase. In some examples, the step of converting the corresponding chatspeak text message to a corresponding chatspeak audible message in a second language provides the corresponding chatspeak text message to a text-to-speech system It includes the steps.

특정 구현에서, 상기 음성 인식 시스템 및/또는 상기 텍스트-음성 시스템은, 제 1 및 제 2 언어 각각에 대한 복수의 악센트(accent) 및 방언(dialect)을 포함하는 데이터를 활용한다. 상기 데이터는, 제 1 및 제 2 언어 각각에 대한 채팅용어 및 분명한 용어 포맷을 포함할 수 있다. 다양한 실시예에서, 상기 방법은, 상기 대응하는 분명한 용어 텍스트 메시지, 상기 대응하는 채팅용어 텍스트 메시지, 및/또는 상기 대응하는 채팅용어 가청 메시지의 정확도에 관한 피드백을 사용자로부터 수신하는 단계를 포함한다. 상기 방법은 또한 상기 피드백의 제공에 대해 사용자에게 인센티브(예를 들어, 온라인 게임에서 사용하기 위한, 가상 재화 및/또는 가상 통화)를 제공하는 단계를 더 포함할 수 있다. 일부 예에서, 상기 방법은, 상기 음성 인식 시스템 및/또는 상기 텍스트-음성 시스템의 정확도를 개선하기 위해 상기 피드백을 처리하는 단계를 더 포함한다.In a particular implementation, the speech recognition system and / or the text-to-speech system utilizes data comprising a plurality of accents and dialects for each of the first and second languages. The data may include chat terms and clear term formats for each of the first and second languages. In various embodiments, the method includes receiving feedback from the user regarding the accuracy of the corresponding clear term text message, the corresponding chatspeak text message, and / or the corresponding chatspeak audible message. The method may further include providing an incentive (eg, virtual goods and / or virtual currencies, for use in online games) to the user for providing the feedback. In some examples, the method further comprises processing the feedback to improve the accuracy of the speech recognition system and / or the text-speech system.

다른 측면에서, 발명은, 명령들이 저장되는 컴퓨터 판독 가능한 매체, 및 데이터 처리 장치를 포함하는 시스템과 관련된다. 상기 데이터 처리 장치는 동작을 수행하기 위한 명령들을 실행하도록 구성된다. 그 동작은, 제 1 언어 및 제 2 언어를 식별하는 단계; 제 1 언어로 통신하는 제 1 채팅 클라이언트 시스템의 제 1 사람으로부터 제 1 언어로 된 채팅용어 가청 메시지를 수신하는 단계; 상기 채팅용어 가청 메시지를 제 1 언어로 된 채팅용어 텍스트 메시지로 변경하는 단계; 상기 채팅용어 텍스트 메시지를 제 1 언어로 된 분명한 용어 텍스트 메시지로 변환하는 단계; 상기 분명한 용어 텍스트 메시지를 제 2 언어로 된 대응하는 분명한 용어 텍스트 메시지로 번역하는 단계; 상기 대응하는 분명한 용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 텍스트 메시지로 변환하는 단계; 상기 대응하는 채팅용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 가청 메시지로 변경하는 단계; 및 제 2 언어로 통신하는 제 2 채팅 클라이언트 시스템의 제 2 사용자에게 상기 대응하는 채팅용어 가청 메시지를 전송하는 단계를 포함한다.In another aspect, the invention relates to a system comprising a computer readable medium on which instructions are stored, and a data processing apparatus. The data processing apparatus is configured to execute instructions for performing an operation. The operation includes: identifying a first language and a second language; Receiving a chat language audible message in a first language from a first person in a first chat client system communicating in a first language; Changing the chat term audible message to a chat term text message in a first language; Converting the chat term text message into a clear term text message in a first language; Translating the clear term text message into a corresponding clear term text message in a second language; Converting the corresponding clear term text message into a corresponding chat term text message in a second language; Changing the corresponding chat term text message to a corresponding chat term audible message in a second language; And transmitting the corresponding chat term audible message to a second user of a second chat client system communicating in a second language.

특정 구현에서, 상기 음성 인식 시스템 및/또는 상기 텍스트-음성 시스템은, 제 1 및 제 2 언어 각각에 대한 복수의 악센트(accent) 및 방언(dialect)을 포함하는 데이터를 활용한다. 상기 데이터는, 제 1 및 제 2 언어 각각에 대한 채팅용어 및 분명한 용어 포맷을 포함할 수 있다. 다양한 실시예에서, 상기 동작은, 상기 대응하는 분명한 용어 텍스트 메시지, 상기 대응하는 채팅용어 텍스트 메시지, 및/또는 상기 대응하는 채팅용어 가청 메시지의 정확도에 관한 피드백을 사용자로부터 수신하는 단계를 포함한다. 상기 동작은 또한 상기 피드백의 제공에 대해 사용자에게 인센티브(예를 들어, 온라인 게임에서 사용하기 위한, 가상 재화 및/또는 가상 통화)를 제공하는 단계를 더 포함할 수 있다. 일부 예에서, 상기 동작은, 상기 음성 인식 시스템 및/또는 상기 텍스트-음성 시스템의 정확도를 개선하기 위해 상기 피드백을 처리하는 단계를 더 포함한다.In a particular implementation, the speech recognition system and / or the text-to-speech system utilizes data comprising a plurality of accents and dialects for each of the first and second languages. The data may include chat terms and clear term formats for each of the first and second languages. In various embodiments, the operation includes receiving feedback from the user regarding the accuracy of the corresponding clear term text message, the corresponding chatspeak text message, and / or the corresponding chatspeak audible message. The operation may further include providing an incentive (eg, virtual goods and / or virtual currencies, for use in online games) to the user for providing the feedback. In some examples, the operation further includes processing the feedback to improve the accuracy of the speech recognition system and / or the text-speech system.

다른 측면에서, 발명은, 데이터 처리 장치의 처리 모드를 제어하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품과 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행 가능하고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, 제 1 언어 및 제 2 언어를 식별하는 단계; 제 1 언어로 통신하는 제 1 채팅 클라이언트 시스템의 제 1 사람으로부터 제 1 언어로 된 채팅용어 가청 메시지를 수신하는 단계; 상기 채팅용어 가청 메시지를 제 1 언어로 된 채팅용어 텍스트 메시지로 변경하는 단계; 상기 채팅용어 텍스트 메시지를 제 1 언어로 된 분명한 용어 텍스트 메시지로 변환하는 단계; 상기 분명한 용어 텍스트 메시지를 제 2 언어로 된 대응하는 분명한 용어 텍스트 메시지로 번역하는 단계; 상기 대응하는 분명한 용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 텍스트 메시지로 변환하는 단계; 상기 대응하는 채팅용어 텍스트 메시지를 제 2 언어로 된 대응하는 채팅용어 가청 메시지로 변경하는 단계; 및 제 2 언어로 통신하는 제 2 채팅 클라이언트 시스템의 제 2 사용자에게 상기 대응하는 채팅용어 가청 메시지를 전송하는 단계를 포함한다.In another aspect, the invention relates to a computer program product stored on one or more storage media for controlling a processing mode of a data processing device. The computer program product is executable by a data processing device and causes the data processing device to perform an operation, the operation comprising: identifying a first language and a second language; Receiving a chat language audible message in a first language from a first person in a first chat client system communicating in a first language; Changing the chat term audible message to a chat term text message in a first language; Converting the chat term text message into a clear term text message in a first language; Translating the clear term text message into a corresponding clear term text message in a second language; Converting the corresponding clear term text message into a corresponding chat term text message in a second language; Changing the corresponding chat term text message to a corresponding chat term audible message in a second language; And transmitting the corresponding chat term audible message to a second user of a second chat client system communicating in a second language.

일 측면에서, 발명은, 데이터 처리 장치에 의해 구현되는 방법과 관련된다. 그 방법은, 오래된 훈련 데이터(예를 들어, 다른 언어로의 정확한 번역이 알려진 하나 또는 그 이상의 오래된 텍스트 메시지를 포함) 및 새로운 훈련 데이터(예를 들어, 다른 언어로의 정확한 번역이 알려져 있지 않은 하나 또는 그 이상의 새로운 텍스트 메시지를 포함)를 선택하는 단계; 복수의 각각의 요청들을 다른 시간에 사용자의 클라이언트 디바이스(또는 다수의 사용자들의 다수의 클라이언트 디바이스로)로 전송하는 단계로서, 상기 요청들은 (i) 상기 사용자가 상기 오래된 훈련 데이터 및/또는 상기 새로운 훈련 데이터를 번역하도록 하는 각각의 요청 및 (ii) 번역에 대한 각각의 인센티브를 포함하는, 전송하는 단계; 특정 요청을 전송한 후에, 상기 특정 요청의 상기 오래된 훈련 데이터에 대한 번역을 상기 클라이언트 디바이스로부터 수신하는 단계; 그 수신된 번역과 상기 오래된 훈련 데이터에 대한 정확한 번역을 비교하는 단계; 상기 비교에 기초하여 상기 수신된 번역의 정확도를 결정하는 단계; 및 번역에 기초하여 상기 사용자에 대한 신뢰도 점수를 업데이트하는 단계를 포함한다. 상기 신뢰도 점수는 상기 사용자가 차후에 상기 다른 언어로의 텍스트 메시지의 정확한 번역을 제공할 가능성을 나타낸다.In one aspect, the invention relates to a method implemented by a data processing apparatus. The method includes old training data (e.g., one or more old text messages whose exact translation is known to another language) and new training data (e.g. one whose exact translation to another language is not known). Or more new text messages); Sending a plurality of each request to the user's client device (or multiple users' multiple client devices) at different times, the requests comprising (i) the user training the old training data and / or the new training; Transmitting, including each request to translate data and (ii) each incentive for translation; After sending a specific request, receiving a translation of the old training data of the specific request from the client device; Comparing the received translation to the correct translation for the old training data; Determining the accuracy of the received translation based on the comparison; And updating a confidence score for the user based on the translation. The reliability score indicates the likelihood that the user will later provide an accurate translation of the text message to the other language.

특정 실시예에서, 상기 사용자는, 온라인 게임의 참가자이다. 상기 각각의 인센티브는, 예를 들어, 온라인 게임을 위한 가상 재화 및/또는 가상 통화를 포함할 수 있다. 상기 사용자로부터 수신된 번역의 정확도를 결정하는 단계는, (i) 단어 에러율(WER)을 계산하는 단계 및/또는 (ii) BLEU(bilingual evaluation understudy)를 이용하는 단계를 포함할 수 있다. 일부 예에서, 상기 사용자에 대한 신뢰도 점수를 업데이트하는 단계는, 사용자 번역 정확도에서 표준으로부터 편차(deviation from a norm)을 식별하기 위해 아이템 응답 이론을 사용하는 단계를 포함한다. 상기 방법은 또한, 상기 신뢰도 점수가 임계 값 아래로 떨어질 경우 상기 사용자의 번역 특전을 취소하는 단계를 포함할 수 있다.In certain embodiments, the user is a participant in an online game. Each of the incentives may include, for example, virtual goods and / or virtual currencies for online games. Determining the accuracy of the translation received from the user may include (i) calculating a word error rate (WER) and / or (ii) using bilingual evaluation understudy (BLEU). In some examples, updating the confidence score for the user includes using item response theory to identify deviations from a norm in user translation accuracy. The method may also include canceling the translation privilege of the user when the reliability score falls below a threshold value.

다양한 구현에서, 상기 방법은, 상기 사용자의 번역이 정확한 것으로 결정될 경우 상기 각각의 인센티브를 상기 사용자에게 보상하는 단계를 포함한다. 상기 방법은, 상기 사용자와 제 2 사용자 간의 기존의 관계를 식별함으로써 상기 사용자와 상기 제 2 사용자 간의 충돌을 검출하는 단계를 포함할 수 있다. 일 예에서, 상기 기존의 관계를 식별하는 것은, 상기 사용자와 상기 제 2 사용자 중 적어도 하나의 소셜 네트워크를 분석하는 것을 포함한다. In various implementations, the method includes compensating the respective incentives to the user if the user's translation is determined to be correct. The method may include detecting a conflict between the user and the second user by identifying an existing relationship between the user and the second user. In one example, identifying the existing relationship includes analyzing a social network of at least one of the user and the second user.

다른 측면에서, 발명은, 명령들이 저장되는 컴퓨터 판독 가능한 매체, 및 데이터 처리 장치를 포함하는 시스템과 관련된다. 상기 데이터 처리 장치는, 동작을 수행하기 위한 명령들을 실행하도록 구성되고, 그 동작은, 오래된 훈련 데이터(예를 들어, 다른 언어로의 정확한 번역이 알려진 하나 또는 그 이상의 오래된 텍스트 메시지를 포함) 및 새로운 훈련 데이터(예를 들어, 다른 언어로의 정확한 번역이 알려져 있지 않은 하나 또는 그 이상의 새로운 텍스트 메시지를 포함)를 선택하는 단계; 복수의 각각의 요청들을 다른 시간에 사용자의 클라이언트 디바이스(또는 다수의 사용자들의 다수의 클라이언트 디바이스로)로 전송하는 단계로서, 상기 요청들은 (i) 상기 사용자가 상기 오래된 훈련 데이터 및/또는 상기 새로운 훈련 데이터를 번역하도록 하는 각각의 요청 및 (ii) 번역에 대한 각각의 인센티브를 포함하는, 전송하는 단계; 특정 요청을 전송한 후에, 상기 특정 요청의 상기 오래된 훈련 데이터에 대한 번역을 상기 클라이언트 디바이스로부터 수신하는 단계; 그 수신된 번역과 상기 오래된 훈련 데이터에 대한 정확한 번역을 비교하는 단계; 상기 비교에 기초하여 상기 수신된 번역의 정확도를 결정하는 단계; 및 번역에 기초하여 상기 사용자에 대한 신뢰도 점수를 업데이트하는 단계를 포함한다. 상기 신뢰도 점수는 상기 사용자가 차후에 상기 다른 언어로의 텍스트 메시지의 정확한 번역을 제공할 가능성을 나타낸다.In another aspect, the invention relates to a system comprising a computer readable medium on which instructions are stored, and a data processing apparatus. The data processing apparatus is configured to execute instructions for performing an operation, the operation comprising old training data (e.g., one or more old text messages known to the correct translation to another language) and new Selecting training data (eg, including one or more new text messages whose exact translation to another language is unknown); Sending a plurality of each request to the user's client device (or multiple users' multiple client devices) at different times, the requests comprising (i) the user training the old training data and / or the new training; Transmitting, including each request to translate data and (ii) each incentive for translation; After sending a specific request, receiving a translation of the old training data of the specific request from the client device; Comparing the received translation to the correct translation for the old training data; Determining the accuracy of the received translation based on the comparison; And updating a confidence score for the user based on the translation. The reliability score indicates the likelihood that the user will later provide an accurate translation of the text message to the other language.

특정 실시예에서, 상기 사용자는, 온라인 게임의 참가자이다. 상기 각각의 인센티브는, 예를 들어, 온라인 게임을 위한 가상 재화 및/또는 가상 통화를 포함할 수 있다. 상기 사용자로부터 수신된 번역의 정확도를 결정하는 단계는, (i) 단어 에러율(WER)을 계산하는 단계 및/또는 (ii) BLEU(bilingual evaluation understudy)를 이용하는 단계를 포함할 수 있다. 일부 예에서, 상기 사용자에 대한 신뢰도 점수를 업데이트하는 단계는, 사용자 번역 정확도에서 표준으로부터 편차(deviation from a norm)을 식별하기 위해 아이템 응답 이론을 사용하는 단계를 포함한다. 상기 동작은 또한, 상기 신뢰도 점수가 임계 값 아래로 떨어질 경우 상기 사용자의 번역 특전을 취소하는 단계를 포함할 수 있다.In certain embodiments, the user is a participant in an online game. Each of the incentives may include, for example, virtual goods and / or virtual currencies for online games. Determining the accuracy of the translation received from the user may include (i) calculating a word error rate (WER) and / or (ii) using bilingual evaluation understudy (BLEU). In some examples, updating the confidence score for the user includes using item response theory to identify deviations from a norm in user translation accuracy. The operation may also include canceling the translation privilege of the user when the reliability score falls below a threshold value.

다양한 구현에서, 상기 동작은, 상기 사용자의 번역이 정확한 것으로 결정될 경우 상기 각각의 인센티브를 상기 사용자에게 보상하는 단계를 포함한다. 상기 동작은 또한, 상기 사용자와 제 2 사용자 간의 기존의 관계를 식별함으로써 상기 사용자와 상기 제 2 사용자 간의 충돌을 검출하는 단계를 포함할 수 있다. 일 예에서, 상기 기존의 관계를 식별하는 것은, 상기 사용자와 상기 제 2 사용자 중 적어도 하나의 소셜 네트워크를 분석하는 것을 포함한다. In various implementations, the operation includes compensating the respective incentives to the user if the user's translation is determined to be correct. The operation may also include detecting a conflict between the user and the second user by identifying an existing relationship between the user and the second user. In one example, identifying the existing relationship includes analyzing a social network of at least one of the user and the second user.

다른 측면에서, 발명은, 데이터 처리 장치의 처리 모드를 제어하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품과 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행 가능하고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, 오래된 훈련 데이터(예를 들어, 다른 언어로의 정확한 번역이 알려진 하나 또는 그 이상의 오래된 텍스트 메시지를 포함) 및 새로운 훈련 데이터(예를 들어, 다른 언어로의 정확한 번역이 알려져 있지 않은 하나 또는 그 이상의 새로운 텍스트 메시지를 포함)를 선택하는 단계; 복수의 각각의 요청들을 다른 시간에 사용자의 클라이언트 디바이스(또는 다수의 사용자들의 다수의 클라이언트 디바이스로)로 전송하는 단계로서, 상기 요청들은 (i) 상기 사용자가 상기 오래된 훈련 데이터 및/또는 상기 새로운 훈련 데이터를 번역하도록 하는 각각의 요청 및 (ii) 번역에 대한 각각의 인센티브를 포함하는, 전송하는 단계; 특정 요청을 전송한 후에, 상기 특정 요청의 상기 오래된 훈련 데이터에 대한 번역을 상기 클라이언트 디바이스로부터 수신하는 단계; 그 수신된 번역과 상기 오래된 훈련 데이터에 대한 정확한 번역을 비교하는 단계; 상기 비교에 기초하여 상기 수신된 번역의 정확도를 결정하는 단계; 및 번역에 기초하여 상기 사용자에 대한 신뢰도 점수를 업데이트하는 단계를 포함한다. 상기 신뢰도 점수는 상기 사용자가 차후에 상기 다른 언어로의 텍스트 메시지의 정확한 번역을 제공할 가능성을 나타낸다.In another aspect, the invention relates to a computer program product stored on one or more storage media for controlling a processing mode of a data processing device. The computer program product is executable by a data processing device and causes the data processing device to perform an action, the action being one or more of old training data (e.g., accurate translation into another language is known. Selecting old training messages (including old text messages) and new training data (eg, including one or more new text messages whose exact translation to another language is unknown); Sending a plurality of each request to the user's client device (or multiple users' multiple client devices) at different times, the requests comprising (i) the user training the old training data and / or the new training; Transmitting, including each request to translate data and (ii) each incentive for translation; After sending a specific request, receiving a translation of the old training data of the specific request from the client device; Comparing the received translation to the correct translation for the old training data; Determining the accuracy of the received translation based on the comparison; And updating a confidence score for the user based on the translation. The reliability score indicates the likelihood that the user will later provide an accurate translation of the text message to the other language.

일 측면에서, 발명은, 데이터 처리 장치에 의해 구현되는 방법과 관련된다. 그 방법은, 제 1 언어로 된 텍스트 메시지를 획득하는 단계로서, 상기 텍스트 메시지는 적어도 하나의 단어를 포함하고; 상기 텍스트 메시지를 기계 번역 시스템으로 제공하는 단계; 상기 기계 번역 시스템으로부터 상기 텍스트 메시지의 번역을 획득하는 단계; 상기 텍스트 메시지 및 상기 번역 모두, 상기 제 1 언어로 된 상기 적어도 하나의 단어를 포함하고 상기 적어도 하나의 단어가 정확하게 철자에 맞게 쓰여진 것을 결정하는 단계; 및 다음의, (a) 상기 적어도 하나의 단어가 이전 텍스트 메시지들에 출현한 빈도를 결정하는 단계; (b) 상기 적어도 하나의 단어의 이전 및 이후에 출현하는 이웃 단어들에 대한 베이지안 확률을 결정하는 단계; 및 (c) 유의어들을 포함하는 단어들의 클러스터를 식별하기 위해 k-means 클러스터링을 수행하는 단계 중 하나 또는 그 이상을 수행하는 단계를 포함한다. 상기 빈도가 임계 값을 초과할 경우, 상기 베이지안 확률이 임계 값을 초과할 경우, 및/또는 상기 클러스터가 상기 적어도 하나의 단어를 포함할 경우, 상기 방법은 상기 데이터 저장부에 있는 사전에 상기 적어도 하나의 단어를 추가하는 단계를 포함한다. In one aspect, the invention relates to a method implemented by a data processing apparatus. The method comprises: obtaining a text message in a first language, the text message comprising at least one word; Providing the text message to a machine translation system; Obtaining a translation of the text message from the machine translation system; Determining that both the text message and the translation include the at least one word in the first language and that the at least one word is spelled correctly; And next, (a) determining the frequency with which the at least one word appeared in previous text messages; (b) determining a Bayesian probability for neighboring words that appear before and after the at least one word; And (c) performing one or more of the steps of performing k-means clustering to identify a cluster of words containing synonyms. If the frequency exceeds a threshold value, the Bayesian probability exceeds a threshold value, and / or the cluster includes the at least one word, the method includes the at least the dictionary in the data storage unit. And adding a single word.

특정 실시예에서, 상기 적어도 하나의 단어는, 미등록 어휘(out of vocabulary) 단어를 포함하거나 그것일 수 있다. 상기 적어도 하나의 단어는, 새로운 채팅용어 단어를 포함하거나 그것일 수 있다. 상기 방법은, 상기 데이터 저장부에 있는 사전이, 상기 적어도 하나의 단어를 포함하는지 결정하는 단계를 포함할 수 있다. 상기 텍스트 메시지는, 온라인 게임의 플레이어의 클라이언트 디바이스로부터 수신될 수 있다. 다양한 예에서, 상기 사전은, 제 1 언어의 어휘로 된 단어들을 포함하거나 이들로 구성된다. In certain embodiments, the at least one word may include or be an out of vocabulary word. The at least one word may include or be a new chat term word. The method may include determining whether the dictionary in the data storage unit includes the at least one word. The text message can be received from the client device of the player of the online game. In various examples, the dictionary includes or consists of words in a vocabulary of the first language.

상기 베이지안 확률을 결정하는 것은, (i) 이전 텍스트 메시지들에서 상기 적어도 하나의 단어의 이전 사용을 검토하는 것 및 (ii) 상기 이전 텍스트 메시지들에서 상기 적어도 하나의 단어 이전 및 이후에 출현하는 단어들을, 만약에 있다면, 식별하는 것을 포함할 수 있다. 상기 베이지안 확률은, 상기 이웃 단어들이 상기 텍스트 메시지에서 상기 적어도 하나의 단어의 이전 및 이후에 출현할 가능성의 표시를 제공할 수 있다. Determining the Bayesian probability includes: (i) reviewing the previous use of the at least one word in previous text messages and (ii) a word appearing before and after the at least one word in the previous text messages. And, if any, identifying. The Bayesian probability can provide an indication of the likelihood that the neighboring words will appear before and after the at least one word in the text message.

다양한 구현에서, 상기 클러스터를 식별하는 것은, 이전 텍스트 메시지들을 검토하고 상기 텍스트 메시지에서 상기 적어도 하나의 단어와 유사한 문맥으로 사용된 단어들을 식별하는 것을 포함한다. 상기 방법은 또한 상기 텍스트 메시지에 있는 품사들을 결정하기 위해 상기 텍스트 메시지의 구문론 및 의미론을 분석하는 단계를 포함할 수 있다. In various implementations, identifying the cluster includes reviewing previous text messages and identifying words used in context similar to the at least one word in the text message. The method may also include analyzing the syntax and semantics of the text message to determine parts of speech in the text message.

다른 측면에서, 발명은, 명령들이 저장되는 컴퓨터 판독 가능한 매체, 및 데이터 처리 장치를 포함하는 시스템과 관련된다. 상기 데이터 처리 장치는, 동작을 수행하기 위한 명령들을 실행하도록 구성되고, 그 동작은, 제 1 언어로 된 텍스트 메시지를 획득하는 단계로서, 상기 텍스트 메시지는 적어도 하나의 단어를 포함하고; 상기 텍스트 메시지를 기계 번역 시스템으로 제공하는 단계; 상기 기계 번역 시스템으로부터 상기 텍스트 메시지의 번역을 획득하는 단계; 상기 텍스트 메시지 및 상기 번역 모두, 상기 제 1 언어로 된 상기 적어도 하나의 단어를 포함하고 상기 적어도 하나의 단어가 정확하게 철자에 맞게 쓰여진 것을 결정하는 단계; 및 다음의, (a) 상기 적어도 하나의 단어가 이전 텍스트 메시지들에 출현한 빈도를 결정하는 단계; (b) 상기 적어도 하나의 단어의 이전 및 이후에 출현하는 이웃 단어들에 대한 베이지안 확률을 결정하는 단계; 및 (c) 유의어들을 포함하는 단어들의 클러스터를 식별하기 위해 k-means 클러스터링을 수행하는 단계 중 하나 또는 그 이상을 수행하는 단계를 포함한다. 상기 빈도가 임계 값을 초과할 경우, 상기 베이지안 확률이 임계 값을 초과할 경우, 및/또는 상기 클러스터가 상기 적어도 하나의 단어를 포함할 경우, 상기 방법은 상기 데이터 저장부에 있는 사전에 상기 적어도 하나의 단어를 추가하는 단계를 포함한다. In another aspect, the invention relates to a system comprising a computer readable medium on which instructions are stored, and a data processing apparatus. The data processing apparatus is configured to execute instructions for performing an operation, the operation comprising: obtaining a text message in a first language, the text message comprising at least one word; Providing the text message to a machine translation system; Obtaining a translation of the text message from the machine translation system; Determining that both the text message and the translation include the at least one word in the first language and that the at least one word is spelled correctly; And next, (a) determining the frequency with which the at least one word appeared in previous text messages; (b) determining a Bayesian probability for neighboring words that appear before and after the at least one word; And (c) performing one or more of the steps of performing k-means clustering to identify a cluster of words containing synonyms. If the frequency exceeds a threshold value, the Bayesian probability exceeds a threshold value, and / or the cluster includes the at least one word, the method includes the at least the dictionary in the data storage unit. And adding a single word.

특정 실시예에서, 상기 적어도 하나의 단어는, 미등록 어휘(out of vocabulary) 단어를 포함하거나 그것일 수 있다. 상기 적어도 하나의 단어는, 새로운 채팅용어 단어를 포함하거나 그것일 수 있다. 상기 동작은, 상기 데이터 저장부에 있는 사전이, 상기 적어도 하나의 단어를 포함하는지 결정하는 단계를 포함할 수 있다. 상기 텍스트 메시지는, 온라인 게임의 플레이어의 클라이언트 디바이스로부터 수신될 수 있다. 다양한 예에서, 상기 사전은, 제 1 언어의 어휘로 된 단어들을 포함하거나 이들로 구성된다. In certain embodiments, the at least one word may include or be an out of vocabulary word. The at least one word may include or be a new chat term word. The operation may include determining whether the dictionary in the data storage unit includes the at least one word. The text message can be received from the client device of the player of the online game. In various examples, the dictionary includes or consists of words in a vocabulary of the first language.

다양한 구현에서, 상기 클러스터를 식별하는 것은, 이전 텍스트 메시지들을 검토하고 상기 텍스트 메시지에서 상기 적어도 하나의 단어와 유사한 문맥으로 사용된 단어들을 식별하는 것을 포함한다. 상기 동작은 또한 상기 텍스트 메시지에 있는 품사들을 결정하기 위해 상기 텍스트 메시지의 구문론 및 의미론을 분석하는 단계를 포함할 수 있다. In various implementations, identifying the cluster includes reviewing previous text messages and identifying words used in context similar to the at least one word in the text message. The operation may also include analyzing the syntax and semantics of the text message to determine parts of speech in the text message.

다른 측면에서, 발명은, 데이터 처리 장치의 처리 모드를 제어하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품과 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행 가능하고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, 제 1 언어로 된 텍스트 메시지를 획득하는 단계로서, 상기 텍스트 메시지는 적어도 하나의 단어를 포함하고; 상기 텍스트 메시지를 기계 번역 시스템으로 제공하는 단계; 상기 기계 번역 시스템으로부터 상기 텍스트 메시지의 번역을 획득하는 단계; 상기 텍스트 메시지 및 상기 번역 모두, 상기 제 1 언어로 된 상기 적어도 하나의 단어를 포함하고 상기 적어도 하나의 단어가 정확하게 철자에 맞게 쓰여진 것을 결정하는 단계; 및 다음의, (a) 상기 적어도 하나의 단어가 이전 텍스트 메시지들에 출현한 빈도를 결정하는 단계; (b) 상기 적어도 하나의 단어의 이전 및 이후에 출현하는 이웃 단어들에 대한 베이지안 확률을 결정하는 단계; 및 (c) 유의어들을 포함하는 단어들의 클러스터를 식별하기 위해 k-means 클러스터링을 수행하는 단계 중 하나 또는 그 이상을 수행하는 단계를 포함한다. 상기 빈도가 임계 값을 초과할 경우, 상기 베이지안 확률이 임계 값을 초과할 경우, 및/또는 상기 클러스터가 상기 적어도 하나의 단어를 포함할 경우, 상기 방법은 상기 데이터 저장부에 있는 사전에 상기 적어도 하나의 단어를 추가하는 단계를 포함한다. In another aspect, the invention relates to a computer program product stored on one or more storage media for controlling a processing mode of a data processing device. The computer program product is executable by a data processing device and causes the data processing device to perform an operation, wherein the operation comprises obtaining a text message in a first language, wherein the text message is at least one Contains words; Providing the text message to a machine translation system; Obtaining a translation of the text message from the machine translation system; Determining that both the text message and the translation include the at least one word in the first language and that the at least one word is spelled correctly; And next, (a) determining the frequency with which the at least one word appeared in previous text messages; (b) determining a Bayesian probability for neighboring words that appear before and after the at least one word; And (c) performing one or more of the steps of performing k-means clustering to identify a cluster of words containing synonyms. If the frequency exceeds a threshold value, the Bayesian probability exceeds a threshold value, and / or the cluster includes the at least one word, the method includes the at least the dictionary in the data storage unit. And adding a single word.

일 측면에서, 발명은, 데이터 처리 장치에 의해 구현되는 방법과 관련된다. 그 방법은, (a) 다중 사용자 채팅 세션으로부터 텍스트 메시지들의 히스토리의 일부를 검토하기 위한 요청을 수신하는 단계로서, 상기 히스토리는 복수의 텍스트 메시지를 포함하고, 각 텍스트 메시지는 각각의 언어로 되어 있고 각각의 채팅 세션 참가자로부터 발신된, 단계; (b) 복수의 병렬 프로세스를 수행하는 단계로서, 각 병렬 프로세스는 (i) 상기 텍스트 메시지들의 히스토리의 일부로부터 다른 각각의 텍스트 메시지를 선택하고, 및 (ii) 그 선택된 텍스트 메시지를 타겟 언어로 번역하는 것을 포함하는 단계; (c) 상기 복수의 병렬 프로세스로부터의 상기 번역된 텍스트 메시지를 사용자의 클라이언트 디바이스로 제공하는 단계; (d) 상기 텍스트 메시지들의 히스토리의 다른 일부를 검토하기 위한 요청을 수신하는 단계; 및 (e) 상기 텍스트 메시지의 히스토리의 다른 일부에 대해 상기 단계 (b) 및 단계 (c)를 반복하는 단계를 포함한다.In one aspect, the invention relates to a method implemented by a data processing apparatus. The method comprises: (a) receiving a request to review a portion of the history of text messages from a multi-user chat session, the history comprising a plurality of text messages, each text message being in a respective language A step originated from each chat session participant; (b) performing a plurality of parallel processes, each parallel process selecting (i) each other text message from a portion of the history of the text messages, and (ii) translating the selected text message to a target language. Step comprising; (c) providing the translated text message from the plurality of parallel processes to a user's client device; (d) receiving a request to review another portion of the history of text messages; And (e) repeating steps (b) and (c) for another part of the history of the text message.

특정 실시예에서, 상기 다른 각각의 텍스트 메시지를 선택하는 것은, 상기 텍스트 메시지들의 히스토리의 일부에 대해 저장 디바이스로 질의하는 것을 포함한다. 상기 선택된 텍스트 메시지를 번역하는 것은, 상기 텍스트 메시지의 적어도 일 부분을 채팅용어에서 분명한 용어로 변환하는 것을 포함할 수 있다. 일부 구현에서, 상기 방법은, 상기 텍스트 메시지들의 히스토리의 검토를 중단하는 요청을 상기 사용자로부터 수신하는 단계를 포함한다. 상기 복수의 병렬 프로세스는, 각 채팅 세션 참가자에 대해 하나의 프로세스를 포함할 수 있다. 대안으로 또는 추가적으로, 상기 복수의 병렬 프로세스는, 상기 각 채팅 세션에서 사용되는 각 언어에 대해 하나의 프로세스를 포함할 수 있다. In certain embodiments, selecting each of the other text messages includes querying a storage device for a portion of the history of the text messages. Translating the selected text message may include converting at least a portion of the text message from chat terminology to a clear term. In some implementations, the method includes receiving a request from the user to stop reviewing the history of the text messages. The plurality of parallel processes may include one process for each chat session participant. Alternatively or additionally, the plurality of parallel processes may include one process for each language used in each chat session.

다른 측면에서, 발명은, 명령들이 저장되는 컴퓨터 판독 가능한 매체, 및 데이터 처리 장치를 포함하는 시스템과 관련된다. 상기 데이터 처리 장치는, 동작을 수행하기 위한 명령들을 실행하도록 구성되고, 상기 동작은, (a) 다중 사용자 채팅 세션으로부터 텍스트 메시지들의 히스토리의 일부를 검토하기 위한 요청을 수신하는 단계로서, 상기 히스토리는 복수의 텍스트 메시지를 포함하고, 각 텍스트 메시지는 각각의 언어로 되어 있고 각각의 채팅 세션 참가자로부터 발신된, 단계; (b) 복수의 병렬 프로세스를 수행하는 단계로서, 각 병렬 프로세스는 (i) 상기 텍스트 메시지들의 히스토리의 일부로부터 다른 각각의 텍스트 메시지를 선택하고, 및 (ii) 그 선택된 텍스트 메시지를 타겟 언어로 번역하는 것을 포함하는 단계; (c) 상기 복수의 병렬 프로세스로부터의 상기 번역된 텍스트 메시지를 사용자의 클라이언트 디바이스로 제공하는 단계; (d) 상기 텍스트 메시지들의 히스토리의 다른 일부를 검토하기 위한 요청을 수신하는 단계; 및 (e) 상기 텍스트 메시지의 히스토리의 다른 일부에 대해 상기 단계 (b) 및 단계 (c)를 반복하는 단계를 포함한다.In another aspect, the invention relates to a system comprising a computer readable medium on which instructions are stored, and a data processing apparatus. The data processing apparatus is configured to execute instructions for performing an operation, the operation comprising: (a) receiving a request to review a portion of the history of text messages from a multi-user chat session, the history A step comprising a plurality of text messages, each text message being in a respective language and originating from each chat session participant; (b) performing a plurality of parallel processes, each parallel process selecting (i) each other text message from a portion of the history of the text messages, and (ii) translating the selected text message to a target language. Step comprising; (c) providing the translated text message from the plurality of parallel processes to a user's client device; (d) receiving a request to review another portion of the history of text messages; And (e) repeating steps (b) and (c) for another part of the history of the text message.

특정 실시예에서, 상기 다른 각각의 텍스트 메시지를 선택하는 것은, 상기 텍스트 메시지들의 히스토리의 일부에 대해 저장 디바이스로 질의하는 것을 포함한다. 상기 선택된 텍스트 메시지를 번역하는 것은, 상기 텍스트 메시지의 적어도 일 부분을 채팅용어에서 분명한 용어로 변환하는 것을 포함할 수 있다. 일부 구현에서, 상기 동작은, 상기 텍스트 메시지들의 히스토리의 검토를 중단하는 요청을 상기 사용자로부터 수신하는 단계를 포함한다. 상기 복수의 병렬 프로세스는, 각 채팅 세션 참가자에 대해 하나의 프로세스를 포함할 수 있다. 대안으로 또는 추가적으로, 상기 복수의 병렬 프로세스는, 상기 각 채팅 세션에서 사용되는 각 언어에 대해 하나의 프로세스를 포함할 수 있다. In certain embodiments, selecting each of the other text messages includes querying a storage device for a portion of the history of the text messages. Translating the selected text message may include converting at least a portion of the text message from chat terminology to a clear term. In some implementations, the operation includes receiving a request from the user to stop reviewing the history of the text messages. The plurality of parallel processes may include one process for each chat session participant. Alternatively or additionally, the plurality of parallel processes may include one process for each language used in each chat session.

다른 측면에서, 발명은, 데이터 처리 장치의 처리 모드를 제어하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품에 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행 가능하고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, (a) 다중 사용자 채팅 세션으로부터 텍스트 메시지들의 히스토리의 일부를 검토하기 위한 요청을 수신하는 단계로서, 상기 히스토리는 복수의 텍스트 메시지를 포함하고, 각 텍스트 메시지는 각각의 언어로 되어 있고 각각의 채팅 세션 참가자로부터 발신된, 단계; (b) 복수의 병렬 프로세스를 수행하는 단계로서, 각 병렬 프로세스는 (i) 상기 텍스트 메시지들의 히스토리의 일부로부터 다른 각각의 텍스트 메시지를 선택하고, 및 (ii) 그 선택된 텍스트 메시지를 타겟 언어로 번역하는 것을 포함하는 단계; (c) 상기 복수의 병렬 프로세스로부터의 상기 번역된 텍스트 메시지를 사용자의 클라이언트 디바이스로 제공하는 단계; (d) 상기 텍스트 메시지들의 히스토리의 다른 일부를 검토하기 위한 요청을 수신하는 단계; 및 (e) 상기 텍스트 메시지의 히스토리의 다른 일부에 대해 상기 단계 (b) 및 단계 (c)를 반복하는 단계를 포함한다.In another aspect, the invention relates to a computer program product stored on one or more storage media for controlling the processing mode of the data processing apparatus. The computer program product is executable by a data processing device and causes the data processing device to perform an operation, the operation comprising: (a) requesting to review a portion of the history of text messages from a multi-user chat session. Receiving, wherein the history includes a plurality of text messages, each text message being in a respective language and originating from each chat session participant; (b) performing a plurality of parallel processes, each parallel process selecting (i) each other text message from a portion of the history of the text messages, and (ii) translating the selected text message to a target language. Step comprising; (c) providing the translated text message from the plurality of parallel processes to a user's client device; (d) receiving a request to review another portion of the history of text messages; And (e) repeating steps (b) and (c) for another part of the history of the text message.

일 측면에서, 발명은, 데이터 처리 장치에 의해 구현되는 방법과 관련된다. 상기 방법은, 온라인 게임의 복수의 사용자에게 텍스트 메시지 채팅 시스템(예를 들어, 온라인 게임)을 제공하는 단계; 상기 텍스트 메시지 채팅 시스템의 제 1 사용자로부터 상기 텍스트 메시지 채팅 시스템의 제 2 사용자를 차단하기 위한 요청을 수신하는 단계; 및 상기 요청의 수신에 이어서, 상기 제 2 사용자로부터의 텍스트 메시지들이 상기 제 1 사용자에게 디스플레이되는 것을 차단하는 단계;를 포함한다. In one aspect, the invention relates to a method implemented by a data processing apparatus. The method includes providing a text message chat system (eg, an online game) to a plurality of users of the online game; Receiving a request to block a second user of the text message chat system from a first user of the text message chat system; And following receipt of the request, blocking text messages from the second user from being displayed to the first user.

특정 실시예에서, 상기 요청의 수신에 이어서, 상기 방법은, 상기 텍스트 메시지 채팅 시스템을 이용한 채팅 세션에 참여하게 하는 상기 제 2 사용자로부터 상기 제 1 사용자로의 미래의 초대들을 차단하는 단계를 포함한다. 상기 방법은, 상기 제 2 사용자의 차단을 해제하기 위한 제 2 요청을 상기 제 1 사용자로부터 수신하는 단계를 포함할 수 있다. 일부 예에서, 상기 제 2 요청의 수신에 이어서, 상기 방법은, 상기 제 2 사용자로부터의 텍스트 메시지가 상기 제 1 사용자에게 디스플레이되는 것을 허용하는 단계를 포함한다. 상기 제 2 요청의 수신에 이어서, 상기 방법은, 상기 텍스트 메시지 채팅 시스템을 이용한 채팅 세션에 참여하게 하는 상기 제 2 사용자로부터 상기 제 1 사용자로 전송되는 미래의 초대들을 허용하는 단계를 포함할 수 있다. In a particular embodiment, following receipt of the request, the method includes blocking future invitations from the second user to the first user to participate in a chat session using the text message chat system. . The method may include receiving a second request from the first user to release the blocking of the second user. In some examples, following receipt of the second request, the method includes allowing a text message from the second user to be displayed to the first user. Following receipt of the second request, the method may include allowing future invitations sent from the second user to the first user to participate in a chat session using the text message chat system. .

일부 실시예에서, 상기 복수의 사용자는, 온라인 게임의 동맹을 포함하거나 그것을 정의한다. 상기 방법은, 상기 텍스트 메시지 채팅 시스템의 텍스트 메시지의 적어도 일 부분을 제 1 언어에서 제 2 언어로 번역하는 단계를 포함할 수 있다. 상기 방법은, 또한 상기 텍스트 메시지의 적어도 일 부분을 채팅용어에서 분명한 용어로 변환하는 단계를 포함할 수 있다. 일부 실시예에서, 번역 및/또는 변환은, 병렬 프로세스들을 포함하거나 활용할 수 있다. 예를 들어, 상기 병렬 프로세스들은, 상기 채팅 시스템의 상기 복수의 사용자 각각에 대한 하나의 프로세스(또는 사용자들에 의해 사용되는 각 언어에 대한 하나의 프로세스)를 포함하거나 활용할 수 있다. In some embodiments, the plurality of users includes or defines an alliance of online games. The method may include translating at least a portion of the text message of the text message chat system from a first language to a second language. The method may also include converting at least a portion of the text message from chatspeak to a clear term. In some embodiments, translation and / or conversion may include or utilize parallel processes. For example, the parallel processes may include or utilize one process for each of the plurality of users of the chat system (or one process for each language used by users).

다른 측면에서, 발명은, 명령들이 저장되는 컴퓨터 판독 가능한 매체, 및 데이터 처리 장치를 포함하는 시스템과 관련된다. 상기 데이터 처리 장치는, 동작을 수행하기 위한 명령들을 실행하도록 구성되고, 상기 동작은, 온라인 게임의 복수의 사용자에게 텍스트 메시지 채팅 시스템(예를 들어, 온라인 게임)을 제공하는 단계; 상기 텍스트 메시지 채팅 시스템의 제 1 사용자로부터 상기 텍스트 메시지 채팅 시스템의 제 2 사용자를 차단하기 위한 요청을 수신하는 단계; 및 상기 요청의 수신에 이어서, 상기 제 2 사용자로부터의 텍스트 메시지들이 상기 제 1 사용자에게 디스플레이되는 것을 차단하는 단계;를 포함한다. In another aspect, the invention relates to a system comprising a computer readable medium on which instructions are stored, and a data processing apparatus. The data processing apparatus is configured to execute instructions for performing an operation, the operation comprising: providing a text message chat system (eg, an online game) to a plurality of users of the online game; Receiving a request to block a second user of the text message chat system from a first user of the text message chat system; And following receipt of the request, blocking text messages from the second user from being displayed to the first user.

특정 실시예에서, 상기 요청의 수신에 이어서, 상기 동작은, 상기 텍스트 메시지 채팅 시스템을 이용한 채팅 세션에 참여하게 하는 상기 제 2 사용자로부터 상기 제 1 사용자로의 미래의 초대들을 차단하는 단계를 포함한다. 상기 동작은, 상기 제 2 사용자의 차단을 해제하기 위한 제 2 요청을 상기 제 1 사용자로부터 수신하는 단계를 포함할 수 있다. 일부 예에서, 상기 제 2 요청의 수신에 이어서, 상기 동작은, 상기 제 2 사용자로부터의 텍스트 메시지가 상기 제 1 사용자에게 디스플레이되는 것을 허용하는 단계를 포함한다. 상기 제 2 요청의 수신에 이어서, 상기 동작은, 상기 텍스트 메시지 채팅 시스템을 이용한 채팅 세션에 참여하게 하는 상기 제 2 사용자로부터 상기 제 1 사용자로 전송되는 미래의 초대들을 허용하는 단계를 포함할 수 있다. In a particular embodiment, following receipt of the request, the operation includes blocking future invitations from the second user to the first user to participate in a chat session using the text message chat system. . The operation may include receiving a second request from the first user to release the blocking of the second user. In some examples, following receipt of the second request, the operation includes allowing a text message from the second user to be displayed to the first user. Following receipt of the second request, the operation may include allowing future invitations sent from the second user to the first user to participate in a chat session using the text message chat system. .

일부 실시예에서, 상기 복수의 사용자는, 온라인 게임의 동맹을 포함하거나 정의한다. 상기 동작은, 상기 텍스트 메시지 채팅 시스템의 텍스트 메시지의 적어도 일 부분을 제 1 언어에서 제 2 언어로 번역하는 단계를 포함할 수 있다. 상기 동작은 또한, 또한 상기 텍스트 메시지의 적어도 일 부분을 채팅용어에서 분명한 용어로 변환하는 단계를 포함할 수 있다. 일부 실시예에서, 번역 및/또는 변환은, 병렬 프로세스들을 포함하거나 활용할 수 있다. 예를 들어, 상기 병렬 프로세스들은, 상기 채팅 시스템의 상기 복수의 사용자 각각에 대한 하나의 프로세스(또는 사용자들에 의해 사용되는 각 언어에 대한 하나의 프로세스)를 포함하거나 활용할 수 있다. In some embodiments, the plurality of users includes or defines an alliance of online games. The operation may include translating at least a portion of the text message of the text message chat system from a first language to a second language. The operation may also include converting at least a portion of the text message from chatspeak to a clear term. In some embodiments, translation and / or conversion may include or utilize parallel processes. For example, the parallel processes may include or utilize one process for each of the plurality of users of the chat system (or one process for each language used by users).

다른 측면에서, 발명은, 데이터 처리 장치의 처리 모드를 제어하기 위한 하나 또는 그 이상의 저장 매체에 저장되는 컴퓨터 프로그램 제품과 관련된다. 상기 컴퓨터 프로그램 제품은, 데이터 처리 장치에 의해 실행 가능하고 상기 데이터 처리 장치로 하여금 동작을 수행하도록 하며, 상기 동작은, 온라인 게임의 복수의 사용자에게 텍스트 메시지 채팅 시스템(예를 들어, 온라인 게임)을 제공하는 단계; 상기 텍스트 메시지 채팅 시스템의 제 1 사용자로부터 상기 텍스트 메시지 채팅 시스템의 제 2 사용자를 차단하기 위한 요청을 수신하는 단계; 및 상기 요청의 수신에 이어서, 상기 제 2 사용자로부터의 텍스트 메시지들이 상기 제 1 사용자에게 디스플레이되는 것을 차단하는 단계;를 포함한다. In another aspect, the invention relates to a computer program product stored on one or more storage media for controlling a processing mode of a data processing device. The computer program product is executable by a data processing device and causes the data processing device to perform an operation, wherein the operation provides a text message chat system (eg, an online game) to a plurality of users of the online game. Providing steps; Receiving a request to block a second user of the text message chat system from a first user of the text message chat system; And following receipt of the request, blocking text messages from the second user from being displayed to the first user.

발명의 특정 측면에 관해 설명된 실시예들의 요소들은, 발명의 다른 측면의 다양한 실시예에서 사용될 수 있다. 예를 들어, 하나의 독립 청구항을 인용하는 종속 청구항들의 특징은, 다른 독립 청구항들 중 일부의 장치들 및/또는 방법들에서 사용될 수 있는 것을 생각할 수 있다.Elements of the embodiments described with respect to certain aspects of the invention may be used in various embodiments of other aspects of the invention. For example, it is contemplated that the features of the dependent claims, citing one independent claim, may be used in some devices and / or methods of the other independent claims.

다양한 실시예들의 다른 특징(features)과 측면(aspects)들은, 실시예들의 특징을 예시적으로 나타낸 첨부된 도면과 연계된 다음의 상세한 설명에 의하여 명백해질 것이다. Other features and aspects of the various embodiments will become apparent by the following detailed description in conjunction with the accompanying drawings, which illustrate the features of the embodiments.

본 발명에 따르면, 복수의 사용자들 사이의 다중 언어 통신에서 번역의 정확도를 높일 수 있다. According to the present invention, it is possible to increase the accuracy of translation in multi-language communication between a plurality of users.

다양한 실시예들은 다음의 도면과 함께 상세히 설명된다. 도면들은 오직 예시적으로 나타내기 위한 목적으로서, 어떤 실시예들을 묘사히기 위한 것은 아니다. 도면들은 실시예들의 폭(breadth), 범위(scope) 또는 적용가능성(applicability)을 제한하는 것으로 고려되어서는 아니 된다.
도 1은 다양한 실시예에 따른 다중 언어 통신 시스템을 활용하는 예시적인 환경을 나타내는 블록도이다.
도 2는 다양한 실시예에 따른 예시적인 통신 변환 및 번역 시스템을 나타내는 블록도이다.
도 3은 다양한 실시예에 따른 예시적인 변환 모듈을 나타내는 블록도이다.
도 4는 다양한 실시예에 따른 예시적인 채팅 클라이언트 시스템을 나타내는 블록도이다.
도 5는 다양한 실시예에 따른 다중 언어 통신의 예시적인 방법을 나타내는 순서도이다.
도 6은 다양한 실시예에 따른 통신을 변환하는 예시적인 방법을 나타내는 순서도이다.
도 7은 다양한 실시예에 따른 채팅 클라이언트 시스템 사이의 예시적인 다중 언어 채팅 세션을 나타내는 도면이다.
도 8은 다양한 실시예에 따른 예시적인 다중 언어 통신 방법의 동작을 나타내는 순서도이다.
도 9는 다양한 실시예에 따른 예시적인 다중 언어 통신 방법의 동작을 나타내는 순서도이다.
도 10은 다양한 실시예에 따른 예시적인 다중 언어 통신 방법의 동작을 나타내는 순서도이다.
도 11은 다양한 실시예에 따른 예시적인 다중 언어 통신 방법의 동작을 나타내는 순서도이다.
도 12는 다양한 실시예에 따른 예시적인 디지털 장치를 나타내는 블록도이다.
도 13은 다양한 실시예에 따른 예시적인 사용자 피드백 시스템을 나타내는 블록도이다.
도 14는 다양한 실시예에 따른 예시적인 사용자 피드백 시스템을 나타내는 블록도이다.
도 15는 다양한 실시예에 따른 예시적인 사용자 피드백 방법을 나타내는 순서도이다.
도 16은 다양한 실시예에 따른 예시적인 사용자 피드백 시스템의 데이터 흐름을 나타내는 블록도이다.
도 17은 다양한 실시예에 따른 단어에 대한 사용자 피드백을 수신하는 예를 나타내는 스크린샷이다.
도 18은 다양한 실시예에 따른 사용자 피드백을 스킵하는 예를 나타내는 스크린샷이다.
도 19는 다양한 실시예에 따른 구절에 대한 사용자 피드백을 수신하는 예를 나타내는 스크린샷이다.
도 20은 다양한 실시예에 따른 선택-폼 응답의 리스트를 통해 사용자 피드백을 수신하는 예를 나타내는 스크린샷이다.
도 21은 다양한 실시예에 따른 선택-폼 응답의 리스트를 생성하는 예를 나타내는 스크린샷이다.
도 22는 다양한 실시예에 따른 예시적인 인센티브 알림을 나타내는 스크린샷이다.
도 23은 다양한 실시예에 따른 클라이언트 채팅 시스템 사이에서 번역이 실패할 때의 예를 나타내는 스크린샷이다.
도 24 및 25는 다양한 실시예에 따른 사용자 피드백에 대해 이용 가능한 단어 또는 구절의 예시적인 리스트를 나타내는 스크린샷이다.
도 26은 다양한 실시예에 따른 단어를 정의하는 예를 나타내는 스크린샷이다.
도 27은 다양한 실시예에 따른 선택-폼 응답의 리스트의 예를 나타내는 스크린샷이다.
도 28은 다양한 실시예에 따른 제출된 응답에 대한 상태의 리스트의 예를 나타내는 스크린샷이다.
도 29는 다양한 실시예에 따른 인센티브 알림의 예를 나타내는 스크린샷이다.
도 30은 다양한 실시예에 따른 언어 말뭉치에서 판독할 수 없는 구절을 검출하는 방법에 대한 순서도이다.
도 31A는 다양한 실시예에 따른 사기 검출 모듈의 개략도이다.
도 31B는 다양한 실시예에 따른 인센티브화된 번역에서 사기를 검출하는 방법의 순서도이다.
도 32는 다양한 실시예에 따른 그룹 채팅 환경에서 음성-텍스트 변환 및 번역을 수행하는 시스템의 개략도이다.
도 33A는 다양한 실시예에 따른 채팅 히스토리를 변환 및/또는 번역하기 위한 채팅 히스토리 모듈의 개략도이다.
도 33B는 다양한 실시예에 따른 채팅 히스토리를 변환 및/또는 번역하기 위한 방법의 순서도이다.
도 34A는 다양한 실시예에 따른 채팅 세션의 하나 또는 그 이상의 사용자를 차단하기 위한 사용자 인터페이스의 스크린샷을 포함한다.
도 34B는 다양한 실시예에 따른 채팅 세션의 하나 또는 그 이상의 사용자를 차단 해제하기 위한 사용자 인터페이스의 스크린샷을 포함한다.
도 35는 다양한 실시예에 따른 채팅 세션의 하나 또는 그 이상의 사용자를 차단하기 위한 방법의 순서도를 포함한다.
도 36A 내지 36F는 다양한 실시예에 따른 원본 메시지의 언어 번역을 사용자가 정정하도록 허용하는 사용자 인터페이스의 스크린샷을 포함한다.
도 37A 및 37B는 다양한 실시예에 따른, 가능한 보상에 대한 대가로, 다른 사용자들에 의해 제출된 번역 정정을 사용자가 판정하는 것을 허용하는 사용자 인터페이스의 스크린샷을 포함한다.
도 38은 다양한 실시예에 따른 번역의 정확도를 평가하는 번역 평가 모듈의 개략도이다.
도 39는 다양한 실시예에 따른 번역의 정확도를 평가하는 방법의 흐름도이다.Various embodiments are described in detail with the following drawings. The drawings are for illustrative purposes only and are not intended to depict certain embodiments. The drawings should not be considered as limiting the breadth, scope or applicability of the embodiments.
1 is a block diagram illustrating an exemplary environment utilizing a multi-language communication system according to various embodiments.
2 is a block diagram illustrating an exemplary communication conversion and translation system in accordance with various embodiments.
3 is a block diagram illustrating an exemplary conversion module in accordance with various embodiments.
4 is a block diagram illustrating an exemplary chat client system according to various embodiments.
5 is a flow chart illustrating an exemplary method of multilingual communication in accordance with various embodiments.
6 is a flow chart illustrating an example method for transforming communication in accordance with various embodiments.
7 is a diagram illustrating an exemplary multi-language chat session between chat client systems in accordance with various embodiments.
8 is a flow chart illustrating the operation of an exemplary multi-language communication method according to various embodiments.
9 is a flow chart illustrating the operation of an exemplary multi-lingual communication method in accordance with various embodiments.
10 is a flowchart illustrating the operation of an exemplary multi-language communication method according to various embodiments.
11 is a flowchart illustrating operation of an exemplary multi-lingual communication method according to various embodiments.
12 is a block diagram illustrating an exemplary digital device according to various embodiments.
13 is a block diagram illustrating an exemplary user feedback system according to various embodiments.
14 is a block diagram illustrating an exemplary user feedback system according to various embodiments.
15 is a flowchart illustrating an exemplary user feedback method according to various embodiments.
16 is a block diagram illustrating data flow of an exemplary user feedback system according to various embodiments.
17 is a screenshot illustrating an example of receiving user feedback for a word according to various embodiments.
18 is a screenshot illustrating an example of skipping user feedback according to various embodiments.
19 is a screenshot illustrating an example of receiving user feedback for a phrase according to various embodiments.
20 is a screenshot illustrating an example of receiving user feedback through a list of select-form responses according to various embodiments.
21 is a screenshot illustrating an example of generating a list of select-form responses according to various embodiments.
22 is a screen shot showing an exemplary incentive notification according to various embodiments.
23 is a screen shot showing an example of when a translation fails between client chat systems according to various embodiments.
24 and 25 are screen shots showing an exemplary list of words or phrases available for user feedback according to various embodiments.
26 is a screenshot illustrating an example of defining a word according to various embodiments.
27 is a screenshot showing an example of a list of select-form responses according to various embodiments.
28 is a screen shot showing an example of a list of statuses for submitted responses according to various embodiments.
29 is a screen shot showing an example of an incentive notification according to various embodiments of the present disclosure.
30 is a flowchart of a method of detecting an unreadable phrase in a language corpus according to various embodiments.
31A is a schematic diagram of a fraud detection module according to various embodiments.
31B is a flow chart of a method for detecting fraud in an incentivized translation according to various embodiments.
32 is a schematic diagram of a system for performing speech-to-text conversion and translation in a group chat environment according to various embodiments.
33A is a schematic diagram of a chat history module for converting and / or translating chat history according to various embodiments.
33B is a flowchart of a method for converting and / or translating chat history according to various embodiments.
34A includes screenshots of a user interface for blocking one or more users of a chat session according to various embodiments.
34B includes screenshots of a user interface for unblocking one or more users of a chat session according to various embodiments.
35 includes a flowchart of a method for blocking one or more users of a chat session according to various embodiments.
36A-36F include screenshots of a user interface that allows a user to correct a language translation of an original message according to various embodiments.
37A and 37B include screenshots of a user interface allowing the user to determine translation corrections submitted by other users in exchange for possible rewards, in accordance with various embodiments.
38 is a schematic diagram of a translation evaluation module for evaluating the accuracy of translation according to various embodiments.
39 is a flowchart of a method for evaluating the accuracy of translation according to various embodiments.

여기서 기술되는 다양한 실시예들은, 다중 언어 통신의 구현에 관한 것이다. 일부 실시예에 의한 시스템 및 방법은, 예를 들어, 인터넷 기반의 채팅(예를 들어, 애플®의 iMessage, Windows® Live Messenger 등), 이메일(예를 들어, 임베디드 포럼 메시징, Yahoo® 메일, RFC 5322 등), 텍스트 기반의 모바일 폰 통신(예를 들어, SMS 메시지나 MMS 메시지), 온라인 포럼에 대한 포스팅(예를들어, 웹 기반의 취미 포럼에 대한 포스팅), 온라인 소셜 미디어 서비스(online social media service)에 대한 포스팅(예를 들어, Twitter®, Facebook® 등) 등을 포함하는 서로 다른 통신 모드들을 통하여, 다중 언어 통신을 가능하게 할 수 있다. 특정 실시예들은, 또한 과거에 일어난 통신 또는 대화의 트랜스크립트(transcripts)를 번역하는데 사용될 수 있다(예를 들어, 증언 녹취록(deposition transcripts)이나 채팅 히스토리(chat history)). 다양한 실시예들은 텍스트 내의 하나 또는 그 이상의 다음: 전문화된/분야에 한정된 용어(예를 들어, 채팅용어(chatspeak)), 약어(abbreiviations), 두음자어(acronyms), 고유명사(proper noun), 보통명사(common noun), 지소사(diminutives), 구어체(colloquial) 단어 또는 구절, 및 비속어(profane) 단어 또는 구절을, 다루거나/수용하면서, 2개 또는 그 이상의 언어(예를 들어, 발화된(spoken)) 사이의 텍스트를 번역하는 통신 시스템 및 방법을 구현할 수 있다. 예를 들어, 여기서 설명되는 일부 시스템 및 방법은, 서로 다른 외국의 언어로 채팅하는 사용자들을 가질 수 있는, 대규모 멀티플레이어 온라인(MMO, Massive-Multiplayer Online) 게임과 같은, 채팅 시스템을 연결하는데 활용될 수 있다. 특정 실시예들을 통하여, 2 또는 그 이상의 사용자들 사이의 채팅 대화(chat dialogue)는, 투명하게(transparently) 번역되고, 각각의 사용자에게, 그들의 각각의 모국어(native language) 또는 선택된 언어로 제공될 수 있다. 추가적으로, 다중 계층(multi-tiered)/다중 모듈(multi-module) 변환 프로세스의 사용을 통하여, 특정 실시예에서는, 통상적인 번역 시스템 단독으로 가능한 것에 비하여, 2 또는 그 이상의 사용자들 사이의 통신에 대한 빠른 번역을 가능하게 할 수 있다(예를 들어, 수 마이크로세컨드(microseconds) 사이의 번역).Various embodiments described herein relate to implementations of multilingual communication. Systems and methods according to some embodiments include, for example, Internet-based chat (eg, Apple® iMessage, Windows® Live Messenger, etc.), email (eg, embedded forum messaging, Yahoo® mail, RFC 5322, etc.), text-based mobile phone communication (e.g., SMS messages or MMS messages), posting to online forums (e.g., posting to web-based hobby forums), online social media services (online social media) It is possible to enable multi-language communication through different communication modes including posting to a service (eg, Twitter®, Facebook®, etc.). Certain embodiments may also be used to translate transcripts of communications or conversations that have occurred in the past (eg, deposition transcripts or chat history). Various embodiments include one or more of the following in the text: specialized / sector-specific terms (eg, chatspeak), abbreviations, acronyms, proper noun, usually Two or more languages (e.g. spoken), dealing with / accepting common nouns, diminutives, colloquial words or phrases, and profane words or phrases )) Can implement a communication system and method for translating text between. For example, some of the systems and methods described herein can be utilized to connect chat systems, such as Massive-Multiplayer Online (MMO) games, which can have users chatting in different foreign languages. Can. Through certain embodiments, a chat dialogue between two or more users is translated transparently and can be provided to each user in their respective native language or selected language. have. Additionally, through the use of a multi-tiered / multi-module conversion process, in certain embodiments, for communication between two or more users, compared to what is possible with a conventional translation system alone. Fast translations may be possible (eg, translations between several microseconds).

일부 실시예에 의하면, 시스템 또는 방법은, 영어(English)와 같은 제1 언어의 채팅용어(chatspeak)로부터, 불어(French)와 같은 제2 언어의 채팅용어로의 번역을 수행할 수 있다. 다른 예에서는, 시스템 또는 방법은, 제2 언어(예를 들어, 불어)로의 번역을 시도하기 전에, 제1 언어(예를 들어, 영어)의 채팅용어(chatspeak)로부터 제1 언어(예를 들어, 영어)의 공식적인 용어(formal speak)로의 변환을 수행할 수 있다. 어떤 실시예에서는, 인간 운영자에 의하여 수동으로 입력된 번역, 또는 번역 시스템에 의하여 이전에 수행된 것에 기반하는 번역(예를 들어, 실시예에 따라 수행된 역사적인 번역(historical translations))이 저장되는 데이터 저장부(예를 들어, 번역 캐시(cache))로 먼저 질의하여, 이러한 텍스트 번역을 달성할 수 있다. 실시예에서는, 정확한 텍스트 번역에 적합하도록, 텍스트의 하나 또는 그 이상의 부분에 대한 변환(예를 들어, 텍스트에 포함된 하나 또는 그 이상의 다음에 대한 처리(process): 채팅용어, 두음자어, 약어, 고유명사, 보통명사, 구어체 및 비속어)을 시도할 수 있다. 예를 들어, 특정 실시예에서는, 다른 언어들 사이에서의 (현재 또는 과거의) 관용적인(idiomatic) 언어 사용을 처리하기 위하여, 주어진 텍스트를 변환할 수 있다. 실시예들은, 텍스트의 일 부분에 대한 변환 후, 상기 데이터 저장부로 질의를 재시도할 수 있다. 이러한 번역 검색 재시도(translate lookup attempt)가 실패하면, 실시예에서는 상기 (이미 변환되었을 수 있는) 텍스트를 기계 번역 서비스(예를 들어, 제3자, Google®translate과 같은 클라우드 기반의 번역 시스템)를 이용하여 번역하는 것을 시도할 수 있다.According to some embodiments, the system or method may perform translation from chatspeak in a first language, such as English, to chat language in a second language, such as French. In other examples, the system or method may include a first language (eg, English) chatspeak from a first language (eg, English) before attempting to translate into a second language (eg, French). , English) to formal speak. In some embodiments, data in which translations manually entered by a human operator, or translations based on previously performed by a translation system (e.g., historical translations performed according to embodiments) are stored. This text translation can be achieved by first querying the storage (eg, translation cache). In embodiments, conversion to one or more parts of the text (e.g., processing of one or more of the following contained in the text: chat terms, acronyms, abbreviations, abbreviations, etc.), suitable for accurate text translation You can try proper nouns, common nouns, colloquial and profanity). For example, in certain embodiments, given text can be transformed to handle (current or past) idiomatic language use between different languages. Embodiments may retry the query to the data storage after conversion for a portion of text. If such a translation lookup attempt fails, in embodiments the text (which may have already been translated) is translated into machine translation services (e.g., third parties, cloud-based translation systems such as Google®translate). You can try to translate using.

실시예는, 결과적으로 생성되는 텍스트의 번역을 보다 정제하기 위하여, 공식적인 텍스트(formal text)의 번역된 부분을, 새로운 언어의 채팅용어로 변환(예를 들어, 불어의 공식적인 용어(formal speak)를 불어의 채팅용어(chatspeak)로 변환)하는 것을, 시도할 수 있다. 따라서, 특정 실시예는, 제1 언어(예를 들어, 영어)의 채팅용어와 제2 언어(예를 들어, 러시아어, 불어, 스페인어, 중국어, 힌두어 등)의 채팅용어 사이의 채팅 번역을 가능하게 한다.In order to further refine the translation of the resulting text, the embodiment converts the translated part of the formal text into a chat language of the new language (eg, formal speak in French). You can try converting into French chatspeak. Thus, certain embodiments enable chat translation between chat terms in a first language (eg, English) and chat terms in a second language (eg, Russian, French, Spanish, Chinese, Hindi, etc.). do.

일부 실시예들은, 기계 번역을 사용할 필요를 줄이거나 방지하는 것을 도울 수 있으며(따라서, 시간, 비용 및 기계 번역과 관련된 다른 간접 비용을 줄일 수 있음), 최소한의 문맥 또는 짧은 문장 구조를 가지는 텍스트의 정확한 번역을 가능하게 할 수 있다. 상기 기계 번역은 제3자 서비스 또는 보안 네트워크 연결(예를 들어, Secure-Socket Layer(SSL) 연결)에 의하여 구현되는 것으로, 특정 실시예에 의하여 피할 수 있는 비용 또는 간접 비용은 큰 의미가 있을 수 있다.Some embodiments can help reduce or avoid the need to use machine translation (and thus reduce time, cost, and other indirect costs associated with machine translation), and of text with minimal context or short sentence structure. It can enable accurate translation. The machine translation is implemented by a third-party service or a secure network connection (for example, a Secure-Socket Layer (SSL) connection), and a cost or indirect cost that can be avoided by a specific embodiment may have great meaning. have.

여기서 이해되는 바와 같이, “변환(transformation)”은 제1 텍스트 세그먼트(segment)를 제1 언어로 조작(manipulate)하여, 제1 언어의 제2 텍스트 세그먼트를 형성하는 것을 의미한다. 결과로 도출된 제2 텍스트 세그먼트도, 여기서 “변환된 텍스트(transformed text)”로 나타낼 수 있다. “번역(translation)”은 제1 언어의 텍스트 세그먼트를, 대응하는 제2 언어의 텍스트 세그먼트로 전환(convert)하는 것을 의미하는 것으로 이해될 수 있다.As understood herein, “transformation” means forming a second text segment of the first language by manipulating the first text segment in the first language. The resulting second text segment may also be referred to herein as “transformed text”. “Translation” can be understood to mean converting a text segment of a first language into a text segment of a corresponding second language.

또한, 여기서 이해되는 바와 같이, “변환된 번역(transformed translation)”은, 여기서 설명된 실시예에 따라, 이미 변환된(예를 들어, 제1 언어의 채팅용어 텍스트로부터, 제1 언어의 공식적인 텍스트로의 변환) 텍스트 세그먼트의 번역(제1 언어에서 제2 언어로)을 의미한다. “변환되지 않은 번역(untransformed translation)”은, 여기서 설명된 실시예에 따라 상기 텍스트 세그먼트가 변환되기 전에, 텍스트 세그먼트의 번역(제1 언어에서 제2 언어)을 의미하는 것으로 이해될 수 있다.Also, as understood herein, “transformed translation” means, according to the embodiments described herein, that has already been transformed (eg, from the chat language text of the first language, to the official text of the first language). Translation to) means the translation of the text segment (from the first language to the second language). “Untransformed translation” may be understood to mean the translation of the text segment (from the first language to the second language) before the text segment is converted according to the embodiments described herein.

다양한 실시예들은, 특정한 번역 어플리케이션에 잘 맞는 특정 전략과 함께, 상이한 변환/번역 전략들을 구현할 수 있다. 예를 들어, 특정한 채팅 시스템 어플리케이션에 있어서, 구현된 변환 전략은, 다음의 변환 관련 모듈들의 집합: 채팅용어 모듈, 두문자어 모듈, 고유명사 모듈, 보통명사 모듈, 구어체 모듈, 맞춤법 검사 모듈, 약어 모듈 및 비속어 모듈을, 나열된 순서에 따라, 적용하는 단계를 포함할 수 있다. 일반적으로, 상기 채용된 변환/번역 전략은, 어떠한 변환 동작(transform operation)이 수행되는지, 전체 번역 프로세스에서 언제 상기 변환 동작이 수행되는지(예를 들어, 기계번역 이전 또는 이후에 변환을 수행), 또는 어떠한 순서로 상기 변환 동작들이 수행되는지(예를 들어, 변환 동작의 우선권 또는 우선순위)를 결정한다. 상기 변환/번역 전략은, 또한 데이터 저장부 내에 어떠한 번역을 미리 채워둘 것인지(예를 들어, 번역은, 전체 프로세스의 속도를 높이기 위하여 번역 “캐시”에 저장될 수 있다), 전체 번역 프로세스에서 언제 번역 캐시를 활용할 것인지를 결정할 수 있다. 특정한 실시예에서, 상기 채용된 변환/번역 전략은, 실시예들이 사용되는 환경의 조건에 기반하여 동적으로 결정될 수 있다. 예를 들어, 채팅 시스템이 평소보다 더 많은 사용자의 부하를 경험하면, 상기 변환/번역 전략은 상기 채팅 시스템에 처리 부담을 줄이도록 전환할 수 있다(예를 들어, 데이터 저장부 보다 기계 번역에 더 의지).Various embodiments may implement different translation / translation strategies, along with a specific strategy that is well suited to a particular translation application. For example, for a particular chat system application, the conversion strategy implemented may include the following set of conversion related modules: chat term module, acronym module, proper noun module, common noun module, colloquial module, spell check module, abbreviation module, and And applying the profanity module, according to the listed order. In general, the adopted transformation / translation strategy is, which transform operation is performed, and when the transformation operation is performed in the entire translation process (for example, performing a transformation before or after machine translation), Or, in what order, the conversion operations are performed (eg, priority or priority of the conversion operation). The translation / translation strategy also pre-populates any translations in the data storage (eg, translations can be stored in the translation “cache” to speed up the entire process), when in the entire translation process. You can decide whether to utilize the translation cache. In a particular embodiment, the employed translation / translation strategy can be dynamically determined based on the conditions of the environment in which the embodiments are used. For example, if the chat system experiences a load of more users than usual, the conversion / translation strategy can be switched to reduce the processing burden on the chat system (eg, more on machine translation than data storage). will).

도 1은 다양한 실시예에 따른 다중 언어 시스템을 활용하는 예시적인 환경(100)을 나타내는 블록도이다. 도1에 도시한 바와 같이, 예시적인 환경(100)은 클라이언트(102-1 내지 102-N, 이하 통칭하여 “클라이언트들(102)” 또는 클라이언트(102)”라 함), 채팅 서버(108) 및 번역 서버(110)를 포함하며, 각각은 컴퓨터 네트워크(106)을 통하여 서로 통신 가능하게 결합될 수 있다. 일부 실시예에 의하면, 상기 컴퓨터 네트워크(106)는, 인터넷, WiFi 네트워크, WiMax 네트워크, 사설 네트워크, 공용 네트워크 등, 하나 또는 그 이상의 로컬 또는 광대역 통신 네트워크를 이용하여 실행 또는 구현될 수 있다. 실시예에 따라서는, 컴퓨터 네트워크(106)와의 일부 또는 전체 통신 연결은, 예시적 환경(100) 내에 도시된 다양한 엔터티 사이에서 교환되는 정보들을 보호하기 위해, 암호화(예를 들어, Secure Sockets Layer(SSL))를 활용할 수 있다.1 is a block diagram illustrating an example environment 100 utilizing a multi-language system according to various embodiments. As shown in FIG. 1, the exemplary environment 100 includes clients 102-1 to 102-N, hereinafter collectively referred to as "clients 102" or clients 102 ", and chat server 108. And a translation server 110, each of which can be communicatively coupled to each other via a computer network 106. According to some embodiments, the computer network 106 may be implemented or implemented using one or more local or broadband communication networks, such as the Internet, WiFi networks, WiMax networks, private networks, public networks, and the like. Depending on the embodiment, some or all communication connections with the computer network 106 may be encrypted (e.g., Secure Sockets Layer (for example, Secure Sockets Layer SSL)).

각각의 클라이언트(102), 채팅 서버(108) 및 번역 서버(110)는 하나 또는 그 이상의 디지털 장치를 이용하여 구현될 수 있으며, 여기서 디지털 장치는 이후 도12와 관련하여 후에 논의될 디지털 장치와 유사할 수 있다. 예를 들어, 클라이언트(102-1)는, 사용자 입력(예를 들어, 사용자 상호 작용(interaction)을 위해 구성된)을 수신할 수 있고, 하나 또는 그 이상의 다른 클라이언트들(예를 들어, 클라이언트 102-2부터 102-N 중 어느 것)과의 통신을 구현하는 클라이언트 사용자 인터페이스를 제공할 수 있으며, 컴퓨터 네트워크(106)를 통하여 채팅 서버(108)와 통신을 수행할 수 있는, 어떠한 형태의 컴퓨팅 장치일 수 있다. 이러한 컴퓨팅 장치는 모바일 폰(mobile phone), 태블릿 컴퓨팅 장치(tablet computing device), 랩탑(laptop), 데스크탑(desktop) 컴퓨터, PDA(Personal Digital Assistant), 휴대용 게임 장치(portable gaming unit), 유선 게임 장치(wired gaming unit), 씬 클라이언트(thin client), 셋탑박스(set-top box), 휴대용 멀티미디어 플레이어(Portable multi-media player) 또는 당업자에게 알려진, 어떠한 다른 형태의 네트워크에 접속 가능한 사용자 장치를 포함할 수 있다. 나아가, 하나 또는 그 이상의 채팅 서버(108) 및 번역 서버(110)는, 하나 또는 그 이상의 클라우드 기반의 서비스(예를 들어, System-as-a-Service(SaaS), Platform-as-a-Service(PaaS) 또는 Infrastructure-as-a-Service(IaaS)) 상에서 동작하거나, 이를 이용하여 구현되는, 하나 또는 그 이상의 서버를 포함할 수 있다.Each client 102, chat server 108 and translation server 110 may be implemented using one or more digital devices, where the digital devices are similar to the digital devices that will be discussed later in connection with FIG. can do. For example, the client 102-1 may receive user input (eg, configured for user interaction), and one or more other clients (eg, client 102- Any type of computing device capable of providing a client user interface to implement communication with any of 2 to 102-N) and performing communication with the chat server 108 through the computer network 106. You can. Such computing devices include mobile phones, tablet computing devices, laptops, desktop computers, personal digital assistants (PDAs), portable gaming units, and wired gaming devices. (wired gaming unit), thin client, set-top box, portable multimedia player (Portable multi-media player) or any other form of network, as known to those skilled in the art, may include user equipment. You can. Furthermore, the one or more chat server 108 and the translation server 110 may include one or more cloud-based services (eg, System-as-a-Service (SaaS), Platform-as-a-Service). (PaaS) or Infrastructure-as-a-Service (IaaS)), or may be implemented using this, and may include one or more servers.

클라이언트(102)들은, 클라이언트(102)들 사이의 채팅 세션을 제공하거나 달리 가능하게 하는 채팅 서버(108)와, 통신 가능하게 연결하도록 구성될 수 있다. 각각의 클라이언트들(102-1 내지 102-N)은, 각각의 클라이언트(102)의 사용자가, 상기 채팅 서버(108)를 통하여 채팅 세션에 접근할 수 있도록 하는, 채팅 클라이언트 시스템(104-1 내지 104-N)을 포함할 수 있다. 추가적으로, 실시예에 의하면, 각각의 채팅 클라이언트 시스템(104-1 내지 104-N, 이하 통칭하여 “채팅 클라이언트 시스템들(104)” 또는 “채팅 클라이언트 시스템(104)”이라 함)들은, 독립형(standalone) 채팅 어플리케이션, 비 채팅 어플리케이션(예를 들어, video game) 내에 내장되는 채팅 특징(chat feature), 또는 웹 브라우저를 통하여 클라이언트에서 접근 가능한 채팅 서비스를 통하여 구현될 수 있다. 일부 실시예들에서, 채팅 클라이언트 시스템(104)이 서로에 대하여 이기종(heterogeneous)이 아닐 수 있고, 그들 사이의 채팅 세션을 성립시키는 것이 가능하다는 것을, 당업자들은 인정할 수 있다. 상기 채팅 클라이언트 시스템(104)은, 사용자에 의하여 선택된(예를 들어, 사용자 설정 또는 선호도에 기초한) 언어(그리고 대응하는 문자세트)로, 그들 각각의 사용자로부터 채팅 입력(예를 들어, 채팅 메시지)을 수신할 수 있으며, 다른 사용자(예를 들어, 다른 채팅 클라이언트 시스템의 다른 사용자)에게 중계될, 채팅 입력을 채팅 서버(108)로 전송할 수 있다. 상기 채팅 클라이언트 시스템(104)은, 또한 채팅 서버(108)로부터(예를 들어, 다른 채팅 클라이언트 시스템의 다른 사용자로부터) 채팅 출력(예를 들어, 채팅 세션 대화)을 수신할 수 있으며, 수신된 채팅 출력을, 사용자에 의하여 선택된(예를 들어, 사용자 설정이나 선호도에 기초한) 언어(그리고 대응하는 문자세트)로 디스플레이할 수 있다.Clients 102 may be configured to communicatively connect with chat server 108 that provides or otherwise enables chat sessions between clients 102. Each of the clients 102-1 to 102-N, the chat client system 104-1 to allow the user of each client 102 to access the chat session through the chat server 108 104-N). Additionally, according to an embodiment, each of the chat client systems 104-1 to 104-N (hereinafter collectively referred to as “chat client systems 104” or “chat client systems 104”) is standalone. ) A chat application, a chat feature built into a non-chat application (eg, video game), or a chat service accessible from a client through a web browser. In some embodiments, those skilled in the art will appreciate that the chat client systems 104 may not be heterogeneous with respect to each other, and that it is possible to establish a chat session between them. The chat client system 104 may input chat (eg, chat messages) from their respective users in a language (and corresponding character set) selected by the user (eg, based on user settings or preferences). And receive chat input to the chat server 108 to be relayed to other users (eg, other users of different chat client systems). The chat client system 104 may also receive chat output (eg, chat session conversations) from the chat server 108 (eg, from other users of other chat client systems), and received chats The output can be displayed in a language (and corresponding character set) selected by the user (eg, based on user settings or preferences).

일부 실시예의 사용에 의하면, 상기 채팅 대화의 번역은, 그 대화가 상기 채팅 클라이언트 시스템(104) 사이를 통과하면서, 사용자들에게 명백(transparent)해질 수 있다. 따라서, 일부 실시예에 의하면, 주어진 채팅 클라이언트 시스템(104)에서 제공된 모든 채팅 대화는, 동일한 채팅 대화에 기여하는 다른 채팅 클라이언트 시스템(104)에서 사용자에 의하여 어떠한 언어가 있는지에 무관하게, 상기 주어진 채팅 클라이언트 시스템(104)의 사용자의(또는 사용자에 의해 선택된) 모국어일 수 있다. 예를 들어, 채팅 클라이언트 시스템(104-1)의 사용자와, 채팅 클라이언트 시스템(104-2)의 사용자가 동일한 채팅 대화를 형성하면(즉, 동일한 채팅 세션에 포함되면), 상기 채팅 클라이언트 시스템(104-1)의 사용자는, 상기 채팅 클라이언트 시스템(104-2)의 사용자가 러시아어로 채팅 대화를 입력하고 수신하는 것을 선택하는 동안, 영어로 채팅 대화를 입력하고 수신하는 것을 선택할 수 있다. 비록 채팅 클라이언트 시스템(104-1, 104-2)의 사용자는 동일한 채팅 내용(chat content)을 볼 수 있지만, 상기 채팅 대화는 그들 각각이 선택한 언어로 표시될 수 있다.According to the use of some embodiments, the translation of the chat conversation can be made transparent to the users as the conversation passes between the chat client systems 104. Thus, according to some embodiments, all chat conversations provided in a given chat client system 104, regardless of what language the user has in other chat client systems 104 contributing to the same chat conversation, are given the above chat It may be the native language of the user of the client system 104 (or selected by the user). For example, if the user of the chat client system 104-1 and the user of the chat client system 104-2 form the same chat conversation (ie, are included in the same chat session), the chat client system 104 The user of -1) can choose to input and receive chat conversations in English while the user of the chat client system 104-2 chooses to enter and receive chat conversations in Russian. Although users of the chat client systems 104-1 and 104-2 can view the same chat content, the chat conversations can be displayed in their respective languages.

도시한 바와 같이, 채팅 서버(108)는, 채팅 클라이언트 시스템(104) 사이의 채팅 세션을 형성 및/또는 실행하도록 구성하는 채팅 호스트 시스템(112)과, 여기서 기술되는 다양한 시스템 및 방법들에 따르는 변환 및/또는 번역 동작을 수행하도록 구성되는 통신 변환 및 번역(CTT: communication Transformation and Translation) 시스템(114)을 포함할 수 있다. 일부 실시예에서, 채팅 클라이언트 시스템(104)들은 상기 채팅 호스트 시스템(112)을 통하여 서로 간의 채팅 세션을 형성할 수 있으며, 상기 채팅 호스트 시스템(104)는, 채팅 클라이언트 시스템(140)들 사이의 채팅 대화의 투명한 번역을 촉진하는 CTT 시스템(114)의 특징을 활용할 수 있다. 일 실시예에서, 당업자는, 채팅 호스트 시스템(112) 및 CTT 시스템(114)이 별도의 서버들의 일 부분일 수 있으며, 상기 채팅 호스트 시스템(112)을 동작시키는 엔터티(entity)는 상기 CTT 시스템(114)을 동작시키는 엔터티와 상이할 수 있음을, 인정할 수 있다. 예를 들어, 상기 채팅 호스트 시스템(112)은, CTT 시스템(114)의 서비스를 활용하는 제3자의 채팅 호스트 시스템일 수 있다.As shown, the chat server 108 converts according to the chat host system 112 configured to establish and / or run chat sessions between the chat client systems 104 and various systems and methods described herein. And / or a communication transformation and translation (CTT) system 114 configured to perform a translation operation. In some embodiments, chat client systems 104 may establish a chat session with each other through the chat host system 112, where the chat host system 104 chats between the chat client systems 140. It is possible to take advantage of the features of the CTT system 114 to facilitate transparent translation of conversations. In one embodiment, those skilled in the art, the chat host system 112 and the CTT system 114 may be part of separate servers, the entity operating the chat host system 112 (entity) is the CTT system ( It can be appreciated that it may be different from the entity that operates 114). For example, the chat host system 112 may be a third-party chat host system that utilizes the services of the CTT system 114.

또한, 도시한 바와 같이, 번역 서버(110)는, 기계 텍스트 번역에 대한 요청을 수신하고, 서비스하도록 구성된 번역 모듈(116)을 포함할 수 있다. 일부 실시예에 의하면, 상기 CTT 시스템(114)는, 텍스트의 기계 번역을 수행하는 번역 모듈(116)의 동작/서비스를 활용할 수 있다. 상기 CTT 시스템(114)은, 상기 번역 모듈(116)에 의하여 제공되는 서비스에 대한 접근을 획득하기 위하여, 하나 또는 그 이상의 번역 API(Application Programming Interfaces)를 이용할 수 있다. 실시예에 따라서는, 상기 번역 모듈(116)(및 그것이 위치하는 서버(110))는, Google®과 같은, 유료 또는 무료의 기계 번역 서비스를 제공하는, 제3자에 의하여 동작할 수 있다. 비록 번역 모듈(116)이 CTT 시스템(114)과는 별도의 서버 상에서 동작하는 구성요소로 도시되지만, 당업자는, 일부 실시예에서, 상기 번역 모듈(116)이 CTT 시스템(114)과 동일한 서버 상에서 동작하거나, 및/또는 상기 CTT 시스템(114)의 집적된 구성요소일 수 있음을 인정할 수 있다.Further, as shown, the translation server 110 may include a translation module 116 configured to receive and service a request for machine text translation. According to some embodiments, the CTT system 114 may utilize an operation / service of the translation module 116 that performs machine translation of text. The CTT system 114 may use one or more translation application programming interfaces (APIs) to obtain access to services provided by the translation module 116. Depending on the embodiment, the translation module 116 (and the server 110 on which it is located) may be operated by a third party, providing a paid or free machine translation service, such as Google®. Although the translation module 116 is shown as a component that operates on a separate server from the CTT system 114, those skilled in the art may, in some embodiments, have the translation module 116 on the same server as the CTT system 114. It may be appreciated that it may operate and / or be an integrated component of the CTT system 114.

도 2는 다양한 실시예에 따른 예시적인 통신 변환 및 번역 시스템(114)을 나타내는 블록도이다. 도시한 바와 같이, 상기 CTT 시스템(114)는 통신 변환 및 번역(CTT) 제어 모듈(202), 통신 변환 및 번역(CTT) 통신 모듈(204), 언어 모듈(206), 변환모듈(208), 번역 데이터 저장부(210) 및 번역 어플리케이션 API(Application Programming Interface) 모듈(212)을 포함할 수 있다. 상기 CTT 제어 모듈(202)은, 여기서 설명되는 시스템 및 방법에 따라, CTT시스템(114)이 변환 또는 번역을 수행하도록, 상기 CTT 시스템(114) 내의 다양한 동작들의 수행을 제어 및/또는 지휘하도록 구성될 수 있다. 일부 실시예에서는, 상기 CTT 제어 모듈(202)이, 상기 CTT 통신 모듈(204), 상기 언어모듈(206), 상기 변환 모듈(208), 상기 번역 데이터 저장부(210) 및 번역 API 모듈(212)과 같은, 상기 CTT 시스템(114)의 다른 구성요소의 동작을 제어할 수 있다.2 is a block diagram illustrating an exemplary communication conversion and translation system 114 according to various embodiments. As shown, the CTT system 114 includes a communication conversion and translation (CTT) control module 202, a communication conversion and translation (CTT) communication module 204, a language module 206, a conversion module 208, A translation data storage unit 210 and a translation application application programming interface (API) module 212 may be included. The CTT control module 202 is configured to control and / or direct the performance of various operations within the CTT system 114 such that the CTT system 114 performs translation or translation, according to the systems and methods described herein. Can be. In some embodiments, the CTT control module 202, the CTT communication module 204, the language module 206, the conversion module 208, the translation data storage unit 210 and the translation API module 212 ), The operation of other components of the CTT system 114 can be controlled.

상기 CTT 통신 모듈(204)은, CTT 시스템(114)과, 채팅 서버(108) 및/또는 번역 서버(110)와 같은, CTT 시스템(114)에 대하여 외부인 시스템 및 구성요소 사이의 통신을 실행하도록 구성될 수 있다. 따라서, CTT 통신 모듈(204)을 통하여, 상기 CTT 시스템(114)는, CTT 시스템(114)에 의하여 변환되거나 번역될 채팅 대화(적어도 하나 또는 그 이상의 채팅 메시지를 포함하는)를 수신할 수 있으며, CTT 시스템(114)으로부터 결과로 도출되는 번역된 채팅 대화를 출력할 수 있다.The CTT communication module 204 is configured to perform communication between external systems and components with respect to the CTT system 114, such as the CTT system 114 and the chat server 108 and / or the translation server 110. Can be configured. Accordingly, through the CTT communication module 204, the CTT system 114 can receive a chat conversation (including at least one or more chat messages) to be converted or translated by the CTT system 114, The translated chat conversation resulting from the CTT system 114 may be output.

상기 언어 모듈(206)은, CTT 시스템(114)에 의하여 수신된 채팅 대화와 관련하여 사용된, 하나 또는 그 이상의 언어를 식별하도록 구성될 수 있다. 일부 실시예에서, 상기 언어 모듈(206)은 수신된 채팅 대화의 내용물(content)에 대한 분석을 통하여 상기 언어를 식별할 수 있으며, 및/또는 수신한 채팅 대화에 참여한 상기 각각의 채팅 클라이언트 시스템(예를 들어, 채팅 클라이언트 시스템 104)으로부터, 언어 선호도/설정 정보를 획득할 수 있다.The language module 206 may be configured to identify one or more languages used in connection with the chat conversation received by the CTT system 114. In some embodiments, the language module 206 may identify the language through analysis of the content of the received chat conversation, and / or each chat client system participating in the received chat conversation ( For example, from the chat client system 104, language preference / setting information may be obtained.

상기 변환 모듈(208)은, 여기서 설명되는 시스템 및 방법에 따라, CTT 시스템(114)으로부터 수신된, 채팅 대화(하나 또는 그 이상의 채팅 메시지를 포함하는)에 대한 변환 동작을 수행하도록 구성될 수 있다. 일부 실시예에 의하면, 상기 변환 모듈(208)에 의하여 수행되는 상기 변환 동작은, 채팅용어, 두음자어, 약어, 고유명사, 보통명사, 구어체 및 비속어에 관계되는 것을 제한없이 포함할 수 있다. 상기 변환 모듈(208)의 추가적인 세부사항은 도 3에서 논의된다.The conversion module 208 may be configured to perform a conversion operation on a chat conversation (including one or more chat messages) received from the CTT system 114, according to the systems and methods described herein. . According to some embodiments, the conversion operation performed by the conversion module 208 may include, without limitation, chat terms, acronyms, abbreviations, proper nouns, common nouns, colloquial and profanity. Additional details of the conversion module 208 are discussed in FIG. 3.

CTT 시스템(114)이, 여기서 설명되는 일부 시스템 및 방법에 따라, 변환된 번역(transformed translation)과 변환되지 않은 번역(untransformed translation)을 수행하면, 번역 데이터 저장부(210)는, 이전에 번역된 텍스트를 저장하고 이어서 CTT 시스템(114)에 제공하도록 구성될 수 있다. 여기서 설명한 바와 같이, 상기 번역 데이터 저장부(210)는, 상기 CTT 시스템(114)에 의하여 이전에 수행된 번역의 캐시로서 동작하거나, 및/또는 인간 운영자에 의하여(예를 들어, 번역 훈련 시스템을 통해) 수동으로 입력되어 저장되는 번역을 저장할 수 있다. 일부 실시예에서, 상기 번역 데이터 저장부(210)는, 특정 채팅 문맥과 관련하여 상기 CTT 시스템(114)의 동작 속도를 높일 수 있는, 번역들로 채워질 수 있다. 예를 들어, CTT 시스템(114)은, MMO 게임과 관련되는 채팅 시스템과 연결되어 활용될 수 있으며, 상기 번역 데이터 저장부(210)는, 특히, MMO 게임과 관련된 (변환된 및 변환되지 않은) 번역으로 채워질 수 있다(예를 들어, CTT 시스템(114)의 운영자에 의하여). 특정한 실시예에서, 변환 모듈(208)에 의해 사용되는, 텍스트를 변환하는 상기 다중 계층(multi-tiered)/다중 모듈(multi-module) 접근법은, 실제로는 매우 복잡하지만, 특히, MMO 게임의 채팅 텍스트의 처리에 매우 적합하다. When the CTT system 114 performs transformed translation and untransformed translation according to some systems and methods described herein, the translation data storage 210 may have previously been translated. It can be configured to store the text and then provide it to the CTT system 114. As described herein, the translation data storage 210 operates as a cache of translations previously performed by the CTT system 114 and / or by a human operator (eg, a translation training system). You can save translations that are manually entered and saved. In some embodiments, the translation data storage 210 may be filled with translations that can speed up the operation of the CTT system 114 in relation to a particular chat context. For example, the CTT system 114 may be utilized in connection with a chat system related to an MMO game, and the translation data storage unit 210 may, in particular, be associated with an MMO game (converted and unconverted). Can be filled with translations (eg, by the operator of the CTT system 114). In a particular embodiment, the multi-tiered / multi-module approach to converting text, used by the conversion module 208, is actually very complex, but in particular chat in MMO games Very suitable for text processing.

실시예에 따르면, 상기 데이터 저장부(210)는 변환되지 않은 번역(예를 들어, <공식적인 영어(English Formal)> 'you' -> <공식적인 불어(French Formal)> -> 'vous'), 변환된 번역(예를 들어, <영어 채팅용어(English Chatspeak)> 'u' -> <공식적인 불어> -> 'vous'), 또는 양쪽 모두를 저장할 수 있다. 일부 실시예에서, 상기 번역 데이터 저장부(210)는, 대응하는 채팅 메시지가 해쉬 값/태그를 이용하여 식별될 수 있도록 하는 번역을 저장할 수 있다. 예를 들어, 영어로 된 원본 메시지에 대한 스페인어 번역을 저장하기 위하여, 상기 스페인어 번역은 상기 영어 메시지의 해쉬 값에 기반하여 저장될 수 있으며, 그로 인하여, 상기 스페인어 번역은, 상기 영어 메시지의 해쉬 값를 이용하여 차후에 위치가 찾아지고 획득될 수 있다. 당업자는, 일부 실시예에서, 상기 번역 데이터 저장부(210)가, 2개의 특정한 언어 사이의 번역을 위하여, 개별적인 데이터 저장부을 포함할 수 있음을 인정할 수 있다. 따라서, 채팅 메시지가 영어와 불어 사이에서 변환되거나/번역될 때, 대응하는 데이터 영어-불어 데이터 저장부(English-French data store)가, 상기 번역 데이터 저장부(210)와 관련된 동작을 위하여, 활용될 수 있다. According to an embodiment, the data storage unit 210 is an untranslated translation (eg, <English Formal> 'you'-> <Official French (French Formal)-> 'vous'), You can save the translated translation (eg <English Chatspeak> 'u'-> <official French>-> 'vous'), or both. In some embodiments, the translation data storage unit 210 may store a translation such that a corresponding chat message can be identified using a hash value / tag. For example, in order to store the Spanish translation of the original message in English, the Spanish translation may be stored based on the hash value of the English message, whereby the Spanish translation may display the hash value of the English message. By using it, the location can be located and acquired later. Those skilled in the art may recognize that, in some embodiments, the translation data storage unit 210 may include separate data storage units for translation between two specific languages. Therefore, when a chat message is converted / translated between English and French, the corresponding data English-French data store is utilized for operations related to the translation data storage 210. Can be.

상기 번역 API 모듈(212)는, 상기 CTT 시스템(114)의 외부에서 제공되는 기계 번역 서비스로의 접속을, CTT 시스템(114)에 제공하도록 구성될 수 있다(예를 들어, 상기 번역 서버(110)의 번역 모듈(116)에 의하여). 여기서 설명된 바와 같이, 번역이 상기 번역 데이터 저장부(210)에서 위치가 찾아지지 않을 때, 상기 번역 API 모듈(212)이 CTT 시스템(114)에 의하여 활용될 수 있다. The translation API module 212 may be configured to provide a connection to a machine translation service provided outside the CTT system 114 to the CTT system 114 (for example, the translation server 110 ) (By the translation module 116). As described herein, when a translation is not found in the translation data storage 210, the translation API module 212 may be utilized by the CTT system 114.

도 3은 다양한 실시예에 따른 예시적인 변환 모듈(280)을 나타내는 블록도이다. 도시한 바와 같이, 변환 모듈(208)은 채팅용어 모듈(302), 두문자어 모듈(304), 고유명사 모듈(306), 보통명사모듈(307), 구어체 모듈(310), 맞춤법 검사 모듈(312), 약어 모듈(314) 및/또는 비속어 모듈(316)을 포함할 수 있다. 일부 실시예에 의하면, 동작 중에, 변환 모듈(208)은 채팅 메시지의 전체 또는 일부를 처리할 수 있다(예를 들어, 메시지를 토큰(token) 또는 논리적 부분(logical portion)으로 나누고, 이후 이들 토큰/부분을 처리함). 일부 실시예에서는, 변환 모듈(208)의 다양한 모듈이 병렬로 호출될 수 있다. 3 is a block diagram illustrating an example conversion module 280 according to various embodiments. As shown, the conversion module 208 is a chat term module 302, an acronym module 304, a proper noun module 306, a common noun module 307, a colloquial module 310, a spell check module 312 , Abbreviation module 314 and / or profanity module 316. According to some embodiments, during operation, the conversion module 208 may process all or part of the chat message (eg, divide the message into tokens or logical portions, and then these tokens) / Process part). In some embodiments, various modules of conversion module 208 may be called in parallel.

상기 채팅용어 모듈(302)는, 채팅 메시지 내에서, 채팅 특수용어(chat jargon) (즉, 채팅용어)와 관련되는, 하나 또는 그 이상의 단어 또는 구절을 식별할 수 있으며, 나아가, 식별된 단어 또는 구절에 대한, 교체 단어 또는 구절(예를 들어, 대응하는 공식적인 용어/즉, 비채팅용어)을 제안하도록 구성될 수 있다. 일부 실시예에서는, 채팅용어 모듈(302)이, (예를 들어, 번역 모듈(116)에 의한) 순차적인 기계 번역 과정에서 생략하거나 또는 이외에 무시하도록, 식별된 채팅용어 단어 또는 구절을 표시할 수 있다. 추가적으로, 일부 실시예에서, 식별된 채팅용어 단어 또는 구절은, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 채팅용어 단어 또는 구절, 및/또는 그의 대응하는 (공식적인) 단어 또는 구절을 식별하기 위하여, 일부 실시예는, 채팅용어 단어 또는 구절, 및/또는 채팅용어 단어 또는 구절과 그들의 대응하는 단어 또는 구절 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 예를 들어, 상기 채팅용어 모듈(302)는, 그것의 기능에 통계적인 기계 번역을 적용할 수 있다. 일부 실시예에서는, 상기 적용된 통계적인 기계 번역은, 병렬적인 텍스트를 이용하거나 및/또는 문맥적인 정보를 포함하는 변환으로부터 추출된 구절-레벨의 짝(pair)들을 이용하여 훈련될 수 있으며, 및/또는 그렇지 않으면 문법에 맞지 않는 문장에, 문법을 추가할 수 있다. 채팅용어 모듈(302)에 의한 결과는, 상기 채팅용어 모듈(302)에 의하여 무시할 것으로 표시된 채팅용어 단어 또는 구절, 제안된 교체(replacement), 또는, 상기 채팅용어 모듈(302)에 의하여 상기 메시지에 삽입되는(예를 들어, 식별된 채팅용어 단어 또는 구절을 대신하여) 채팅용어가 아닌(non-chatspeak) 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 채팅용어 모듈(302)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 있는) 다른 변환 모듈로 제공될 수 있으며, 또는 제안된 교체는, 상기 채팅용어 모듈(302)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 CTT 제어 모듈(202)로 제공될 수 있다. The chat term module 302 may identify one or more words or phrases related to a chat jargon (ie, chat term) in a chat message, and furthermore, the identified word or It may be configured to suggest a replacement word or phrase for a phrase (eg, a corresponding official term / ie non-chat term). In some embodiments, the chat term module 302 may display the identified chat term words or phrases to be omitted or otherwise ignored in a sequential machine translation process (eg, by the translation module 116). have. Additionally, in some embodiments, the identified chatspeak words or phrases can then be displayed for review and disposition by human operators (eg, administrators of the CTT system 114). . To identify chat term words or phrases, and / or their corresponding (official) words or phrases, some embodiments may include chat term words or phrases, and / or between chat term words or phrases and their corresponding words or phrases It is possible to utilize a dataset (stored in a data storage, for example), including mapping of. The dataset may be built through a training or learning system, and may be owned by an owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. For example, the chat terminology module 302 can apply statistical machine translation to its function. In some embodiments, the applied statistical machine translation can be trained using parallel text and / or phrase-level pairs extracted from transformations containing contextual information, and / or Or else, you can add grammar to sentences that don't fit the grammar. The result by the chat term module 302 is the chat term word or phrase marked to be ignored by the chat term module 302, the proposed replacement (replacement), or the chat term module 302 to the message. It may include non-chatspeak words or phrases that are inserted (eg, in place of the identified chat term words or phrases). According to an embodiment, the message resulting from the chat term module 302 may be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement may be The message converted by the chat term module 302 may be provided to the CTT control module 202 to determine whether the message is stored in the data storage unit 210.

상기 두음자어 모듈(304)은, 채팅 메시지에서 하나 또는 그 이상의 두음자어를 식별하도록 구성될 수 있으며, 나아가 상기 두음자어에 대응하는(예를 들어, 두음자어에 의하여 대표되는) 교체 단어 또는 구절을 제안하도록 구성될 수 있다. 일부 실시예에서, 두음자어 모듈(304)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안에서 무시하도록, 그 식별된 두음자어를 표시할 수 있다. 추가적으로, 일부 실시예에서, 식별된 두음자어는, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 두음자어 및 또는 그에 대응하는 단어 또는 구절을 식별하기 위하여, 일부 실시예에서, 두음자어, 및/또는 두음자어와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출될 수 있다. 두음자어 모듈(304)에 의한 결과는, 상기 두음자어 모듈(304)에 의하여 무시할 것으로 표시된 두음자어, 제안된 교체(replacement), 또는, 상기 두음자어 모듈(302)에 의하여 상기 메시지에 삽입될(예를 들어, 식별된 두음자어를 대신하여) 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 두음자어 모듈(304)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 있는) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 두음자어 모듈(304)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The acronym module 304 may be configured to identify one or more acronyms in a chat message, and further may include replacement words or phrases corresponding to the acronyms (eg, represented by acronyms). It can be configured to suggest. In some embodiments, the acronym module 304 may indicate the identified acronyms to omit or ignore during subsequent machine translations (eg, by the translation module 116). Additionally, in some embodiments, the identified acronyms can then be displayed for review and disposition by a human operator (eg, an administrator of the CTT system 114). A dataset, including mapping between acronyms, and / or acronyms and their corresponding words and phrases, in some embodiments, to identify acronyms and / or corresponding words or phrases , For example, may be utilized. The dataset may be built through a training or learning system, and may be owned by an owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Commercially available, or derived from publicly available Internet knowledge bases. The result by the acronym module 304 may be inserted into the message by the acronym module 304, a suggested acronym, suggested replacement, or by the acronym module 302 ( For example, it may include words or phrases (instead of identified acronyms). According to an embodiment, the message resulting from the acronym module 304 may be provided to another transformation module (in the transformation module 208) for further processing, or the acronym module 304 ) To determine whether the message converted by the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

상기 고유명사 모듈(306)은, 채팅 메시지에서 하나 또는 그 이상의 고유명사를 식별하도록 구성될 수 있으며, 나아가 상기 고유명사에 대응하는(예를 들어, 고유명사로 대표되는) 교체 단어 또는 구절을 제안하도록 구성될 수 있다. 일부 실시예에서, 고유명사 모듈(306)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안 무시하도록, 그 식별된 고유명사를 표시할 수 있다. 추가적으로, 일부 실시예에서, 식별된 고유명사는, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 고유명사 및/또는 그에 대응하는 단어 또는 구절을 식별하기 위하여, 일부 실시예에서는, 고유명사(예를 들어, Disneyland®와 같이 잘 알려진 고유명사 또는 개인들의 흔한 이름), 및/또는 고유명사와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 고유명사 모듈(306)에 의한 결과는, 상기 고유명사 모듈(306)에 의하여 무시할 것으로 표시된 고유명사, 제안된 교체, 또는, 상기 고유명사 모듈(302)에 의하여 상기 메시지에 삽입될(예를 들어, 식별된 고유명사를 대신하여) 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 고유명사 모듈(306)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 있는) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 고유명사 모듈(306)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The proper noun module 306 may be configured to identify one or more proper nouns in a chat message, and further propose a replacement word or phrase corresponding to the proper noun (eg, represented by a proper noun) It can be configured to. In some embodiments, the proper noun module 306 may indicate the identified proper noun to be omitted or ignored during subsequent machine translation (eg, by the translation module 116). Additionally, in some embodiments, the identified proper noun can then be displayed for review and disposition by a human operator (eg, an administrator of the CTT system 114). In order to identify proper nouns and / or corresponding words or phrases, in some embodiments, proper nouns (eg, well-known nouns or common names of individuals, such as Disneyland®), and / or nouns and their It is possible to utilize a dataset (stored in a data store, for example), including mapping between corresponding words and phrases. The dataset may be built through a training or learning system, and may be owned by an owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. The result by the proper noun module 306 will be inserted into the message by the proper noun module, proposed replacement, or the proper noun module 302, which is marked to be ignored by the proper noun module 306 (eg , On behalf of the identified noun). According to an embodiment, the message resulting from the proper noun module 306 may be provided to another transform module (in the transform module 208) for further processing, or the proper noun module 306 ) To determine whether the message converted by the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

상기 보통명사 모듈(308)은, 채팅 메시지에서 하나 또는 그 이상의 보통명사를 식별하도록 구성될 수 있으며, 나아가 상기 보통명사에 대응하는(예를 들어, 보통명사로 대표되는) 교체 단어 또는 구절을 제안하도록 구성될 수 있다. 일부 실시예에서, 보통명사 모듈(308)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안 무시하도록, 식별된 보통명사를 표시할 수 있다. 추가적으로, 일부 실시예에서, 식별된 보통명사는, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 보통명사 및/또는 그에 대응하는 단어 또는 구절을 식별하기 위하여, 일부 실시예에서는, 보통명사, 및/또는 보통명사와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있으며, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 보통명사 모듈(308)에 의한 결과는, 상기 보통명사 모듈(308)에 의하여 무시할 것으로 표시된 보통명사, 제안된 교체 또는, 상기 보통명사 모듈(308)에 의하여 상기 메시지에 삽입될(예를 들어, 식별된 보통명사를 대신하여) 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 보통명사 모듈(308)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 있는) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 보통명사 모듈(308)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The common noun module 308 may be configured to identify one or more common nouns in a chat message, and further propose a replacement word or phrase corresponding to the common noun (eg, represented by a common noun) It can be configured to. In some embodiments, the common noun module 308 may indicate the identified common noun to be omitted or ignored during subsequent machine translation (eg, by the translation module 116). Additionally, in some embodiments, the identified common noun can then be displayed for review and disposition by a human operator (eg, the manager of the CTT system 114). In order to identify common nouns and / or their corresponding words or phrases, in some embodiments, a dataset, comprising common nouns and / or mapping between common nouns and their corresponding words and phrases dataset, for example). The dataset may be built through a training or learning system, and may be owned by the owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. The results by the common noun module 308 may be inserted into the message by the common noun module, proposed replacement, or by the common noun module 308, for example, indicated by the common noun module 308 to be ignored. Words or phrases (instead of identified common nouns). According to an embodiment, the message resulting from the common noun module 308 may be provided to another conversion module (in the conversion module 208) for further processing, or the common noun module 308 ) To determine whether the message converted by the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

상기 구어체 모듈(310)은, 채팅 메시지에서 하나 또는 그 이상의 구어체 단어 또는 구절을 식별하도록 구성될 수 있으며, 나아가 상기 식별된 단어 또는 구절에 대한 교체(예를 들어, 대응하는 공식적인/즉, 구어체가 아닌) 단어 또는 구절을 제안하도록 구성될 수 있다. 일부 실시예에서, 구어체 모듈(310)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안 무시하도록, 식별된 구어체 단어 또는 구절을 표시할 수 있다. 추가적으로, 일부 실시예에서, 식별된 구어체 단어 또는 구절은, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 구어체 단어 또는 구절 및/또는 그에 대응하는 (공식적인) 단어 또는 구절을 식별하기 위하여, 일부 실시예에서는, 구어체 단어 또는 구절, 및/또는, 구어체 단어 또는 구절과 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 구어체 모듈(310)에 의한 결과는, 상기 구어체 모듈(310)에 의하여 무시할 것으로 표시된 구어체 단어 또는 구절, 제안된 교체 또는, 상기 구어체 모듈(310)에 의하여 상기 메시지에 삽입될 (예를 들어, 식별된 구어체 단어 또는 구절을 대신하여) 비구어체 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 구어체 모듈(310)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 포함된) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 구어체 모듈(310)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The colloquial module 310 may be configured to identify one or more colloquial words or phrases in a chat message, and further replacement of the identified word or phrase (eg, corresponding official / ie colloquial language Not) can be configured to suggest words or phrases. In some embodiments, colloquial module 310 may display the identified colloquial word or phrase to omit or ignore during subsequent machine translation (eg, by translation module 116). Additionally, in some embodiments, the identified colloquial words or phrases can then be displayed for review and disposition by human operators (eg, administrators of the CTT system 114). . To identify colloquial words or phrases and / or corresponding (official) words or phrases, in some embodiments, colloquial words or phrases, and / or mapping between colloquial words or phrases and their corresponding words and phrases A dataset, including (mapping), may be utilized. The dataset may be built through a training or learning system, and may be owned by an owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. The result by the colloquial module 310 will be inserted into the message by the colloquial module 310, or a colloquial word or phrase marked as to be ignored by the colloquial module 310, or by the colloquial module 310 (eg, identification Non- colloquial words or phrases (in lieu of spoken words or phrases). According to an embodiment, the message derived as a result by the colloquial module 310 may be provided to another conversion module (included in the conversion module 208) for further processing, or the colloquial module 310 In order to determine whether the message converted by is in the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

상기 맞춤법 검사 모듈(312)은, 채팅 메시지에서 하나 또는 그 이상의 맞춤법이 틀린(misspelled) 단어 또는 구절을 식별하도록 구성될 수 있으며, 나아가 상기 식별된 단어 또는 구절에 대한 교체(예를 들어, 정정된) 단어 또는 구절을 제안하도록 구성될 수 있다. 예를 들어, 맞춤법 검사 모듈(312)은, 상기 단어 또는 구절을 상기 제안된 교체 단어 또는 구절로 자동으로 정정하도록 구성될 수 있다. 일부 실시예에 의하면, 맞춤법 검사 모듈(312)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안 무시하도록, 식별된 맞춤법이 틀린 단어 또는 구절을 표시할 수 있다. 추가적으로, 일부 실시예에서, 상기 식별된 맞춤법이 틀린 단어 또는 구절은, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 맞춤법이 틀린 단어 또는 구절 및/또는 그에 대응하는 (정정된) 단어 또는 구절을 식별하기 위하여, 일부 실시예에서는, 맞춤법이 틀린 단어 또는 구절, 및/또는 맞춤법이 틀린 단어 또는 구절과 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 맞춤법 검사 모듈(312)에 의한 결과는, 상기 맞춤법 검사 모듈(312)에 의하여 무시할 것으로 표시된 맞춤법이 틀린 단어 또는 구절, 제안된 교체, 또는, 맞춤법 검사 모듈(312)에 의하여 상기 메시지에 삽입될(예를 들어, 맞춤법이 틀린 단어 또는 구절을 대신하여) 정정된 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 맞춤법 검사 모듈(312)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 포함된) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 맞춤법 검사 모듈(312)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The spell check module 312 may be configured to identify one or more misspelled words or phrases in a chat message, and further replace (eg, corrected) the identified words or phrases ) Can be configured to suggest words or phrases. For example, the spell check module 312 may be configured to automatically correct the word or phrase with the suggested replacement word or phrase. According to some embodiments, the spell checking module 312 may display the misspelled word or phrase that is identified to be omitted or ignored during subsequent machine translation (eg, by the translation module 116). have. Additionally, in some embodiments, the identified misspelled words or phrases are then marked for review and disposition by a human operator (eg, an administrator of the CTT system 114). Can be. To identify misspelled words or phrases and / or corresponding (corrected) words or phrases, in some embodiments, misspelled words or phrases, and / or misspelled words or phrases and their corresponding words And a mapping between verses (dataset, for example, stored in a data storage). The dataset may be built through a training or learning system, and may be owned by an owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. The results by the spell check module 312 will be inserted into the message by the misspelled word or phrase, suggested replacement, or by the spell check module 312, which is marked to be ignored by the spell check module 312 ( For example, it may include a corrected word or phrase (in place of a misspelled word or phrase). According to an embodiment, the message derived as a result by the spell checking module 312 may be provided to another conversion module (included in the conversion module 208) for further processing, or the spell checking module ( In order to determine whether the message converted by 312) is in the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

상기 약어 모듈(314)은, 채팅 메시지에서 하나 또는 그 이상의 약어를 식별하도록 구성될 수 있으며, 나아가 상기 식별된 상기 약어에 대응하는(예를 들어, 약어로 대표되는) 교체 단어 또는 구절을 제안하도록 구성될 수 있다. 일부 실시예에 의하면, 약어 모듈(314)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안 무시하도록, 식별된 약어를 표시할 수 있다. 추가적으로, 일부 실시예에서, 상기 식별된 약어는, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 약어 및/또는 그에 대응하는 단어 또는 구절을 식별하기 위하여, 일부 실시예에서는, 약어 및/또는, 약어와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있으며, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 약어 모듈(314)에 의한 결과는, 상기 약어 모듈(314)에 의하여 무시할 것으로 표시된 약어, 제안된 교체 또는, 약어 모듈(314)에 의하여 상기 메시지에 삽입될(예를 들어, 상기 식별된 약어를 대신하여) 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 약어 모듈(314)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 포함된) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 약어 모듈(314)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The abbreviation module 314 may be configured to identify one or more abbreviations in a chat message, and further to suggest replacement words or phrases corresponding to the identified abbreviations (eg, represented by abbreviations). Can be configured. According to some embodiments, abbreviation module 314 may display the identified abbreviations to be omitted or ignored during subsequent machine translations (eg, by translation module 116). Additionally, in some embodiments, the identified abbreviations may then be displayed for review and disposition by a human operator (eg, the manager of the CTT system 114). To identify an abbreviation and / or a corresponding word or phrase, in some embodiments, a dataset, including an abbreviation and / or mapping between the abbreviation and their corresponding words and phrases For example, it can utilize (stored in the data storage). The dataset may be built through a training or learning system, and may be owned by the owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. The result by the abbreviation module 314 may be inserted into the message by the abbreviation module 314, an abbreviation marked to be ignored by the abbreviation module 314, or the abbreviation module 314 (eg, the identified abbreviation). Instead) words or phrases. According to an embodiment, the message resulting from the abbreviation module 314 may be provided to another transformation module (included in the transformation module 208) for further processing, or the abbreviation module 314 In order to determine whether the message converted by is in the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

상기 비속어 모듈(316)은, 채팅 메시지에서 하나 또는 그 이상의 비속한 단어 또는 구절(이하, “비속어”라 함)를 식별하도록 구성될 수 있으며, 나아가 상기 식별된 비속어에 대응하는 교체 단어 또는 구절(예를 들어, 적절한 대체)을 제안하도록 구성될 수 있다(예를 들어, 부드러운 완곡어구). 일부 실시예에 의하면, 비속어 모듈(316)은, 생략하거나 아니면 이후의 기계 번역(예를 들어, 번역 모듈(116)에 의한) 동안 무시하도록, 식별된 비속어를 표시할 수 있다. 추가적으로, 일부 실시예에서, 상기 식별된 비속어는, 이후 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한, 사후 검토(review) 및 처분(disposition)을 위하여 표시될 수 있다. 비속어 및/또는 그에 대응하는 단어 또는 구절을 식별하기 위하여, 일부 실시예에서는, 비속어 및/또는, 비속어와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 상기 데이터세트는 훈련이나 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있고(예를 들어, 상기 CTT 시스템(114)의 관리자에 의하여 수동으로 수집된 “내부용(in-house)”), 상업적으로 얻을 수 있거나, 또는 공개적으로 이용 가능한 인터넷 지식기반으로부터 도출되는 것일 수 있다. 비속어 모듈(316)에 의한 결과는, 상기 비속어 모듈(316)에 의하여 무시할 것으로 표시된 비속어, 제안된 교체 또는, 비속어 모듈(316)에 의하여 상기 메시지에 삽입될(예를 들어, 상기 식별된 비속어를 대신하여) 단어 또는 구절을 포함할 수 있다. 실시예에 의하면, 비속어 모듈(316)에 의하여 결과로 도출된 상기 메시지는, 추가적인 처리를 위하여 (변환 모듈(208)에 포함된) 다른 변환 모듈로 제공될 수 있으며, 또는 상기 비속어 모듈(316)에 의하여 변환된 메시지가 상기 데이터 저장부(210)에 있는지 여부를 판별하기 위하여, 상기 제안된 교체가 상기 CTT 제어 모듈(202)로 제공될 수 있다. The profanity module 316 may be configured to identify one or more profanity words or phrases (hereinafter referred to as “profanity words”) in a chat message, and further, a replacement word or phrase corresponding to the identified profanity ( For example, it may be configured to suggest appropriate substitutions (eg, soft euphemism). According to some embodiments, the profanity module 316 may indicate the profanity identified to be omitted or ignored during subsequent machine translation (eg, by the translation module 116). Additionally, in some embodiments, the identified profanity may then be displayed for review and disposition by a human operator (eg, an administrator of the CTT system 114). To identify profanity and / or corresponding words or phrases, in some embodiments, a dataset, e.g., including profanity and / or mapping between the profanity and their corresponding words and phrases For example, it can utilize (stored in the data storage). The dataset may be built through a training or learning system, and may be owned by an owner (eg, “in-house” manually collected by the administrator of the CTT system 114). , Or commercially available, or derived from publicly available Internet knowledge bases. The result by the profanity module 316 may be inserted into the message by the profanity module, slang proposed by the profanity module 316 to be ignored, or by the profanity module 316 (eg, the identified profanity Instead) words or phrases. According to an embodiment, the message resulting from the slang module 316 may be provided to another conversion module (included in the conversion module 208) for further processing, or the slang module 316 In order to determine whether the message converted by is in the data storage unit 210, the proposed replacement may be provided to the CTT control module 202.

일부 실시예에 의하면, 상기 변환 모듈(208)의 하나 또는 그 이상의 다양한 모듈들은, 상기 표시될 부분의 앞 및/또는 뒤에, 기 설정된 문자(character)를 삽입하여, 상기 채팅 메시지의 하나 또는 그 이상의 부분을 표시할 수 있다. 예를 들어, 채팅용어 모듈(302)이 상기 채팅 메시지의 일 부분에 포함된 단어 “LOL”를 표시할 때, 상기 채팅용어 모듈(302)는, 상기 표시된 부분이 번역 모듈(116)에 의하여 무시되어야 함을 표시하기 위하여, 기 설정된 문자(“_”)를 상기 단어의 앞 및/또는 뒤에 삽입할 수 있다(예를 들어, “_LOL_”) According to some embodiments, one or more of the various modules of the conversion module 208 may insert one or more of the chat message by inserting a predetermined character before and / or after the portion to be displayed. The part can be marked. For example, when the chat term module 302 displays the word “LOL” included in a part of the chat message, the chat term module 302 ignores the displayed part by the translation module 116 To indicate that it should be, a preset character (“_”) can be inserted before and / or after the word (eg, “_LOL_”).

일부 실시예에 의하면, 변환 모듈(208)은, 초기 메시지에 대하여 병렬적으로, 2 또는 그 이상의 변환 동작을 수행할 수 있으며, 그에 응답하여, 각각의 2 또는 그 이상의 변환은 개별적인 응답을 회신할 수 있고, 그로부터 변환 모듈(208)은 추가적인 처리(예를 들어, 동작(514)에서 사용되는)를 위하여 하나의 변환된 메시지를 선택할 수 있다. 상기 실시예에 의하면, 각각의 응답은, 표시된 텍스트 영역, 제안된 교체, 또는 상기 초기 메시지에 삽입된 단어 또는 구절을 포함할 수 있다. 그 후에, 상기 선택된 변환된 메시지는, 어떠한 전래에 따라, 어떤 변환된 메시지가 추가적인 처리를 위하여 선택될 것인가를 결정할 수 있는, 선택의 우선순위에 따를 수 있다. 일부 실시예에서, 상기 우선순위 선택은, 어느 변환 동작이, 번역 데이터 저장부(210)에서 차후 검색에 적절한 변환된 메시지를 생성하기 쉬운지, 또는 차후 기계 번역에 적절한 변환된 메시지를 생성하기 쉬운지에 따를 수 있다. 추가적으로, 일부 실시예에 의하면, 상기 선택의 우선순위는 어떠한 변환 동작이 가장 공식적인 변환된 메시지를 생성하는가에 따를 수 있다. 선택의 우선순위는, 실시예에 의하여 선택된 변환/번역 전략에 따라 결정될 수 있다. According to some embodiments, the conversion module 208 may perform two or more conversion operations in parallel on the initial message, and in response, each two or more conversions will return a separate response. From there, the conversion module 208 can select one converted message for further processing (eg, used in operation 514). According to the above embodiment, each response may include a displayed text area, a suggested replacement, or a word or phrase inserted in the initial message. Thereafter, the selected converted message may be subject to a priority of selection, which may determine which converted message will be selected for further processing, according to what source. In some embodiments, the priority selection, which conversion operation is easy to generate a converted message suitable for a subsequent search in the translation data storage unit 210, or is easy to generate a converted message suitable for a subsequent machine translation Can follow. Additionally, according to some embodiments, the priority of the selection may depend on which transform operation produces the most formal transformed message. The priority of selection may be determined according to the conversion / translation strategy selected by the embodiment.

표 1은, 다양한 실시예에 따라, 변환모듈이 채팅 메시지의 일 부분을 어떻게 처리하는지에 대한 예시를 제공하고 있다. 도시한 바와 같이, 상기 변환 모듈(208)은 토큰(token) 또는 프록시말 토큰(proximal token)에 기반하여 채팅 메시지를 처리할 수 있으며, 한번 변환이 수행되면 특정한 토큰에 대한 처리를 중단할 수 있다.Table 1 provides an example of how the conversion module processes a portion of the chat message, according to various embodiments. As illustrated, the conversion module 208 may process a chat message based on a token or a proximal token, and once conversion is performed, processing of a specific token may be stopped. .

채팅 메시지 처리의 예Chat Message Processing Example 채팅 메시지의 토큰
(Token(s) from a Chat Message)Token for chat messages
(Token (s) from a Chat Message) 변환 처리
(Transformation Processing)Conversion processing
(Transformation Processing) Token = 'USA'Token = 'USA' 채팅용어 모듈 ('USA') → 두음자어 모듈 ('USA') → 번역하지 않을 것을 표시Chatting term module ('USA') → Acronym module ('USA') → Marking not to translate Token = 'brb'Token = 'brb' 채팅용어 모듈 ('brb') → 두음자어 모듈 ('brb') → 고유명사 모듈 ('brb') → 보통명사 모듈 ('brb') → 구어체 모듈 ('brb') → 맞춤법 검사 모듈('brb') → 약어 모듈('brb') → 'be right back'으로 변환Chat Glossary Module ('brb') → Acronym Module ('brb') → Proper Noun Module ('brb') → Common Noun Module ('brb') → Spoken Language Module ('brb') → Spell Checking Module ('brb) ') → Abbreviation module (' brb ') → Convert to' be right back ' Token = '9'Token = '9' 채팅용어 모듈 ('9') → 'parents watching over shoulder'로 변환Chat Term Module ('9') → Converted to 'parents watching over shoulder' Token = '99'Token = '99' 채팅용어 모듈 ('99') → 'parents stopped watching over shoulder'로 변환Chat Term Module ('99') → Converted to 'parents stopped watching over shoulder' Proximal tokens = 'go gabe'Proximal tokens = 'go gabe' 채팅용어 모듈 ('go gabe') → 두음자어모듈 ('go gabe') → 고유명사 모듈 ('going') → 보통명사 모듈 ('go gabe') → 보통명사에 해당할 가능성이 있음을 표시Chat Glossary Module ('go gabe') → Acronym Module ('go gabe') → Proper Noun Module ('going') → Common Noun Module ('go gabe') → Indicates that there is a possibility of being a common noun String = 'Your going to attack him?'
Token#1 = 'Your'
Token#2 = 'going'
Token#3 = 'to'
Token#4 = 'attack'
Token#5 = 'him'String = 'Your going to attack him?'
Token # 1 = 'Your'
Token # 2 = 'going'
Token # 3 = 'to'
Token # 4 = 'attack'
Token # 5 = 'him' 맞춤법 검사 모듈('Your') → 프록시말 토큰 'going'을 기반으로 'you're'로 정정(즉, 맞춤법 검사를 위하여 프록시말 문맥(proximal context)을 사용함)

채팅용어모듈 ('going') → 두음자어모듈 ('going') → 고유명사 모듈 ('going') → 보통명사 모듈 ('going') → 구어체 모듈 ('going') → 맞춤법 검사 모듈('going') → 약어모듈 ('going') → 비속어 모듈('going') → 변환하지 않음.

채팅용어 모듈 ('to') → 두음자어 모듈('to') → 고유명사 모듈('to') → 보통명사 모듈 ('to') → 구어체 모듈 ('to') → 맞춤법 검사 모듈('to') → 약어 모듈 ('to') → 비속어 모듈('to') → 변환하지 않음.

채팅용어 모듈 ('attack') → 두음자어 모듈('attack') → 고유명사 모듈('attack') → 보통명사 모듈 ('attack') → 구어체 모듈 ('attack') → 맞춤법 검사 모듈('attack') → 약어 모듈 ('attack') → 비속어 모듈('attack') → 변환하지 않음.

채팅용어 모듈 ('him') → 두음자어 모듈('him') → 고유명사 모듈('him') → 보통명사 모듈 ('him') → 구어체 모듈 ('him') → 맞춤법 검사 모듈('him') → 약어 모듈 ('him') → 비속어 모듈('him') → 변환하지 않음.Spell check module ('Your') → Corrected to 'you're' based on the proxy word token 'going' (ie, use a proximal context for spell checking)

Chat Glossary Module ('going') → Acronym Module ('going') → Proper Noun Module ('going') → Common Noun Module ('going') → Colloquial Module ('going') → Spell Checking Module ('going) ') → Abbreviation module (' going ') → Slang module (' going ') → Do not convert.

Chat Glossary Module ('to') → Acronym Module ('to') → Proper Noun Module ('to') → Common Noun Module ('to') → Colloquial Module ('to') → Spell Checking Module ('to ') → Abbreviation module (' to ') → Profanity module (' to ') → Do not convert.

Chat Glossary Module ('attack') → Acronym Module ('attack') → Proper Noun Module ('attack') → Common Noun Module ('attack') → Colloquial Module ('attack') → Spell Checking Module ('attack) ') → Abbreviation module (' attack ') → Slang module (' attack ') → Do not convert.

Chat Glossary module ('him') → Acronym module ('him') → Proper noun module ('him') → Common noun module ('him') → Spoken language module ('him') → Spell check module ('him ') → Abbreviation module (' him ') → Profanity module (' him ') → Do not convert. String = 'Sup bro, sup yall?'
Token#1 = 'Sup'
Token#2 = 'bro'
Token#3 = 'sup'
Token#4 = 'yall'String = 'Sup bro, sup yall?'
Token # 1 = 'Sup'
Token # 2 = 'bro'
Token # 3 = 'sup'
Token # 4 = 'yall' 채팅용어 모듈 ('Sup') → "How is it going."으로 교체

채팅용어 모듈 ('bro') → 두음자어 모듈('bro') → 고유명사 모듈 ('bro') → 보통명사 모듈 ('bro') → 구어체 모듈 ('bro') → 맞춤법 검사 모듈('bro') → 약어모듈('bro') → "brother"로 교체

채팅용어 모듈 ('sup') → "how is it going."로 교체

채팅용어 모듈 ('yall') → "you all."로 교체Replace the chat term module ('Sup') with "How is it going."

Chat Term module ('bro') → Acronym module ('bro') → Proper noun module ('bro') → Common noun module ('bro') → Spoken language module ('bro') → Spell check module ('bro ') → Abbreviation module (' bro ') → Replaced with "brother"

Replace the chat term module ('sup') → "how is it going."

Chat Term Module ('yall') → Replaced with "you all."

도 4는 다양한 실시예에 따르는 예시적인 채팅 클라이언트 시스템(104)을 나타내는 블록도이다. 도시한 바와 같이, 채팅 클라이언트 시스템(104)은 채팅 클라이언트 제어부(402), 채팅 클라이언트 통신 모듈(404) 및 채팅 클라이언트 GUI(Graphic User Interface) 모듈(406)을 포함할 수 있다. 상기 채팅 클라이언트 제어 모듈(402)은, 채팅 클라이언트 시스템(104)이 채팅과 관련된 동작(예를 들어, 채팅 서버(108)와의 통신 채팅 대화)을 수행함에 따라, 채팅 클라이언트 시스템(104) 내에서의 다양한 동작의 수행을 제어 및/또는 지휘하도록 구성될 수 있다. 일부 실시예에서, 상기 채팅 클라이언트 제어 모듈(402)은, 예를 들어 채팅 클라이언트 통신 모듈(404) 및 채팅 클라이언트 GUI 모듈(406)과 같은, 상기 채팅 클라이언트 시스템(104)에 포함되는 다른 구성요소의 동작을 제어할 수 있다. 4 is a block diagram illustrating an exemplary chat client system 104 according to various embodiments. As illustrated, the chat client system 104 may include a chat client control unit 402, a chat client communication module 404, and a chat client graphical user interface (GUI) module 406. The chat client control module 402 is within the chat client system 104 as the chat client system 104 performs actions related to chat (eg, communication chat conversation with the chat server 108). It can be configured to control and / or direct the performance of various operations. In some embodiments, the chat client control module 402 includes other components of the chat client system 104, such as, for example, the chat client communication module 404 and the chat client GUI module 406. You can control the operation.

상기 채팅 클라이언트 통신 모듈(404)는, 채팅 클라이언트 시스템(104)과, 채팅 서버(108)와 같은 상기 채팅 클라이언트 시스템(104)에 대한 외부의 시스템 및 구성요소들(components) 사이의 통신을 구현하도록 구성될 수 있다. 따라서, 채팅 클라이언트 모듈(404)을 통하여, 상기 채팅 클라이언트 시스템(104)은 채팅 서버(108)로부터 상기 채팅 클라이언트 시스템(104)에(예를 들어, 채팅 클라이언트 GUI 모듈(406)을 통하여) 제공되는 채팅 대화를 수신할 수 있으며, 상기 채팅 클라이언트 시스템(104)의 사용자로부터 수신한 채팅 대화를 채팅 서버로 전송할 수 있다(예를 들어, 채팅 클라이언트 GUI 모듈(406)을 통하여).The chat client communication module 404 is configured to implement communication between a chat client system 104 and external systems and components for the chat client system 104, such as the chat server 108. Can be configured. Thus, through the chat client module 404, the chat client system 104 is provided from the chat server 108 to the chat client system 104 (eg, via the chat client GUI module 406). A chat conversation may be received, and a chat conversation received from a user of the chat client system 104 may be transmitted to a chat server (eg, via the chat client GUI module 406).

상기 채팅 클라이언트 GUI 모듈(406)은, 채팅 클라이언트 시스템(104)의 사용자에게, 다른 채팅 클라이언트 시스템과의 채팅 세션에 대한 그래픽적인 입력/출력 접속(access)를 제공하도록 구성될 수 있다. 따라서, 일부 실시예에서, 상기 채팅 클라이언트 GUI 모듈(406)은, 클라이언트(102)의 사용자에게, 상기 클라이언트(102)를 통하여 사용자의 상호작용(interaction)을 수신하도록 적용된, 클라이언트 GUI를 제공할 수 있다. 일부 실시예에 의하면, 상기 채팅 클라이언트 GUI 모듈(406)은 사용자에게, 그들의 선택 언어(예를 들어, 사용자의 선호/설정 언어에 따르는)로 된 채팅 대화(예를 들어, 채팅 서버(108)로부터 그들이 수신한 바와 같이)를 제공하도록 구성될 수 있다. 추가적으로, 상기 채팅 클라이언트 GUI 모듈(406)은, 그들이 선택한 언어(예를 들어, 사용자의 선호/설정 언어에 따르는)로 된 채팅 입력을, 상기 사용자로부터 수신하도록 구성될 수 있다. 여기서, 설명한 바와 같이, 상기 채팅 클라이언트 시스템(104)에서, 상기 채팅 대화를 표시하고 수신하는데 사용된 언어는, 다른 채팅 클라이언트 시스템에서 상기 채팅 대화를 표시하고 수신하는데 사용된 언어와는 상이할 수 있다. 상기 채팅 클라이언트 GUI 모듈(406)에 대한 더 많은 것은 도 7과 관련하여 설명된다. The chat client GUI module 406 may be configured to provide users of the chat client system 104 with graphical input / output access to chat sessions with other chat client systems. Thus, in some embodiments, the chat client GUI module 406 can provide a client GUI to a user of the client 102, adapted to receive user interaction through the client 102. have. According to some embodiments, the chat client GUI module 406 provides the user with chat conversations (eg, from the chat server 108) in their selected language (eg, according to the user's preferred / set language). Can be configured to provide). Additionally, the chat client GUI module 406 may be configured to receive chat input from the user in their chosen language (eg, according to the user's preferred / set language). Here, as described, in the chat client system 104, the language used to display and receive the chat conversation may be different from the language used to display and receive the chat conversation in another chat client system. . More on the chat client GUI module 406 is described with respect to FIG. 7.

도 5는 다양한 실시예에 따르는, 다중 언어 통신에 대한 예시적인 방법(500)을 나타내는 순서도이다. 아래에 설명하는 바와 같이, 일부 실시예에서, 상기 방법(500)에 의하여 표시되는 방법은, 채팅 클라이언트 시스템(104-1), 채팅 클라이언트 시스템(104-2), CTT 시스템(114)(예를 들어, 채팅 서버(108)) 및 번역 모듈(116)(예를 들어, 번역 서버(110))과 관련되어 동작을 수행할 수 있다.5 is a flow diagram illustrating an exemplary method 500 for multilingual communication, in accordance with various embodiments. As described below, in some embodiments, the method represented by the method 500 includes a chat client system 104-1, a chat client system 104-2, and a CTT system 114 (eg For example, an operation may be performed in connection with the chat server 108 and the translation module 116 (eg, the translation server 110).

방법(500)은 동작(502)에서 시작할 수 있으며, (CTT 시스템(114)의) 언어 모듈(204)은 제1 채팅 클라이언트 시스템(예를들어, 104-1)의 사용자에 의하여 사용되는 제1 언어와, 제2 채팅 클라이언트 시스템(예를들어, 104-2)의 사용자에 의하여 사용되는 제2 언어를 식별할 수 있다. 일부 실시예에 의하면, 상기 언어 모듈(204)는, 각각의 채팅 클라이언트 시스템(104)로부터 획득한 언어 선호도/설정에 따라, 제1 언어 및 제2 언어를 식별할 수 있다. The method 500 can begin at operation 502, and the language module 204 (of the CTT system 114) is the first used by the user of the first chat client system (eg, 104-1). The language and the second language used by the user of the second chat client system (eg, 104-2) can be identified. According to some embodiments, the language module 204 may identify the first language and the second language according to the language preference / setting obtained from each chat client system 104.

동작(504)에서, (CTT 시스템(114)의) 상기 CTT 통신 모듈(204)은 제1 언어로 된 초기 메시지를 수신할 수 있다. 일부 실시예에서, 상기 CTT 통신 모듈(204)은, 채팅 클라이언트 시스템(예를들어, 104-1)으로부터 초기 메시지를 수신한, 채팅 호스트 시스템(112)로부터 상기 초기 메시지를 수신할 수 있다.In operation 504, the CTT communication module 204 (of the CTT system 114) may receive an initial message in a first language. In some embodiments, the CTT communication module 204 may receive the initial message from the chat host system 112, which has received the initial message from a chat client system (eg, 104-1).

동작(506)에서, (CTT 시스템(114)의) 상기 CTT 제어 모듈(202)은, 상기 초기 메시지에 대응하는, 제2 언어로 된 대응 메시지에 대하여, 번역 데이터 저장부(210)로 질의할 수 있다. 동작(508)에서, 상기 CTT 제어 모듈(202)은, 대응 메시지가 상기 번역 데이터 저장부(210)에서 발견되는지 여부를, 판별할 수 있다. 만약, 하나 존재하면, 동작(510)에서 상기 CTT 통신 모듈(204)는 제2 채팅 클라이언트 시스템(예를들어, 채팅 클라이언트 시스템(104-2))으로 상기 대응 메시지를 전송하는 것을 지원할 수 있다. 일부 실시예에서, 상기 대응 메시지는, 상기 대응 메시지를 제2 채팅 클라이언트 시스템(예를들어, 104-2)으로 중계할 수 있는, 채팅 호스트 시스템(112)로 전송될 수 있다. 상기 방법(500)은 이후 종료될 수 있다. In operation 506, the CTT control module 202 (of the CTT system 114) queries the translation data storage 210 for a corresponding message in a second language, corresponding to the initial message. You can. In operation 508, the CTT control module 202 may determine whether a corresponding message is found in the translation data storage 210. If there is one, in operation 510 the CTT communication module 204 may support sending the corresponding message to a second chat client system (eg, chat client system 104-2). In some embodiments, the correspondence message may be sent to the chat host system 112, which may relay the correspondence message to a second chat client system (eg, 104-2). The method 500 can then end.

만약, 번역 데이터 저장부(210) 내에 대응메시지가 존재하지 않으면, 동작(512)에서, 상기 변환 모듈(208)은, 상기 초기 메시지의 적어도 일부를, 제1 언어로 된 변환된 메시지의 변환하는 시도를 할 수 있다. 여기서 설명된 바와 같이, 상기 변환 모듈(208)에 의하여 도출된 상기 메시지는, 변환되거나 또는 변화되지 않은 상태로 남을 수 있다(예를 들어, 변환 모듈(208)의 변환 동작이 초기 메시지에 적용되지 않을 때). 일부 실시예에서, 상기 변환 모듈(208)은, 상기 초기 메시지에 대하여 병렬적으로, 2 또는 그 이상의 변환 동작을 수행할 수 있으며, 이에 응답하여, 각각의 2 또는 그 이상의 변환 동작은 개별적인 응답을 회신할 수 있고, 변환 모듈(208)은, 그러면, 추가적인 처리(예를 들어, 동작(514)에서 사용되는)를 위하여, 하나의 변환된 메시지를 선택할 수 있다. 실시예에 따르면, 각각의 응답은 표시된 텍스트 부분, 제안된 교체, 또는, 상기 초기 메시지에 삽입될 단어 또는 구절을 포함할 수 있다. 그 후에, 상기 선택된 변환된 메시지는, 선택의 우선순위에 따를 수 있는데, 이때 선택의 우선순위는 어느 변환된 메시지가 추가적인 처리를 위해 그리고 어떤 전례에 따라 선택되는지를 결정할 수 있다. 일부 실시예에서, 상기 우선순위 선택은, 어느 변환 동작이, 번역 데이터 저장부(210)에서 차후 검색에 적절한 변환된 메시지를 생성하기 쉬운지, 또는 차후 기계 번역에 적절한 변환된 메시지를 생성하기 쉬운지에 따를 수 있다. 추가적으로, 일부 실시예에 의하면, 상기 선택의 우선순위는 어떠한 변환 동작이 가장 공식적인 변환된 메시지를 생성하는가에 따를 수 있다. 선택의 우선순위는, 실시예에 의하여 선택된 변환/번역 전략에 따라 결정될 수 있다.If a corresponding message does not exist in the translation data storage 210, in operation 512, the conversion module 208 converts at least a part of the initial message into a converted message in a first language. You can try. As described herein, the message derived by the conversion module 208 may remain converted or unchanged (eg, the conversion operation of the conversion module 208 is not applied to the initial message). When not). In some embodiments, the conversion module 208 may perform two or more conversion operations in parallel on the initial message, and in response, each two or more conversion operations may generate a separate response. Can reply, and the conversion module 208 can then select one converted message for further processing (eg, used in operation 514). According to an embodiment, each response may include a portion of the displayed text, a suggested replacement, or a word or phrase to be inserted into the initial message. Thereafter, the selected converted message may be in accordance with the priority of the selection, where the priority of the selection may determine which converted message is selected for further processing and according to which precedent. In some embodiments, the priority selection, which conversion operation is easy to generate a converted message suitable for a subsequent search in the translation data storage unit 210, or is easy to generate a converted message suitable for a subsequent machine translation Can follow. Additionally, according to some embodiments, the priority of the selection may depend on which transform operation produces the most formal transformed message. The priority of selection may be determined according to the conversion / translation strategy selected by the embodiment.

동작(514)에서, 변환 모듈(208)이 상기 메시지를 변환한 것으로 가정하면, (CTT 시스템(114)의) 상기 CTT 제어 모듈(202)은, 변환된 메시지에 대응하는, 제2 언어로 된 대응 메시지를, 번역 데이터 저장부(210)로 질의할 수 있다. 동작(516)에서, 상기 CTT 제어 모듈(202)는 상기 대응 메시지가 상기 번역데이터 저장부(210)에서 발견되는지 여부를 판별할 수 있다. 만약, 하나가 존재하면, 동작(518)에서, 상기 CTT 통신 모듈(204)은 제2 채팅 클라이언트 시스템(예를 들어, 채팅 클라이언트 시스템(104-2))으로 대응 메시지를 전송하는 것을 지원할 수 있다. 일부 실시예에서, 상기 대응 메시지는, 상기 제2 채팅 클라이언트 시스템(예를 들어, 104-2)으로 대응 메시지를 중계하는, 채팅 호스트 시스템(112)으로 전송될 수 있다. 방법(500)은 이후 종료될 수 있다.In operation 514, assuming that the conversion module 208 has converted the message, the CTT control module 202 (of the CTT system 114) is in a second language, corresponding to the converted message. The corresponding message may be queried by the translation data storage unit 210. In operation 516, the CTT control module 202 may determine whether the corresponding message is found in the translation data storage 210. If one exists, in operation 518, the CTT communication module 204 may support sending a corresponding message to a second chat client system (eg, chat client system 104-2). . In some embodiments, the correspondence message may be sent to a chat host system 112 that relays the correspondence message to the second chat client system (eg, 104-2). Method 500 may then end.

일부 실시예에서, 대응 메시지가 번역 데이터 저장부(210)에 여전히 존재하지 않으면, 동작(520)에서, 상기 CTT 제어 모듈(202)은, 아직 수행되지 않은 채팅 메시지 상에 수행할 수 있는 변환 모듈(208)의 추가적인 변환 동작이 있는지 여부를 판별할 수 있다. In some embodiments, if the corresponding message is still not present in the translation data storage 210, in operation 520, the CTT control module 202 is a conversion module that can be performed on a chat message that has not yet been performed. It may be determined whether there is an additional conversion operation of 208.

만약 추가적인 변환 동작이 존재하면, 방법(500)은 동작(512)로 돌아가서 추가적인 변환 동작(들)을 수행한다. 실시예에 의하면, 상기 추가적인 변환 동작(들)은, 변환 모듈(208)에 의하여, 초기 메시지 상에 이미 적용된 것들과 상이한 변환 동작의 적용을 포함할 수 있으며, 영어 채팅 메시지의 다른 부분에 동일한 변환 동작을 적용하거나, 또는 그들의 동일한 조합을 포함할 수 있다. 예를 들어 만약, 동작(512)의 제1 실행 도중, 상기 변환 모듈(208)이 (제1 변환된 메시지를 생성하기 위하여) 초기 메시지에 채팅용어-관련 동작을 적용하면, 동작(512)의 제2 실행 도중에는, 상기 변환 모듈(208)이, 제2 변환된 메시지에 대하여 약어-관련 동작을 적용할 수 있다. 그 다음, 동작(512)의 다음 실행으로, 상기 방법(500)은, 동작(514 및 516)을 계속할 수 있다. 이 동작(514, 516)은, 상기 CTT 제어 모듈(202)이, 가장 최근에 도출된 변환된 메시지에 대응하는, 제2 언어로 된 대응 메시지를 번역 데이터 저장부(210)에 다시 질의하고, 상기 CTT 제어 모듈(202)이 대응 메시지가 상기 번역 데이터 저장부(210)에서 발견되는지를 판별하는 것이다. 변환 및 질의 동작을 이러한 반복 방식으로 수행함으로써, 일부 실시예들은 가능한 모든 변환 동작을 수행하기 전에 대응 메시지를 찾는 것이 가능할 수 있다. 당업자들은, 특정한 실시예에서, 변환 모듈(208)에 의하여 제공되는 모든 가능한 변환 동작이 상기 채팅 메시지에 수행된 이후에 그 질의 동작(예를 들어, 동작(514))만이 오직 수행되면서, 상기 변환 및 질의 동작이 직렬적으로 수행될 수 있음을 인정할 수 있다.If there is an additional conversion operation, method 500 returns to operation 512 to perform additional conversion operation (s). According to an embodiment, the additional conversion operation (s) may include, by the conversion module 208, application of a different conversion operation to those already applied on the initial message, and the same conversion to other parts of the English chat message. Motions, or the same combination thereof. For example, if during the first execution of operation 512, the conversion module 208 applies a chat term-related operation to the initial message (to generate the first converted message), the operation 512 During the second execution, the conversion module 208 may apply the abbreviation-related operation to the second converted message. Then, with the next execution of operation 512, the method 500 can continue with operations 514 and 516. In operation 514 and 516, the CTT control module 202 queries the translation data storage 210 again for a corresponding message in a second language corresponding to the most recently derived converted message, The CTT control module 202 determines whether a corresponding message is found in the translation data storage 210. By performing transform and query operations in this iterative manner, some embodiments may be able to find a corresponding message before performing all possible transform operations. Those skilled in the art, in certain embodiments, only the query operation (eg, operation 514) is performed after all possible transformation operations provided by the transformation module 208 are performed on the chat message, while the transformation is performed. And it can be appreciated that the query operation can be performed serially.

만약 대응 메시지가 번역 데이터 저장부(210)에 존재하지 않고 (변환 모듈(208)의) 추가적인 변환 동작이 존재하지 않으면, 동작(522)에서 (번역 API 모듈(212)를 통하여), 상기 번역 모듈(116)은 초기 메시지 또는 변환된 메시지를 제2 언어로 된 대응 메시지로 번역하는 것을 지원할 수 있다. 그 후로, 동작(524)에서, 상기 CTT 통신 모듈(204)은 대응 메시지를, 제2 채팅 클라이언트 시스템(예를들어 채팅 클라이언트 시스템(104-2))으로 전송하는 것을 지원할 수 있다. 일부 실시예에 의하면, 상기 대응 메시지는, 상기 대응 메시지를 상기 제2 채팅 클라이어트 시스템(예를들어, 104-2)으로 중계할 수 있는, 채팅 호스트 시스템(112)으로 전송될 수 있다. 상기 방법(500)은 이후 종료될 수 있다. If a corresponding message does not exist in the translation data storage 210 and there is no additional conversion operation (of the conversion module 208), in operation 522 (via the translation API module 212), the translation module 116 may support translation of an initial message or a converted message into a corresponding message in a second language. Thereafter, in operation 524, the CTT communication module 204 may support sending a corresponding message to a second chat client system (eg, chat client system 104-2). According to some embodiments, the correspondence message may be transmitted to the chat host system 112, which may relay the correspondence message to the second chat client system (eg, 104-2). The method 500 can then end.

일부 실시예에서, 상기 변환 모듈(208)은, 대응 메시지가 채팅 호스트 시스템(112)으로 전송되기 전에, 제2 언어로 된 대응 메시지를 변환하기 위하여 활용될 수 있다. 여기서 설명된 바와 같이, 상기 대응 메시지는, 제2 채팅 클라이언트 시스템(104-2)의 사용자를 위하여 번역을 더 정제하기 위해서, 추가적인 변환 처리를 위하여, 제출될 수 있다. 예를 들어, 만약 초기 메시지가 제1 언어(예를 들어, 영어)로 된 채팅용어를 포함하면, 추가적인 변환 처리는, 가능한 범위까지, 제2 언어로 된 채팅용어를 더할 수 있다. In some embodiments, the conversion module 208 may be utilized to convert a correspondence message in a second language before the correspondence message is sent to the chat host system 112. As described herein, the corresponding message can be submitted for further conversion processing, to further refine the translation for the user of the second chat client system 104-2. For example, if the initial message includes chat terms in the first language (eg, English), further conversion processing may add chat terms in the second language to the extent possible.

비록 상기의 단계들의 방법은, 특정의 순서로, 설명되고 묘사되었으나, 당업자는, 수행된 단계들의 순서들은 실시예들 사이에서 변화할 수 있음을 인정할 것이다. 추가적으로, 당업자는, 방법(500)과 관련하여, 위에서 설명된 구성요소는 단지 상기 방법에서 사용될 수 있는 구성요소의 예시이고, 일부 실시예에서는 다른 구성요소도 일부 실시예로서 활용될 수 있음을 인정할 수 있다.Although the method of steps above has been described and described in a particular order, those skilled in the art will appreciate that the order of steps performed may vary between embodiments. Additionally, those skilled in the art will appreciate that in connection with method 500, the components described above are merely examples of components that may be used in the method, and in some embodiments, other components may also be utilized as some embodiments. You can.

도 6은, 다양한 실시예에 따른, 통신을 변환하는 예시적인 방법(600)을 나타내는 순서도이다. 후술하는 바와 같이, 일부 실시예에서, 방법(600)은 (예를 들어, CCT 시스템(114)의) 변환 모듈(209)과 관련되어 동작을 수행한다.6 is a flow diagram illustrating an example method 600 for transforming communications, in accordance with various embodiments. As described below, in some embodiments, method 600 performs operations in connection with transformation module 209 (eg, of CCT system 114).

방법은, 동작(602)에서 시작하는데, 변환 처리를 위해 초기 메시지가 변환 모듈(208)에 수신된다. 일부 실시예에 기초하여, 변환 모듈(208)은, 번역 데이터 저장부(210)에서 메시지를 식별하는데 실패한 뒤에 변환을 위해 초기 메시지를 수신할 수 있고, 초기 메시지가 제 3자 또는 소유주 번역 프로세스(proprietary translation process)(예를 들어, 클라우드 기반 서비스로 제공될 수 있는 번역 모듈(116))에 의해 기계 번역되기 전에 초기 메시지를 수신하는 것도 가능할 수 있다. 여기서 설명하는 바와 같이, 변환 모듈(208)은 텍스트 번역을 용이하기 하기 위해 또는 개선하기 위해, 다양한 실시예에 사용될 수 있다. 이때 텍스트는 최소의 문맥, 간단한 문장 구조, 전문적/분야에 한정된 용어(예를 들어, 인터넷 기반 채팅을 위한 채팅용어), 약어, 두문자어, 구어체어, 고유명사, 보통명사, 비속어 또는 이들의 조합을 포함할 수 있다. 번역 모듈(208)의 동작으로부터 이득을 볼 수 있는 텍스트 번역은, 대화(예를 들어, 트랜스크립트(transcript)), 오프라인 또는 온라인 인터넷 기반 채팅(예를 들어, 인스턴트 메시징) 그리고 모바일 폰 메시징(예를 들어, SMS 또는 MMS)으로부터 비롯된 텍스트의 번역을 포함할 수 있으나 여기에 제한되지 않는다.The method begins at operation 602, where an initial message is received at the conversion module 208 for conversion processing. Based on some embodiments, the conversion module 208 may receive an initial message for conversion after the translation data storage 210 fails to identify the message, and the initial message is a third-party or owner translation process ( It may also be possible to receive an initial message before being machine translated by a proprietary translation process (eg, translation module 116 that can be provided as a cloud-based service). As described herein, the conversion module 208 can be used in various embodiments to facilitate or improve text translation. In this case, the text should include minimal context, simple sentence structure, and terminology limited to professional / sector (eg, chat term for Internet-based chatting), abbreviations, acronyms, colloquial language, proper nouns, common nouns, profanity, or combinations thereof. It can contain. Text translations that can benefit from the operation of the translation module 208 include conversation (eg, transcript), offline or online Internet-based chat (eg, instant messaging) and mobile phone messaging (eg For example, it may include translation of text originating from SMS or MMS), but is not limited thereto.

동작(604)에서, 채팅용어 모듈(302)은, 초기 메시지에서 채팅 특수용어(chat jargon)(즉 채팅용어)와 관련된 하나 또는 그 이상의 단어 또는 구절을 식별할 수 있고, 그 식별된 단어 또는 구절에 대한 교체(예를 들어, 대응하는 공식적인/즉, 비채팅용어) 단어 또는 구절을 더 제안할 수 있다. 일부 실시예에서, 채팅용어 모듈(302)은 그 식별된 단어 또는 구절이 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되도록 또는 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 채팅용어 단어 또는 구절은, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토(review) 및 처분(disposition)를 위해 표시될 수 있다. 채팅용어 단어 또는 구절 및/또는 그것의 대응하는 (공식적인) 단어 또는 구절을 식별하기 위해, 일부 실시예는, 채팅용어 단어 또는 구절, 및/또는 채팅용어 단어 또는 구절과 그들의 대응하는 단어 또는 구절 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 “내부용(in-house)”), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(604)의 결과인 메시지(이하, '제 1 중간 메시지'라 함)는, 채팅용어 모듈(302)에 의해 표시되어 무시될 채팅용어 단어 또는 구절, 제안된 교체, 또는 채팅용어 모듈(302)에 의해 초기 메시지에 삽입된(예를 들어, 그 식별된 채팅용어 단어 또는 구절을 대신해서) 비채팅용어 단어 또는 구절을 포함할 수 있다. 예를 들어, 제 1 중간 메시지는 초기 메시지와 같을 수 있다(예를 들어, 채팅용어 모듈(302)에 의해 어떠한 교체도 수행되지 않을 때). 실시예에 따르면, 채팅용어 모듈(302)로부터 생성된 제 1 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 채팅용어 모듈(302)에 의해 변환된 메시지가 데이터 저장부(302)에 있는지 여부를 결정하기 위해 CTT 제어모듈(202)로 제공될 수 있다. 동작(604)의 후에, 제 1 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(606))에 제공될 수 있다.In operation 604, the chat term module 302 may identify one or more words or phrases associated with a chat jargon (ie, chat term) in the initial message, and the identified words or phrases It is possible to suggest more words or phrases for replacement (eg, corresponding formal / ie non-chat terms). In some embodiments, chat terminology module 302 may indicate that the identified word or phrase is to be omitted or ignored during the next machine translation (eg, by translation module 116). Additionally, in some embodiments, the identified chatspeak words or phrases may be displayed for review and disposition by a human operator (eg, manager of CTT system 114). . In order to identify chat term words or phrases and / or their corresponding (official) words or phrases, some embodiments may include chat term words or phrases and / or between chat term words or phrases and their corresponding words or phrases It is possible to utilize a dataset (stored in a data storage, for example), including mapping of. The dataset may be built through a training or learning system, and may be owned by the owner (eg, “in-house” manually collected by the manager of the CTT system 114), and commercially available. It may be obtained from, or may be derived from publicly available Internet knowledge. The message (hereinafter referred to as 'first intermediate message') resulting from the operation 604 is displayed by the chat term module 302 to be ignored or a chat term word or phrase, suggested replacement, or chat term module 302 ) Into the initial message (eg, instead of the identified chat term word or phrase). For example, the first intermediate message may be the same as the initial message (eg, when no replacement is performed by the chat term module 302). According to an embodiment, the first intermediate message generated from the chat term module 302 may be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement may be a chat term. The message converted by the module 302 may be provided to the CTT control module 202 to determine whether it is in the data storage 302. After operation 604, the first intermediate message can be provided to the next operation (eg, operation 606) of the conversion module 208 for processing.

동작(606)에서, 두문자어 모듈(304)은, 채팅 메시지에서 하나 또는 그 이상의 두문자어를 식별할 수 있고, 그 두문자어에 대응하는(에 의해 대표되는) 교체 단어 또는 구절을 더 제안할 수 있다. 일부 실시예에서, 두문자어 모듈(304)은, 그 식별된 두문자어가 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 두문자어는, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토(review) 및 처분(disposition)을 위해 표시될 수 있다. 두문자어 및/또는 그것의 대응하는 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 두음자어, 및/또는 두음자어와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 (예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(606)의 결과인 메시지(이하, '제 2 중간 메시지'라 함)는, 두문자어 모듈(304)에 의해 표시되어 무시될 두문자어, 제안된 교체, 또는 두문자어 모듈(304)에 의해 메시지에 삽입된(예를 들어, 그 식별된 두문자어를 대신해서) 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 2 중간 메시지는 제 1 중간 메시지와 같을 수 있다(예를 들어, 두문자어 모듈(304)에 의해 어떠한 교체도 수행되지 않을 때). 실시예에 따르면, 두문자어 모듈(304)로부터 생성된 제 2 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 두문자어 모듈(304)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어 모듈(202)로 제공될 수 있다. 동작(606)의 후에, 제 2 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(608))에 제공될 수 있다.In operation 606, the acronym module 304 can identify one or more acronyms in the chat message, and can further suggest replacement words or phrases (represented by) corresponding to the acronyms. In some embodiments, the acronym module 304 can indicate that the identified acronym is omitted or ignored during the next machine translation (eg, by the translation module 116). Additionally, in some embodiments, the identified acronyms can be displayed for review and disposition by a human operator (eg, an administrator of the CTT system 114). A dataset comprising, in some embodiments, acronyms, and / or mappings between acronyms and their corresponding words and phrases, to identify acronyms and / or their corresponding words or phrases ( dataset, for example). The dataset may be built through a training or learning system, and may be owned by the owner (eg, 'internally' collected manually by the manager of the CTT system 114), or may be commercially obtained. Or, it may come from publicly available Internet knowledge. The message (hereinafter referred to as 'second intermediate message') resulting from operation 606 is displayed by the acronym module 304 and is inserted into the message by the acronym to be ignored, the proposed replacement, or the acronym module 304 May include words or phrases (eg, in place of the identified acronyms). In some examples, the second intermediate message may be the same as the first intermediate message (eg, when no replacement is performed by the acronym module 304). According to an embodiment, the second intermediate message generated from the acronym module 304 may be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement may include the acronym module ( 304) may be provided to the CTT control module 202 to determine whether the message converted by the data storage 210. After operation 606, a second intermediate message may be provided to the next operation (eg, operation 608) of the conversion module 208 for processing.

동작(608)에서, 고유명사 모듈(306)은, 채팅 메시지에서 하나 또는 그 이상의 고유명사를 식별할 수 있고, 또한 그 고유명사에 대응하는(예를 들어, 대표되는) 교체 단어 또는 구절을 더 제안할 수 있다. 일부 실시예에서, 고유명사 모듈(306)은, 그 식별된 고유명사가 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 고유명사는, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자) 에 의한 사후 검토 및 처분을 위해 표시될 수 있다. 고유명사 및/또는 그것의 대응하는 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 고유명사(예를 들어, Disneyland®와 같이 잘 알려진 고유명사 또는 개인들의 흔한 이름) 및/또는 고유명사와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있는 것일 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(608)의 결과인 메시지(이하, '제 3 중간 메시지'라 함)는, 고유명사 모듈(306)에 의해 표시되어 무시될 고유명사, 제안된 교체, 또는 고유명사 모듈(306)에 의해 메시지에 삽입된(예를 들어, 그 식별된 고유명사를 대신해서) 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 3 중간 메시지는 제 2 중간 메시지와 같을 수 있다(예를 들어, 고유명사 모듈(306)에 의해 어떠한 교체도 수행되지 않을 때). 실시예에 따르면, 고유명사 모듈(306)로부터 생성된 제 3 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 고유명사 모듈(306)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어 모듈(202)로 제공될 수 있다. 동작(608)의 후에, 제 3 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(610))에 제공될 수 있다.In operation 608, the proper noun module 306 can identify one or more proper nouns in the chat message, and further adds a replacement word or phrase corresponding to the proper noun (eg, represented). I can suggest. In some embodiments, the proper noun module 306 can indicate that the identified proper noun is omitted or ignored during the next machine translation (eg, by the translation module 116). Additionally, in some embodiments, the identified proper noun may be marked for post-mortem review and disposition by a human operator (eg, manager of the CTT system 114). To identify proper nouns and / or their corresponding words or phrases, in some embodiments, proper nouns (eg, common names of well-known nouns or individuals such as Disneyland®) and / or nouns and their It is possible to utilize a dataset (stored in a data store, for example), including mapping between corresponding words and phrases. The dataset can be built through a training or learning system, can be owned by the owner (eg, 'internally' collected manually by the administrator of the CTT system 114), or can be commercially obtained. Or it may come from publicly available Internet knowledge. The message resulting from operation 608 (hereinafter referred to as the 'third intermediate message') is displayed by the proper noun module 306 and is to be ignored by the proper noun, proposed replacement, or proper noun module 306. It may contain words or phrases embedded in the message (eg, in place of its identified proper noun). In some examples, the third intermediate message may be the same as the second intermediate message (eg, when no replacement is performed by the proper noun module 306). According to an embodiment, the third intermediate message generated from the proper noun module 306 may be provided to another transform module (in the transform module 208) for further processing, or the proposed replacement may be a proper noun. The message converted by the module 306 may be provided to the CTT control module 202 to determine whether it is in the data storage 210. After operation 608, a third intermediate message may be provided to the next operation (eg, operation 610) of the conversion module 208 for processing.

동작(610)에서, 보통명사 모듈(308)은, 채팅 메시지에서 하나 또는 그 이상의 보통명사를 식별할 수 있고, 또한 그 보통명사에 대응하는(예를 들어, 대표되는) 교체 단어 또는 구절을 더 제안할 수 있다. 일부 실시예에서, 보통명사 모듈(308)은, 그 식별된 보통명사가 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 보통명사는, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토 및 처분을 위해 표시될 수 있다. 보통명사 및/또는 그것의 대응하는 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 보통명사, 및/또는 보통명사와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있을 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(610)의 결과인 메시지(이하, '제 4 중간 메시지'라 함)는, 보통명사 모듈(308)에 의해 표시되어 무시될 보통명사, 제안된 교체, 또는 보통명사 모듈(308)에 의해 메시지에 삽입된(예를 들어, 그 식별된 보통명사를 대신해서) 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 4 중간 메시지는 제 3 중간 메시지와 같을 수 있다(예를 들어, 보통명사 모듈(308)에 의해 어떠한 교체도 수행되지 않을 때). 실시예에 따르면, 보통명사 모듈(308)로부터 생성된 제 4 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 보통명사 모듈(308)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어 모듈(202)로 제공될 수 있다. 동작(610)의 후에, 제 4 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(612))에 제공될 수 있다. In operation 610, the common noun module 308 may identify one or more common nouns in the chat message, and further add a replacement word or phrase corresponding to the common noun (eg, represented). I can suggest. In some embodiments, common noun module 308 may indicate that the identified common noun is omitted or ignored during the next machine translation (eg, by translation module 116). Additionally, in some embodiments, the identified common noun can be marked for follow-up review and disposition by a human operator (eg, the manager of the CTT system 114). A dataset comprising a common noun, and / or a mapping between the common noun and their corresponding words and phrases, in some embodiments, to identify the common noun and / or its corresponding word or phrase (dataset, for example, stored in data storage) can be utilized. The dataset may be built through a training or learning system, may have an owner (eg, 'internally' collected manually by the administrator of the CTT system 114), or may be commercially obtained. Or, it may come from publicly available Internet knowledge. The message resulting from operation 610 (hereinafter referred to as the 'fourth intermediate message') is displayed by the common noun module 308 and is to be ignored by the common noun, the proposed replacement, or the common noun module 308. It may contain words or phrases embedded in a message (eg, in place of its identified common noun). In some examples, the fourth intermediate message may be the same as the third intermediate message (eg, when no replacement is performed by the common noun module 308). According to an embodiment, the fourth intermediate message generated from the common noun module 308 may be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement may be a common noun. The message converted by the module 308 may be provided to the CTT control module 202 to determine whether it is in the data storage 210. After operation 610, a fourth intermediate message may be provided to the next operation (eg, operation 612) of the conversion module 208 for processing.

동작(612)에서, 구어체 모듈(310)은, 채팅 메시지에서 하나 또는 그 이상의 구어체 단어 또는 구절을 식별할 수 있고, 또한 그 식별된 단어 또는 구절의 교체(예를 들어, 대응하는 공식적인/즉, 비구어체) 단어 또는 구절을 더 제안할 수 있다. 일부 실시예에서, 구어체 모듈(310)은, 그 식별된 구어체 단어 또는 구절이 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 구어체 단어 또는 구절은, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토 및 처분을 위해 표시될 수 있다. 구어체 단어 또는 구절 및/또는 그것의 대응하는 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 구어체 단어 또는 구절 및/또는, 구어체 단어 또는 구절과 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있을 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(612)의 결과인 메시지(이하, '제 5 중간 메시지'라 함)는, 구어체 모듈(310)에 의해 표시되어 무시될 구어체 단어 또는 구절, 제안된 교체, 또는 구어체 모듈(310)에 의해 메시지에 삽입된(예를 들어, 그 식별된 구어체 단어 또는 구절을 대신해서) 비구어체 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 5 중간 메시지는 제 4 중간 메시지와 같을 수 있다(예를 들어, 구어체 모듈(310)에 의해 어떠한 교체도 수행되지 않을 때). 그 실시예에 따르면, 구어체 모듈(310)로부터 생성된 제 5 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 구어체 모듈(310)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어 모듈(202)로 제공될 수 있다. 동작(612)의 후에, 제 5 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(614))에 제공될 수 있다.In operation 612, the colloquial module 310 may identify one or more colloquial words or phrases in the chat message, and also replace the identified words or phrases (eg, corresponding official / ie, Non-verbal) more words or phrases can be suggested. In some embodiments, colloquial module 310 may indicate that the identified colloquial word or phrase is omitted or ignored during the next machine translation (eg, by translation module 116). Additionally, in some embodiments, the identified colloquial words or phrases can be marked for follow-up review and disposition by a human operator (eg, manager of the CTT system 114). To identify colloquial words or phrases and / or their corresponding words or phrases, in some embodiments, colloquial words or phrases and / or mapping between colloquial words or phrases and their corresponding words and phrases It may utilize a dataset (dataset, for example, stored in the data storage), including. The dataset may be built through a training or learning system, may have an owner (eg, 'internally' collected manually by the administrator of the CTT system 114), or may be commercially obtained. Or, it may come from publicly available Internet knowledge. The message (hereinafter referred to as 'Fifth Intermediate Message') as a result of the operation 612 is displayed by the colloquial module 310 to be ignored by the colloquial word or phrase, proposed replacement, or colloquial module 310. May include non- colloquial words or phrases embedded in the message (eg, in place of the identified colloquial words or phrases). In some examples, the fifth intermediate message may be the same as the fourth intermediate message (eg, when no replacement is performed by colloquial module 310). According to that embodiment, the fifth intermediate message generated from the colloquial module 310 may be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement may be a colloquial module The message converted by 310 may be provided to the CTT control module 202 to determine whether it is in the data storage 210. After operation 612, a fifth intermediate message may be provided to the next operation (eg, operation 614) of the conversion module 208 for processing.

동작(614)에서, 맞춤법 검사 모듈(312)은, 채팅 메시지에서 하나 또는 그 이상의 철자가 틀린 단어 또는 구절을 식별할 수 있고, 또한 그 식별된 단어 또는 구절의 교체(예를 들어, 정정된) 단어 또는 구절을 더 제안할 수 있다. 예를 들어, 맞춤법 검사 모듈(312)은 단어 또는 구절을 자동으로 그 제안된 교체 단어 또는 문구로 정정할 수 있다. 일부 실시예에서, 맞춤법 검사 모듈(312)은, 그 식별된 맞춤법이 틀린 단어 또는 구절이 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 맞춤법이 틀린 단어 또는 구절은, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토 및 처분을 위해 표시될 수 있다. 맞춤법이 틀린 단어 또는 구절 및/또는 그것의 대응하는 (정정된) 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 맞춤법이 틀린 단어 또는 구절 및/또는, 맞춤법이 틀린 단어 또는 구절과 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있을 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(614)의 결과인 메시지(이하, '제 6 중간 메시지'라 함)는, 맞춤법 검사 모듈(312)에 의해 표시되어 무시될 맞춤법이 틀린 단어 또는 구절, 제안된 교체, 또는 맞춤법 검사 모듈(312)에 의해 메시지에 삽입된(예를 들어, 그 철자가 틀린 단어 또는 구절을 대신해서) 정정된 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 6 중간 메시지는 제 5 중간 메시지와 같을 수 있다(예를 들어, 맞춤법 검사 모듈(312)에 의해 어떠한 교체도 수행되지 않을 때). 그 실시예에 따르면, 맞춤법 검사 모듈(312)로부터 생성된 제 6 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 맞춤법 검사 모듈(312)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어 모듈(202)로 제공될 수 있다. 동작(614)의 후에, 제 6 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(616))에 제공될 수 있다.In operation 614, the spell check module 312 can identify one or more misspelled words or phrases in the chat message, and also replace (eg, corrected) the identified words or phrases You can suggest more words or phrases. For example, the spell check module 312 can automatically correct a word or phrase with the suggested replacement word or phrase. In some embodiments, the spell checking module 312 can indicate that the identified misspelled word or phrase is omitted or ignored during the next machine translation (eg, by the translation module 116). . Additionally, in some embodiments, the identified misspelled words or phrases may be marked for follow-up review and disposition by a human operator (eg, manager of CTT system 114). To identify misspelled words or phrases and / or their corresponding (corrected) words or phrases, in some embodiments, misspelled words or phrases and / or misspelled words or phrases and their corresponding A dataset (stored in a data store, for example), including mapping between words and phrases, can be utilized. The dataset may be built through a training or learning system, may have an owner (eg, 'internally' collected manually by the administrator of the CTT system 114), or may be commercially obtained. Or, it may come from publicly available Internet knowledge. The message (hereinafter referred to as the 'sixth intermediate message') resulting from the operation 614 is displayed by the spelling check module 312 and the misspelled word or phrase to be ignored, the proposed replacement, or the spell checking module ( 312) may include a corrected word or phrase inserted into the message (eg, in place of the misspelled word or phrase). In some examples, the sixth intermediate message may be the same as the fifth intermediate message (eg, when no replacement is performed by the spell checking module 312). According to that embodiment, the sixth intermediate message generated from the spell checking module 312 can be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement is spelled The message converted by the inspection module 312 may be provided to the CTT control module 202 to determine whether or not it is in the data storage 210. After operation 614, the sixth intermediate message can be provided to the next operation (eg, operation 616) of the conversion module 208 for processing.

동작(616)에서, 약어 모듈(314)은, 채팅 메시지에서 하나 또는 그 이상의 약어를 식별할 수 있고, 또한 그 약어에 대응하는 (예를 들어, 대표되는) 교체 단어 또는 구절을 더 제안할 수 있다. 일부 실시예에서, 약어 모듈(314)은, 그 식별된 약어가 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 약어는, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토 및 처분을 위해 표시될 수 있다. 약어 및/또는 그것의 대응하는 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 약어 및/또는, 약어와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있을 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(616)의 결과인 메시지(이하, '제 7 중간 메시지'라 함)는, 약어 모듈(314)에 의해 표시되어 무시될 약어, 제안된 교체, 또는 약어 모듈(314)에 의해 메시지에 삽입된(예를 들어, 그 식별된 약어를 대신해서) 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 7 중간 메시지는 제 6 중간 메시지와 같을 수 있다(예를 들어, 약어 모듈(314)에 의해 어떠한 교체도 수행되지 않을 때). 그 실시예에 따르면, 약어 모듈(314)로부터 생성된 제 7 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 약어 모듈(314)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어모듈(202)로 제공될 수 있다. 동작(616)의 후에, 제 7 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작(예를 들어 동작(618))에 제공될 수 있다.In operation 616, the abbreviation module 314 may identify one or more abbreviations in the chat message, and may further suggest alternate words or phrases (eg, representative) corresponding to the abbreviations. have. In some embodiments, abbreviation module 314 may indicate that the identified abbreviation is omitted or ignored during the next machine translation (eg, by translation module 116). Additionally, in some embodiments, the identified abbreviations may be marked for post-mortem review and disposition by a human operator (eg, manager of CTT system 114). To identify an abbreviation and / or its corresponding word or phrase, in some embodiments, a dataset, including an abbreviation and / or mapping between the abbreviation and their corresponding words and phrases, For example, it may utilize (stored in the data storage). The dataset may be built through a training or learning system, may have an owner (eg, 'internally' collected manually by the administrator of the CTT system 114), or may be commercially obtained. Or, it may come from publicly available Internet knowledge. The message resulting from operation 616 (hereinafter referred to as the 'seventh interim message') is displayed by the abbreviation module 314 and is inserted into the message by the abbreviation, proposed replacement, or abbreviation module 314 to be ignored. May include words or phrases (eg, in place of its identified abbreviations). In some examples, the seventh intermediate message may be the same as the sixth intermediate message (eg, when no replacement is performed by abbreviation module 314). According to that embodiment, the seventh intermediate message generated from the abbreviation module 314 may be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement may be an abbreviation module. The message converted by 314 may be provided to the CTT control module 202 to determine whether it is in the data storage 210. After operation 616, the seventh intermediate message can be provided to the next operation (eg, operation 618) of the conversion module 208 for processing.

동작(618)에서, 비속어 모듈(316)은, 채팅 메시지에서 하나 또는 그 이상의 비속어 단어 또는 구절(이하, '비속어'라 함)을 식별할 수 있고, 또한 그 비속어에 대응하는 교체 단어 또는 구절(예를 들어, 적절한 대체어)을 더 제안할 수 있다(예를 들어, 부드러운 완곡어구). 일부 실시예에서, 비속어 모듈(316)은, 그 식별된 비속어가 그 다음의 기계 번역(예를 들어, 번역 모듈(116)에 의해) 동안에 생략되거나 무시되도록 표시할 수 있다. 추가적으로, 일부 실시예에서, 그 식별된 비속어는, 인간 운영자(예를 들어, CTT 시스템(114)의 관리자)에 의한 사후 검토 및 처분을 위해 표시될 수 있다. 비속어 및/또는 그것의 대응하는 단어 또는 구절을 식별하기 위해, 일부 실시예에서는, 비속어 및/또는, 비속어와 그들의 대응하는 단어 및 구절들 사이의 매핑(mapping)를 포함하는, 데이터세트(dataset, 예를 들어 데이터 저장부에 저장된)를 활용할 수 있다. 데이터세트는 훈련 또는 학습 시스템을 통해 구축될 수 있고, 소유주가 있을 수 있으며(예를 들어, CTT 시스템(114)의 관리자에 의해 수동으로 수집된 '내부용'), 상업적으로 획득된 것일 수도 있고, 또는 공개적으로 이용 가능한 인터넷 지식으로부터 유래될 수도 있다. 동작(618)의 결과인 메시지(이하, '제 8 중간 메시지'라 함)는, 비속어 모듈(316)에 의해 표시되어 무시될 비속어, 제안된 교체, 또는 비속어 모듈(316)에 의해 메시지에 삽입된(예를 들어, 그 식별된 비속어를 대신해서) 단어 또는 구절을 포함할 수 있다. 일부 예에서, 제 8 중간 메시지는 제 7 중간 메시지와 같을 수 있다(예를 들어, 비속어 모듈(316)에 의해 어떠한 교체도 수행되지 않을 때). 그 실시예에 따르면, 비속어 모듈(316)로부터 생성된 제 8 중간 메시지는, 추가적인 처리를 위해 다른 변환 모듈(변환 모듈(208)에 있는)로 제공될 수 있고, 또는 제안된 교체는, 비속어 모듈(316)에 의해 변환된 메시지가 데이터 저장부(210)에 있는지 여부를 결정하기 위해 CTT 제어 모듈(202)로 제공될 수 있다. 동작(618)의 후에, 제 8 중간 메시지는 처리를 위해 변환 모듈(208)의 다음 동작에 제공될 수 있다. 그리고 나서 방법(600)은 종료한다.In operation 618, the profanity module 316 may identify one or more profanity words or phrases (hereinafter, referred to as 'profanity') in a chat message, and may also substitute words or phrases corresponding to the profanity ( For example, a suitable alternative word may be further proposed (eg, a soft euphemism). In some embodiments, profanity module 316 may indicate that the identified profanity is omitted or ignored during the next machine translation (eg, by translation module 116). Additionally, in some embodiments, the identified profanity can be marked for follow-up review and disposition by a human operator (eg, the manager of the CTT system 114). To identify a slang and / or its corresponding word or phrase, in some embodiments, a dataset, including mapping between the slang and / or the slang and their corresponding words and phrases, For example, it may utilize (stored in the data storage). The dataset may be built through a training or learning system, may have an owner (eg, 'internally' collected manually by the administrator of the CTT system 114), or may be commercially obtained. Or, it may come from publicly available Internet knowledge. The message resulting from operation 618 (hereinafter referred to as the 'eighth intermediate message') is displayed by the profanity module 316 and inserted into the message by the profanity, suggested replacement, or profanity module 316 to be ignored. May include words or phrases (eg, in place of the identified profanity). In some examples, the eighth intermediate message may be the same as the seventh intermediate message (eg, when no replacement is performed by the slang module 316). According to that embodiment, the eighth intermediate message generated from the slang module 316 can be provided to another conversion module (in the conversion module 208) for further processing, or the proposed replacement is a slang module The message converted by 316 may be provided to the CTT control module 202 to determine whether it is in the data storage 210. After operation 618, the eighth intermediate message may be provided to the next operation of conversion module 208 for processing. The method 600 then ends.

일부 실시예에 따르면, 변환 모듈(208)로부터 궁극적으로 생성된 결과 메시지(예를 들어, 동작(618)의 결과로 생성된 제 8 중간 메시지)는, 나중에, 그 결과 메시지에 대한 번역으로서 역할을 할 수 있는 대응 메시지를 번역 데이터 저장부(210)로 질의하는데 사용될 수 있다. 일부 예에서, 변환 모듈(208)로부터 생성된 메시지(예를 들어, 나중에 번역 데이터 저장부(210)로 질의하는데 사용된 메시지)는, 어떤 변환도 초기 메시지에 적용되지 않았을 때(예를 들어, 초기 메시지가 어떤 변환도 적용되지 않고 동작(604-618)을 거침) 수신된(예를 들어, 동작(602)) 초기 메시지와 같을 수 있다는 것은, 당업자에게 자명하다. According to some embodiments, the result message ultimately generated from translation module 208 (eg, the eighth intermediate message generated as a result of operation 618), later, serves as a translation to the result message. It can be used to query the corresponding message to the translation data storage 210. In some examples, a message generated from the transformation module 208 (eg, a message used to query the translation data storage 210 later) is when no transformation is applied to the initial message (eg, It is apparent to those skilled in the art that the initial message may be the same as the initial message received (eg, operation 602) without any transformation applied and passed operations 604-618.

다양한 실시예들은, 설명된 것보다 더 많이 또는 더 적은 동작을 수행할 수 있고, 설명된 것과 다른 동작을 수행할 수도 있으며, 다른 순서로 동작을 수행할 수도 있음은 당업자에게 자명하다. 일반적으로, 수행되는 변환 동작의 종류와, 그 수행되는 순서는, 실시예에서 채용하는 변환 전략에 의존할 수 있다. 여기에서 언급하는 바와 같이, 다양한 실시예는 각각의 번역을 달성하는데 있어서 다른 변형/번역 전략을 구현할 수 있는데, 어떤 전략은 특정 번역 어플리케이션 또는 번역 문맥(context)에 적합하다. 채용되는 변환/번역 전략은, 어느 변환 동작이 수행되는지, 변환 동작이 언제 수행되는지, 또는 변환 동작이 수행되는 순서를 결정할 수 있다. 변환/번역 전략은 또한 번역이 번역 데이터 저장부로 이동되는 것 그리고 번역 데이터가 전체 변환/번역 프로세스에서 이용되는 때를 결정할 수 있다. It will be apparent to those skilled in the art that various embodiments may perform more or less operations than described, may perform other operations than those described, and may perform operations in a different order. In general, the type of conversion operation performed and the order in which it is performed can depend on the conversion strategy employed in the embodiment. As mentioned herein, various embodiments may implement different transformation / translation strategies in achieving each translation, which strategy is suitable for a particular translation application or translation context. The conversion / translation strategy employed may determine which conversion operation is performed, when the conversion operation is performed, or the order in which the conversion operation is performed. The translation / translation strategy can also determine when translations are transferred to the translation data store and when the translation data is used in the entire conversion / translation process.

일부 실시예에서, 방법(600)에서의 동작들의 결과인 중간 메시지는, 방법(600)에서의 다음 동작들의 결과인 메시지에 충격(impact) 및/또는 누적(cascade) 효과를 가할 수 있다. 추가적으로, 일부 실시예에서, 채팅 메시지가 방법(600)에 의해 처리될 때, 순서도(600)의 각 동작이, 방법이 끝나기 전에 채팅 메시지에 수행될 수 있다. 대안으로, 일부 실시예에서, 순서도(600)의 방법은, 도시된 동작의 오직 일부분이 수행되는 즉시 조기에 종료할 수 있다(예를 들어, 적어도 하나의 동작이 채팅 메시지의 변환을 야기한 이후). 일부 실시예에 따르면, 순서도(500)의 각 동작의 수행은, 최근의 결과인 변환된 메시지에 기초하여 원하는 언어의 대응하는 메시지를 번역 데이터 저장부(210)로 질의하는 것으로 이어질 수 있다. 만약 대응하는 메시지가 식별되면, 순서도(500)의 방법은 조기에 종료될 수 있다. In some embodiments, the intermediate message that is the result of the actions in method 600 may exert an impact and / or cascade effect on the message that is the result of the following actions in method 600. Additionally, in some embodiments, when the chat message is processed by method 600, each operation of flowchart 600 may be performed on the chat message before the method ends. Alternatively, in some embodiments, the method of flowchart 600 may terminate prematurely as soon as only a portion of the illustrated operation is performed (eg, after at least one operation has caused the conversion of a chat message). . According to some embodiments, performing each operation of the flowchart 500 may lead to querying the translation data storage 210 for a corresponding message in a desired language based on the converted message, which is a recent result. If a corresponding message is identified, the method of flowchart 500 may end prematurely.

다양한 실시예에서, 방법(600)은 동작(604-612)을 병렬로 수행할 수 있다. 예를 들어, CTT 제어 모듈(202)은, 초기 메시지를 두 개 또는 그 이상의 동작(604-612)으로 병렬로 제공할 수 있고, 그 두 개 또는 그 이상의 동작 각각으로부터 개별 응답을 수신할 수 있다. 각 응답은, 표시된 텍스트 부분, 제안된 교체, 또는 초기 메시지에 삽입된 단어 또는 구절을 포함할 수 있다. 이후, CTT 제어 모듈(202)은 다음의 처리(예를 들어, 번역 데이터 저장부(210) 질의 또는 번역 모듈(116)에 의한 번역)를 위해 그 수신된 응답 중 하나를, 가능한 선택 우선 순위(예를 들어, 추가적인 처리를 위해 그리고 어떤 선례에 따라 어느 변환된 메시지가 선택될 것인지를 결정할 수 있는)에 따라 선택한다. In various embodiments, method 600 may perform operations 604-612 in parallel. For example, the CTT control module 202 can provide the initial message in parallel in two or more actions 604-612, and receive separate responses from each of the two or more actions. . Each response may include a portion of the displayed text, a suggested replacement, or a word or phrase inserted in the initial message. Thereafter, the CTT control module 202 may select one of the received responses for the next processing (eg, translation data storage 210 query or translation by the translation module 116), a possible selection priority ( For example, it is selected for further processing and according to which precedent can determine which translated message is selected).

예를 들어, 방법(600) 동안에, CTT 제어 모듈(202)은, 초기 메시지를, 채팅용어 식별을 처리하는 동작(604), 보통명사 처리를 위한 동작(610), 약어 처리를 위한 동작(616)으로 제공할 수 있다. 응답으로, 동작(604)은 채팅용어가 변환된 초기 메시지를 회신할 수 있고, 동작(610)은 변경이 없는 초기 메시지를 회신할 수 있으며, 동작(616)은 약어가 변환된 초기 메시지를 회신할 수 있다. 다음으로, 선택 우선 순위에 기초하여, CTT 제어 모듈(202)은, 추가적인 처리를 위해 동작(616)으로부터 회신된 변환된 메시지를 선택할 수 있다. For example, during the method 600, the CTT control module 202 may process the initial message, an operation 604 for processing chat term identification, an operation 610 for common noun processing, an operation for abbreviation processing 616 ). In response, operation 604 may return the initial message with the chat term converted, operation 610 may return the initial message with no change, and operation 616 returns the initial message with the abbreviation converted. can do. Next, based on the selection priority, the CTT control module 202 can select the converted message returned from operation 616 for further processing.

일부 실시예에서, 시간 제한이 방법(600)의 여러 동작을 수행하는데 적용될 수 있다. 시간 제한은, 응답(Response)/결과(Result)가 그 시간 제한이 만료하기 전에 수신되지 않는다면, 방법(600)의 변환 동작의 수행이 중단되도록 한다. 이와 같이 될 경우, 다양한 실시예는, 어떤 변환 동작은 전체 변환/번역 프로세스를 불필요하게 방해하지 않는다는 것을 보장할 수 있다. In some embodiments, time limits may be applied to perform various operations of method 600. The time limit causes the conversion of method 600 to stop performing if the Response / Result is not received before the time limit expires. If so, various embodiments can ensure that any conversion operation does not unnecessarily interfere with the entire conversion / translation process.

비록 상술한 방법의 동작들이 특정 순서로 도시되고 설명되었지만, 동작들이 수행되는 순서는 실시예들 간에 다를 수 있음은 당업자에게 자명하다. 추가적으로, 순서도(600)의 방법과 관련하여 설명한 구성요소(component)는, 단지 방법에 사용될 수 있는 구성요소들의 예이고, 일부 실시예에서 다른 구성요소들이 또한 사용될 수 있음은 당업자에게 자명하다. Although the operations of the above-described method are shown and described in a specific order, it is apparent to those skilled in the art that the order in which the operations are performed may vary between embodiments. Additionally, it is apparent to those skilled in the art that the components described in connection with the method of flowchart 600 are only examples of components that may be used in the method, and other components may also be used in some embodiments.

도 7은, 다양한 실시예에 따라, 채팅 클라이언트 시스템(104)(예를 들어, 104-1 및 104-2) 사이의 예시적인 다중 언어 채팅 세션을 나타낸 도면(700)이다. 도시된 바와 같이, 채팅 클라이언트 시스템(104-1)은, 채팅 클라이언트 GUI 모듈(406-1)을 포함하고, 채팅 클라이언트 시스템(104-2)은, 채팅 클라이언트 GUI 모듈(406-2)을 포함한다. 여기에 설명하는 바와 같이, 각 채팅 클라이언트 GUI 모듈(406-1, 406-2)은, 각각 채팅 클라이언트 시스템(104-1, 104-2)의 사용자에게 그들 사이에 공유된 채팅 세션에 대한 그래픽 입력/출력 접근을 제공하도록 구성될 수 있다. 일부 실시예에서, 채팅 클라이언트 GUI 모듈(406-1, 406-2)은, 송수신된 채팅 대화와 관련하여 사용자 상호작용(interaction)을 수신하기 위해 맞춰진 클라이언트 GUI를, 각 사용자에게 제공할 수 있다. FIG. 7 is a diagram 700 illustrating an exemplary multi-language chat session between chat client systems 104 (eg, 104-1 and 104-2), in accordance with various embodiments. As shown, the chat client system 104-1 includes a chat client GUI module 406-1, and the chat client system 104-2 includes a chat client GUI module 406-2. . As described herein, each chat client GUI module 406-1, 406-2, graphically inputs a chat session shared between them to users of the chat client system 104-1, 104-2, respectively. / Can be configured to provide output access. In some embodiments, the chat client GUI modules 406-1 and 406-2 may provide each user with a client GUI tailored to receive user interactions in relation to the transmitted and received chat conversations.

채팅 대화(712)(도 7의 양방향 화살표로 표시됨)가 채팅 클라이언트 시스템(104-1, 104-2) 사이를 지나가듯이, 채팅 클라이언트 GUI 모듈(406-1, 406-2)은 각 채팅 클라이언트 시스템(104-1, 104-2)의 사용자에 의해 선택된 언어(암시적으로 또는 명시적으로)로 채팅 대화(712)를 제공할 수 있다. 도시된 바와 같이, 채팅 클라이언트 GUI 모듈(406-1)은, 출력 영역(708)에 제 1 언어(예, 영어)로 채팅 대화(712)를 제공하고 제 2 영역(710)에서 제 1 언어로 채팅 입력을 수신하도록 구성된 채팅 대화 박스(702)를 포함한다. 채팅 클라이언트 GUI 모듈(406-2)은, 출력 영역(720)에 제 2 언어(예, 불어)로 채팅 대화(712)를 제공하고 제 2 영역(722)에서 제 2 언어로 채팅 입력을 수신하도록 구성된 채팅 대화 박스(714)를 포함한다. 일부 실시예에서, 채팅 대화(712)가 대화 박스(702, 714)에 제공될 경우, 사용자가 채팅 대화(712)에 채팅 메시지를 입력하는 것과 관련된 사용자명(예, 사용자 온라인 식별자)의 표시를 포함할 수 있다. As the chat conversation 712 (indicated by the double arrow in FIG. 7) passes between the chat client systems 104-1, 104-2, the chat client GUI modules 406-1, 406-2 are each chat client system. A chat conversation 712 can be provided in a language (implicitly or explicitly) selected by the user of (104-1, 104-2). As shown, the chat client GUI module 406-1 provides a chat conversation 712 in a first language (eg, English) to the output area 708 and a first language in the second area 710. And a chat dialog box 702 configured to receive chat input. The chat client GUI module 406-2 provides a chat conversation 712 in a second language (eg, French) to the output area 720 and receives chat input in a second language in the second area 722. And a configured chat dialog box 714. In some embodiments, when a chat conversation 712 is provided in the dialog boxes 702 and 714, an indication of a user name (eg, user online identifier) associated with the user entering a chat message in the chat conversation 712 is provided. It can contain.

도 7의 실시예에서, 채팅 클라이언트 시스템(104-1)에서 선택된 언어는, 영어이고 채팅 클라이언트 시스템(104-2)에서 선택된 언어는 불어이다. 따라서, 채팅 메시지(704)(“LOL”) 및 채팅 메시지(706)(“Who u laughin at?”)는, 채팅 클라이언트 GUI 모듈(406-1)의 대화 박스(702)에서 영어로 표시된다. 반면, 이들의 각 대응하는 채팅 메시지(716)(“MDR”) 및 채팅 메시지(718)(“Qui te fair rire?”)는, 채팅 클라이언트 GUI 모듈(406-2)의 대화 박스(714)에서 불어로 표시된다. 채팅 메시지(704, 706, 716 및 718)의 번역은, 여기에 설명된 다양한 시스템들 및 방법들을 통해 달성될 수 있다. 채팅 메시지(704, 706, 716 및 718)와 유사한 메시지들의 번역과 관련된 더 많은 내용은, 도 8 내지 도 10을 참조하여 설명된다. In the embodiment of FIG. 7, the language selected in the chat client system 104-1 is English and the language selected in the chat client system 104-2 is French. Therefore, the chat message 704 (“LOL”) and the chat message 706 (“Who u laughin at?”) Are displayed in English in the dialog box 702 of the chat client GUI module 406-1. On the other hand, their respective chat messages 716 (“MDR”) and chat messages 718 (“Qui te fair rire?”) Are displayed in the dialog box 714 of the chat client GUI module 406-2. It is displayed in French. Translation of chat messages 704, 706, 716 and 718 can be accomplished through various systems and methods described herein. More details regarding the translation of messages similar to chat messages 704, 706, 716 and 718 are described with reference to FIGS. 8-10.

도 8은 다양한 실시예에 따른, 예시적인 다중 언어 통신 방법(800)의 동작을 설명하는 순서도이다. 아래에서 설명하는 바와 같이, 일부 실시예에서, 방법(800)은 채팅 클라이언트 시스템(104-1), 채팅 클라이언트 시스템(104-2) 및 CTT 시스템(114)(예, 채팅 서버(108))와 관련되어 동작들을 수행할 수 있다. 특히, 도 8은, 일부 실시예에 따라 텍스트 'LOL'을 포함하는 영어 채팅 메시지를 불어 채팅 메시지로 번역하는 것을 설명한다. 이러한 상황은, 제 1 채팅 클라이언트 시스템(104-1)의 사용자에 의해 사용되는 언어가 영어이고, 제 2 채팅 클라이언트 시스템(104-2)의 사용자에 의해 사용되는 언어가 불어일 때 발생할 수 있다. 일부 실시예에 따라, CTT 시스템(114)은 자동으로 채팅 클라이언트 시스템(104-1, 104-2)의 언어 선택/선호를 감지할 수 있다. 8 is a flow chart illustrating the operation of an exemplary multi-lingual communication method 800, in accordance with various embodiments. As described below, in some embodiments, method 800 includes chat client system 104-1, chat client system 104-2, and CTT system 114 (eg, chat server 108). Related actions can be performed. In particular, FIG. 8 describes translating an English chat message containing the text 'LOL' into a French chat message, according to some embodiments. This situation may occur when the language used by the user of the first chat client system 104-1 is English, and the language used by the user of the second chat client system 104-2 is French. According to some embodiments, the CTT system 114 may automatically detect language selection / preferration of the chat client systems 104-1 and 104-2.

도시된 바와 같이, 동작(802)에서, 제 1 채팅 클라이언트 시스템(104-1)은, 전송하려는 영어 메시지를 제 2 채팅 클라이언트 시스템(104-2)로 제공한다(예를 들어, 채팅 호스트 시스템(112)을 통해). 영어 메시지는, 번역 처리를 위해 CTT 시스템(114) 의 CTT 제어 모듈(202)로 라우팅될 수 있다. As shown, in operation 802, the first chat client system 104-1 provides an English message to be sent to the second chat client system 104-2 (eg, a chat host system ( 112). The English messages can be routed to the CTT control module 202 of the CTT system 114 for translation processing.

동작(804)에서, CTT 제어 모듈(202)은, 영어 채팅 메시지(“LOL”)에 대응하는 채팅 메시지를, 번역 데이터 저장부(210)로 질의하고, 그것은 불어로 미리 번역(pre-translated)된다. 응답으로, 동작(806)에서, 번역 데이터 저장부(210)는 영어 채팅 메시지(“LOL”)에 대응하는 불어 메시지(“MDR”)를 CTT 제어 모듈(202)로 회신한다. 다음으로, 동작(808)에서, CTT 제어 모듈(202)은, 대응하는 불어 채팅 메시지(“MDR”)를 제 2 채팅 클라이언트 시스템(104-2)으로 전송하는 것을 돕는다(예를 들어, CTT 시스템(114)은, 전송을 위해, 대응하는 불어 채팅 메시지를 채팅 호스트 시스템(112)으로 전달한다). In operation 804, the CTT control module 202 queries the translation data storage 210 for a chat message corresponding to the English chat message (“LOL”), which is pre-translated in French. do. In response, in operation 806, the translation data storage 210 returns a French message (“MDR”) corresponding to the English chat message (“LOL”) to the CTT control module 202. Next, in operation 808, the CTT control module 202 helps to send the corresponding French chat message (“MDR”) to the second chat client system 104-2 (eg, the CTT system). (114) sends a corresponding French chat message to the chat host system 112 for transmission).

도 9는 다양한 실시예에 따른, 예시적인 다중 언어(multi-lingual) 통신 방법(900)의 동작을 나타내는 순서도이다. 후술하는 일부 실시예들에서, 상기 순서도를 통해 설명된 상기 방법(900)은 채팅 클라이언트 시스템(104-1), 채팅 클라이언트 시스템(104-2), CTT 시스템(114)(예를 들면, 채팅 서버(108)) 및 번역 모듈(116)(예를 들면, 번역 서버(110))과 관련된 동작을 수행할 수 있다. 특히, 도 9는 일부 실시예들에 따라, 텍스트 "LOL"를 포함하는 영어 채팅 메시지를 불어 동의어 채팅 메시지로 번역하는 것을 설명한다. 도시된 도 8의 실시예와 다르게, 도 9는 변환 모듈(208)(예를 들어, CTT 시스템(114)) 및 번역 모듈(116)에 대한 사용을 도시한다.9 is a flow chart illustrating operation of an exemplary multi-lingual communication method 900, in accordance with various embodiments. In some embodiments described below, the method 900 described through the flowchart is a chat client system 104-1, a chat client system 104-2, a CTT system 114 (eg, a chat server (108) and translation module 116 (eg, translation server 110). In particular, FIG. 9 describes translating an English chat message containing the text “LOL” into a French synonym chat message, according to some embodiments. Unlike the illustrated embodiment of FIG. 8, FIG. 9 shows use for translation module 208 (eg, CTT system 114) and translation module 116.

도시된 바와 같이, 동작(902)에서, 제1 채팅 클라이언트 시스템(104-1)은 불어로 말하는 사용자를 갖는 제2 채팅 클라이언트 시스템(104-2)으로 (예를 들어, 채팅 호스트 시스템(112)을 통하여) 전송을 위한 영어 채팅 메시지를 제공할 수 있다. 상기 영어 채팅 메시지는 번역 처리를 위해 CTT 시스템(114)의 CTT 제어 모듈(202)로 라우팅된다.As shown, in operation 902, the first chat client system 104-1 is directed to a second chat client system 104-2 having a French-speaking user (eg, the chat host system 112). It is possible to provide an English chat message for transmission. The English chat message is routed to the CTT control module 202 of the CTT system 114 for translation processing.

동작(904)에서, 상기 CTT 제어 모듈(202)은 상기 영어 채팅 메시지("LOL")에 대응하는 불어 동의어 채팅 메시지를, 상기 번역 데이터 저장부(210)로 질의할 수 있다. 동작(906)에서, 번역 데이터 저장부(210)는 응답으로서, 상기 영어 채팅 메시지("LOL")와 대응되는 불어 채팅 메시지를 번역 데이터 저장부(210)가 보유하고 않음을 나타내는 질의 실패(query failure)를 CTT 제어 모듈(202)로 회신할 수 있다. 이 경우, 동작(908)에서, CTT 제어 모듈(202)은, 일부 실시예들에 따른 변환 처리를 진행하기 위해, 상기 영어 채팅 메시지를 변환 모듈(208)로 전달할 수 있다. 여기서 설명한 바와 같이, 상기 변환 모듈(208)은, 채팅 메시지를 추가적인 번역 처리를 위하여 보다 적합한 메시지로 변환하도록 구성되는, 다수의 변환 관련 모듈(932)을 포함할 수 있다.In operation 904, the CTT control module 202 may query the French synonym chat message corresponding to the English chat message ("LOL") to the translation data storage 210. In operation 906, the translation data storage unit 210, as a response, a query failure indicating that the translation data storage unit 210 does not hold the French chat message corresponding to the English chat message (“LOL”) (query failure) to the CTT control module 202. In this case, in operation 908, the CTT control module 202 may deliver the English chat message to the conversion module 208 in order to proceed with conversion processing according to some embodiments. As described herein, the conversion module 208 may include a number of conversion related modules 932 configured to convert chat messages into more suitable messages for further translation processing.

동작(910)에서, 변환 모듈(208)의 채팅용어 모듈(302)은 상기 영어 채팅 메시지("LOL")를 변환된 영어 채팅 메시지("laugh out loud")로 변환할 수 있고, 추가적인 처리를 위해 CTT 제어 모듈(202)로 상기 변환된 영어 채팅 메시지를 회신할 수 있다. 당업자는 일부 실시예들에서, 상기 변환된 영어 채팅 메시지가 CTT 제어 모듈(202)로 회신되기 전에, 상기 영어 채팅 메시지가 상기 변환 모듈(208)의 추가적인 모듈에 의해 처리될 수 있음을 이해할 것이다.In operation 910, the chat term module 302 of the conversion module 208 may convert the English chat message ("LOL") into a converted English chat message ("laugh out loud"), and further processing To this end, the converted English chat message may be returned to the CTT control module 202. Those skilled in the art will understand that in some embodiments, before the converted English chat message is returned to the CTT control module 202, the English chat message may be processed by an additional module of the conversion module 208.

동작(912)에서, CTT 제어 모듈(202)은 상기 변환된 영어 채팅 메시지("laugh out loud)에 대응하는 불어 동의어 채팅 메시지를, 번역 데이터 저장부(210)로 질의할 수 있다. 동작(914)에서, 번역 데이터 저장부(210)는 응답으로서, 상기 변환된 영어 채팅 메시지("laugh out loud")에 대응되는 불어 채팅 메시지를 상기 번역 데이터 저장부(210)가 보유하고 있지 않음을 나타내는 질의 실패를 CTT 제어 모듈(202)로 회신할 수 있다. 이 경우, 동작(916)에서, CTT 제어 모듈(202)은 일부 실시예들에 따른 기계 번역 처리를 위하여, 상기 변환된 영어 채팅 메시지를 번역 모듈(116)로 전송할 수 있다.In operation 912, the CTT control module 202 may query the French synonym chat message corresponding to the converted English chat message ("laugh out loud") to the translation data storage 210. Operation 914 ), The translation data storage 210 is a query indicating that the translation data storage 210 does not have a French chat message corresponding to the converted English chat message ("laugh out loud") as a response. The failure can be returned to the CTT control module 202. In this case, in operation 916, the CTT control module 202 translates the translated English chat message for machine translation processing according to some embodiments. Module 116.

동작(918)에서, 상기 번역 모듈(116)은 상기 변환된 영어 채팅 메시지에 대응되는 기계 번역된 불어 채팅 메시지("mort de rire")를 회신할 수 있다. 상기 기계 번역된 결과인 불어 채팅 메시지("mort de rire")는 영어 채팅용어 채팅 메시지("LOL")의 변환된 번역의 예시이다.In operation 918, the translation module 116 may reply to a machine translated French chat message (“mort de rire”) corresponding to the converted English chat message. The machine-translated French chat message ("mort de rire") is an example of a translated translation of the English chat-term chat message ("LOL").

동작(920)에서, CTT 제어 모듈(202)은 일부 실시예들에 따라, 상기 기계 번역된 불어 메시지가 추가적인 변환 처리되도록, 기계 번역된 불어 채팅 메시지("mort de rire")를 변환 모듈(208)로 제공할 수 있다. 여기에서 언급된 바와 같이, 기계 번역된 텍스트는 불어 번역이 더 정제되도록 추가 변환 처리를 위해 제공될 수 있다. 예를 들어, 원본 영어 채팅 메시지가 영어 채팅용어를 포함하는 경우, 추가 변형 처리는, 가능한 범위 내에서 불어 채팅용어를 추가할 수 있다. 따라서, 동작(922)에서, 변환 모듈(208)의 채팅용어 모듈(302)은 상기 기계 번역된 불어 채팅 메시지("mort de rire")를, 변환된 불어 채팅 메시지("MDR")로 변환할 수 있고, 추가 처리를 위해 상기 변환된 불어 채팅 메시지를 CTT 제어 모듈(202)로 회신할 수 있다.In operation 920, the CTT control module 202 converts the machine translated French chat message (“mort de rire”) to the machine translated French message (“mort de rire”), according to some embodiments. ). As mentioned herein, machine translated text can be provided for further conversion processing so that the French translation is further refined. For example, if the original English chat message includes English chat terms, the additional variant processing may add French chat terms to the extent possible. Thus, in operation 922, the chatspeak module 302 of the conversion module 208 converts the machine translated French chat message ("mort de rire") into a converted French chat message ("MDR"). It is possible to return the converted French chat message to the CTT control module 202 for further processing.

결국, 동작(924)에서, CTT 제어 모듈(202)은, 대응하는 불어 채팅 메시지("MDR")를 제2 채팅 클라이언트 시스템(104-2)으로 전송하는 것을 돕는다(예를 들어, CTT 시스템(114)은, 전송을 위해, 대응하는 불어 채팅 메시지를 채팅 호스트 시스템(112)으로 전달한다). 추가적으로, 동작(926)에서, CTT 제어 모듈(202)은 원본 영어 채팅 메시지("LOL")와 상기 번역된 불어 채팅 메시지("MDR") 간의 변환된 번역의 번역 매핑을 번역 데이터 저장부(210)에 저장할 수 있다. 일단 상기 매핑이 번역 데이터 저장부(210)에 저장되면, 상기 매핑은 예를 들어 도 8에 도시된 바와 같이, 향후의 번역 속도를 향상시키기 위해 번역 엔트리(translation entries)를 저장하는데 이용될 수 있다. 여기서 언급된 바와 같이, 번역 데이터 저장부(210)는 변환되지 않은 번역 및 변환된 번역의 매핑을 저장할 수 있다.Eventually, in operation 924, the CTT control module 202 helps to send the corresponding French chat message (“MDR”) to the second chat client system 104-2 (eg, the CTT system ( 114) sends the corresponding French chat message to the chat host system 112 for transmission). Additionally, in operation 926, the CTT control module 202 translates the translation mapping of the translated translation between the original English chat message ("LOL") and the translated French chat message ("MDR") translation data storage 210 ). Once the mapping is stored in translation data storage 210, the mapping can be used to store translation entries to improve future translation speed, for example, as shown in FIG. . As mentioned herein, the translation data storage 210 may store untranslated translations and mappings of the translated translations.

일부 실시예에서, CTT 제어 모듈(202)은 또한, 방법(900)의 동작 동안에 결정된, 동등한 (변환되고 변환되지 않은) 번역 매핑을 저장할 수도 있다. 일부 실시예에서, 번역 매핑은, 번역 데이터 저장부(210)에 원래 저장되지 않은 채팅 메시지(예를 들어, 동작(904)에서 도시된 채팅 메시지와 동작(912)에서 도시된 채팅 메시지)와, 번역 데이터 저장부(210)의 검색 후의 동작 동안에 결정된 대응하는 메시지 간의 매핑(예를 들어, 결과를 회신받지 못한 번역 데이터 저장부(210)로의 질의 및 상기 질의 후에 변환 모듈(208) 및/또는 번역 모듈(116)을 통해 결정된 대응하는 채팅 메시지 간의 매핑)일 수 있다.In some embodiments, CTT control module 202 may also store equivalent (translated and untranslated) translation mappings determined during operation of method 900. In some embodiments, the translation mapping includes chat messages that are not originally stored in the translation data storage 210 (eg, chat messages shown in operation 904 and chat messages shown in operation 912), Mapping between corresponding messages determined during an operation after the search of the translation data storage unit 210 (for example, a query to the translation data storage unit 210 that does not receive a result and a conversion module 208 and / or translation after the query) It may be a mapping between corresponding chat messages determined through module 116).

예를 들어, 도 9에 도시된 바와 같이, CTT 제어 모듈(202)은 동작(904)에서 원본 영어 채팅 메시지("LOL")를 번역 데이터 저장부(210)로 질의하고, 동작(912)에서, 변환된 영어 채팅 메시지("laugh out loud")를 번역 데이터 저장부(210)로 질의하며, CTT 제어 모듈(202)은 (동작(906) 및 동작(914) 각각에서) 두 질의에 대해, 번역 데이터 저장부(210)로부터 어떠한 결과도 수신하지 않는다. 그러나 동작(916)에서, CTT 제어 모듈(202)은 결국 기계 번역을 위해, 변환된 영어 메시지("laugh out loud")를 기계 번역 모듈(116)로 전달하고 이에 대한 응답으로 동작(918)에서 기계 번역된 불어 채팅 메시지("mort de rire")를 수신한다. 이에 따라, 동작(928)에서, CTT 제어 모듈(202)은 원본 영어 채팅 메시지("LOL")와 기계 번역된 불어 채팅 메시지("mort de rire") 간의 변환된 번역의 번역 매핑을 번역 데이터 저장부(210)에 저장할 수 있다. 마찬가지로, 동작(930)에서, CTT 제어 모듈(202)은 변환된 영어 채팅 메시지("laugh out loud")와 기계 번역된 불어 채팅 메시지("mort de rire") 간의 변환 번역의 번역 매핑을 번역 데이터 저장부(210)에 저장할 수 있다. 이러한 과정을 통해서, 다음 번의 방법(900)은 원본 영어 채팅 메시지("LOL") 또는 변환된 영어 채팅 메시지("laugh out loud")를 번역 데이터 저장부(210)로 질의하고, 번역 데이터 저장부(210)는 대응하는 변환된 번역을 제공할 것이다.For example, as shown in FIG. 9, the CTT control module 202 queries the translation data storage 210 for the original English chat message (“LOL”) in operation 904, and in operation 912 , Query the translated English chat message ("laugh out loud") to the translation data storage 210, and the CTT control module 202 (for each of the operations 906 and 914) for both queries, No results are received from the translation data storage 210. However, in operation 916, the CTT control module 202 eventually forwards the translated English message (“laugh out loud”) to the machine translation module 116 for machine translation and in response to operation 918 Receive a machine translated French chat message ("mort de rire"). Accordingly, in operation 928, the CTT control module 202 stores translation data of the translation mapping of the translated translation between the original English chat message ("LOL") and the machine translated French chat message ("mort de rire"). It can be stored in the unit 210. Similarly, in operation 930, the CTT control module 202 translates translation mappings of translation translations between translated English chat messages ("laugh out loud") and machine translated French chat messages ("mort de rire"). It may be stored in the storage unit 210. Through this process, the next method 900 queries the original English chat message ("LOL") or the converted English chat message ("laugh out loud") with the translation data storage unit 210, and the translation data storage unit 210 will provide the corresponding translated translation.

도10은 다양한 실시예에 따르는 예시적인 다중 언어 통신 방법의 동작을 나타내는 순서도이다. 후술하는 바와 같이, 일부 실시예들에서, 상기 방법(1000)은 채팅 클라이언트 시스템(104-1), 채팅 클라이언트 시스템(104-2), CTT 시스템(114)(예를 들면, 채팅 서버(108)) 및 번역 모듈(116)(예를 들면, 번역 서버(110))와 관련된 동작을 수행할 수 있다. 특히, 도 10은 일부 실시예들에 따라, 텍스트 "Who u laughin at?"로 구성된 영어 채팅 메시지를 불어 채팅 메시지로 번역하는 것을 설명한다. 10 is a flow chart illustrating operation of an exemplary multi-lingual communication method in accordance with various embodiments. As described below, in some embodiments, the method 1000 includes a chat client system 104-1, a chat client system 104-2, and a CTT system 114 (eg, chat server 108). ) And the translation module 116 (eg, the translation server 110). In particular, FIG. 10 describes translating an English chat message composed of the text “Who u laughin at?” Into a French chat message, according to some embodiments.

동작(1002)에서 도시된 바와 같이, 제1 채팅 클라이언트 시스템(104-1)은 제2 채팅 클라이언트 시스템(104-2)으로 (예를 들어, 채팅 호스트 시스템(112)을 통하여) 전송을 위해 영어 채팅 메시지를 전달할 수 있다. 상기 영어 채팅 메시지는 번역 처리를 위해 CTT 시스템(114)의 CTT 제어 모듈(202)로 라우팅된다.As shown in operation 1002, the first chat client system 104-1 is in English for transmission to the second chat client system 104-2 (eg, via the chat host system 112). You can deliver a chat message. The English chat message is routed to the CTT control module 202 of the CTT system 114 for translation processing.

동작(1004)에서, 상기 CTT 제어 모듈(202)은 상기 영어 채팅 메시지("Who u laughin at?")에 대응하는 불어 동의어 채팅 메시지를, 번역 데이터 저장부(210)로 질의할 수 있다. 동작(1006)에서, 번역 데이터 저장부(210)는 응답으로서, 상기 영어 채팅 메시지("Who u laughin at?")에 대응하는 불어 채팅 메시지를 상기 데이터 저장 장치(210)가 보유하고 않음을 나타내는 질의 실패(query failure)를 CTT 제어 모듈(202)로 회신할 수 있다. 이 경우, 동작(1008)에서, CTT 제어 모듈(202)은, 일부 실시예들에 따른 변환 처리를 진행하기 위해, 상기 영어 채팅 메시지를 변환 모듈(208)로 전달할 수 있다. 여기서 설명한 바와 같이, 상기 변환 모듈(208)은, 채팅 메시지를 추가적인 번역 처리에 더 적합한 메시지로 변환하도록 구성되는 다수의 변환 관련 모듈(1036)을 포함할 수 있다. In operation 1004, the CTT control module 202 may query the French synonym chat message corresponding to the English chat message ("Who u laughin at?") To the translation data storage 210. In operation 1006, the translation data storage unit 210, in response, indicates that the data storage device 210 does not hold a French chat message corresponding to the English chat message (“Who u laughin at?”). The query failure may be returned to the CTT control module 202. In this case, in operation 1008, the CTT control module 202 may deliver the English chat message to the conversion module 208 in order to proceed with conversion processing according to some embodiments. As described herein, the conversion module 208 may include a number of conversion related modules 1036 configured to convert chat messages into messages that are more suitable for further translation processing.

동작(1010)에서, 변환 모듈(208)의 채팅용어 모듈(302)은 상기 영어 채팅 메시지("Who u laughin at?")를 변환된 영어 채팅 메시지("Who you laughin at?")로 변환할 수 있고, 추가 처리를 위해 맞춤법 검사 모듈(312)과 같은 변환 모듈(208)의 추가적인 모듈로 상기 변환된 영어 채팅 메시지를 전달할 수 있다. In operation 1010, the chat term module 302 of the conversion module 208 converts the English chat message ("Who u laughin at?") Into a converted English chat message ("Who you laughin at?"). In addition, for further processing, the converted English chat message may be delivered to an additional module of the conversion module 208 such as the spell check module 312.

논의된 바와 같이, 채팅용어 모듈(302)을 포함하는 변환 모듈(208)의 다양한 모듈은, 채팅 메시지에서 하나 이상의 단어 또는 구절을 식별하고, 상기 식별된 단어 또는 구절에 대한 교체 단어 또는 구절을 추천하도록 구성될 수 있다. 따라서, 당업자들은, 일부 실시예에서, 변환 모듈(208)의 한 모듈에 의해 수행되고/제안된 변환은, 채팅 메시지의 단어에서 구절 또는 구절에서 구절로의 변환을 포함할 수 있음을 이해할 것이다. 예를 들어, 동작(1010)에서, 채팅 용어 모듈(302)은 대안으로 (단어 "laughin"을 "laughing"으로 교체하는 교체/제안 다음의) 변환 과정에서 구절 "who u"를 "who are you"로 교체/제안하여 영어 채팅 메시지("Who u laughin at?")를 변환된 영어 채팅 메시지("Who are you laughing at?")로 변환할 수 있다. 이러한 과정을 통해서, 채팅 용어 모듈(302)과 같은 변형 모듈(208)의 다양한 모듈들은, 각각의 변환에 있어서의 문법적인 개선사항을 제공할 것이고, 문법 개선을 실현하기 위하여 변환 모듈(208)에 대한 별도의 모듈에 대한 필요성을 제거할 수 있을 것이다. As discussed, various modules of conversion module 208, including chat term module 302, identify one or more words or phrases in a chat message, and recommend replacement words or phrases for the identified words or phrases It can be configured to. Thus, those skilled in the art will understand that, in some embodiments, the conversion performed / suggested by one module of the conversion module 208 may include a word-to-verse or phrase-to-verse conversion of a chat message. For example, in operation 1010, the chat term module 302 alternatively replaces the phrase “who u” in the conversion process (after the replacement / suggestion to replace the word “laughin” with “laughing”) “who are you” You can convert the English chat message ("Who u laughin at?") Into a converted English chat message ("Who are you laughing at?"). Through this process, various modules of the transformation module 208, such as the chat term module 302, will provide grammatical improvements in each transformation, and to the transformation module 208 to realize grammar improvement. The need for a separate module for this would be eliminated.

일부 실시예에서, 영어 채팅 메시지가 변환 모듈(208)의 추가적인 모듈로 전달되기 전에, 채팅용어 모듈(302)은 동작(1010)에서 CTT 제어 모듈(202)로 상기 변환된 영어 채팅 메시지를 전달할 수 있다. 결국, CTT 제어 모듈(202)은 (동작(1012)에서) 번역 데이터 저장부(210)로 상기 변환된 영어 채팅 메시지(“Who you laughin at?")에 대응하는 불어 동의어 채팅 메시지를 질의할 수 있다. 동작(1014)에서, 번역 데이터 저장부(210)는 응답으로서, 상기 변환된 영어 채팅 메시지("Who you laughin at?")에 대응하는 불어 채팅 메시지를 번역 데이터 저장부(210)가 보유하지 않음을 나타내는 질의 실패(query failure)를 CTT 제어 모듈(202)로 회신할 수 있다. In some embodiments, before the English chat message is delivered to an additional module of the conversion module 208, the chat term module 302 may deliver the converted English chat message from operation 1010 to the CTT control module 202. have. Eventually, the CTT control module 202 may query the French synonym chat message corresponding to the converted English chat message (“Who you laughin at?”) To the translation data storage 210 (in operation 1012). In operation 1014, the translation data storage 210 stores, in response, the French chat message corresponding to the converted English chat message (“Who you laughin at?”). The query failure indicating not being performed may be returned to the CTT control module 202.

동작(1016)에서, 맞춤법 검사 모듈(312)은, 동작(1018)에서, 변환된 영어 채팅 메시지("Who you laughin at?")에 대한 맞춤법 검사 처리를 수행할 수 있다. 상기 맞춤법 검사 처리를 수행하는 동안에, 맞춤법 검사 모듈(312)은 상기 변환된 영어 채팅 메시지를 정정된 영어 채팅 메시지("Who you laughing at?")로 정정할 수 있고, 상기 정정된 영어 채팅 메시지를 CTT 제어 모듈(202)로 회신할 수 있다. 당업자는 일부 실시예에서, 상기 변환된 영어 채팅 메시지가 CTT 제어 모듈(202)로 회신되기 전에, 정정된 영어 채팅 메시지가 변환 모듈(208)의 추가적인 모듈에 의해 처리될 수 있음을 이해할 것이다.In operation 1016, the spell check module 312 may, in operation 1018, perform spell checking processing on the converted English chat message (“Who you laughin at?”). While performing the spell check processing, the spell check module 312 may correct the converted English chat message into a corrected English chat message ("Who you laughing at?"), And the corrected English chat message. It is possible to reply to the CTT control module 202. Those skilled in the art will understand that in some embodiments, the corrected English chat message can be processed by an additional module of the conversion module 208 before the converted English chat message is returned to the CTT control module 202.

동작(1020)에서, CTT 제어 모듈(202)은 상기 정정된 영어 채팅 메시지(“Who you laughing at?")와 대응되는 불어 동의어 채팅 메시지를 번역 데이터 저장부(210)로 질의할 수 있다. 동작(1022)에서, 번역 데이터 저장부(210)는 응답으로서, 상기 정정된 영어 채팅 메시지("Who you laughing at?")와 대응되는 불어 채팅 메시지를 번역 데이터 저장부(210)가 보유하고 있지 않음을 나타내는 질의 실패를 CTT 제어 모듈(202)로 회신할 수 있다. 이 경우, 동작(1024)에서, CTT 제어 모듈(202)은 일부 실시예들에 따른 기계 번역 처리를 위하여, 상기 정정된 영어 채팅 메시지를 번역 모듈(116)로 전달할 수 있다.In operation 1020, the CTT control module 202 may query the translation data storage 210 for a French synonym chat message corresponding to the corrected English chat message (“Who you laughing at?”). At 1022, the translation data storage 210 does not hold the French chat message corresponding to the corrected English chat message ("Who you laughing at?") As a response. A query failure indicating C may be returned to the CTT control module 202. In this case, in operation 1024, the CTT control module 202 may revise the corrected English chat for machine translation processing according to some embodiments. The message can be delivered to the translation module 116.

여기서 언급된 바와 같이, 상기 기계 번역된 텍스트는 텍스트 번역을 더 정제하기 위한 추가 변환 처리를 위해 전달될 수 있다. 예를 들어, 원본 영어 채팅 메시지가 영어 채팅용어를 포함하는 경우, 추가 변환 처리는, 가능한 범위 내에서, 불어 채팅용어를 추가할 수 있다. 동작(1030)에서, 변환 모듈(208)은 (예를 들어, 변환 모듈(208)의 모듈이 상기 기계 번역된 불어 채팅 메시지에 어떠한 변경도 적용하지 않은 경우) 추가 처리를 위해, 변경되지 않은 기계 번역된 불어 채팅 메시지("Qui te fait rire?")를 CTT 제어 모듈(202)로 회신할 수 있다.As mentioned herein, the machine translated text can be delivered for further conversion processing to further refine the text translation. For example, if the original English chat message includes English chat terms, the additional conversion processing may add French chat terms to the extent possible. In operation 1030, the conversion module 208 is configured for further processing (e.g., if the module of the conversion module 208 has not applied any changes to the machine translated French chat message), the unchanged machine The translated French chat message (“Qui te fait rire?”) May be returned to the CTT control module 202.

동작(1032)에서, CTT 제어 모듈(202)은 기계 번역된 불어 채팅 메시지("Qui te fait rire?")를 제2 채팅 클라이언트 시스템(104-2)로 전송하는 것을 돕는다(예를 들어, CTT 시스템(114)은, 전송하고자 하는 상기 대응하는 불어 채팅 메시지를 채팅 호스트 시스템(112)으로 전달한다). 게다가, 동작(1034)에서, CTT 제어 모듈(202)은 원본 영어 채팅 메시지("Who u laughin at?")와 상기 번역된 불어 채팅 메시지("Qui te fait rire?") 간의 번역 매핑을 번역 데이터 저장부(210)에 저장할 수 있다. 여기서 설명한 바와 같이, 추가적인 동작들(도면에 도시되지 않음)에서, CTT 제어 모듈(202)은 또한, 이전에 번역 데이터 저장부(210)에 실패한 질의 및 이 질의(예를 들어, 도 9의 동작(928) 및 동작(930)과 유사한) 이후에 결정된 대응 메시지에 기초한, 동의어 번역 매핑을 번역 데이터 저장부(210)에 저장할 수 있다.In operation 1032, the CTT control module 202 assists in sending a machine translated French chat message (“Qui te fait rire?”) To the second chat client system 104-2 (eg, CTT). The system 114 delivers the corresponding French chat message to be transmitted to the chat host system 112). In addition, in operation 1034, the CTT control module 202 translates the translation mapping between the original English chat message ("Who u laughin at?") And the translated French chat message ("Qui te fait rire?"). It may be stored in the storage unit 210. As described herein, in additional operations (not shown in the figure), the CTT control module 202 may also query previously failed translation data storage 210 and this query (eg, the operation of FIG. 9). (Similar to 928 and operation 930), the synonym translation mapping based on the determined corresponding message may be stored in the translation data storage 210.

일부 실시예에 따르면, 변환 모듈(208)이 수행하는 변환 동작은 병렬적으로 소정의 변환 동작을 수행하는 것을 포함할 수 있고, 직렬적으로 소정의 변환 동작을 수행하는 것을 포함할 수 있다. 병렬적이고 직렬적으로 수행되는 변환 동작의 순서는 다양한 실시예들 사이에서 달라질 수 있다. 여기서 설명한 바와 같이, 변환 동작이 병렬적으로 수행되는 일부 실시예들은, 추가적인 변환을 위해 어떤 변환 메시지가 선택될지를 결정하기 위해 그리고 어떠한 선례에 따라, 선택의 우선순위를 채택할 수 있다. According to some embodiments, the conversion operation performed by the conversion module 208 may include performing a predetermined conversion operation in parallel, and may include performing a predetermined conversion operation in series. The order of conversion operations performed in parallel and serially may vary between various embodiments. As described herein, some embodiments in which the conversion operation is performed in parallel may adopt the priority of selection to determine which conversion message is selected for further conversion and according to some precedent.

도 11은 다양한 실시예에 따르는 예시적인 다중 언어 통신 방법의 동작을 나타내는 순서도이다. 후술하는 일부 실시예들에서, 상기 방법(1100)은 채팅 클라이언트 시스템(104-1), 채팅 클라이언트 시스템(104-2), (예를 들어, 채팅 서버(108)의) CTT 시스템(114) 및 (예를 들어, 번역 서버(110)의) 번역 모듈(116)과 관련된 동작을 수행할 수 있다. 특히, 도 11은 일부 실시예들에 따라, 텍스트 "Who u laughin at?"를 포함하는 영어 채팅 메시지를 병렬적인 변환 동작을 통해 불어 채팅 메시지로 번역하는 것을 설명한다. 11 is a flow chart illustrating operation of an exemplary multi-lingual communication method in accordance with various embodiments. In some embodiments described below, the method 1100 includes a chat client system 104-1, a chat client system 104-2, a CTT system 114 (eg, of the chat server 108), and An operation related to the translation module 116 (eg, of the translation server 110) may be performed. In particular, FIG. 11 describes translating an English chat message containing the text “Who u laughin at?” Into a French chat message through a parallel conversion operation, according to some embodiments.

동작(1102)에서 도시된 바와 같이, 제1 채팅 클라이언트 시스템(104-1)은 제2 채팅 클라이언트 시스템(104-2)으로 (예를 들어, 채팅 호스트 시스템(112)을 통하여) 전송하고자 하는 영어 채팅 메시지를 전달할 수 있다. 상기 영어 채팅 메시지는 번역 처리를 위해 CTT 시스템(114)의 CTT 제어 모듈(202)로 라우팅될 수 있다.As shown in operation 1102, the first chat client system 104-1 is an English language that is intended to be transmitted (eg, via the chat host system 112) to the second chat client system 104-2. You can deliver a chat message. The English chat message can be routed to the CTT control module 202 of the CTT system 114 for translation processing.

동작(1104)에서, 상기 CTT 제어 모듈(202)은 상기 영어 채팅 메시지("Who u laughin at?")에 대응하는 불어 동의어 채팅 메시지를, 상기 번역 데이터 저장부(210)로 질의할 수 있다. 동작(1106)에서, 번역 데이터 저장부(210)는 응답으로서, 상기 영어 채팅 메시지("Who u laughin at?")와 대응되는 불어 채팅 메시지를 번역 데이터 저장부(210)가 보유하고 않음을 나타내는 질의 실패(query failure)를 CTT 제어 모듈(202)로 회신할 수 있다. In operation 1104, the CTT control module 202 may query the French synonym chat message corresponding to the English chat message ("Who u laughin at?") To the translation data storage 210. In operation 1106, the translation data storage unit 210, in response, indicates that the translation data storage unit 210 does not have a French chat message corresponding to the English chat message (“Who u laughin at?”). The query failure may be returned to the CTT control module 202.

이 경우, CTT 제어 모듈(202)은 일부 실시예들에 따른 변환 처리를 진행하기 위해, 변환 모듈(208)로 상기 영어 채팅 메시지를 전달할 수 있다. 여기서 설명한 바와 같이, 상기 변환 모듈(208)은, 채팅 메시지를 추가적인 번역 처리에 더 적합한 메시지로 변환하도록 구성되는 다수의 변환 관련 모듈(1130)을 포함할 수 있다. 도 11에 도시된 바와 같이, 동작(1108) 동안에, CTT 제어 모듈(202)은 상기 영어 채팅 메시지를 병렬적으로, 변환 모듈(208)에 포함된 둘 이상의 변환 관련 모듈들(1130)로 전달할 수 있다. 게다가, 동작(1108) 동안에, CTT 제어 모듈(202)은 병렬적으로 상기 변환 관련 모듈들(1130)로부터 결과를 수신하고, 변환 결과에 기초하여 병렬로 번역 데이터 저장부(210)로 질의를 전달할 수 있다. In this case, the CTT control module 202 may deliver the English chat message to the conversion module 208 to perform conversion processing according to some embodiments. As described herein, the conversion module 208 may include a plurality of conversion related modules 1130 configured to convert chat messages into messages that are more suitable for further translation processing. 11, during operation 1108, the CTT control module 202 may deliver the English chat message in parallel to two or more conversion related modules 1130 included in the conversion module 208. have. In addition, during operation 1108, the CTT control module 202 receives results from the transformation related modules 1130 in parallel, and forwards queries to the translation data storage 210 in parallel based on the transformation results. You can.

이에 따라, 동작(1110a)에서 CTT 제어 모듈(202)은 변환 처리를 위해, 상기 영어 채팅 메시지("Who u laughin at?")를 채팅용어 모듈(302)로 전달할 수 있다. 병렬적으로, 동작(1110b)에서, CTT 제어 모듈(202)은 변환 처리를 위해 상기 영어 채팅 메시지("Who u laughin at?")를 맞춤법 검사 모듈(312)로 전달할 수 있다. 이어서, CTT 제어 모듈(202)은 동작(1112a)에서 제1 변환된 영어 채팅 메시지("Who you laughin at?")를 채팅용어 모듈(302)로부터 수신할 수 있고, 동시적으로 동작(1112b)에서 CTT 제어 모듈(202)은 맞춤법 검사 모듈(312)로부터 제2 변환 영어 채팅 메시지("Who u laughing at?")를 수신할 수 있다. 각각의 변환 처리 시간에 따라, 채팅용어 모듈(320), 맞춤법 검사 모듈(312) 및 다른 변환 관련 모듈(1130)은 서로에 대해, 직렬적 또는 병렬적으로 CTT 제어 모듈(202)로 응답할 수 있다.Accordingly, in operation 1110a, the CTT control module 202 may transmit the English chat message (“Who u laughin at?”) To the chat term module 302 for conversion processing. In parallel, in operation 1110b, the CTT control module 202 may deliver the English chat message ("Who u laughin at?") To the spell check module 312 for conversion processing. Subsequently, the CTT control module 202 may receive the first converted English chat message ("Who you laughin at?") From the chat term module 302 in operation 1112a, and simultaneously operate 1112b In CTT control module 202 may receive a second translation English chat message ("Who u laughing at?") From the spell check module 312. According to each conversion processing time, the chat term module 320, the spell check module 312, and other conversion related modules 1130 may respond to each other, serially or in parallel, with the CTT control module 202. have.

이어서, 동작(1114a)에서, CTT 제어 모듈(202)은 상기 제1 변환된 영어 채팅 메시지("Who you laughin at?")와 대응하는 불어 동의어 채팅 메시지를 번역 데이터 저장부(210)로 질의할 수 있다. 동작(1114b)에서, CTT 제어 모듈(202)은 상기 제2 변환된 영어 채팅 메시지("Who u laughing at?")와 대응하는 불어 동의 채팅 메시지를 번역 데이터 저장부(210)로 질의할 수 있다. 일부 실시예를 위해서, 동작(1114a 및 1114b) 동안에, CTT 제어 모듈(202)은 직렬적으로 또는 병렬적으로 번역 데이터 저장부(210)로 질의할 수 있다. 일부 실시예에서, 질의 타이밍은 변환 모듈(208)의 변환 관련 모듈들(1130) 각각이 응답을 회신한 시기에 의존될 수 있다. 도 11에 도시된 바와 같이, 번역 데이터 저장부(210)는 동작(1116a) 및 동작(1116b)에서 상기 질의에 대한 질의 실패(예컨대, <FAIL>)를 회신할 수 있다.Subsequently, in operation 1114a, the CTT control module 202 queries the translation data storage 210 for a French synonym chat message corresponding to the first translated English chat message ("Who you laughin at?"). Can be. In operation 1114b, the CTT control module 202 may query the translation data storage 210 for a French consent chat message corresponding to the second converted English chat message ("Who u laughing at?"). . For some embodiments, during operations 1114a and 1114b, the CTT control module 202 may query the translation data storage 210 serially or in parallel. In some embodiments, the query timing may depend on when each of the transformation related modules 1130 of the transformation module 208 returned a response. As shown in FIG. 11, the translation data storage unit 210 may return a query failure (eg, <FAIL>) for the query in operations 1116a and 1116b.

결국, CTT 제어 모듈(202)은 추가 처리를 위해, 병렬적 동작(1108)의 결과인 두 개 이상의 메시지에서 하나의 변환된 메시지를 선택할 수 있다. 변환 관련 모듈들(1130)의 어느 하나가 변환된 메시지를 생성한 경우, CTT 제어 모듈(202)은 추가 처리를 위해 그 특정 변환된 메시지를 선택할 수 있다. 여기서 언급한 바와 같이, CTT 제어 모듈(202)은 선택 우선순위에 기초하여, 변환된 메시지를 선택할 수 있고, 상기 선택 우선순위는 실시예들에 의해 선택된 변환/번역 전략에 따라 결정될 수 있다. 일부 실시예에서, 상기 선택의 우선순위는, 변환된 메시지가 가장 정식적인 내용(formal content)을 가지는지 여부, 변환된 메시지가 최상의 변환들을 갖는지 여부, 또는 변환된 메시지가 기계 번역에 적합한 변환된 메시지를 생성할 높은 가능성을 가지는 것으로 알려진 변환 관련 모듈로부터의 결과인지 여부에 기초할 수 있다. As a result, the CTT control module 202 can select one converted message from two or more messages resulting from the parallel operation 1108 for further processing. When any one of the conversion-related modules 1130 generates a converted message, the CTT control module 202 can select the specific converted message for further processing. As mentioned herein, the CTT control module 202 may select the converted message based on the selection priority, and the selection priority may be determined according to the conversion / translation strategy selected by the embodiments. In some embodiments, the priority of the selection is whether the converted message has the most formal content, whether the converted message has the best conversions, or whether the converted message is suitable for machine translation. It can be based on whether the result is from a conversion related module known to have a high probability of generating a message.

어느 한 변환된 메시지가 선택되면, 동작(1118)에서, CTT 제어 모듈(202)은 일부 실시예에 따른 기계 번역 처리를 위하여 상기 변환된 영어 채팅 메시지를 번역 모듈(116)로 전달할 수 있다. 예를 들어, 도 11에 도시된 바와 같이, CTT 제어 모듈(202)은 상기 번역 모듈(116)로 전달하기 위해, 채팅용어 모듈(302)에서 생성된 제1 변환된 영어 채팅 메시지("Who you laughin at?")를 선택할 수 있다. If any one converted message is selected, in operation 1118, the CTT control module 202 may deliver the converted English chat message to the translation module 116 for machine translation processing according to some embodiments. For example, as shown in FIG. 11, the CTT control module 202 generates a first translated English chat message ("Who you") generated by the chat term module 302 for delivery to the translation module 116. laughin at? ").

동작(1120)에서, 번역 모듈(116)은 (맞춤법이 틀린 단어 "laughin"을 포함하고 있음에도 불구하고) 상기 제1 변환된 영어 채팅 메시지와 대응되는 기계 번역된 불어 채팅 메시지("Qui te fait rire?")를 회신할 수 있다. 동작(1122)에서, CTT 제어 모듈(202)은 일부 실시예에 따라 기계 번역된 불어 채팅 메시지의 추가 변환 처리를 위해 상기 기계 번역된 불어 채팅 메시지("Qui te fait rire?")를 변환 모듈(208)로 전달할 수 있다.In operation 1120, the translation module 116 (even though it contains the misspelled word “laughin”) corresponds to the first translated English chat message and a machine translated French chat message (“Qui te fait rire”) ? "). In operation 1122, the CTT control module 202 converts the machine translated French chat message (“Qui te fait rire?”) For further translation processing of the machine translated French chat message in accordance with some embodiments. 208).

여기서 설명한 바와 같이, 상기 기계 번역된 텍스트는, 텍스트 번역을 더 정제하기 위해, 추가적인 변환 처리로 제공될 수 있다. 예를 들어, 원본 영어 채팅 메시지가 영어 채팅용어를 포함하는 경우, 추가 변형 처리는, 가능한 범위 내에서, 불어 채팅용어를 추가할 수 있다. 동작(1124)에서, 변환 모듈(208)은 (예를 들어, 변환 모듈(208)의 모듈이 상기 기계 번역된 불어 채팅 메시지에 어떠한 변경도 적용하지 않은 경우) 추가 처리를 위해, 변경되지 않은 기계 번역된 불어 채팅 메시지("Qui te fait rire?")를 CTT 제어 모듈(202)로 회신할 수 있다.As described herein, the machine-translated text can be provided as an additional conversion process to further refine the text translation. For example, if the original English chat message includes English chat terms, further modification processing may add French chat terms to the extent possible. In operation 1124, the translation module 208, for further processing (eg, if the module of the translation module 208 did not apply any changes to the machine translated French chat message), the unchanged machine The translated French chat message (“Qui te fait rire?”) May be returned to the CTT control module 202.

동작(1126)에서, CTT 제어 모듈(202)은 기계 번역된 불어 채팅 메시지("Qui te fait rire?")를 제2 채팅 클라이언트 시스템(104-2)로 전송하는 것을 돕는다(예를 들어, CTT 시스템(114)은, 전송하고자 하는 대응하는 불어 채팅 메시지를 채팅 호스트 시스템(112)으로 전달한다). 추가적으로, 동작(1128)에서, CTT 제어 모듈(202)은 원본 영어 채팅 메시지("Who u laughin at?")와 상기 번역된 불어 채팅 메시지("Qui te fait rire?") 간의 번역 매핑을 번역 데이터 저장부(210)에 저장할 수 있다. 여기서 설명한 바와 같이, 추가적인 동작들(도면에 도시되지 않음)에서, CTT 제어 모듈(202)은, 이전에 번역 데이터 저장부(210)로 실패한 질의 및 이 질의(예를 들어, 도 9의 동작(928 및 930)과 유사한) 이후에 결정된 대응 메시지에 기초하는, 동의어 번역 매핑도 번역 데이터 저장부(210)에 저장할 수 있다.In operation 1126, the CTT control module 202 helps to send a machine translated French chat message (“Qui te fait rire?”) To the second chat client system 104-2 (eg, CTT). The system 114 delivers the corresponding French chat message to be sent to the chat host system 112). Additionally, in operation 1128, the CTT control module 202 translates the translation mapping between the original English chat message ("Who u laughin at?") And the translated French chat message ("Qui te fait rire?"). It can be stored in the storage 210. As described herein, in additional operations (not shown in the figure), the CTT control module 202 may query previously failed with the translation data storage 210 and this query (e.g., the operation of FIG. 9 ( Similar to 928 and 930), synonym translation mapping based on the corresponding determined message afterwards may also be stored in the translation data storage 210.

일부 실시예에서, 변환 동작들은 하이브리드 직렬/병렬 방식으로 수행될 수 있고, 또는 일부 변환 동작들은 병렬적으로 수행되고 다른 변환 동작들은 직렬적으로 수행될 수 있다. 예를 들어, 도 11에 도시된 바와 같이, 영어 채팅 메시지("Who u laughin at?")는 동작(1110a) 및 동작(1110b)에서 병렬적으로, 채팅용어 모듈(302)과 맞춤법 검사 모듈(312)로 전달될 수 있다. 이어서, 결과로 나온 변환된 메시지들 중에서 어느 하나가 선택되면 (예를 들어, 선택 우선순위에 기초하여), 변환 모듈(208)의 다른 변환 관련 모듈들(1130)(예를 들어, 두문자어 모듈(304), 고유명사 모듈(306), 보통명사 모듈(308))은, 상기 선택된 변환 메시지를 병렬적으로 처리할 수 있다.In some embodiments, conversion operations may be performed in a hybrid serial / parallel fashion, or some conversion operations may be performed in parallel and other conversion operations may be performed in series. For example, as illustrated in FIG. 11, the English chat message (“Who u laughin at?”) Is performed in parallel in the operation 1110a and operation 1110b, the chat term module 302 and the spell check module ( 312). Subsequently, if any one of the resulting converted messages is selected (e.g., based on the selection priority), other conversion related modules 1130 of the conversion module 208 (e.g., acronym module ( 304), the proper noun module 306, and the common noun module 308 may process the selected conversion message in parallel.

도 12는 예시적인 디지털 장치(1200)의 블록도이다. 상기 디지털 장치(1200)는, 버스(bus, 1214)를 통해 통신 가능하게 연결되는, 프로세서(1202), 메모리 시스템(1204), 저장 시스템(1206), 통신 네트워크 인터페이스(1208), I/O 인터페이스(1210) 및 디스플레이 인터페이스(1212)를 포함한다. 상기 프로세서(1202)는 실행 가능한 명령어들(instructions)(예컨대, 프로그램들)을 실행하도록 구성된다. 일부 실시예에서, 상기 프로세서(1202)는 상기 실행 가능한 명령어들을 처리할 수 있는 임의의 프로세서 또는 회로를 포함한다. 12 is a block diagram of an exemplary digital device 1200. The digital device 1200 is communicatively connected via a bus 1214, a processor 1202, a memory system 1204, a storage system 1206, a communication network interface 1208, an I / O interface 1212 and a display interface 1212. The processor 1202 is configured to execute executable instructions (eg, programs). In some embodiments, the processor 1202 includes any processor or circuit capable of processing the executable instructions.

상기 메모리 시스템(1204)은 데이터를 저장하도록 구성되는 임의의 메모리이다. 상기 메모리 시스템(1204)의 예로는, RAM 또는 ROM과 같은 저장 기기이다. 상기 메모리 시스템(1204)은 램 캐시(ram cache)를 포함할 수 있다. 다양한 실시예에서, 데이터는 메모리 시스템(1204) 내에 저장된다. 메모리 시스템(1204) 내에 저장된 데이터는 삭제되거나 궁극적으로 상기 저장 시스템(1206)으로 전송될 수 있다.The memory system 1204 is any memory configured to store data. An example of the memory system 1204 is a storage device such as RAM or ROM. The memory system 1204 may include a ram cache. In various embodiments, data is stored in memory system 1204. Data stored in memory system 1204 may be deleted or ultimately transferred to storage system 1206.

상기 저장 시스템(1206)은 데이터를 검색하고 저장하도록 구성된 임의의 저장소이다. 상기 저장 시스템(1206)의 예로서, 플래시 드라이브(flash drive), 하드 드라이브(hard drive), 광 드라이브(optical drive) 및/또는 마그네틱 테이프(magnetic tape)이다. 일부 실시예에서, 상기 디지털 기기(1200)는 RAM으로 형성된 메모리 시스템(1204) 및 플래시 데이터로 형성되는 저장 시스템(1206)을 포함한다. 상기 메모리 시스템(1204) 및 저장 시스템(1206) 모두, 상기 프로세서(1202)를 포함하는 컴퓨터 프로세서에 의해 실행될 수 있는 명령어 또는 프로그램을 저장하는 컴퓨터 판독 가능 매체를 포함할 수 있다.The storage system 1206 is any storage configured to retrieve and store data. Examples of the storage system 1206 are flash drives, hard drives, optical drives and / or magnetic tapes. In some embodiments, the digital device 1200 includes a memory system 1204 formed of RAM and a storage system 1206 formed of flash data. Both the memory system 1204 and the storage system 1206 may include computer readable media storing instructions or programs executable by a computer processor including the processor 1202.

상기 통신 네트워크 인터페이스(1208)(com. network interface)는 링크(1216)를 경유하여 네트워크(예컨대, 상기 컴퓨터 네트워크(106))와 결합될 수 있다. 상기 통신 네트워크 인터페이스(1208)는 예를 들어 이더넷 접속, 시리얼 접속, 병렬 접속 또는 ATA 접속을 통한 통신을 지원할 수 있다. 또한, 상기 통신 네트워크 인터페이스(1208)는 무선 통신(예를 들어, 802.11 a/b/g/n, 와이맥스)를 지원할 수 있다. 상기 통신 네트워크 인터페이스(1208)가 많은 유선 또는 무선 규격을 지원할 수 있음은 당업자에게 명확할 것이다.The communication network interface 1208 (com. Network interface) may be coupled to a network (eg, the computer network 106) via a link 1216. The communication network interface 1208 may support communication through, for example, Ethernet connection, serial connection, parallel connection, or ATA connection. In addition, the communication network interface 1208 may support wireless communication (eg, 802.11 a / b / g / n, WiMAX). It will be apparent to those skilled in the art that the communication network interface 1208 can support many wired or wireless standards.

선택적인(optional) 입출력(I/O) 인터페이스(1210)는 사용자로부터 입력을 수신하고 데이터를 출력하는 임의의 기기이다. 상기 선택적인 디스플레이 인터페이스(1212)는 디스플레이에 그래픽과 데이터를 출력하도록 구성되는 임의의 기기이다. 일례로서, 상기 디스플레이 인터페이스(1212)는 그래픽 어댑터이다.The optional input / output (I / O) interface 1210 is any device that receives input from a user and outputs data. The optional display interface 1212 is any device configured to output graphics and data to the display. As an example, the display interface 1212 is a graphics adapter.

당업자는 상기 디지털 장치(1200)의 하드웨어 구성요소가 도 12에 도시된 것에 한정되지 않음을 이해할 것이다. 디지털 장치(1200)는 도시된 것보다, 더 많거나 더 적은 하드웨어 구성요소를 포함할 수 있다. 또한, 하드웨어 구성요소들은 여기에 설명된 다양한 실시예들의 범위 내에서 유지되고, 기능을 공유할 수 있다. 일례로서, 인코딩 및/또는 디코딩은 상기 프로세서(1202) 및/또는 CPU에 위치하는 보조 프로세서(예컨대, Nvidia®)에 의해 수행될 수 있다. Those skilled in the art will understand that the hardware components of the digital device 1200 are not limited to those shown in FIG. 12. The digital device 1200 may include more or fewer hardware components than shown. In addition, hardware components can be maintained and share functionality within the scope of the various embodiments described herein. As an example, encoding and / or decoding may be performed by the processor 1202 and / or a coprocessor located in the CPU (eg, Nvidia®).

상술한 기능들 및 구성요소들은, 컴퓨터 판독 매체와 같은 저장 매체에 저장된 명령어들로 구성될 수 있다. 상기 명령어들은 프로세서에 의해 검색되고 실행될 수 있다. 명령어들의 몇 가지 예로서, 소프트웨어, 프로그램 코드 및 펌웨어가 있다. 저장 매체의 몇 가지 예로서, 메모리 기기, 테이프, 디스크, 집적 회로 및 서버가 있다. 상기 명령어들은 일부 실시예에 따라 프로세서에 의해 실행될 때에 동작하여, 상기 프로세서로 해당 동작을 지시한다. 당업자들은 명령어들, 프로세서 및 저장 매체에 대해서 잘 알고 있다.The functions and components described above may be composed of instructions stored in a storage medium such as a computer readable medium. The instructions can be retrieved and executed by the processor. Some examples of instructions are software, program code and firmware. Some examples of storage media are memory devices, tapes, disks, integrated circuits, and servers. The instructions operate when executed by a processor, in accordance with some embodiments, to direct the operation to the processor. Those skilled in the art are familiar with instructions, processor and storage medium.

다양한 실시예가 예로서 여기에 설명된다. 본 발명의 벗어나지 않은 범위에서, 다양한 변형이 나타날 수 있고 다른 실시예들이 사용될 수 있음을 당업자에게 명백할 것이다. 예시적인 실시예에 따른 다양한 변형은 본 발명에 속한 것으로 이해되어야 한다.Various embodiments are described herein as examples. It will be apparent to those skilled in the art that various modifications can be made and other embodiments can be used without departing from the scope of the present invention. It should be understood that various modifications according to the exemplary embodiments belong to the present invention.

도 13은 다양한 실시예에 따른 예시적인 사용자 피드백 시스템을 나타낸 블록도이다. 설명을 목적으로, 도 13에 도시된 사용자 피드백을 위한 시스템은, 다양한 실시예에 따른 예시적인 통신 변환 및 번역(CCT) 시스템(1300)의 부분을 보여준다. 예를 들어, 예시적인 사용자 피드백 시스템은, MMO 게임과 함께 이용 가능한 게임 채팅 시스템(예를 들어, 인-게임(in-game) 채팅 시스템)을 포함하는 다양한 다중 언어 채팅 시스템에 채용되는 채팅 번역 시스템의 부분일 수 있다. 예시적인 사용자 피드백 시스템의 사용으로, MMO 게임의 다양한 플레이어들은, 피드백/승인된 피드백에 대한 인센티브인 인-게임 통화/신용거래/아이템에 대한 대가로, 표시된 단어 또는 구절에 대한 사용자 피드백을, 예를 들어 인-게임 채팅 시스템을 통해, 제공하기 위해 선택할 수 있다. 일부 실시예에 따라, 예시적인 사용자 피드백 시스템, 및 이와 관련된 구성요소의 일부 또는 전부는, 예시적인 통신 변환 및 번역 시스템(1300)으로부터 분리되거나 및/또는 독립될 수 있음은 당업자는 인정할 수 있다. 13 is a block diagram illustrating an exemplary user feedback system according to various embodiments. For purposes of explanation, the system for user feedback shown in FIG. 13 shows a portion of an exemplary communication conversion and translation (CCT) system 1300 according to various embodiments. For example, an exemplary user feedback system is a chat translation system employed in various multi-language chat systems, including a game chat system (eg, an in-game chat system) available with MMO games. It can be part of. With the use of an exemplary user feedback system, various players in an MMO game may receive user feedback for a displayed word or phrase, in exchange for an in-game currency / credit transaction / item that is an incentive for feedback / approved feedback. For example, through an in-game chat system, you can choose to provide. It will be appreciated by those skilled in the art that, in accordance with some embodiments, some or all of the exemplary user feedback system, and components related thereto, may be separated and / or independent from the exemplary communication translation and translation system 1300.

여기에 사용된 바와 같이, “피드백”은, 변환 또는 번역에 대한 피드백을 요청하는 질의에 가능한 대로 응답하는 “사용자 피드백” 또는 “사용자 피드백 응답”을 나타내는 것으로 이해되어야 한다. 또한 사용자 피드백은, 번역에 대한 사용자 피드백, 또는 변환에 대한 사용자 피드백을 포함할 수 있음을 이해해야 한다. 사용자 피드백은, 특정한 단어 또는 구절에 대한 정의를 포함할 수 있고, 그 정의는: (i) 그 특정 단어 또는 구절이, 제 1 언어로 된 그 특정 단어 또는 구절에서 (동일한) 제 1 언어로 된 대응하는 단어 또는 구절로 변환되는 것을 허용하고, (ii) 그 주어진 단어 또는 구절이, 제 1 언어로 된 그 특정 단어 또는 구절로부터 (다른) 제 2 언어로 된 대응하는 단어 또는 구절로 변환되도록 허용하며, 그렇게 함으로써 단어 또는 구절의 변환된 번역을 수행하고; 및/또는 (iii) 그 특정 단어 또는 구절이, 제 1 언어에서 제 2 언어로 번역되도록 허용하고, 그렇게 함으로써 단어 또는 구절의 변환되지 않은 번역을 수행한다. 여기에 설명하는 바와 같이, 사용자 피드백 응답은, 번역 실패와 관련하여 사용자 피드백을 요청하는 질의에 대한 응답으로 수신될 수 있다.As used herein, “feedback” should be understood to indicate “user feedback” or “user feedback response” that responds as much as possible to queries requesting feedback for translation or translation. It should also be understood that the user feedback may include user feedback for translation, or user feedback for translation. User feedback may include a definition for a particular word or phrase, which definition is: (i) the particular word or phrase in the first language (same) in that particular word or phrase in the first language. Allow conversion to the corresponding word or phrase, and (ii) allow the given word or phrase to be converted from that particular word or phrase in the first language to the corresponding word or phrase in the (other) second language Do so, thereby performing a translated translation of a word or phrase; And / or (iii) allow that particular word or phrase to be translated from a first language to a second language, thereby performing an untranslated translation of the word or phrase. As described herein, a user feedback response may be received in response to a query requesting user feedback in connection with a translation failure.

여기에 사용된 바와 같이, “번역의 실패” 또는 “번역 실패”는, 텍스트를 번역하는데 있어서의 실질적인 실패(예를 들어, 번역된 텍스트는, 원본 텍스트와 동일함), 또는 번역 실패를 잠재적으로 포함하는 것으로 표시된 텍스트 번역(예를 들어, 번역된 텍스트를 읽고 그 번역된 텍스트가 의심스럽거나 부정확한 번역을 포함하고 있다고 믿는 사용자에 의해 표시됨)을 포함하는 것으로 이해될 수 있다. As used herein, “failure to translate” or “failure to translate” is a substantial failure in translating the text (eg, translated text is the same as the original text), or potentially a translation failure. It can be understood to include text translations marked as including (eg, displayed by a user who reads translated text and believes that the translated text contains a suspicious or incorrect translation).

도 13에 도시된 바와 같이, CTT 시스템(1300)은, 통신 변환 및 번역(CTT) 제어 모듈(1302), 통신 변환 및 번역(CTT) 통신 모듈(1304), 언어 모듈(1306), 변환 모듈(1308), 번역 데이터 저장부(1310) 및 번역 API 모듈(1312)을 포함할 수 있다. 일부 실시예에서, CTT 제어 모듈(1302), CTT 통신 모듈(1304), 언어 모듈(1306), 변환 모듈(1308), 번역 데이터 저장부(1310) 및 번역 API 모듈(1312)은, 여기에 설명되는 바와 같이, CTT 시스템(114)의 구성요소와 유사할 수 있다.As shown in FIG. 13, the CTT system 1300 includes a communication conversion and translation (CTT) control module 1302, a communication conversion and translation (CTT) communication module 1304, a language module 1306, and a conversion module ( 1308), a translation data storage unit 1310 and a translation API module 1312. In some embodiments, CTT control module 1302, CTT communication module 1304, language module 1306, translation module 1308, translation data storage 1310 and translation API module 1312 are described herein. As may be, it may be similar to the components of the CTT system 114.

도 13에 도시된 바와 같이, CCT 시스템(1300)은 번역 실패 관리 모듈(1314), 사용자 피드백(UF) 질의 생성 모듈(1316), 사용자 피드백(UF) 질의 관중 선택 모듈(1318), 질의/응답 평가 모듈(1320), 질의 API 모듈(1322), 응답 평가 모듈(1324), 번역 데이터 저장부 업데이트 모듈(1326), 관중 능력 평가 모듈(1328) 및 인센티브 보상 모듈(1330)을 포함할 수 있다. As shown in FIG. 13, the CCT system 1300 includes a translation failure management module 1314, a user feedback (UF) query generation module 1316, a user feedback (UF) query crowd selection module 1318, and a query / response. It may include an evaluation module 1320, a query API module 1322, a response evaluation module 1324, a translation data storage update module 1326, an audience capability evaluation module 1328, and an incentive reward module 1330.

번역 실패 관리 모듈(1314)은, 번역 실패(예를 들어, 제 1 언어에서 제 2 언어로 텍스트를 번역하는데 실패)의 관리를 가능하게 하도록 구성될 수 있다. 일부 실시예에서, 번역 실패 관리 모듈(1314)은, 시스템(1300)이 채팅 세션 동안 제 1 언어를 사용하는 제 1 사용자로부터 제 2 언어를 사용하는 제 2 사용자로 전송된 채팅 메시지를 번역하는 시도를 할 때, CCT 시스템(1300)에 의해 경험될 수 있는 하나 또는 그 이상의 번역 실패를 처리하는데 적당할 수 있다. 여기에 설명되는 바와 같이, CCT 시스템(1300)에 의해 번역될 텍스트는, 채팅용어, 약어, 구어적 표현, 관용구 등을 포함할 수 있는 채팅 메시지일 수 있다. 채팅 메시지의 변환 또는 번역 동안, 채팅 메시지의 일부 또는 전부는, 채팅 메시지의 번역 실패를 야기하는 부분을 변환하고 및/또는 번역하는 CCT 시스템(1300)의 무능 때문에, 번역 실패로 귀결될 수 있음을 이해할 수 있다.The translation failure management module 1314 may be configured to enable management of translation failures (eg, failure to translate text from a first language to a second language). In some embodiments, the translation failure management module 1314 attempts to translate the chat message sent by the system 1300 from a first user who speaks a first language to a second user who speaks a second language during a chat session. , It may be suitable to handle one or more translation failures that may be experienced by the CCT system 1300. As described herein, text to be translated by the CCT system 1300 may be a chat message that may include chat terms, abbreviations, colloquial expressions, idioms, and the like. During the conversion or translation of a chat message, some or all of the chat message may result in a translation failure, due to the inability of the CCT system 1300 to transform and / or translate the portion that causes the chat message to fail to translate. I can understand.

번역 실패 관리 모듈(1314)에 의해 제공되는 특징은, 다음을 제한없이 포함할 수 있다: 텍스트 번역에 대한 실패가 발생할 때 자동으로 검출하는 것; 텍스트의 특정 단어 또는 구절이 번역되지 않을 때 자동으로 검출하는 것; 번역 과정 후에 실질적인 또는 잠재적인 번역 에러를 포함하는 “번역된” 텍스트의 일부 또는 모두를 사용자가 표시할 수 있게 하는 것; 어떤 번역 실패들이 다양한 관중 멤버들(예를 들어, 인-게임 채팅 시스템을 이용하는 플레이어들과 같은 채팅 멤버)로부터의 사용자 피드백을 위해 선택되는지를 관리하는 것; 번역 실패와 연관된 어떤 단어 또는 구절이 사용자 피드백을 위해 선택되는지를 관리하는 것(예를 들어, 단어가, 번역될 텍스트에서 얼마나 자주 나오는지/사용되는지에 기초하여); 및 번역 실패가 더 이상 사용자 피드백을 위한 선택 후보가 아닐 때를 관리하는 것(예를 들어, 번역 실패와 연관된 단어 또는 구절이 선택 풀에서 제거된다).Features provided by the translation failure management module 1314 may include, without limitation: automatically detecting when a failure to translate the text occurs; Automatically detecting certain words or phrases in the text when they are not translated; Allowing the user to display some or all of the “translated” text containing substantial or potential translation errors after the translation process; Managing which translation failures are selected for user feedback from various audience members (eg, chat members such as players using an in-game chat system); Managing which words or phrases associated with translation failures are selected for user feedback (eg, based on how often / when words are used / in the text to be translated); And managing when translation failure is no longer a selection candidate for user feedback (eg, words or phrases associated with translation failure are removed from the selection pool).

다양한 실시예에서, 번역 실패 관리 모듈(1314)은, 관중 멤버들의 사용자 피드백을 위한 단어 또는 구절의 종합적인 이용 가능성(availability)을 관리할 수 있다. 여기에 사용되는 바와 같이, “사용자 피드백 기회”는, 관중 멤버의 사용자 피드백을 위해 이용 가능한 단어 또는 구절을 포함할 수 있다. 번역 실패 관리 모듈(1314)은, 사용자 피드백을 위한 컴퓨팅 자원의 가용성, 사용자 피드백과 연관되어 보상된 인센티브, 과거에 수신된 피드백 응답의 품질 등과 같은 다양한 조건에 기초하여 하나 또는 그 이상의 사용자 피드백 기회의 이용 가능성을 저지하거나, 연기할 수 있다. 예를 들어, 승인된 사용자 피드백 응답에 대해 인센티브(예를 들어, 인-게임 통화 또는 인-게임 아이템)를 계속 보상하는 것이 인센티브와 관련된 경제(예를 들어, 인-게임 경제)에 부정적인 영향을 줄 수 있는지가 결정될 수 있다. 이에 따라, 일부 실시예는, 현재 인센티브 전략이 관련 경제(예를 들어, 인-게임 경제)에 유해하다는 것을 검출할 때, 인센티브를 보상하는 사용자 피드백 기회의 이용 가능성을 (일시적으로) 연기할 수 있고, 또는 사용자 피드백 기회에 대해 보상된 인센티브를 조정할 수 있다. In various embodiments, the translation failure management module 1314 may manage the overall availability of words or phrases for user feedback of audience members. As used herein, “user feedback opportunity” may include words or phrases available for user feedback of spectator members. The translation failure management module 1314 may generate one or more user feedback opportunities based on various conditions such as availability of computing resources for user feedback, incentives compensated in connection with user feedback, and quality of feedback responses received in the past. You can defer or postpone availability. For example, continuing to reward incentives (eg, in-game currency or in-game items) for an approved user feedback response will have a negative impact on the economy associated with incentives (eg, in-game economy). You can decide if you can give. Accordingly, some embodiments may (temporarily) postpone the availability of user feedback opportunities to reward incentives when detecting that current incentive strategies are detrimental to the relevant economy (eg, in-game economy). Or adjust the rewarded incentives for user feedback opportunities.

여기에 설명된 바와 같이, 사용자 피드백이 요청된 관중 멤버는, 그러한 피드백을 제공하는 것을 자원한 사람으로, 사용자 피드백 프로그램에서의 참가자 같은 사람이거나, 및/또는 그들의 제출된 피드백이 승인되었을 때(예를 들어, 정정되었을 때, 받아들여졌을 때, 또는 변환 또는 번역을 정의하는데 유용한)의 인센티브에 대한 대가로 그러한 피드백을 제공하는 것을 자원한 사람일 수 있다. 일부 실시예에서, 여기에 설명된 사용자 피드백 시스템은, 온라인 MMO 게임과 같은 게임 시스템의 부분일 수 있고, 그 게임에서 관중 멤버는, 게임 시스템에서 유용한 인센티브(예를 들어, 인-게임 통화 또는 인-게임 아이템)를 위해, 사용자 피드백 기회에 참여하는 것을 선택한 게임 플레이어들이다. 이러한 방법으로, 사용자 피드백 시스템은, 게임 플레잉 관중 멤버가 보상을 위해 "플레이되는" "게임"으로 여기는 게임 특징으로서 제공될 수 있고, 이로써 게임 환경에 고유한 경쟁을 확대할 수 있다. 관중 멤버들의 선택에 관한 더 많은 것은, UF 질의 관중 선택 모듈(1318)에 관해 설명된다. As described herein, an audience member for whom user feedback is requested is a person who volunteers to provide such feedback, is a person like a participant in the user feedback program, and / or when their submitted feedback has been approved (eg For example, it may be a person who volunteers to provide such feedback when corrected, accepted, or in exchange for an incentive (useful for defining a translation or translation). In some embodiments, the user feedback system described herein may be part of a gaming system, such as an online MMO game, in which the spectator members may find useful incentives (eg, in-game currency or in-game) in the gaming system. For game items), game players have chosen to participate in user feedback opportunities. In this way, the user feedback system can be provided as a game feature that the game-playing spectator member regards as a "game" "played" for rewards, thereby expanding competition inherent in the game environment. More on spectator member selection is described with respect to UF query spectator selection module 1318.

여기에 설명된 바와 같이, 단어 또는 구절은, 관심 단어 또는 구절이 번역을 방해하거나 통신된 텍스트의 부정확한 번역을 야기하는 것으로 믿는 통신 시스템 사용자에 의해 표시될 수 있다. 예를 들어, 온라인 게임과 연관된 다중 언어 다중 사용자 채팅 시스템에서, 게임 플레이어는, 번역 문제 또는 실패를 잠재적으로 갖는, 그들이 수신하였던 전체 채팅 메시지를 표시하거나, 번역 채팅 메시지의 특정 부분들(예를 들어, 단어 또는 구절)을 표시할 수 있다. 번역 실패와 연관된 단어들 또는 구절들은, 예를 들어, 전문화된/도메인 관련 특수용어, 약어들, 두문자어, 고유명사, 보통명사, 지소어, 구어체 단어 또는 구절, 및 비속한 단어 또는 구절을 포함할 수 있다. 추가적으로, 단어 또는 구절은, 단어 또는 구절이 번역될 수 없고 더 큰 구절 또는 문장의 번역을 방해할 수 있다는 것을 자동으로 검출하는 시스템 또는 방법에 의해 표시될 수 있다. As described herein, a word or phrase may be indicated by a communication system user who believes that the word or phrase of interest may interfere with translation or cause an incorrect translation of the communicated text. For example, in a multi-language multi-user chat system associated with an online game, game players display the entire chat message they have received, potentially having translation problems or failures, or certain parts of the translation chat message (eg , Words or phrases). Words or phrases associated with translation failures include, for example, specialized / domain related terminology, abbreviations, acronyms, proper nouns, common nouns, idioms, colloquial words or phrases, and profanity words or phrases You can. Additionally, a word or phrase can be displayed by a system or method that automatically detects that a word or phrase cannot be translated and can interfere with the translation of a larger phrase or sentence.

번역 실패 관리 모듈(1314)은, 사용자 피드백의 주제가 되는 단어 또는 구절을 선택할 책임을 질 수 있고, 여기서 선택된 단어 또는 구절은 제 1 언어에서 제 2 언어로 텍스트를 번역하는데 실제적인 또는 잠재적인 실패와 연관될 때 표시될 수 있다. 번역 실패 관리 모듈(1314)은, 다수의 팩터들에 기초하여 단어 또는 구절을 선택할 수 있고, 그 팩터들의 일부는, 번역에서 단어 또는 구절의 현재 중요성(예를 들어, 단어 또는 구절의 전반적인 사용법에 기초한 중요성), 단어 또는 구절의 복잡성(예를 들어, 단어 또는 구절의 어려움, 또는 단어 또는 구절이 쟁점이었던 기간), 사용자 피드백을 제공하기로 선택되거나/자원한 사용자의 능력(competency)(예를 들어, 사용자가 제 2 언어에 대한 언어 능력을 가짐), 및 사용자 피드백을 제공하기로 선택되거나/자원한 사용자의 선호도(preference)를 포함한다. 사용자 피드백을 위한 단어 또는 구절을 선택하는 다른 팩터들은, 다양한 실시예에 의해 활용될 수 있음은 당업자에게 인정될 수 있다. The translation failure management module 1314 may be responsible for selecting words or phrases that are the subject of user feedback, where the selected words or phrases may cause actual or potential failures to translate text from a first language to a second language. Can be displayed when associated. The translation failure management module 1314 can select a word or phrase based on a number of factors, some of which are based on the current importance of the word or phrase in translation (eg, the overall usage of the word or phrase) Based importance), the complexity of the word or phrase (e.g., the difficulty of the word or phrase, or the period during which the word or phrase was an issue), and / or the ability of the user to choose / volunteer to provide user feedback (e.g. For example, the user has a language ability for the second language), and / or a preference of the user who is / are selected to provide user feedback. It will be appreciated by those skilled in the art that other factors for selecting words or phrases for user feedback can be utilized by various embodiments.

다양한 실시예에서, 사용자 피드백을 제공하기로 선택된 사용자들은, 그러한 피드백을 제공하기로 자원한 사람일 수 있다. 예를 들어, 사용자는, 연관된 사용자 선호도를 설정하거나, 사용자 피드백을 요청하는 인-채팅 시스템(in-chat system) 광고를 선택하거나, 및/또는 선택이 가능한 하나 또는 그 이상의 사용자 피드백 기회를 나열하는 섹션을 통해 브라우징을 함으로써, 사용자 피드백을 제공하는 것을 선택할 수 있다. In various embodiments, users selected to provide user feedback may be people who volunteer to provide such feedback. For example, a user may set an associated user preference, select an in-chat system advertisement requesting user feedback, and / or list one or more user feedback opportunities available for selection. By browsing through sections, you can choose to provide user feedback.

여기에 설명된 바와 같이, 번역 실패 관리 모듈(1314)은, 실제적인 또는 잠재적인 번역 실패와 연관된 특정 단어 또는 구문이 더 이상 관중 멤버들의 사용자 피드백을 위한 선택 후보가 아닌 때를 관리할 수 있다. 다양한 조건이, 번역 실패 관리 모듈(1314)이 그러한 결정을 하도록 이끌 수 있다. 그 결정은, 예를 들어, 특정한 수의 피드백 응답이, 그 연관된 번역 실패와 관련하여 또는, 특정한 단어 또는 구절과 관련하여 수신될 때; 특정한 수의 일관된 피드백 응답이, 그 연관된 번역 실패와 관련하여 또는, 특정한 단어 또는 구절과 관련하여 수신될 때; 및 특정 피드백 응답이, 탐색되었던 사용자 피드백에 대해 유효한 것으로 승인되었을 때(예를 들어, 고유 피드백 응답의 세트로부터, 특정한 응답이, 정확한 응답으로 관리자에 의해 수동으로 승인될 때)를 포함한다. As described herein, the translation failure management module 1314 may manage when a particular word or phrase associated with actual or potential translation failure is no longer a selection candidate for user feedback from spectator members. Various conditions can lead the translation failure management module 1314 to make such a decision. The decision may be made, for example, when a specific number of feedback responses are received in connection with the associated translation failure or in connection with a particular word or phrase; When a certain number of consistent feedback responses are received in connection with the associated translation failure or in connection with a particular word or phrase; And when a specific feedback response has been approved as valid for the user feedback that has been searched (eg, from a set of unique feedback responses, a specific response is manually approved by the administrator as an accurate response).

UF 질의 생성 모듈(1316)은, 사용자 피드백을 위해 선택된 특정 단어 또는 구절에 대한 사용자 피드백을, 선택된 관중 멤버로부터 얻기 위한 질의를 생성하도록 구성될 수 있다. 여기에 설명된 바와 같이, 선택된 관중 멤버는, 제출된 사용자 피드백이, 정확한 것으로, 수락된 피드백 응답인 것으로, 또는 논쟁 중인 번역을 정제하기에 유용한 것으로 승인되었을 때의 인센티브에 대한 대가로서, 사용자 피드백을 제공하기로 자원한 사용자일 수 있다. 관중 멤버들의 선택에 관한 더 많은 것은, UF 질의 관중 선택 모듈(1318)과 관련하여 설명한다. The UF query generation module 1316 may be configured to generate a query for obtaining user feedback for a specific word or phrase selected for user feedback from a selected audience member. As described herein, selected spectator members, user feedback, in return for incentives when submitted user feedback has been approved as being accurate, accepted feedback response, or useful for purifying controversial translations It may be a user who volunteers to provide. More about spectator member selection is described in relation to UF query spectator selection module 1318.

일부 실시예에서, 생성된 질의는, 선택된 관중 멤버로부터 질의 응답을 수신하기 위한 하나 또는 그 이상의 방법을 포함한다. 예를 들어, 그 생성된 질의는, 응답 방법으로서, 미리 정의된 응답 리스트를 포함하고, 관중 멤버는 그 응답 리스트로부터 그 생성된 질의(또한 여기에서 “선택-폼 응답”으로서 참조된)에 대한 그들의 응답으로서 선택할 수 있다. 다른 예에서, 그 생성된 질의는, 응답 방법으로서, 그 생성된 질의에 대한 그들의 응답으로서 수신하도록 구성된 하나 또는 그 이상의 필드를 포함하고, 관중 멤버에 의해 필드로 텍스트 값이 입력된다(또한 여기에서 “프리-폼 응답”으로서 참조된다). 다른 응답 방법은, 그래픽 사용자 인터페이스(GUI), 텍스트 값, 또는 그것들의 일부 조합을 포함할 수 있다. In some embodiments, the generated query includes one or more methods for receiving query responses from selected audience members. For example, the generated query, as a response method, includes a predefined list of responses, and the audience member from the list of responses to the generated query (also referred to herein as “select-form response”). You can choose as their response. In another example, the generated query includes, as a response method, one or more fields configured to receive as their responses to the generated query, and text values are entered into the fields by an audience member (also here. Referred to as "pre-form response"). Other response methods may include a graphical user interface (GUI), text values, or some combination thereof.

그 생성된 질의에 포함되는 하나 또는 그 이상의 응답은, 예를 들어 다음을 포함하는 다수의 팩터에 따를 수 있다: 관중 멤버의 선호도; 사용자 피드백이 요청되는 단어 또는 구절의 중요성 및/또는 복잡성; 사용자 피드백이 요청되는 단어 또는 구절에 대해 이제까지 수신된 사용자 피드백 응답의 수; 사용자 피드백이 요청되는 단어 또는 구절에 대해 이제까지 수신된 일관된 피드백 응답의 수; 및 선택-폼 응답을 생성하기 위한 충분한 프리-폼 응답이 있는지 여부. 일부 실시예에서, 그 생성된 질의에 대해 응답하는 관중 멤버는, 그 생성된 질의에 대한 두 개 또는 그 이상의 응답 방법 중에서 선택할 수 있다. The one or more responses included in the generated query may be subject to a number of factors including, for example: audience member preferences; The importance and / or complexity of the word or phrase for which user feedback is requested; The number of user feedback responses received so far for the word or phrase for which user feedback is requested; The number of consistent feedback responses received so far for the word or phrase for which user feedback is requested; And whether there is enough pre-form response to generate a select-form response. In some embodiments, the spectator member responding to the generated query can select between two or more response methods to the generated query.

추가적으로, 사용자 피드백 기회는, 관중 멤버의 언어에 따라 제공될 수 있고, 예를 들어 다음을 포함하는 다수의 팩터에 따를 수 있다: 관중 멤버의 채팅 히스토리를 모니터링함으로써 관중 멤버가 다중 언어자인지 여부; 관중 멤버의 언어 능력이 특정 언어 신뢰 수준을 만족하거나 넘는지 여부; 및 관중 멤버가 사용자 피드백 기회에 참여하기 위해 사용하는 것으로 사용자 디바이스에 설정한 언어(예를 들어, 관중 멤버의 디바이스에 설정된 디폴트 언어). 예를 들어, 관중 멤버의 디바이스의 디폴트 디바이스 언어는 독일어이고, 그 관중 멤버가 사용자 피드백 기회에 참여하는 것을 선택하면, 생성되어 관중 멤버에 전송된 질의는, 독일어에 기초할 수 있다(예를 들어, 영어 단어 또는 구절을 독일어 단어 또는 구절로 정의하기 위한 질의 또는 그 반대의 질의). 일부 실시예에서, 그 생성된 질의는, 제출될 응답의 언어를 선택하는 옵션(예를 들어, 그 생성된 질의가 특정 언어를 특정하거나 요구하지 않을 때) 및/또는 두 개 이상의 응답을 제공하되 각 응답은 서로 다른 언어로 된 옵션을, 관중 멤버에게 제공할 수 있다. 일부 그러한 실시예에서, 관중 멤버는, 여기에 설명된, 관중 멤버가 다중 언어자인지 여부 그리고 관중 멤버의 사용자 디바이스에 설정된 언어와 같은 다양한 언어 팩터에 따른 언어 응답 옵션이 제공될 수 있다. Additionally, user feedback opportunities may be provided depending on the language of the audience member, and may be subject to a number of factors including, for example: whether the audience member is multilingual by monitoring the audience member's chat history; Whether the audience member's language ability meets or exceeds a certain level of language confidence; And the language set on the user device by the audience member to use to participate in the user feedback opportunity (eg, the default language set on the audience member's device). For example, the default device language of a spectator member's device is German, and if the spectator member chooses to participate in a user feedback opportunity, the query generated and sent to the spectator member may be based on German (eg , Query to define an English word or phrase as a German word or phrase, or vice versa). In some embodiments, the generated query provides an option to select the language of the response to be submitted (eg, when the generated query does not specify or require a specific language) and / or two or more responses. Each response can offer different language options to audience members. In some such embodiments, the audience member may be provided with language response options according to various language factors, such as whether the audience member is multilingual and the language set on the user device of the audience member, as described herein.

UF 질의 관중 선택 모듈(1318)은, 사용자 피드백을 요청할 관중 멤버를 선택하도록 구성될 수 있다. 일부 실시예에서, UF 질의 관중 선택 모듈(1318)은, 인센티브(사용자 피드백이 승인되거나/받아들여질 때 수신될 수 있는)의 대가로, 사용자 피드백을 제공하는 것을 자원한 관중 멤버의 풀에서 관중 멤버를 선택할 수 있다. 여기에 설명되는 바와 같이, 관중 멤버는, 번역에 참여하는 자원자일 수 있고, 그 번역 시간에, 그 관중 멤버는, UF 질의 관중 선택 모듈(1318)이 사용자 피드백 요청을 위해 선택하는 관중 멤버들의 풀에 포함될 수 있다. 일부 실시예에서, UF 질의 관중 선택 모듈(1318)이 관중 멤버를 선택할 때, 관중 멤버에게, 사용자 피드백을 획득하기 위해 UF 질의 생성 모듈(1316)에 의해 생성된 질의가 제공될 수 있다. UF 질의 생성 모듈(1316)에 의해 생성된 질의는, UF 질의 관중 선택 모듈(1318)에 의해 선택된 관중 멤버에게, 그 선택된 관중 멤버에 의해 참가가 가능한 사용자 피드백의 리스트의 부분으로서 제공될 수 있다. 일단 그 생성된 질의가 그 선택된 관중 멤버에게 제공되면, 그 선택된 관중 멤버는, 사용자 피드백 세션을 개시하고, 그 사용자 피드백 세션에서 상기 생성된 질의가 관중 멤버에게 제공되며(예를 들어, 그 생성된 질의와 연관된 하나 또는 그 이상의 응답 방법을 가지고), 그리고 그 선택된 관중 멤버는 하나 또는 그 이상의 응답을 제공할 수 있다(예를 들어, 그 생성된 질의에 포함된 하나 또는 그 이상의 응답 방법을 통해). The UF query audience selection module 1318 may be configured to select an audience member to request user feedback. In some embodiments, UF query spectator selection module 1318 is a spectator member in a pool of spectator members who volunteer to provide user feedback in exchange for incentives (which may be received when user feedback is approved / accepted). You can choose As described herein, a spectator member can be a volunteer who participates in a translation, and at that time, the spectator member is a pool of spectator members that the UF query audience selection module 1318 selects for user feedback requests. Can be included in In some embodiments, when the UF query spectator selection module 1318 selects a spectator member, a query generated by the UF query generation module 1316 may be provided to the spectator member to obtain user feedback. The query generated by the UF query generation module 1316 may be provided to an audience member selected by the UF query audience selection module 1318 as part of a list of user feedbacks that can be participated by the selected audience member. Once the generated query is provided to the selected audience member, the selected audience member initiates a user feedback session, and the generated query is provided to the audience member in the user feedback session (e.g., the generated audience member) Have one or more response methods associated with the query), and the selected audience member can provide one or more responses (eg, via one or more response methods included in the generated query) .

실시예에 따르면, 관중 멤버가 선택되기 전에 사용자 피드백을 위한 단어 또는 구절이 선택될 때, 관중 멤버가 그 선택된 단어 또는 구절에 기초하여(예를 들어, 그 선택된 단어 또는 구절이, 관중 멤버의 능력 또는 선호도에 매칭되는지 여부) 후보 관중 멤버의 집합에서 선택될 수 있다. 대안으로, 사용자 피드백을 위한 단어 또는 구절이 선택되기 전에 관중 멤버가 선택되면, 그 선택되는 단어 또는 구절은, 선택된 관중 멤버에 기초할 수 있다. 특정 관중 멤버의 능력은, 그 특정 관중 멤버에 의해 이전에 제공된(예를 들어, 이전에 생성되어 그 특정 관중 멤버로 제공된 질의와 관련하여) 하나 또는 그 이상의 피드백 응답에 기초하여 결정될 수 있고, 및/또는 관중 멤버의 하나 또는 그 이상의 언어 능력에 기초하여 결정될 수 있다. 관중 멤버는, 예를 들어, 그 관중 멤버가 특정한 수의 승인된 피드백 응답을 달성하였을 때, 번역 실패와 연관된 하나 또는 그 이상의 단어 또는 구절에 대한 사용자 피드백을 제공하는데 필요한 능력을 가졌는지 평가될 수 있다. 다른 예에서, 관중 멤버는, 적어도 이전에 제출된 특정한 언어에 대한 피드백 응답, 그 관중 멤버의 선호도, 및/또는 그 특정한 언어에서의 그들의 능력을 나타내는 관중 멤버와 관련된 정보에 기초하여 그 특정한 언어에서 능력을 가졌는지 평가될 수 있다. 또 다른 예에서, 이전에 제출된 피드백 응답이 부정확하고 또는 속이는 것(예를 들어, 터무니없거나 허구의 응답)으로 평가된 관중 멤버는, 낮은 능력을 가진 것으로 결정될 수 있다. According to an embodiment, when a word or phrase for user feedback is selected before an audience member is selected, the audience member is based on the selected word or phrase (eg, the selected word or phrase is the ability of the audience member) Or whether it matches the preference). Alternatively, if an audience member is selected before a word or phrase for user feedback is selected, the selected word or phrase may be based on the selected audience member. The ability of a particular spectator member may be determined based on one or more feedback responses previously provided by the particular spectator member (eg, with respect to a query previously created and provided to that particular spectator member), and And / or based on one or more language skills of the audience member. Audience members can be evaluated, for example, to see if they have the ability to provide user feedback on one or more words or phrases associated with a translation failure when they achieve a certain number of approved feedback responses. . In another example, a spectator member is in a particular language based on at least a feedback response to a specific language submitted previously, the spectator member's preferences, and / or information related to the spectator member in their particular language. You can be evaluated for your ability. In another example, a spectator member rated with a previously submitted feedback response being incorrect or deceiving (eg, an absurd or fictional response) may be determined to have a low ability.

일부 실시예에서, 특정 관중 멤버가 단어 또는 구절과 관련한 응답을 제공하면(및 응답은 가능한 승인된다), 그 특정 관중 멤버는, 동일한 단어 또는 구절에 대한 사용자 피드백을 제공하는데 다시 선택되지 않을 수 있다. 특정 실시예에서, 그 특정 관중이 이전에 동일한 단어 또는 구절에 대해 제출된 응답의 언어와 다른 언어로 응답을 제공할 때, 이러한 제한의 예외를 줄 수 있다.In some embodiments, if a particular audience member provides a response related to a word or phrase (and the response is approved as possible), that particular audience member may not be selected again to provide user feedback for the same word or phrase . In certain embodiments, an exception to this limitation may be given when the particular spectator provides a response in a language different from that of a response previously submitted for the same word or phrase.

사용자 피드백에 대한 보상으로 인센티브가 제공되면(예를 들어, 피드백 응답이 승인되자마자), 다양한 실시예는 시간 기반 쿼터(quota)(예를 들면, 피드백 응답의 제출에 대한 매시간당, 하루당, 월당, 년당 한도) 또는 인센티브-소득 쿼터(예를 들어, 인센티브 기반 사용자 피드백에 대해 주어진 인-게임 통화 또는 아이템의 제한)와 같은, 쿼터에 기초하여 관중 멤버를 선택할 수 있다. 이때 쿼터는 개인 관중 멤버, 관중 멤버 그룹, 또는 이들의 일부 조합에 연관된다. 일부 실시예에서, UF 질의 관중 선택 모듈(1318)은, 사용자 피드백에 대한 컴퓨팅 자원의 가용성, 사용자 피드백과 관련하여 보상된 인센티브, 과거에 수신된 피드백 응답의 품질 등과 같은 다양한 조건에 기초하여 하나 또는 그 이상의 관중 멤버에 대해 사용자 피드백 기회의 이용 가능성을 중단하거나 연기할 수 있다. When an incentive is provided as a reward for user feedback (e.g. as soon as the feedback response is approved), various embodiments are based on time-based quotas (e.g. hourly, daily, monthly, for submitting feedback responses) Audience members may be selected based on quotas, such as limits per year) or incentive-income quotas (eg, limits on in-game currency or items given for incentive-based user feedback). The quarter is then associated with an individual spectator member, spectator member group, or some combination thereof. In some embodiments, the UF query spectator selection module 1318 is based on various conditions such as availability of computing resources for user feedback, rewarded incentives in relation to user feedback, quality of feedback responses received in the past, or the like. The availability of user feedback opportunities may be suspended or postponed for further audience members.

질의/응답(QR) 평가 모듈(1320)은, 질의 또는 질의 응답이 사용자 피드백을 요청하는 단어 또는 구절에 기초하여 질의 또는 질의 응답의 가치 또는 중요성을 결정할 수 있도록 구성된다. 질의 또는 질의 응답의 가치를 결정할 때 다양한 실시예에 의해 고려되는 팩터들의 예는, 단어 또는 구절의 복잡성(예를 들어, 복잡성이 높을수록 가치는 더 높다), 변환/번역 프로세스에서의 단어 또는 구절의 중요성(예를 들어, 중요성이 높을수록 가치는 더 높다), 질의 또는 질의 응답에 의해 채용된 응답 방법(예를 들어, 선택-폼 응답 방법보다 프리-폼 응답 방법의 가치가 더 높다), 단어 또는 구절의 타입(예를 들어, 채팅용어, 약어 또는 구어체), 또는 하나 또는 그 이상의 관여된 언어(예를 들어, 영어 단어 또는 구절을 포함하고 불어 응답을 수신하는 질의는, 영어 단어 또는 구절을 포함하고 영어 응답을 수신하는 질의보다 더 가치가 높다)를 포함할 수 있다. 일부 실시예에서, 특정 사용자 피드백에 대해 보상된 인센티브는, 질의 또는 질의 응답에 연관된 가치에 기초할 수 있다. 일부 실시예에서, QR 평가 모듈(1320)은, 유익하고, 유용하며, 또는 정확한 사용자 피드백을 간청하는데 있어서의 이전 질의들 또는 질의 응답들의 효능(efficacy)에 기초하여 질의 또는 질의 응답에 대한 가치를 결정하도록 구성될 수 있다. 이에 따라, 일부 실시예에서, 가치는, 이전 질의들 또는 질의 응답들에 의해 달성되는 최근의 효능에 기초하여 동적으로 조정될 수 있다. 특정 실시예에서, QR 평가 모듈(1320)은, 관리자(예, CCT 시스템(1300)의), 또는 다른 권한이 있는 사용자가 수동으로 하나 또는 그 이상의 사용자 피드백에 연관된 질의 및/또는 질의 응답들에 대한 가치를 할당하거나 조정할 수 있도록 있다.The query / response (QR) evaluation module 1320 is configured such that the query or query response can determine the value or importance of the query or query response based on the word or phrase requesting user feedback. Examples of factors that are considered by various embodiments when determining the value of a query or question answer include word or phrase complexity (eg, the higher the complexity, the higher the value), the word or phrase in the translation / translation process. The importance of (for example, the higher the value, the higher the value), the response method employed by the query or query response (eg, the value of the pre-form response method is higher than the select-form response method), Types of words or phrases (eg, chat terms, abbreviations or colloquial), or queries involving one or more involved languages (eg, English words or phrases and receiving a French response), the English words or phrases And higher value than a query to receive an English response). In some embodiments, incentives rewarded for specific user feedback may be based on a value associated with a query or query response. In some embodiments, QR evaluation module 1320 values the value of a query or query response based on the efficacy of previous queries or query responses in soliciting beneficial, useful, or accurate user feedback. It can be configured to determine. Accordingly, in some embodiments, the value can be dynamically adjusted based on recent efficacy achieved by previous queries or query responses. In certain embodiments, the QR evaluation module 1320 may be configured by an administrator (eg, of the CCT system 1300), or other authorized user to manually respond to queries and / or query responses associated with one or more user feedback. It is possible to allocate or adjust the value of

질의 API 모듈(1322)는, 선택된 관중 멤버들에게 사용자 피드백 질의들을 전송하고 및/또는 선택된 관중 멤버로부터 질의 응답을 수신하는 것을 촉진하도록 구성될 수 있다. 일부 실시예에서, 질의 API 모듈(1322)은, 그 선택된 관중 멤버에게 가용한 사용자 피드백 기회들의 리스트를 제공할 수 있고, 그 선택된 관중 멤버에 의해 선택된 하나 또는 그 이상의 사용자 피드백 기회에 관련된 질의들을 제공하며, 그 선택된 관중 멤버에 의해 선택된 하나 또는 그 이상의 사용자 피드백 기회에 대한 응답을 수신하고, 및/또는 이제까지 승인을 위해 제출된 질의 응답의 현재 상태(예, 승인된, 거절된 또는 계류(pending)/대기(awaiting) 중인 승인의 상태)를 그 선택된 관중 멤버에게 제공할 수 있다. 일부 실시예에서, 질의 API 모듈(1322)은, CCT 시스템(1300)과 상호 작용하는 그 특정 관중 멤버에 의해 사용되는 채팅 클라이언트 시스템으로부터, 그 특정 관중 멤버와 연관된 하나 이상의 선호도를 획득할 수 있다. 여기에 설명하는 바와 같이, 관중 멤버에 연관된 선호도는, 관중 멤버에게 사용자 피드백이 요청되는 단어 또는 구절의 선택을 결정할 수 있고, 및/또는 관중 멤버가 특정 단어 또는 구절의 사용자 피드백에 대한 질의를 수신하도록 선택되었는지 여부를 결정할 수 있다. The query API module 1322 may be configured to facilitate sending user feedback queries to selected audience members and / or receiving query responses from the selected audience members. In some embodiments, query API module 1322 may provide a list of user feedback opportunities available to the selected audience member, and provide queries related to one or more user feedback opportunities selected by the selected audience member. And receive a response to one or more user feedback opportunities selected by the selected audience member, and / or the current status of the query response ever submitted for approval (eg, approved, rejected or pending) / Awaiting approval status) to the selected audience member. In some embodiments, query API module 1322 may obtain one or more preferences associated with the particular audience member from the chat client system used by that particular audience member interacting with CCT system 1300. As described herein, preferences associated with a spectator member can determine a selection of a word or phrase for which user feedback is requested from the spectator member, and / or the spectator member receives a query for user feedback of a specific word or phrase You can decide whether or not it is selected.

응답 평가 모듈(1324)은, 사용자 피드백을 얻기 위해 생성된 질의와 연관되어 제출된 질의 응답을 평가하도록 구성될 수 있다. 일부 실시예에서, 응답 평가 모듈(1324)은 하나 또는 그 이상의 단계에서 질의 응답을 평가할 수 있다.The response evaluation module 1324 may be configured to evaluate the submitted query response in association with the generated query to obtain user feedback. In some embodiments, the response evaluation module 1324 may evaluate query responses in one or more steps.

검증(validation) 단계(phase) 동안, 응답 평가 모듈(1324)은, 특정 생성된 질의에 대해 제출된, 그 특정 생성된 질의에 대한 승인 또는 거절로서, 하나 또는 그 이상의 고유 응답을 배치(disposition)할 수 있다. 일부 실시예에서, 사용자 피드백을 얻기 위해 생성된 질의와 관련하여 제출된 고유 응답은, 그 고유 응답이 그 생성된 질의와 연관된 단어 또는 구절에 대해 유효한 것으로 승인되거나, 그 생성된 질의와 연관된 단어 또는 구절에 대해 유효하지 않은 것으로 거절될 때까지, 계류 중인 승인으로 고려될 수 있다. 여기에 사용된 바와 같이, 특정 "고유 응답"은, 구문론에서는 정확히 일치하지 않지만 유사한(예를 들어, 다른 구두점 또는 띄어쓰기) 응답의 세트를 포함할 수 있고, 두 개 또는 그 이상의 응답 방법을 통해 수신될 수 있다. 고유 응답은 또한 여기서 "고유 질의 응답" 및 "고유 사용자 피드백 응답"으로 언급될 수 있다. 실시예에 따르면, 하나 이상의 고유 질의 응답은, 특정 생성된 질의에 대해 승인될 수 있다. 예를 들어, 하나 이상의 고유 질의 응답은, 찾아지고 있는 사용자 응답과 관련된 특정 단어 또는 구절을 정의할 때 승인될 수 있다. 응답은, 예를 들어, 응답(예, 가능한 오직 고유 응답)을 리뷰하고 배치하는 관리자 등에 의해 수동적으로 평가될 수 있다. 추가적으로, 또는 대안으로, 응답은, 응답의 카운트 및/또는 응답 카운트와 관련된 임계치에 기초하여, 제출된 응답을 승인 및/또는 거절하는 자동 프로세스에 의해 평가될 수 있다. 일부 실시예에서, 자동 검증(validation) 프로세스는, 특정 단어 또는 구절에 대한 최고의 제출된 응답을 필터링할 수 있고, 리뷰 및 배치를 위해 관리자 등에게 그 최고의 제출된 응답을 제공할 수 있다. 확인 단계(phase)는, 주기적으로(예, 스케줄에 기초하여) 또는 조건(예, 승인을 기다리는 제출된 응답의 수가 임계치를 만족하거나 넘을 때)에 기초하여 응답 평가 모듈(1324)에 의해 수행될 수 있다. During the validation phase, the response evaluation module 1324 disposes one or more unique responses, such as approval or rejection of the particular generated query, submitted for that particular generated query. can do. In some embodiments, a unique response submitted in relation to a query generated to obtain user feedback is either approved as valid for the word or phrase associated with the generated query, or a word associated with the generated query, or Until it is rejected as invalid for the passage, it may be considered pending approval. As used herein, a particular “unique response” may not be an exact match in syntax, but may include a set of similar (eg, different punctuation or spacing) responses, received via two or more response methods. Can be. Native responses may also be referred to herein as “unique query responses” and “unique user feedback responses”. According to an embodiment, one or more unique query responses may be approved for a particular generated query. For example, one or more unique query responses may be approved when defining a specific word or phrase related to the user response being sought. Responses can be passively evaluated, for example, by an administrator reviewing and deploying responses (eg, only unique responses possible). Additionally, or alternatively, the response can be evaluated by an automated process that approves and / or rejects the submitted response based on the response's count and / or threshold associated with the response count. In some embodiments, an automatic validation process can filter the best submitted responses for a particular word or phrase, and provide the best submitted responses to managers, etc. for review and placement. The verification phase may be performed by the response evaluation module 1324 periodically (eg, based on a schedule) or condition (eg, when the number of submitted responses awaiting approval satisfies or exceeds a threshold). You can.

일부 실시예에서, 복수의 고유 사용자 피드백 응답(예, 정의)이, 특정 단어 또는 구절에 대해 제공될 때, 응답 평가 모듈(1324)은, 그 복수의 고유 사용자 피드백 응답 중 최고로 평가된 고유 사용자 피드백 응답(예, 가장 인기 있는 응답)을 자동으로(및/또는 수동의 관리자 입력을 통해) 승인할 수 있으나, 그러나 그 최고로 평가된 고유 사용자 피드백 응답이 통계적으로 중요할 때만 오직 승인할 수 있다. 예를 들어, 제 1 고유 사용자 피드백 응답이, 특정 단어에 대해 26번 수신되었고, 제 2 고유 사용자 피드백 응답이, 동일한 그 특정 단어에 대해 24번 수신되었을 때, 제 1 고유 사용자 피드백은, 비록 그것이 그 특정 단어에 대해 최고로 평가된 고유 사용자 피드백 응답일지라도, 그 특정 단어에 대한 응답으로 승인되지 않을 수 있다. 이는 26이 24에 대해 통계적으로 중요하지 않기 때문이다. 제 1 고유 사용자 피드백은, 예를 들어 제 1 고유 사용자 피드백 응답이 최고로 평가된 고유 사용자 피드백 응답으로 남아있고 그 응답이 40에 달할 때까지 선택되지 않을 수 있다. In some embodiments, when a plurality of unique user feedback responses (eg, definitions) are provided for a specific word or phrase, the response evaluation module 1324 is the highest rated unique user feedback among the plurality of unique user feedback responses Responses (eg, the most popular responses) can be approved automatically (and / or via manual admin input), but only when the highest rated unique user feedback response is statistically important. For example, when a first unique user feedback response has been received 26 times for a particular word, and a second unique user feedback response has been received 24 times for that same particular word, the first unique user feedback response, although it is Even a unique user feedback response rated best for that particular word may not be accepted in response to that particular word. This is because 26 is not statistically significant for 24. The first unique user feedback may not be selected, for example, until the first unique user feedback response remains the highest rated unique user feedback response and the response reaches 40.

구절들이 동일하지 않고 사실상 유사할 수 있으며 동일한 의도를 전달할 수 있다는 것을 고려하면, 일부 실시예에서, 단어 에러율(WEB:Word Error Rate)은, 구절을 포함하는 고유 사용자 피드백 응답을 그룹핑하는데 사용될 수 있다. 두 구절에 대해, 단어 에러율은, 구절들 사이에 유사성을 전달하기 위해 단어의 교체, 삭제, 및 삽입을 측정할 수 있다. Considering that the verses are not the same and can be substantially similar and convey the same intent, in some embodiments, a Word Error Rate (WEB) can be used to group unique user feedback responses that contain the verse. . For both passages, the word error rate can measure the replacement, deletion, and insertion of words to convey similarity between passages.

다양한 실시예에서, 복수의 고유 사용자 피드백 응답(예를 들어, 정의)이 특정 구절에 대해 제공되면, 응답 평가 모듈(1324)은 자동으로(및/또는 수동의 관리자 입력을 통해) 최고로 평가된 고유 사용자 피드백 응답을 승인할 수 있다.In various embodiments, if multiple unique user feedback responses (eg, definitions) are provided for a particular phrase, the response evaluation module 1324 automatically (and / or through manual administrator input) is the best rated unique User feedback responses can be approved.

체크 단계(phase) 동안, 응답 평가 모듈(1324)은, 관중 멤버에 의해 제출된 응답이 승인, 거절 또는 검토 계류(예, 승인 계류(pending approval))로 배치되는지 여부를 결정할 수 있다. 일부 실시예에서, 데이터 저장부는, 특정 단어 또는 구절과 관련하여 관중 멤버에 의해 제출된 고유 응답이, 그 특정 단어 또는 구절에 대한 유효한 정의로서 승인 또는 거절되었는지에 관한 상태를 유지할 수 있다. 이에 따라, 체크 단계는, 특정 단어 또는 구절에 대해 이전에 제출된 고유 응답의 배치 상태를 유지하는 데이터 저장부를 가지고 컨설팅함으로써 특정 단어 또는 구절에 대해 제출된 응답의 배치를 결정할 수 있다; 그 제출된 응답은, 그 제출된 응답에 대응하는 고유 응답의 배치 상태를 공유한다. 실시예에 따르면, 제출된 응답에 대한 체트 단계는, 즉시 또는 응답이 제출된 후에 곧 수행될 수 있다. 제출된 응답이 체크 단계 동안 여전히 검토 계류 중인 것으로 결정되면, 체크 단계는, 그 동일한 또는 유사한 제출된 응답의 상태에 영향을 주는 검증(validation) 단계 다음의 이후 시간에 재수행될 수 있다. 그 제출된 응답의 상태는, 체크 단계 동안에 결정된 그 제출된 응답의 현재 배치에 따라 갱신될 수 있다. 여기에 설명하는 바와 같이, 특정 관중 멤버에 의해 제출된 하나 또는 그 이상의 응답의 현재 상태는, 그 응답들에 대한 현재 상태를 반영하는 리스트로서 제공될 수 있다. 응답 상태에 관한 더 많은 것은 도 28과 함께 후술한다. During the check phase, the response evaluation module 1324 may determine whether the response submitted by the spectator member is placed in an approval, rejection or review pending (eg, pending approval). In some embodiments, the data storage unit may maintain a state as to whether a unique response submitted by an audience member with respect to a particular word or phrase has been approved or rejected as a valid definition for that particular word or phrase. Accordingly, the checking step may determine the placement of the response submitted for the specific word or phrase by consulting with a data storage that maintains the placement of the unique response previously submitted for the specific word or phrase; The submitted response shares the deployment status of the unique response corresponding to the submitted response. According to an embodiment, the cheat step for a submitted response may be performed immediately or soon after the response has been submitted. If it is determined that the submitted response is still pending pending during the check step, the check step may be re-performed at a time following a validation step that affects the status of the same or similar submitted response. The status of the submitted response can be updated according to the current batch of the submitted response determined during the check phase. As described herein, the current status of one or more responses submitted by a particular audience member can be provided as a list reflecting the current status for those responses. More about the response status will be described later in conjunction with FIG. 28.

응답 평가 모듈(1324)은, 응답이 승인되는지 여부를 결정하는 것을 포함할 수 있다. 응답은, 다른 질의에 대한 응답으로 다른 사람에 의해 제공된 적어도 하나의 이전 응답에 기초하여 승인될 수 있고, 그 다른 질의는 그 다른 사람으로부터 단어 또는 구절에 대한 피드백을 얻기 위해 이전에 생성된다. 응답이 단어 또는 구절을 정확하게 정의하는 것으로 결정되면 그 응답은 승인될 수 있다.The response evaluation module 1324 can include determining whether the response is approved. The response may be approved based on at least one previous response provided by the other person in response to another query, the other query being previously generated to obtain feedback on the word or phrase from the other person. If the response is determined to accurately define a word or phrase, the response can be approved.

번역 데이터 저장부 업데이트 모듈(1326)은, 선택된 관중 멤버에 의해 사용자 피드백 질의에 대해 제출된 응답의 평가에 기초하여, 변환 또는 번역(예를 들어, 번역 데이터 저장부(210)에 저장된)을 업데이트하도록 구성될 수 있다. 예를 들어, 제 1 언어로 제출된 응답이, 응답 평가 동안, (동일한) 제 1 언어로 된 특정 단어에 대한 승인된 정의로 결정될 때, 제 1 언어로 된 그 특정 단어와 제 1 언어로 된 응답을 매핑하는 변환이, 추가되거나 업데이트될 수 있다. 다른 예에서, 제 2 언어로 제출된 응답이, 응답 평가 동안, 제 1 언어로 된 특정 단어에 대한 승인된 정의로 결정될 때, 제 1 언어로 된 그 특정 단어와 제 2 언어로 된 응답을 매핑하는 변환이, 추가되거나 업데이트될 수 있다. 변환 또는 번역의 업데이트는, 제출된 응답의 배치(disposition)가 승인으로 변경되는 체크 단계 동안 또는 그 체크 단계에 이어서, 번역 데이터 저장부 업데이트 모듈(1326)에 의해 수행될 수 있다. Translation data storage update module 1326 updates translations or translations (eg, stored in translation data storage 210) based on evaluation of responses submitted to user feedback queries by selected audience members. It can be configured to. For example, when a response submitted in the first language is determined during the response evaluation to an approved definition for a specific word in the (same) first language, that specific word in the first language and the first language Transforms that map responses can be added or updated. In another example, a response submitted in a second language maps the response in a second language to that specific word in a first language when a response is determined by an approved definition of a specific word in the first language during response evaluation. The conversion to be done can be added or updated. The translation or update of the translation may be performed by the translation data storage update module 1326 during or after the check step in which the disposition of the submitted response is changed to approval.

관중 능력 평가 모듈(1328)은, 정확하고 및/또는 유용한 사용자 피드백 응답을 제공하는 관중 멤버 능력에 연관된 신뢰 수준을 나타낼 수 있는 관중 멤버의 능력(competence)을 결정하도록 구성될 수 있다. 여기에 설명되는 바와 같이, 특정 관중 멤버의 능력은, 특정 관중 멤버에 의해 이전에 제공된 하나 또는 그 이상의 피드백 응답, 및/또는 관중 멤버의 하나 또는 그 이상의 언어 능력에 기초하여 결정될 수 있다. 예를 들어, 관중 멤버는, 그 관중 멤버가 특정 수의 승인된 피드백 응답을 달성하였을 때, 번역 실패와 연관된 하나 이상의 단어 또는 구절에 대한 사용자 피드백을 제공하는데 필요한 능력을 가지고 있는지 평가될 수 있다. 다른 예에서, 관중 멤버는, 특정 언어로 적어도 이전에 제출된 피드백 응답, 관중 멤버의 선호도, 및/또는 그 특정 언어의 유창함을 나타내는 관중 멤버에 관련된 정보에 기초하여 특정 언어의 능력을 갖고 있는지 평가될 수 있다. 또 다른 예에서, 부정확하거나 기만적인 것으로(예, 횡설수설 응답) 평가되었던 이전에 제출된 피드백 응답을 보유한 관중 멤버는, 낮은 능력을 가진 것으로 결정될 수 있다. The audience capacity assessment module 1328 may be configured to determine a crowd member's ability to indicate a confidence level associated with the audience member ability to provide an accurate and / or useful user feedback response. As described herein, the ability of a particular spectator member may be determined based on one or more feedback responses previously provided by a particular spectator member, and / or one or more language abilities of the spectator member. For example, an audience member can be evaluated to see if that audience member has the ability to provide user feedback for one or more words or phrases associated with a translation failure when a certain number of approved feedback responses have been achieved. In another example, an spectator member is assessed to have a proficiency in a particular language based at least on feedback responses previously submitted in a particular language, spectator member preferences, and / or information related to the spectator member indicating the fluency of that particular language. Can be. In another example, a spectator member with a previously submitted feedback response that has been evaluated as inaccurate or deceptive (eg, gibberish response) may be determined to have a low ability.

인센티브 보상 모듈(1330)은, 사용자 피드백을 위한 질의와 관련하여 관중 멤버에 의해 제출된 응답의 평가에 기초하여 관중 멤버에게 인센티브를 보상하도록 구성될 수 있다. 여기에 설명되는 바와 같이, 제출된 응답이 승인되면, 관중 멤버는, 인센티브를 보상받을 수 있다. 보상된 인텐티브의 양 또는 유형은, 다음을 포함하는 다수의 팩터들에 기초하여 결정될 수 있으나, 여기에 제한되지 않는다. 그 팩터는, QR 평가 모듈(1320)에 의해 할당된 질의 또는 질의 응답의 가치, 질의에 응답하여 관중 멤버에 의해 사용되는 응답 방법, 이미 (예, 특정 단어 또는 구절과 관련하여 또는 인센티브 기반 사용자 피드백을 통해 그 관중 멤버에게 또는 모든 관중 멤버들에게) 보상된 인센티브의 양, 질의의 언어 또는 제공된 응답의 언어, 및 응답이 제출된 단어 또는 구절의 유형(예, 채팅용어, 약어 또는 특정 도메인 단어 또는 구절)을 포함한다. 보상된 인센티브는, 실제 세계의 통화 또는, 관련된 가상 경제 밖에서는 가치(예, 실제 세계 경제에서의 통화 가치)가 있을 수도 있고 없을 수도 있는 인-게임 통화 또는 인-게임 아이템과 같은 가상 통화를 포함할 수 있다. 일부 실시예에서, 인센티브는, 연관된 통화 가치를 가질 수 있는 실제 세계 재화 또는 서비스 또는 가상 재화 또는 서비스를 포함할 수 있다. 인센티브의 다른 형태가 다른 실시예에서 보상될 수 있음은 당업자에게 인지될 수 있다. The incentive reward module 1330 may be configured to reward an incentive member to an audience based on an evaluation of a response submitted by the audience member in relation to a query for user feedback. As described herein, if the submitted response is approved, the spectator member can be rewarded with incentives. The amount or type of rewarded intent can be determined based on a number of factors including, but not limited to: The factor is the value of the query or query response assigned by the QR evaluation module 1320, the response method used by the spectator member in response to the query, already (e.g., related to a specific word or phrase or incentive based user feedback) The amount of incentives rewarded to that spectator member or to all spectator members via, the language of the query or the language of the response provided, and the type of word or phrase in which the response was submitted (eg, chat terms, abbreviations or specific domain words or Verse). Rewarded incentives include real world currencies, or virtual currencies such as in-game currencies or in-game items that may or may not have value outside the associated virtual economy (e.g., currency values in the real world economy). can do. In some embodiments, incentives may include real world goods or services or virtual goods or services that may have an associated currency value. It will be appreciated by those skilled in the art that other forms of incentives may be compensated in other embodiments.

일부 실시예에서, 인센티브 보상 모듈(1330), 관중 멤버들이 제출한 하나 또는 그 이상의 응답이 승인될 때, 및/또는 제출된 응답이 승인된 관중 멤버에게 인센티브가 보상될 때, 그 관중 멤버에게 알릴 책임이 있을 수 있다. 다양한 실시예에서, 인센티브 보상 모듈(1330)은, 알림 메시지(예, 팝업 메시지와 같은 인-채팅 메시지) 및/또는 제출된 응답에 대한 상태들의 리스트에 대한 업데이트를 통해 인센티브 보상을 관중 멤버에게 알릴 수 있다. In some embodiments, the incentive reward module 1330 notifies the spectator member when one or more responses submitted by the spectator members are approved, and / or when the submitted response is rewarded by an approved spectator member. You may be responsible. In various embodiments, the incentive reward module 1330 notifies spectator members of the incentive rewards through an update to the list of statuses for notification messages (eg, in-chat messages such as pop-up messages) and / or submitted responses. You can.

다양한 실시예에서 사용자 피드백을 위한 시스템은, 도 13에 도시된 것들보다 더 많은 구성요소 또는 더 적은 구성요소를 포함할 수 있고, 도 13에 도시된 각 구성요소는 각 구성요소에 대해 설명된 것보다 더 많이 또는 더 적은 동작을 수행할 수 있음은 당업자에게 자명한다. In various embodiments, a system for user feedback may include more or fewer components than those illustrated in FIG. 13, and each component illustrated in FIG. 13 is described for each component It will be apparent to those skilled in the art that more or fewer operations can be performed.

도 14는, 다양한 실시예에 따른 예시적인 사용자 피드백 클라이언트 시스템을 나타낸 블럭도이다. 설명을 목적으로, 도 14에서 사용자 피드백 클라이언트 시스템은, 다양한 실시예에 따른 채팅 클라이언트 시스템(1400)의 부분으로 도시된다. 예를 들어, 예시적인 사용자 피그백 클라이언트 시스템은, MMO 게임과 함께 이용할 수 있는 게임 채팅 클라이언트 시스템(예, 인-게임 채팅 클라이언트 시스템)의 부분일 수 있고, MMO 게임의 다양한 플레이어들은, 가능하면, 피드백에 대한 인센티브로서 인-게임 통화/크레딧/아이템을 보상받기 위해, 식별된 단어 또는 구절들에 대한 사용자 피드백을 제공하는 것을 선택할 수 있다. 일부 실시예에서, 예시적인 사용자 피드백 시스템, 이와 관련된 일부 구성요소 또는 전부는, 예시적인 통신 변환 및 번역 시스템(1300)으로부터 분리될 수 있음은 당업자에게 인정될 수 있다. 14 is a block diagram illustrating an exemplary user feedback client system according to various embodiments. For illustrative purposes, the user feedback client system in FIG. 14 is shown as part of a chat client system 1400 according to various embodiments. For example, an exemplary user pigback client system may be part of a game chat client system (eg, an in-game chat client system) available with MMO games, and various players of the MMO game, if possible, To reward in-game currency / credits / items as an incentive for feedback, one can choose to provide user feedback for the identified word or phrase. It may be appreciated by those skilled in the art that in some embodiments, the exemplary user feedback system, some or all of the components associated therewith, can be separated from the exemplary communication conversion and translation system 1300.

도 14에 도시된 바와 같이, 채팅 클라이언트 시스템(1400)은, 채팅 클라이언트 제어부(1402), 채팅 클라이언트 통신 모듈(1404) 및 채팅 클라이언트 GUI 모듈(1406)을 포함할 수 있다. 일부 실시예에서, 채팅 클라이언트 제어부(1402), 채팅 클라이언트 통신 모듈(1404) 및 채팅 클라이언트 GUI 모듈(1406)은, 여기에 설명하는 바와 같이, 채팅 클라이언트 시스템(104)의 구성요소와 유사할 수 있다.As illustrated in FIG. 14, the chat client system 1400 may include a chat client control unit 1402, a chat client communication module 1404, and a chat client GUI module 1406. In some embodiments, chat client control unit 1402, chat client communication module 1404, and chat client GUI module 1406 may be similar to components of chat client system 104, as described herein. .

도 14에 도시된 바와 같이, 채팅 클라이언트 시스템(1400)은, 변환/사용자 피드백(UF) 질의 선호도 모듈(1408) 및 변환/사용자 피드백(UF) 질의 그래픽 유저 인터페이스(GUI) 모듈(1410)을 포함할 수 있다. 일부 실시예에서, UF 질의 선호도 모듈(1408) 및/또는 UF 질의 GUI 모듈(1410)은, CTT 시스템(1300)과 관련하여 사용자 피드백 상호동작을 촉진한다. 채팅 클라이언트 시스템(1400)의 문맥에서, 채팅 클라이언트 시스템(1400)의 채팅 사용자는, 다양한 실시예의 사용자 피드백 시스템(예, CTT 시스템(1300))과 관련한 관중 멤버일 수 있다.As shown in FIG. 14, the chat client system 1400 includes a conversion / user feedback (UF) query preference module 1408 and a conversion / user feedback (UF) query graphical user interface (GUI) module 1410. can do. In some embodiments, UF query preference module 1408 and / or UF query GUI module 1410 facilitate user feedback interaction with respect to CTT system 1300. In the context of chat client system 1400, a chat user of chat client system 1400 may be an audience member associated with a user feedback system (eg, CTT system 1300) of various embodiments.

UF 질의 선호도 모듈(1408)은, 일부 실시예의 사용자 피드백 시스템(예, CTT 시스템(1300))과 관련하여 제공된 사용자 피드백 특징(feature)들에 관한 선호도를 채팅 사용자가 프리뷰하거나 재정의하거나 및/또는 조정하는 것을 관리하고 가능하게 하도록 구성될 수 있다. UF 질의 선호도 모듈(1408)에 의해 관리되는 선호도의 예는, 예를 들어, 사용자 피드백과 관련된 언어 선호도(예, 사용자 피드백이 요청된 단어 또는 구절의 언어 및/또는 찾아지는 사용자 피드백의 언어), 사용자 피드백 질의들에 대해 선호된 응답 방법(예, 프리-폼 응답보다 선택-폼 응답), 또는 선호된 단어 또는 구절 유형(예, 약어, 채팅용어, 물리 관련 또는 관용어) 등을 포함한다. The UF query preference module 1408 allows chat users to preview, redefine, and / or adjust preferences regarding user feedback features provided in connection with a user feedback system (eg, CTT system 1300) of some embodiments. It can be configured to manage and enable what to do. Examples of preferences managed by the UF query preferences module 1408 include, for example, language preferences associated with user feedback (eg, the language of the word or phrase in which the user feedback is requested and / or the language of the user feedback found). Preferred response methods for user feedback queries (e.g., select-form responses rather than pre-form responses), or preferred word or phrase types (e.g., abbreviations, chat terms, physics related or idioms), and the like.

여기에 사용되는 바와 같이, 선택-폼 응답은, 미리 결정되고 적어도 두 개 이상의 선택-폼 응답으로부터 선택할 수 있는 응답이다. 그 실시예에 따르면, 선택-폼 응답의 리스트는, 관중 멤버들이, 적용할 수 있을 때 두 개 이상의 응답을 선택할 수 있게 할 수 있다. 프리-폼 응답은, 관중 멤버에 의해 필드에 입력된 텍스트 기반 값(예, 문자 값, 또는 스트링 값)을 포함하는 응답이다.As used herein, a select-form response is a response that is predetermined and selectable from at least two or more select-form responses. According to the embodiment, the list of select-form responses may allow audience members to select two or more responses when applicable. The pre-form response is a response that includes text-based values (eg, character values, or string values) entered in the field by an audience member.

UF 질의 GIU 모듈(1410)은, 사용자 피드백을 위해 생성되어 채팅 사용자에게 제공되는 질의의 표시, 그 질의와 연관된 하나 이상의 응답 방법의 표시, 및/또는 그 표시된 응답 방법을 통해 채팅 사용자로부터 응답을 수신하는 것을 그래픽적으로 가능하게 한다. UF 질의 GUI 모듈(1410)은, 또한 UF 질의 선호도 모듈(1408)을 통한 선호도의 관리를 가능하게 할 수 있다. 그래픽 사용자 인터페이스에 관한 더 많은 것은, 도 17-23 및 24-31과 관련하여 후술한다. The UF query GIU module 1410 receives a response from a chat user via an indication of a query generated for user feedback and provided to the chat user, an indication of one or more response methods associated with the query, and / or the displayed response method It makes it possible to do it graphically. The UF query GUI module 1410 may also enable management of preferences through the UF query preference module 1408. More on the graphical user interface is described below in relation to FIGS. 17-23 and 24-31.

다양한 실시예에서, 사용자 피드백을 위한 채팅 시스템은, 도 14에 도시된 것보다 더 많은 구성요소 또는 더 적은 구성요소를 포함할 수 있고, 도 14에 도시된 각 구성요소는, 각 구성요소에 대해 설명된 동작보다 더 많은 동작 또는 더 적은 동작을 수행할 수 있음은 당업자에게 자명하다. In various embodiments, the chat system for user feedback may include more or fewer components than those illustrated in FIG. 14, and each component illustrated in FIG. 14 is for each component It is apparent to those skilled in the art that it is possible to perform more or less actions than the described actions.

도 15는 다양한 실시예에 따른 사용자 피드백을 위한 예시적인 방법을 설명하는 흐름도이다. 단계(1502)에서, 번역 실패 관리 모듈(1314)은, 예를 들어 제 1 언어에서 제 2 언어로 텍스트의 변환 또는 번역에서의 잠재적인 실패를 식별할 수 있다. 단계(1504)에서, 번역 실패 관리 모듈(1314)은, 또한 그 식별된 잠재적인 실패로부터 사용자 피드백을 위한 단어 또는 구절을 선택한다. 단계(1506)에서, UF 질의 관중 선택 모듈(1318)은, 사용자 피드백을 요청하기 위해 관중 멤버를 선택할 수 있다. 단계(1508)에서, UF 질의 생성 모듈(1316)은, 그 선택된 관중 멤버로부터 사용자 피드백을 얻기 위한 질의를 생성한다. 단계(1510)에서, 응답 평가 모듈(1324)은, 그 생성된 질의에 대한 응답을 수신할 수 있다. 질의 API 모듈(1322)은, 그 선택된 관중 멤버로 그 생성된 질의를 제공하고, 그 생성된 질의에 대한 응답을 수신할 책임이 있을 수 있다. 단계(1512)에서, 응답 평가 모듈(1324)은, 그 수신된 응답을 평가할 수 있다. 단계(1514)에서, 관중 능력 평가 모듈(1328)은, 단계(1510)에서 제공된 응답 및/또는 단계(1512)에서 수행된 그 수신된 응답의 평가에 기초하여, 그 선택된 관중 멤버의 능력을 평가할 수 있다. 단계(1516)에서, 인센티브 보상 모듈(1330)은, 그 선택된 관중 멤버에게 응답 평가에 기초하여 인센티브를 보상할 수 있다. 여기에 설명된 바와 같이, 응답을 평가하고 응답이 승인되는 것을 결정하면, 인센티브 보상 모듈(1330)은, QR 평가 모듈(1320)에 의해 결정된 질의 및/또는 질의 응답의 가치에 따라, 관중 멤버에게 보상을 보상할 수 있다. 단계(1518)에서, 번역 데이터 저장부 업데이트 모듈(1326)은, 응답 평가에 기초하여 변환 또는 번역을 업데이트할 수 있다. 여기에 설명된 바와 같이, 응답을 평가하고 응답이 승인되는 것을 결정하면, 번역 데이터 저장부 업데이트 모듈(1326)은, 질의의 단어 또는 구절 및 그 제출된 질의 응답에 대응하는 번역 또는 변환을 업데이트할 수 있다. 15 is a flowchart illustrating an example method for user feedback according to various embodiments. At step 1502, translation failure management module 1314 may identify potential failures in the translation or translation of text, for example, from a first language to a second language. At step 1504, translation failure management module 1314 also selects a word or phrase for user feedback from the identified potential failure. In step 1506, the UF query spectator selection module 1318 may select spectator members to request user feedback. In step 1508, the UF query generation module 1316 generates a query to obtain user feedback from the selected audience member. In step 1510, the response evaluation module 1324 may receive a response to the generated query. The query API module 1322 may be responsible for providing the generated query to the selected audience member and receiving a response to the generated query. In step 1512, the response evaluation module 1324 can evaluate the received response. In step 1514, the spectator ability evaluation module 1328 evaluates the ability of the selected spectator member based on the evaluation provided in step 1510 and / or the received response performed in step 1512. You can. In step 1516, the incentive reward module 1330 may reward the selected audience member based on the response evaluation. As described herein, upon evaluating the response and determining that the response is approved, the incentive reward module 1330, to the audience member, depending on the value of the query and / or query response determined by the QR evaluation module 1320 Rewards can be rewarded. In step 1518, the translation data storage update module 1326 may update the translation or translation based on the response evaluation. As described herein, upon evaluating the response and determining that the response is approved, the translation data store update module 1326 updates the word or phrase of the query and the translation or translation corresponding to the submitted query response. You can.

도 16은, 다양한 실시예에 따른 사용자 피드백을 위한 예시적인 데이터 흐름(1600)을 나타낸 블록도이다 .도시된 바와 같이, 데이터 흐름(1600)은, 채팅 클라이언트 시스템(1400), 번역 실패 관리 모듈(1314), UF 질의 생성 모듈(1316) , UF 질의 관중 선택 모듈(1318), 응답 평가 모듈(1324), 번역 데이터 저장부 업데이트 모듈(1326), 및 인센티브 보상 모듈(1330)이 관련된다. 데이터 흐름(1600)은, 미정의(undefined) 단어/구절 데이터 저장부(1602), 기록된 응답 데이터 저장부(1604), 피드백 관중 데이터 저장부(1606), 응답 승인 데이터 저장부(1608), 채팅 데이터 저장부(1610), 및 사전 데이터 저장부(1612)가 관련된다. 16 is a block diagram illustrating an exemplary data flow 1600 for user feedback according to various embodiments. As shown, the data flow 1600 includes a chat client system 1400, a translation failure management module ( 1314), a UF query generation module 1316, a UF query crowd selection module 1318, a response evaluation module 1324, a translation data storage update module 1326, and an incentive reward module 1330. The data flow 1600 includes an undefined word / phrase data storage unit 1602, a recorded response data storage unit 1604, a feedback spectator data storage unit 1606, a response approval data storage unit 1608, The chat data storage unit 1610 and the dictionary data storage unit 1612 are related.

미정의 단어/구절 데이터 저장부(1602)는, 번역 실패에 연관되고 사용자 피드백이 찾아지고 있는 단어 또는 구절을 포함할 수 있다. 미정의 단어/구절 데이터 저장부(1602)는, 단어 또는 구절과 함께, 단어 또는 구절이 사용되는 샘플 문장(예, 단어 또는 구절 문맥), 단어 또는 구절이 얼마나 중요한지(예, 단어 중요성)를 나타내는 신뢰도 측정, 단어 또는 구절에 대한 소스 언어, 찾아지는 사용자 피드백에 대한 타겟 언어 등을 포함한다. 일부 실시예에서, 미정의 단어/구절 데이터 저장부(1602)에서 단어 또는 구절의 단어 중요성은, 초기에는 모든 단어들이 동등하나, 단어 또는 구절이 충돌하고 문제가 있고 및/또는 번역할 수 없는 때 증가할 수 있다. The undefined word / phrase data storage unit 1602 may include words or phrases related to translation failures and user feedback is being sought. The undefined word / phrase data storage 1602 indicates a sample sentence (eg, word or phrase context) in which the word or phrase is used, along with the word or phrase, and how important the word or phrase is (eg, word importance). Reliability measures, source language for words or phrases, target language for user feedback found. In some embodiments, the word importance of a word or phrase in the undefined word / phrase data storage 1602 is when all words are equal initially, but the words or phrases conflict, have problems and / or cannot be translated Can increase.

기록된 응답 데이터 저장부(1604)는, 관중 멤버로부터 수신되고, 미정의 단어/구절 데이터 저장부(1602)에 포함된 단어 또는 구절에 대해 기록된, 사용자 피드백 응답을 포함할 수 있다. 일부 실시예에서, 사용자 피드백 응답은, 단어 또는 구절에 관한 사용자 피드백을 얻기 위해 생성된 질의에 대해 수신된 응답을 포함한다. 미정의 단어/구절 데이터 저장부(1602)는, 기록된 사용자 피드백 응답과 함께, 사용자 피드백 응답을 제출하는 관중 멤버에 대한 식별자, 사용자 피드백 응답이 수신되고 및/또는 기록될 때의 시간스탬프, 기록된 사용자 피드백 응답이 승인되는지 여부의 표시, 기록된 사용자 피드백 응답이 승인될 때의 타임스탬프 등을 포함한다. The recorded response data storage unit 1604 may include a user feedback response received from an audience member and recorded for a word or phrase included in the undefined word / phrase data storage unit 1602. In some embodiments, the user feedback response includes a response received for a query generated to obtain user feedback regarding a word or phrase. The undefined word / phrase data storage unit 1602 records, along with the recorded user feedback response, an identifier for an audience member submitting the user feedback response, a timestamp when the user feedback response is received and / or recorded It includes an indication of whether the user feedback response is approved, a timestamp when the recorded user feedback response is approved, and the like.

피드백 관중 데이터 저장부(1606)는, 미정의 단어/구절 데이터 저장부(1602)에 포함된 단어 또는 구절에 대한 사용자 피드백에 참여하는 것으로 선택된 관중 멤버에 대한 식별자의 세트를 포함할 수 있다. 피드백 관중 데이터 저장부(1606)ㄴ는, 관중 멤버에 대한 각 식별자와 함께, 사용자 피드백을 제공하는 관중 멤버의 일관성(consistency), 능력(competency), 및/또는 신뢰도(confidence)를 반영하는 신뢰도 점수를 포함할 수 있다.The feedback spectator data storage unit 1606 may include a set of identifiers for spectator members selected to participate in user feedback for words or phrases included in the undefined word / phrase data storage unit 1602. The feedback spectator data storage unit 1606 b, along with each identifier for the spectator member, scores a confidence level reflecting the consistency, competence, and / or confidence of the spectator member providing user feedback. It may include.

응답 승인 데이터 저장부(1608)는, 미정의 단어/구절 데이터 저장부(1602)에 포함된 단어 또는 구절과 관련되어 수신된 각 고유 사용자 피드백 응답을 포함할 수 있다. 응답 승인 데이터 저장부(1608)는, 각 고유 사용자 피드백 응답과 함께, 고유 사용자 피드백 응답이 승인된 응답(예, 정확한 응답)인지 여부의 표시, 거절된 응답(예, 부정확한 응답), 검토 계류 중 응답(예, 검토가 필요한 응답), 또는 일부 다른 상태를 갖는 응답을 포함할 수 있다. 일부 실시예에서, 응답 승인 데이터 저장부(1608)는, 관중 멤버로부터 수신되고 그 기록된 응답 데이터 저장부(1604)에 기록된 사용자 피드백 응답이 승인될 때를 결정하는데 채용될 수 있다. The response approval data storage unit 1608 may include each unique user feedback response received in association with a word or phrase included in the undefined word / phrase data storage unit 1602. Response acknowledgment data storage 1608, along with each unique user feedback response, indicates whether the unique user feedback response is an approved response (eg, an accurate response), rejected response (eg, incorrect response), pending review It can include either a medium response (eg, a response that needs to be reviewed), or a response with some other status. In some embodiments, the response approval data storage 1608 may be employed to determine when a user feedback response received from an audience member and recorded in the recorded response data storage 1604 is approved.

일부 실시예에 따라, 번역 실패 관리 모듈(1314)은, 채팅 데이터 저장부(1610)에 의해 제공될 수 있는 채팅 로그들을 검토하고, 실제적인 또는 잠재적인 번역 실패와 연관된 하나 또는 그 이상의 단어 또는 구절을 식별하도록 구성될 수 있다. 다양한 실시예에서, 번역 실패 관리 모듈(1314)은, 표준 사전(예, 옥스포드 사전) 및/또는, 여기에 설명된 실시예가 파싱할 수 있고, 인식할 수 있으며, 및/또는 처리할 수 있는 단어 또는 구절(예, 채팅용어 단어 또는 구절)의 사전을 포함할 수 있는 사전 데이터 저장부(1612)에 정의된, 이러한 단어들 또는 구절들을 배제하도록 구성될 수 있다. 번역 실패 관리 모듈(1314)에 의해 식별된 단어 또는 구절은, 미정의 단어/구절 데이터 저장부(1602)에 추가될 수 있고, 이로써 이러한 추가된 단어 또는 구절이, 선택된 관중 멤버로부터의 사용자 피드백을 위해 선택되도록 할 수 있다.According to some embodiments, the translation failure management module 1314 reviews chat logs that may be provided by the chat data storage 1610, and one or more words or phrases associated with actual or potential translation failures. It can be configured to identify. In various embodiments, translation failure management module 1314 may include standard dictionaries (eg, Oxford dictionaries) and / or words that the embodiments described herein can parse, recognize, and / or process. Or it may be configured to exclude such words or phrases, as defined in the dictionary data storage 1612, which may include a dictionary of phrases (eg, chat term words or phrases). Words or phrases identified by the translation failure management module 1314 may be added to the undefined word / phrase data storage 1602, whereby these added words or phrases receive user feedback from selected audience members. Can be selected.

번역 실패 관리 모듈(1314)은, 사용자 피드백을 위해, 미정의 단어/구절 데이터 저장부(1602)로부터, 하나 또는 그 이상의 단어 또는 구절을 선택하도록 구성될 수 있다. 일부 실시예에서, 번역 실패 관리 모듈(1314)은, 미정의 단어/구절 데이터 저장부(1602)에서 가장 높은 중요성을 갖는 것으로 지명된 단어 또는 구절의 세트(예, 미정의 단어/구절 데이터 저장부(1602)에 있는 탑 10의 중요한 단어 또는 구절로부터 선택된)로부터 선택할 수 있다. 일부 실시예에서, 번역 실패 관리 모듈(1314)은, 관중 멤버에게 응답을 위해 선택할 수 있는 두 개 또는 그 이상의 사용자 피드백의 세트가 제공되도록 두 개 또는 그 이상의 단어 또는 구절을 선택할 수 있다(예를 들어, 관중 멤버가 가장 신뢰하는 응답이라고 느끼는 이러한 사용자 피드백들을 선택할 수 있도록 하는). 미정의 단어/구절 데이터 저장부(1602)로부터 번역 실패 관리 모듈(1314)에 의한 선택 프로세스는, 단어 또는 구절의 중요성, 미정의 단어/구절 데이터 저장부(1602)에서의 단어 또는 구절의 오래됨, UF 질의 관중 선택 모듈(1318)에 의한 그 선택된 관중 멤버의 선호도, UF 질의 관중 선택 모듈(1318)에 의해 선택된 관중 멤버가, 선택될 단어 또는 구절에 대해 이미 응답하였는지 여부(예, 선택될 단어 또는 구절에 대해 그 기록된 응답 데이터 저장부(1604)를 체크하는 것에 기초하여 결정) 등에 기초하여, 랜덤일 수 있다. The translation failure management module 1314 may be configured to select one or more words or phrases from the undefined word / phrase data storage unit 1602 for user feedback. In some embodiments, the translation failure management module 1314 sets a word or phrase designated as having the highest importance in the undefined word / phrase data storage unit 1602 (eg, undefined word / phrase data storage unit) (Selected from the top 10 important words or phrases in 1602). In some embodiments, the translation failure management module 1314 may select two or more words or phrases so that the audience member is provided with a set of two or more user feedbacks selectable for response (eg For example, allowing audience members to select these user feedbacks they feel is the most trusted response). The selection process by the translation failure management module 1314 from the undefined word / phrase data storage unit 1602 includes the importance of the word or phrase, the word or phrase in the undefined word / phrase data storage unit 1602, The preference of the selected audience member by the UF query audience selection module 1318, whether the audience member selected by the UF query audience selection module 1318 has already responded to the word or phrase to be selected (eg, a word to be selected or Based on checking the recorded response data storage 1604 for a verse) and the like.

UF 질의 관중 선택 모듈(1318)은, 사용자 피드백이 찾아질 수 있는, 피드백 관중 데이터 저장부(1606)로부터, 하나 또는 그 이상의 관중 멤버를 선택하도록 구성될 수 있다. 여기에 설명되는 바와 같이, 사용자 피드백은, 가능하면 채팅 데이터 저장부(1610)로부터, 번역 실패 관리 모듈(1314)에 의해 선택된 단어 또는 구절에 대해 찾아질 수 있다. 피드백 관중 데이터 저장부(1606)로부터의 관중 멤버의 선택은, 관중 멤버에 연관된 능력(competency) 레벨에 의존할 수 있다. The UF query spectator selection module 1318 may be configured to select one or more spectator members from the feedback spectator data storage 1606 where user feedback can be found. As described herein, user feedback may be searched for the word or phrase selected by the translation failure management module 1314, possibly from the chat data storage 1610. The choice of spectator members from the feedback spectator data storage 1606 may depend on the level of competence associated with the spectator members.

UF 질의 생성 모듈(1316)은, 사용자 피드백을 위해, 미정의 단어/구절 데이터 저장부(1602)로부터, 번역 실패 관리 모듈(1314)에 의해 선택된 단어 또는 구절에 대한 하나 또는 그 이상의 질의를 생성하도록 구성될 수 있다. 보여진 바와 같이, 번역 실패 관리 모듈(1314)은, UF 질의 생성 모듈(1316)로, 하나 또는 그 이상의 질의가 생성될 그 선택된 단어 또는 구절을 제공할 수 있다. 여기에 설명된 바와 같이, UF 질의 생성 모듈(1316)은, 예를 들어 UF 질의 관중 선택 모듈(1318)에 의해 선택된 관중 멤버들의 선호도 및 번역 실패 관리 모듈(1314)에 의해 사용자 피드백을 위해 선택된 단어 또는 구절을 포함하는 질의를 생성할 때, 많은 다른 팩터를 고려할 수 있다. 결국, UF 질의 생성 모듈(1316)은, 번역 실패 관리 모듈(1314)에 의해 선택되고 제공된 각 단어에 대해 다른 질의를 생성할 수 있는 UF 질의 생성 모듈(1316)에 의해 생성된 하나 또는 그 이상의 질의를, 채팅 클라이언트 시스템(1400)으로 제공할 수 있다. The UF query generation module 1316 generates one or more queries for words or phrases selected by the translation failure management module 1314 from the undefined word / phrase data storage unit 1602 for user feedback. Can be configured. As shown, the translation failure management module 1314 may provide the selected word or phrase in which one or more queries will be generated, to the UF query generation module 1316. As described herein, the UF query generation module 1316 is a word selected for user feedback by the preference and translation failure management module 1314 of the spectator members selected, for example, by the UF query audience selection module 1318. Or, when creating a query that includes phrases, there are many different factors to consider. As a result, the UF query generation module 1316 may generate one or more queries generated by the UF query generation module 1316 which may generate different queries for each word selected and provided by the translation failure management module 1314. , Can be provided to the chat client system 1400.

결국, UF 질의 생성 모듈(1316)에 의해 생성된 하나 또는 그 이상의 질의는, 채팅 클라이언트 시스템(1400)으로 제공될 수 있고, 채팅 클라이언트 시스템(1400)은 차례로 그 제공된 질의를 채팅 클라이언트 시스템(1400)에 있는 사용자의 선택을 위해 제공한다. 실시예에 따르면, UF 질의 생성 모듈(1316)은, 그 생성된 질의를 채팅 클라이언트 시스템으로 제공할 수 있고 또는, 대안으로, 다른 구성요소가 그 생성된 질의를 채팅 클라이언트 시스템으로 제공하는데 책임질 수 있다. 일단 그 생성된 질의가 채팅 클라이언트 시스템(1400)에서의 선택을 위해 제공되면, 채팅 클라이언트 시스템(1400)의 사용자는, 하나 또는 그 이상의 그 제공된 질의에 응답하는 것을 선택할 수 있고, 채팅 클라이언트 시스템(1400)에 의해 제공된 이 응답들은, 기록된 응답 데이터 저장부(1604)에 추가(예, 기록)될 수 있다.As a result, one or more queries generated by the UF query generation module 1316 may be provided to the chat client system 1400, and the chat client system 1400 in turn queries the provided queries to the chat client system 1400. Provides for user's choice in. According to an embodiment, the UF query generation module 1316 may provide the generated query to the chat client system, or alternatively, other components may be responsible for providing the generated query to the chat client system. . Once the generated query is provided for selection in the chat client system 1400, a user of the chat client system 1400 can choose to respond to one or more of the provided queries, and the chat client system 1400 These responses provided by) may be added (eg, recorded) to the recorded response data storage 1604.

응답이, 기록된 응답 데이터 저장부(1604)에 추가될 때, 일부 실시예는 그 추가된 응답이 응답 평가 모듈(1324)에 의해 평가될 수 있는지 체크한다. 여기에 설명된 바와 같이, 응답 평가 모듈(1324)은, 그 응답을 체크함으로써 응답을 평가할 수 있고, 응답의 상태를 배치할 수 있다. When a response is added to the recorded response data storage 1604, some embodiments check whether the added response can be evaluated by the response evaluation module 1324. As described herein, the response evaluation module 1324 can evaluate the response by checking the response, and can arrange the status of the response.

도 16에 도시된 바와 같이, 응답 평가 모듈(1324)은, 사용자 피드백 응답의 평가 동안, 사용자 피드백 응답의 상태 체크를 수행하도록 구성될 수 있는, 응답 체크 모듈(1614)을 포함한다. 응답 체크 모듈(1614)은, 기록된 응답 데이터 저장부(1604)로부터 사용자 피드백 응답을 검색하고, 응답 승인 데이터 저장부(1608)에서 그 검색된 사용자 피드백에 대응하는 고유 응답의 상태를 체크함으로써 채팅 클라이언트 시스템(1400)으로부터의 사용자 피드백 응답의 상태를 체크할 수 있다. 이와 같이 하는 동안, 응답 체크 모듈(1614)은, 특정 사용자 피드백 응답이 승인되는지 또는 거절되는지를 결정할 수 있다. 기록된 응답 데이터 저장부(1604)에서의 그 검색된 사용자 피드백 응답의 승인 상태는, 응답 체크 모듈(1614)에 의해 수행된 최근 상태 체크에 따라 업데이트될 수 있다. 응답 체크 모듈(1614)이, 그 검색된 사용자 피드백 응답이 승인되었는지를 결정하는데 있어서, 기록된 응답 데이터 저장부(1604)에 있는 그 검색된 사용자 피드백 응답의 승인 상태는, 그 승인을 반영하도록 그리고 그 승인 상태가 업데이트되었던 시기에 대한 타임스탬프를 포함하도록 업데이트될 수 있다. 결국, 그 검색된 사용자 피드백 응답에 대해, 기록된 응답 데이터 저장부(1604)에 반영된 승인은, 인센티브 보상 모듈(1330)에서 그 승인된 사용자 피드백 응답을 제출하였던 관중 멤버에게 인센티브를 보상하는 것으로 결론날 수 있다. As shown in FIG. 16, the response evaluation module 1324 includes a response check module 1614, which may be configured to perform a status check of the user feedback response during evaluation of the user feedback response. The response check module 1614 retrieves a user feedback response from the recorded response data storage unit 1604, and checks the status of the unique response corresponding to the retrieved user feedback in the response approval data storage unit 1608, thereby chatting client The status of the user feedback response from system 1400 may be checked. While doing so, the response check module 1614 can determine whether a particular user feedback response is approved or rejected. The approval status of the retrieved user feedback response in the recorded response data storage 1604 may be updated according to the latest status check performed by the response check module 1614. When the response check module 1614 determines whether the retrieved user feedback response has been approved, the approval status of the retrieved user feedback response in the recorded response data storage 1604 reflects the approval and the approval It can be updated to include a timestamp for when the status was updated. Eventually, for the retrieved user feedback response, the approval reflected in the recorded response data storage 1604 concludes that the incentive reward module 1330 compensates the incentive to the audience member who submitted the approved user feedback response. Can be.

검색된 번역 응답의 상태가 여전히 검토 계류 중이면, 응답 체크 모듈(1614)은, 나중에(예, 미리 결정된 스케줄에 따라) 그 검색된 사용자 피드백 응답의 상태를 다시 체크할 수 있다. 그 검색된 번역 응답의 상태가 거절이면, 기록된 응답 데이터 저장부(1604)에 있는 그 검색된 사용자 피드백 응답의 승인 상태는 그 거절을 반영하도록 업데이트될 수 있다.If the status of the retrieved translation response is still pending review, the response check module 1614 may check again the status of the retrieved user feedback response later (eg, according to a predetermined schedule). If the status of the retrieved translation response is rejected, the approval status of the retrieved user feedback response in the recorded response data storage 1604 can be updated to reflect the rejection.

그 검색된 사용자 피드백 응답에 대응하는 고유 응답이 발견되지 않으면, 그 검색된 사용자 피드백 응답은, 응답 승인 데이터 저장부(1608)에, 사용자 피드백 응답이 제공되었던(예, 채팅 클라이언트 시스템(1400)의 사용자에 의해) 단어 또는 구절에 대한 고유 응답으로서 추가될 수 있다. 추가적으로, 검색된 사용자 피드백 응답이, 고유 응답으로서 응답 승인 데이터 저장부(1608)에 추가되면, 그 고유 응답은 승인 계류 중인 초기 상태를 가질 수 있고, 이는 고유 응답의 상태가 수동으로 또는 자동적으로 배치되는(예, 평가 응답 모듈(1324)을 통해) 시간까지 유지될 것이다.If a unique response corresponding to the retrieved user feedback response is not found, the retrieved user feedback response is sent to the response approval data storage unit 1608, to the user of the chat client system 1400, for example. Can be added as a unique response to a word or phrase. Additionally, if the retrieved user feedback response is added to the response acknowledgment data storage 1608 as a unique response, the unique response may have an initial status pending approval, in which the status of the native response is manually or automatically deployed. (Eg, via evaluation response module 1324).

도 16에 도시된 바와 같이, 응답 평가 모듈(1324)은 또한, 고유 사용자 응답의 상태를 승인, 거절, 또는 승인 계류 중으로 배치하도록 구성될 수 있는, 응답 검증기(1616)를 포함한다. 여기에 설명된 바와 같이, 사용자 피드백을 얻기 위해 생성된 질의와 관련하여 제출된 고유 사용자 응답은, 고유 사용자 피드백 응답이, 그 생성된 질의와 연관된 단어 또는 구절에 대해 유효한 것으로 승인되거나, 그 생성된 질의와 연관된 단어 또는 구절에 대해 유효하지 않은 것으로 거절될 때까지, 승인 계류 중인 것으로 고려될 수 있다. As shown in FIG. 16, the response evaluation module 1324 also includes a response verifier 1616, which can be configured to place the status of the unique user response into approval, rejection, or pending approval. As described herein, a unique user response submitted in connection with a query generated to obtain user feedback is either approved, or generated, that the unique user feedback response is valid for the word or phrase associated with the generated query. It can be considered pending approval until it is rejected as invalid for the word or phrase associated with the query.

일부 실시예에서, 응답 평가 모듈(1324)은, 특정 단어 또는 구절이 추가적인 사용자 피드백이 더 이상 필요하지 않은 때를 결정하도록 구성될 수 있다. 응답 평가 모듈(1324)은, 얼마나 많은 고유 사용자 피드백 응답이 특정 단어 또는 구절에 대해 승인되었는지, 그리고 고유 및 승인된 사용자 피드백 응답의 수가 특정 임계치와 같거나 초과하는지와 같은 예의 팩터들에 기초하여 그러한 결정을 할 수 있다. 특정 단어 또는 구절이 더 오래 추가적인 사용자 피드백을 필요로 하는 것으로 결정될 때, 응답 평가 모듈(1324)은, 미정의 단어/구절 데이터 저장부(1602)로부터 특정 단어 또는 구절을 제거하도록 구성될 수 있고, 이로써 사용자 피드백을 위한 향후 선택(예, 번역 실패 관리 모듈(1314)에 의해)으로부터 그 단어 또는 구절을 제거한다. In some embodiments, response evaluation module 1324 may be configured to determine when a particular word or phrase no longer requires additional user feedback. The response evaluation module 1324 is based on example factors such as how many unique user feedback responses are approved for a specific word or phrase, and the number of unique and approved user feedback responses equals or exceeds a certain threshold. You can make a decision. When it is determined that a specific word or phrase requires additional user feedback for a longer time, the response evaluation module 1324 may be configured to remove the specific word or phrase from the undefined word / phrase data storage 1602, This removes the word or phrase from future selection for user feedback (eg, by translation failure management module 1314).

여기에 설명된 바와 같이, 인센티브 보상 모듈(1330)은, 관중 멤버가 제공한(예, 채팅 클라이언트 시스템(1400)을 통해 관중 멤버에 의해) 사용자 피드백 응답이, 사용자 피드백 응답이 제공되었던 단어 또는 구절에 대한 유효한 응답으로 승인되면, 그 관중 멤버에게 인센티브를 보상할 수 있다. 인센티브 보상 모듈(1330)은, 기록된 응답 데이터 저장부(1604)에서, 최근에 승인된(예, 그 승인 상태는 승인을 반영하도록 최근에 업데이트되었음) 및/또는 인센티브 보상 모듈(1330)이 그 기록된 응답 데이터 저장부(1604)에서 승인된 상태를 갖는 하나 또는 그 이상의 사용자 피드백 응답을 식별하려고 시도했던 마지막 이후에 승인된, 하나 또는 그 이상의 사용자 피드백 응답을 식별할 수 있다. 인센티브 보상 모듈(1330)은, 기록된 응답 데이터 저장부(1604)에 있는 사용자 피드백 응답에 대해 포함되어 있는 승인 타임스탬프에 기초하여, 특정 사용자 피드백 응답이 마지막에 승인되었던 시간을 결정할 수 있다. 일부 실시예에서, 기록된 응답 데이터 저장부(1604)에 있는 번역된 피드백 응답에 대해 인센티브가 보상되면, 번역된 피드백 응답은, 기록된 응답 데이터 저장부(1604)로부터 제거될 수 있다. 대안으로, 기록된 응답 데이터 저장부(1604)에 있는 번역된 피드백 응답에 대해 인센티브가 보상되면, 번역된 피드백 응답은, 예를 들어, 인센티브가 보상되었던 시기, 보상된 인센티브의 양, 보상된 인센티브의 유형, 관중 멤버가 보상을 통지받은 시기, 및/또는 관중 멤버가 인센티브 보상을 통지받은 방법 등을 표시하도록, 그 기록된 응답 데이터 저장부(1604)에서 업데이트될 수 있다. As described herein, the incentive reward module 1330 may be a word or phrase in which a user feedback response provided by an audience member (eg, by an audience member via the chat client system 1400), was provided with a user feedback response. If approved as a valid response to, you can reward the spectator members with incentives. The incentive reward module 1330 may include, in the recorded response data storage unit 1604, recently approved (eg, the approval status has been recently updated to reflect the approval) and / or the incentive reward module 1330. The recorded response data storage 1604 may identify one or more user feedback responses that have been approved since the last attempt to identify one or more user feedback responses with an approved status. The incentive reward module 1330 may determine a time at which the specific user feedback response was last approved based on the approval timestamp included for the user feedback response in the recorded response data storage 1604. In some embodiments, if the incentive is compensated for the translated feedback response in the recorded response data storage 1604, the translated feedback response may be removed from the recorded response data storage 1604. Alternatively, if the incentive is compensated for the translated feedback response in the recorded response data storage 1604, the translated feedback response may be, for example, when the incentive was compensated, the amount of the incentive compensated, the compensated incentive It can be updated in the recorded response data storage 1604 to indicate the type of, when the audience member was notified of the reward, and / or how the audience member was notified of the incentive reward, and the like.

도 17은, 다양한 실시예에 따른 단어에 대한 사용자 피드백을 수신하는 예시적인 스크린샷을 나타낸다. 특히, 도 17은, 사용자 피드백 프로세스를 촉진하기 위해 관중 멤버에게 제공될 수 있는(예, 채팅 클라이언트 시스템(1400)을 통해) GUI의 예를 나타내는 스크린샷(1702, 1704 및 1706)이다. 스크린샷(1702)은, 하나 또는 그 이상의 관중 멤버에게 번역 실패와 연관된 단어 또는 구절에 대한 사용자 피드백 참여를 간청하는 배너(1708)의 예를 나타낸다. 관중 멤버는, 사용자 피드백 세션의 시작으로 이끌 수 있고 및/또는 관중 멤버를 그 관중 멤버가 참여하는 것을 선택할 수 있는 이용 가능한 사용자 피드백 기회들의 리스트로 이끌 수 있는, 배너(1708)를 선택함으로써 사용자 피드백에 참여하는 것을 선택할 수 있다. 여기에 설명된 바와 같이, 사용자 피드백 기회는, 관중 멤버가 실제적인 또는 잠재적인 번역 실패와 연관된 단어 또는 구절에 대해 정의를 제공할 수 있도록 허용할 수 있다. 일부 실시예에 따르면, 관중 멤버는, 단어 또는 구절에 연관된 이용 가능한 사용자 피드백 기회들 중 하나를 선택할 수 있고, 그러면 그렇게 할 수 있도록 유도될 때, 그 연관된 단어 또는 구절에 대한 정의를 제공한다. 17 shows example screenshots for receiving user feedback for words in accordance with various embodiments. In particular, FIG. 17 is a screenshot 1702, 1704 and 1706 showing an example of a GUI that can be provided to an audience member (eg, via the chat client system 1400) to facilitate the user feedback process. Screenshot 1702 shows an example of a banner 1708 encouraging one or more audience members to participate in user feedback on a word or phrase associated with a translation failure. Audience members can direct user feedback by selecting banner 1708, which can lead to the start of a user feedback session and / or lead a spectator member to a list of available user feedback opportunities from which the spectator member can choose to participate. You can choose to participate. As described herein, user feedback opportunities can allow audience members to provide definitions for words or phrases associated with actual or potential translation failures. According to some embodiments, the spectator member may select one of the available user feedback opportunities associated with the word or phrase, and when prompted to do so, provide a definition of the associated word or phrase.

스크린샷(1704)은, 다양한 단어들(예, “Skrilla”, “Booty”, “Cray”, “Hecka” 및 “Freshness”)에 대한 이용 가능한 사용자 피드백 기회들의 리스트(1710)를 나타낸다. 스크린샷(1706)은, 단어 “Skrilla”에 대한 사용자 피드백을 얻기 위해 관중 멤버에게 제시된 질의(1712)의 예를 제공한다. 도시된 바와 같이, 질의(1712)는, 단어 “Skrilla”가 사용되는 예시적인 문맥(1714)을 제공하고, 또한 질의(1712)에 대한 프리-폼 응답을 수신하도록 구성된 필드(1716)를 제공한다. 관중 멤버가 스크린샷(1704)의 리스트(1710)로부터 단어 “Skrilla”에 대한 사용자 피드백을 선택할 때 해당 관중 멤버는 스크린샷(1706)으로 안내된다.Screenshot 1704 shows a list 1710 of available user feedback opportunities for various words (eg, “Skrilla”, “Booty”, “Cray”, “Hecka” and “Freshness”). Screenshot 1706 provides an example of a query 1712 presented to an audience member to get user feedback on the word “Skrilla”. As shown, query 1712 provides an example context 1714 in which the word “Skrilla” is used, and also provides a field 1716 configured to receive a pre-form response to query 1712. . When the spectator member selects user feedback for the word “Skrilla” from the list 1710 of the screenshot 1704, the spectator member is directed to the screenshot 1706.

도 18은, 다양한 실시예에 따른 사용자 피드백을 스킵하는 예시적인 스크린샷을 나타낸다. 특히 도 18은, 사용자 피드백 프로세스를 가능하게 하기 위해 관중 멤버에게 제시될 수 있는(예, 채팅 클라이언트 시스템(1400)을 통해) GUI의 예를 나타내는 스크린샷(1802, 1804)이다. 스크린샷(1802)은 관중 멤버에 의해 선택이 가능한 사용자 피드백 기회들의 리스트(1806)를 나타낸다. 도시된 바와 같이, 리스트(1806)는 관중 멤버에게 리스트된 하나 또는 그 이상의 사용자 피드백 기회들을 스킵하는 옵션을 제공한다. 18 shows example screenshots for skipping user feedback according to various embodiments. In particular, FIG. 18 is a screenshot 1802, 1804 showing an example of a GUI that may be presented to an audience member (eg, via the chat client system 1400) to enable a user feedback process. Screenshot 1802 shows a list 1806 of user feedback opportunities that are selectable by an audience member. As shown, list 1806 provides spectator members an option to skip one or more of the user feedback opportunities listed.

스크린샷(1804)은 다양한 단어에 대한 사용자 피드백을 얻기 위해 관중 멤버에게 제시된 질의(1808)의 예를 나타낸다. 도시된 바와 같이, 질의(1808)는 관중 멤버에게 질의(1808)에 대한 응답을 제공하는 프로세스를 스킵할 수 있는 옵션을 제공한다. 특정 실시예는, 특정 사용자 피드백 기회들 및/또는 다양한 사용자 피드백 질의들을 스킵할 수 있는 옵션을 관중 멤버에게 제공함으로써 다양한 사용자 피드백 질의들에 대한 부정확한 및/또는 허구의 응답을 피할 수 있게 한다.Screenshot 1804 shows an example of a query 1808 presented to an audience member to get user feedback for various words. As shown, query 1808 provides an audience member with the option to skip the process of providing a response to query 1808. Certain embodiments enable avoiding inaccurate and / or fictional responses to various user feedback queries by providing an audience member with the option to skip specific user feedback opportunities and / or various user feedback queries.

도 19는 다양한 실시예에 따른 구절에 대한 사용자 피드백을 수신하는 예시적인 스크린샷을 나타낸다. 특히 도 19는, 사용자 피드백 프로세스를 촉진하기 위해 관중 멤버에게 제시될 수 있는(예, 채팅 클라이언트 시스템(1400)을 통해) GUI의 예를 나타내는 스크린샷(1902, 1904)이다. 스크린샷(1902)은 관중 멤버에 의해 선택이 가능한 사용자 피드백 기회들의 리스트(1906)를 나타낸다. 도시된 바와 같이, 리스트(1906)는 다양한 단어 및 구절(예, “Skrilla” 및 “Pardon my french”)에 대한 이용 가능한 사용자 피드백 기회를 나타낸다. 19 shows example screenshots for receiving user feedback for a phrase in accordance with various embodiments. In particular, FIG. 19 is a screenshot 1902, 1904 showing an example of a GUI that can be presented to an audience member (eg, via the chat client system 1400) to facilitate the user feedback process. Screenshot 1902 shows a list of user feedback opportunities 1906 selectable by an audience member. As shown, list 1906 represents available user feedback opportunities for various words and phrases (eg, “Skrilla” and “Pardon my french”).

스크린샷(1904)은, 구절 “Pardon my french”에 대한 사용자 피드백을 얻기 위해 관중 멤버에게 제시된 질의(1908)의 예를 나타낸다. 도시된 바와 같이, 질의(1908)는 구절 “Pardon my french”가 사용된 예시적인 문맥(1910)을 제공하고, 또한 질의(1910)에 대해 프리-폼 응답을 수신하도록 구성된 필드(1912)를 제공한다. 스크린샷(1902)의 리스트(1906)로부터 구절 “Pardon my french”에 대한 사용자 피드백이 청중 멤버에 의해 선택될 때, 그 관중 멤버는 스크린샷(1904)으로 안내될 수 있다. Screenshot 1904 shows an example of a query 1908 presented to an audience member to get user feedback for the phrase “Pardon my french”. As shown, query 1908 provides an example context 1910 in which the phrase “Pardon my french” is used, and also provides a field 1912 configured to receive a pre-form response to query 1910 do. When user feedback for the phrase “Pardon my french” from the list 1906 of the screenshot 1902 is selected by the audience member, the audience member can be directed to the screenshot 1904.

도 20은 다양한 실시예에 따른 선택-폼 응답의 리스트를 통해 사용자 피드백을 수신하는 예시적인 스크린샷을 나타낸다. 특히, 도 20은, 사용자 피드백 프로세스를 촉진하기 위해 관중 멤버에게 제시될 수 있는(예, 채팅 클라이언트 시스템(1400)을 통해) GUI의 예를 나타내는 스크린샷(2002, 2004, 2006)이다. 스크린샷(2002)은 번역 실패와 연관된 단어 또는 구절에 대한 사용자 피드백에 하나 또는 그 이상의 관중 멤버들이 참여하도록 요청하는 배너(2008)의 예를 나타낸다. 관중 멤버는, 배너(2008)를 선택함으로써 사용자 피드백에 참여하는 것을 선택할 수 있고, 그 배너(2008)는 사용자 피드백 세션의 시작으로 안내할 수 있고 및/또는 관중 멤버가 참여를 선택할 수 있는 이용 가능한 사용자 피드백 기회들의 리스트로 관중 멤버를 안내할 수 있다.20 shows an example screenshot for receiving user feedback through a list of select-form responses according to various embodiments. In particular, FIG. 20 is a screenshot (2002, 2004, 2006) showing an example of a GUI that can be presented to an audience member (eg, via the chat client system 1400) to facilitate the user feedback process. Screenshot 2002 shows an example of a banner 2008 that requests one or more spectator members to participate in user feedback for a word or phrase associated with a translation failure. Spectator members can choose to participate in user feedback by selecting banner 2008, which banner 2008 can guide to the start of a user feedback session and / or the audience member is available to choose to participate A list of user feedback opportunities can direct the audience member.

스크린샷(2004)은, 단어 “Skrilla”에 대한 사용자 피드백을 얻기 위해 관중 멤버에게 제시된 질의(2010)의 예를 제공한다. 질의(2010)는, 관중 멤버가 선택할 수 있는 가능한 응답들을 나열한 선택-폼 응답(2012)을 포함한다. 스크린샷(2006)은, 단어 “Skrilla”에 대해 정확한 응답 “money”가 선택될 때, 보상된 인센티브(예, 5 골드 코인)를 알림(2014 및 2016)에 의해 제시한다. The screenshot 2004 provides an example of a query 2010 presented to an audience member to get user feedback on the word “Skrilla”. The query 2010 includes a select-form response 2012 listing possible responses that can be selected by an audience member. Screenshot 2006 presents rewarded incentives (eg, 5 gold coins) by notifications (2014 and 2016) when the correct response “money” is selected for the word “Skrilla”.

도 21은 다양한 실시예에 따른 선택의 리스트를 생성하는 예시적인 스크린샷을 나타낸다. 도 21에서, 스크린샷(2102)은, 단어 “Skrilla”에 대한 사용자 피드백을 얻기 위해 관중 멤버에게 제시된 질의(2106)의 예를 제공한다. 도시된 바와 같이, 질의(2106)는 단어 “Skrilla”가 사용된 예시적인 문맥(2108)을 제공하고, 또한 질의(2106)에 대한 프리-폼 응답을 수신하도록 구성된 필드(2110)를 제공한다. 21 shows example screenshots for generating a list of selections in accordance with various embodiments. In FIG. 21, screenshot 2102 provides an example of a query 2106 presented to an audience member to obtain user feedback for the word “Skrilla”. As shown, query 2106 provides an exemplary context 2108 in which the word “Skrilla” is used, and also provides a field 2110 configured to receive a pre-form response to query 2106.

일부 실시예에 따라, 특정 단어 또는 구절에 대한 사용자 피드백을 얻기 위해 사용된 선택-폼 응답 방법은, 그 특정 단어 또는 구절에 대해 수집된 프리-폼 응답으로부터 선택된 미리 정의된 응답의 리스트를 포함할 수 있다. 이에 따라, 다양한 관중 멤버들은 필드(2110)(예, “a lot of momey”, “Cash”, “Money” 및 “Really Rich”) 를 통해 단어 “Skrilla”에 대해 프리-폼 응답을 제공하고, 수집된 응답은, 스크린샷(2104)에 도시된 바와 같이, 선택-폼 응답(2112)의 리스트를 생성하는데 유용할 수 있다. According to some embodiments, a select-form response method used to obtain user feedback for a particular word or phrase includes a list of predefined responses selected from pre-form responses collected for that particular word or phrase. Can be. Accordingly, various spectator members provide a pre-form response to the word “Skrilla” through the field 2110 (eg, “a lot of momey”, “Cash”, “Money” and “Really Rich”), The collected responses can be useful for generating a list of select-form responses 2112, as shown in screenshot 2104.

도 22는 다양한 실시예에 따른 예시적인 인센티브 알림을 나타낸다. 도 22에서, 스크린샷(2200)은, 단어 “Skrilla”에 대한 관중 멤버의 응답인 “momey”의 승인을 그 관중 멤버에게 알리고, 승인된 응답에 대해 보상된 인센티브(예, XXXX 골드)를 그 관중 멤버에게 알리는 알림의 예를 나타낸다. 스크린샷(2202)은, 단어 “Skrilla”에 대한 관중 멤버의 응답인 “money”의 거절을 그 관중 멤버에게 알리는 알림의 예를 나타낸다. 스크린샷(2204)은, 관중 멤버의 응답의 승인을 관중 멤버에게 알리는 푸시 알림의 예를 나타낸다. 22 illustrates exemplary incentive notifications in accordance with various embodiments. In FIG. 22, the screenshot 2200 informs the spectator member of the approval of the spectator member's response to the word “Skrilla”, “momey”, and rewards the incentive (eg, XXXX Gold) rewarded for the approved response. Here is an example of a notification to an audience member. The screenshot 2202 shows an example of a notification informing the audience member of the rejection of “money”, which is the response of the audience member to the word “Skrilla”. Screenshot 2204 shows an example of a push notification informing an audience member of the approval of an audience member's response.

도 23은, 다양한 실시예에 따른, 번역이 채팅 클라이언트 시스템 사이에서 실패하였을 때의 예를 나타내는 스크린샷을 도시한다. 도 23에서, 스크린샷(2300)은, 제 1 채팅 클라이언트 시스템의 예시적인 인터페이스를 나타내고, 스크린샷(2302)은, 제 2 채팅 클라이언트 시스템의 예시적인 인터페이스를 나타낸다. 이중 화살표(2304)는, 제 1, 2 채팅 클라이언트 시스템 사이의 채팅 통신을 나타낸다. 도시된 바와 같이, 채팅 사용자 “Aramis”가 제 1 채팅 클라이언트 시스템의 인터페이스에 영어로 채팅 통신을 입력하면, 그 입력된 채팅 통신은 불어로 번역되고 채팅 사용자 “tapir”의 제 2 채팅 클라이언트 시스템의 인터페이스에 표시된다. 마찬가지로, 채팅 사용자 “tapir”가 제 2 채팅 클라이언트 시스템의 인터페이스에 불어로 채팅 통신을 입력하면, 그 입력된 채팅 통신은 영어로 번역되고 채팅 사용자 “Aramis”의 제 1 채팅 클라이언트 시스템의 인터페이스에 표시된다. 23 shows a screenshot showing an example of when translation fails between chat client systems, according to various embodiments. In FIG. 23, screenshot 2300 represents an exemplary interface of the first chat client system, and screenshot 2302 represents an exemplary interface of the second chat client system. The double arrow 2304 indicates chat communication between the first and second chat client systems. As shown, when the chat user “Aramis” enters chat communication in English into the interface of the first chat client system, the entered chat communication is translated into French and the interface of the second chat client system of the chat user “tapir” Is displayed on. Similarly, when the chat user “tapir” enters chat communication in French into the interface of the second chat client system, the entered chat communication is translated into English and displayed on the interface of the first chat client system of the chat user “Aramis”. .

도 23에 도시된 바와 같이, 채팅 사용자 “Aramis”에 의해 제 1 채팅 클라이언트 시스템의 인터페이스에 입력된 채팅 통신(2306)(즉, “Tru dat bro?”)은, 채팅 사용자 “tapir”의 제 2 채팅 클라이언트 시스템의 인터페이스로 전송될 때 번역에 실패한다. 채팅 사용자 “tapir”에게 제시된 채팅 통신(2308)(즉, “Tru dat bro?”)은, 채팅 사용자 “Aramis”에 의해 입력된 원본 채팅 통신을 제시하고 채팅 사용자 “tapir”에게 채팅 통신은 채팅 사용자 “Aramis”에 의해 입력된 원본 채팅 메시지라는 것을 표시함으로써, 번역 실패를 반영한다. As shown in FIG. 23, the chat communication 2306 (ie, “Tru dat bro?”) Input to the interface of the first chat client system by the chat user “Aramis” is the second of the chat user “tapir”. Translation fails when sent to the interface of the chat client system. The chat communication 2308 presented to the chat user “tapir” (ie “Tru dat bro?”) Presents the original chat communication entered by the chat user “Aramis” and the chat communication to the chat user “tapir” is the chat user By indicating that it is the original chat message entered by "Aramis", it reflects the translation failure.

도 23에 설명된 번역 실패는, 일부 실시예에 따라 사용자 피드백으로부터 이득을 볼 수 있는 것이다. 일부 실시예에 따라, 도 23에 설명된 번역 실패는, 번역 실패 관리 모듈(1314)에 의해 확인될 수 있고, 원본 채팅 통신(2306)(즉, “Tru dat bro?”)로부터의 하나 또는 그 이상의 단어는, 참여하는 관중 멤버로부터의 사용자 피드백을 위한 미래의 선택을 위해, 미정의된 단어/구절 데이터 저장부(1602)에 추가될 수 있다. 예를 들어, 각 단어 “Tru”, “dat”, “bro”는, 각각에 대한 미래의 사용자 피드백을 위해 미정의된 단어/구절 데이터 저장부(1602)에 추가될 수 있다.The translation failure described in FIG. 23 may benefit from user feedback in accordance with some embodiments. According to some embodiments, the translation failure described in FIG. 23 may be identified by the translation failure management module 1314, one or more from the original chat communication 2306 (ie, “Tru dat bro?”) The above words may be added to the undefined word / phrase data storage unit 1602 for future selection for user feedback from participating audience members. For example, each word “Tru”, “dat”, and “bro” may be added to the undefined word / phrase data storage unit 1602 for future user feedback on each.

도 24 및 도 25는, 다양한 실시예에 따른, 사용자 피드백에 이용 가능한 단어 또는 구절의 예시적인 리스트를 설명하는 스크린샷을 나타낸다. 도 24에서, 스크린샷(2400)은, 관중 멤버에 의해 선택이 가능한 단어 및 구절에 대해(“Tru”를 포함하는), 관중 멤버가 이용 가능한, 사용자 피드백 기회들의 리스트(2402)를 제시한다. 도 25에서, 스크린샷(2500)은, 관중 멤버에 의해 선택이 가능한 구절들에 대해, 관중 멤버 선택에 이용 가능한, 사용자 피드백 기회들의 다른 리스트(2502)를 제시한다. 도 24 및 도 25에서, 스크린샷(2400 및 2500)은, 게임 플레이어들이 특정 단어 또는 구절에 대한 사용자 피드백을 제공할 수 있는 인-게임 채팅 시스템의 부분일 수 있고, 사용자 피드백이 승인되자마자, 인-게임 크레딧(예, 인-게임 골드)이 주어질 수 있다. 24 and 25 illustrate screenshots illustrating an exemplary list of words or phrases available for user feedback, according to various embodiments. In FIG. 24, screenshot 2400 presents a list 2402 of user feedback opportunities available to the audience member, for words and phrases (including “Tru”) selectable by the audience member. In FIG. 25, screenshot 2500 presents another list 2502 of user feedback opportunities, available for audience member selection, for verses selectable by the audience member. 24 and 25, screenshots 2400 and 2500 can be part of an in-game chat system that allows game players to provide user feedback for a particular word or phrase, and as soon as the user feedback is approved, In-game credits (eg in-game gold) can be given.

도 26은, 다양한 실시예에 따른, 단어를 정의하는 예를 설명하는 스크릿샷을 나타낸다. 도 26에서, 스크린샷(2600)은, 단어 “Tru”가 사용되는 예시적인 문맥(2604)을 포함하고, 또한 질의(2602)에 대해 프리-폼 응답을 수신하도록 구성된 필드(2606)를 제공하는 그 질의(2602)를 제시한다. 26 is a screen shot illustrating an example of defining a word according to various embodiments. In FIG. 26, screenshot 2600 includes an example context 2604 in which the word “Tru” is used, and also provides a field 2606 configured to receive a pre-form response to query 2602. The query 2602 is presented.

도 27은, 다양한 실시예에 따른 선택-폼 응답의 예시적인 리스트를 설명하는 스크린샷을 나타낸다. 도 27에서, 스크린샷(2700)은, 관중 멤버가 단어 “nemor”를 정의하기 위해 선택할 수 있는 응답들의 리스트(2702)를 제시한다.27 shows a screenshot illustrating an exemplary list of select-form responses in accordance with various embodiments. In FIG. 27, screenshot 2700 presents a list 2702 of responses the audience member can select to define the word “nemor”.

도 28은, 다양한 실시예에 따라 제출된 응답에 대한 상태의 예시적인 리스트를 설명하는 스크린샷을 나타낸다. 도 28에 도시된 바와 같이, 제출된 응답 상태의 리스트(2802)는, 제 1 응답(2804)에 대한 계류(pending) 중인 상태, 그리고 제 2, 3 응답(2806, 2808)에 대한 승인된(approved) 상태들을 포함한다. 일부 실시예에서, 리스트(2802)는, 예를 들어, 특정 응답이 승인된 이유, 또는 아직 검토 중인 것으로 계류된 이유를 포함하는 응답 상태에 대한 추가적인 정보를 제공할 수 있다.28 shows a screenshot illustrating an exemplary list of statuses for responses submitted in accordance with various embodiments. As shown in FIG. 28, the list 2802 of the submitted response status is pending status for the first response 2804, and the approved status for the second and third responses 2806 and 2808 ( approved) states. In some embodiments, the list 2802 can provide additional information about the status of the response, including, for example, why the particular response was approved, or why it was pending pending review.

도 29는, 다양한 실시예에 따른 예시적인 인센티브 알림을 설명하는 스크린샷을 나타낸다. 특히, 도 29는, 일 실시예에 따른 사용자 피드백 프로세스 동안에 구절 “U still thr”를 “You still there?”로 정확하게 정의한 멤버에게 예시적인 알림(2902)을 제시하는 스크린샷(2900)을 제공한다. 그 알림은, 제공된 사용자 피드백에 대한 인센티브로서, 그는 인-게임 통화에서 가치가 있거나 유용한 10개의 골드를 보상받을 것을 알린다. 도시된 바와 같이, 알림은 또는 사용자 피드백의 요약(즉, 질문인 단어 또는 구절, 및 제공된 사용자 피드백 응답)을 제공한다. 29 shows screenshots illustrating exemplary incentive notifications in accordance with various embodiments. In particular, FIG. 29 provides a screenshot 2900 presenting an exemplary notification 2902 to a member who correctly defined the phrase “U still thr” as “You still there?” During the user feedback process according to one embodiment. . The notification, as an incentive for the user feedback provided, informs him that he will be rewarded with 10 valuable or useful gold in an in-game currency. As shown, the notification also provides a summary of user feedback (ie, the word or phrase being a question, and the user feedback response provided).

특정 실시예에서, 사용자로부터 얻게 되는 피드백이 플레이어 일관성을 위해 체크하고 균형을 유지하는 기계 학습 시스템에 의해 자동화되는, 데이터 선택을 위한 학습 시스템이 제공된다. 그 시스템은, 플레이어로부터 수신되는 병렬 문장을, 가끔 통계적 기계 번역(SMT) 시스템을 다시 훈련하는데 사용될 수 있는 병렬 말뭉치(parallel corpora)에 추가한다. In certain embodiments, a learning system for data selection is provided in which feedback from the user is automated by a machine learning system that checks and balances for player consistency. The system adds parallel sentences received from the player to a parallel corpora that can sometimes be used to retrain a statistical machine translation (SMT) system.

채팅 변환 시스템은, 채팅용어를 분명한 용어(plain speak)로 변환하는 시스템일 수 있고, 또는 그 시스템을 포함할 수 있다. 예를 들어, 채팅 변환은, “U r da king”(채팅용어 메시지)를 “You are the king”(분명한 용어 메시지)로 변환할 수 있다. 특정 실시예에서, “분명한 용어”(plain speak)는, 채팅용어가 지배적인 전자 채팅 환경의 밖에서 일반적으로 보통의 개인에 의해 말해지고 및/또는 쓰여지는 보통의 언어를 나타낸다. 분명한 용어는 채팅용어보다 더 문법에 맞는 경향이 있다. The chat conversion system may be a system that converts chat terms into plain speak, or may include the system. For example, the chat conversion may convert “U r da king” (chat term message) to “You are the king” (obvious term message). In certain embodiments, “plain speak” refers to an ordinary language spoken and / or written by an ordinary person outside the electronic chat environment in which the chat term is dominant. Clear terms tend to be more grammatical than chat terms.

학습 시스템은 또한, 한 언어를 다른 언어로 번역하는 언어 번역 시스템을 이용하거나 포함할 수 있다. 예를 들어, 언어 번역은, “How are you doing kind sir” (영어 메시지) 를 “¿Como te va amable senor” (스페인어 메시지)로 번역할 수 있다. The learning system may also use or include a language translation system that translates one language into another. For example, a language translation can translate “How are you doing kind sir” (English message) to “¿Como te va amable senor” (Spanish message).

일부 실시예에서, “병렬 말뭉치”는, 하나의 텍스트의 n 라인이 제 2 번역된 텍스트의 n 라인에 대응하도록, 각 언어로 평행하게 배열된 두 개의 텍스트를 의미하는 것으로 이해될 수 있다. 병렬 말뭉치는 또한, 그러한 문맥에서 “훈련 말뭉치”(training corpora)로 참조될 수 있다. In some embodiments, “parallel corpus” can be understood to mean two texts arranged in parallel in each language such that n lines of one text correspond to n lines of the second translated text. Parallel corpus can also be referred to in this context as “training corpora”.

다양한 실시예에서, “기계 학습”은, 입력 데이터의 패턴으로부터 학습할 수 있고, 발견, 변환 또는 행동 예측을 위한 메커니즘을 개발할 수 있는, 지도 학습, 준지도 학습 또는 자율 학습으로 참조되는 것으로 이해될 수 있다.In various embodiments, “machine learning” will be understood as being referred to as supervised learning, semi-supervised learning or autonomous learning, which can learn from patterns of input data and develop mechanisms for discovery, transformation or behavior prediction. Can be.

일반적으로, 채팅 변환 시스템 및 언어 번역 시스템의 구축은, 통계적 학습을 위한 적당한 양의 구문론 법칙 또는 많은 양의 병렬 말뭉치를 요구한다. 여기에 설명된 시스템 및 방법은, 병렬 말뭉치를 사용하는 믿을 수 있는 통계적 채팅 변환 및 언어 번역 시스템을 일반적으로 이용한다. 그러나, 특정 상황에서, 이러한 초기 훈련 데이터는, 컨텐츠 및 범위에서 제한이 있을 수 있다. 예를 들어, 새로운 채팅 단어는, 매일 채팅방에 생성되고 추가될 수 있다. 정확하고 믿을 수 있는 변환 및 번역 시스템을 유지하기 위해, 이러한 새로운 채팅 단어는, 채팅 변환 훈련 말뭉치에 증가되어야 한다. In general, the construction of a chat conversion system and a language translation system requires an appropriate amount of syntactic laws or a large amount of parallel corpus for statistical learning. The systems and methods described herein generally use a reliable statistical chat transformation and language translation system using parallel corpus. However, in certain situations, such initial training data may be limited in content and scope. For example, new chat words can be created and added to the chat room every day. In order to maintain an accurate and reliable conversion and translation system, these new chat words must be increased in the chat conversion training corpus.

다양한 실시예에서, 미등록 어휘(Out of Vocabulary)(OOV)인 단어(예, 특정 사전(lexicon)에 제시되어 있지 않은 단어들)를 식별하는 시스템 및 방법이 제공된다. 도 30을 참조하면, 일부 실시예에서 OOV 단어들을 검출하고 처리하기 위한 방법(3000)이 제공된다. 단계(3002)에서, OOV 단어들은 CTT 시스템(114) 또는 CCT 시스템(1300) 및/또는 그들의 하나 또는 그 이상의 모듈과 같은 번역기 시스템을 통해 단어들을 전송함으로써 초기에 검출된다. 번역기 시스템으로부터의 출력이, 특정 단어에 대한 입력과 동일할 때, 번역기 시스템은 변환성의 부족을 표시하고, 단어가 OOV일 수 있음을 제안한다. 단어가 잠재적인 OOV 단어임을 더 평가하기 위해, 시스템 및 방법은, 단어가 철자가 틀린 단어가 아니라 새로운 단어인지 결정할 수 있고(단계 3004), 두 가지 경우 모두 OOV로서 나타날 수 있다. 이에 따라, 맞춤법 체크기로 수정될 수 있는 단어는, OOV 단어라기 보다는 철자가 틀린 단어로 고려될 수 있다. In various embodiments, systems and methods are provided for identifying words that are out of vocabulary (OOV) (eg, words that are not presented in a particular lexicon). Referring to FIG. 30, in some embodiments, a method 3000 for detecting and processing OOV words is provided. In step 3002, OOV words are initially detected by sending words through a translator system such as CTT system 114 or CCT system 1300 and / or one or more modules thereof. When the output from the translator system is the same as the input for a particular word, the translator system indicates a lack of translatability and suggests that the word can be OOV. To further evaluate that a word is a potential OOV word, the system and method can determine if the word is a new word rather than a misspelled word (step 3004), and in both cases may appear as an OOV. Accordingly, a word that can be corrected with a spell checker may be considered as a misspelled word rather than an OOV word.

추가적으로, 채팅에서 자주 나타나는 OOV 단어는, OOV 단어가 되는 경향이 더 높다(예, 새로운 채팅용어 단어). 예를 들어, 이전 텍스트 메시지에서 사용자에 의해 단어가 사용되었을 때, 그러한 이전 사용은, 그 단어가 OOV 단어일 수 있다는 것을 제안한다. 일부 실시예에서, 기계 학습 및 언어 처리 방법의 총체는, 단어가 OOV 단어인지 여부를 검출하는데 병렬적으로 사용된다(단계 3006).Additionally, OOV words that appear frequently in chat tend to be more OOV words (eg, new chat term words). For example, when a word was used by a user in a previous text message, such previous use suggests that the word may be an OOV word. In some embodiments, a total of machine learning and language processing methods are used in parallel to detect whether a word is an OOV word (step 3006).

추가적으로 또는 대안으로, 베이지안 확률(Bayesian probability)이, OOV가 철자가 틀린 단어라기 보다는 새로운 단어일 때의 통계적 확률을 제공하기 위해 계산될 수 있다(단계 3008). 진정한 채팅용어 단어는, 보통으로 사용된 특정 단어의 이전 및 이후에 사용되는 경향이 있다. 이에 비해 철자 오류는, 인접한 단어와 더 적은 일관된 분포를 가질 것이다. 이전 및 이후의 베이지안 확률을 계산하는 것은, 사전에 추가될 수 있는 유용한 OOV 단어와 사전(lexicon)에 추가되어서는 안 되는 철자 오류를 구분하는데 도움을 줄 수 있다. 예를 들어, 구절 “Wassup, how’s it going.”를 고려할 때, “Wassup”은 표준 사전(lexicon)에 있지 않을 때 OOV 단어로 고려된다. 그러나, “Wassup”은, 거의 항상 “How’s it going” 단어가 뒤따르거나, 문장의 시작에서 주로 사용된다. 이러한 패턴 또는 일관된 행동은, 베이지안 확률에 의해 포착된다. 시스템은, 철자가 틀린 단어를 포함하지만 채팅 단어는 포함하지 않는 텍스트에 대해 훈련된다. Additionally or alternatively, the Bayesian probability can be calculated to provide a statistical probability when the OOV is a new word rather than a misspelled word (step 3008). True chat terminology tends to be used before and after certain commonly used words. In comparison, spelling errors will have less consistent distribution with adjacent words. Calculating Bayesian probabilities before and after can help distinguish between useful OOV words that can be added to a dictionary and spelling errors that should not be added to a lexicon. For example, when considering the phrase “Wassup, how ’s it going.”, “Wassup” is considered an OOV word when it is not in the standard lexicon. However, “Wassup” is almost always followed by the word “How ’s it going”, or is usually used at the beginning of a sentence. This pattern or consistent behavior is captured by Bayesian probability. The system is trained on text that contains misspelled words but not chat words.

대안으로, 또는 추가적으로, k-means 클러스터링(clustering)과 같은 기계 학습 방법이, 새로운 단어, 철자가 틀린 단어, 또는 정크(junk)와 같은 서로 다른 종류의 OOV 단어들을 구분하는데 사용될 수 있다. k-means 클러스터링은, 단어들의 등급들 사이의 잠재적인 유사도를 끌어내는 경향이 있다. 유사 주제에 속하는 단어들은, 같이 무리지어지는 경향이 있고, 그들 사이의 잠재적인 유의어(synonymous)의 관계를 나타낸다. “Wassup, how’s it going”의 예를 다시 고려하자. k-means 알고리즘을 사용하여 문장들의 그룹을 클러스터링하면, “Hi,” “What’s up,” “Hello,” “Hi!,” 등의 인사 단어의 클러스터(cluster)는, 그 클러스터 내에서, 그들 사이에 포함되는 “Wassup”을 드러낸다. 반대로, 철자 오류는, 클러스트의 주변부에 배치되고, 또는 임의의 정의된 클러스터에 전혀 배치되지 않는다. 이러한 잠재적인 관계는, 오류로부터 유용한 OOV 단어를 구분하는데 도움을 준다. 문장의 구문론 및 의미론은, 문장이 어느 종류의 OOV 단어(예, 동사, 명사, 또는 형용사)를 포함하는지를 결정하기 위해 분석될 수 있다.Alternatively, or additionally, machine learning methods such as k-means clustering can be used to distinguish new words, misspelled words, or different kinds of OOV words such as junk. k-means clustering tends to elicit potential similarities between classes of words. Words belonging to similar subjects tend to be grouped together, indicating a potential synonymous relationship between them. Consider again the example of “Wassup, how ’s it going”. When a group of sentences is clustered using the k-means algorithm, a cluster of greeting words such as “Hi,” “What's up,” “Hello,” “Hi !,” etc., within the cluster, between them Reveals "Wassup" included in Conversely, spelling errors are placed at the periphery of the cluster, or not at all in any defined cluster. This potential relationship helps to distinguish useful OOV words from errors. The syntax and semantics of a sentence can be analyzed to determine what kind of OOV word (eg, verb, noun, or adjective) the sentence contains.

시스템 및 방법이 새로운 채팅 단어 또는 다른 OOV 단어를 검출할 때, 새로운 채팅 단어는, 새로운 채팅 단어의 채팅 변환된 버전 또는 언어 번역된 버전을 정의하기 위해 인간 번역가에게 제시될 수 있다. 새로운 채팅 단어의 변환된 또는 번역된 버전은, 번역 사전에 추가될 수 있고, 여기에 설명된 시스템 및 방법에 의해 사용될 수 있다.When the system and method detect a new chat word or other OOV word, the new chat word can be presented to a human translator to define a chat translated version or a language translated version of the new chat word. Translated or translated versions of new chat words can be added to the translation dictionary and used by the systems and methods described herein.

여기에 설명한 바와 같이, 인센티브가 언어들 간의 채팅의 수동 번역에 대해 제공될 때, 사용자들이 인센티브(예, 인-게임 통화)의 이득을 얻고자 시스템을 악의적으로 조종할 가능성이 있다. 여기에 설명된 시스템 및 방법은, 일반적으로 악의적인 제출을 검출할 수 있는 인간-번역가 능력에 관대하다. As described herein, when incentives are provided for manual translation of chats between languages, there is a possibility that users will manipulate the system maliciously to benefit from incentives (eg, in-game currency). The systems and methods described herein are generally tolerant of human-translator's ability to detect malicious submissions.

시스템의 사용자가 번역가로서 행동할 때, 사용자는 하나 또는 그 이상의 단어 또는 문장을 특정된 타겟 언어로 번역한다. 그러나, 사용자가 시스템에서 내기를 하기 위해 또는 시스템의 목적을 이행하지 않고 인센티브를 얻을 목적으로, 거짓, 불완전한, 또는 부도덕한 번역을 제공할 때, 사기를 칠 수 있다. When a user of the system acts as a translator, the user translates one or more words or sentences into a specified target language. However, fraud can be fraudulent when a user provides a false, incomplete, or immoral translation for the purpose of betting on the system or for obtaining incentives without fulfilling the purpose of the system.

도 31A를 참조하면, 특정 실시예에서, 여기에 설명된 시스템 및 방법은, 사기 검출 모듈(3100)을 이용한다. 사기 검출 모듈(3100)은, 사용자들에게 새로운 및 오래된 훈련 데이터(예, 병렬 말뭉치)를 제공함으로써 인센티브화된 번역에서의 사기를 검출할 수 있다. 오래된 훈련 데이터는 정확한 정답이 알려져 있는 번역에 대응한다. 반면, 새로운 훈련 데이터는 정확한 정답이 알려져 있지 않은 번역에 대응한다. 오래된 데이터에 대비한 새로운 데이터의 비율(퍼센트)은, 시간이 흐르면서 사용자에게 변할 수 있다. 예를 들어, 오래된 데이터는 초기에 더 많이 제공될 수 있고 그리고 나서 점진적으로 퍼센트에서 감소할 수 있다. Referring to Figure 31A, in certain embodiments, the systems and methods described herein utilize fraud detection module 3100. The fraud detection module 3100 can detect fraud in incentivized translations by providing users with new and old training data (eg, parallel corpus). Old training data corresponds to a translation for which the correct answer is known. On the other hand, the new training data corresponds to a translation for which an exact correct answer is unknown. The percentage (%) of new data versus old data can change over time. For example, older data may be provided more initially and then gradually reduced in percentage.

일부 실시예에서, 사기 검출은, 사용자로부터 수신된 오래된 데이터 번역들의 정확도를 체크함으로써 수행될 수 있다. 신뢰도 점수는 이러한 정확도에 기초하여 각 사용자에게 배정된다. 번역 정확도에서의 큰 또는 갑작스런 변화 또는 지속적인 낮은 정확도는, 사용자의 사기 또는 낮은 번역 능력을 나타낸다. 번역가의 능력에서의 신뢰도를 형성한 후에도, 오래된 데이터는, 주기적인 사기 체크를 위해 시간의 적어도 10-20%는 랜덤하게 도입되는 것이 바람직하다.In some embodiments, fraud detection may be performed by checking the accuracy of old data translations received from the user. Reliability scores are assigned to each user based on this accuracy. A large or sudden change in translation accuracy or constant low accuracy indicates a user's fraud or low translation ability. Even after establishing confidence in the translator's ability, it is desirable that old data is randomly introduced at least 10-20% of the time for periodic fraud checks.

이러한 기본 구조를 사용하여, 사기 검출 모듈(3100)은, 지도(supervised) 사기 검출 모듈(3102) 및/또는 자율(unsupervised) 사기 검출 모듈(3104)을 포함하거나 및/또는 이용할 수 있다. 지도 사기 검출 모듈(3102)을 통해, 보고 툴은, 예를 들어, 각 사용자로부터의 출력을 사용자 인터페이스에 제시할 수 있다. 그리고, 그 사용자 인터페이스는 다음의 필드를 갖는다: 제시된 입력 문장, 얻어진 번역, 현재 알려진 진실한 번역, 사용자의 현재 신뢰도 점수, 및 시간에 따른 번역기의 신뢰도 점수의 변동을 보여주는 그래프. 번역이 리뷰될 때, 인간 감독자는, 번역을 받아들이거나 거절할 수 있고, 이에 따라 사용자의 신뢰도 점수를 조정할 수 있다. 감독자는, 누적된 보고가 사기적인 행동을 보여주면, 사용자를 제거할 수 있다(즉, 사용자의 번역 특전을 취소). 사용자의 제거 또는 사용자의 번역 특전의 취소는, 번역 특전 모듈(3106)을 이용하여 수행될 수 있다.Using this basic structure, the fraud detection module 3100 may include and / or use a supervised fraud detection module 3102 and / or an unsupervised fraud detection module 3104. Through the map fraud detection module 3102, the reporting tool may, for example, present output from each user to the user interface. Then, the user interface has the following fields: a graph showing the proposed input sentence, the obtained translation, the currently known true translation, the user's current confidence score, and the change in the translator's confidence score over time. When the translation is reviewed, the human supervisor can accept or reject the translation and adjust the user's confidence score accordingly. The supervisor can remove the user if the accumulated report shows fraudulent behavior (ie, cancel the user's translation privilege). The removal of the user or cancellation of the translation privilege of the user may be performed using the translation privilege module 3106.

대안으로, 또는 추가적으로, 사기 검출 모듈(3100)은, 자율 사기 검출 모듈(3104)을 이용할 수 있다. 자율 사기 검출 모듈(3104)을 통해, 번역의 정확도는, 단어 에러율(WEB) 및 BLEU(기계 번역을 우수한 퀄리티의 참조 번역과 비교하는 기계 번역 정확도 메트릭)와 같은 다양한 메트릭스를 이용하여 계산될 수 있다. 사용자 번역 능력에서의 신뢰도는 변화 또는 변동(예, 상승 또는 하강)이 체크될 수 있다. 어느 한 번역가에 제시된 유사한 문장은, 시스템을 이용하는 다른 독립된 번역가에 제시될 수 있다. 인터넷 번역가 신뢰도 또한 계산될 수 있다. 예를 들어, 랜덤 샘플링, 소셜 네트워킹 분석(예, 두 번역가가 사회적으로 연결되지 않거나 또는 기존 존재하는 관계를 갖지 않는 것을 확인하는 것)을 통해, 및 채팅 세션 및/또는 온라인 게임에서 사용자 간의 반복적인 상호작용을 검출함으로써, 번역가 사이에 공모를 방지할 수 있다. 온라인(예, 온라인 게임 또는 채팅 세션)에서 함께 정기적으로 상호작용하는 두 사용자는, 공모에 가담하기 더 쉽다. 일부 실시예에서, 아이템 응답 이론(예, 심리언어학 및 검정론에서 사용되는 이론)이, 번역가 능력을 이용하여 번역가 신뢰도의 측정을 증가시키기 위해 사용된다. 사기 검출은, 인센티브 메커니즘을 갖는 번역 증강 시스템에서 자율 사기 검출을 하기 위해 아이템 응답 이론을 이용하여 수행될 수 있다. 아이템 응답 이론은, 일관성을 측정하기 위해, 번역가 정확도가 피어들에 관하여 그리고 스스로에 관하여 기간에 따라 측정될 수 있는 방법을 지시한다. 표준으로부터 편차들(deviations from the norm)는, 이 방법으로 식별될 수 있다. 인트라-번역가 능력은 또한, 설정된 기간 이후에 번역가에 다시 동일한 문장을 제시함으로써 계산될 수 있다. 신뢰도(reliability) 및 번역가 신뢰도(translator confidence)에서의 다양한 임계치가 설정될 수 있고, 만약 번역가의 신뢰도가 그러한 임계치 아래로 떨어지면, 번역가는, 번역 특권 모듈(3106)을 이용하여 제거되고 시스템으로부터 차단될 수 있다(예, 사용자의 번역 특권이 취소될 수 있다). 일부 실시예에서, 높은 신뢰도 시스템으로부터의 번역은, 번역 쌍 사전(lexicon)에 추가된다. Alternatively, or additionally, the fraud detection module 3100 can use the autonomous fraud detection module 3104. Through the autonomous fraud detection module 3104, the accuracy of translation can be calculated using various metrics such as word error rate (WEB) and BLEU (machine translation accuracy metric that compares machine translation to a reference reference of good quality). . Reliability in user translation ability can be checked for changes or fluctuations (eg, rising or falling). Similar sentences presented to one translator can be presented to another independent translator using the system. Internet translator reliability can also be calculated. For example, through random sampling, social networking analysis (e.g., verifying that two translators are not socially connected or having no existing relationship), and repetitive between users in chat sessions and / or online games. By detecting interactions, conspiracy among translators can be prevented. Two users who regularly interact together online (eg online games or chat sessions) are more likely to join the competition. In some embodiments, item response theory (eg, the theory used in psycholinguistics and black theory) is used to increase the measure of translator confidence using translator capabilities. Fraud detection can be performed using item response theory to autonomous fraud detection in a translation enhancement system with an incentive mechanism. The item response theory dictates how translator accuracy can be measured over time with respect to peers and with respect to themselves to measure consistency. Deviations from the norm can be identified in this way. Intra-translator capability can also be calculated by presenting the same sentence back to the translator after a set period of time. Various thresholds in reliability and translator confidence can be set, and if the reliability of the translator falls below that threshold, the translator is removed using the translation privilege module 3106 and blocked from the system. (Eg, the user's translation privileges may be revoked). In some embodiments, translations from high reliability systems are added to a translation pair dictionary (lexicon).

도 31B는, 본 발명의 특정 실시예에 따른, 인센티브화된 번역에서의 사기를 검출하는 방법(3110)의 흐름도이다. 방법은, 오래된 훈련 데이터 및 새로운 훈련 데이터의 혼합을 선택하는 단계(단계 3112)를 포함한다. 오래된 훈련 데이터는, 다른 언어로의 정확한 번역이 알려진 하나 또는 그 이상의 오래된 텍스트 메시지를 포함한다. 새로운 훈련 데이터는, 다른 언어로의 정확한 번역이 알려져 있지 않은 하나 또는 그 이상의 새로운 텍스트 메시지를 포함한다. 복수의 각 요청이, 사용자의 클라이언트 디바이스로 다른 시간에 전송된다(단계 3114). 요청은, (i) 사용자에게 오래된 훈련 데이터 및/또는 새로운 훈련 데이터의 번역을 요청 및 (ii) 번역에 대한 인센티브를 포함한다. 특정 요청을 전송한 후에, 그 특정 요청의 오래된 훈련 데이터에 대한 번역이 사용자 디바이스로부터 수신된다(단계 3116). 그 수신된 번역은, 오래된 훈련 데이터에 대한 정확한 번역과 비교된다(단계 3118). 수신된 번역의 정확도는, 비교에 기초하여 결정된다(단계 3120). 다음으로, 신뢰도 점수가, 번역에 기초하여 사용자에 대해 업데이트된다(단계 3122). 신뢰도 점수는, 사용자가 텍스트 메시지의 정확한 번역을 이후의 시간에 다른 언어로 제공할 가능성을 나타낸다. 31B is a flow diagram of a method 3110 for detecting fraud in an incentivized translation, in accordance with certain embodiments of the present invention. The method includes selecting a mixture of old training data and new training data (step 3112). The old training data includes one or more old text messages whose exact translation to another language is known. The new training data includes one or more new text messages whose exact translation to another language is not known. Each of the plurality of requests is sent to the user's client device at different times (step 3114). The request includes (i) requesting the user to translate old training data and / or new training data and (ii) incentives for translation. After sending the specific request, a translation for the old training data of the specific request is received from the user device (step 3116). The received translation is compared to the correct translation for old training data (step 3118). The accuracy of the received translation is determined based on the comparison (step 3120). Next, the confidence score is updated for the user based on the translation (step 3122). The reliability score indicates the likelihood that the user will provide an accurate translation of the text message in another language at a later time.

다양한 실시예에서, 여기에 설명된 시스템 및 방법은, 그룹 음성 채팅 시스템의 사용자에 대해 어느 한 언어의 가청 음성을 다른 언어로 번역하기 위해 음성 번역 또는 음성 인식을 이용한다. 시스템 및 방법은, 채팅용어에 대해 구현될 수 있다. 음성-텍스트(speech-to-text) 변경 시스템은 사용자 채팅용어를 텍스트로 변경하고, 그리고 나서 이 텍스트는 분명한 언어로 변환되며(즉, 비채팅용어) 그리고 외국 언어로 번역되는 채팅용어에 대해 구현될 수 있다. 그리고 나서 최종 변환이, 외국 언어 텍스트-음성 시스템을 통해 엔드 사용자에게 출력되는 외국 채팅 음성을 생산하는데 실행된다. 시스템 및 방법은, 바람직하게 매우 빠른 디코더로 음성 인식 기술 및 통계적 기계 번역 기술을 이용한다. In various embodiments, the systems and methods described herein utilize speech translation or speech recognition to translate audible speech in one language to another language for users of a group speech chat system. The system and method can be implemented for chat terms. The speech-to-text change system converts user chat terms into text, and then the text is translated into a clear language (ie, non-chat term) and implemented for chat terms that are translated into foreign languages. Can be. The final conversion is then performed to produce a foreign chat voice that is output to the end user through a foreign language text-to-speech system. The system and method preferably utilize speech recognition technology and statistical machine translation technology as very fast decoders.

도 32는, 다른 언어를 말하는 사람들의 그룹(3202)이 채팅용어를 이용하여 구두(verbally)로 상호작용하는 것을 허용하는 그룹 채팅 시스템(3200)의 블록도이다. 여기에 설명된 바와 같이, 시스템(3200)는 그룹 채팅 시스템(3200)에 참여하는 사람에 의해 말해지는 언어를 인식할 수 있다. 제 1 사용자(3204)가 제 2 사용자(3206)에게 들을 수 있는 채팅용어 메시지를 전송하기를 희망할 때, 제 1 사용자(3204)는 사용자 입력 디바이스(예, 채팅 클라이언트 시스템에서의 마이크로폰)로 제 1 언어로 가청(audible) 채팅용어 메시지(3208)를 입력한다. 음성 인식 모듈(3210)은, 가청 채팅용어 메시지를 제 1 언어의 채팅용어 텍스트 메시지(3212)로 변경한다. 변환 모듈(3214)이 그 채팅용어 메시지(3212)를 제 1 언어로 된 분명한 용어(예, 비채팅용어) 텍스트 메시지(3216)로 변환하는데 사용된다. 다음으로, 번역 모듈(3218)이 그 분명한 용어 텍스트 메시지(3216)를 제 2 언어(예, 불어)로 된 대응하는 분명한 용어 텍스트 메시지(3220)로 번역하는데 사용된다. 그리고 나서 변환 모듈(3222)이 그 대응하는 분명한 용어 텍스트 메시지(3220)를 제 2 언어로 된 대응하는 채팅용어 텍스트 메시지(3224)로 변환하는데 사용된다. 당업자는, 변환 모듈(3222)은 변환 모듈(3214)과 동일하거나 그 일부를 형성할 수 있음을 인식할 것이다. 그리고 나서, 텍스트-음성 모듈(3226)이 그 대응하는 채팅용어 텍스트 메시지(3224)를 제 2 언어로 된 대응하는 채팅용어 가청 메시지(3228)로 변경하는데 사용된다. 마지막으로, 그 대응하는 채팅용어 가청 메시지(3228)는, 출력 디바이스(예, 제 2 채팅 클라이언트 시스템의 스피커)를 이용하여 제 2 사용자(3206)에게 전달된다. FIG. 32 is a block diagram of a group chat system 3200 that allows groups 3202 of people speaking different languages to verbally interact using chat terms. As described herein, system 3200 may recognize a language spoken by a person participating in group chat system 3200. When the first user 3204 wishes to send an audible chatspeak message to the second user 3206, the first user 3204 may be sent to a user input device (eg, a microphone in a chat client system). Enter audible chat term message 3208 in one language. The speech recognition module 3210 changes the audible chat term message to the first term chat term text message 3212. A conversion module 3214 is used to convert the chatspeak message 3212 to a clear text (eg, non-chatspeak) text message 3216 in the first language. Next, a translation module 3218 is used to translate the clear term text message 3216 into the corresponding clear term text message 3220 in a second language (eg, French). The conversion module 3222 is then used to convert the corresponding clear term text message 3220 to the corresponding chatspeak text message 3224 in a second language. Those skilled in the art will recognize that the conversion module 3222 may be the same as or form part of the conversion module 3214. The text-to-speech module 3226 is then used to change the corresponding chatspeak text message 3224 to the corresponding chatspeak audible message 3228 in the second language. Finally, the corresponding chat term audible message 3228 is delivered to the second user 3206 using an output device (eg, the speaker of the second chat client system).

다양한 실시예에서, 음성 인식 모듈(3210)은, 가청 채팅용어 텍스트 메시지(3208)를 채팅용어 텍스트 메시지(3212)로 변경하는데 은닉 마코프 모델(hidden Markov model), DTW(Dynamic Time Warping) 기반 음성 인식, 및/또는 신경망(neural network)을 활용할 수 있다. 마찬가지로, 텍스트-음성 모듈(3226)은, 그 대응하는 채팅용어 메시지를 대응하는 채팅용어 가청 메시지로 변경하는데 음성 합성을 이용할 수 있다. 음성 합성은, 당업자에 의해 이해될 수 있는, 연결형 합성(concatenative synthesis)(예, 단위 선택 합성(unit selection synthesis), 다이폰 합성(diphone synthesis), 및/또는 도메인-특화 합성(domain-specific synthesis), 포만트 합성(formant synthesis), 조음 합성(articulatory synthesis), HMM 기반 합성(HMM-based synthesis), 및/또는 사인파 합성(sinewave synthesis)을 활용하거나 포함할 수 있다. In various embodiments, the speech recognition module 3210 changes the audible chat term text message 3208 into a chat term text message 3212 to use a hidden Markov model, DTW (Dynamic Time Warping) based speech recognition. , And / or neural networks. Similarly, the text-to-speech module 3226 can use speech synthesis to change the corresponding chatspeak message to the corresponding chatspeak audible message. Speech synthesis is concatenative synthesis (e.g., unit selection synthesis, diphone synthesis, and / or domain-specific synthesis) that can be understood by those skilled in the art. ), Formant synthesis, articulatory synthesis, HMM-based synthesis, and / or sinewave synthesis.

음성 처리 시스템과 같은 것을 생성하는 중요한 측면은, 처리되는 언어에 대한 다중 악센트 및 방언으로부터 음성 샘플을 수집하는 것을 수반한다. 음성 데이터의 본질은, 시스템과 그 시스템이 어드레스하는 도메인의 관련성을 유지하기 위해, 채팅용어 및 각 언어의 분명한 용어 포맷을 포함한다. 여기에 설명된 인센티브화된 피드백 메커니즘은 이러한 음성 샘플을 바꾸는데 사용될 수 있고, 그 음성 샘플은 음성 인식 모듈(3210) 및/또는 텍스트-음성 모듈(3226)을 훈련하는데 차례차례 사용될 수 있다. 도메인 적응 기술(domain adaptation techniques)은 데이터가 희박한 경향이 있는 채팅용어 음성 샘플의 케이스에서 필요할 수 있다. 예를 들어, 게임 도메인(예, 온라인 게임)에서 수집된 음성 데이터는, 풍부하게 이용 가능한 분명한 음성 용어 데이터로 교체될 수 있다. 도메인 적응은, 바람직하게, 특정 언어로 된 채팅-분명한 용어로부터 사소한 음성 변동을 지배하는 룰(예를 들어, 특정 언어로, 채팅용어를 분명한 용어로 변경 또는 분명한 용어를 채팅용어로 변경하는 것을 지배하는 룰)을 식별하는 것을 포함한다. 동등한 채팅용어에서 음성 샘플을 갖지 않는 분명한 용어 문장은, 그 도메인 레벨 룰을 이용하여 채팅용어로 변경될 수 있다. 사용자 피드백 루프는, 음향 모델 도메인을 특수하고 더 정확하게 만드는 레벨로 음향 모델 파라미터(예를 들어, 음성 인식 모듈(3210) 및/또는 텍스트-음성 모듈(3226))를 조율하는데 사용될 수 있다. 예를 들어, 음성 인식 모듈(3210)이 특정 엑센트에 어려움을 지속적으로 갖는 경우, 추가적인 다양한 단어의 가청 샘플이 그 엑센트로 시스템에 제공될 수 있다(예를 들어, 사용자들에 의해). 이는 음성 인식 모듈(3210)이 그 엑센트로 발음된 단어들을 더 잘 인식하는 방법을 배울 수 있도록 도울 것이다. An important aspect of creating something like a speech processing system involves collecting speech samples from multiple accents and dialects for the language being processed. The essence of voice data includes chat terms and clear terminology formats for each language, in order to maintain the relevance of the system and the domain it addresses. The incentivized feedback mechanism described herein can be used to alter this speech sample, which in turn can be used to train the speech recognition module 3210 and / or text-speech module 3226. Domain adaptation techniques may be necessary in cases of chatspeak speech samples where data tends to be sparse. For example, voice data collected in a game domain (eg, online game) can be replaced with clear voice term data that is abundantly available. Domain adaptation preferably governs rules that govern minor voice fluctuations from chat-obvious terms in a particular language (e.g., in certain languages, changing chat terms to clear terms or clear terms to chat terms). Rules). Clear term sentences that do not have speech samples in equivalent chat terms can be changed to chat terms using the domain level rules. The user feedback loop can be used to tune the acoustic model parameters (eg, speech recognition module 3210 and / or text-to-speech module 3226) to a level that makes the acoustic model domain special and more accurate. For example, if the speech recognition module 3210 continues to have difficulty with a particular accent, audible samples of additional various words may be provided to the system with the accent (eg, by users). This will help the speech recognition module 3210 learn how to better recognize words pronounced with the accent.

언급된 바와 같이, 여기에 설명된 시스템 및 방법의 실시예는, 그룹 채팅 환경으로부터의 텍스트 또는 채팅 메시지를 다른 언어로 번역하는데 사용된다. 이러한 번역된 채팅을 보관하는 것은, 다른 언어로 된 많은 수의 텍스트들이 저장소에 유지되도록 한다. As mentioned, embodiments of the systems and methods described herein are used to translate text or chat messages from a group chat environment into other languages. Archiving these translated chats allows a large number of texts in different languages to be kept in the repository.

도 33A를 참조하면, 특정 실시예에서, 사용자에 의한 채팅 히스토리의 검토를 촉진하고 저장 요구사항을 줄이기 위해, 채팅 히스토리 모듈(3300)이, 그 채팅 히스토리가 사용자에 의해 브라우징될 때, 실시간으로 채팅 히스토리들을 번역하는데 사용된다. 채팅 히스토리 모듈(3300)은, 다양한 사용자로부터의 채팅 히스토리를 저장하기 위한 채팅 저장 모듈(3302)(예, 레지스터 또는 다른 저장 디바이스)을 포함한다. 채팅 히스토리 모듈(3300)은 또한, 텍스트 메시지가 다른 언어로 번역되기 전에 및/또는 후에 텍스트 메시지를 변환하는 채팅 히스토리 변환 모듈(3304) 을 포함한다. 예를 들어, 채팅 히스토리 변환 모듈(3304)은, 채팅용어로부터 공식적인 용어 또는 분명한 용어로의 채팅 히스토리 텍스트 메시지의 실시간 변환을 수행한다. 일부 실시예에서, 채팅 히스토리 변환 모듈(3304)은, 변환 모듈(208)과 동일하거나 또는 유사하다. 채팅 히스토리 모듈(3300)은 또한, 채팅 히스토리 텍스트 메시지(예, 공식적인 용어 또는 분명한 용어로 된)를 다른 언어(예, 불어에서 영어로)로의 실시간 번역을 수행하는데 사용될 수 있는 채팅 히스토리 번역 모듈(3306)을 포함한다. 채팅 히스토리 번역 모듈(3306)은, 언어 모듈(206) 및/또는 번역 데이터 저장부(210)와 같은 여기에 설명된 다른 모듈 또는 구성요소일 수 있고 포함할 수 있다. Referring to FIG. 33A, in certain embodiments, to facilitate review of the chat history by the user and to reduce storage requirements, the chat history module 3300 chats in real time when the chat history is browsed by the user. Used to translate histories. The chat history module 3300 includes a chat storage module 3302 (eg, a register or other storage device) for storing chat history from various users. The chat history module 3300 also includes a chat history conversion module 3304 that converts text messages before and / or after the text messages are translated into other languages. For example, the chat history conversion module 3304 performs real-time conversion of chat history text messages from chat terms to formal or explicit terms. In some embodiments, chat history transformation module 3304 is the same or similar to transformation module 208. Chat history module 3300 can also be used to perform real-time translation of chat history text messages (e.g., in official or clear terms) to other languages (e.g., French to English), chat history translation module 3306 ). Chat history translation module 3306 can be and include other modules or components described herein, such as language module 206 and / or translation data storage 210.

사용자가 채팅 히스토리의 검토를 완료하는 경우, 채팅 히스토리 모듈(3300)에 의해 생성된 변환된 및/또는 번역된 텍스트는, 메모리에서 삭제되거나 제거될 수 있다. 이는 시스템 및 방법을 위한 저장 요구사항을 줄인다. 사용자가 나중에 채팅 히스토리를 검토하길 원하는 경우, 채팅 히스토리 모듈(3300)은, 요구될 때, 채팅 히스토리에서 텍스트를 변환하고 번역하는데 다시 사용될 수 있다. When the user completes reviewing the chat history, the converted and / or translated text generated by the chat history module 3300 may be deleted or removed from the memory. This reduces storage requirements for systems and methods. If the user wants to review the chat history later, the chat history module 3300 can be used again to convert and translate text in the chat history, if required.

특정 실시예에서, 채팅 히스토리 모듈(3300)은, 실시간으로 사용자에 대한 채팅 히스토리를 번역한다. 채팅 히스토리 모듈(3300)은, 채팅 세션의 텍스트 메시지의 히스토리를 검토하기 위한 요청을 사용자로부터 수신한다. 채팅 히스토리 모듈(3300)은, 채팅 히스토리 저장 모듈(3302)로부터, 복수의 언어로 된 텍스트 메시지들을 포함하는 텍스트 메시지의 히스토리를 수신한다. 그리고 나서, 채팅 히스토리 변환 모듈(3304) 및 채팅 히스토리 번역 모듈(3306)이, 요구될 때 채팅 히스토리의 초기 부분을 사용자에 의해 사용되는 언어로 변환 및/또는 번역하는데 사용된다. 그 채팅 히스토리의 번역된 제 1 부분을 본 후, 사용자는 채팅 히스토리의 다른 부분을 보는 것을 원할 수 있다. 그리고 나서, 채팅 히스토리 모듈(3300)은, 텍스트 메시지의 히스토리의 다른 부분을 보기 위한 요청을 사용자로부터 수신할 수 있다. 그리고 나서, 채팅 히스토리 변환 모듈(3304) 및 채팅 히스토리 번역 모듈(3306)이, 요구될 때 채팅 히스토리의 다른 부분을 사용자에 의해 사용되는 언어로 변환 및/또는 번역을 하는데 사용된다. 채팅 히스토리 모듈(3300)은 바람직하게 사용자가 채팅 히스토리를 스크롤할 때, 실시간으로 변환 및/또는 번역을 수행한다. In certain embodiments, the chat history module 3300 translates the chat history for the user in real time. The chat history module 3300 receives a request from a user to review the history of a text message in a chat session. The chat history module 3300 receives a history of text messages including text messages in a plurality of languages from the chat history storage module 3302. The chat history conversion module 3304 and chat history translation module 3306 are then used to convert and / or translate the initial portion of the chat history to the language used by the user when required. After viewing the translated first portion of the chat history, the user may want to view another portion of the chat history. Then, the chat history module 3300 may receive a request from the user to view another part of the history of the text message. The chat history conversion module 3304 and chat history translation module 3306 are then used to convert and / or translate other parts of the chat history into the language used by the user when required. The chat history module 3300 preferably converts and / or translates in real time when the user scrolls the chat history.

특정 예에서, 채팅 히스토리의 스크롤링은, 스케일(scale) 및 데이터 저장의 문제를 나타낸다. 채팅 히스토리의 무한 스트롤의 제공은, 다중 데이터베이스 및 다중 사용자를 스패닝(spanning)하는 데이터의 빠른 실시간 접근의 문제점을 나타낸다. 이는 채팅방에 있는 모든 사용자로부터 역사적(historical) 메시지를 가져오는 다중 프로세스를 병렬로 파생(spawning)함으로써 발생할 수 있다. 이러한 메시지에 대한 번역 및 연관된 채팅 변환은, 메시지들을 데이터 저장부로부터 가져올 때 병렬로 수행된다. 엔드 사용자에 의해 실현된, 그 결과 출력은, 데이터베이스로부터의 데이터 검색(lookup)이 이미 행해졌던, 채팅의 한 스크린으로부터 다음 스크린으로 끊김없는 트랜지션의 결과 출력이다. 이는, 여기에 설명된 시스템 및 방법이 데이터 저장부에 어떤 제한을 갖지 않고, 병렬 계산(computation)이 보다 일찍 파생된 프로세스들 사이에서 재활용될 때, 무한한 수의 스크린에 대해 시작할 수 있다. In a particular example, the scrolling of the chat history represents a problem of scale and data storage. The provision of infinite strokes in the chat history represents the problem of fast real-time access to data spanning multiple databases and multiple users. This can happen by spawning multiple processes in parallel that fetch historical messages from all users in the chat room. Translation of these messages and associated chat transformations are performed in parallel when messages are retrieved from the data store. The result output, realized by the end user, is the result output of a seamless transition from one screen of the chat to the next, where data lookup from the database has already been done. This can start for an infinite number of screens when the systems and methods described herein do not have any limitations on data storage, and parallel computation is recycled between derived processes earlier.

도 33B는, 본 발명의 특정 실시예에 따른, 실시간으로 채팅 히스토리를 번역하는 방법(3310)의 순서도이다. 방법(3310)은, 채팅 세션으로부터 텍스트 메시지의 히스토리를 검토하기 위한 요청을 어느 한 사람으로부터 수신하는 과정(단계 3312)을 포함한다. 히스토리는 바람직하게 복수의 언어로 된 그리고 복수의 사용자로부터의 텍스트 메시지들을 포함한다. 적어도 두 개의 병렬 프로세스가 수행된다(단계 3314). 각 병렬 프로세스는, (i) 채팅 세션의 각 사용자에 의해 생성된 텍스트 메시지(예, 그 텍스트 메시지는 텍스트 메시지들의 히스토리의 적어도 일 부분을 형성함)를 수신하거나 선택하는 과정, 및 (ii) 텍스트 메시지를 타겟 언어로 번역하는 과정을 포함한다. 복수의 병렬 프로세스로부터 번역된 텍스트 메시지는, 그 사람의 클라이언트 디바이스로 제공된다(단계 3316). 텍스트 메시지의 히스토리의 다른 부분을 검토하기 위한 요청이 그 사람으로부터 수신된다(단계 3318). 단계(3314 및 3316)은, 텍스트 메시지의 히스토리의 다른 부분에 대해 반복된다. 33B is a flow diagram of a method 3310 for translating chat history in real time, in accordance with certain embodiments of the present invention. The method 3310 includes receiving a request from one person to review the history of the text message from the chat session (step 3312). The history preferably includes text messages in multiple languages and from multiple users. At least two parallel processes are performed (step 3314). Each parallel process includes: (i) receiving or selecting a text message generated by each user of a chat session (eg, the text message forms at least a portion of the history of text messages), and (ii) text. And translating the message into the target language. Text messages translated from multiple parallel processes are provided to the person's client device (step 3316). A request is received from the person to review other parts of the history of the text message (step 3318). Steps 3314 and 3316 are repeated for other parts of the history of the text message.

일부 예에서, 여기에 설명된 시스템 및 방법의 사용자들은, 그룹 채팅 또는 게임 환경에서 특정 다른 사용자들과의 상호작용을 피하기를 원할 수 있다. 이전의 채팅 시스템에서, 채팅 사용자의 금지 및 침묵은 통상적으로 채팅 서버의 관리자 또는 조정자에 의해 처리된다. 그러나 여기에 설명된 시스템 및 방법의 실시예들은, 누가 사용자 채팅 메시지 및/또는 채팅 컨택 초대를 전송할 수 있는지를 사용자가 직접 제어하는 것을 허용한다. 예를 들어, 사용자 A는 사용자 B를 차단하도록 허용될 수 있고, 이로써 사용자 A는 더 이상 어느 채팅방에서 사용자 B의 통신을 보지 않을 수 있고, 및/또는 사용자 A는 더 이상 사용자 B로부터 개인적 채팅 컨택(즉, 단독 채팅) 초대를 수신하지 않을 수 있다. In some examples, users of the systems and methods described herein may want to avoid interaction with certain other users in a group chat or gaming environment. In previous chat systems, the ban and silence of chat users is typically handled by the chat server's administrator or coordinator. However, embodiments of the systems and methods described herein allow the user to directly control who can send user chat messages and / or chat contact invitations. For example, user A may be allowed to block user B, such that user A may no longer see user B's communication in any chat room, and / or user A no longer has a personal chat contact from user B (I.e. exclusive chat) You may not receive an invitation.

다양한 실시예에서, 동맹은 게임에서 게임 플레이를 강화하기 위한 유닛으로서 함께 그룹화할 수 있는 플레이어의 그룹이다. 각 동맹은, 바람직하게, 동맹의 멤버들이 서로 대화를 하고 텍스트 메시지를 전송할 수 있는 단독의 채팅방을 갖는다. 이는 때때로 동맹 채팅방으로부터 특정 사용자를 차단할 필요를 나타낸다. In various embodiments, an alliance is a group of players that can be grouped together as a unit to enhance gameplay in the game. Each alliance preferably has a single chat room where members of the alliance can communicate with each other and send text messages. This sometimes indicates the need to block certain users from allied chat rooms.

도 34A는, 특정 실시예에 따른, 게임 시스템의 제 1 사용자가 게임 시스템의 제 2 사용자로부터의 통신을 차단할 수 있는 사용자 인터페이스(3400)의 스크린샷을 포함한다. 도시된 바와 같이, 제 1 사용자는, 설정창(3404)을 오픈하는 관리 설정 아이콘(3402)를 선택한다(예를 들어, 터치 스크린을 탭핑함으로써). 그리고 나서, 제 1 사용자는 설정창(3404)에서 “동맹으로부터 차단(block from alliance)” 버튼(3406)을 선택한다. 제 2 사용자로부터의 향후의 통신은 차단될 것이라는 것을 알리기 위한 메시지창(3408)이 제 1 사용자에게 나타난다. 제 2 사용자는 제 1 사용자에 의해 차단된 다른 사용자들의 리스트에 추가될 수 있다. 제 1 사용자는 그 리스트에서 사용자를 추가하거나 제거하기 위한 이 리스트의 편집에 관한 옵션을 가질 수 있다. 예를 들어, 도 34B를 참조하면, 다음에 사용자가 관리 설정 아이콘(3402)을 선택하면, 설정창(3404)은 동맹으로부터 차단 해제(unblock from alliance) 버튼(3410)을 포함한다. 제 1 사용자가 통맹으로부터 차단 해제 버튼(3410)을 선택할 때, 제 2 사용자로부터의 향후의 통신은 차단되지 않을 수 있고, 메시지창(3412)은 그러한 통신이 차단되지 않았다는 것을 제 1 사용자에게 알리기 위해 나타난다. 34A includes a screenshot of a user interface 3400 that allows a first user of a gaming system to block communication from a second user of the gaming system, according to certain embodiments. As shown, the first user selects the management settings icon 3402 that opens the settings window 3404 (eg, by tapping the touch screen). Then, the first user selects the "block from alliance" button 3406 in the setting window 3404. A message window 3408 appears to inform the first user that future communication from the second user will be blocked. The second user can be added to the list of other users blocked by the first user. The first user can have the option of editing this list to add or remove users from the list. For example, referring to FIG. 34B, the next time the user selects the management settings icon 3402, the settings window 3404 includes an unblock from alliance button 3410. When the first user selects the unblock button 3410 from the federation, future communication from the second user may not be blocked, and a message window 3412 may inform the first user that such communication is not blocked appear.

일부 실시예에서, 시스템의 복잡성은, 차단 및 차단 해제가 실행되는 스케일에 의해 증가하거나 감소된다. 병렬 계산은, 통상적인 시스템에서 나타나는 시간 지연의 불이익 없이, 실시간으로 차단 및 차단 해제를 실행하는 유연성을 제공할 수 있다. 예를 들어, 병렬 처리는, 텍스트 메시지 채팅 시스템에서 텍스트 메시지를 번역하고 및/또는 변환하는데 사용될 수 있다. 분리된 병렬 처리는, 채팅 세션의 각 사용자에게 배치되거나 및/또는 그 채팅 세션에서 사용되는 각 언어에 배치될 수 있다. 이러한 병렬 처리는, 사용자의 차단 및 차단 해제의 태스크를 단순화할 수 있다. 예를 들어, 구분된 병렬 처리는, 사용자들이 차단되거나 또는 차단 해제될 때 각각 채팅 시스템으로부터 제거되거나 추가될 수 있다. In some embodiments, the complexity of the system is increased or decreased by the scale at which blocking and unblocking is performed. Parallel computation can provide the flexibility to perform blocking and unblocking in real time without the penalty of time delays seen in conventional systems. For example, parallel processing can be used to translate and / or translate text messages in a text message chat system. Separate parallel processing can be deployed to each user of the chat session and / or to each language used in the chat session. Such parallel processing can simplify the task of blocking and unblocking the user. For example, separate parallel processing can be added or removed from the chat system, respectively, when users are blocked or unblocked.

도 35는, 채팅 세션에서 사용자를 차단하는 방법(3500)의 순서도이고, 방법(3500)은, 온라인 게임의 복수의 사용자에게 텍스트 메시지 채팅 시스템을 제공하는 단계(단계 3502)를 포함한다. 텍스트 메시지 채팅 시스템의 제 1 사용자로부터 텍스트 메시지 채팅 시스템의 제 2 사용자의 차단을 위한 요청이 수신된다(단계 3504). 그 요청의 수신에 이어서, 제 2 사용자로부터의 텍스트 메시지가 제 1 사용자에게 표시되지 않도록 차단한다(단계 3506). 일부 예에서, 채팅 세션의 텍스트 메시지는, 여기에 설명된 시스템 및 방법을 이용하여 번역되고 및/또는 변환된다. 병렬 처리가, 텍스트 메시지의 번역 및/또는 변환을 수행하는데 사용될 수 있다. 예를 들어, 구분된 병렬 처리는, 채팅 세션의 각 특정 사용자의 텍스트 메시지의 번역 및/또는 변환을 처리하기 위해 배치될 수 있고, 및/또는 채팅 세션에 관여된 각 언어에 배치될 수 있다. 35 is a flow chart of a method 3500 for blocking a user in a chat session, and the method 3500 includes providing a text message chat system to a plurality of users of an online game (step 3502). A request for blocking the second user of the text message chat system is received from the first user of the text message chat system (step 3504). Following receipt of the request, a text message from the second user is blocked from being displayed to the first user (step 3506). In some examples, text messages in a chat session are translated and / or translated using the systems and methods described herein. Parallel processing can be used to perform translation and / or conversion of text messages. For example, separate parallel processing may be arranged to handle translation and / or translation of text messages of each particular user in the chat session, and / or may be placed in each language involved in the chat session.

자동화된 번역 서비스가 항상 정확하지는 않고, 가끔 특정 오류를 정정하는데 인간의 간섭이 이득을 줄 수 있다. 일부 구현에서, 여기에 설명된 번역 시스템 및 방법은, 사용자가 번역 오류를 확인하고 이러한 오류를 바로잡기 위한 정정을 제안하도록 할 수 있다. 예를 들어, 이중 언어 사용자 또는 외국어 사용자(예, 온라인 게임의 불어 플레이어)는, 채팅창을 볼 수 있고 올바르지 않은 번역(예, 불어로 또는 불어로부터)을 볼 수 있다. 사용자는 그 오류가 있는 번역에 대한 정정 제안을 제출할 수 있고, 사용자는 정정 제출에 대한 보상(예, 인-게임 통화 또는 가상 아이템)을 받을 수 있다.Automated translation services are not always accurate, and sometimes human intervention can benefit from correcting certain errors. In some implementations, the translation system and method described herein can allow a user to identify translation errors and suggest corrections to correct those errors. For example, bilingual users or foreign language users (eg, French players in online games) can view the chat window and see incorrect translations (eg, in French or from French). The user can submit a correction proposal for the erroneous translation, and the user can receive a reward for the correction submission (eg, in-game currency or virtual item).

일부 구현에서, 원본 텍스트 메시지 및 대응하는 번역은, 단일 스크린에 표시되고, 그 단일 스크린은 언어에서 경험이 있는 사용자가 즉시 번역에 대한 피드백을 제공할 기회를 제공한다. 예를 들어, 사용자가 번역 오류를 인지하고 정정된 번역을 제출하는 옵션을 선택한다. 그리고 나서 사용자는 정정된 번역을 입력하고 제출하며 그 정정된 번역이 승인된다면 그리고 승인될 때 보상을 수신할 수 있다. 정정된 번역을 제출하자마자, 사용자는 원본 메시지에 대한 추가적인 정정된 번역을 제출하는 것이 차단된다. 이로써 사용자는 단일의 오류가 있는 번역으로부터 다중 보상을 받을 수 없다. In some implementations, the original text message and corresponding translation are displayed on a single screen, which provides an opportunity for experienced users in the language to immediately provide feedback on the translation. For example, the user recognizes a translation error and selects an option to submit a corrected translation. The user can then enter and submit the corrected translation and receive compensation if and when the corrected translation is approved. Upon submitting the corrected translation, the user is blocked from submitting additional corrected translations of the original message. This prevents users from getting multiple rewards from a single erroneous translation.

일부 예에서, 시스템 및 방법은, 원본 메시지가 사용자에 의해 올바르게 입력되지 않기 때문에, 원본 메시지를 번역할 수 없다. 예를 들어, 도 36A는, “Eres el peor!”를 인용하는 원본 스페인어 메시지(3602)를 보여준다. 이 메시지의 자동화된 영어 번역(3604)은 도 36B에 도시되어 있고 “You are the best!”를 인용한다. 도 36C 및 도 36D를 참조하면, 사용자는, 원본 메시지가 제대로 된 스페인어로 입력되지 않았고 올바르지 않게 번역되었음을 인지할 수 있다. 이러한 오류에 대처하기 위해, 사용자는 “정확한 번역 버튼(correct translation button)”(3606)을 선택하고, 그 버튼은 사용자가 원본 메시지에 대한 정정을 입력할 수 있는 정정창(3608)이 오픈되도록 한다. 도 36E 및 36F를 참조하면, 이 경우, 사용자는 그 정정창(3608)에 “Ustedes son los mejores!”를 입력하고 제출 버튼(3610)을 선택한다. 그 제출이 처리될 것임을 사용자에게 알리는 확인창(3612)이 나타난다. 확인창(3612)을 닫으면 사용자는 원본 채팅 페이지로 되돌아간다.In some examples, the system and method cannot translate the original message because the original message is not entered correctly by the user. For example, FIG. 36A shows an original Spanish message 3602 quoting “Eres el peor!”. The automated English translation 3604 of this message is shown in Figure 36B and quotes "You are the best!". 36C and 36D, the user can recognize that the original message was not entered in the correct Spanish and was translated incorrectly. To cope with this error, the user selects the "correct translation button" 3606, which causes the user to enter a correction window 3608 to enter a correction to the original message. 36E and 36F, in this case, the user enters “Ustedes son los mejores!” In the correction window 3608 and selects a submit button 3610. A confirmation window 3612 appears informing the user that the submission will be processed. When the confirmation window 3612 is closed, the user returns to the original chat page.

도 37A 및 도 37B를 참조하면, 사용자가 다른 사용자들로부터 수신된 번역 정정을 검토할 수 있도록 하는 사용자 인터페이스(3700)가 제공된다. 번역 정정을 검토하는 사용자들은, 그들의 노력에 대해 보상을 받을 수 있고, 그들이 수신한 보상의 특정 유형을 선택할 수 있다(예, 온라인 게임을 위한 가상 재화 또는 통화). 일반적으로, 번역 정정이 사용자에 의해 제출된 후에, 다른 사용자는, 그 정정이 원본 번역 및 다른 사용자들에 의해 제출되었던 다른 번역 정정에 비해 더 잘 되었는지를 결정할 수 있다. 사용자의 번역이 최상의 번역인 것으로 판단될 때, 그 사용자는 보상을 수신할 수 있고, 그 사용자의 번역은 번역 사전(번역 데이터 저장부(210))에 추가될 수 있다. 다양한 번역을 판단하는데 참여한 사용자들 또한 보상을 수신할 수 있다. 그러나 이러한 보상은 모든 심판자에 의해 최상으로 선택된 번역 정정을 선택한 사용자들에게 오직 주어질 수 있다. 37A and 37B, a user interface 3700 is provided that allows a user to review translation corrections received from other users. Users who review translation corrections can be rewarded for their efforts, and can select a specific type of reward they have received (eg, virtual goods or currency for online gaming). In general, after a translation correction has been submitted by a user, another user can determine whether the correction was better than the original translation and other translation corrections that were submitted by other users. When it is determined that the user's translation is the best translation, the user can receive the reward, and the user's translation can be added to the translation dictionary (translation data storage 210). Users who participate in judging various translations can also receive rewards. However, these rewards can only be given to users who have selected the translation correction best selected by all judges.

일반적으로, 사용자들이 제안된 번역 정정을 제출하고 다른 사용자의 제출을 판단하도록 함으로써, 시스템 및 방법은 사용자들이 자유롭게 기꺼이 제공하는 사용자 피드백을 이용한다. 이러한 처리에서 수집된 데이터는, 일단 승인되면, 번역 캐쉬 엔트리를 정정하는데 사용될 수 있고, 이로써 여기에 설명된 시스템 및 방법의 모든 번역 능력을 향상시킨다. 이는 원본 메시지가 번역을 위해 다시 제출될 때, 정확한 번역이 향후에 보여지는 것을 보장할 수 있다. In general, by allowing users to submit proposed translation corrections and judge other users' submissions, the system and method utilize user feedback that users are free to provide. The data collected in this process, once approved, can be used to correct the translation cache entry, thereby improving all translation capabilities of the systems and methods described herein. This can ensure that when the original message is resubmitted for translation, the correct translation will be shown in the future.

통상적인 구현에서, 번역 정정을 제출하고 및/또는 판단할 수 있는 두 유형의 사용자들이 있다: 하나의 언어를 사용하는 사용자들, 및 두 개의 언어를 사용하는 사용자들. 두 개의 언어를 사용하는 사용자들은, 일반적으로 원본 언어 문장을 이해할 수 있고, 다른 언어로 더 정확한 번역을 제공할 수 있다. 반대로, 하나의 언어를 사용하는 사용자는, 원본 언어 구절을 이해할 수 없으나, 그렇더라도 번역(사용자의 디바이스 언어로 제공된)을 검토할 수 있고 보상을 교환으로 정정을 제출할 수 있다. 두 유형의 사용자들로부터 얻어진 번역들은, 내용에서 다른 경향이 있다. 이중 언어 사용자는 일반적으로 더 정확한 번역을 제공한다. 시스템 및 방법은, 사용자가 단일 언어를 사용하는 사용자인지 이중 언어 사용자인지 결정하고 또는 검출할 수 있는 것이 바람직하고, 사용자의 피드백은 그 결정에 따라 평가될 수 있다. 예를 들어, 사용자들은 그들이 단일 언어를 사용하는 사용자인지 또는 이중 언어 사용자인지를 시스템 및 방법에 확인시킬 수 있다. In a typical implementation, there are two types of users who can submit and / or determine translation corrections: users who speak one language, and users who speak two languages. Users who speak two languages can generally understand the original language sentences and provide more accurate translations in other languages. Conversely, a user who speaks one language is unable to understand the original language verse, but can still review translations (provided in the user's device language) and submit corrections in exchange for rewards. Translations obtained from both types of users tend to differ in content. Bilingual users generally provide more accurate translations. It is preferred that the system and method be able to determine or detect whether the user is a user who speaks a single language or a bilingual user, and the user's feedback can be evaluated accordingly. For example, users can confirm to the system and method whether they are monolingual or bilingual users.

특정 상황에서, 사용자들의 대부분은, 단일 언어 사용자이고 동일한 언어(영어)를 말한다. 하나의 언어를 말하는 사람이 많이 공급되면, 그 언어에 대한 번역 제공을 제출하는 사용자들이 많아지고, 일반적으로 그 언어로 또는 그 언어로부터의 번역 정정에 대한 수요는 줄어든다. 다른 언어에 대한 번역 정정의 공급을 자극하기 위해, 사용자는, 번역 정정에 대한 수요에 따라 보상받을 수 있다. 예를 들어, 다수의 사용자가 하나의 언어를 말하고 그 언어로 주어진 번역 정정이 부족하지 않을 때, 그러한 사용자들은 번역 정정의 제출에 대해 더 적은 보상(예, 명목상 총액의 75%)을 수신할 수 있다. 동시에, 다른 언어로 말하는 소수의 사용자들은, 그 다른 언어로 된 번역 정정들에 대한 더 큰 수요 때문에, 더 큰 보상(예, 명목상 총액의 125%)을 수신할 수 있다. In certain situations, most of the users are monolingual and speak the same language (English). When a large number of people speaking a language are supplied, the number of users submitting a translation offer for that language is generally reduced, and the demand for correction of translations into or from that language is generally reduced. In order to stimulate the supply of translation corrections to other languages, the user can be compensated according to the demand for translation corrections. For example, when multiple users speak a language and there is not a shortage of translation corrections given in that language, those users may receive less compensation (eg, 75% of nominal total) for the submission of translation corrections. have. At the same time, a small number of users speaking in different languages can receive greater rewards (eg, 125% of the nominal total) due to the greater demand for translation corrections in those different languages.

사용자가 특정 기간(예, 하루)에 걸쳐 정정할 수 있는 번역의 수는 제한되거나 제한되지 않을 수 있다. 예를 들어, 정정을 제출한 것에 대한 보상이 주어지지 않을 때, 번역 정정의 수에는 제한이 없을 수 있다. 반면에, 사용자가 그러한 제출에 대해 보상을 받을 때, 사용자는 그 기간 동안 제한된 수의 번역 정정을 제출하는 것으로 허용될 수 있다. 이러한 제한은, 이중 언어를 사용하는 사용자들, 또는 많은 수의 번역 정정을 제출하는 경향이 있는 사용자들이, 과도한 보상을 받는 것을 방지하고, 이로써 근본적인 게임(예, 멀티 플레이어 온라인 게임)에서 불공평한 이득을 얻는 것을 방지할 수 있다. The number of translations that a user can correct over a certain period of time (eg, one day) may or may not be limited. For example, there may be no limit to the number of translation corrections when no compensation is given for submitting corrections. On the other hand, when the user is compensated for such submission, the user may be allowed to submit a limited number of translation corrections during that period. This limitation prevents users who speak bilingual, or those who tend to submit a large number of translation corrections, from being overly rewarded, thereby unfairly benefiting the underlying game (eg multiplayer online games). It can be prevented from getting.

특정 예에서, 번역 제출의 정확성을 결정하기 어렵고 자동적으로 보상을 발생하기에 어려운 부정확한 번역에 대한 피드백은, 오직 작은 수의 사용자로부터 수신될 수 있다. 예를 들어, 채팅은 연속적인 스트림으로 발생하고, 많은 사용자가 다른 사용자들과의 채팅에 더 집중하고 및/또는 근본적인 게임에서의 플레이에 집중하며, 번역 정정을 제출하는 것에는 덜 집중할 수 있다. 사용자들은 또한 그들의 창에서 그들이 보는 것에 기초하여 채팅을 선택하고, 많지 않은 사용자들이 동일한 채팅을 선택할 수 있다. 이에 따라, 많은 번역 정정이 수신되었을 때, 제안된 정정을, 보상의 교환으로, 정확한 번역에 대한 의견 일치를 얻기 위한 노력으로, 다른 사용자들이 판단하도록 할 수 있다. In certain instances, feedback on incorrect translations that are difficult to determine the accuracy of translation submission and difficult to automatically generate rewards can be received from only a small number of users. For example, chats occur in a continuous stream, and many users may focus more on chatting with other users and / or focusing on playing in the underlying game, and less on submitting translation corrections. Users can also choose a chat based on what they see in their window, and not many users can choose the same chat. Accordingly, when many translation corrections are received, the proposed correction can be made to be judged by other users in exchange for compensation, in an effort to obtain a consensus on the correct translation.

번역 정정을 제출하는 것에 대한 보상은, 래플(raffle) 시스템에 따라 사용자들에게 주어질 수 있다. 그러한 시스템에서, 보상은 모든 제출에 대해 주어지지 않고, 보상을 얻을 가능성이 있는 더 많은 정정을 제출한 사용자에게 랜덤하게 주어질 수 있다. 이러한 접근은, 특정 플레이어들이, 근본적인 게임에서의 그들의 능력 또는 노력에 비해, 메시지의 번역에서의 그들의 능력 및/또는 욕망 때문에, 다른 사용자에 비해 불공평한 이득을 얻을 수 있을 가능성을 줄인다. Rewards for submitting translation corrections can be given to users according to a raffle system. In such a system, rewards are not given for all submissions, but can be given randomly to users who submit more corrections that are likely to get rewards. This approach reduces the likelihood that certain players will gain an unfair advantage over other users due to their ability and / or desire in the translation of the message, compared to their ability or effort in the underlying game.

사용자가 잘못된 번역을 정정할 수 있도록 하는 것 이외에, 사용자들은 또한 잘못 검출된 언어, 필터링되지 않은 비속어, 및 개체명(named entity) 검출에 관한 피드백을 제출할 수 있다. 예를 들어, 원본 메시지 및 번역된 메시지를 볼 때, 사용자는 자동화된 번역 시스템이 원본 언어를 적절하지 않게 검출한 것을 인식할 수 있다. 그리고 나서 사용자는 가능한 보상에 대한 교환으로, 이 언어 검출 실수에 관해 시스템에 알릴 수 있다. 마찬가지로, 사용자는 메시지에 나타난 비속어에 관해 시스템에 알릴 수 있고, 이로써 시스템은 향후 메시지로부터 그러한 비속어를 필터링하고 또는 삭제할 수 있도록 한다. 사용자들은 또한 회사, 브랜드, 상표 등과 같이 메시지에 나타나는 개체명에 관해 시스템에 알릴 수 있다. 이는 개체명이 메시지에 나타날 때 시스템 및 방법이 이를 인식할 수 있도록 하고 그러한 개체들이 적절하게 이름이 붙여지고 및/또는 식별되도록 한다. In addition to allowing users to correct incorrect translations, users can also submit feedback regarding incorrectly detected languages, unfiltered profanity, and named entity detection. For example, when viewing the original message and the translated message, the user can recognize that the automated translation system has incorrectly detected the original language. The user can then inform the system of this language detection mistake in exchange for a possible reward. Likewise, the user can inform the system about profanity appearing in the message, thereby allowing the system to filter or delete such profanity from future messages. Users can also inform the system about the entity name appearing in the message, such as company, brand, trademark, and so on. This allows the system and method to recognize when the entity name appears in the message and allows those entities to be properly named and / or identified.

일반적으로, 사용자에 의해 제출된 번역 정정은, 사용자가 정확한 정정에 대해서만 오직 보상받도록 하기 위해 주의 깊은 평가를 필요로 한다. 이는 시스템의 전체적인 정확도를 향상시키고 사용자가 사기적인 정정을 제출하여 부정행위를 하는 것을 방지할 것이다. 일부 구현에서, 번역 정정의 정확도는, 사용자가 시스템을 부당하게 이용하는 것을 방지하기 위해, 단어 기반 특징, 언어 기반 특징, 및 다른 특징(예, 단어 배치 매칭 특징(word alignment match feature))을 이용하여 자동으로 평가된다. 품사(POS:a part of speech) 기반 언어 모델이, 문법 정정을 위해 문장을 체크하는데 사용될 수 있다. 추가적으로, 일부 사용자는 문법적으로 정확하지만 원본 메시지와 관계가 없는 번역 정정을 제출할 수 있다. 그러한 경우에서, 단어 배치 매칭 분석 특징이, 유용할 수 있고, 사용자 제출들을 승인하고 및/또는 거절하는데 주기적인 프로세스로서 동작할 수 있다. 기계 학습 접근(machine learning approach)은 여기에 설명된 번역 시스템 및 방법에서 드문 사용자 피드백을 인증하는데 사용될 수 있다. In general, translation corrections submitted by the user require careful evaluation to ensure that the user is only compensated for the correct correction. This will improve the overall accuracy of the system and prevent users from cheating by submitting fraudulent corrections. In some implementations, the accuracy of translation correction uses word-based features, language-based features, and other features (eg, word alignment match features) to prevent users from unfairly using the system. It is evaluated automatically. A language model based on a part of speech (POS) can be used to check sentences for grammar correction. Additionally, some users may submit translation corrections that are grammatically correct but independent of the original message. In such a case, the word placement matching analysis feature can be useful and can act as a periodic process to approve and / or reject user submissions. The machine learning approach can be used to authenticate rare user feedback in the translation systems and methods described herein.

표2는 본 발명의 특정 실시예에 따른, 사용자에 의해 제출된 제안된 번역 정정의 예를 나타낸다. 이 예에서, 소스 언어에서의 원본 메시지는, “aaa bbb ccc”이고 타겟 언어에서의 정확한 번역은 “xxx yyy zzz”이다. “보여진 번역(Shown Translation)”이라는 열은, 여기에 설명된 자동화된 시스템에 의해 제안된 초기 번역의 예를 포함한다.Table 2 shows examples of proposed translation corrections submitted by users according to certain embodiments of the present invention. In this example, the original message in the source language is “aaa bbb ccc” and the correct translation in the target language is “xxx yyy zzz”. The column “Shown Translation” contains an example of the initial translation proposed by the automated system described herein.

사용자 정정들 및 바람직한 결과들의 예Examples of user corrections and desirable results 원본 메시지The original message 보여진 번역Translation shown 사용자 정정User correction 설명Explanation 상태state aaa bbb cccaaa bbb ccc

.;$@#!

.; $ @ #! xxx yyy zzz User corrects incorrect translation Approved xxx yyy xxx yyy zzz Improves the quality of existing machine translations that are partially accurate by the user Approved aaa xxx uuu xxx 'yyy' zzz ' Users do not know the target language, but simply use an online translation service Approved xxx yyy zzz xxx zzz yyy The user unreasonably uses the system for compensation by rearranging the words a little in the translation shown. Rejected xxx yyy zzz xxx yyy zzz Copy and paste the same translation for cheating Rejected xxx yyy zzz

$ @% ^ & / dse reyyfwf If copying is rejected, type a random message / junk for cheating Rejected xxx yyy zzz sss ddd fff The user submits a grammatically correct message in the target language, but the translation does not matter (copy and paste the translation of the previous message) Rejected xxx yyy zzz XXX yyY zzz User attempts to submit a true translation, but his correction is worse than the translation shown (low quality) Rejected

표2에 나타난 바와 같이, 사용자가 정확하고 개선된 번역을 제출할 때, 사용자 제출은 승인되어야 하고, 사용자는 적절한 보상을 수신할 수 있다. 그러나, 사용자가 질 낮은 또는 사기적인 번역(예, 랜덤 메시지)을 제출할 때, 사용자 제출은 거부되어야 하고, 어떠한 보상도 사용자에게 주어져서는 안 된다. 시스템 및 방법은 바람직하게 표2의 “상태” 열에 보여진 것과 같은 예를 승인하거나 또는 거절한다. As shown in Table 2, when the user submits an accurate and improved translation, the user submission must be approved, and the user can receive appropriate rewards. However, when a user submits a poor or fraudulent translation (eg, a random message), the user submission should be rejected and no reward should be given to the user. The system and method preferably approve or reject the example as shown in the “Status” column of Table 2.

특정 실시예에서, 원본 메시지의 번역은, 번역이 원본 메시지에 대해 적절한지 여부에 따라 분류된다. 분류는, 이항 분류 태스크에서 번역 및 원본 메시지로부터 특징이 추출될 때 처리할 수 있다. 분류 기술은, 사용자에 의해 제출된 번역 정정이 정확하다는 것을 보장하기 위해 사용될 수 있다. 예를 들어, 일부 예에서, 여기에 설명된 다수 기반 검증(majority-based validation)은, 부정확한 번역마다 수신될 수 있는 응답의 수가 적기 때문에(예, 하나에서 셋) 적합하지 않다. 분류 기술은 또한 캐쉬된 번역 데이터에 나타나는 해쉬 충돌(hash collision)을 식별하고 및/또는 처리하는데 사용될 수 있다. 예를 들어, 데이터 테이블의 번역 엔트리의 약 10% 이상이 해쉬 충돌 때문에 오염될 수 있다. In certain embodiments, the translation of the original message is classified according to whether the translation is appropriate for the original message. Classification can be handled when features are extracted from translation and original messages in a binomial classification task. The classification technique can be used to ensure that the translation correction submitted by the user is correct. For example, in some examples, the majority-based validation described herein is not suitable because the number of responses that can be received per incorrect translation is small (eg, one to three). Classification techniques can also be used to identify and / or process hash collisions that appear in cached translation data. For example, over 10% of translation entries in a data table can be contaminated due to hash collisions.

도 38을 참조하면, 다양한 구현에서, 번역의 정확도는, 단어 기반 특징 모듈(3802), 언어 기반 특징 모듈(3804), 및 단어 배치 모듈(3806)을 포함하는 번역 정확도 모듈(3800)을 이용하여 평가된다. 단어 기반 특징 모듈(3802)은, 단어 카운트(count), 문자 카운트(count), 이모지(emoji), 숫자, 및/또는 구두점과 같은, 단어 기반 특징을 평가하는데 사용된다. 예를 들어, 번역이 정확할 때, 원본 메시지의 단어들의 개수와 번역에서의 단어들의 개수는 일반적으로 동일하다. 이에 따라, 두 메시지에서 단어의 개수가 임계 양(예, 두 배)보다 더 많아 차이가 나면, 번역은 부정확하거나 또는 부정확할 가능성이 있는 것으로 여겨질 수 있다. 일 예에서, 메시지 중 하나(예, 번역)에서의 단어의 개수가 다른 메시지(예, 원본 메시지)의 단어의 개수의 1/2(또는 1/2 보다 더 적음)이면, 단어 기반 특징 모듈(3802)은, 번역이 부정확하거나 또는 부정확할 가능성이 있는 것으로 결론을 내릴 수 있다.Referring to FIG. 38, in various implementations, the accuracy of translation uses a translation accuracy module 3800 that includes a word-based feature module 3802, a language-based feature module 3804, and a word placement module 3806. Is evaluated. The word based feature module 3802 is used to evaluate word based features, such as word count, character count, emoji, numbers, and / or punctuation. For example, when the translation is correct, the number of words in the original message and the number of words in the translation are generally the same. Accordingly, if the number of words in the two messages differs by more than a threshold amount (eg, double), the translation may be considered to be inaccurate or possibly inaccurate. In one example, if the number of words in one of the messages (eg, translation) is 1/2 (or less than 1/2) of the number of words in another message (eg, the original message), then the word-based feature module ( 3802) may conclude that the translation may be inaccurate or inaccurate.

번역의 정확도를 평가하는데 사용될 수 있는 다른 단어 기반 특징은, 원본 메시지 및 번역에 있는 문자(character)(예, 글자(letter)들 및 숫자(number)들)의 수이다. 일반적으로, 원본 메시지의 문자의 수가, 번역에서의 문자의 수와 동일할 때, 번역은 정확한 것으로 볼 수 있다. 임계 양은, 두 메시지에서의 문자 카운트가 지나치게 다를 때, 결정하기 위해 사용될 수 있다. 예를 들어, 번역이 원본 메시지의 문자의 3/2 보다 더 많은 문자를 가지면, 단어 기반 특징 모듈(3802)은, 번역이 부정확하거나 부정확할 가능성이 있는 것으로 결론을 내릴 수 있다.Another word-based feature that can be used to evaluate the accuracy of the translation is the number of characters (eg, letters and numbers) in the original message and translation. In general, when the number of characters in the original message is equal to the number of characters in the translation, the translation can be regarded as correct. The threshold amount can be used to determine when the character counts in the two messages are too different. For example, if the translation has more than 3/2 of the characters in the original message, the word-based feature module 3802 can conclude that the translation is likely to be incorrect or inaccurate.

번역의 정확도를 평가하는데 사용될 수 있는 다른 단어 기반 특징은, 이모지(emoji)의 카운트 및 순서(예, 일본 전자 메시지에서 사용되는 표의문자 및 스마일리(smiley))로, 이는 일반적으로 원본 메시지와 번역 사이에서 변하지 않고 유지된다. 이모지는 특정 문장에서 이모지를 검출하는데 사용될 수 있는 특정 유니코드 텍스트 범위 내에 위치하는 경향이 있다. 규칙적인 표현은, 이러한 유니코드 범위를 이용하여 두 메시지로부터 이모지를 식별하거나 추출하는데 사용될 수 있다. 예를 들어, 입력이 연속하여 3개의 이모지를 포함하고 출력은 단지 하나의 이모지를 포함하면, 이는 입력과 출력 사이에 차이를 나타낸다. 이모지의 카운트 및/또는 순서가 두 메시지 사이에서 다르면, 단어 기반 특징 모듈(3802)은, 번역이 부정확하거나, 부정확할 가능성이 있는 것으로 결론 낼 수 있다.Another word-based feature that can be used to evaluate the accuracy of translations is the count and order of emoji (e.g., ideograms and smileys used in Japanese electronic messages), which are usually translated from the original message. It remains unchanged between. Emoji tends to be located within a specific range of Unicode text that can be used to detect emoji in a particular sentence. Regular expressions can be used to identify or extract emoji from two messages using this Unicode range. For example, if the input contains three emoji in succession and the output contains only one emoji, this represents a difference between the input and output. If the count and / or order of the emoji is different between the two messages, the word-based feature module 3802 can conclude that the translation is likely to be incorrect or inaccurate.

번역 정확도를 평가하는데 사용될 수 있는 추가적인 단어 기반 특징은, 두 메시지에서의 숫자들 및 구두점들의 카운트이다. 예를 들어, 숫자들 및 구두점들이, 원본 메시지와 번역에서 식별될 수 있고 추출될 수 있고, 최장 공통 부분 수열(LCS :Longest Common Susequence)의 길이가 정렬 후에 그들 사이에서 발견될 수 있다. 이 길이는, 두 메시지의 길이의 최대값으로 나누어지고, 이 단어 기반 특징에 대한 실제 숫자화된 값을 제공한다. 일반적으로 실제 숫자화된 값은, 오버랩되는 두 메시지에서의 숫자 및 구두점의 퍼센트의 표시를 제공한다. 실험 결과는, 이 특정 특징에 대해, 이진 값보다 실제 숫자화된 값을 이용하여 더 좋은 결과가 얻어지는 것을 보여준다. 예를 들어, 영어로 된 “I am going to meet you at 4:30 Cya!!”의 입력 문장은, “Je vais vous recontrer a 4:30 Au revoir!!”의 동등한 출력을 가질 수 있다. 구두점 및 숫자를 추출하면, 영어 및 불어 버전 모두에 대해 “4:30!!”를 얻을 수 있다. 이 경우 LCS는 6(문자에 의해)이고, 영어 및 불어 버전 중에서 길이의 최대는, 36(문자에 의해)이다. 이 단어 기반 특징에 대해 결과로 생긴 실제 숫자화된 값은, 6/36=0.167이다.An additional word-based feature that can be used to evaluate translation accuracy is the count of numbers and punctuation marks in both messages. For example, numbers and punctuation marks can be identified and extracted from the original message and translation, and the length of the Longest Common Susequence (LCS) can be found between them after sorting. This length is divided by the maximum value of the lengths of the two messages, providing the actual numeric value for this word-based feature. In general, the actual numeric value provides an indication of the percentage of the number and punctuation in the two overlapping messages. Experimental results show that for this particular feature, better results are obtained using actual numeric values than binary values. For example, an input sentence of “I am going to meet you at 4:30 Cya !!” in English may have an equivalent output of “Je vais vous recontrer a 4:30 Au revoir !!”. By extracting punctuation and numbers, you can get “4:30 !!” for both English and French versions. In this case, the LCS is 6 (by letter), and the maximum length of the English and French versions is 36 (by letter). The actual numeric value resulting from this word-based feature is 6/36 = 0.167.

단어 기반 특징에 대한 의존은, 단독으로는 번역 정확도를 평가하는데 불충분할 수 있다. 예를 들어, 사용자는, 원본 메시지에 나타난 동일한 개수의 단어 및 문자를 갖는 사기적인 정정을 생산하기 위해 원본 메시지의 각 단어를 더미 단어(예, “xxx”)로 교체하는 번역 정정을 제출함으로써 적어도 일부의 단어 기반 특징을 기만할 수 있다. Reliance on word-based features alone may be insufficient to assess translation accuracy. For example, a user may submit at least a translation correction by replacing each word in the original message with a dummy word (eg, “xxx”) to produce a fraudulent correction with the same number of words and characters appearing in the original message. Some word-based features can be deceived.

이러한 문제를 피하기 위해, 번역 정확도 모듈(3800)은, 단어 기반 특징에 더하여 또는 대신하여, 언어 기반 특징을 평가하기 위한 언어 기반 특징 모듈(3804)을 이용할 수 있다. 예를 들어, 일 실시예에서, 원본 메시지와 번역에 나타나는 단어들은, 두 메시지에서 품사(POS)(예, 동사, 명사, 형용사 등)를 식별하기 위해 태그될 수 있다(예를 들어, 오픈 소스 POS tagger를 이용). 메시지들의 각 단어는, 각 언어별 다른 태그 세트를 이용하여, 다른 수의 태그들로, 품사에 따라 태그될 수 있다. 예를 들어, 샘플 문장 “The device is easy to use”는, 문장에서 각 단어의 품사를 나타내는 POS 태거에 의해 “The_DT device_NP is_VBZ easy_JJ to_TO use_VB”로 태그될 수 있다. 이 경우, 태그는, 한정사(DT:Determiner), 명사구(NP:Noun phrase), 단수의 현재형 시제(VBZ:Singular present tense verb), 형용사(JJ:Adjective), TO(TO), 단순 동사(VB:Simple verb)이다. 이 제안의 주된 관심의 태그들은, 전형적으로 형용사 및 부사가 뒤따르는 동사(verb)이다. To avoid this problem, the translation accuracy module 3800 can use a language based feature module 3804 to evaluate language based features in addition to or instead of word based features. For example, in one embodiment, the words appearing in the original message and the translation may be tagged to identify a part of speech (POS) (eg, verb, noun, adjective, etc.) in both messages (eg, open source. POS tagger). Each word in the messages can be tagged according to part of speech, with a different number of tags, using a different set of tags for each language. For example, the sample sentence “The device is easy to use” may be tagged as “The_DT device_NP is_VBZ easy_JJ to_TO use_VB” by a POS tagger indicating the part of speech of each word in the sentence. In this case, the tags are: delimiter (DT), noun phrase (NP), singular present tense verb (VBZ), adjective (JJ), TO (TO), simple verb (VB) : Simple verb). The tags of primary interest in this proposal are verbs, typically followed by adjectives and adverbs.

특정 예에서, 원본 메시지 및 번역은, 분리되어 태그되고(예, POS 태거를 이용), 각 메시지의 결과 태그는, 각 메시지에서의 동사, 형용사, 부사 등의 개수를 식별하기 위해 카운트된다. 각 언어에 사용된 다른 유형의 동사(법조동사(modal verb), 부정형 동사(infinite verb), 과거 시제 동사(past tense verb), 미래 시제 동사(future tense verb) 등) 때문에, 간소화된 동사 태그 VB는, 각 언어에서 모든 유형의 동사들에 대한 맵을 이용하여 얻어질 수 있다. 예를 들어, 영어 동사 품사 태그는, 다음과 같이 단일 동사 태그 VB로 매핑될 수 있다; 'VBD'(동사(Verb), 과거 시제(past tense))=>'VB', 'VBG'(동사(Verb), 동명사(Gerund))=>'VB', 'VBN'(동사(Verb), 과거 분사(past participle))=>'VB', 'VBP'(동사(Verb), 비 3인칭 단수 현재(non third person singular present))=>'VB', 및 'VBZ'(동사(Verb), 3인칭 단수 현재(third person singular present)) =>'VB'. 태그된 메시지의 POS 태그는, 간소화된 POS 태그 세트로 교체될 수 있다. In a particular example, the original message and translation are tagged separately (eg, using POS taggers), and the resulting tag of each message is counted to identify the number of verbs, adjectives, adverbs, etc. in each message. Simplified verb tags VB due to different types of verbs used in each language (modal verb, infinite verb, past tense verb, future tense verb, etc.) Can be obtained by using maps for all types of verbs in each language. For example, an English verb part-of-speech tag may be mapped to a single verb tag VB as follows; 'VBD' (verb, past tense) => 'VB', 'VBG' (verb, gerund) => 'VB', 'VBN' (verb) , Past participle) => 'VB', 'VBP' (Verb, non third person singular present) => 'VB', and 'VBZ' (verb ), Third person singular present => 'VB'. The POS tag of the tagged message can be replaced with a simplified set of POS tags.

POS 태그를 간소화한 후, 동사 태그 VB의 개수는, 원본 메시지 및 번역에서 카운트될 수 있다. 이상적으로, 각 메시지에서 동사의 개수는, 동일해야 하지만, 일부 예외가 있다. 예를 들어, 영어 “was sleeping”는, 불어 “dormais”로 번역된다. 영어 POS 태거는, 두 개의 다른 동사인 “was” 및 “sleeping”을 태그할 수 있다. 반면, 불어 POS 태거는, 단일 동사인 “dormais”를 태그할 수 있다. “is”, “was”, 및 “can”과 같은 동사는, 영어에서 조동사로서 알려져 있다. 다른 언어들은, 이러한 조동사와 동등한 동사를 갖지 않을 수 있고, 대신 단일 동사를 교체로서 이용할 수 있다. 언어 중에서 동사 사용에서의 이러한 차이를 설명하기 위해, 시스템 및 방법은, 원본 메시지와 번역 사이의 동사의 개수에서의 차이에 대한 임계값(예, 2 또는 3)을 사용할 수 있다. 예를 들어, 두 메시지에서의 동사의 개수 사이의 차이가, 2보다 클 때, 언어 기반 특징 모듈(3804)은, 번역이 부정확하거나 부정확할 가능성이 있다고 고려할 수 있다. 이 임계값 2는, 합리적인 결과를 생산하기 위해 시행착오를 통해 발견되었다. 다른 품사(예, 형용사 및 부사)는, 하나 또는 그 이상의 추가적인 임계값을 이용하여, 두 메시지 사이에서 카운트되고 비교될 수 있다.After simplifying the POS tag, the number of verb tags VB can be counted in the original message and translation. Ideally, the number of verbs in each message should be the same, with some exceptions. For example, “was sleeping” in English is translated as “dormais” in French. The English POS tagger can tag two different verbs, “was” and “sleeping”. On the other hand, the French POS tagger can tag a single verb “dormais”. Verbs such as “is”, “was”, and “can” are known as auxiliary verbs in English. Other languages may not have verbs equivalent to these auxiliary verbs, but instead use a single verb as a replacement. To account for these differences in verb use among languages, systems and methods can use thresholds (eg, 2 or 3) for differences in the number of verbs between the original message and the translation. For example, when the difference between the number of verbs in the two messages is greater than 2, the language-based feature module 3804 may consider that the translation may or may not be accurate. This threshold 2 was found through trial and error to produce reasonable results. Other parts of speech (eg, adjectives and adverbs) can be counted and compared between two messages using one or more additional thresholds.

그러나, 일부 예에서, 사용자는, 번역에 대한 정정으로서 단순히 현재 존재하는 번역을 복사하여 제출함으로써 이 번역 정확도 체크를 기만할 수 있다. 이러한 경우에, 제출은, 유효한 정정으로 분류될 수 있으나, 사용자는 제출에 대한 보상을 받지 못할 수 있다. 일부 경우에서, 사용자는 또한 유효한 정정을 생산하여 제출하기 위해 현존하는 번역에서 일부 단어의 격(case)을 단순히 변경할 수 있다. 이에 따라, 일부 실시예에서, 시스템 및 방법은, 현존하는 번역 및 사용자 제출이 동일한지 아닌지를 결정한다. 만약 현존하는 번역 및 사용자 제출이 동일(예, 격(case) 및 대문자 사용(capitalization))하면, 사용자에게 어떠한 보상도 주어지지 않을 수 있다. However, in some examples, the user can deceive this translation accuracy check by simply copying and submitting the existing translation as a correction to the translation. In this case, the submission may be classified as a valid correction, but the user may not be compensated for the submission. In some cases, the user can also simply change the case of some words in the existing translation to produce and submit a valid correction. Accordingly, in some embodiments, the systems and methods determine whether existing translations and user submissions are the same. If the existing translations and user submissions are the same (eg, case and capitalization), no compensation may be given to the user.

특정 실시예에서, POS 태그 체크는, 자동화된 번역 시스템이 원본 메시지의 언어를 정확하게 식별하는데 실패할 때의 경우를 식별하는데 사용된다. 예를 들어, 원본 메시지의 언어는, 사용자의 번역 정정이 단어 카운트 체크를 패스하지만 POS 태그 체크에서 실패할 때, 부정확하게 검출될 수 있다. 부정확한 언어 검출은 또한, 동사의 개수가 0과 같거나, 모든 태그가 하나의 메시지에서 명사이고 다른 메시지에서는 아닐 때일 가능성이 있다. 예를 들어, 원본 스페인어 메시지는 “Pizt te enviA³ tan pronto regrese una marcha.”를 인용한다. 그러나, 언어가 영어인 것으로 검출되면, 영어 POS 태거는, 메시지를 태그할 수 없을 가능성이 있을 수 있고, 디폴트로, 모든 단어를 명사로 태크할 수 있다. 영어 POS 태거의 출력은, 예를 들어, [{“tag”: “NP”, “word”: “Pizt”, “stem”: “<unknown>“}, {“tag”: “NN”, “word”: “te”, “stem”: “te”}, {“tag”: “NN”, “word”: “envi＼u00c3＼u00b3”, “stem”: “<unknown>“}, {“tag”: “NN”, “word”: “tan”, “stem”: “tan”}, {“tag”: “RB”, “word”: “pronto”, “stem”: “pronto”}, {“tag”: “JJ”, “word”: “regrese”, “stem”: “<unknown>“}, {“tag”: “NN”, “word”: “una”, “stem”: “<unknown>“}, {“tag”: “NN”, “word”: “marcha”, “stem”: “<unknown>“}]일 수 있다. 그에 비해, 동일한 원본 메시지의 스페인어 태거의 출력은, [{“tag”: “NP”, “word”: “Pizt”, “stem”: “<unknown>“}, {“tag”: “PPX”, “word”: “te”, “stem”: “t＼u00fa”}, {“tag”: “VLfin”, “word”: “envi＼u00c3＼u00b3”, “stem”: “<unknown>“}, {“tag”: “ADV”, “word”: “tan”, “stem”: “tan”}, {“tag”: “ADV”, “word”: “pronto”, “stem”: “pronto”}, {“tag”: “VLfin”, “word”: “regrese”, “stem”: “regresar”}, {“tag”: “ART”, “word”: “una”, “stem”: “un”}, {“tag”: “NC”, “word”: “marcha”, “stem”: “marcha”}]일 수 있다. 태그 “NN”, “RB”, 및 “PPX”는 각각 명사(단수 또는 불가산), 부사, 및 소유 대명사를 나타낸다. In certain embodiments, POS tag checks are used to identify cases when the automated translation system fails to accurately identify the language of the original message. For example, the language of the original message may be incorrectly detected when the user's translation correction passes the word count check but fails the POS tag check. Inaccurate language detection is also likely when the number of verbs is equal to zero, or when all tags are nouns in one message and not in the other. For example, the original Spanish message cites “Pizt te enviA³ tan pronto regrese una marcha.” However, if the language is detected to be English, the English POS tagger may be unable to tag the message, and by default, all words can be tagged as nouns. The output of the English POS tagger, for example, [{“tag”: “NP”, “word”: “Pizt”, “stem”: “<unknown>“}, {“tag”: “NN”, “ word ”:“ te ”,“ stem ”:“ te ”}, {“ tag ”:“ NN ”,“ word ”:“ envi＼u00c3＼u00b3 ”,“ stem ”:“ <unknown> “}, {“ tag ”:“ NN ”,“ word ”:“ tan ”,“ stem ”:“ tan ”}, {“ tag ”:“ RB ”,“ word ”:“ pronto ”,“ stem ”:“ pronto ”}, {“Tag”: “JJ”, “word”: “regrese”, “stem”: “<unknown>“}, {“tag”: “NN”, “word”: “una”, “stem”: “ <unknown> "}, {" tag ":" NN "," word ":" marcha "," stem ":" <unknown> "}]. In comparison, the output of the Spanish tagger of the same original message is [{“tag”: “NP”, “word”: “Pizt”, “stem”: “<unknown>“}, {“tag”: “PPX” , “Word”: “te”, “stem”: “t＼u00fa”}, {“tag”: “VLfin”, “word”: “envi＼u00c3＼u00b3”, “stem”: “<unknown>“ }, {“Tag”: “ADV”, “word”: “tan”, “stem”: “tan”}, {“tag”: “ADV”, “word”: “pronto”, “stem”: “ pronto ”}, {“ tag ”:“ VLfin ”,“ word ”:“ regrese ”,“ stem ”:“ regresar ”}, {“ tag ”:“ ART ”,“ word ”:“ una ”,“ stem ” : “Un”}, {“tag”: “NC”, “word”: “marcha”, “stem”: “marcha”}]. The tags “NN”, “RB”, and “PPX” indicate nouns (singular or non-numerical), adverbs, and possessive pronouns, respectively.

이에 따라, 특정 예에서, 원본 메시지 및 번역의 품사가, 원본 메시지에서 언어가 제대로 식별되었는지를 결정하기 위해 비교된다. 일반적으로, 언어 검출 실패는, 메시지 중 하나(예, 번역)가 동사가 0이 아닌 개수를 갖는 것으로 태그되고, 다른 메시지(예, 원본 메시지)는 동사가 없는 것으로 태그될 때, 발생될 가능성이 있다. 언어 검출 실패는 또한, 하나의 메시지에서 모든 단어가 명사로 태그되고 반면 다른 메시지는 여러 유형의 POS 태그(예, 명사, 동사 및 형용사)를 가질 때일 가능성이 있다. Accordingly, in a particular example, the parts of speech of the original message and the translation are compared to determine whether the language was properly identified in the original message. Generally, a language detection failure is likely to occur when one of the messages (eg, a translation) is tagged as having a non-zero number of verbs, and the other message (eg, the original message) is tagged as having no verbs. have. Language detection failure is also likely when all the words in one message are tagged with nouns, while the other message has several types of POS tags (eg nouns, verbs and adjectives).

다양한 실시예에서, 번역 정확도는, 원본 메시지 및 번역에서 고유 명사를 식별하고 검토함으로써 평가된다. 일반적으로, 번역이 정확할 때, 고유 명사(예, 사람 및 도시들의 이름)는, 번역 및 원본 메시지에서 동일하다. 따라서 번역되지 않은 상태로 있는 두 메시지의 비교 및 공통 단어의 필터링은, 진정한 번역을 식별하는 특징으로서 유용할 수 있다. 일부 예에서, 이러한 번역되지 않은 고유 명사의 존재는, 번역 정확도를 개선하는데 도움을 줄 수 있으나, 번역되지 않은 고유 명사의 부재는, 번역 정확도에 관한 어떠한 정보도 주지 않을 수 있다. 원본 메시지에서 고유 명사가 식별되지만 번역에는 없다면, 번역의 정확도는 부정확한 것으로 또는 부정확할 가능성이 있는 것으로 고려될 수 있다. 이러한 고유 명사 특징에 대해 회신된 실제 가치화된 점수에 페널티가 추가될 수 있고, 이는 잘못된 번역을 식별하고 번역 정확도를 개선하는데 도움을 준다. 예를 들어, 고유 명사가 두 메시지 사이에서 다를 때, 번역에 대한 정확도 점수는 페널티에 의해 감소될 수 있다. In various embodiments, translation accuracy is assessed by identifying and reviewing proper nouns in the original message and translation. Generally, when the translation is correct, the proper noun (eg, the names of people and cities) is the same in the translation and the original message. Thus, comparison of two messages that are left untranslated and filtering of common words can be useful as a feature to identify a true translation. In some examples, the presence of such untranslated proper nouns can help improve translation accuracy, but the absence of untranslated proper nouns may not give any information about the translation accuracy. If the proper noun is identified in the original message but is not present in the translation, the accuracy of the translation may be considered to be inaccurate or inaccurate. Penalties can be added to the actual valued scores returned for these intrinsic noun features, which helps to identify incorrect translations and improve translation accuracy. For example, when the proper noun is different between the two messages, the accuracy score for the translation can be reduced by a penalty.

대안으로 또는 추가적으로, 번역 정확도는, 원본 메시지 및 번역에서 문법을 분석하고 비교함으로써 평가될 수 있다. 다중 언어로 작업하는 것은, 문장의 문법을 이해하기 위해 모든 언어에 대해 트리(tree)를 분석(parse)하는 것을 어렵게 할 수 있다. 메시지는 또한 종종 채팅 언어로 작성되고, 이는 모국어(native language)의 분명한 또는 공식적인 화법과 비교되는 다른 문법으로 이어진다. Alternatively or additionally, translation accuracy can be assessed by analyzing and comparing grammar in the original message and translation. Working in multiple languages can make it difficult to parse a tree for every language to understand the grammar of the sentence. Messages are also often written in a chat language, which leads to other grammars that are compared to the explicit or formal speech of the native language.

이에 따라, 채팅 언어의 문법 중에서 패턴을 인식하기 위해, 문장은, POS 태그에 대한 N-그램(gram) 언어 모델을 세우기 위해 POS 태그로 태그될 수 있고, 이로써 근본적인 문법적 구조의 근사치를 제공한다. n-그램은, n-연속적인 단어들의 수집으로 정의될 수 있다. 이 n-그램의 모델은, 전형적으로 특정 언어에 대한 것일 수 있고, 및/또는 특정 언어에서 n 연속적인 단어들 나타내는데 사용될 수 있다. 특정 구현에서, 단어 기반 n-그램의 방법은, 음성 기반 n-그램의 부분으로 확장된다. 즉, 문장을 파싱하는 피상적인 방법이, POS 태거로 문장의 단어가 태그되는데 사용될 수 있다. 일 접근에서, BLUE 점수는, 실제 텍스트보다는 POS 태그에 대해 계산된다. Accordingly, in order to recognize patterns among grammars of chat languages, sentences can be tagged with POS tags to establish an N-gram language model for POS tags, thereby providing an approximation of the underlying grammatical structure. An n-gram can be defined as a collection of n-sequential words. This n-gram model can typically be for a particular language, and / or can be used to represent n consecutive words in a particular language. In certain implementations, the method of word-based n-grams is extended to parts of speech-based n-grams. That is, a superficial method of parsing a sentence can be used to tag the word of the sentence with a POS tagger. In one approach, BLUE scores are calculated for POS tags rather than actual text.

트라이그램(3-gram) 언어 모델은, 각 언어에 대한 POS 태그된 문장에 대해 만들어질 수 있다. 예를 들어, 문장 “The device is easy to use”는, “The_DT device_NP is_VBZ easy_JJ to_TO use_VB”의 POS 태그된 출력을 갖는다. 이 문장에서의 단어 기반 트라이그램은, {The, device, is}, {device, is, easy}, {is, easy, to}, {easy, to, use}이다. 대응하는 POS 기반 트라이그램은 {DT, NP, VBZ}, {NP, VBZ, JJ}, {VBZ, JJ, TO}, {JJ, TO, VB}일 수 있다.A trigram language model can be created for POS tagged sentences for each language. For example, the sentence “The device is easy to use” has the POS tagged output of “The_DT device_NP is_VBZ easy_JJ to_TO use_VB”. The word-based trigrams in this sentence are {The, device, is}, {device, is, easy}, {is, easy, to}, {easy, to, use}. The corresponding POS-based trigram may be {DT, NP, VBZ}, {NP, VBZ, JJ}, {VBZ, JJ, TO}, {JJ, TO, VB}.

더 높은 확률을 갖는 트라이그램이, 문법의 부분적 구조를 추론하는데 사용될 수 있다. 예를 들어, POS 태그에 대해 만들어진 트라이그램 언어 모델은, 각 트라이그램과 연관된 확률을 가질 수 있다. 확률은, 비교된 텍스트 말뭉치에서의 모든 트라이그램에 대한 동일 텍스트에서 특정 트라이그램이 발생한 시기의 개수의 비율로서 계산될 수 있다. 문법적 트라이그램은, 종종 반복되는 경향이 있고, 이런 이유로 더 높은 확률(또한 언어 모델 점수로 알려진)을 가질 것이다. 이런 이유로, 메시지가 이 언어 모델에 대해 더 높은 점수를 수신할 때, 메시지는 문법적으로 정확할 가능성이 더 크다. 해당 점수는, 사용자가 보상을 받기 위해 스팸 메시지를 입력할 때의 사례를 검출하는데 유용할 수 있다. 그 점수는 또한 언어 검출이 실패했을 때를 결정하는데 유용할 수 있다. 예를 들어, 구분된 모델들이 각 언어에 대해 사용될 수 있기 때문에, 잘못된 언어로 된 문장의 점수는, 더 낮을 수 있다. 점수는 또한 복수의 번역이 좋을 때를 검출하는데 유용할 수 있다. 구분된 모델들은, 이 목적을 위해 인간 및 기계 번역을 위해 훈련될 수 있다.Trigrams with higher probability can be used to infer the partial structure of the grammar. For example, a trigram language model built for POS tags may have a probability associated with each trigram. Probability can be calculated as the ratio of the number of times a particular trigram occurred in the same text for all trigrams in the compared text corpus. Grammatic trigrams tend to repeat often, and for this reason they will have a higher probability (also known as language model scores). For this reason, when a message receives a higher score for this language model, the message is more likely to be grammatically correct. This score can be useful for detecting cases when a user enters a spam message to receive a reward. The score can also be useful in determining when language detection has failed. For example, the scores of sentences in the wrong language may be lower, since distinct models can be used for each language. Scores can also be useful for detecting when multiple translations are good. Separated models can be trained for human and machine translation for this purpose.

특정 실시예에서, 언어 모델은, 정확한 것으로 입증된 번역을 이용하여 훈련된다. 트라이그램 모델은 POS 태그에 대해 만들어질 수 있다.In certain embodiments, language models are trained using translations that have been proven to be accurate. Trigram models can be built for POS tags.

임의의 크기의 n-그램이 갖는 내재된 문제는, 모든 가능한 가설의 부족이다. 이러한 경우에, n-1-그램 및 n-2-그램이 식별되는 경우에 백오프 방법이 뒤따른다. 예를 들어, {DT, NP, JJ}와 같은, 알려지지 않은 POS 트라이그램이 발견되면, 모델은, 바이그램(bigram) {DT, NP} 및 {NP, JJ}가 문법성을 나타내는 언어 모델 점수를 갖는지를 보기 위해 소급적으로 체크할 수 있다. 그 바이그램이 확률을 갖지 않으면(또는 확률이 매우 낮으면), 다른 백오프가 {DT}, {NT}, {JJ}에 대한 유니그램 언어 모델 점수를 체크하기 위해 행해질 수 있다. 모든 세 개의 모델은 또한, 트라이그램 모델에 더 많은 가중치를 두고, 바이그램 모델에는 보다 적은 가중치를 두며, 그리고 유니그램 모델에는 훨씬 더 적은 가중치를 두는 선형 보간법에 의해 그룹핑될 수 있다. The inherent problem with n-grams of any size is the lack of all possible hypotheses. In this case, a back-off method follows when n-1-gram and n-2-gram are identified. For example, if an unknown POS trigram is found, such as {DT, NP, JJ}, the model will score the language model scores in which the bigrams {DT, NP} and {NP, JJ} represent grammar. You can check retrospectively to see if you have it. If the bigram has no probability (or if the probability is very low), another backoff can be done to check the unigram language model score for {DT}, {NT}, {JJ}. All three models can also be grouped by linear interpolation with more weight on the trigram model, less weight on the bigram model, and much less weight on the unigram model.

채팅 영어에 대해 POS 태그가 가장 자주 발생하는 트라이그램은, 다음과 같다: <IN> <VBD> <PP> = 1.0; <FW> <NN> <IN> = 1.0; <DT> <WP> <VBP> = 1.0; 및 <RB> </S1> </S2> = 1.0. 1.0의 점수는, 특정 트라이그램 시퀀스가 그것이 발생할 때 항상 문법적이라는 100% 확률을 나타낸다. 일반적으로, 채팅 언어는, “Wh”로 시작하고 동사가 뒤따르는 더 많은 대명사를 사용하고, 이는 <DT> <WP> <VBP>로 캡쳐된다. 사람들은 또한, “You are cool” 또는 “This is awesome”에서와 같이, 채팅 언어를 부사 또는 형용사로 끝내는 경향이 있고, 이는 <RB> </S1> </S2>로 캡쳐되고, S1 및 S2는 문장 태그들의 끝이다. 따라서 이러한 트라이그램들은, 각 언어에서 근본적인 문법의 적어도 일부 구조를 인식하는데 사용될 수 있다. 언어 모델은 관심이 있는 모든 언어에 대해 만들어질 수 있고, 그 모델들은, JSON 포맷으로 저장될 수 있으며, 어느 때고 즉시 로딩될 수 있다.The trigrams where POS tags occur most often for chat English are: <IN> <VBD> <PP> = 1.0; <FW> <NN> <IN> = 1.0; <DT> <WP> <VBP> = 1.0; And <RB> </ S1> </ S2> = 1.0. A score of 1.0 indicates a 100% probability that a particular trigram sequence is always grammatical when it occurs. Generally, the chat language uses more pronouns that start with “Wh” and are followed by verbs, which are captured as <DT> <WP> <VBP>. People also tend to end the chat language as an adverb or adjective, as in “You are cool” or “This is awesome”, which is captured as <RB> </ S1> </ S2> and S1 and S2 Is the end of the sentence tags. Thus, these trigrams can be used to recognize at least some structure of the underlying grammar in each language. Language models can be created for any language of interest, and the models can be stored in JSON format and loaded immediately at any time.

일부 구현에서, 메시지가 POS 태그된 후에, 시스템 및 방법은, 현존하는 트라이그램 언어 모델에 매칭되는 메시지에서 트라이그램의 개수를 카운트할 수 있다. 모델에서 각 트라이그램은, 그것과 연관된 확률 점수를 가질 수 있고, 메시지에서의 각 트라이그램에 대해, 트라이그램의 대응하는 점수는, 그 모델에서 식별된다. 일부 예에서, 점수가 임계값(예, 0.05)보다 크면, 트라이그램은 매칭되는 것으로 카운트된다. 그렇지 않으면, 트라이그램은 매칭되는 것으로 카운트되지 않는다. 시스템 및 방법은, 문장에서 전체 트라이그램 개수에 대한 매칭되는 트라이그램의 개수를 계산할 수 있고, 이 비율은 번역 정확도를 평가하기 위한 실제 가치화된 특징으로서 사용될 수 있다. 예를 들어, 문법적 텍스트에서 자주 발생할 수 있는 {easy, to, use}의 트라이그램은, 약 0.68의 높은 확률 점수(언어 모델 점수)를 가질 수 있다. 그에 반하여, {easy, use, to}의 비문법적 트라이그램은, 약 0.04의 더 적은 발생 확률을 가질 수 있다. 0.05의 정의된 임계치와 비교될 때, 비문법적 트라이그램은, 걸러질 수 있고, 임계치를 넘는 트라이그램의 개수는, 텍스트의 문법성에 대한 양적인 가치를 제공할 수 있다.In some implementations, after the message is POS tagged, the system and method can count the number of trigrams in the message matching the existing trigram language model. Each trigram in the model can have a probability score associated with it, and for each trigram in the message, the corresponding score of the trigram is identified in the model. In some examples, if the score is greater than a threshold (eg, 0.05), the trigram is counted as a match. Otherwise, the trigram is not counted as a match. The system and method can calculate the number of matching trigrams for the total number of trigrams in a sentence, and this ratio can be used as an actual valued feature to evaluate translation accuracy. For example, a trigram of {easy, to, use} that can occur frequently in grammatical text may have a high probability score (language model score) of about 0.68. In contrast, {easy, use, to} inscriptional trigrams may have a lower probability of occurrence of about 0.04. When compared to a defined threshold of 0.05, non-grammatical trigrams can be filtered out, and the number of trigrams above the threshold can provide a quantitative value for the grammar of the text.

POS 태그된 메시지를 얻은 후에, 문장의 확률은, 트라이그램 언어 모델을 이용하여 계산될 수 있다. 일부 실시예에서, 문장에서의 각 트라이그램의 로그 확률이 결정되고, 로그 확률의 합이 계산된다. 그리고 나서, 이 합은, 문장에 대한 점수를 얻기 위해, 문장에서의 단어들의 개수로 나누어진다. 그 점수는, 번역 정확도를 평가하기 위한 실제 가치화된 특징으로서 사용될 수 있다. 트라이그램의 로그 확률은, 바람직하게 트라이그램, 바이그램 및 유니그램 확률들의 선형 보간법으로 계산된다. 예를 들어, “The device is easy to use” 문장에서, POS 태그된 출력은 “The_DT device_NP is_VBZ easy_JJ to_TO use_VB”이다. 그 문장에 대한 POS 기반 트라이그램은, {DT, NP, VBZ}, {NP, VBZ, JJ}, {VBZ, JJ, TO}, 및 {JJ, TO, VB}이다. 이 트라이그램의 각각은, 특정 언어 말뭉치에서 발생 확률을 갖는다. 각 트라이그램에 대한 확률들이 0.12, 0.44, 0.32, 및 0.89라고 가정하면, 그 문장의 문법성에 대한 결합된 점수는, 로그 확률로 계산될 수 있다. 이 경우에서, 로그 확률은, log(0.12) + log(0.44) + log(0.32) + log(0.89)로 계산되어 -1.82와 같다. 상대 스케일로, 로그 확률은, 문장의 문법성의 수치화된 표시를 제공한다. After obtaining a POS tagged message, the probability of a sentence can be calculated using a trigram language model. In some embodiments, the log probability of each trigram in the sentence is determined, and the sum of the log probabilities is calculated. Then, this sum is divided by the number of words in the sentence to get a score for the sentence. The score can be used as a real valued feature to evaluate translation accuracy. The logarithmic probability of the trigram is preferably calculated by linear interpolation of trigram, bigram and unigram probabilities. For example, in the sentence “The device is easy to use”, the POS tagged output is “The_DT device_NP is_VBZ easy_JJ to_TO use_VB”. The POS-based trigrams for the sentence are {DT, NP, VBZ}, {NP, VBZ, JJ}, {VBZ, JJ, TO}, and {JJ, TO, VB}. Each of these trigrams has a probability of occurrence in a particular language corpus. Assuming that the probabilities for each trigram are 0.12, 0.44, 0.32, and 0.89, the combined score for the grammar of the sentence can be calculated as a logarithmic probability. In this case, the log probability is equal to -1.82, calculated as log (0.12) + log (0.44) + log (0.32) + log (0.89). On a relative scale, the logarithmic probability provides a numerical representation of the grammar of a sentence.

상술한 단어 기반 및 언어 기반 특징에 추가하여 또는 대신하여, 번역 정확도 모듈(3800)은, 원본 메시지와 번역 사이에서 단어의 배치를 평하기 위한 단어 배치 모듈(3806)을 사용할 수 있다. 잠재적인 단어 배치 쌍의 리스트를 생성하기 위해, 번역 정확도 체크 알고리즘은, 데이터베이스에서 단독으로 단어 기반 특징으로 실행되고, 병렬 말뭉치는, 확률값 > 0.90을 갖는 번역 쌍으로 추출된다. 이는 병렬 말뭉치를 만들기 위해 오직 품질이 좋은 번역 메시지 쌍이 필터링되는 것을 나타낸다. 100K 문장들이, 영어 및 불어 쌍 그리고 영어 및 스페인어 쌍을 위해 수집된다. 이 100K 문장들에 대한 병렬 말뭉치는, 단어 배치를 추출하기 위해 통계적 기계 번역 툴킷(즉, GIZA++)에 공급된다. 그 툴킷은, 대략 25-30K 단어 배치 쌍 및 이 쌍들에 대한 연관된 확률 점수를 추출한다. In addition to or instead of the word-based and language-based features described above, the translation accuracy module 3800 can use the word placement module 3806 to evaluate the placement of words between the original message and the translation. To generate a list of potential word placement pairs, the translation accuracy check algorithm is run with word-based features alone in the database, and the parallel corpus is extracted as translation pairs with probability> 0.90. This indicates that only good quality translation message pairs are filtered to create a parallel corpus. 100K sentences are collected for English and French pairs and English and Spanish pairs. The parallel corpus for these 100K sentences is supplied to a statistical machine translation toolkit (ie GIZA ++) to extract word placement. The toolkit extracts approximately 25-30K word placement pairs and associated probability scores for these pairs.

일반적으로 단어당 다중 단어 배치(alignment)가 있다는 것을 고려하면, 특정 임계치보다 더 큰 확률 점수를 갖는 배치를 선택하는 것이 이로울 수 있다. 그러나, 임계치에도 불구하고, 단어당 하나 이상의 배치가 얻어질 수 있고, 그것의 대부분은 전형적으로 철자 오류 또는 다른 시제(예, 과거 시제 또는 미래 시제)로 된 동일한 단어에 기인한다. 영어 및 불어 간의 샘플 단어 배치들이 표3에 도시된다. 구분된 단어 배치는, 두 언어의 두 가능한 순서들에 대해 얻어질 수 있다(예를 들어, 영어 대 불어 및 불어 대 영어). 따라서, 두 단어 배치 파일은, 언어 쌍마다 추출될 수 있다. 소스 언어(즉, 원본 메시지의 언어) 대 타겟 언어(즉, 번역의 언어)에 대한 단어 배치는, 소스 배치로서 참조될 수 있고, 반면 타겟 언어 대 소스 언어에 대한 단어 배치는, 타겟 배치로서 참조될 수 있다. Considering that there are generally multiple word alignments per word, it may be beneficial to select a placement with a probability score greater than a certain threshold. However, despite the threshold, more than one batch per word can be obtained, most of which are typically due to spelling errors or the same word in different tenses (eg past tense or future tense). Sample word assignments between English and French are shown in Table 3. Separate word placement can be obtained for two possible sequences of two languages (eg English vs. French and French vs. English). Thus, two word batch files can be extracted for each language pair. The word placement for the source language (i.e., the language of the original message) versus the target language (i.e., the language of translation) can be referred to as source placement, while the word placement for the target language vs. source language is referred to as target placement Can be.

영어와 불어 간의 단어 배치의 예Example of word placement between English and French 영어English 불어French TroopsTroops troupes troupetroupes troupe AttacksAttacks attaques attaqueattaques attaques SmokeSmoke fumee fume fumez fumeefumee fume fumez fumee

소스 배치 및 타겟 배치는, 두 개의 구분된 파일로 로딩될 수 있다. 소스 배치에 또한 존재하는 원본 메시지에서의 각 단어에 대해, 시스템 및 방법은, 적어도 하나의 대응하는 단어가, 번역에 존재하는지 그리고 또한 타겟 배치에 존재하는지 결정한다. 그리고 나서, 동일한 프로세스가 번역 문장에 적용된다. 마지막으로, 단어 배치 파일에서 발견되는 단어들의 퍼센트는, 번역 정확도를 평가하기 위한 특징으로서 회신된다.Source batch and target batch can be loaded into two separate files. For each word in the original message that is also present in the source batch, the system and method determine whether at least one corresponding word is present in the translation and also in the target batch. Then, the same process is applied to the translated sentences. Finally, the percentage of words found in the word batch file is returned as a feature to evaluate translation accuracy.

특정 구현에서, 단어 배치는, 영어를 언어의 하나로서 포함하는 언어 쌍에 대해 추출된다. 예를 들어, 언어 쌍은, 스페인어, 불어, 폴란드어, 포르투갈어, 네덜란드어, 독일어, 덴마크어, 스웨덴어, 터키어, 이탈리아어 및 러시아어 중 하나와 결합되는 영어일 수 있고, 총 11 개의 언어 쌍 및 11 x 2 = 22 개의 단어 배치 파일이다. 영어를 포함하지 않는 언어 쌍(예, 불어를 러시아어로 번역)에 대해, 영어는 배치를 추출하기 위한 중간 언어로서 사용될 수 있다. 예를 들어, 불어에서 러시아어로의 번역이 검증될 때, 불어 대 영어는, 원본 메시지에서 단어 배치를 추출하는데 사용될 수 있고, 러시아어 대 영어는, 번역에서 단어 배치를 추출하는데 사용될 수 있다. 이 두 세트의 교차지점은, 두 메시지에서의 단어 배치 중의 매칭되는 것의 퍼센트를 제공한다. 이 방법은, 모든 언어 쌍으로 확장될 수 있다. In certain implementations, word placement is extracted for language pairs that include English as one of the languages. For example, the language pair can be English combined with one of Spanish, French, Polish, Portuguese, Dutch, German, Danish, Swedish, Turkish, Italian, and Russian, with a total of 11 language pairs and 11 x 2 = 22 Word batch file. For language pairs that do not contain English (eg, French to Russian translation), English can be used as an intermediate language to extract batches. For example, when a French-to-Russian translation is verified, French-to-English can be used to extract word placement from the original message, and Russian-to-English can be used to extract word placement from the translation. The intersection of these two sets gives the percentage of matches in word placement in both messages. This method can be extended to any language pair.

다양한 실시예에서, 사용자 신뢰도는, 사용자의 번역 제출이 신뢰할 수 있는지 여부의 표시를 제공하기 위해 계산된다. 여기에 설명된 다른 접근법 이외에, 사용자는 번역 정정들의 사용자의 히스토리에 기초하여 거의 신뢰될 수 있다. 정정된 번역의 개수 및 보상된 번역의 개수는, 다양한 사용자에 대해 데이터 테이블에 저장될 수 있다. 이는 플레이어에 의해 행해진 정정의 퍼센트를 계산할 수 있게 하고, 번역 정정이, 특정 사용자의 신뢰도 임계치에 기초하여, 적어도 어느 정도는, 승인될 수 있다. 이는 번역을 정정하는 진정한 플레이어로부터 스패머를 쉽게 구분할 수 있게 한다. In various embodiments, user confidence is calculated to provide an indication of whether the user's translation submission is trustworthy. In addition to the other approaches described herein, the user can be almost trusted based on the user's history of translation corrections. The number of corrected translations and the number of compensated translations can be stored in a data table for various users. This allows the percentage of corrections made by the player to be calculated, and translation corrections can be approved, at least to a certain extent, based on a particular user's confidence threshold. This makes it easy to distinguish spammers from true players who correct translations.

도 39는, 특정 실시예에 따른, 번역 오류를 정정하는 방법(3900)의 순서도이다. 방법(3900)은, 온라인 게임의 복수의 사용자에게 텍스트 메시지 채팅 시스템을 제공하는 단계(단계 3902)를 포함한다. 원본 텍스트 메시지는, 온라인 게임의 제 1사용자로부터 제 1 언어로 수신된다(단계 3904). 초기 번역이 원본 텍스트 메시지의 제 2 언어로 생성된다 (단계 3906). 원본 텍스트 메시지 및 초기 번역은, 온라인 게임의 제 2 사용자에게 제공된다(단계 3908). 예를 들어, 제 2 사용자는, 클라이언트 디바이스의 디스플레이에서, 두 개의 번역을 함께 또는 구분하여 볼 수 있다. 초기 번역에서의 오류를 제기하기 위한 번역 정정이 제 2 사용자로부터 수신된다(단계 3910). 방법(3900)은 또한 복수의 번역 정정(제 2 사용자로부터의 번역 정정을 포함)으로부터 가장 정확한 번역 정정을 식별하는 단계(단계 3912)를 포함할 수 있다. 대안으로 또는 추가적으로, 방법(3900)은, 단어 기반 특징, 언어 기반 특징 및 단어 배치 특징 중 적어도 하나를 이용하여, 제 2 사용자로부터의 번역 정정의 정확도를 평가하는 단계(단계 3914)를 포함한다. 39 is a flow diagram of a method 3900 for correcting translation errors, according to certain embodiments. Method 3900 includes providing a text message chat system to a plurality of users of the online game (step 3902). The original text message is received in the first language from the first user of the online game (step 3904). An initial translation is created in the second language of the original text message (step 3906). The original text message and initial translation are provided to the second user of the online game (step 3908). For example, the second user, on the display of the client device, can view the two translations together or separately. Translation correction to raise an error in the initial translation is received from the second user (step 3910). The method 3900 can also include identifying the most accurate translation correction (step 3912) from a plurality of translation corrections (including translation corrections from a second user). Alternatively or additionally, the method 3900 includes evaluating the accuracy of translation correction from a second user (step 3914) using at least one of word-based features, language-based features, and word placement features.

상술한 단어 기반 및 언어 기반 특징(예, 4 개의 단어 기반 특징 및 4 개의 언어 기반 특징)은, 선형 회귀 모델을 이용하여 맞춰질 수 있다. 훈련 후에, 그 모델은 바람직하게 각 번역 쌍에 대한 실제 가치화된 수를 회신하고, 임계값은, 각 번역 쌍이 좋은지 또는 나쁜지를 분류하는데 사용될 수 있다. 예를 들어, 특징들이, x1, x2, x3, ... x8의 수치 값을 회신하면, 회귀 방정식은, y = a1*x1 + a2*x2 + . . . + a8*x8이 될 수 있고, 이때 a1, a2, . . . a8은, 선형 회귀 방정식을 모델링함으로써 얻어지는 계수이고, y는 출력 값이다. 임계값 y에 대한 바람직한 값은, ROC 곡선을 이용하여, 정밀도(precision) 및 재현율(recall)을 갖는 실험 후에 0.65인 것으로 확인되었다.The above-described word-based and language-based features (eg, four word-based features and four language-based features) can be fitted using a linear regression model. After training, the model preferably returns the actual valued number for each translation pair, and a threshold can be used to classify whether each translation pair is good or bad. For example, if the features return a numeric value of x1, x2, x3, ... x8, then the regression equation is y = a1 * x1 + a2 * x2 +. . . + a8 * x8, where a1, a2,. . . a8 is a coefficient obtained by modeling a linear regression equation, and y is an output value. The preferred value for the threshold y was found to be 0.65 after the experiment with precision and recall using the ROC curve.

상술한 단어 배치 기반 특징을 추가하고, 단어 기반 및 언어 기반 특징들에 대한 선형 회귀를 재실행한 후에, 바람직한 임계치는, 0.76으로 변한다. 단어 배치 기반 특징의 추가는 또한, AUC 값을 0.853에서 0.976으로 증가시킨다. After adding the word placement-based features described above and re-running linear regression for word-based and language-based features, the preferred threshold is changed to 0.76. The addition of word placement based features also increases the AUC value from 0.853 to 0.976.

표4는, 단어 기반 특징, 언어 기반 특징 및 단어 배치 특징의 각각에 대한 13k 문장을 훈련하여 얻어지는 회귀 계수를 나타낸다. 표의 결과는, 이모지 특징 및 문자 카운트는, 정규화 후에, 작은 회귀 계수를 갖는 것을 나타내고, 이는 이러한 특징들이 번역의 정확도(accuracy) 평가에 작은 기여를 하는 것으로 확인되었음을 의미한다. Table 4 shows the regression coefficients obtained by training 13k sentences for each of the word-based features, language-based features, and word placement features. The results in the table indicate that the emoji feature and character count, after normalization, have a small regression coefficient, which means that these features have been found to make a small contribution to the accuracy of the translation.

번역 정확도 특징들에 대한 회귀의 계수Coefficient of regression for translation accuracy features 특징(Featur)Features (Featur) 계수(Coefficient)Coefficient 숫자(Numbers)Numbers 5.39E-015.39E-01 이모들(Emojis)Emojis 6.99E-206.99E-20 문자 카운트(Character Counts)Character Counts 1.76E-171.76E-17 단어 카운트(Word Counts)Word Counts 2.07E-012.07E-01 카운트<VB>Count <VB> 9.55E-029.55E-02 트라이그램 매칭(Trigram Match)Trigram Match 1.67E-021.67E-02 트라이그램 프로브(Trigram Probs)Trigram Probs -2.24E-02-2.24E-02 번역되지 않은 명사들Untranslated nouns 4.18E-024.18E-02 단어 배치 매칭(Word Alignment Match)Word Alignment Match 4.48E-014.48E-01

회귀 모델은, 불어 대 영어 및 스페인어 대 영어 쌍의 13k 문장에 대한 10-fold 교차 검증(cross validation)에 의해 평가될 수 있다. 데이터에 대한 금본위 라벨(gold standard label)은, 일부 인간 감독이 있는 BING 번역을 이용하여 계산된다. 표5는, 정밀도(precision), 재현율(recall), 정확도(accuracy), 및 평가에서 퍼센트로 나타낸 F-측정 값(measure value)을 나타낸다. 정밀도(precision)는, 우리 방법에 의해 진실(true)인 것으로 마킹된 모든 번역 쌍에 대한 TP(True Positive) 번역 쌍의 비율을 나타낸다. 재현율(recall)은, 진실 쌍(true pair)들로 알려진 모든 번역 쌍들에 대한 TR(True Positive) 번역 쌍들의 비율이다. 정확도(accuracy)는, 전체 테스트 세트 크기에 대한, TP(True Positive) 및 TN(True Negative)로 마킹된 결과의 합의 비율이다. F-측정(measure)은, 정밀도(precision) 및 재현율(recall)의 조화 평균(harmonic mean)이다. 이러한 메트릭스는, 시스템 성능(performance) 및 신뢰도(reliability)를 측정하기 위해 분류 태스크에서 사용될 수 있다. 13k 문장은, 훈련 세트로서 사용될 수 있고, 400 손으로 주석이 달린(hand-annotated) 문장은, 테스트를 위해 사용된다. 테스트 세트는 작기 때문에, 테스트를 위한 개수는, 13k 문장들에 대한 개수보다 작다. The regression model can be evaluated by 10-fold cross validation for 13k sentences of French vs. English and Spanish vs. English pairs. The gold standard label for the data is calculated using some human supervised BING translation. Table 5 shows the precision, recall, accuracy, and F-measure values expressed as percentages in the evaluation. Precision represents the ratio of True Positive (TP) translation pairs to all Translation Pairs marked as true by our method. Recall is the ratio of True Positive (TR) translation pairs to all Translation pairs known as true pairs. Accuracy is the ratio of the sum of the results marked as True Positive (TP) and True Negative (TN) to the total test set size. F-measure is the harmonic mean of precision and recall. These metrics can be used in classification tasks to measure system performance and reliability. 13k sentences can be used as a training set, and 400 hand-annotated sentences are used for testing. Since the test set is small, the number for testing is smaller than the number for 13k sentences.

13,000 문장으로 번역 정확도(accuracy) 연구 결과Translation accuracy study results in 13,000 sentences 측정(Measure)Measurement 13k 문장들
(10 fold cv)13k sentences
(10 fold cv) 400 문장들400 sentences 13k 문장들
(단어 기반 특징들)13k sentences
(Word-based features) 정밀도(Precision)Precision 9090 7777 88.288.2 재현율(Recall)Recall 79.379.3 7272 79.379.3 F-측정(Measure)F-Measure 84.3184.31 74.4174.41 83.5683.56 정확도(Accuracy)Accuracy 97.397.3 9696 9797

표6은, 13k 문장 데이터 세트에 대한 10-fold 교차 검증의 결과를 나타낸다. 이때, W는 단어 기반 특징의 사용, L은 언어 기반 특징의 사용, 그리고 A는 단어 배치 특징의 사용을 의미한다. 결과는, 단어 기반 특징은, 정밀도(precision) 향상에 도움을 주고, 단어 배치 특징은, 재현율(recall) 향상에 상당히 도움을 준다. 단어 기반 특징은, 정밀도(precision) 및 재현율(recall) 모두에 대해 작은 신장만 준다. 일반적으로 재현율(recall)은, 나쁜 결정이 전체 데이터 세트에서 얼마나 정확하게 검출되는지를 나타낸다. 표의 결과는, 단어 배치 기반 특징의 추가는 재현율(recall)을 향상시키는 것을 보여준다. 정밀도(precision)는, 시스템에 의해 정확한 것으로 마킹된 전체 번역에서 좋은 번역이 얼마나 정확하게 예측되는지를 나타낸다. 표의 결과는, 단어 기반 특징이 정밀도(precision)를 향상시키는 것을 보여준다. Table 6 shows the results of the 10-fold cross validation for the 13k sentence data set. At this time, W means the use of word-based features, L means the use of language-based features, and A means the use of word-based features. The result is that word-based features help improve precision, and word-position features help significantly improve recall. The word-based feature gives only a small elongation for both precision and recall. Recall, in general, indicates how accurately bad decisions are detected in the entire data set. The results in the table show that adding word placement based features improves recall. Precision indicates how accurately a good translation is predicted in the entire translation marked as correct by the system. The results in the table show that word-based features improve precision.

13,000 문장 데이터 세트에 대한 10-fold 교차 검증의 결과Results of 10-fold cross validation on 13,000 sentence data sets 방법Way 정밀도(Precision)Precision 재현율(Recall)Recall F-측정(Measure)F-Measure 정확도(Accuracy)Accuracy WW 88.288.2 79.379.3 83.5683.56 9797 LL 51.951.9 26.626.6 35.1735.17 92.592.5 AA 5555 96.796.7 70.1170.11 93.993.9 WLWL 9090 79.379.3 84.5684.56 97.397.3 WAWA 80.580.5 96.696.6 87.8187.81 97.797.7 LALA 57.857.8 96.696.6 72.3272.32 94.294.2 WLAWLA 80.780.7 96.896.8 88.0188.01 97.797.7

표7은, 특정 실시예에 따른, 다른 기계 알고리즘으로 다양한 단어 기반, 언어 기반, 및 단어 배치 특징을 조정한 결과를 나타낸다. 지금까지의 결과는, 다른 특징들을 함께 묶기 위한 선형 회귀 기술을 이용한 것을 설명하였다. 기계 학습 알고리즘은, 개별 부분보다 더 좋은 앙상블 결과를 생산하기 위한 변수(문맥에서의 특징들)를 함께 묶기 위해 사용될 수 있도록 존재한다. 선형 회귀는, 변수들을 결합하기 위한 1차원적 방법을 제시한다. 변수들을 결합하기 위한 2차원 및 다차원 방법이 기계 학습 알고리즘 문헌에 존재한다. 이러한 알고리즘은, 좋은 번역 쌍들을 예측하는 과제에서 사용되는 특징들을 결합하는 더 최적의 방법을 찾기 위해 채용된다. Table 7 shows the results of adjusting various word-based, language-based, and word-positioning features with different machine algorithms, according to certain embodiments. The results so far have demonstrated the use of a linear regression technique to group different features together. Machine learning algorithms exist so that they can be used to group together variables (features in context) to produce better ensemble results than individual parts. Linear regression presents a one-dimensional method for combining variables. Two-dimensional and multidimensional methods for combining variables exist in the machine learning algorithm literature. This algorithm is employed to find a more optimal way to combine the features used in the task of predicting good translation pairs.

표7의 결과는, 다양한 기계 학습 알고리즘으로 특징들을 결합함으로써 얻어진다. 사용된 데이터 세트는, 13k 문장을 포함하고, 파라미터들은 gridSearch 알고리즘을 이용하여 조정된다. 표에 리스트된 방법으로부터, 그래디언트 부스팅 분류기(gradient boosting classifier) 및 랜덤 포레스트(random forest) 방법은, 앙상블 기반 방법이고, 이러한 방법이 더 좋은 결과를 제공하는 이유를 설명한다. 그래디언트 부스팅 머신(GBM : Gradient Boosting Machines) 및 랜덤 포레스트는, 비록 GBM은 훈련을 위해 더 오랜 시간을 소요하지만, 매우 좋은 결과를 제공한다. 그러나, 그 모델은 오직 한번 훈련되는 것을 필요로 하기 때문에, 훈련 시간은 대체로 상관이 없다. The results in Table 7 are obtained by combining features with various machine learning algorithms. The data set used contains 13k sentences, and the parameters are adjusted using the gridSearch algorithm. From the methods listed in the table, the gradient boosting classifier and the random forest method are ensemble-based methods and explain why these methods provide better results. Gradient Boosting Machines (GBMs) and random forests, although GBMs take longer to train, provide very good results. However, since the model only needs to be trained once, the training time is largely irrelevant.

기계 알고리즘들을 갖는 번역 정확도 특징들을 추정(fitting)한 결과Fitting results of translation accuracy features with machine algorithms ML 알고리즘ML algorithm 파라미터
(Params)parameter
(Params) 정밀도
(Precision)Precision
(Precision) 재현율
(Recall)Reproducibility
(Recall) 정확도
(Accuracy)accuracy
(Accuracy) 선형 회귀
(Linear Regression)Linear regression
(Linear Regression) 최소 제곱
(Least Squares)Least squares
(Least Squares) 80.780.7 96.896.8 97.897.8 퍼셉트론
(Perceptron)Perceptron
(Perceptron) 98.898.8 74.374.3 97.897.8 리즈 분류기
(Ridge Classifier)Leeds sorter
(Ridge Classifier) 96.196.1 71.671.6 97.397.3 가우시안 네이브 베이즈(Gaussian Naive Bayes)Gaussian Naive Bayes 80.780.7 90.390.3 97.497.4 결정 트리
(Decision Tree)Crystal tree
(Decision Tree) 91.391.3 90.990.9 98.298.2 로지스틱 회귀
(logistic Regression)Logistic regression
(logistic Regression) L1 norm, C= 100, tol = 0.001L1 norm, C = 100, tol = 0.001 92.192.1 91.491.4 98.498.4 SVMSVM C= 1000, gamma = 0.001C = 1000, gamma = 0.001 91.391.3 92.892.8 98.598.5 그래디언트 부스팅 머신(Gradient Boosting Machines)Gradient Boosting Machines n_estimators=100 n_estimators = 100 92.492.4 9393 98.898.8 랜덤 포레스트
(Random Forests)Random forest
(Random Forests) n_estimators=10 n_estimators = 10 93.393.3 92.892.8 98.898.8

최종 번역 정확도 체크 알고리즘은, 3045 영어-불어 문장 쌍의 지속된 테스트 세트에 대해 테스트된다. 결과들이 표8에 보여진다. WLAplain 특징들은, 단어 배치가, 분명한 언어 데이터베이스로부터 추출된다는 사실 때문에, 좋지 않게 수행한다. 메시지들은, 일련의 변환 이후에 수정되므로, 결과에서의 급락이 발견될 수 있다. 단어 배치는, 채팅용어 데이터베이스로부터 추출되고, 더 적은 문장들이 특징들에서 0 점수를 피하기 위해 일부 평활화(smoothing)가 완료된다. WLA는, 채팅용어 데이터베이스에 대한 결과를 보여준다. WLA 특징들을 갖는 나쁜 정밀도(precision)는, 많은 정확한 번역이 거절되는 것을 나타낸다. 그 특징들은, 랜섬 포레스트가 결과를 과하게 추정(overfit)하기 때문에, 선형 회귀로 추정된다. 일반적으로, 이러한 결과는, 알고리즘에 사용되는 최종 세트를 선택하는 기초로 사용된다. 더 높은 정확도(accuracy)가 일반적으로 바람직하지만, 더 높은 정밀도(precision)가 사용자에 의해 입력된 더 진실한 번역 쌍을 보상하고, 높은 재현율(recall)이 미분류된 더 적은 틀린 엔트리들을 정확한 것으로 보장한다. WLA 특징 세트는, 표의 결과에 따라, 바람직하다. 0.75의 임계치는, 더 높은 재현율(recall)에 대해 선택될 수 있고(예를 들어, 마킹된 최소한의 가능한 악의적인 엔트리들을 정확한 것으로 얻기 위해), 0.68의 임계치는, 더 높은 정밀도(precision)를 바랄 때의 경우에 선택될 수 있다.The final translation accuracy check algorithm is tested against a sustained test set of 3045 English-French sentence pairs. Results are shown in Table 8. WLAplain features perform poorly due to the fact that word placement is extracted from a clear language database. Messages are modified after a series of transformations, so a drop in results can be found. The word placement is extracted from the chat term database, and fewer sentences are done with some smoothing to avoid zero scores in the features. WLA shows the results for the chat term database. Bad precision with WLA features indicates that many exact translations are rejected. The features are estimated with linear regression, as the ransom forest overfits the results. Generally, these results are used as the basis for selecting the final set used in the algorithm. Higher accuracy is generally desirable, but higher precision compensates for more true pairs of translations entered by the user, and high recall ensures fewer unsorted and incorrect entries are correct. The WLA feature set is preferred, according to the results in the table. A threshold of 0.75 can be selected for a higher recall (e.g., to get the least possible malicious entries marked as correct), and a threshold of 0.68, hopes for higher precision. It can be selected in some cases.

번역 정확도 체킹 알고리즘의 결과들Results of the translation accuracy checking algorithm 특징Characteristic 임계
(Threshold)Critical
(Threshold) 정밀도
(precision)Precision
(precision) 재현율
(recall)Reproducibility
(recall) F-측정F-measure 정확도
(Accuracy)accuracy
(Accuracy) ROC 아래 영역Area under ROC WLWL 0.750.75 8585 5959 69.6569.65 9696 0.7230.723 WLAplainWLAplain 0.750.75 1919 7979 30.6330.63 8080 0.6310.631 WLAWLA 0.750.75 6565 9393 76.5176.51 9595 0.8830.883 WLAWLA 0.680.68 7676 8585 80.2480.24 9696 0.8530.853

본 발명이 특정 바람직한 실시예에 관해 특별히 보여지고 설명되는 동안, 첨부된 청구항들에 의해 정의된 발명의 사상 및 범주를 벗어나지 않는 범위 내에서 세부 사항이 만들어질 수 있고, 다양한 변경이 가능함은 당업자에게 이해될 수 있다.While the present invention has been specifically shown and described with respect to certain preferred embodiments, details may be made within the scope and spirit of the invention as defined by the appended claims, and various modifications may be made to those skilled in the art. Can be understood.

102 : 클라이언트
104 : 채팅 클라이언트 시스템
106 : 컴퓨터 네트워크
108 : 채팅 서버
112 : 채팅 호스트 시스템
114 : 통신 변환 및 번역(CTT) 시스템
110 : 번역 서버
116 : 번역 모듈102: client
104: chat client system
106: computer network
108: chat server
112: chat host system
114: Communication Conversion and Translation (CTT) System
110: translation server
116: translation module

Claims

A method executed by one or more computer processors, the method comprising:
Obtaining an original text message in a first language written by the first user;
Obtaining a translated text message in a second language of the original text message;
Comparing the plurality of features associated with the original text message with each of the plurality of features associated with the translated text message, wherein the plurality of features include (i) word count, character count, emoji, Comprising at least one word-based feature selected from the group consisting of numbers and punctuation marks, and (ii) at least one language-based feature comprising parts of speech selected from the group consisting of verbs, nouns, adverbs and adjectives; And
Calculating a metric based on the comparison of the features, and determining the accuracy of the translated text message based on the calculated metric.

According to claim 1,
The translated text message is written by a second user.

According to claim 1,
The translated text message includes a correction of a previous translated text message of the original text message.

According to claim 1,
Comparing the plurality of features,
And determining a difference in the at least one word-based feature between the original text message and the translated text message.

According to claim 1,
Comparing the plurality of features,
And determining a difference in the number of parts of speech between the original text message and the translated text message.

According to claim 1,
The step of calculating the metric,
And determining the placement of at least one word in the original text message and at least one respective word in the translated text message.

According to claim 1,
The metric is,
A method further based on the grammar of the translated text message.

According to claim 1,
The step of calculating the metric,
Generating a part-of-speech n-gram representation of the translated text message; And
And calculating a probability of an n-gram representation for the second language.

According to claim 1,
The step of calculating the metric,
And fitting the features into a linear regression model.

According to claim 1,
The step of calculating the metric,
And fitting the features with one or more machine learning algorithms.

In the system,
A non-transitory computer-readable medium on which instructions are stored; And
At least one processor configured to execute the instructions,
The at least one processor,
Obtain an original text message in a first language written by the first user;
Obtain a translated text message in a second language of the original text message;
Compare the plurality of features associated with the original text message with each of the plurality of features associated with the translated text message, wherein the plurality of features include: (i) word count, character count, emoji, number and At least one word-based feature selected from the group consisting of punctuation marks, and (ii) at least one language-based feature comprising parts of speech selected from the group consisting of verbs, nouns, adverbs and adjectives; And
A system for performing an operation that calculates a metric based on a comparison of the features and determines the accuracy of the translated text message based on the calculated metric.

The method of claim 11,
The translated text message is written by a second user.

The method of claim 11,
And the translated text message comprises correction of a previous translated text message of the original text message.

The method of claim 11,
Comparing the plurality of features,
And determining a difference in the at least one word-based feature between the original text message and the translated text message.

The method of claim 11,
Comparing the plurality of features,
And determining a difference in the number of parts of speech between the original text message and the translated text message.

The method of claim 11,
The operation of calculating the metric,
And determining the placement of at least one word in the original text message and at least one respective word in the translated text message.

The method of claim 11,
The metric is,
A system further based on the grammar of the translated text message.

The method of claim 11,
The operation of calculating the metric,
Generate a part-of-speech n-gram representation of the translated text message; And
And calculating a probability of an n-gram representation for the second language.

The method of claim 11,
The operation of calculating the metric,
A system comprising fitting the features into a linear regression model.

The method of claim 11,
The operation of calculating the metric,
A system comprising fitting the features with one or more machine learning algorithms.

A non-transitory computer-readable recording medium comprising executable instructions, comprising:
The executable instructions are executable by one or more processors and cause the one or more processors to perform operations,
The operation is,
Obtaining an original text message in a first language written by the first user;
Obtaining a translated text message in a second language of the original text message;
Comparing the plurality of features associated with the original text message with each of the plurality of features associated with the translated text message, wherein the plurality of features include (i) word count, character count, emoji, Comprising at least one word-based feature selected from the group consisting of numbers and punctuation marks, and (ii) at least one language-based feature comprising parts of speech selected from the group consisting of verbs, nouns, adverbs and adjectives; And
And calculating a metric based on the comparison of the features and determining the accuracy of the translated text message based on the calculated metric.