WO2010125560A1 - Système idn multilingue - Google Patents

Système idn multilingue Download PDF

Info

Publication number
WO2010125560A1
WO2010125560A1 PCT/IL2010/000325 IL2010000325W WO2010125560A1 WO 2010125560 A1 WO2010125560 A1 WO 2010125560A1 IL 2010000325 W IL2010000325 W IL 2010000325W WO 2010125560 A1 WO2010125560 A1 WO 2010125560A1
Authority
WO
WIPO (PCT)
Prior art keywords
characters
string
list
local language
local
Prior art date
Application number
PCT/IL2010/000325
Other languages
English (en)
Inventor
Netanel Raisch
Original Assignee
Netanel Raisch
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netanel Raisch filed Critical Netanel Raisch
Publication of WO2010125560A1 publication Critical patent/WO2010125560A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9566URL specific, e.g. using aliases, detecting broken or misspelled links

Definitions

  • the present invention relates to Domain Name Services used to resolve network domain names into corresponding network addresses. More particularly, the invention relates to an alternative or modified Domain Name Service that accepts domain names provided in many different encoding formats, not just ASCII.
  • the Internet today relies entirely on the Domain Name System to resolve human- readable names into numeric IP addresses and vice versa.
  • DNS Domain Name System
  • DNS Domain Name System
  • e-mail addresses, Web addresses, and other Internet addressing formats adopt the American Standard Code for Information Interchange (ASCII) as the global standard to guarantee interoperation. Till the last few years no provision was made to allow for Web addresses to be in a non- ASCII native language. The implication is that any user of the Internet has to have some basic knowledge of ASCII characters.
  • IDNs Internationalized Domain Names
  • TLD Top Level Domain
  • IDNs are domain names represented by local language characters.
  • Such domain names can contain characters with diacritical marks as required by many European languages, or characters from non-Latin scripts (for example, Hebrew or Chinese).
  • IDNs made the domain name label as it is displayed and viewed by the end user different from that transmitted in the DNS.
  • IDNs Internet users are now able to type domain names, at least partially, in their local language.
  • an Internet user can type the localized characters in front of the "dot". This change in practice allows for an explosion in the number of potential internet addresses as well as allowing for non- English speakers to have at least partial use of a local alphabet in a URL.
  • GTTLD Global TLD
  • ccTLD Country Code TLD
  • iTLD Initial TLD
  • the invention relates to a method for converting unconventional URL addresses written at least in part in a local language into a standard IDN form, comprising: a. Providing a table containing a list of characters in said local language and their corresponding list of characters in a standard TLD form, wherein each item in each of said lists is a string of characters; b. Providing to a web browser a string of characters in said local language, wherein said string of characters represents a URL address; c. Searching in said table parts of said string of characters among said list of characters in said local language; and d. Upon finding an identical string in said table, replacing said identical string in said web browser with the corresponding characters in said standard TLD form.
  • the method further comprises adding to the web browser a local URL address box for entering the string of characters in the local language.
  • Each string of characters in the list of characters in the standard TLD form can be selected, for instance, from the group consisting of iTLDs, TLDs, gTLDs or ccTLDs.
  • the web browser is associated with a computer.
  • the web browser is associated with a cellular phone.
  • the cellular phone is an iPhone.
  • the cellular phone employs the Android® operating system.
  • the web browser is associated with a handheld electronic device.
  • the method further includes an algorithm to implement aspects of the method.
  • the method further comprises allowing modifying the list of characters in the local language by a user.
  • the method further comprises allowing modifying the list of characters in the standard TLD form by a user.
  • the list of characters in the local language and their corresponding list of characters in the standard TLD form can be updated by any suitable means, e.g., via the Internet or an update file.
  • the local language is a plurality of languages or dialects.
  • the invention also encompasses a system for converting unconventional URL address written in a local language into a standard IDN form, comprising: a. Means for generating a table containing a list of characters in said local language and their corresponding list of characters in said standard TLD form, wherein each item in each of said lists is a string of characters; b. Input means for providing to a web browser a string of characters in said local language, wherein said string of characters represents a URL address; c. Means for searching in said table at least part of said string of characters among said list of characters in said local language; and, d. Means for replacing said identical string in said web browser with the corresponding characters in said standard TLD form upon finding an identical string in said table.
  • system further comprises means for adding to the web browser a local URL address box for entering the string of characters in the local language.
  • Fig. 1 schematically illustrates a web browser provided with a URL address typed in a local language form
  • Fig. 2 schematically illustrates a web browser provided with a URL address in multi-language form
  • Fig. 3 is a conversion table from local language to common Latin form, according to an embodiment of the invention.
  • the present invention relates to a method and device for locally translating or mapping a string of characters typed in a local language or dialect in a web browser into common TLD names via that web browser.
  • a translation table is provided in which lists of TLD names (which are a string of characters in a standard ASCII form) are kept together with lists of string of characters in a local language form (i.e., non English language).
  • Fig. 3 schematically illustrates an example of such a translation table, generally indicated by numeral 4.
  • each string of characters in the list of characters in the standard ASCII form of translation table 4 can be any type of TLD, such as iTLDs, TLDs, gTLDs, ccTLDs, etc.
  • user interface software embedded within the web browser examines a presumed TLD name and, if it was typed in the local character set, the software attempts to translate it into the standard ASCII form using the corresponding values in the translation table.
  • the system converts unconventional URL addresses written in a local language form into a standard IDN form using the translation table.
  • the translation table contains a list of characters in the local language and their corresponding list of characters in the standard TLD form, wherein each item in each of the lists is a string of characters.
  • An unconventional URL may generally refer to a URL written all or in part in a non-English language.
  • Fig. 1 schematically illustrates a web browser 1 provided with a string of characters which represents a URL address typed in a local language form (surrounded by circle 2).
  • the string is provided to web browser 1 via any suitable input means, such as keyboard.
  • the software embedded within web browser 1 searches in the translation table (such as table 4 of Fig. 3) whether part of the provided string of characters (which represents the TLD form in the local language) can be found among the list of characters in the translation table 4.
  • the following URL address was provided to the web browser (in Hebrew letters):
  • each of the strings (9) and (11) is replaced in the web browser 1 with its corresponding string in the standard TLD form, such as string "www.” (10) and string “.com” (12) correspondingly.
  • web browser 1 contains a URL address in an acceptable IDN format, as shown in Fig. 2 (surrounded by circle 3).
  • the system of the present invention identifies the IDN form as a local language text between the first "dot" and the last "dot" of the URL address according to the common URL address rules. Whether it was typed as a character which represent a real dot (i.e., ".”), or it was typed as local language character (e.g.,
  • the method of the present invention avoids the need to switch between two languages while typing a URL address in an IDN format. Instead of typing: www.n2X.com, using the method of the present invention the user type the entire URL address in a single local language, and no switch between languages is required while typing.
  • the web browser is provided with an additional local URL address box dedicated for entering the string of characters in the local language, such as a toolbar (not shown) or other suitable form.
  • the lists of characters in the translation table 4, either the local or the standard TLD one can be updated or modified either manually or via a file by a user or via the Internet.
  • IDNA is entirely a client-side protocol. It works by providing labels to the DNS in a special format (so-called "ACE"). When labels in that format are encountered, they are transformed, by the client, back into internationalized (normally Unicode) characters.
  • ACE special format
  • TLDs may want to maintain a translation table that contains a list of TLDs and that maps between them and locally-desirable names, such as "mp.” (15) and its corresponding TLD form “.com” (16), “.wwz?" (7) and its corresponding TLD form “www.” (8), "xpn ⁇ ” (13) and its corresponding TLD form “.net” (14), etc.
  • names or locally-standard abbreviations) by which the relevant countries are known locally (whether in ASCII characters or others).
  • the software embedded within browser 1 further contains means to allow a user to input one or more particular TLD names either in the local language or in standard form without special tagging or problems, as it known to a person skilled in the art.
  • TLD names either in the local language or in standard form without special tagging or problems, as it known to a person skilled in the art.
  • a local language may also be a plurality of unique languages.
  • TLD labels When DNS names are received by these client programs, the TLD labels is mapped to local form before the IDNA is applied to the rest of the name. When names are received from users, local TLD names are mapped to the global ones before being passed into IDNA or for other DNS processing.
  • Each function represented by a TLD can be represented in the local language and character set as needed; and, for countries with many languages, or users living, working, or visiting countries where their language was not dominant, "local” can be defined in terms of the needs or wishes of each particular user.
  • "Local” generally refers to languages other than English, especially those that use characters other than the Latin characters associated with the English alphabet.
  • a browser may be programmed to handle characters from a plurality of local languages.
  • the tables for mapping between local names and TLD names can be locally determined, and differ from one locale to another, as long as users understand that international interchange of names required using the standard forms. That understanding can be assisted by software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

La présente invention concerne un système permettant de faciliter significativement l'usage des noms de domaine internationalisés (International Domain Name / IDN). Le système effectue automatiquement la transition entre des chaînes de caractères dans une langue locale et une chaîne de caractères de domaine de premier niveau (Top Level Domain / TLD) correspondante de type ".com". Le système permet d'éviter d'avoir à effectuer une transition entre langues lors de la saisie d'un IDN et s'avère idéal pour les navigateurs web pour ordinateur ou pour appareil portatif utilisés dans des pays dont les langues parlées localement ne sont pas l'anglais.
PCT/IL2010/000325 2009-04-27 2010-04-22 Système idn multilingue WO2010125560A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IL198426A IL198426A0 (en) 2009-04-27 2009-04-27 Multi-languages idn system
IL198426 2009-04-27

Publications (1)

Publication Number Publication Date
WO2010125560A1 true WO2010125560A1 (fr) 2010-11-04

Family

ID=42113741

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2010/000325 WO2010125560A1 (fr) 2009-04-27 2010-04-22 Système idn multilingue

Country Status (2)

Country Link
IL (1) IL198426A0 (fr)
WO (1) WO2010125560A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10140282B2 (en) 2014-04-01 2018-11-27 Verisign, Inc. Input string matching for domain names

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010047429A1 (en) * 1999-02-26 2001-11-29 I-Dns.Net International, Inc. Multi-language domain name service
US20030115040A1 (en) * 2001-02-09 2003-06-19 Yue Xing International (multiple language/non-english) domain name and email user account ID services system
US20040044791A1 (en) * 2001-05-22 2004-03-04 Pouzzner Daniel G. Internationalized domain name system with iterative conversion
US6829653B1 (en) * 1999-03-18 2004-12-07 Idn Technologies Llc Method and system for internationalizing domain names
US7149964B1 (en) * 2000-02-09 2006-12-12 Microsoft Corporation Creation and delivery of customized content
US20080160956A1 (en) * 2006-12-22 2008-07-03 Jackson Jonathan B Method and system for delivering and/or displaying targeted content to a mobile handheld device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010047429A1 (en) * 1999-02-26 2001-11-29 I-Dns.Net International, Inc. Multi-language domain name service
US6829653B1 (en) * 1999-03-18 2004-12-07 Idn Technologies Llc Method and system for internationalizing domain names
US20060031579A1 (en) * 1999-03-18 2006-02-09 Tout Walid R Method and system for internationalizing domain names
US7149964B1 (en) * 2000-02-09 2006-12-12 Microsoft Corporation Creation and delivery of customized content
US20030115040A1 (en) * 2001-02-09 2003-06-19 Yue Xing International (multiple language/non-english) domain name and email user account ID services system
US20040044791A1 (en) * 2001-05-22 2004-03-04 Pouzzner Daniel G. Internationalized domain name system with iterative conversion
US20080160956A1 (en) * 2006-12-22 2008-07-03 Jackson Jonathan B Method and system for delivering and/or displaying targeted content to a mobile handheld device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10140282B2 (en) 2014-04-01 2018-11-27 Verisign, Inc. Input string matching for domain names

Also Published As

Publication number Publication date
IL198426A0 (en) 2010-02-17

Similar Documents

Publication Publication Date Title
CA2319750C (fr) Adressage sur le world wide web
KR100444757B1 (ko) 다언어 도메인 네임 서비스
US20100274553A1 (en) Multi-Languages IDN System
RU2251729C2 (ru) Способ и устройство для регистрации и использования имен доменов на родном языке
KR100706702B1 (ko) 도메인네임서버를 이용한 한글 인터넷컨텐츠주소 서비스 방법 및 시스템
JP2001249884A (ja) インターネットユーザが所望の言語でドメイン名を使用するためのシステム
WO2010125560A1 (fr) Système idn multilingue
Klensin et al. Review and recommendations for internationalized domain names (idns)
KR20010075446A (ko) 대안의 인터넷 리소스 식별자 및 주소를 위한 방법 및시스템
KR100516302B1 (ko) 인터넷 주소의 오타 처리 방법 및 시스템
He Internet multilinguality: Challenges, dimensions, and recommendations
KR20060088200A (ko) 키워드를 이용한 인터넷 웹사이트 접속방법
JPH11259477A (ja) 文書処理システムおよび記録媒体
KR20010093355A (ko) 영어와는 다른 언어의 글자들을 이용하여 도메인 이름을표현하기 위한 방법
KR20020057449A (ko) 이메일 형태의 전화번호 도메인과 다국어 도메인
TW466404B (en) Chinese domain name service method
Klensin et al. RFC 4690: Review and Recommendations for Internationalized Domain Names (IDNs)
Lo et al. Character-image search engine
Wielansky Internationalized domain names
KR20200076184A (ko) 인터넷 문장의 표현법 개발
Bhuiyan et al. Multilingualism: Does It Legitimize the ICANN Model?
IL207660A (en) Www addressing
JP2005141445A (ja) 特許情報検索システム
KR20010057440A (ko) 인터넷 도메인 이름 처리 방법
KR20110095755A (ko) 다국어 인터넷주소 처리 시스템 및 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10769406

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10769406

Country of ref document: EP

Kind code of ref document: A1