WO2003077158A1 - Dispositif de sortie d'informations de lien - Google Patents

Dispositif de sortie d'informations de lien Download PDF

Info

Publication number
WO2003077158A1
WO2003077158A1 PCT/JP2002/002241 JP0202241W WO03077158A1 WO 2003077158 A1 WO2003077158 A1 WO 2003077158A1 JP 0202241 W JP0202241 W JP 0202241W WO 03077158 A1 WO03077158 A1 WO 03077158A1
Authority
WO
WIPO (PCT)
Prior art keywords
url
information
domain
word
search
Prior art date
Application number
PCT/JP2002/002241
Other languages
English (en)
Japanese (ja)
Inventor
Kohji Nitori
Original Assignee
Levi Co.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Levi Co. filed Critical Levi Co.
Priority to AU2002236284A priority Critical patent/AU2002236284A1/en
Priority to PCT/JP2002/002241 priority patent/WO2003077158A1/fr
Publication of WO2003077158A1 publication Critical patent/WO2003077158A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • the present invention relates to a URL search device used in the Internet and the like. Background art
  • a search engine is used to search the homepage on the Internet.
  • the search engine inputs a search term, it outputs information for linking to a URL such as a homepage containing the search term.
  • the present invention does not search for words contained in a homepage, even though it is a search device for a homepage or the like. Disclosure of the invention
  • the present invention has been made in view of the fact that URLs are written in languages such as Japanese, and outputs only URLs whose search words are included in the domain name. In this way, it is possible to obtain search results with less noise. This is because a domain name created using a language such as Japanese generally indicates the information contained in the domain. This is because it is considered that a jump is used.
  • FIG. 1 is a conceptual diagram for explaining the first embodiment.
  • FIG. 2 is a functional block diagram of an example of the domain link information output device according to the first embodiment.
  • FIG. 3 is a diagram illustrating a flow of a process according to the first embodiment.
  • FIG. 4 is a diagram for explaining the concept of the second embodiment.
  • Figure 5 is a diagram for explaining languages that use ideographic characters.
  • FIG. 6 is a diagram illustrating a flow of a process according to the second embodiment.
  • FIG. 7 is a diagram illustrating an example of a functional block of the domain link information output device according to the third and fourth embodiments.
  • FIG. 8 is a diagram showing a processing flow of the third and fourth embodiments.
  • FIG. 9 is a diagram illustrating a search word generated for “clothing” by the search word generation unit according to the fourth embodiment.
  • FIG. 10 is a diagram illustrating an example of functional blocks of a URL acquisition device according to the fifth embodiment.
  • FIG. 11 is a diagram depicting a processing flow of the fifth embodiment
  • the first embodiment is a domain link information output device that searches for a domain name including a word using words in a language using ideographic characters.
  • An "ideographic character” is a character whose character itself represents a meaning.
  • a typical ideographic character is a kanji.
  • FIG. 1 is a diagram showing the concept of the domain link information output device of this embodiment. As shown in this figure, this domain link information When an ideographic character is input, the output device outputs information for linking to a URL having a domain name including the ideographic character.
  • FIG. 2 is a diagram showing an example of a functional block of this embodiment.
  • the domain link information output device of this embodiment has a domain URL information storage unit, a word input unit, a URL information acquisition unit, and a URL link information output unit.
  • the “domain URL information storage unit” stores the domain name in association with URL information that is information indicating URL. For example, as URL information including the domain name “Ginza”, “www. Ginza. Clothing. Jp” is associated. Similarly, “WWW. Ginza. Clothes. Jp” is also associated as URL information including the domain name “Clothing”.
  • the "word input section” inputs words in a language using ideographic characters.
  • a character “clothing” represented by Japanese kanji which is a language using ideographic characters
  • the characters input from the language input unit are not limited to ideographic characters, but may be any language that uses ideographic characters. This is because languages that use ideographs also use phonograms together with ideographs. For example, in Japanese, hiragana and katakana are used as phonograms in addition to ideographic kanji. Therefore, not only kanji, but also hiragana and rikyuu kana words may be input to the word input unit.
  • non-kanji characters can be entered in the word input section.
  • the newly recognized Japanese domain is not limited to kanji, but can use any language that is made in Japanese. . And by using such words as domain names, This is because the contents of the domain can be expressed simply and accurately.
  • the “URL information acquisition unit” acquires URL information associated with a domain name including a word input to the word input unit from the domain URL information storage unit.
  • the word input to the word input unit is “clothing”, and the URL containing the word is selected from the domain URL information storage unit.
  • the selected result is “www.Ginza.Clothing.jP” as shown in the figure.
  • the “URL link information output unit” outputs link information that is information for linking to the URL based on the information acquired by the URL information acquisition unit. For example, if URL is “www. Ginza. Clothes. Jp”, then “http: ZZwww. Ginza. Clothes. Jp” and so on. By inputting this information into the address of a browser or the like, you can reach the domain of “www. Ginza. Clothes. JP”.
  • the output format may be electronic output or print output on paper or the like. If the output is electronic, it may be converted to hypertext using the HTML language.
  • Embodiment 1 As described above, in the present embodiment, with the availability of the Japanese domain, the language of ideographic characters such as Japanese is used. Can be used as a search term, and at the same time, the search term is used only for domain name searches, greatly reducing the noise in search results that could not be eliminated by conventional search engines. . This is because the domain name is the most straightforward description of the domain.
  • the processing flow of the first embodiment is as shown in FIG. That is, the processing of the first embodiment is performed by associating the domain name with the URL information indicating the URL, and accumulating the domain name.
  • Information accumulating step a word input step for inputting words in a language using ideographic characters, and URL information associated with a domain name including the words input in the word input step.
  • a URL information acquisition step acquired from the information accumulation step; a URL link information output step for outputting link information that is information for linking to the URL based on the information acquired in the URL information acquisition step.
  • This is a domain link information output method having the following.
  • Embodiment 2 is characterized in that the language input to the language input unit is a language of any one of Japanese, Chinese, and Korean.
  • FIG. 4 is a diagram for explaining the concept of the second embodiment. For example, when the word “boutique” is input, domain link information such as URL using a domain name including the word “boutique” is output.
  • the domain link information output device of the second embodiment has the same configuration as the domain link information output device of the first embodiment. That is, it is the same as that shown in FIG. 2 and includes a domain URL information storage unit, in this case, an input unit, a URL information acquisition unit, and a URL link information output unit.
  • the feature point is in the word input section.
  • Japanese, Chinese, and Korean have a special language system even in the world in terms of using ideographic characters.
  • Figure 5 summarizes the characters used in these languages. As shown in this figure, Japanese uses Kanji, Hiragana, and Katakana characters. In Korean, kanji (Hanja) and Hangul characters are used. In Chinese, vague characters are used, but there are two types of kanji: simplified and traditional. In this embodiment, one of these characters is input to the word input unit. This outputs link information to the URL that has the domain name that contains the word.
  • FIG. 6 shows a processing flow of the second embodiment.
  • the processing of this embodiment includes a domain URL information storage step of storing a domain name and URL information that is information indicating a URL in association with each other, and Japanese, Chinese, and Korean Word input step for inputting a word in any one of the words, and URL information associated with the domain name including the word input in the word input step is obtained from the domain URL information storage step Domain link information comprising: a URL information obtaining step to perform link information output based on the information obtained in the URL information obtaining step; Output method.
  • a search word is generated based on a word input to the word input unit, instead of using only the word input to the word input unit to search for a domain name including the word.
  • This is a domain link information output device that uses the process of searching for the domain name that includes the search term.
  • any one or a combination of two or more of synonyms, derivatives, and antonyms of the input word is used as a search word.
  • a conventional search engine only home-based link information that includes the input search word as it is is output as a search result. “Include here” means that it was included in every part of the home page, so I could not easily find the information I wanted to know.
  • the present invention can greatly reduce such noise because only the words included in the domain name are used as the search range. However, only the domain name is searched. Therefore, if you do not enter the exact word, it may not be possible to obtain the domain URL information even if the domain name uses the same meaning.
  • the ideographs Japanese, Chinese, and Korean sometimes have multiple words that have the same meaning or almost the same meaning. In such a case, it is difficult to input all the synonyms in the word input.
  • the information may actually be in a URL having a domain name that includes the word that is an antonym of the word.
  • the word “word” also has the same meaning as the word “word”, so that search omission itself may occur due to differences in expression, such as whether it is expressed in kanji or hiragana.
  • a search word is generated based on the input word, and a domain name including the search word is searched based on the search word.
  • FIG. 7 shows an example of the functional blocks of the third and fourth embodiments. As shown in this figure, these embodiments include a domain URL information storage unit, a word input unit, a search word generation unit, a URL information acquisition unit, and a URL link information output unit. Except for the search word generation unit, this embodiment is common to Embodiments 1 and 2.
  • the “search word generation unit” generates a search word based on words in any one of Japanese, Chinese, and Korean input to the word input unit. For example, when the word input to the word input unit is “clothing”, the search word generation unit generates “boutique” in addition to “clothing”. This is because there is a deep relationship with the input of “clothing”. This association may be obtained by using a database in which a plurality of words are associated with each other. This associated database may include, for example, past user search trends, dictionaries, etc. 2241
  • synonyms, derivatives, and antonyms of input words are used as search words, but words having a similar meaning to ⁇ synonyms '' are derived from specific words
  • a tongue is a word that has the opposite meaning to a particular word.
  • Figure 9 shows the search terms generated for “clothing” by the search term generator. “Clothing”, “Clothes”, “Clothes”, “Kimono”, “Costumes”,... “Boutiques”, etc. are generated for “Clothing” entered in the word input section.
  • search words generated by the search word generation unit are used for acquiring URL information, but some of them may be used.
  • a list of words generated by the search word generation unit may be output, and a selection may be made to obtain output of domain link information. The choice is to choose what you need or remove what you don't need.
  • FIG. 8 shows a processing flow of the third and fourth embodiments.
  • a domain URL information storage step for storing a domain name and URL information that is information indicating a URL in association with each other, Japanese, Chinese, and Korean
  • a word input step for inputting a word in any one of the words
  • a search word generation step for generating a search word based on the words input in the word input step
  • a search word generation step for generating the search word.
  • a URL link information output step for outputting link information that is information for the link.
  • This is the link information output method.
  • the search word generated in the search word generation step may be any one or a combination of two or more of a synonym, a derivative word, and antonym of the word input in the word input step.
  • Embodiment 5 is a URL acquisition device using a domain name server. For example, it is used for storing information in the domain URL information storage unit of the domain link information output device described in the first to fourth embodiments.
  • the “domain name server” holds a domain name and an IP address of the domain name in association with each other. Therefore, a candidate for the domain name is generated using appropriate words, and it is checked whether there is an IP address that includes the relevant domain name. If so, the IP address is associated with the domain name and Obtain and use it.
  • FIG. 10 is a diagram illustrating an example of a functional block according to a fifth embodiment.
  • the URL acquisition device of the present embodiment includes a word storage unit, a search domain name generation unit, and a DNSURL acquisition unit.
  • the “Koba Accumulator” accumulates words in one of Japanese, Chinese, and Korean languages.
  • the “search domain name generation unit” uses the words stored in the word storage unit to generate a search domain name described in the domain name format. For example, the word “boutik” stored in the word storage unit is used to generate a domain name “boutik. Jp”.
  • the “DNSURL acquisition unit” sends the search domain name generated by the search domain name generation unit to the DNS server, and obtains IP address information that is information indicating the IP address corresponding to the domain name. I do. 02241
  • FIG. 11 shows a process flow of the fifth embodiment.
  • the fifth embodiment uses a word accumulation step in which words in any one of Japanese, Chinese, and Korean are accumulated, and a word accumulated in the word accumulation step.
  • a search domain name generation step for generating a search domain name described in the domain name format, and a search domain name generated in the search domain name generation step for a DNS server (domain name server).
  • a DNS IP address acquisition step of acquiring IP address information that is information indicating a corresponding IP address by transmitting the URL to the IP address.
  • a search engine on the Internet that searches only the domain name using search terms when searching the homepage, or the content of information disclosed on the homepage
  • the domain link information output apparatus of this invention since a domain name can be searched directly using ideographic characters, for example, the language of a language using a kanji as a search word, the homepage which provides the target information is provided. It is possible to obtain an excellent effect that it is possible to access the information efficiently.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Habituellement, un moteur de recherche de site web indique un nombre élevé de sites web en tant que résultat d'une recherche et il est difficile de restreindre les informations. Selon l'invention, seuls les mots d'une langue telle que le japonais, le chinois ou le coréen utilisant les kanji, qui sont des idéogrammes, et les adresses URL possédant des noms de domaine utilisant ces mots sont utilisés dans le résultat de recherche.
PCT/JP2002/002241 2002-03-11 2002-03-11 Dispositif de sortie d'informations de lien WO2003077158A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU2002236284A AU2002236284A1 (en) 2002-03-11 2002-03-11 Link information output device
PCT/JP2002/002241 WO2003077158A1 (fr) 2002-03-11 2002-03-11 Dispositif de sortie d'informations de lien

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2002/002241 WO2003077158A1 (fr) 2002-03-11 2002-03-11 Dispositif de sortie d'informations de lien

Publications (1)

Publication Number Publication Date
WO2003077158A1 true WO2003077158A1 (fr) 2003-09-18

Family

ID=27799911

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2002/002241 WO2003077158A1 (fr) 2002-03-11 2002-03-11 Dispositif de sortie d'informations de lien

Country Status (2)

Country Link
AU (1) AU2002236284A1 (fr)
WO (1) WO2003077158A1 (fr)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11250086A (ja) * 1998-03-03 1999-09-17 Hitachi Ltd 検索支援システム
JP2000200281A (ja) * 1999-01-05 2000-07-18 Matsushita Electric Ind Co Ltd 情報検索装置および情報検索方法ならびに情報検索プログラムを記録した記録媒体

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11250086A (ja) * 1998-03-03 1999-09-17 Hitachi Ltd 検索支援システム
JP2000200281A (ja) * 1999-01-05 2000-07-18 Matsushita Electric Ind Co Ltd 情報検索装置および情報検索方法ならびに情報検索プログラムを記録した記録媒体

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
TATSUYA HAGIWARA ET AL.: "Net kensaku super technique", PC STYLE 21, vol. 1, no. 1, 18 December 1999 (1999-12-18), pages 8,9,27, - 37, XP002968473 *
TSUNORU NAKAJIMA ET AL.: "Suguni Yakudatsu 30 no Keyword", 1 March 2001, NIKKEI BUSINESS PUBLICATIONS, INC., article "1 Nichi de Manaberu 21 Seiki no IT Shuho", pages: 195, XP002968472 *

Also Published As

Publication number Publication date
AU2002236284A1 (en) 2003-09-22

Similar Documents

Publication Publication Date Title
US7010568B1 (en) Search engine request method, product, and apparatus
USRE43690E1 (en) Search engine request method, product, and apparatus
KR100815215B1 (ko) 웹사이트 통합 검색 장치 및 방법
US20050235031A1 (en) Hyperlink generation and enhanced spell check method, product, apparatus, and user interface system
JP2019074843A (ja) 情報提供装置、情報提供方法、およびプログラム
JP2010534378A (ja) 自動拡張言語サーチ
US7783643B2 (en) Direct navigation for information retrieval
WO2011017929A1 (fr) Procede et appareil de positionnement rapide d'informations efficaces au moyen d'un navigateur de telephone mobile
US20070283262A1 (en) Recursive browser system and uses therefor
JP6653169B2 (ja) キーワード抽出装置、コンテンツ生成システム、キーワード抽出方法、およびプログラム
JP2019153271A5 (fr)
JP2010198350A (ja) 検索装置及び検索方法
JP2007128367A (ja) 情報検索ノウハウ管理システム
WO2003077158A1 (fr) Dispositif de sortie d'informations de lien
JP3898016B2 (ja) 情報検索装置、情報検索方法及び情報検索プログラム
JP2009086979A (ja) 組織名抽出装置、方法及びプログラム
KR20080025273A (ko) 도메인 네임서버(dns)를 기반으로 한 키워드 인터넷접속 방법 및 그 시스템
JP2010079863A (ja) 絞り込み検索サービス提供方法及びプログラム
JP5002631B2 (ja) 単語情報収集装置、単語情報収集方法および単語情報収集プログラム
Urbansky et al. Entity extraction from the web with webknox
RU2002111687A (ru) Способ и система исполнения услуг для регистрации доменных имен с использованием национальных языков в неанглоязычных странах
JP2002215675A (ja) 情報検索システム
JP6655981B2 (ja) キーワード抽出装置、キーワード抽出方法、およびプログラム
JP2006350864A (ja) メッセージ生成支援方法及び装置及びプログラム及びプログラムを格納した記憶媒体
JP2008210022A (ja) 変換テンプレート自動生成システムと方法、変換テンプレート合成サーバ、及びプログラム

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP