DE60304331D1 - Abrufen übereinstimmender dokumente durch abfragen in einer nationalen sprache - Google Patents

Abrufen übereinstimmender dokumente durch abfragen in einer nationalen sprache

Info

Publication number
DE60304331D1
DE60304331D1 DE60304331T DE60304331T DE60304331D1 DE 60304331 D1 DE60304331 D1 DE 60304331D1 DE 60304331 T DE60304331 T DE 60304331T DE 60304331 T DE60304331 T DE 60304331T DE 60304331 D1 DE60304331 D1 DE 60304331D1
Authority
DE
Germany
Prior art keywords
documents
keywords
languages
search
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60304331T
Other languages
English (en)
Other versions
DE60304331T2 (de
Inventor
Gregory T Brown
Yurdaer Nezihi Doganata
Youssef Drissi
Tong-Haing Fin
Ju Kim
Lev Kozakov
Rodriguez Juan Leon
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/066,346 external-priority patent/US6952691B2/en
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Application granted granted Critical
Publication of DE60304331D1 publication Critical patent/DE60304331D1/de
Publication of DE60304331T2 publication Critical patent/DE60304331T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3337Translation of the query language, e.g. Chinese to English
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/319Inverted lists
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99943Generating database or data structure, e.g. via user interface
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
DE60304331T 2002-02-01 2003-01-24 Abrufen übereinstimmender dokumente durch abfragen in einer nationalen sprache Expired - Lifetime DE60304331T2 (de)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US66346 2002-02-01
US10/066,346 US6952691B2 (en) 2002-02-01 2002-02-01 Method and system for searching a multi-lingual database
US10/180,195 US7260570B2 (en) 2002-02-01 2002-06-26 Retrieving matching documents by queries in any national language
US180195 2002-06-26
PCT/EP2003/000761 WO2003065248A2 (en) 2002-02-01 2003-01-24 Retrieving matching documents by queries in any national language

Publications (2)

Publication Number Publication Date
DE60304331D1 true DE60304331D1 (de) 2006-05-18
DE60304331T2 DE60304331T2 (de) 2006-11-09

Family

ID=27667790

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60304331T Expired - Lifetime DE60304331T2 (de) 2002-02-01 2003-01-24 Abrufen übereinstimmender dokumente durch abfragen in einer nationalen sprache

Country Status (9)

Country Link
US (1) US7260570B2 (de)
EP (1) EP1485830B1 (de)
JP (1) JP4634715B2 (de)
KR (1) KR100572797B1 (de)
CN (1) CN100375090C (de)
AT (1) ATE322045T1 (de)
CA (1) CA2474814A1 (de)
DE (1) DE60304331T2 (de)
WO (1) WO2003065248A2 (de)

Families Citing this family (70)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6952691B2 (en) * 2002-02-01 2005-10-04 International Business Machines Corporation Method and system for searching a multi-lingual database
US7039625B2 (en) * 2002-11-22 2006-05-02 International Business Machines Corporation International information search and delivery system providing search results personalized to a particular natural language
CN1997992A (zh) * 2003-03-26 2007-07-11 维克托·西 用于无线网络的在线智能多语种比较商店代理
US7483877B2 (en) * 2003-04-11 2009-01-27 International Business Machines Corporation Dynamic comparison of search systems in a controlled environment
JP2004355069A (ja) 2003-05-27 2004-12-16 Sony Corp 情報処理装置および方法、プログラム、並びに記録媒体
US7854009B2 (en) * 2003-06-12 2010-12-14 International Business Machines Corporation Method of securing access to IP LANs
US20050065774A1 (en) * 2003-09-20 2005-03-24 International Business Machines Corporation Method of self enhancement of search results through analysis of system logs
US8014997B2 (en) * 2003-09-20 2011-09-06 International Business Machines Corporation Method of search content enhancement
US20050138007A1 (en) * 2003-12-22 2005-06-23 International Business Machines Corporation Document enhancement method
US7716211B2 (en) * 2004-02-10 2010-05-11 Microsoft Corporation System and method for facilitating full text searching utilizing inverted keyword indices
DE202004005008U1 (de) * 2004-03-30 2004-06-24 E.I. Du Pont De Nemours And Company, Wilmington Textiles Flächengebilde für Schutzbekleidung
US7594277B2 (en) * 2004-06-30 2009-09-22 Microsoft Corporation Method and system for detecting when an outgoing communication contains certain content
US8473475B2 (en) 2004-09-15 2013-06-25 Samsung Electronics Co., Ltd. Information storage medium for storing metadata supporting multiple languages, and systems and methods of processing metadata
US20060212441A1 (en) * 2004-10-25 2006-09-21 Yuanhua Tang Full text query and search systems and methods of use
US20080077570A1 (en) * 2004-10-25 2008-03-27 Infovell, Inc. Full Text Query and Search Systems and Method of Use
US20070022134A1 (en) * 2005-07-22 2007-01-25 Microsoft Corporation Cross-language related keyword suggestion
US7672831B2 (en) * 2005-10-24 2010-03-02 Invention Machine Corporation System and method for cross-language knowledge searching
KR100643801B1 (ko) * 2005-10-26 2006-11-10 엔에이치엔(주) 복수의 언어를 연동하는 자동완성 추천어 제공 시스템 및방법
US8762358B2 (en) * 2006-04-19 2014-06-24 Google Inc. Query language determination using query terms and interface language
US8442965B2 (en) 2006-04-19 2013-05-14 Google Inc. Query language identification
US8255376B2 (en) * 2006-04-19 2012-08-28 Google Inc. Augmenting queries with synonyms from synonyms map
US7835903B2 (en) * 2006-04-19 2010-11-16 Google Inc. Simplifying query terms with transliteration
US8380488B1 (en) 2006-04-19 2013-02-19 Google Inc. Identifying a property of a document
US20070271231A1 (en) * 2006-05-22 2007-11-22 Jimmy Jong-Yuan Lin Search method on the Internet
CN100416570C (zh) * 2006-09-22 2008-09-03 浙江大学 一种基于问答库的中文自然语言问答方法
WO2008086889A1 (de) * 2007-01-16 2008-07-24 Netbreeze Gmbh Transkriptionsvorrichtung zur automatisierten transkription und transphrasierung sowie entsprechendes verfahren
KR100893629B1 (ko) * 2007-02-12 2009-04-20 주식회사 이지씨앤씨 전자교재 컨텐츠의 구문에 식별코드를 부여하는 시스템 및방법, 전자교재 컨텐츠의 데이터 검색 시스템 및 방법,전자교재 컨텐츠의 사용과 제공에 관한 포인트 관리 시스템및 방법
US8051061B2 (en) 2007-07-20 2011-11-01 Microsoft Corporation Cross-lingual query suggestion
US7917488B2 (en) * 2008-03-03 2011-03-29 Microsoft Corporation Cross-lingual search re-ranking
US8065739B1 (en) * 2008-03-28 2011-11-22 Symantec Corporation Detecting policy violations in information content containing data in a character-based language
US8171041B2 (en) * 2008-05-15 2012-05-01 Enpulz, L.L.C. Support for international search terms
US20110295857A1 (en) * 2008-06-20 2011-12-01 Ai Ti Aw System and method for aligning and indexing multilingual documents
US8782061B2 (en) * 2008-06-24 2014-07-15 Microsoft Corporation Scalable lookup-driven entity extraction from indexed document collections
US8135580B1 (en) 2008-08-20 2012-03-13 Amazon Technologies, Inc. Multi-language relevance-based indexing and search
JP5751537B2 (ja) * 2008-09-17 2015-07-22 有限会社新英プラナーズ 国際対応型日本語入力システム
US20100145923A1 (en) * 2008-12-04 2010-06-10 Microsoft Corporation Relaxed filter set
WO2010105214A2 (en) 2009-03-13 2010-09-16 Invention Machine Corporation Question-answering system and method based on semantic labeling of text documents and user questions
US8577910B1 (en) 2009-05-15 2013-11-05 Google Inc. Selecting relevant languages for query translation
US8572109B1 (en) 2009-05-15 2013-10-29 Google Inc. Query translation quality confidence
US8577909B1 (en) * 2009-05-15 2013-11-05 Google Inc. Query translation using bilingual search refinements
US8538957B1 (en) 2009-06-03 2013-09-17 Google Inc. Validating translations using visual similarity between visual media search results
CN102053991B (zh) * 2009-10-30 2014-07-02 国际商业机器公司 用于多语言文档检索的方法及系统
WO2011061556A1 (en) * 2009-11-20 2011-05-26 Kim Mo Intelligent search system
US8773706B2 (en) * 2010-03-29 2014-07-08 Konica Minolta Laboratory U.S.A., Inc. Apparatus, systems, and methods for dynamic language customization
CN101944108A (zh) * 2010-09-07 2011-01-12 深圳市彩讯科技有限公司 一种索引文件及索引文件建立方法
US8639701B1 (en) * 2010-11-23 2014-01-28 Google Inc. Language selection for information retrieval
US8527518B2 (en) * 2010-12-16 2013-09-03 Sap Ag Inverted indexes with multiple language support
US8498972B2 (en) * 2010-12-16 2013-07-30 Sap Ag String and sub-string searching using inverted indexes
CN103493046B (zh) * 2011-04-28 2018-02-23 微软技术许可有限责任公司 备选市场搜索结果切换标签
AU2012360732B2 (en) * 2011-12-29 2018-02-01 P2S Media Group Oy Method and apparatus for providing metadata search codes to multimedia
US20130332450A1 (en) * 2012-06-11 2013-12-12 International Business Machines Corporation System and Method for Automatically Detecting and Interactively Displaying Information About Entities, Activities, and Events from Multiple-Modality Natural Language Sources
CN103488648B (zh) * 2012-06-13 2018-03-20 阿里巴巴集团控股有限公司 一种多语种混合检索方法和系统
CN104281583B (zh) * 2013-07-02 2018-01-12 索意互动(北京)信息技术有限公司 信息检索方法及装置
CN104731828B (zh) 2013-12-24 2017-12-05 华为技术有限公司 一种跨领域文档相似度计算方法及装置
CN103699675B (zh) * 2013-12-30 2017-07-04 语联网(武汉)信息技术有限公司 一种译员分级索引的方法
US9524293B2 (en) * 2014-08-15 2016-12-20 Google Inc. Techniques for automatically swapping languages and/or content for machine translation
US9977810B2 (en) 2014-08-21 2018-05-22 Dropbox, Inc. Multi-user search system with methodology for personal searching
US9384226B1 (en) 2015-01-30 2016-07-05 Dropbox, Inc. Personal content item searching system and method
US9183303B1 (en) 2015-01-30 2015-11-10 Dropbox, Inc. Personal content item searching system and method
TWI712899B (zh) * 2015-07-28 2020-12-11 香港商阿里巴巴集團服務有限公司 資訊查詢方法及裝置
US9606990B2 (en) 2015-08-04 2017-03-28 International Business Machines Corporation Cognitive system with ingestion of natural language documents with embedded code
KR101656357B1 (ko) 2015-11-04 2016-09-09 국방과학연구소 데이터 표를 이용하여 공학용 데이터베이스를 구성하는 방법
CN105404688A (zh) * 2015-12-11 2016-03-16 北京奇虎科技有限公司 搜索方法和搜索设备
US10824795B2 (en) 2016-06-21 2020-11-03 Fernando J. Pinho Indoor positioning and recording system
WO2017223133A1 (en) * 2016-06-21 2017-12-28 Pinho Fernando J Indoor positioning and recording system
US10691734B2 (en) * 2017-11-21 2020-06-23 International Business Machines Corporation Searching multilingual documents based on document structure extraction
CN108345694B (zh) * 2018-03-19 2021-09-03 华北电力大学(保定) 一种基于主题数据库的文献检索方法及系统
US10482185B1 (en) * 2019-02-27 2019-11-19 Capital One Services, Llc Methods and arrangements to adjust communications
CN110347904A (zh) * 2019-05-28 2019-10-18 成都美美臣科技有限公司 一个多语言电子商务网站处理语言搜索方法
CN112380410A (zh) * 2020-11-10 2021-02-19 北京字节跳动网络技术有限公司 信息处理方法、装置和电子设备

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01181123A (ja) * 1988-01-14 1989-07-19 Hitachi Ltd 情報検索装置
US6278967B1 (en) * 1992-08-31 2001-08-21 Logovista Corporation Automated system for generating natural language translations that are domain-specific, grammar rule-based, and/or based on part-of-speech analysis
JP2737662B2 (ja) * 1994-08-29 1998-04-08 日本電気株式会社 外国語キーワード文献検索処理装置
US5799307A (en) * 1995-10-06 1998-08-25 Callware Technologies, Inc. Rapid storage and recall of computer storable messages by utilizing the file structure of a computer's native operating system for message database organization
US6055528A (en) * 1997-07-25 2000-04-25 Claritech Corporation Method for cross-linguistic document retrieval
US5991713A (en) * 1997-11-26 1999-11-23 International Business Machines Corp. Efficient method for compressing, storing, searching and transmitting natural language text
JP3181548B2 (ja) * 1998-02-03 2001-07-03 富士通株式会社 情報検索装置及び情報検索方法
JP3601653B2 (ja) * 1998-03-18 2004-12-15 富士通株式会社 情報検索装置および方法
GB2338089A (en) * 1998-06-02 1999-12-08 Sharp Kk Indexing method
US6275789B1 (en) * 1998-12-18 2001-08-14 Leo Moser Method and apparatus for performing full bidirectional translation between a source language and a linked alternative language
US6336117B1 (en) * 1999-04-30 2002-01-01 International Business Machines Corporation Content-indexing search system and method providing search results consistent with content filtering and blocking policies implemented in a blocking engine
CN1176432C (zh) 1999-07-28 2004-11-17 国际商业机器公司 提供本国语言查询服务的方法和系统
US7027974B1 (en) * 2000-10-27 2006-04-11 Science Applications International Corporation Ontology-based parser for natural language processing
EP1454263A4 (de) * 2001-11-21 2008-02-13 Contecs Dd Llc Datenwörterbuch für die verwaltung digitaler rechte

Also Published As

Publication number Publication date
CA2474814A1 (en) 2003-08-07
KR100572797B1 (ko) 2006-04-24
US7260570B2 (en) 2007-08-21
WO2003065248A3 (en) 2004-03-11
CN100375090C (zh) 2008-03-12
DE60304331T2 (de) 2006-11-09
EP1485830B1 (de) 2006-03-29
JP4634715B2 (ja) 2011-02-16
JP2005516306A (ja) 2005-06-02
CN1620661A (zh) 2005-05-25
US20030149687A1 (en) 2003-08-07
ATE322045T1 (de) 2006-04-15
WO2003065248A2 (en) 2003-08-07
KR20040077918A (ko) 2004-09-07
EP1485830A2 (de) 2004-12-15

Similar Documents

Publication Publication Date Title
DE60304331D1 (de) Abrufen übereinstimmender dokumente durch abfragen in einer nationalen sprache
US8346536B2 (en) System and method for multi-lingual information retrieval
WO2005124599A3 (en) Content search in complex language, such as japanese
WO2008031062A3 (en) System and method for building and retriving a full text index
CN105045852A (zh) 一种教学资源的全文搜索引擎系统
KR20080017685A (ko) 문서랭킹 부여방법 및 이를 수행할 수 있는 프로그램이수록된 컴퓨터로 읽을 수 있는 기록 매체
US8452722B2 (en) Method and system for searching multiple data sources
Golub Subject access in Swedish discovery services
Mann et al. Enhanced search with wildcards and morphological inflections in the Google Books Ngram Viewer
Evert et al. Identifying Morphosyntactic Preferences in Collocations.
SADEQI et al. Evaluation of the usability of admission and medical record information system: A heuristic evaluation
Shashirekha et al. Dictionary based Amharic-Arabic cross language information retrieval
McIlwaine et al. The new ecumenism: Exploration of a DDC/UDC view of religion
US20120150862A1 (en) System and method for augmenting an index entry with related words in a document and searching an index for related keywords
Giacomini Dictionaries for translation
Ababneh et al. Enhanced Arabic Information Retrieval by Using Arabic Slang
KR970049752A (ko) 동사정보를 이용한 한국어 자연어 질의 정보검색 방법
Flemmings et al. BBK-UFRGS@ CLEF2009: Query Expansion of Geographic Place Names.
Naji et al. On cross-script information retrieval
Danis Major place harmony in ABC and the (reduced) role of representation: evidence from Ngbaka
Ru et al. TREC 2005 Enterprise Track Experiments at BUPT.
Leveling Exploring term selection for geographic blind feedback
CZ17986U1 (cs) Informacní systém pro vyhledávání jazykove nezávislých informací
Koehler et al. A Question Answering System for German. Experiments with Morphological Linguistic Resources.
Fortson New school calvinism and the presbyterian creed

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8320 Willingness to grant licences declared (paragraph 23)