JP4264118B2 - ネットワーク上の異なる情報源から情報を構成する方法 - Google Patents
ネットワーク上の異なる情報源から情報を構成する方法 Download PDFInfo
- Publication number
- JP4264118B2 JP4264118B2 JP2008007850A JP2008007850A JP4264118B2 JP 4264118 B2 JP4264118 B2 JP 4264118B2 JP 2008007850 A JP2008007850 A JP 2008007850A JP 2008007850 A JP2008007850 A JP 2008007850A JP 4264118 B2 JP4264118 B2 JP 4264118B2
- Authority
- JP
- Japan
- Prior art keywords
- information
- computer
- semi
- processing unit
- relational database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
- G06F16/84—Mapping; Conversion
- G06F16/86—Mapping to a database
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
- Computer And Data Communications (AREA)
Description
半構造化情報 − 全体としては精密な構造を持たないが、半構造化情報内の各要素は半構造化情報内におけるその場所または周囲状況に基づく意味を持つ情報である。半構造化情報の形式は、正規表現、典型的にはネストされた正規表現によって表すことができる。
以下の詳細な説明では、本発明についてワールド・ワイド・ウェブ上で動作する好ましい実施形態を参照しながら説明する。具体的には、ウェブ上での本発明の具体的な応用例を示す例について説明する。しかし、本発明は特定の情報源には限定されず、本明細書に記載されている例にも限定されない。したがって、以下の実施形態の説明は、例示的なものに過ぎず、限定的なものではない。
"<h2>"$category"</h2>"$listings #LOOKAHEAD ("<h2>"|"</body>")
この正規表現は、属性「category(カテゴリ)」に対応する変数$categoryがHTMLタグ<h2>と</h2>の間に見つかることを意味している。たとえば、図6のHTMLページの3行目に、この2つのタグの間に「Condos」が示されている。
"<p>" $text #LOOKAHEAD ("<p>" | "<hr>")
この正規表現は、変数$listingsがタグ<p>とタグ<p>または<hr>との間に変数$text(属性「text」に対応する)を含むことを示している。
変数$textは「TEXT」タイプであり、以下の正規化表現によって表される。
”<b>”$city”</b>”
”$”$rent
$bedrooms”BR”
$bathrooms”BA”
上記の正規表現は、変数$textが変数$city、$rent、$bedrooms、および$bathroomsのすべてを含むことも1つも含まないこともできることを示している。これらの変数は、それぞれ属性「city(市)」、「rent(賃貸料)」、「bedrooms(寝室数)」、および「bathrooms(浴室数)」に対応する。正規化表現によって示されているように変数$cityは<b>と</b>の間にくる。変数$rentがドル符号の後に続く。また、変数$bedroomsおよび$bathroomsがそれぞれ語「BR」および「BA」の前にくる。この例では、データ源として単一のHTMLページだけが示されているが、データ源は一般に情報リポジトリとすることができ、場合によっては異なる情報の間のリンクを備える。
3 モニタ
5 画面
7 筐体
9 キーボード
15 CD−ROMドライブ
108 中央処理装置
112 取外し可能ディスク
116 固定ディスク
118 ネットワーク・インタフェース
120 スピーカ
202 コンピュータ
208 インターネット
Claims (1)
- コンピュータを使用してネットワーク上の異なる情報源から情報を構成する方法であって、
a)ソフトウエアを実行するコンピュータの処理装置が、コンピュータに接続のネットワーク上の情報源中の、テキスト情報の含まれる半構造化情報を、語彙解析を使用して、反復パターンがないかに関して調べ、当該テキスト情報中の求めるパターンを識別するステップと、
b)ソフトウエアを実行するコンピュータの処理装置が、求めるパターンに関する事前の情報を必要とせずに、識別されたパターンを名前と位置によってネストされた構造体にカタログ化するステップと、
c)ソフトウエアを実行するコンピュータの処理装置が、前記ネストされた構造体内のパターンを調べてリレーショナル・データベースのリレーショナル・スキーマのフィールドに対応する属性を識別するステップと、
d)ソフトウエアを実行するコンピュータの処理装置が、前記ネストされた構造体内のパターンを調べて分解できるパターンを識別し、分解して前記ネストされた構造体にカタログ化するステップと、
e)ソフトウエアを実行するコンピュータの処理装置が、前記ネストされた構造体内のパターンを調べて他の半構造化情報へのリンクを識別し、検査して前記ネストされた構造体にカタログ化するステップと、
f)コンピュータの処理装置が、該当するネストされた情報がすべてカタログ化されるまでステップ(c)ないし(e)を繰り返し、専用のプログラム・トランスレータが使用するように半構造化情報の定義を得るステップと、
g)ソフトウエアを実行するコンピュータの処理装置が、リレーショナル・データベースのパーサを構築するために前記定義をプログラム・トランスレータに入力として供給するステップと
を含む方法。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/724,943 US5826258A (en) | 1996-10-02 | 1996-10-02 | Method and apparatus for structuring the querying and interpretation of semistructured information |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP9269792A Division JPH10222539A (ja) | 1996-10-02 | 1997-10-02 | 半構造化情報の照会および解釈を構造化する方法および装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2008328321A Division JP2009080849A (ja) | 1996-10-02 | 2008-12-24 | 単一の高水準構造化ユーザ照会に応答する方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2008123547A JP2008123547A (ja) | 2008-05-29 |
JP4264118B2 true JP4264118B2 (ja) | 2009-05-13 |
Family
ID=24912520
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP9269792A Pending JPH10222539A (ja) | 1996-10-02 | 1997-10-02 | 半構造化情報の照会および解釈を構造化する方法および装置 |
JP2008007850A Expired - Lifetime JP4264118B2 (ja) | 1996-10-02 | 2008-01-17 | ネットワーク上の異なる情報源から情報を構成する方法 |
JP2008328321A Pending JP2009080849A (ja) | 1996-10-02 | 2008-12-24 | 単一の高水準構造化ユーザ照会に応答する方法 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP9269792A Pending JPH10222539A (ja) | 1996-10-02 | 1997-10-02 | 半構造化情報の照会および解釈を構造化する方法および装置 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2008328321A Pending JP2009080849A (ja) | 1996-10-02 | 2008-12-24 | 単一の高水準構造化ユーザ照会に応答する方法 |
Country Status (2)
Country | Link |
---|---|
US (1) | US5826258A (ja) |
JP (3) | JPH10222539A (ja) |
Families Citing this family (130)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6003046A (en) * | 1996-04-15 | 1999-12-14 | Sun Microsystems, Inc. | Automatic development and display of context information in structured documents on the world wide web |
US5913214A (en) * | 1996-05-30 | 1999-06-15 | Massachusetts Inst Technology | Data extraction from world wide web pages |
US5905862A (en) * | 1996-09-04 | 1999-05-18 | Intel Corporation | Automatic web site registration with multiple search engines |
US6085186A (en) * | 1996-09-20 | 2000-07-04 | Netbot, Inc. | Method and system using information written in a wrapper description language to execute query on a network |
US6801926B1 (en) * | 1996-11-05 | 2004-10-05 | Peoplesoft, Inc. | Platform-independent programmable batch processing engine |
US6085190A (en) * | 1996-11-15 | 2000-07-04 | Digital Vision Laboratories Corporation | Apparatus and method for retrieval of information from various structured information |
US6038566A (en) * | 1996-12-04 | 2000-03-14 | Tsai; Daniel E. | Method and apparatus for navigation of relational databases on distributed networks |
JPH10171814A (ja) * | 1996-12-12 | 1998-06-26 | Digital Vision Lab:Kk | 情報検索装置 |
WO1998048360A1 (en) * | 1997-04-22 | 1998-10-29 | Greg Hetherington | Method and apparatus for processing free-format data |
US7020670B1 (en) * | 1997-04-23 | 2006-03-28 | Xerox Corporation | Document constraint descriptors obtained from user signals indicating attribute-value relations |
US7779020B2 (en) * | 2002-03-01 | 2010-08-17 | International Business Machines Corporation | Small-footprint applicative query interpreter method, system and program product |
US6442576B1 (en) * | 1997-08-06 | 2002-08-27 | Adobe Systems Incorporated | Searching for documents with multiple element types |
US6014670A (en) * | 1997-11-07 | 2000-01-11 | Informatica Corporation | Apparatus and method for performing data transformations in data warehousing |
US6339775B1 (en) * | 1997-11-07 | 2002-01-15 | Informatica Corporation | Apparatus and method for performing data transformations in data warehousing |
US6571243B2 (en) | 1997-11-21 | 2003-05-27 | Amazon.Com, Inc. | Method and apparatus for creating extractors, field information objects and inheritance hierarchies in a framework for retrieving semistructured information |
US6304870B1 (en) * | 1997-12-02 | 2001-10-16 | The Board Of Regents Of The University Of Washington, Office Of Technology Transfer | Method and apparatus of automatically generating a procedure for extracting information from textual information sources |
US5963949A (en) * | 1997-12-22 | 1999-10-05 | Amazon.Com, Inc. | Method for data gathering around forms and search barriers |
JPH11261627A (ja) * | 1998-03-09 | 1999-09-24 | Fujitsu Ltd | 電子メール配信装置 |
US6424980B1 (en) * | 1998-06-10 | 2002-07-23 | Nippon Telegraph And Telephone Corporation | Integrated retrieval scheme for retrieving semi-structured documents |
US6925595B1 (en) * | 1998-08-05 | 2005-08-02 | Spyglass, Inc. | Method and system for content conversion of hypertext data using data mining |
GB2343763B (en) * | 1998-09-04 | 2003-05-21 | Shell Services Internat Ltd | Data processing system |
US6298334B1 (en) * | 1998-10-15 | 2001-10-02 | Netdecide Corporation | Object-based numeric-analysis engine |
US6163794A (en) | 1998-10-23 | 2000-12-19 | General Magic | Network system extensible by users |
JP2000200288A (ja) * | 1999-01-07 | 2000-07-18 | Nec Corp | 検索結果解析方式 |
US6782505B1 (en) * | 1999-04-19 | 2004-08-24 | Daniel P. Miranker | Method and system for generating structured data from semi-structured data sources |
US6356906B1 (en) * | 1999-07-26 | 2002-03-12 | Microsoft Corporation | Standard database queries within standard request-response protocols |
US6792576B1 (en) * | 1999-07-26 | 2004-09-14 | Xerox Corporation | System and method of automatic wrapper grammar generation |
US6613099B2 (en) * | 1999-08-20 | 2003-09-02 | Apple Computer, Inc. | Process and system for providing a table view of a form layout for a database |
US6851089B1 (en) * | 1999-10-25 | 2005-02-01 | Amazon.Com, Inc. | Software application and associated methods for generating a software layer for structuring semistructured information |
US6721780B1 (en) * | 1999-11-09 | 2004-04-13 | Fireclick, Inc. | Predictive pre-download of network objects |
US6721727B2 (en) | 1999-12-02 | 2004-04-13 | International Business Machines Corporation | XML documents stored as column data |
US6658400B2 (en) * | 1999-12-04 | 2003-12-02 | William S. Perell | Data certification and verification system having a multiple-user-controlled data interface |
US7152062B1 (en) | 2000-11-21 | 2006-12-19 | Actuate Corporation | Technique for encapsulating a query definition |
US7124144B2 (en) * | 2000-03-02 | 2006-10-17 | Actuate Corporation | Method and apparatus for storing semi-structured data in a structured manner |
US6581062B1 (en) | 2000-03-02 | 2003-06-17 | Nimble Technology, Inc. | Method and apparatus for storing semi-structured data in a structured manner |
US7707159B2 (en) * | 2000-03-02 | 2010-04-27 | Actuate Corporation | Method and apparatus for storing semi-structured data in a structured manner |
US6449620B1 (en) * | 2000-03-02 | 2002-09-10 | Nimble Technology, Inc. | Method and apparatus for generating information pages using semi-structured data stored in a structured manner |
AU2001252909A1 (en) * | 2000-03-21 | 2001-10-03 | Quack.Com | System and method for non-programming development of rules used in the transformation of web-based information |
US6954895B1 (en) * | 2000-03-22 | 2005-10-11 | Autodesk, Inc. | Method and apparatus for using and storing objects |
US7404141B1 (en) | 2000-03-31 | 2008-07-22 | Oracle International Corporation | System for creating and maintaining a website |
US6745238B1 (en) * | 2000-03-31 | 2004-06-01 | Oracle International Corporation | Self service system for web site publishing |
US7702995B2 (en) * | 2000-04-24 | 2010-04-20 | TVWorks, LLC. | Method and system for transforming content for execution on multiple platforms |
US8296792B2 (en) | 2000-04-24 | 2012-10-23 | Tvworks, Llc | Method and system to provide interactivity using an interactive channel bug |
US9788058B2 (en) * | 2000-04-24 | 2017-10-10 | Comcast Cable Communications Management, Llc | Method and system for automatic insertion of interactive TV triggers into a broadcast data stream |
US8936101B2 (en) | 2008-07-17 | 2015-01-20 | Halliburton Energy Services, Inc. | Interventionless set packer and setting method for same |
US6604107B1 (en) * | 2000-04-24 | 2003-08-05 | Ebay Inc. | Generic attribute database system for storing items of different categories having shared attributes |
US6778983B1 (en) * | 2000-04-28 | 2004-08-17 | International Business Machines Corporation | Apparatus and method for accessing HTML files using an SQL query |
US6704728B1 (en) | 2000-05-02 | 2004-03-09 | Iphase.Com, Inc. | Accessing information from a collection of data |
US8478732B1 (en) | 2000-05-02 | 2013-07-02 | International Business Machines Corporation | Database aliasing in information access system |
US6516308B1 (en) | 2000-05-10 | 2003-02-04 | At&T Corp. | Method and apparatus for extracting data from data sources on a network |
KR100391391B1 (ko) * | 2000-06-14 | 2003-07-12 | (주) 제이.에스.씨.앤.아이 | 저작권 침해 방지를 위한 정보추출 에이전트 시스템 및그의 정보제공 방법 |
US7117215B1 (en) | 2001-06-07 | 2006-10-03 | Informatica Corporation | Method and apparatus for transporting data for data warehousing applications that incorporates analytic data interface |
US7613790B2 (en) | 2000-07-13 | 2009-11-03 | Biap Systems, Inc. | Apparatus for and method of executing customized interactive computing services in a broadband network environment |
US7313588B1 (en) * | 2000-07-13 | 2007-12-25 | Biap Systems, Inc. | Locally executing software agent for retrieving remote content and method for creation and use of the agent |
JP4536880B2 (ja) * | 2000-07-18 | 2010-09-01 | キヤノン株式会社 | 情報処理システム及びその制御方法、情報処理装置及び方法、並びに記憶媒体 |
WO2002027551A2 (en) | 2000-08-01 | 2002-04-04 | Nimble Technology, Inc. | Nested conditional relations (ncr) model and algebra |
US7308646B1 (en) * | 2000-08-30 | 2007-12-11 | International Business Machines Corporation | Integrating diverse data sources using a mark-up language |
CA2360906C (en) * | 2000-11-16 | 2010-01-05 | General Electric Company | Method for mapping information from a network-based data source |
US7917888B2 (en) * | 2001-01-22 | 2011-03-29 | Symbol Technologies, Inc. | System and method for building multi-modal and multi-channel applications |
US7269784B1 (en) | 2001-01-22 | 2007-09-11 | Kasriel Stephane | Server-originated differential caching |
US20020129342A1 (en) * | 2001-03-07 | 2002-09-12 | David Kil | Data mining apparatus and method with user interface based ground-truth tool and user algorithms |
US20020161757A1 (en) * | 2001-03-16 | 2002-10-31 | Jeffrey Mock | Simultaneous searching across multiple data sets |
US6915482B2 (en) * | 2001-03-28 | 2005-07-05 | Cyber Watcher As | Method and arrangement for web information monitoring |
WO2002086781A1 (fr) * | 2001-04-17 | 2002-10-31 | Nishimori, Koji | Procede et systeme d'automatisation d'un cadre de vie |
WO2002097667A2 (en) * | 2001-05-31 | 2002-12-05 | Lixto Software Gmbh | Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml |
US7162643B1 (en) | 2001-06-15 | 2007-01-09 | Informatica Corporation | Method and system for providing transfer of analytic application data over a network |
US7185063B1 (en) | 2001-06-22 | 2007-02-27 | Digital River, Inc. | Content delivery network using differential caching |
US7720842B2 (en) | 2001-07-16 | 2010-05-18 | Informatica Corporation | Value-chained queries in analytic applications |
US7092997B1 (en) | 2001-08-06 | 2006-08-15 | Digital River, Inc. | Template identification with differential caching |
US7188214B1 (en) | 2001-08-07 | 2007-03-06 | Digital River, Inc. | Efficient compression using differential caching |
DE10144332A1 (de) * | 2001-09-10 | 2003-04-03 | Siemens Ag | Verfahren zur Übertragung eines Prozesswerts und Steuerungssystem |
US7752266B2 (en) | 2001-10-11 | 2010-07-06 | Ebay Inc. | System and method to facilitate translation of communications between entities over a network |
US8387074B2 (en) * | 2001-11-09 | 2013-02-26 | International Business Machines Corporation | Enterprise directory service |
US20030101212A1 (en) * | 2001-11-26 | 2003-05-29 | Borg Michael J. | Method for automatically completing an electronic form |
US7296051B1 (en) | 2002-02-19 | 2007-11-13 | Digital River, Inc. | Predictive predownload of templates with delta encoding |
US7487261B1 (en) | 2002-02-22 | 2009-02-03 | Digital River, Inc. | Delta caching service |
US6820077B2 (en) | 2002-02-22 | 2004-11-16 | Informatica Corporation | Method and system for navigating a large amount of data |
US8078505B2 (en) | 2002-06-10 | 2011-12-13 | Ebay Inc. | Method and system for automatically updating a seller application utilized in a network-based transaction facility |
US7020667B2 (en) * | 2002-07-18 | 2006-03-28 | International Business Machines Corporation | System and method for data retrieval and collection in a structured format |
US7035841B2 (en) * | 2002-07-18 | 2006-04-25 | Xerox Corporation | Method for automatic wrapper repair |
US7487234B2 (en) * | 2002-09-17 | 2009-02-03 | International Business Machines Corporation | Context conflict resolution and automatic context source maintenance |
US8452631B2 (en) | 2002-09-17 | 2013-05-28 | International Business Machines Corporation | Keeping working hours and calendar entries up-to date |
JP2004158050A (ja) * | 2002-11-01 | 2004-06-03 | Renesas Technology Corp | 半導体記憶装置 |
US7146356B2 (en) * | 2003-03-21 | 2006-12-05 | International Business Machines Corporation | Real-time aggregation of unstructured data into structured data for SQL processing by a relational database engine |
US8458164B2 (en) * | 2003-07-15 | 2013-06-04 | International Business Machines Corporation | Query model tool and method for visually grouping and ungrouping predicates |
US20050015368A1 (en) * | 2003-07-15 | 2005-01-20 | International Business Machines Corporation | Query modelling tool having a dynamically adaptive interface |
US20050015361A1 (en) | 2003-07-15 | 2005-01-20 | International Business Machines Corporation | Model content provider with reusable components for supporting a plurality of GUI API's |
US7421458B1 (en) | 2003-10-16 | 2008-09-02 | Informatica Corporation | Querying, versioning, and dynamic deployment of database objects |
US7254590B2 (en) | 2003-12-03 | 2007-08-07 | Informatica Corporation | Set-oriented real-time data processing based on transaction boundaries |
US20050165789A1 (en) * | 2003-12-22 | 2005-07-28 | Minton Steven N. | Client-centric information extraction system for an information network |
US9189568B2 (en) | 2004-04-23 | 2015-11-17 | Ebay Inc. | Method and system to display and search in a language independent manner |
US7421322B1 (en) | 2004-04-30 | 2008-09-02 | Carfax, Inc. | System and method for automatic identification of vehicle identification number |
US7769752B1 (en) | 2004-04-30 | 2010-08-03 | Network Appliance, Inc. | Method and system for updating display of a hierarchy of categories for a document repository |
US7519621B2 (en) * | 2004-05-04 | 2009-04-14 | Pagebites, Inc. | Extracting information from Web pages |
WO2005109011A2 (en) * | 2004-05-05 | 2005-11-17 | Fluor Technologies Corporation | Integrated acceptance testing |
US8244689B2 (en) | 2006-02-17 | 2012-08-14 | Google Inc. | Attribute entropy as a signal in object normalization |
US7769579B2 (en) | 2005-05-31 | 2010-08-03 | Google Inc. | Learning facts from semi-structured text |
WO2006076579A2 (en) * | 2005-01-14 | 2006-07-20 | Cosmix Corporation | Web operation language |
US8682913B1 (en) | 2005-03-31 | 2014-03-25 | Google Inc. | Corroborating facts extracted from multiple sources |
US7587387B2 (en) | 2005-03-31 | 2009-09-08 | Google Inc. | User interface for facts query engine with snippets from information sources that include query terms and answer terms |
US9208229B2 (en) | 2005-03-31 | 2015-12-08 | Google Inc. | Anchor text summarization for corroboration |
US20060265357A1 (en) * | 2005-04-26 | 2006-11-23 | Potts Matthew P | Method of efficiently parsing a file for a plurality of strings |
US7831545B1 (en) | 2005-05-31 | 2010-11-09 | Google Inc. | Identifying the unifying subject of a set of facts |
US8996470B1 (en) | 2005-05-31 | 2015-03-31 | Google Inc. | System for ensuring the internal consistency of a fact repository |
US8260785B2 (en) | 2006-02-17 | 2012-09-04 | Google Inc. | Automatic object reference identification and linking in a browseable fact repository |
US7991797B2 (en) | 2006-02-17 | 2011-08-02 | Google Inc. | ID persistence through normalization |
US8700568B2 (en) | 2006-02-17 | 2014-04-15 | Google Inc. | Entity normalization via name normalization |
US8639782B2 (en) | 2006-08-23 | 2014-01-28 | Ebay, Inc. | Method and system for sharing metadata between interfaces |
US8099415B2 (en) * | 2006-09-08 | 2012-01-17 | Simply Hired, Inc. | Method and apparatus for assessing similarity between online job listings |
US8122026B1 (en) | 2006-10-20 | 2012-02-21 | Google Inc. | Finding and disambiguating references to entities on web pages |
US7752484B2 (en) * | 2006-10-24 | 2010-07-06 | Sap Ag | On-demand wrappers of application data with session failover recovery |
US20080195646A1 (en) * | 2007-02-12 | 2008-08-14 | Microsoft Corporation | Self-describing web data storage model |
US8347202B1 (en) | 2007-03-14 | 2013-01-01 | Google Inc. | Determining geographic locations for place names in a fact repository |
US8239350B1 (en) | 2007-05-08 | 2012-08-07 | Google Inc. | Date ambiguity resolution |
US7966291B1 (en) | 2007-06-26 | 2011-06-21 | Google Inc. | Fact-based object merging |
US7970766B1 (en) | 2007-07-23 | 2011-06-28 | Google Inc. | Entity type assignment |
US8738643B1 (en) | 2007-08-02 | 2014-05-27 | Google Inc. | Learning synonymous object names from anchor texts |
US7840548B2 (en) * | 2007-12-27 | 2010-11-23 | Yahoo! Inc. | System and method for adding identity to web rank |
US7933916B2 (en) * | 2008-02-28 | 2011-04-26 | Microsoft Corporation | Querying nonSQL data stores with a SQL-style language |
US9646308B1 (en) | 2008-06-06 | 2017-05-09 | Carfax, Inc. | Tool for selling and purchasing vehicle history reports |
US9003474B1 (en) | 2008-08-22 | 2015-04-07 | Taser International, Inc. | Systems and methods for managing disclosure of protectable information |
US9280528B2 (en) * | 2010-10-04 | 2016-03-08 | Yahoo! Inc. | Method and system for processing and learning rules for extracting information from incoming web pages |
US8935719B2 (en) | 2011-08-25 | 2015-01-13 | Comcast Cable Communications, Llc | Application triggering |
US10579634B2 (en) * | 2012-08-30 | 2020-03-03 | Citus Data Bilgi Islemleri Ticaret A.S. | Apparatus and method for operating a distributed database with foreign tables |
WO2014109009A1 (ja) * | 2013-01-09 | 2014-07-17 | 株式会社日立製作所 | データベースの管理方法、管理計算機及び記憶媒体 |
EP2755148A1 (en) * | 2013-01-15 | 2014-07-16 | Fujitsu Limited | Data storage system, and program and method for execution in a data storage system |
US9414114B2 (en) | 2013-03-13 | 2016-08-09 | Comcast Cable Holdings, Llc | Selective interactivity |
US11076205B2 (en) | 2014-03-07 | 2021-07-27 | Comcast Cable Communications, Llc | Retrieving supplemental content |
CN110175157B (zh) * | 2019-04-24 | 2023-10-03 | 平安科技(深圳)有限公司 | 一种列存储文件的查询方法及查询装置 |
US20220004944A1 (en) | 2020-07-06 | 2022-01-06 | Grokit Data, Inc. | Automation system and method |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4631673A (en) * | 1985-01-22 | 1986-12-23 | International Business Machines Corporation | Method for refreshing multicolumn tables in a relational data base using minimal information |
US4918588A (en) * | 1986-12-31 | 1990-04-17 | Wang Laboratories, Inc. | Office automation system with integrated image management |
US4918593A (en) * | 1987-01-08 | 1990-04-17 | Wang Laboratories, Inc. | Relational database system |
SE466029B (sv) * | 1989-03-06 | 1991-12-02 | Ibm Svenska Ab | Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem |
US5307484A (en) * | 1991-03-06 | 1994-04-26 | Chrysler Corporation | Relational data base repository system for managing functional and physical data structures of nodes and links of multiple computer networks |
US5748954A (en) * | 1995-06-05 | 1998-05-05 | Carnegie Mellon University | Method for searching a queued and ranked constructed catalog of files stored on a network |
US5710918A (en) * | 1995-06-07 | 1998-01-20 | International Business Machines Corporation | Method for distributed task fulfillment of web browser requests |
US5737592A (en) * | 1995-06-19 | 1998-04-07 | International Business Machines Corporation | Accessing a relational database over the Internet using macro language files |
US5706507A (en) * | 1995-07-05 | 1998-01-06 | International Business Machines Corporation | System and method for controlling access to data located on a content server |
US5649186A (en) * | 1995-08-07 | 1997-07-15 | Silicon Graphics Incorporated | System and method for a computer-based dynamic information clipping service |
-
1996
- 1996-10-02 US US08/724,943 patent/US5826258A/en not_active Expired - Lifetime
-
1997
- 1997-10-02 JP JP9269792A patent/JPH10222539A/ja active Pending
-
2008
- 2008-01-17 JP JP2008007850A patent/JP4264118B2/ja not_active Expired - Lifetime
- 2008-12-24 JP JP2008328321A patent/JP2009080849A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
JPH10222539A (ja) | 1998-08-21 |
JP2009080849A (ja) | 2009-04-16 |
JP2008123547A (ja) | 2008-05-29 |
US5826258A (en) | 1998-10-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4264118B2 (ja) | ネットワーク上の異なる情報源から情報を構成する方法 | |
US7581170B2 (en) | Visual and interactive wrapper generation, automated information extraction from Web pages, and translation into XML | |
Arocena et al. | WebOQL: Restructuring documents, databases, and webs | |
US5913214A (en) | Data extraction from world wide web pages | |
CA2522686C (en) | Progressive relaxation of search criteria | |
US8122048B2 (en) | Context sensitive term expansion with dynamic term expansion | |
US6449620B1 (en) | Method and apparatus for generating information pages using semi-structured data stored in a structured manner | |
US7747617B1 (en) | Searching documents using a dimensional database | |
US6766330B1 (en) | Universal output constructor for XML queries universal output constructor for XML queries | |
US6094649A (en) | Keyword searches of structured databases | |
US8086623B2 (en) | Context-sensitive term expansion with multiple levels of expansion | |
Stuckenschmidt et al. | Exploring large document repositories with RDF technology: The DOPE project | |
US20080114803A1 (en) | Database System With Path Based Query Engine | |
US20060015843A1 (en) | Semantic system for integrating software components | |
Binding et al. | KOS at your service: Programmatic access to knowledge organisation systems | |
Sadeh et al. | Library portals: toward the semantic Web | |
EP1774432A2 (en) | Patent mapping | |
JPWO2003060764A1 (ja) | 情報検索システム | |
Yu et al. | Metadata management system: design and implementation | |
Mahalingam et al. | Ontology tools for semantic reconciliation in distributed heterogeneous information environments | |
Sfakakis et al. | Eliminating query failures in a work‐centric library meta‐search environment | |
Ghiselli et al. | A unified access to extract knowledge from heterogeneous web archives | |
Bi et al. | Facilitating Integration of Distributed Statistical Databases Using Metadata and XML | |
Baldonado et al. | An extensible constructor tool for the rapid, interactive design of query synthesizers | |
Marin-Castro et al. | VR-Tree: A novel tree-based approach for modeling Web Query Interfaces |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20080624 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20080924 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20080929 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20081024 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20081029 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20081125 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20081128 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20081224 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20090203 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20090213 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20120220 Year of fee payment: 3 |
|
R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20120220 Year of fee payment: 3 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130220 Year of fee payment: 4 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130220 Year of fee payment: 4 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130220 Year of fee payment: 4 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130220 Year of fee payment: 4 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130220 Year of fee payment: 4 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140220 Year of fee payment: 5 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
EXPY | Cancellation because of completion of term |