JP2008305352A5 - - Google Patents

Download PDF

Info

Publication number
JP2008305352A5
JP2008305352A5 JP2007154467A JP2007154467A JP2008305352A5 JP 2008305352 A5 JP2008305352 A5 JP 2008305352A5 JP 2007154467 A JP2007154467 A JP 2007154467A JP 2007154467 A JP2007154467 A JP 2007154467A JP 2008305352 A5 JP2008305352 A5 JP 2008305352A5
Authority
JP
Japan
Prior art keywords
index
group
file server
client
full
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2007154467A
Other languages
Japanese (ja)
Other versions
JP4422742B2 (en
JP2008305352A (en
Filing date
Publication date
Application filed filed Critical
Priority to JP2007154467A priority Critical patent/JP4422742B2/en
Priority claimed from JP2007154467A external-priority patent/JP4422742B2/en
Priority to PCT/JP2008/059128 priority patent/WO2008152884A1/en
Publication of JP2008305352A publication Critical patent/JP2008305352A/en
Publication of JP2008305352A5 publication Critical patent/JP2008305352A5/ja
Application granted granted Critical
Publication of JP4422742B2 publication Critical patent/JP4422742B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Claims (3)

検索対象となる電子文書が蓄積されたデータベースを有するファイルサーバ群と、
前記電子文書の新規作成,削除,部分的な変更を入力するとともに、検索対象文書のインデクスを入力するクライアントPC群と、
前記ファイルサーバ群の前記電子文書を照査して、当該電子文書に含まれている文言の索引用インデクスを作成するとともに、作成したインデクスを格納するインデクスデータベースを有するインデクサ用PC群と、
前記インデクサ用PC群で作成されたインデクスを格納する検索用PCとを有し、
前記ファイルサーバ群,クライアントPC群,インデクサ用PC群,検索用PCの間のそれぞれがスイッチイングハブを介して接続される通信路とを備え、
任意のクライアントPCから入力された検索対象インデクスに基づいて、前記検索用PCから前記検索対象インデクスに該当するインデクスを抽出して、当該クライアントPCに出力する全文検索システムにおいて、
前記クライアントPC群から前記ファイルサーバ群へアクセスされる情報のうちから、前記前記電子文書の新規作成,削除,部分的な変更に関連する更新候補情報を抽出して、格納するキャプチャ用PCを備え、
前記インデクサ用PC群は、前記インデクスを最初に作成する際にのみ、前記ファイルサーバ群を全体走査し、
前記インデクスの新規作成,部分的な変更および削除に伴う更新は、前記キャプチャPCに格納されている前記更新候補情報に基づいて、前記ファイルサーバ群を照査して、新規作成,削除,部分的な変更が実際に行われた更新実行情報を抽出し、当該更新実行情報に基づいて前記インデクスの更新を行う全文検索システムであって、
前記更新候補情報は、前記通信路から取得することを特徴とする全文検索システム。
A file server group having a database in which electronic documents to be searched are stored;
A group of client PCs for inputting new creation, deletion, and partial change of the electronic document, and for inputting an index of the search target document;
Checking the electronic documents of the file server group to create an index for the wording contained in the electronic document, and an indexer PC group having an index database for storing the created index;
A search PC for storing an index created by the indexer PC group,
Each of the file server group, the client PC group, the indexer PC group, and the search PC includes a communication path connected via a switching hub,
In a full-text search system that extracts an index corresponding to the search target index from the search PC based on a search target index input from an arbitrary client PC, and outputs the index to the client PC.
A capture PC is provided for extracting and storing update candidate information related to new creation, deletion, and partial change of the electronic document from information accessed from the client PC group to the file server group. ,
The indexer PC group scans the entire file server group only when the index is first created,
Updates associated with new creation, partial change and deletion of the index are performed by checking the file server group based on the update candidate information stored in the capture PC, and creating, deleting, or partially A full-text search system that extracts update execution information in which a change has actually been performed and updates the index based on the update execution information ,
The update candidate information is obtained from the communication path, and is a full-text search system.
前記キャプチャ用PCは、前記通信路に設置されたタップを介して、前記クライアントPC群から前記ファイルサーバ群へアクセスされる情報を取得して、前記更新候補情報を抽出するパケット判別モジュールと、The capture PC acquires information accessed from the client PC group to the file server group via a tap installed in the communication path, and extracts the update candidate information; and
前記更新候補情報を格納するパケトログと、A packet log for storing the update candidate information;
前記判別モジュールの制御用ファイルとを有することを特徴とする請求項1記載の全文検索システム。The full-text search system according to claim 1, further comprising a control file for the discrimination module.
前記インデクスには、前記検索対象となる電子文書へのアクセス権の作成、および、同アクセス権の新規作成,削除,部分的な変更などの更新情報を含ませることを特徴とする請求項1記載の全文検索システム 2. The index includes update information such as creation of an access right to the electronic document to be searched and new creation, deletion, and partial change of the access right. Full-text search system .
JP2007154467A 2007-06-11 2007-06-11 Full-text search system Active JP4422742B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2007154467A JP4422742B2 (en) 2007-06-11 2007-06-11 Full-text search system
PCT/JP2008/059128 WO2008152884A1 (en) 2007-06-11 2008-05-19 Full-text search system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2007154467A JP4422742B2 (en) 2007-06-11 2007-06-11 Full-text search system

Publications (3)

Publication Number Publication Date
JP2008305352A JP2008305352A (en) 2008-12-18
JP2008305352A5 true JP2008305352A5 (en) 2009-09-03
JP4422742B2 JP4422742B2 (en) 2010-02-24

Family

ID=40129496

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2007154467A Active JP4422742B2 (en) 2007-06-11 2007-06-11 Full-text search system

Country Status (2)

Country Link
JP (1) JP4422742B2 (en)
WO (1) WO2008152884A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013073557A (en) * 2011-09-29 2013-04-22 Hitachi Solutions Ltd Information search system, search server and program
JP5759881B2 (en) * 2011-12-08 2015-08-05 株式会社日立ソリューションズ Information processing system
JP2013178685A (en) * 2012-02-29 2013-09-09 Nec Corp Data processing system with asynchronous backup function, front system, backup method and program therefor
JP2013196544A (en) * 2012-03-22 2013-09-30 Nec Corp Document management system, document management method, and program therefor
JP5887236B2 (en) * 2012-09-24 2016-03-16 株式会社日立ソリューションズ Business document processing apparatus, business document processing method, and business document processing program
US10223431B2 (en) 2013-01-31 2019-03-05 Facebook, Inc. Data stream splitting for low-latency data access

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001344245A (en) * 2000-03-29 2001-12-14 Fujitsu Ltd Information processor
JP2002182961A (en) * 2000-12-13 2002-06-28 Nec Corp Synchronization system for database and method of the synchronization

Similar Documents

Publication Publication Date Title
Cambazoglu et al. Scalability challenges in web search engines
Rao et al. An enhanced blacklist method to detect phishing websites
US8781815B1 (en) Non-standard and standard clause detection
JP2008305352A5 (en)
JP2008537264A5 (en)
US10783153B2 (en) Efficient internet protocol prefix match support on No-SQL and/or non-relational databases
WO2008152884A1 (en) Full-text search system
US10216787B2 (en) Method, apparatus, and computer-readable medium for contextual data mining using a relational data set
Hysing Governing towards sustainability: Environmental governance and policy change in Swedish forestry and transport
WO2011163567A2 (en) Methods and systems for filtering search results
De Wilde Improving retrieval of historical content with entity linking
Susuri et al. Machine learning based detection of vandalism in wikipedia across languages
CN104376067A (en) Index file inputting method and retrieval method based on index file
JP2019020795A (en) Document management device, document management system, and program
CN101853307A (en) Note establishing method, corresponding network searching system and method thereof
Pourzaferani et al. Repairing broken RDF links in the web of data
JP6707410B2 (en) Document search device, document search method, and computer program
WO2005114409A3 (en) System and method for managing a path environment variable
Spirin et al. Unsupervised approach to generate informative structured snippets for job search engines
Rocha et al. LODifying personal content sharing
CN103530418A (en) Information searching and publishing method and information searching and publishing system
Suchomel et al. Source retrieval for plagiarism detection
Chen et al. When peculiarity makes a difference: object characterisation in heterogeneous information networks
Suchomel et al. Approaches for candidate document retrieval
Medvar et al. Bayesian Analysis of E3 Ubiquitin Ligase/AQP2 Interactions in the Renal Collecting Duct