JP2008305352A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2008305352A5 JP2008305352A5 JP2007154467A JP2007154467A JP2008305352A5 JP 2008305352 A5 JP2008305352 A5 JP 2008305352A5 JP 2007154467 A JP2007154467 A JP 2007154467A JP 2007154467 A JP2007154467 A JP 2007154467A JP 2008305352 A5 JP2008305352 A5 JP 2008305352A5
- Authority
- JP
- Japan
- Prior art keywords
- index
- group
- file server
- client
- full
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Claims (3)
前記電子文書の新規作成,削除,部分的な変更を入力するとともに、検索対象文書のインデクスを入力するクライアントPC群と、
前記ファイルサーバ群の前記電子文書を照査して、当該電子文書に含まれている文言の索引用インデクスを作成するとともに、作成したインデクスを格納するインデクスデータベースを有するインデクサ用PC群と、
前記インデクサ用PC群で作成されたインデクスを格納する検索用PCとを有し、
前記ファイルサーバ群,クライアントPC群,インデクサ用PC群,検索用PCの間のそれぞれがスイッチイングハブを介して接続される通信路とを備え、
任意のクライアントPCから入力された検索対象インデクスに基づいて、前記検索用PCから前記検索対象インデクスに該当するインデクスを抽出して、当該クライアントPCに出力する全文検索システムにおいて、
前記クライアントPC群から前記ファイルサーバ群へアクセスされる情報のうちから、前記前記電子文書の新規作成,削除,部分的な変更に関連する更新候補情報を抽出して、格納するキャプチャ用PCを備え、
前記インデクサ用PC群は、前記インデクスを最初に作成する際にのみ、前記ファイルサーバ群を全体走査し、
前記インデクスの新規作成,部分的な変更および削除に伴う更新は、前記キャプチャPCに格納されている前記更新候補情報に基づいて、前記ファイルサーバ群を照査して、新規作成,削除,部分的な変更が実際に行われた更新実行情報を抽出し、当該更新実行情報に基づいて前記インデクスの更新を行う全文検索システムであって、
前記更新候補情報は、前記通信路から取得することを特徴とする全文検索システム。 A file server group having a database in which electronic documents to be searched are stored;
A group of client PCs for inputting new creation, deletion, and partial change of the electronic document, and for inputting an index of the search target document;
Checking the electronic documents of the file server group to create an index for the wording contained in the electronic document, and an indexer PC group having an index database for storing the created index;
A search PC for storing an index created by the indexer PC group,
Each of the file server group, the client PC group, the indexer PC group, and the search PC includes a communication path connected via a switching hub,
In a full-text search system that extracts an index corresponding to the search target index from the search PC based on a search target index input from an arbitrary client PC, and outputs the index to the client PC.
A capture PC is provided for extracting and storing update candidate information related to new creation, deletion, and partial change of the electronic document from information accessed from the client PC group to the file server group. ,
The indexer PC group scans the entire file server group only when the index is first created,
Updates associated with new creation, partial change and deletion of the index are performed by checking the file server group based on the update candidate information stored in the capture PC, and creating, deleting, or partially A full-text search system that extracts update execution information in which a change has actually been performed and updates the index based on the update execution information ,
The update candidate information is obtained from the communication path, and is a full-text search system.
前記更新候補情報を格納するパケトログと、A packet log for storing the update candidate information;
前記判別モジュールの制御用ファイルとを有することを特徴とする請求項1記載の全文検索システム。The full-text search system according to claim 1, further comprising a control file for the discrimination module.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2007154467A JP4422742B2 (en) | 2007-06-11 | 2007-06-11 | Full-text search system |
PCT/JP2008/059128 WO2008152884A1 (en) | 2007-06-11 | 2008-05-19 | Full-text search system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2007154467A JP4422742B2 (en) | 2007-06-11 | 2007-06-11 | Full-text search system |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2008305352A JP2008305352A (en) | 2008-12-18 |
JP2008305352A5 true JP2008305352A5 (en) | 2009-09-03 |
JP4422742B2 JP4422742B2 (en) | 2010-02-24 |
Family
ID=40129496
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2007154467A Active JP4422742B2 (en) | 2007-06-11 | 2007-06-11 | Full-text search system |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP4422742B2 (en) |
WO (1) | WO2008152884A1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2013073557A (en) * | 2011-09-29 | 2013-04-22 | Hitachi Solutions Ltd | Information search system, search server and program |
JP5759881B2 (en) * | 2011-12-08 | 2015-08-05 | 株式会社日立ソリューションズ | Information processing system |
JP2013178685A (en) * | 2012-02-29 | 2013-09-09 | Nec Corp | Data processing system with asynchronous backup function, front system, backup method and program therefor |
JP2013196544A (en) * | 2012-03-22 | 2013-09-30 | Nec Corp | Document management system, document management method, and program therefor |
JP5887236B2 (en) * | 2012-09-24 | 2016-03-16 | 株式会社日立ソリューションズ | Business document processing apparatus, business document processing method, and business document processing program |
US10223431B2 (en) | 2013-01-31 | 2019-03-05 | Facebook, Inc. | Data stream splitting for low-latency data access |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001344245A (en) * | 2000-03-29 | 2001-12-14 | Fujitsu Ltd | Information processor |
JP2002182961A (en) * | 2000-12-13 | 2002-06-28 | Nec Corp | Synchronization system for database and method of the synchronization |
-
2007
- 2007-06-11 JP JP2007154467A patent/JP4422742B2/en active Active
-
2008
- 2008-05-19 WO PCT/JP2008/059128 patent/WO2008152884A1/en active Application Filing
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Cambazoglu et al. | Scalability challenges in web search engines | |
Rao et al. | An enhanced blacklist method to detect phishing websites | |
US8781815B1 (en) | Non-standard and standard clause detection | |
JP2008305352A5 (en) | ||
JP2008537264A5 (en) | ||
US10783153B2 (en) | Efficient internet protocol prefix match support on No-SQL and/or non-relational databases | |
WO2008152884A1 (en) | Full-text search system | |
US10216787B2 (en) | Method, apparatus, and computer-readable medium for contextual data mining using a relational data set | |
Hysing | Governing towards sustainability: Environmental governance and policy change in Swedish forestry and transport | |
WO2011163567A2 (en) | Methods and systems for filtering search results | |
De Wilde | Improving retrieval of historical content with entity linking | |
Susuri et al. | Machine learning based detection of vandalism in wikipedia across languages | |
CN104376067A (en) | Index file inputting method and retrieval method based on index file | |
JP2019020795A (en) | Document management device, document management system, and program | |
CN101853307A (en) | Note establishing method, corresponding network searching system and method thereof | |
Pourzaferani et al. | Repairing broken RDF links in the web of data | |
JP6707410B2 (en) | Document search device, document search method, and computer program | |
WO2005114409A3 (en) | System and method for managing a path environment variable | |
Spirin et al. | Unsupervised approach to generate informative structured snippets for job search engines | |
Rocha et al. | LODifying personal content sharing | |
CN103530418A (en) | Information searching and publishing method and information searching and publishing system | |
Suchomel et al. | Source retrieval for plagiarism detection | |
Chen et al. | When peculiarity makes a difference: object characterisation in heterogeneous information networks | |
Suchomel et al. | Approaches for candidate document retrieval | |
Medvar et al. | Bayesian Analysis of E3 Ubiquitin Ligase/AQP2 Interactions in the Renal Collecting Duct |