CN107330077B - Retrieval method of digital archive - Google Patents
Retrieval method of digital archive Download PDFInfo
- Publication number
- CN107330077B CN107330077B CN201710529086.0A CN201710529086A CN107330077B CN 107330077 B CN107330077 B CN 107330077B CN 201710529086 A CN201710529086 A CN 201710529086A CN 107330077 B CN107330077 B CN 107330077B
- Authority
- CN
- China
- Prior art keywords
- retrieval
- retrieved
- archive
- acquiring
- fuzzy
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2468—Fuzzy queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
- G06F16/243—Natural language query formulation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Fuzzy Systems (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Automation & Control Theory (AREA)
- Probability & Statistics with Applications (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to a retrieval method of digital archive archives, which comprises full-text retrieval, catalogue retrieval, fuzzy retrieval and card retrieval.
Description
Technical Field
The invention relates to the technical field of digital archives, in particular to a retrieval method of digital archives.
Background
When searching, the current digital archives generally search only through a single searching mode, for example, only through a fuzzy searching mode. However, since different users have different retrieval requirements, providing only one retrieval method cannot meet the requirements of users for retrieval efficiency and retrieval accuracy due to different retrieval requirements.
Disclosure of Invention
The invention provides a retrieval method of digital archives, aiming at solving the technical defects of low retrieval efficiency and low retrieval accuracy caused by the single retrieval mode adopted by the archive management in the prior art during retrieval.
In order to realize the purpose, the technical scheme is as follows:
a retrieval method of digital archive archives comprises full-text retrieval, catalog retrieval, fuzzy retrieval and card retrieval.
Compared with the prior art, the invention has the beneficial effects that:
the retrieval method provided by the invention provides four retrieval modes for retrieval, can meet the retrieval requirements of users with different roles, and realizes 'quick, accurate and full' positioning of archive files.
Drawings
Fig. 1 is a diagram illustrating a specific implementation process of full-text search.
Fig. 2 is a diagram illustrating a specific implementation process of directory retrieval.
Fig. 3 is a diagram of a specific implementation process of fuzzy search.
Fig. 4 is a diagram illustrating a specific implementation process of card retrieval.
Detailed Description
The drawings are for illustrative purposes only and are not to be construed as limiting the patent;
the invention is further illustrated below with reference to the figures and examples.
Example 1
The retrieval method provided by the invention comprises four retrieval modes, namely full-text retrieval, catalog retrieval, fuzzy retrieval and card retrieval, wherein the retrieval results of the full-text retrieval, the catalog retrieval, the fuzzy retrieval and the card retrieval are displayed in a web snapshot mode.
As shown in fig. 1, the specific execution process of the full-text search is as follows:
s11, acquiring file type metadata information of a file to be retrieved according to the archive and the retrieval level;
s12, acquiring related synonyms according to the search keywords;
and S13, calling a retrieval interface of the full-text retrieval service, acquiring full-text retrieval contents according to the file type metadata information, the keywords and the synonym information, and finally displaying a retrieval result on a page.
In a specific implementation process, as shown in fig. 2, the specific implementation process of the directory retrieval is as follows:
s21, according to the file directory to be retrieved, obtaining metadata of the field to be retrieved displayed by the file directory;
s22, according to conditions formed by file directory field metadata to be retrieved, relational operators and retrieval keywords, splicing according to logical AND operators to form retrieval conditions;
s23, related contents are inquired in the file directory to be searched, a search result is obtained, and the search result is displayed on a page.
In a specific implementation process, as shown in fig. 3, the specific implementation process of the fuzzy search is as follows:
s31, according to the archive to be retrieved and the retrieval level, field metadata of the archive to be retrieved is obtained;
s32, acquiring related synonyms according to the search keywords, splicing to form a fuzzy search condition, using an 'inclusion' relation operator for the field metadata and the keywords or the synonyms, and using logical 'OR' operation connection between the field metadata;
and S33, inquiring related contents in the archive to be searched according to the fuzzy search condition, acquiring a search result, and displaying the search result on a page.
In a specific implementation process, as shown in fig. 4, the specific implementation process of the card retrieval is as follows:
s41, acquiring field metadata displayed on the card according to the archive to be retrieved and the retrieval level;
s42, splicing retrieval conditions according to logical AND operators and card retrieval conditions formed by field metadata, relational operators and retrieval keywords arranged on the cards;
s43, inquiring related content in the archive to be searched according to the searching conditions, obtaining a searching result, and displaying the result on a page.
It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.
Claims (2)
1. A retrieval method of digital archive files is characterized in that: the method comprises full-text retrieval, catalog retrieval, fuzzy retrieval and card retrieval;
the specific execution process of the full-text retrieval is as follows:
s11, acquiring file type metadata information of a file to be retrieved according to the archive and the retrieval level;
s12, acquiring related synonyms according to the search keywords;
s13, calling a retrieval interface of full-text retrieval service, acquiring full-text retrieval contents according to the file type metadata information, the keywords and the synonym information, and finally displaying a retrieval result on a page;
the specific implementation process of the directory retrieval is as follows:
s21, according to the file directory to be retrieved, obtaining metadata of the field to be retrieved displayed by the file directory;
s22, according to conditions formed by file directory field metadata to be retrieved, relational operators and retrieval keywords, splicing according to logical AND operators to form retrieval conditions;
s23, inquiring related contents in a file directory to be retrieved, acquiring a retrieval result and displaying the retrieval result on a page;
the specific implementation process of the fuzzy retrieval is as follows:
s31, according to the archive to be retrieved and the retrieval level, field metadata of the archive to be retrieved is obtained;
s32, acquiring related synonyms according to the search keywords, and splicing to form fuzzy search conditions;
s33, inquiring related contents in an archive library to be retrieved according to the fuzzy retrieval conditions, acquiring a retrieval result, and displaying the retrieval result on a page;
the specific process of forming the fuzzy retrieval condition by splicing in the step S32 is as follows: the field metadata and the keywords or synonyms use a 'containing' relational operator, and the field metadata are connected by using a logical 'OR' operation;
the specific implementation process of the card retrieval is as follows:
s41, acquiring field metadata displayed on the card according to the archive to be retrieved and the retrieval level;
s42, splicing retrieval conditions according to logical AND operators and card retrieval conditions formed by field metadata, relational operators and retrieval keywords arranged on the cards;
s43, inquiring related content in the archive to be searched according to the searching conditions, obtaining a searching result, and displaying the result on a page.
2. The method for retrieving a digital archive file of claim 1, characterized in that: and the retrieval results of the full-text retrieval, the catalogue retrieval, the fuzzy retrieval and the card retrieval are displayed in a web page snapshot mode.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710529086.0A CN107330077B (en) | 2017-07-01 | 2017-07-01 | Retrieval method of digital archive |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710529086.0A CN107330077B (en) | 2017-07-01 | 2017-07-01 | Retrieval method of digital archive |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107330077A CN107330077A (en) | 2017-11-07 |
CN107330077B true CN107330077B (en) | 2020-07-14 |
Family
ID=60199688
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710529086.0A Active CN107330077B (en) | 2017-07-01 | 2017-07-01 | Retrieval method of digital archive |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107330077B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102637268A (en) * | 2012-03-30 | 2012-08-15 | 云南远信科技有限公司 | Digital archive system |
CN102955822A (en) * | 2011-08-31 | 2013-03-06 | 河南新创元信息网络有限公司 | Classification-type secretarial document management system and method |
CN104462306A (en) * | 2014-11-28 | 2015-03-25 | 青岛大学 | Automatic archive compiling and researching device |
CN106855890A (en) * | 2017-01-09 | 2017-06-16 | 广州巨杉软件开发有限公司 | A kind of method for realizing the final consistency full-text search of high-performance data storehouse |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4951331B2 (en) * | 2006-12-26 | 2012-06-13 | 株式会社日立製作所 | Storage system |
CN103164388B (en) * | 2011-12-09 | 2016-07-06 | 北大方正集团有限公司 | In a kind of layout files structured message obtain method and device |
-
2017
- 2017-07-01 CN CN201710529086.0A patent/CN107330077B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102955822A (en) * | 2011-08-31 | 2013-03-06 | 河南新创元信息网络有限公司 | Classification-type secretarial document management system and method |
CN102637268A (en) * | 2012-03-30 | 2012-08-15 | 云南远信科技有限公司 | Digital archive system |
CN104462306A (en) * | 2014-11-28 | 2015-03-25 | 青岛大学 | Automatic archive compiling and researching device |
CN106855890A (en) * | 2017-01-09 | 2017-06-16 | 广州巨杉软件开发有限公司 | A kind of method for realizing the final consistency full-text search of high-performance data storehouse |
Non-Patent Citations (2)
Title |
---|
"数字时代企业档案信息化的发展趋势--以河北省企业为例";耿树伟;《2014年海峡两岸档案暨缩微学术交流会论文集》;20160526;论文第5.2节 * |
耿树伟."数字时代企业档案信息化的发展趋势--以河北省企业为例".《2014年海峡两岸档案暨缩微学术交流会论文集》.2016, * |
Also Published As
Publication number | Publication date |
---|---|
CN107330077A (en) | 2017-11-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11516289B2 (en) | Method and system for displaying similar email messages based on message contents | |
KR101002451B1 (en) | Computer searching with associations | |
US7930306B2 (en) | System and method for near and exact de-duplication of documents | |
US9009201B2 (en) | Extended database search | |
US8239348B1 (en) | Method and apparatus for automatically archiving data items from backup storage | |
US8682859B2 (en) | Transferring records between tables using a change transaction log | |
US8782101B1 (en) | Transferring data across different database platforms | |
US20080033903A1 (en) | Methods and apparatuses for using location information | |
US20130198233A1 (en) | Integrated repository of structured and unstructured data | |
CN103631937B (en) | Build method, the apparatus and system of row storage index | |
CN107291949B (en) | Information searching method and device | |
CN102339315A (en) | Index updating method and system of advertisement data | |
CN107870915B (en) | Indication of search results | |
US8984044B2 (en) | Managing bookmarks within a collaborative application | |
CN105488198A (en) | Recommendation method of application program and terminal | |
CN107330077B (en) | Retrieval method of digital archive | |
CN107291951B (en) | Data processing method, device, storage medium and processor | |
CN106528590B (en) | Query method and device | |
CN103309899A (en) | Data protection system file index structure and updating and positioning methods | |
US10417439B2 (en) | Post-hoc management of datasets | |
Shobe et al. | On mapping releases to commits in open source systems | |
US11275786B2 (en) | Implementing enhanced DevOps process for cognitive search solutions | |
CN110659344B (en) | Block method based full text search method | |
CN112380811A (en) | Method, device and equipment for generating word text material hierarchical directory | |
CN105512339A (en) | File searcher and searching method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |