CN107330077B - Retrieval method of digital archive - Google Patents

Retrieval method of digital archive Download PDF

Info

Publication number
CN107330077B
CN107330077B CN201710529086.0A CN201710529086A CN107330077B CN 107330077 B CN107330077 B CN 107330077B CN 201710529086 A CN201710529086 A CN 201710529086A CN 107330077 B CN107330077 B CN 107330077B
Authority
CN
China
Prior art keywords
retrieval
retrieved
archive
acquiring
fuzzy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710529086.0A
Other languages
Chinese (zh)
Other versions
CN107330077A (en
Inventor
王建永
刘文彬
郭威
廖丹
吴广财
周珑
李颖
冷裕波
黄杰韬
方宽
黄慧欣
林俊
郑杰生
杨春
陈非
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Information Center of Guangdong Power Grid Co Ltd
Original Assignee
Information Center of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Information Center of Guangdong Power Grid Co Ltd filed Critical Information Center of Guangdong Power Grid Co Ltd
Priority to CN201710529086.0A priority Critical patent/CN107330077B/en
Publication of CN107330077A publication Critical patent/CN107330077A/en
Application granted granted Critical
Publication of CN107330077B publication Critical patent/CN107330077B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/243Natural language query formulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Fuzzy Systems (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a retrieval method of digital archive archives, which comprises full-text retrieval, catalogue retrieval, fuzzy retrieval and card retrieval.

Description

Retrieval method of digital archive
Technical Field
The invention relates to the technical field of digital archives, in particular to a retrieval method of digital archives.
Background
When searching, the current digital archives generally search only through a single searching mode, for example, only through a fuzzy searching mode. However, since different users have different retrieval requirements, providing only one retrieval method cannot meet the requirements of users for retrieval efficiency and retrieval accuracy due to different retrieval requirements.
Disclosure of Invention
The invention provides a retrieval method of digital archives, aiming at solving the technical defects of low retrieval efficiency and low retrieval accuracy caused by the single retrieval mode adopted by the archive management in the prior art during retrieval.
In order to realize the purpose, the technical scheme is as follows:
a retrieval method of digital archive archives comprises full-text retrieval, catalog retrieval, fuzzy retrieval and card retrieval.
Compared with the prior art, the invention has the beneficial effects that:
the retrieval method provided by the invention provides four retrieval modes for retrieval, can meet the retrieval requirements of users with different roles, and realizes 'quick, accurate and full' positioning of archive files.
Drawings
Fig. 1 is a diagram illustrating a specific implementation process of full-text search.
Fig. 2 is a diagram illustrating a specific implementation process of directory retrieval.
Fig. 3 is a diagram of a specific implementation process of fuzzy search.
Fig. 4 is a diagram illustrating a specific implementation process of card retrieval.
Detailed Description
The drawings are for illustrative purposes only and are not to be construed as limiting the patent;
the invention is further illustrated below with reference to the figures and examples.
Example 1
The retrieval method provided by the invention comprises four retrieval modes, namely full-text retrieval, catalog retrieval, fuzzy retrieval and card retrieval, wherein the retrieval results of the full-text retrieval, the catalog retrieval, the fuzzy retrieval and the card retrieval are displayed in a web snapshot mode.
As shown in fig. 1, the specific execution process of the full-text search is as follows:
s11, acquiring file type metadata information of a file to be retrieved according to the archive and the retrieval level;
s12, acquiring related synonyms according to the search keywords;
and S13, calling a retrieval interface of the full-text retrieval service, acquiring full-text retrieval contents according to the file type metadata information, the keywords and the synonym information, and finally displaying a retrieval result on a page.
In a specific implementation process, as shown in fig. 2, the specific implementation process of the directory retrieval is as follows:
s21, according to the file directory to be retrieved, obtaining metadata of the field to be retrieved displayed by the file directory;
s22, according to conditions formed by file directory field metadata to be retrieved, relational operators and retrieval keywords, splicing according to logical AND operators to form retrieval conditions;
s23, related contents are inquired in the file directory to be searched, a search result is obtained, and the search result is displayed on a page.
In a specific implementation process, as shown in fig. 3, the specific implementation process of the fuzzy search is as follows:
s31, according to the archive to be retrieved and the retrieval level, field metadata of the archive to be retrieved is obtained;
s32, acquiring related synonyms according to the search keywords, splicing to form a fuzzy search condition, using an 'inclusion' relation operator for the field metadata and the keywords or the synonyms, and using logical 'OR' operation connection between the field metadata;
and S33, inquiring related contents in the archive to be searched according to the fuzzy search condition, acquiring a search result, and displaying the search result on a page.
In a specific implementation process, as shown in fig. 4, the specific implementation process of the card retrieval is as follows:
s41, acquiring field metadata displayed on the card according to the archive to be retrieved and the retrieval level;
s42, splicing retrieval conditions according to logical AND operators and card retrieval conditions formed by field metadata, relational operators and retrieval keywords arranged on the cards;
s43, inquiring related content in the archive to be searched according to the searching conditions, obtaining a searching result, and displaying the result on a page.
It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims (2)

1. A retrieval method of digital archive files is characterized in that: the method comprises full-text retrieval, catalog retrieval, fuzzy retrieval and card retrieval;
the specific execution process of the full-text retrieval is as follows:
s11, acquiring file type metadata information of a file to be retrieved according to the archive and the retrieval level;
s12, acquiring related synonyms according to the search keywords;
s13, calling a retrieval interface of full-text retrieval service, acquiring full-text retrieval contents according to the file type metadata information, the keywords and the synonym information, and finally displaying a retrieval result on a page;
the specific implementation process of the directory retrieval is as follows:
s21, according to the file directory to be retrieved, obtaining metadata of the field to be retrieved displayed by the file directory;
s22, according to conditions formed by file directory field metadata to be retrieved, relational operators and retrieval keywords, splicing according to logical AND operators to form retrieval conditions;
s23, inquiring related contents in a file directory to be retrieved, acquiring a retrieval result and displaying the retrieval result on a page;
the specific implementation process of the fuzzy retrieval is as follows:
s31, according to the archive to be retrieved and the retrieval level, field metadata of the archive to be retrieved is obtained;
s32, acquiring related synonyms according to the search keywords, and splicing to form fuzzy search conditions;
s33, inquiring related contents in an archive library to be retrieved according to the fuzzy retrieval conditions, acquiring a retrieval result, and displaying the retrieval result on a page;
the specific process of forming the fuzzy retrieval condition by splicing in the step S32 is as follows: the field metadata and the keywords or synonyms use a 'containing' relational operator, and the field metadata are connected by using a logical 'OR' operation;
the specific implementation process of the card retrieval is as follows:
s41, acquiring field metadata displayed on the card according to the archive to be retrieved and the retrieval level;
s42, splicing retrieval conditions according to logical AND operators and card retrieval conditions formed by field metadata, relational operators and retrieval keywords arranged on the cards;
s43, inquiring related content in the archive to be searched according to the searching conditions, obtaining a searching result, and displaying the result on a page.
2. The method for retrieving a digital archive file of claim 1, characterized in that: and the retrieval results of the full-text retrieval, the catalogue retrieval, the fuzzy retrieval and the card retrieval are displayed in a web page snapshot mode.
CN201710529086.0A 2017-07-01 2017-07-01 Retrieval method of digital archive Active CN107330077B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710529086.0A CN107330077B (en) 2017-07-01 2017-07-01 Retrieval method of digital archive

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710529086.0A CN107330077B (en) 2017-07-01 2017-07-01 Retrieval method of digital archive

Publications (2)

Publication Number Publication Date
CN107330077A CN107330077A (en) 2017-11-07
CN107330077B true CN107330077B (en) 2020-07-14

Family

ID=60199688

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710529086.0A Active CN107330077B (en) 2017-07-01 2017-07-01 Retrieval method of digital archive

Country Status (1)

Country Link
CN (1) CN107330077B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102637268A (en) * 2012-03-30 2012-08-15 云南远信科技有限公司 Digital archive system
CN102955822A (en) * 2011-08-31 2013-03-06 河南新创元信息网络有限公司 Classification-type secretarial document management system and method
CN104462306A (en) * 2014-11-28 2015-03-25 青岛大学 Automatic archive compiling and researching device
CN106855890A (en) * 2017-01-09 2017-06-16 广州巨杉软件开发有限公司 A kind of method for realizing the final consistency full-text search of high-performance data storehouse

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4951331B2 (en) * 2006-12-26 2012-06-13 株式会社日立製作所 Storage system
CN103164388B (en) * 2011-12-09 2016-07-06 北大方正集团有限公司 In a kind of layout files structured message obtain method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102955822A (en) * 2011-08-31 2013-03-06 河南新创元信息网络有限公司 Classification-type secretarial document management system and method
CN102637268A (en) * 2012-03-30 2012-08-15 云南远信科技有限公司 Digital archive system
CN104462306A (en) * 2014-11-28 2015-03-25 青岛大学 Automatic archive compiling and researching device
CN106855890A (en) * 2017-01-09 2017-06-16 广州巨杉软件开发有限公司 A kind of method for realizing the final consistency full-text search of high-performance data storehouse

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"数字时代企业档案信息化的发展趋势--以河北省企业为例";耿树伟;《2014年海峡两岸档案暨缩微学术交流会论文集》;20160526;论文第5.2节 *
耿树伟."数字时代企业档案信息化的发展趋势--以河北省企业为例".《2014年海峡两岸档案暨缩微学术交流会论文集》.2016, *

Also Published As

Publication number Publication date
CN107330077A (en) 2017-11-07

Similar Documents

Publication Publication Date Title
US11516289B2 (en) Method and system for displaying similar email messages based on message contents
KR101002451B1 (en) Computer searching with associations
US7930306B2 (en) System and method for near and exact de-duplication of documents
US9009201B2 (en) Extended database search
US8239348B1 (en) Method and apparatus for automatically archiving data items from backup storage
US8682859B2 (en) Transferring records between tables using a change transaction log
US8782101B1 (en) Transferring data across different database platforms
US20080033903A1 (en) Methods and apparatuses for using location information
US20130198233A1 (en) Integrated repository of structured and unstructured data
CN103631937B (en) Build method, the apparatus and system of row storage index
CN107291949B (en) Information searching method and device
CN102339315A (en) Index updating method and system of advertisement data
CN107870915B (en) Indication of search results
US8984044B2 (en) Managing bookmarks within a collaborative application
CN105488198A (en) Recommendation method of application program and terminal
CN107330077B (en) Retrieval method of digital archive
CN107291951B (en) Data processing method, device, storage medium and processor
CN106528590B (en) Query method and device
CN103309899A (en) Data protection system file index structure and updating and positioning methods
US10417439B2 (en) Post-hoc management of datasets
Shobe et al. On mapping releases to commits in open source systems
US11275786B2 (en) Implementing enhanced DevOps process for cognitive search solutions
CN110659344B (en) Block method based full text search method
CN112380811A (en) Method, device and equipment for generating word text material hierarchical directory
CN105512339A (en) File searcher and searching method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant