CN103530384B - Internet information resource quick searching method - Google Patents

Internet information resource quick searching method Download PDF

Info

Publication number
CN103530384B
CN103530384B CN201310492732.2A CN201310492732A CN103530384B CN 103530384 B CN103530384 B CN 103530384B CN 201310492732 A CN201310492732 A CN 201310492732A CN 103530384 B CN103530384 B CN 103530384B
Authority
CN
China
Prior art keywords
file
index
information resource
searching
internet information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310492732.2A
Other languages
Chinese (zh)
Other versions
CN103530384A (en
Inventor
史高升
朱涛
赵西法
韩祥松
荣丽萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhenghe Technology Co ltd
Original Assignee
JINAN ZHENGHE TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JINAN ZHENGHE TECHNOLOGY Co Ltd filed Critical JINAN ZHENGHE TECHNOLOGY Co Ltd
Priority to CN201310492732.2A priority Critical patent/CN103530384B/en
Publication of CN103530384A publication Critical patent/CN103530384A/en
Application granted granted Critical
Publication of CN103530384B publication Critical patent/CN103530384B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an Internet information resource quick searching method. The Internet information resource quick searching method includes the following steps: defining a set of index file formats, setting up a file class, enabling an object file in a database to be converted into a binary file, generating an index list with a serial number, setting up a field description class to be used for storing various attributes of a single field, carrying out word segmentation processing on file contents, enabling each index file to be added into a file object corresponding to the index file, enabling keywords searched by a user to be packaged, setting up an object where the user saves searching results, processing the keywords, setting up a data conversion class, enabling the searching results to be converted to be in the format required by webpage display, and enabling the searching results to be converted to be in the format required by page display. According to the Internet information resource quick searching method, partitioning searching is achieved, the searching speed is greatly quickened, when new data are generated, a small-file index parallel to an original-file index can be set up for the new data, and searching is rapidly achieved.

Description

Internet information resource method for quickly retrieving
Technical field
The present invention relates to internet information searching field, specifically a kind of Internet information resource method for quickly retrieving.
Background technology
With the continuous development of Internet technology, the internet, applications headed by search engine, penetrate into increasingly extensive China's economic, culture, the every field of political life.Network information resource has enormous amount, how updates rapid feature Fast and accurately retrieving the information resources required for people in the internet information of magnanimity is the improved side of Internet enterprises To.At present, search information is to scan for from data base, and each data base is very big, and therefore search time is long, and is counting After updating, in a new data base, so search gets up more to bother for the data of renewal and original data mixing.
Content of the invention
For the shortcoming overcoming above-mentioned prior art to exist, it is an object of the invention to provide one kind realizes block index, The Internet information resource method for quickly retrieving of lifting index speed.
In order to solve the above problems, the present invention employs the following technical solutions: a kind of Internet information resource quick-searching side Method, comprises the following steps:
S1, defines a set of index file form, sets up document class, for depositing the record of file index and index range;
S2, the file destination in data base is converted to binary file, generates index list, and advances in index list Line number, wherein, described file destination includes webpage, file database record;
S3, sets up field description class, for depositing each attribute of single field;
S4, before document is indexed, carries out word segmentation processing to document content first;
S5, is added to each index document in corresponding document object;
S6, the keyword that user is searched is packaged into the readily identified form of place binary file;
S7, sets up the object that a user preserves Search Results, preserves lookup result for user;
S8, the search key of parsing user input, according to keyword exclusive dictionary, keyword is processed;
S9, sets up data conversion class, Search Results is converted into the form required for web displaying;
S10, by applying mechanically indicating template, changes into Search Results the form needed for page presentation.
When increasing new document data record, in s1 step, set up a small documents index, described small documents index with Original file index is coordination.
Index file form defined in s1 step is octet or 16 bit bytes.
The invention has the beneficial effects as follows: it by setting up document class, data base than before classification thinner it is achieved that piecemeal Retrieval, greatly improves retrieval rate, and, when there being new data to produce, new data can be set up a small documents index, It with original index file side by side, thus accelerates the index speed to new data.
Brief description
The present invention is described further with reference to the accompanying drawings and examples:
Fig. 1 is the FB(flow block) of the present invention;
Fig. 2 is the system structure diagram of the present invention.
Specific embodiment
As shown in figure 1, one embodiment of the present invention includes step:
S1, defines a set of index file form, sets up document class, for depositing the record of file index and index range;
S2, the file destination in data base is converted to binary file, generates index list, and advances in index list Line number, wherein, described file destination includes webpage, file database record;
S3, sets up field description class, for depositing each attribute of single field;
S4, before document is indexed, carries out word segmentation processing to document content first;
S5, is added to each index document in corresponding document object, facilitates follow-up search operation;
S6, the keyword that user is searched is packaged into the readily identified form of place binary file;
S7, sets up the object that a user preserves Search Results, preserves lookup result for user;
S8, the search key of parsing user input, according to keyword exclusive dictionary, keyword is processed, remove one A bit as the auxiliary word that indicates mood such as " ", " ";
S9, sets up data conversion class, Search Results is converted into the form required for web displaying;
S10, by applying mechanically indicating template, changes into Search Results the form needed for page presentation.
As shown in Fig. 2 user enters line retrieval by index page, the catalogue in file index is numbered, and hunting zone is remembered Record is also carried out numbering, and when increasing new data, generates the small documents index of new data, and it with original file index side by side, makes inspection Rope process is more quick;After line retrieval is entered to file destination (including webpage, file, data-base recording etc.), turned by data Change class, enter line algorithm and corresponding relation, returning result after the form that result is converted into required for web displaying will be searched, and shown Show.
The present invention has a feature:
1st, index file form is independent of application platform.Invention defines the index file based on a set of octet Form is naturally it is also possible to enable other digit bytes of 16 grades so that the application of compatible system or different platform is shared and built Vertical index file.
2nd, it is achieved that block research on the basis of the inverted index of traditional file search engine, new literary composition can be directed to Part sets up small documents index, lifting index speed, then passes through the merging of index, reaches optimization purpose.
3rd, so that the learning difficulty of this search extension reduces, convenient expansion believes work(to outstanding OO system architecture Energy.
4th, the text analyzing interface of independence and language and file format, the language of user's extension and file format are devised, Only need to realize this interface.
5th, achieve fuzzy query, the function such as Querying by group.
The above is the preferred embodiment of the present invention, for those skilled in the art, Without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications are also regarded as this Bright protection domain.

Claims (3)

1. a kind of Internet information resource method for quickly retrieving is it is characterised in that comprise the following steps:
S1, defines a set of index file form, sets up document class, for depositing the record of file index and index range;
S2, the file destination in data base is converted to binary file, generates index list, and is compiled before index list Number, wherein, described file destination includes webpage, file database record;
S3, sets up field description class, for depositing each attribute of single field;
S4, before document is indexed, carries out word segmentation processing to document content first;
S5, is added to each index document in corresponding document object;
S6, the keyword that user is searched is packaged into the readily identified form of place binary file;
S7, sets up the object that a user preserves Search Results, preserves lookup result for user;
S8, the search key of parsing user input, according to keyword exclusive dictionary, keyword is processed;
S9, sets up data conversion class, Search Results is converted into the form required for web displaying;
S10, by applying mechanically indicating template, changes into Search Results the form needed for page presentation.
2. Internet information resource method for quickly retrieving according to claim 1 increases new data it is characterised in that working as During log file, in s1 step, set up a small documents index, described small documents index and original file index are arranged side by side closing System.
3. Internet information resource method for quickly retrieving according to claim 1 and 2 is it is characterised in that determine in s1 step The index file form of justice is octet or 16 bit bytes.
CN201310492732.2A 2013-10-21 2013-10-21 Internet information resource quick searching method Active CN103530384B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310492732.2A CN103530384B (en) 2013-10-21 2013-10-21 Internet information resource quick searching method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310492732.2A CN103530384B (en) 2013-10-21 2013-10-21 Internet information resource quick searching method

Publications (2)

Publication Number Publication Date
CN103530384A CN103530384A (en) 2014-01-22
CN103530384B true CN103530384B (en) 2017-01-25

Family

ID=49932393

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310492732.2A Active CN103530384B (en) 2013-10-21 2013-10-21 Internet information resource quick searching method

Country Status (1)

Country Link
CN (1) CN103530384B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6910029B1 (en) * 2000-02-22 2005-06-21 International Business Machines Corporation System for weighted indexing of hierarchical documents
CN103268445A (en) * 2012-12-27 2013-08-28 武汉安天信息技术有限责任公司 Android malicious code detection method based on OpCode and system thereof

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6910029B1 (en) * 2000-02-22 2005-06-21 International Business Machines Corporation System for weighted indexing of hierarchical documents
CN103268445A (en) * 2012-12-27 2013-08-28 武汉安天信息技术有限责任公司 Android malicious code detection method based on OpCode and system thereof

Also Published As

Publication number Publication date
CN103530384A (en) 2014-01-22

Similar Documents

Publication Publication Date Title
CN111753099B (en) Method and system for enhancing relevance of archive entity based on knowledge graph
Shigarov et al. Rule-based spreadsheet data transformation from arbitrary to relational tables
CN111930966B (en) Intelligent policy matching method and system for digital government affairs
CN110941612A (en) Autonomous data lake construction system and method based on associated data
CN102279894A (en) Method for searching, integrating and providing comment information based on semantics and searching system
CN109947952B (en) Retrieval method, device, equipment and storage medium based on English knowledge graph
CN111967761A (en) Monitoring and early warning method and device based on knowledge graph and electronic equipment
CN105868177A (en) Universal formula search method
CN108090223B (en) Openers portrait method based on internet information
CN113190687B (en) Knowledge graph determining method and device, computer equipment and storage medium
CN105426529A (en) Image retrieval method and system based on user search intention positioning
CN113312922B (en) Improved chapter-level triple information extraction method
WO2020155749A1 (en) Method and apparatus for constructing personal knowledge graph, computer device, and storage medium
CN114064851A (en) Multi-machine retrieval method and system for government office documents
CN113239111A (en) Network public opinion visual analysis method and system based on knowledge graph
CN116361487A (en) Multi-source heterogeneous policy knowledge graph construction and storage method and system
CN104424399B (en) A kind of method, apparatus of the knowledge navigation based on virus protein body
CN114241501A (en) Image document processing method and device and electronic equipment
CN112883202A (en) Knowledge graph-based multi-component modeling method and system
Han et al. Design and implementation of elasticsearch for media data
Modoni et al. A semantic framework for graph-based enterprise search
CN103530384B (en) Internet information resource quick searching method
CN109948015B (en) Meta search list result extraction method and system
CN111401047A (en) Method and device for generating dispute focus of legal document and computer equipment
CN104281695A (en) Combination theory based quasi natural language semantic information extraction method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20161230

Address after: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 silver bearing No. 2008 building A, 803-02

Applicant after: ZHENGHE TECHNOLOGY Co.,Ltd.

Address before: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 silver bearing No. 2008 building A-8-3

Applicant before: JINAN ZHENGHE TECHNOLOGY Co.,Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Internet information resource method for quickly retrieving

Effective date of registration: 20171110

Granted publication date: 20170125

Pledgee: Qilu bank Limited by Share Ltd. Ji'nan high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: 2017370000170

PE01 Entry into force of the registration of the contract for pledge of patent right
PC01 Cancellation of the registration of the contract for pledge of patent right
PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20220129

Granted publication date: 20170125

Pledgee: Qilu bank Limited by Share Ltd. Ji'nan high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: 2017370000170

PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Fast retrieval method of Internet Information Resources

Effective date of registration: 20220214

Granted publication date: 20170125

Pledgee: Ji'nan rural commercial bank Limited by Share Ltd. high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: Y2022980001521

PC01 Cancellation of the registration of the contract for pledge of patent right
PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20221212

Granted publication date: 20170125

Pledgee: Ji'nan rural commercial bank Limited by Share Ltd. high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: Y2022980001521

PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Fast retrieval method of internet information resources

Effective date of registration: 20230203

Granted publication date: 20170125

Pledgee: Ji'nan rural commercial bank Limited by Share Ltd. high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: Y2023980031993