CN103530384A - Internet information resource quick searching method - Google Patents

Internet information resource quick searching method Download PDF

Info

Publication number
CN103530384A
CN103530384A CN201310492732.2A CN201310492732A CN103530384A CN 103530384 A CN103530384 A CN 103530384A CN 201310492732 A CN201310492732 A CN 201310492732A CN 103530384 A CN103530384 A CN 103530384A
Authority
CN
China
Prior art keywords
file
index
information resource
internet information
searching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310492732.2A
Other languages
Chinese (zh)
Other versions
CN103530384B (en
Inventor
史高升
朱涛
赵西法
韩祥松
荣丽萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhenghe Technology Co ltd
Original Assignee
JINAN ZHENGHE TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JINAN ZHENGHE TECHNOLOGY Co Ltd filed Critical JINAN ZHENGHE TECHNOLOGY Co Ltd
Priority to CN201310492732.2A priority Critical patent/CN103530384B/en
Publication of CN103530384A publication Critical patent/CN103530384A/en
Application granted granted Critical
Publication of CN103530384B publication Critical patent/CN103530384B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an Internet information resource quick searching method. The Internet information resource quick searching method includes the following steps: defining a set of index file formats, setting up a file class, enabling an object file in a database to be converted into a binary file, generating an index list with a serial number, setting up a field description class to be used for storing various attributes of a single field, carrying out word segmentation processing on file contents, enabling each index file to be added into a file object corresponding to the index file, enabling keywords searched by a user to be packaged, setting up an object where the user saves searching results, processing the keywords, setting up a data conversion class, enabling the searching results to be converted to be in the format required by webpage display, and enabling the searching results to be converted to be in the format required by page display. According to the Internet information resource quick searching method, partitioning searching is achieved, the searching speed is greatly quickened, when new data are generated, a small-file index parallel to an original-file index can be set up for the new data, and searching is rapidly achieved.

Description

Internet information resource method for quickly retrieving
Technical field
The present invention relates to internet information searching field, is a kind of internet information resource method for quickly retrieving specifically.
Background technology
Along with the development of Internet technology, with the internet, applications headed by search engine, in the increasingly extensive every field that is penetrated into China's economic, culture, political life.Network information resource has enormous amount, upgrades feature rapidly, and how in the internet information of magnanimity, retrieving fast and accurately the needed information resources of people is the improved directions of Internet enterprises.At present, search information is to search for from database, and each database is very large, so search time is long, and after Data Update, in the data of renewal and original data mixing to new database, searches for like this trouble more.
Summary of the invention
The shortcoming existing in order to overcome above-mentioned prior art, the object of the present invention is to provide a kind of block index of realizing, and promotes the internet information resource method for quickly retrieving of index speed.
In order to address the above problem, the present invention by the following technical solutions: a kind of internet information resource method for quickly retrieving, comprises the following steps:
S1, defines a set of index file form, sets up document class, for the record of storing documents index and index range;
S2, is converted to binary file by the file destination in database, generating indexes catalogue, and be numbered before index list, wherein, described file destination comprises webpage, file and data-base recording;
S3, sets up field description class, for depositing the various attributes of single field;
S4, before document is indexed, first carries out word segmentation processing to document content;
S5, joins each index file in the document object corresponding with it;
S6, the key word that user is searched is packaged into the form that place binary file is easy to identification;
S7, sets up the object that a user preserves Search Results, for user, preserves lookup result;
S8, resolves the search key that user inputs, and according to the exclusive dictionary of key word, key word is processed;
S9, sets up data and transforms class, and Search Results is converted into the needed form of web displaying;
S10, by applying mechanically indicating template, changes into the page Search Results and shows required form.
When increasing new document data record, in S1 step, set up a small documents index, described small documents index and original file index are coordination.
The index file form defining in S1 step is octet or 16 bit bytes.
The invention has the beneficial effects as follows: it is by setting up document class, database classification is than before thinner, realized block research, greatly promoted retrieval rate, and, when having new data to produce, new data can be set up to a small documents index, it and original index file are arranged side by side, have so just accelerated the index speed to new data.
Accompanying drawing explanation
Below in conjunction with drawings and Examples, the present invention is described further:
Fig. 1 is FB(flow block) of the present invention;
Fig. 2 is system architecture schematic diagram of the present invention.
Embodiment
As shown in Figure 1, one embodiment of the present invention comprises step:
S1, defines a set of index file form, sets up document class, for the record of storing documents index and index range;
S2, is converted to binary file by the file destination in database, generating indexes catalogue, and be numbered before index list, wherein, described file destination comprises webpage, file and data-base recording;
S3, sets up field description class, for depositing the various attributes of single field;
S4, before document is indexed, first carries out word segmentation processing to document content;
S5, joins each index file in the document object corresponding with it, facilitates follow-up search operation;
S6, the key word that user is searched is packaged into the form that place binary file is easy to identification;
S7, sets up the object that a user preserves Search Results, for user, preserves lookup result;
S8, resolves the search key that user inputs, and according to the exclusive dictionary of key word, key word is processed, and removes some as auxiliary words that indicates mood such as " ", " ";
S9, sets up data and transforms class, and Search Results is converted into the needed form of web displaying;
S10, by applying mechanically indicating template, changes into the page Search Results and shows required form.
As shown in Figure 2, user retrieves by index page, and the catalogue in file index is numbered, hunting zone record is also numbered, and when increasing new data, generates the small documents index of new data, it and original file index are arranged side by side, make retrieving more quick; After file destination (comprising webpage, file, data-base recording etc.) is retrieved, by data, transform class, carry out algorithm and corresponding relation, will search after result is converted into the needed form of web displaying and return results, show.
The present invention has feature:
1, index file form is independent of application platform.It is basic index file form that the present invention has defined a set of octet, can certainly make other figure place bytes such as 16 grades, makes the application of compatible system or different platform can share the index file of foundation.
2, on the basis of the inverted index of traditional file search engine, realize block research, can promote index speed for new file set up small documents index, then by the merging of index, reached optimization object.
3, outstanding OO system architecture, reduces the learning difficulty of this search extension, the convenient telecommunication function that expands.
4, designed text analyzing interface independent and language and file layout, language and the file layout of user's expansion, only need to realize this interface.
5, realized fuzzy query, the functions such as minute group polling.
The above is the preferred embodiment of the present invention, for those skilled in the art, under the premise without departing from the principles of the invention, can also make some improvements and modifications, and these improvements and modifications are also regarded as protection scope of the present invention.

Claims (3)

1. an internet information resource method for quickly retrieving, is characterized in that, comprises the following steps:
S1, defines a set of index file form, sets up document class, for the record of storing documents index and index range;
S2, is converted to binary file by the file destination in database, generating indexes catalogue, and be numbered before index list, wherein, described file destination comprises webpage, file and data-base recording;
S3, sets up field description class, for depositing the various attributes of single field;
S4, before document is indexed, first carries out word segmentation processing to document content;
S5, joins each index file in the document object corresponding with it;
S6, the key word that user is searched is packaged into the form that place binary file is easy to identification;
S7, sets up the object that a user preserves Search Results, for user, preserves lookup result;
S8, resolves the search key that user inputs, and according to the exclusive dictionary of key word, key word is processed;
S9, sets up data and transforms class, and Search Results is converted into the needed form of web displaying;
S10, by applying mechanically indicating template, changes into the page Search Results and shows required form.
2. internet information resource method for quickly retrieving according to claim 1, is characterized in that, when increasing new document data record, sets up a small documents index in S1 step, and described small documents index and original file index are coordination.
3. internet information resource method for quickly retrieving according to claim 1 and 2, is characterized in that, the index file form defining in S1 step is octet or 16 bit bytes.
CN201310492732.2A 2013-10-21 2013-10-21 Internet information resource quick searching method Active CN103530384B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310492732.2A CN103530384B (en) 2013-10-21 2013-10-21 Internet information resource quick searching method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310492732.2A CN103530384B (en) 2013-10-21 2013-10-21 Internet information resource quick searching method

Publications (2)

Publication Number Publication Date
CN103530384A true CN103530384A (en) 2014-01-22
CN103530384B CN103530384B (en) 2017-01-25

Family

ID=49932393

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310492732.2A Active CN103530384B (en) 2013-10-21 2013-10-21 Internet information resource quick searching method

Country Status (1)

Country Link
CN (1) CN103530384B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6910029B1 (en) * 2000-02-22 2005-06-21 International Business Machines Corporation System for weighted indexing of hierarchical documents
CN103268445A (en) * 2012-12-27 2013-08-28 武汉安天信息技术有限责任公司 Android malicious code detection method based on OpCode and system thereof

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6910029B1 (en) * 2000-02-22 2005-06-21 International Business Machines Corporation System for weighted indexing of hierarchical documents
CN103268445A (en) * 2012-12-27 2013-08-28 武汉安天信息技术有限责任公司 Android malicious code detection method based on OpCode and system thereof

Also Published As

Publication number Publication date
CN103530384B (en) 2017-01-25

Similar Documents

Publication Publication Date Title
CN111753099B (en) Method and system for enhancing relevance of archive entity based on knowledge graph
CN108932294B (en) Resume data processing method, device, equipment and storage medium based on index
CN110941612B (en) Autonomous data lake construction system and method based on associated data
US20110295775A1 (en) Associating media with metadata of near-duplicates
CN102279894A (en) Method for searching, integrating and providing comment information based on semantics and searching system
CN103823838A (en) Method for inputting and comparing multi-format documents
CN109657072B (en) Intelligent search WEB system and method applied to government aid decision
WO2020155749A1 (en) Method and apparatus for constructing personal knowledge graph, computer device, and storage medium
CN112115232A (en) Data error correction method and device and server
KR101095866B1 (en) Triple indexing and searching scheme for efficient information retrieval
CN114218472A (en) Intelligent search system based on knowledge graph
CN116204660A (en) Multi-source heterogeneous data driven domain knowledge graph construction system method
CN113239111A (en) Network public opinion visual analysis method and system based on knowledge graph
KR20120047622A (en) System and method for managing digital contents
CN116523041A (en) Knowledge graph construction method, retrieval method and system for equipment field and electronic equipment
CN111401047A (en) Method and device for generating dispute focus of legal document and computer equipment
CN103530384A (en) Internet information resource quick searching method
CN113407803A (en) Method for acquiring internet data in one step
Singh et al. User specific context construction for personalized multimedia retrieval
CN112214615A (en) Policy document processing method and device based on knowledge graph and storage medium
CN115203428B (en) Knowledge graph construction method and device, electronic equipment and storage medium
CN111309773A (en) Vehicle information query method, device and system and storage medium
CN111241313A (en) Retrieval method and device supporting image input
Lang et al. The next-generation search engine: Challenges and key technologies
US11853356B1 (en) System and method for generating hierarchical mind map and index table

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20161230

Address after: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 silver bearing No. 2008 building A, 803-02

Applicant after: ZHENGHE TECHNOLOGY Co.,Ltd.

Address before: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 silver bearing No. 2008 building A-8-3

Applicant before: JINAN ZHENGHE TECHNOLOGY Co.,Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Internet information resource method for quickly retrieving

Effective date of registration: 20171110

Granted publication date: 20170125

Pledgee: Qilu bank Limited by Share Ltd. Ji'nan high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: 2017370000170

PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20220129

Granted publication date: 20170125

Pledgee: Qilu bank Limited by Share Ltd. Ji'nan high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: 2017370000170

PC01 Cancellation of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Fast retrieval method of Internet Information Resources

Effective date of registration: 20220214

Granted publication date: 20170125

Pledgee: Ji'nan rural commercial bank Limited by Share Ltd. high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: Y2022980001521

PE01 Entry into force of the registration of the contract for pledge of patent right
PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20221212

Granted publication date: 20170125

Pledgee: Ji'nan rural commercial bank Limited by Share Ltd. high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: Y2022980001521

PC01 Cancellation of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Fast retrieval method of internet information resources

Effective date of registration: 20230203

Granted publication date: 20170125

Pledgee: Ji'nan rural commercial bank Limited by Share Ltd. high tech branch

Pledgor: ZHENGHE TECHNOLOGY Co.,Ltd.

Registration number: Y2023980031993

PE01 Entry into force of the registration of the contract for pledge of patent right