CN111382341A - Scientific and technological information resource retrieval and query system and method based on big data - Google Patents

Scientific and technological information resource retrieval and query system and method based on big data Download PDF

Info

Publication number
CN111382341A
CN111382341A CN202010210056.5A CN202010210056A CN111382341A CN 111382341 A CN111382341 A CN 111382341A CN 202010210056 A CN202010210056 A CN 202010210056A CN 111382341 A CN111382341 A CN 111382341A
Authority
CN
China
Prior art keywords
information
retrieval
data
query
scientific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010210056.5A
Other languages
Chinese (zh)
Other versions
CN111382341B (en
Inventor
刘合安
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hongfujin Precision Industry Shenzhen Co Ltd
Original Assignee
Hongfujin Precision Industry Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hongfujin Precision Industry Shenzhen Co Ltd filed Critical Hongfujin Precision Industry Shenzhen Co Ltd
Priority to CN202010210056.5A priority Critical patent/CN111382341B/en
Publication of CN111382341A publication Critical patent/CN111382341A/en
Application granted granted Critical
Publication of CN111382341B publication Critical patent/CN111382341B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9532Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention belongs to the technical field of internet resource retrieval and query, and discloses a scientific and technological information resource retrieval and query system and method based on big data, which are used for carrying out user identity verification based on an identity verification program; analyzing and counting the user access behaviors, and intercepting the abnormal behaviors of the user; inputting user retrieval query information; extracting query keywords; acquiring required scientific and technological information data from the Internet; carrying out data classification and fusion; storing the processed scientific and technological information data; carrying out retrieval query on key information resources; generating the retrieval query report; the mobile terminal receives scientific and technological information data and remotely controls the scientific and technological information resource retrieval and query system, comprehensive and accurate query of scientific and technological information retrieval can be achieved on the premise that data safety is guaranteed, user behaviors can be analyzed, and related information can be accurately recommended based on the user retrieval behaviors.

Description

Scientific and technological information resource retrieval and query system and method based on big data
Technical Field
The invention belongs to the technical field of internet resource retrieval and query, and particularly relates to a scientific and technological information resource retrieval and query system and method based on big data.
Background
At present, with the development of infrastructure for information resource sharing and the gradual formation of a system framework of digital information resources, an information transmission system which is open, interconnected, convenient and fast to operate at a high speed enables the overall development, communication and utilization depth and breadth of document resources and the propagation speed to be revolutionarily changed; the method provides good technical and resource guarantee for the vast public, particularly scientific and technical personnel to share the resources of the scientific and technical information. However, the existing network information resources are complex, which limits the comprehensiveness and accuracy of information to different degrees, and further makes it difficult to obtain valuable information; meanwhile, the conventional scientific and technological information resource retrieval and query system cannot meet the requirement of rapid and accurate positioning of scientific and technological information resources required by user query, and cannot meet the requirement of professional, effective and accurate pushing of users. Therefore, a new scientific and technological information resource retrieval and query system based on big data is needed.
Through the above analysis, the problems and defects of the prior art are as follows: the existing network information resources are complicated, the comprehensiveness and accuracy of information are limited to different degrees, and valuable information is difficult to obtain; meanwhile, the conventional scientific and technological information resource retrieval and query system cannot meet the requirement of rapid and accurate positioning of scientific and technological information resources required by user query, and cannot meet the requirement of professional, effective and accurate pushing of users.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a scientific and technological information resource retrieval and query system and method based on big data.
The invention is realized in such a way that a scientific and technological information resource retrieval and query method based on big data comprises the following steps:
firstly, realizing the input of user search query information by using an input program through a text input dialog box; building a basic hot word bank related to the input query information;
secondly, performing Chinese word segmentation processing on the query information input by the user and outputting word segmentation results; declaring a new array arrs _ a, traversing the word segmentation result, and adding a word segmentation in the word segmentation result to the array arrs _ a if a word segmentation in the traversal process is matched with a hot word in the basic hot word bank;
thirdly, sequencing the array arrs _ a according to the word length and the word position of the word; traversing the sorted array arrs _ a, sequentially performing null-replace operation on the query information aiming at each participle in the array arrs _ a, and taking the obtained final word as a query information keyword;
fourthly, acquiring required scientific and technological information data from the internet by a data acquisition program through Hash calculation according to the retrieval query keyword obtained in the third step; calculating basic probability assignment of an object to be fused provided by any homogeneous data based on the fusion frame by utilizing a preset basic probability distribution function for the acquired related scientific and technical information data;
fifthly, performing orthogonality and operation on basic probability assignments provided by homogeneous data acquired by any data acquisition system in multiple periods to obtain time dimension probability assignments of the object to be fused in the data acquisition system;
sixthly, performing orthogonality and operation on the time dimension probability assignments of the object to be fused in the plurality of data acquisition systems to obtain the time-space dimension probability assignments of the object to be fused in the plurality of data acquisition systems and the plurality of periods;
seventhly, determining a data fusion result of the object to be fused in the data fusion frame by utilizing the spatiotemporal dimension probability assignment to obtain related scientific and technological information data; storing the processed scientific and technological information data through a temporary memory;
eighthly, searching and inquiring key information resources from the temporary storage by a searching and inquiring program;
the searching and querying of the key information resource from the temporary storage by the searching and querying program comprises the following steps:
1) the mobile terminal receives an information triggering instruction of the text input dialog box and acquires retrieval information required by a user according to a retrieval query keyword input by the user;
2) the mobile terminal generates an information calling request according to the information triggering instruction and sends the information calling request to the main control computer; the information invoking request comprises: retrieving item information;
3) the main control computer analyzes the information calling request and acquires the corresponding calling information list data according to the retrieval item information;
4) and the main control computer sends the calling information list data to the terminal, and the terminal searches and queries key information resources from the temporary storage according to the calling information list data.
Ninthly, generating the retrieval query report according to the retrieval query result by a retrieval report generating program; receiving scientific and technological information data through a mobile terminal, and remotely controlling a scientific and technological information resource retrieval and query system;
step ten, storing keywords, scientific and technological information data and retrieval query reports of user retrieval query through a cloud server; and displaying the keywords of the user search query, the scientific and technical information data and the real-time data of the search query report through the display.
Further, the first step is preceded by:
performing user authentication based on the authentication program; analyzing and counting the access behaviors of the user, and intercepting the abnormal behaviors of the user.
Further, the third step is followed by:
and counting the search terms of the current user, the use frequency of the search terms, the search time, the search source, the browser type and other related information.
Further, in the fourth step, the method for acquiring the required scientific and technological information data from the internet by the data acquisition program by using hash calculation according to the search query keyword includes:
(I) performing hash calculation on data information carried in a data acquisition request sent by a client to obtain a hash calculation result;
(II) determining a target virtual node on a hash ring containing at least two virtual nodes based on the hash calculation result;
(III) if the target virtual node cannot respond to the data acquisition request, sending the data acquisition request to another virtual node on the hash ring, which is located at the downstream of the target virtual node, so that the data acquisition request is processed by the other virtual node located at the downstream of the target virtual node.
Further, in the fourth step, the homogeneous data is image data of the same region, the object to be fused is a point to be fused in the region, and the fusion frame is a set including the target element and the background element.
Further, in the eighth step, when the calling information list data includes retrieval information required by the user, the terminal acquires and outputs corresponding retrieval result information from the server according to the received first access operation instruction and the retrieval result data source address information in sequence;
and when the calling information list data does not comprise the retrieval information required by the user, the terminal receives the voice information and analyzes the voice information to generate the retrieval information.
Further, in the eighth step, the receiving, by the mobile terminal, the call information, and analyzing the call information to generate the retrieval information specifically include:
the mobile terminal stores preset standard keywords and keyword expansion rules;
the mobile terminal receives calling information and identifies the calling information to obtain a voice keyword;
the mobile terminal matches the calling information keywords with the standard keywords to obtain retrieval keywords;
and the mobile terminal expands the search keywords according to a preset keyword expansion rule to generate the search information.
Another objective of the present invention is to provide a scientific and technological information resource retrieval and query system based on big data, which applies the scientific and technological information resource retrieval and query method based on big data, and the scientific and technological information resource retrieval and query system based on big data includes:
the user identity authentication module is connected with the main control module and is used for carrying out user identity authentication based on an identity authentication program;
the user access behavior analysis module is connected with the main control module and is used for analyzing and counting user access behaviors and intercepting abnormal user behaviors;
the keyword input module is connected with the main control module and used for realizing the input of user retrieval query information by utilizing an input program through a text input dialog box;
the query preprocessing module is connected with the main control module and used for segmenting input query information and extracting keywords;
the retrieval statistic module is used for counting the retrieval words of the current user, the using frequency, the retrieval time, the retrieval source, the browser type and other related information;
the data acquisition module is connected with the main control module and used for acquiring required scientific and technological information data from the Internet according to the retrieval query keyword by utilizing Hash calculation through a data acquisition program;
the data processing module is connected with the main control module and is used for classifying and fusing the acquired scientific and technological information data through a data processing program;
the data temporary storage module is connected with the main control module and used for storing the processed scientific and technological information data through a temporary memory;
the main control module is connected with the user identity verification module, the user access behavior analysis module, the keyword input module, the query preprocessing module, the retrieval statistics module, the data acquisition module, the data processing module, the data temporary storage module, the information recommendation module, the information retrieval query module, the retrieval report generation module, the information terminal module, the information storage module and the display module and is used for controlling the normal operation of each module of the scientific and technological information resource retrieval query system through the main control computer;
the information recommendation module is connected with the main control module and used for recommending corresponding information resources based on the user retrieval behavior statistical result;
the information retrieval query module is connected with the main control module and is used for retrieving and querying key information resources from the temporary storage through a retrieval query program;
the retrieval report generation module is connected with the main control module and used for generating a retrieval query report according to a retrieval query result through a retrieval report generation program;
the information terminal module is connected with the main control module and used for receiving scientific and technological information data through the mobile terminal and remotely controlling the scientific and technological information resource retrieval and query system;
the information storage module is connected with the main control module and used for storing keywords, scientific and technological information data and retrieval and query reports of user retrieval and query through the cloud server;
and the display module is connected with the main control module and is used for displaying keywords and scientific and technological information data of user retrieval and query and real-time data of a retrieval and query report through a display.
Another object of the present invention is to provide a computer program product stored on a computer-readable medium, which includes a computer-readable program for providing a user input interface to implement the method for searching and querying scientific and technical information resources based on big data when the computer program product is executed on an electronic device.
Another object of the present invention is to provide a computer-readable storage medium, which stores instructions for causing a computer to execute the method for searching and querying scientific and technical information resources based on big data when the instructions are executed on the computer.
By combining all the technical schemes, the invention has the advantages and positive effects that: the method and the system can realize comprehensive and accurate query of scientific and technological information retrieval on the premise of ensuring data safety, and can analyze user behaviors and accurately recommend related information based on the user retrieval behaviors.
Compared with the traditional retrieval method, the retrieval method has diversity and selectivity, is beneficial to improving the retrieval efficiency of the user and simultaneously enables the operation to be simpler and more convenient. According to the invention, the data processing module fuses the data of the same data acquisition system in different periods in the time dimension, and then fuses the data of different data acquisition systems in the space dimension for final fusion, so that the accuracy of data fusion and the strong robustness of an algorithm are ensured.
Drawings
Fig. 1 is a flowchart of a scientific and technical information resource retrieval and query method based on big data according to an embodiment of the present invention.
FIG. 2 is a schematic structural diagram of a scientific and technological information resource retrieval and query system based on big data according to an embodiment of the present invention;
in the figure: 1. a user identity authentication module; 2. a user access behavior analysis module; 3. a keyword input module; 4. a query preprocessing module; 5. a retrieval statistic module; 6. a data acquisition module; 7. a data processing module; 8. a data temporary storage module; 9, a main control module; 10. an information recommendation module; 11. an information retrieval query module; 12. a retrieval report generation module; 13. an information terminal module; 14. an information storage module; 15. and a display module.
Fig. 3 is a flowchart of a method for acquiring, by a data acquisition program according to an embodiment of the present invention, required scientific and technical information data from the internet according to a search query keyword by using hash calculation.
Fig. 4 is a flowchart of a method for classifying and fusing acquired scientific and technical information data through a data processing program according to an embodiment of the present invention.
Fig. 5 is a flowchart of a method for performing a search query of a key information resource from a temporary storage by a search query program according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
In view of the problems in the prior art, the present invention provides a scientific and technological information resource retrieval and query system and method based on big data, and the following describes the present invention in detail with reference to the accompanying drawings.
As shown in fig. 1, the scientific and technological information resource retrieval and query method based on big data provided by the embodiment of the present invention includes the following steps:
s101, user identity authentication is carried out based on an identity authentication program; analyzing and counting the user access behaviors, and intercepting the abnormal behaviors of the user; and the input program is used for realizing the input of the user retrieval query information through the text input dialog box.
S102, segmenting input query information and extracting keywords; and counting the search terms of the current user, the use frequency of the search terms, the search time, the search source, the browser type and other related information.
S103, acquiring required scientific and technological information data from the Internet by a data acquisition program through Hash calculation according to the retrieval query keyword; classifying and fusing the acquired scientific and technological information data through a data processing program; and storing the processed scientific and technological information data through a temporary memory.
And S104, searching and inquiring the key information resources from the temporary storage by the searching and inquiring program.
S105, generating the retrieval query report according to the retrieval query result through a retrieval report generating program; and receiving scientific and technological information data through the mobile terminal, and remotely controlling the scientific and technological information resource retrieval and query system.
S106, storing keywords, scientific and technological information data and retrieval query reports of user retrieval query through a cloud server; and displaying the keywords of the user search query, the scientific and technical information data and the real-time data of the search query report through the display.
As shown in fig. 2, the scientific and technical information resource retrieval and query system based on big data provided by the embodiment of the present invention includes: the system comprises a user identity authentication module 1, a user access behavior analysis module 2, a keyword input module 3, a query preprocessing module 4, a retrieval statistics module 5, a data acquisition module 6, a data processing module 7, a data temporary storage module 8, a main control module 9, an information recommendation module 10, an information retrieval query module 11, a retrieval report generation module 12, an information terminal module 13, an information storage module 14 and a display module 15.
And the user identity authentication module 1 is connected with the main control module and is used for carrying out user identity authentication based on an identity authentication program.
And the user access behavior analysis module 2 is connected with the main control module and is used for analyzing and counting the user access behaviors and intercepting the abnormal behaviors of the user.
And the keyword input module 3 is connected with the main control module and is used for realizing the input of the user retrieval query information by utilizing an input program through a text input dialog box.
And the query preprocessing module 4 is connected with the main control module and is used for segmenting input query information and extracting keywords.
And the retrieval statistic module 5 is used for counting the retrieval words of the current user, the use frequency, the retrieval time, the retrieval source, the browser type and other related information.
And the data acquisition module 6 is connected with the main control module and used for acquiring the required scientific and technological information data from the Internet by utilizing a data acquisition program through Hash calculation according to the retrieval query keyword.
And the data processing module 7 is connected with the main control module and is used for classifying and fusing the acquired scientific and technological information data through a data processing program.
And the data temporary storage module 8 is connected with the main control module and is used for storing the processed scientific and technological information data through a temporary memory.
The main control module 9 is connected with the user identity authentication module 1, the user access behavior analysis module 2, the keyword input module 3, the query preprocessing module 4, the retrieval statistics module 5, the data acquisition module 6, the data processing module 7, the data temporary storage module 8, the information recommendation module 10, the information retrieval query module 11, the retrieval report generation module 12, the information terminal module 13, the information storage module 14 and the display module 15, and is used for controlling the normal operation of each module of the scientific and technological information resource retrieval query system through the main control computer.
And the information recommendation module 10 is connected with the main control module and is used for recommending corresponding information resources based on the user retrieval behavior statistical result.
And the information retrieval query module 11 is connected with the main control module and is used for performing retrieval query on the key information resources from the temporary storage through a retrieval query program.
And the retrieval report generation module 12 is connected with the main control module and is used for generating the retrieval query report according to the retrieval query result through a retrieval report generation program.
And the information terminal module 13 is connected with the main control module and used for receiving scientific and technological information data through the mobile terminal and remotely controlling the scientific and technological information resource retrieval and query system.
And the information storage module 14 is connected with the main control module and is used for storing keywords, scientific and technological information data and retrieval and query reports of user retrieval and query through the cloud server.
And the display module 15 is connected with the main control module and is used for displaying keywords and scientific and technical information data of user retrieval and query and real-time data of retrieval and query reports through the display.
The invention is further described with reference to specific examples.
As shown in fig. 3, a method for acquiring, by a data acquisition program, required scientific and technical information data from the internet by using hash calculation according to a search query keyword according to an embodiment of the present invention includes:
s201, performing hash calculation on data information carried in a data acquisition request sent by a client to obtain a hash calculation result.
S202, based on the hash calculation result, determining a target virtual node on a hash ring containing at least two virtual nodes.
S203, if the target virtual node cannot respond to the data obtaining request, sending the data obtaining request to another virtual node on the hash ring downstream of the target virtual node, so that the another virtual node downstream of the target virtual node processes the data obtaining request.
As shown in fig. 4, as a preferred embodiment, the method for classifying and fusing acquired scientific and technical information data through a data processing program according to the embodiment of the present invention includes:
s301, calculating the basic probability assignment of the object to be fused provided by any homogeneous data based on the fusion framework by using a preset basic probability distribution function.
S302, performing orthogonality and operation on basic probability assignments provided by homogeneous data acquired by any data acquisition system in multiple periods to obtain time dimension probability assignments of the object to be fused in the data acquisition system.
And S303, performing orthogonality and operation on the time dimension probability assignments of the object to be fused in the plurality of data acquisition systems to obtain the time-space dimension probability assignments of the object to be fused in the plurality of data acquisition systems and the plurality of periods.
S304, determining a data fusion result of the object to be fused in the data fusion frame by utilizing the spatiotemporal dimension probability assignment.
The homogeneous data provided by the embodiment of the invention is image data of the same region, the object to be fused is a point to be fused in the region, and the fusion frame is a set comprising a target element and a background element.
As shown in fig. 5, as a preferred embodiment, the method for performing a search query of a key information resource from a temporary storage by using a search query program according to an embodiment of the present invention includes:
s401, the mobile terminal receives an information triggering instruction of the text input dialog box and acquires retrieval information required by the user according to the retrieval query keyword input by the user.
S402, the mobile terminal generates an information calling request according to the information triggering instruction and sends the information calling request to a main control computer; the information invoking request comprises: item information is retrieved.
And S403, the main control computer analyzes the information calling request and acquires the corresponding calling information list data according to the retrieval item information.
S404, the main control computer sends the calling information list data to the terminal, and the terminal searches and inquires key information resources from the temporary storage according to the calling information list data.
When the calling information list data comprises the retrieval information required by the user, the terminal acquires and outputs corresponding retrieval result information from the server according to the received first access operation instruction and the retrieval result data source address information in sequence;
and when the calling information list data does not comprise the retrieval information required by the user, the terminal receives the voice information and analyzes the voice information to generate the retrieval information.
The mobile terminal provided by the embodiment of the invention receives the calling information and analyzes the calling information to generate the retrieval information, which specifically comprises the following steps:
the mobile terminal stores preset standard keywords and keyword expansion rules.
The mobile terminal receives calling information and identifies the calling information to obtain a voice keyword;
and the mobile terminal matches the calling information keyword with the standard keyword to obtain a retrieval keyword.
And the mobile terminal expands the search keywords according to a preset keyword expansion rule to generate the search information.
As a preferred embodiment, the method for performing word segmentation on input query information and extracting keywords provided in the embodiment of the present invention includes:
first, a basic hot word stock related to query information is built.
Secondly, Chinese word segmentation processing is carried out on the query information input by the user, and word segmentation results are output.
And then, a new array arrs _ a is stated, the word segmentation result is traversed, and if a word in the word segmentation result is matched with the hot word in the basic hot word bank in the traversing process, the word is added into the array arrs _ a.
Then, the array arrs _ a is sorted according to the word length and the word position of the word.
And finally, traversing the sorted array arrs _ a, sequentially performing null-replace operation on the query information aiming at each participle in the array arrs _ a, and taking the obtained final word as a query information keyword.
In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When used in whole or in part, can be implemented in a computer program product that includes one or more computer instructions. When loaded or executed on a computer, cause the flow or functions according to embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, the computer instructions may be transmitted from one website site, computer, server, or data center to another website site, computer, server, or data center via wire (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL), or wireless (e.g., infrared, wireless, microwave, etc.)). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that includes one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.
The above description is only for the purpose of illustrating the present invention and the appended claims are not to be construed as limiting the scope of the invention, which is intended to cover all modifications, equivalents and improvements that are within the spirit and scope of the invention as defined by the appended claims.

Claims (10)

1. A scientific and technological information resource retrieval and query method based on big data is characterized by comprising the following steps:
firstly, realizing the input of user search query information by using an input program through a text input dialog box; building a basic hot word bank related to the input query information;
secondly, performing Chinese word segmentation processing on the input query information and outputting word segmentation results; declaring a new array arrs _ a, traversing the word segmentation result, and adding a word segmentation in the word segmentation result to the array arrs _ a if a word segmentation in the traversal process is matched with a hot word in the basic hot word bank;
thirdly, sequencing the array arrs _ a according to the word length and the word position of the word; traversing the sorted array arrs _ a, sequentially performing null-replace operation on the query information aiming at each participle in the array arrs _ a, and taking the obtained final word as a query information keyword;
fourthly, acquiring required scientific and technological information data from the internet by a data acquisition program through Hash calculation according to the retrieval query keyword obtained in the third step; calculating basic probability assignment of an object to be fused provided by any homogeneous data based on the fusion frame by utilizing a preset basic probability distribution function for the acquired related scientific and technical information data;
fifthly, performing orthogonality and operation on basic probability assignments provided by homogeneous data acquired by any data acquisition system in multiple periods to obtain time dimension probability assignments of the object to be fused in the data acquisition system;
sixthly, performing orthogonality and operation on the time dimension probability assignments of the object to be fused in the plurality of data acquisition systems to obtain the time-space dimension probability assignments of the object to be fused in the plurality of data acquisition systems and the plurality of periods;
seventhly, determining a data fusion result of the object to be fused in the data fusion frame by utilizing the spatiotemporal dimension probability assignment to obtain related scientific and technological information data; storing the processed scientific and technological information data through a temporary memory;
eighthly, searching and inquiring key information resources from the temporary storage by a searching and inquiring program; the searching and querying of the key information resource from the temporary storage by the searching and querying program comprises the following steps:
1) the mobile terminal receives an information triggering instruction of the text input dialog box and acquires retrieval information required by a user according to a retrieval query keyword input by the user;
2) the mobile terminal generates an information calling request according to the information triggering instruction and sends the information calling request to the main control computer; the information invoking request comprises: retrieving item information;
3) the main control computer analyzes the information calling request and acquires the corresponding calling information list data according to the retrieval item information;
4) the main control machine sends the calling information list data to the terminal, and the terminal searches and queries key information resources from the temporary storage according to the calling information list data;
ninth, generating the retrieval query report according to the retrieval query result in the eighth step by a retrieval report generating program; receiving scientific and technological information data through a mobile terminal, and remotely controlling a scientific and technological information resource retrieval and query system;
step ten, storing keywords, scientific and technological information data and a retrieval query report of retrieval query through a cloud server; and displaying the keywords of the retrieval query, the scientific and technical information data and the real-time data of the retrieval query report through a display.
2. A scientific and technological information resource retrieval query method based on big data as claimed in claim 1, characterized in that, before the first step, it also needs to perform:
performing identity verification based on the identity verification program; and analyzing and counting the access behaviors, and intercepting the abnormal behaviors.
3. A scientific and technological information resource retrieval query method based on big data as claimed in claim 1, characterized in that said third step is followed by:
and counting the current search terms and the use frequency, the search time, the search source, the browser type and other relevant information thereof.
4. The method for searching and querying scientific and technological information resources based on big data according to claim 1, wherein in the fourth step, the method for obtaining the required scientific and technological information data from the internet by the data obtaining program by using hash calculation according to the search query keyword comprises:
(I) performing hash calculation on data information carried in a data acquisition request sent by a client to obtain a hash calculation result;
(II) determining a target virtual node on a hash ring containing at least two virtual nodes based on the hash calculation result;
(III) if the target virtual node cannot respond to the data acquisition request, sending the data acquisition request to another virtual node on the hash ring, which is located at the downstream of the target virtual node, so that the data acquisition request is processed by the other virtual node located at the downstream of the target virtual node.
5. A scientific and technological information resource retrieval query method based on big data as claimed in claim 1, characterized in that in the fourth step, the homogeneous data is image data of the same region, the object to be fused is the point to be fused in the region, and the fusion frame is a set including the target element and the background element.
6. A scientific and technological information resource retrieval query method based on big data as claimed in claim 1, characterized in that in the eighth step, when the calling information list data includes the required retrieval information, the terminal obtains and outputs the corresponding retrieval result information from the server in turn according to the retrieval result data source address information according to the received first access operation instruction;
and when the calling information list data does not comprise the required retrieval information, the terminal receives voice information and analyzes the voice information to generate retrieval information.
7. The scientific and technological information resource retrieval and query method based on big data as claimed in claim 1, wherein in the eighth step, the mobile terminal receives the call information and analyzes the call information to generate the retrieval information specifically as follows:
the mobile terminal stores preset standard keywords and keyword expansion rules;
the mobile terminal receives calling information and identifies the calling information to obtain a voice keyword;
the mobile terminal matches the calling information keywords with the standard keywords to obtain retrieval keywords;
and the mobile terminal expands the search keywords according to a preset keyword expansion rule to generate the search information.
8. A scientific and technological information resource retrieval and query system based on big data applying the scientific and technological information resource retrieval and query method based on big data as claimed in any one of claims 1 to 7, characterized in that the scientific and technological information resource retrieval and query system based on big data comprises:
the user identity authentication module is connected with the main control module and is used for performing identity authentication based on an identity authentication program;
the user access behavior analysis module is connected with the main control module and is used for analyzing and counting access behaviors and intercepting abnormal behaviors of the user;
the keyword input module is connected with the main control module and used for realizing the input of retrieval query information by utilizing an input program through a text input dialog box;
the query preprocessing module is connected with the main control module and used for segmenting input query information and extracting keywords;
the retrieval statistic module is used for counting the current retrieval words, the using frequency, the retrieval time, the retrieval sources, the browser types and other related information;
the data acquisition module is connected with the main control module and used for acquiring required scientific and technological information data from the Internet according to the retrieval query keyword by utilizing Hash calculation through a data acquisition program;
the data processing module is connected with the main control module and is used for classifying and fusing the acquired scientific and technological information data through a data processing program;
the data temporary storage module is connected with the main control module and used for storing the processed scientific and technological information data through a temporary memory;
the main control module is connected with the user identity verification module, the user access behavior analysis module, the keyword input module, the query preprocessing module, the retrieval statistics module, the data acquisition module, the data processing module, the data temporary storage module, the information recommendation module, the information retrieval query module, the retrieval report generation module, the information terminal module, the information storage module and the display module and is used for controlling the normal operation of each module of the scientific and technological information resource retrieval query system through the main control computer;
the information recommendation module is connected with the main control module and used for recommending corresponding information resources based on the retrieval behavior statistical result;
the information retrieval query module is connected with the main control module and is used for retrieving and querying key information resources from the temporary storage through a retrieval query program;
the retrieval report generation module is connected with the main control module and used for generating a retrieval query report according to a retrieval query result through a retrieval report generation program;
the information terminal module is connected with the main control module and used for receiving scientific and technological information data through the mobile terminal and remotely controlling the scientific and technological information resource retrieval and query system;
the information storage module is connected with the main control module and used for storing keywords, scientific and technological information data and retrieval and query reports of user retrieval and query through the cloud server;
and the display module is connected with the main control module and used for displaying the keywords of the retrieval query, the scientific and technological information data and the real-time data of the retrieval query report through the display.
9. A computer program product stored on a computer readable medium, comprising a computer readable program, which when executed on an electronic device, provides a user input interface to implement the big data based technology information resource retrieval query method according to any one of claims 1 to 7.
10. A computer-readable storage medium storing instructions which, when executed on a computer, cause the computer to execute the method for searching and querying scientific and technical information resource based on big data according to any one of claims 1 to 7.
CN202010210056.5A 2020-03-23 2020-03-23 Scientific and technological information resource retrieval and query system and method based on big data Active CN111382341B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010210056.5A CN111382341B (en) 2020-03-23 2020-03-23 Scientific and technological information resource retrieval and query system and method based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010210056.5A CN111382341B (en) 2020-03-23 2020-03-23 Scientific and technological information resource retrieval and query system and method based on big data

Publications (2)

Publication Number Publication Date
CN111382341A true CN111382341A (en) 2020-07-07
CN111382341B CN111382341B (en) 2022-08-26

Family

ID=71217346

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010210056.5A Active CN111382341B (en) 2020-03-23 2020-03-23 Scientific and technological information resource retrieval and query system and method based on big data

Country Status (1)

Country Link
CN (1) CN111382341B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113949750A (en) * 2020-12-09 2022-01-18 中国信息通信研究院 Handle identifier analysis caching method, query method and handle identifier analysis system
CN114760120A (en) * 2022-03-31 2022-07-15 苏州市强旭科技有限公司 Safety monitoring system for computer data
CN117076783A (en) * 2023-10-16 2023-11-17 广东省科技基础条件平台中心 Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN117633255A (en) * 2024-01-25 2024-03-01 中国标准化研究院 Scientific and technological resource identification analysis method and system based on active identification

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102270234A (en) * 2011-08-01 2011-12-07 北京航空航天大学 Image search method and search engine
CN104503991A (en) * 2014-12-03 2015-04-08 百度在线网络技术(北京)有限公司 Information searching method and device
CN109829104A (en) * 2019-01-14 2019-05-31 华中师范大学 Pseudo-linear filter model information search method and system based on semantic similarity
US10402191B1 (en) * 2018-07-17 2019-09-03 Morgan Stanley Services Group Inc. Fault resistant 24×7 topology for business process management ecosystem

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102270234A (en) * 2011-08-01 2011-12-07 北京航空航天大学 Image search method and search engine
CN104503991A (en) * 2014-12-03 2015-04-08 百度在线网络技术(北京)有限公司 Information searching method and device
US10402191B1 (en) * 2018-07-17 2019-09-03 Morgan Stanley Services Group Inc. Fault resistant 24×7 topology for business process management ecosystem
CN109829104A (en) * 2019-01-14 2019-05-31 华中师范大学 Pseudo-linear filter model information search method and system based on semantic similarity

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113949750A (en) * 2020-12-09 2022-01-18 中国信息通信研究院 Handle identifier analysis caching method, query method and handle identifier analysis system
CN113949750B (en) * 2020-12-09 2023-09-19 中国信息通信研究院 Handle identification analysis caching method, query method and handle identification analysis system
CN114760120A (en) * 2022-03-31 2022-07-15 苏州市强旭科技有限公司 Safety monitoring system for computer data
CN117076783A (en) * 2023-10-16 2023-11-17 广东省科技基础条件平台中心 Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN117076783B (en) * 2023-10-16 2023-12-26 广东省科技基础条件平台中心 Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN117633255A (en) * 2024-01-25 2024-03-01 中国标准化研究院 Scientific and technological resource identification analysis method and system based on active identification
CN117633255B (en) * 2024-01-25 2024-04-05 中国标准化研究院 Scientific and technological resource identification analysis method and system based on active identification

Also Published As

Publication number Publication date
CN111382341B (en) 2022-08-26

Similar Documents

Publication Publication Date Title
CN111382341B (en) Scientific and technological information resource retrieval and query system and method based on big data
US11681944B2 (en) System and method to generate a labeled dataset for training an entity detection system
JP2021119463A (en) Method for generating knowledge graph, method for mining relation, device, apparatus, and medium
KR20200019824A (en) Entity relationship data generating method, apparatus, equipment and storage medium
JP6734946B2 (en) Method and apparatus for generating information
CN111400504B (en) Method and device for identifying enterprise key people
CN111552799B (en) Information processing method, information processing device, electronic equipment and storage medium
CN110019211A (en) The methods, devices and systems of association index
CN110765295A (en) Graph database-based query method and device, computer equipment and storage medium
CN111552797B (en) Name prediction model training method and device, electronic equipment and storage medium
CN111324804B (en) Search keyword recommendation model generation method, keyword recommendation method and device
CN111314063A (en) Big data information management method, system and device based on Internet of things
CN110321252B (en) Skill service resource scheduling method and device
CN110737820B (en) Method and apparatus for generating event information
CN115204889A (en) Text processing method and device, computer equipment and storage medium
CN108959294B (en) Method and device for accessing search engine
CN110515979B (en) Data query method, device, equipment and storage medium
CN114547257B (en) Class matching method and device, computer equipment and storage medium
US20230252980A1 (en) Multi-channel conversation processing
JP2023015275A (en) Observation information processing method, apparatus, electronic device, storage medium, and computer program
CN113780827A (en) Article screening method and device, electronic equipment and computer readable medium
CN110321435B (en) Data source dividing method, device, equipment and storage medium
CN113297436A (en) User policy distribution method and device based on relational graph network and electronic equipment
CN111695031A (en) Label-based searching method, device, server and storage medium
CN112632962A (en) Method and device for realizing natural language understanding in human-computer interaction system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant