CN117076783B - Scientific and technological information recommendation method, device, medium and equipment based on data analysis - Google Patents

Scientific and technological information recommendation method, device, medium and equipment based on data analysis Download PDF

Info

Publication number
CN117076783B
CN117076783B CN202311329773.XA CN202311329773A CN117076783B CN 117076783 B CN117076783 B CN 117076783B CN 202311329773 A CN202311329773 A CN 202311329773A CN 117076783 B CN117076783 B CN 117076783B
Authority
CN
China
Prior art keywords
data
information
keyword
technological
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202311329773.XA
Other languages
Chinese (zh)
Other versions
CN117076783A (en
Inventor
张熙
蔡宇铮
李洁儒
石慧芳
许上云
李正权
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Science & Technology Infrastructure Center
Original Assignee
Guangdong Science & Technology Infrastructure Center
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Science & Technology Infrastructure Center filed Critical Guangdong Science & Technology Infrastructure Center
Priority to CN202311329773.XA priority Critical patent/CN117076783B/en
Publication of CN117076783A publication Critical patent/CN117076783A/en
Application granted granted Critical
Publication of CN117076783B publication Critical patent/CN117076783B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/10Pre-processing; Data cleansing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches

Abstract

According to the technical information recommending method, device, medium and equipment based on data analysis, the technical information related text information is monitored and collected, the technical information content is divided into a plurality of region grades according to the keyword indexes of the technical information content, and the different region grades correspond to different grade authorities respectively; acquiring index values corresponding to keyword indexes of each region level, wherein the index values comprise keyword searching times and searching numbers of different users; analyzing and sorting according to the sorting value of each region level, and respectively storing the technological information content of each region level according to the sorting; sequencing the technological information content matched with the keyword field input by the user according to the matching degree sequence; and selecting technological information content with the area level lower than the user authority of the user for output. The method and the device can solve the problem that the existing data resource recommendation platform does not consider the influence of different user behaviors and technological information heat, and the resource allocation is solidified.

Description

Scientific and technological information recommendation method, device, medium and equipment based on data analysis
Technical Field
The present invention relates to the field of data processing, and in particular, to a method, an apparatus, a medium, and a device for recommending scientific and technological information based on data analysis.
Background
The current society is an information society age and a big data time science and technology age, and along with the continuous development and progress of information technologies such as the Internet, the Internet of things, cloud computing, artificial intelligence and the like and computer industries, the data processing becomes a problem to be solved urgently; therefore, in the context of big data, how to efficiently retrieve useful information from a database has become a focus of attention of enterprises and scientific research institutions, and the key technology involved in the work is a data analysis and data mining technology; in summary, the need for data processing presents both opportunities for data mining techniques and a series of challenges.
With the deep development of the mobile internet, a foundation is laid for the wide application of data resource recommendation. There is also a need for using data resource recommendation systems in power marketing to provide users with more service information of interest or to facilitate users to quickly search for relevant information of interest themselves.
Because the resource heat can intuitively embody the influence range and importance of data, in most data resource recommendation systems, the resource heat is taken as a data resource recommendation standard, however, the existing resource heat analysis statistics usually determine the resource heat by counting the access times, and the influence of different user behaviors and technological information heat is not considered, so that the resource allocation is solidified, the intrusion detection efficiency and the reliability are low, and the behaviors attempting to destroy the confidentiality, the integrity and the availability of the information resource are difficult to accurately identify.
Disclosure of Invention
In order to solve the technical problems, the invention provides a scientific and technological information recommending method, a device, a medium and equipment based on data analysis, which can solve the problem that the conventional data resource recommending platform does not consider the influence of different user behaviors and the heat of the scientific and technological information, so that resources are allocated and solidified.
The embodiment of the invention provides a scientific and technological information recommendation method based on data analysis, which comprises the following steps:
monitoring and collecting related text information of the technical information to obtain keyword indexes of the content of the technical information;
dividing the technical information content into a plurality of region grades according to the keyword index of the technical information content, wherein different region grades respectively correspond to different grade authorities;
acquiring index values corresponding to keyword indexes of each region level, wherein the index values comprise keyword searching times and searching numbers of different users;
inputting index values corresponding to the keyword indexes of each region level into a pre-established technological innovation analysis model for calculation to obtain a sequencing value of each region level;
analyzing and sorting according to the sorting value of each region level, and respectively storing the technological information content of each region level according to the sorting;
matching keyword fields input by a user with stored technical information contents, and sequencing the matched technical information contents according to the matching degree sequence of the keyword fields;
and selecting the technical information content with the area level lower than the user authority of the user from the ordered technical information content for outputting.
Preferably, after monitoring and collecting the text information related to the technical information, the method further comprises:
and carrying out data preprocessing on the technical information, eliminating data noise and data irrelevant to the technical subject, and carrying out real-time updating on the processed text data information.
As an improvement of the above solution, the process of performing data preprocessing on the technical information, eliminating data noise and data irrelevant to the technical theme, and performing real-time update on the processed text data information specifically includes:
acquiring field names and values of text information related to input technical information;
generating a list, and traversing data items of text information related to the input technical information;
the first data item is put into the list, and the remaining data items are compared with the values of the data items in the list: if the value of the field in a certain data item is the same as the value of the data item in the list, judging the data item as repeated data; if the values of the fields in a certain data item are different from the values of the data items in the list, judging the data item as non-repeated data, and storing the non-repeated data into the list;
and after traversing, finally taking the data in the list as the technological information content.
As a preferred solution, after selecting, from the ordered technological information contents, a technological information content output with a region level lower than the user authority of the user, the method further includes:
and displaying the related data information of the technological information to the user in a data visualization mode.
Further, the displaying the related data information of the technical information to the user through the data visualization method specifically includes:
acquiring technological information contents of a plurality of regional levels, carrying out data abstraction on index values corresponding to keyword indexes of each regional level, and establishing an information polyhedron data model aiming at an information side face and the user access to realize visualization of the information side face and the user access;
adopting a DataV technology for the polyhedral data model, and visualizing complex association relationship data in the scientific and technological information content;
the information polyhedron data model comprises a user access model, an information side model and a keyword popularity model.
As an improvement of the above scheme, the user access model includes a user search category classification, a user permission association table set and a user access time dimension set;
the information side model comprises a data item set, a data item association table set, a data item time dimension set, a data item region dimension set and a data item category classification, wherein the data item set comprises a data item and a data item keyword set;
the keyword popularity model comprises a keyword set and keyword class classification.
Preferably, the technological innovation analysis model is that
Wherein,ordering value for regional class, +.>Is the coefficient of the nth index, +.>Is the weight of the nth index,,/>wherein->An n-th index which is an index value of the keyword, < + >>Is the average of the nth index of the keyword index values.
The embodiment of the invention also provides a scientific and technological information recommending device based on data analysis, which comprises:
the data acquisition module is used for monitoring and acquiring the related text information of the technical information and acquiring the keyword index of the content of the technical information;
the data analysis module is used for dividing the scientific and technological information content into a plurality of region grades according to the keyword index of the scientific and technological information content, and different region grades respectively correspond to different grade authorities;
the index value acquisition module is used for acquiring index values corresponding to the keyword indexes of the regional grades, wherein the index values comprise the keyword searching times and the searching quantity of different users;
the ranking value calculation module is used for inputting index values corresponding to the keyword indexes of the region grades into a pre-established technological innovation analysis model for calculation to obtain ranking values of the region grades;
the sequencing module is used for analyzing and sequencing according to the sequencing value of each region level, and storing the technological information content of each region level according to the sequencing;
the matching module is used for matching the keyword field input by the user with the stored technical information content and sequencing the matched technical information content according to the matching degree sequence of the keyword field;
and the output module is used for selecting the technical information content with the area level lower than the user authority of the user from the ordered technical information content to output.
Preferably, the apparatus further comprises a data processing module for:
after the related text information of the technical information is monitored and collected, the technical information is subjected to data preprocessing, data noise and data irrelevant to the technical subject are eliminated, and the processed text data information is updated in real time.
Further, the data processing module is specifically configured to:
acquiring field names and values of text information related to input technical information;
generating a list, and traversing data items of text information related to the input technical information;
the first data item is put into the list, and the remaining data items are compared with the values of the data items in the list: if the value of the field in a certain data item is the same as the value of the data item in the list, judging the data item as repeated data; if the values of the fields in a certain data item are different from the values of the data items in the list, judging the data item as non-repeated data, and storing the non-repeated data into the list;
and after traversing, finally taking the data in the list as the technological information content.
Preferably, the apparatus further comprises a visualization module for:
and after the technological information content with the area level lower than the user authority of the user is selected from the ordered technological information contents and output, the relevant technological information data information is displayed to the user in a data visualization mode.
Further, the visualization module is specifically configured to:
acquiring technological information contents of a plurality of regional levels, carrying out data abstraction on index values corresponding to keyword indexes of each regional level, and establishing an information polyhedron data model aiming at an information side face and the user access to realize visualization of the information side face and the user access;
adopting a DataV technology for the polyhedral data model, and visualizing complex association relationship data in the scientific and technological information content;
the information polyhedron data model comprises a user access model, an information side model and a keyword popularity model.
Further, the user access model comprises a user search category classification, a user permission association table set and a user access time dimension set;
the information side model comprises a data item set, a data item association table set, a data item time dimension set, a data item region dimension set and a data item category classification, wherein the data item set comprises a data item and a data item keyword set;
the keyword popularity model comprises a keyword set and keyword class classification.
Preferably, the technological innovation analysis model is that
Wherein,ordering value for regional class, +.>Is the coefficient of the nth index, +.>Is the weight of the nth index,,/>wherein->An n-th index which is an index value of the keyword, < + >>Is the average of the nth index of the keyword index values.
The embodiment of the invention also provides a computer readable storage medium, which comprises a stored computer program, wherein the computer program controls equipment where the computer readable storage medium is located to execute the scientific and technological information recommendation method based on data analysis according to any one of the above embodiments when running.
The embodiment of the invention also provides a terminal device, which comprises a processor, a memory and a computer program stored in the memory and configured to be executed by the processor, wherein the processor realizes the scientific and technological information recommendation method based on data analysis according to any one of the above embodiments when executing the computer program.
According to the technical information recommending method, device, medium and equipment based on data analysis, the keyword index of the content of the technical information is obtained by monitoring and collecting the related text information of the technical information; dividing the technical information content into a plurality of region grades according to the keyword index of the technical information content, wherein different region grades respectively correspond to different grade authorities; acquiring index values corresponding to keyword indexes of each region level, wherein the index values comprise keyword searching times and searching numbers of different users; inputting index values corresponding to the keyword indexes of each region level into a pre-established technological innovation analysis model for calculation to obtain a sequencing value of each region level; analyzing and sorting according to the sorting value of each region level, and respectively storing the technological information content of each region level according to the sorting; matching keyword fields input by a user with stored technical information contents, and sequencing the matched technical information contents according to the matching degree sequence of the keyword fields; and selecting the technical information content with the area level lower than the user authority of the user from the ordered technical information content for outputting. The method and the device can solve the problem that the existing data resource recommendation platform does not consider the influence of different user behaviors and technological information heat, and the resource allocation is solidified.
Drawings
Fig. 1 is a schematic flow chart of a technological information recommendation method based on data analysis according to an embodiment of the present invention;
FIG. 2 is a schematic flow chart of data preprocessing according to an embodiment of the present invention;
FIG. 3 is a flow chart of data visualization provided by an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a scientific and technological information recommendation device based on data analysis according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a terminal device according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The embodiment of the invention provides a scientific and technological information recommending method based on data analysis, and referring to fig. 1, the method is a flow chart of the scientific and technological information recommending method based on data analysis, and the method comprises the following steps of S1-S7:
s1, monitoring and collecting related text information of technical information to obtain keyword indexes of the content of the technical information;
s2, dividing the scientific and technological information content into a plurality of regional grades according to the keyword index of the scientific and technological information content, wherein different regional grades respectively correspond to different grade authorities;
s3, obtaining index values corresponding to keyword indexes of all the regional grades, wherein the index values comprise keyword searching times and searching numbers of different users;
s4, inputting index values corresponding to the keyword indexes of the region grades into a pre-established technological innovation analysis model for calculation to obtain a sorting value of the region grades;
s5, analyzing and sorting according to the sorting value of each area level, and storing the technological information content of each area level according to the sorting;
s6, matching the keyword field input by the user with the stored technical information content, and sequencing the matched technical information content according to the matching degree sequence of the keyword field;
s7, selecting the technical information content with the area level lower than the user authority of the user from the ordered technical information content to output.
When the embodiment is implemented, firstly, the related characters of the technical information are required to be collected, namely, the related characters of the technical information are monitored and collected, the collection mode can be the technical information added by a user, and the technical information is classified in multiple areas to prevent the condition of out-of-order access; and storing the text information related to the technical information.
Then carrying out technological innovation analysis on the processed technological information content according to the established technological innovation analysis model to obtain keyword indexes of the technological information content;
dividing the technical information into a plurality of region grades according to the keyword indexes of the content of the technical information, wherein the region grades respectively correspond to the multi-level authorities and are used for limiting the access authorities of different users.
The user authority is divided into three levels of authority which respectively correspond to a primary user, a middle-level user and a high-level user, wherein the primary user authority is only used for searching the technical information, the primary user does not have middle-level and high-level access authority, namely, the technical information in the primary area level is allowed to be accessed, the middle-level user can add, delete and modify the technical information on the basis of the technical information search, the technical information uploaded by the user is only processed in the deleting and modifying mode, the middle-level user does not have high-level access authority, namely, the technical information in the primary area level and the middle-level area level is allowed to be accessed, the high-level user can select to call the technical information on the basis of adding new technical information, and a plurality of related technical information contents can be simultaneously displayed, namely, the technical information in the primary area level, the middle-level area level and the high-level area level is allowed to be accessed.
It should be noted that, in still another embodiment provided by the present invention, the user rights are divided into four-level rights, where the four-level rights respectively correspond to a primary user, a middle-level user, a high-level user and a special-level user, the primary user rights only refer to searching for technical information, and the primary user has no middle-level and high-level access rights, that is, allows accessing technical information in a primary area level, the middle-level user can add, delete and modify the technical information based on searching for the technical information, and the deleting and modifying the technical information only refer to processing the technical information uploaded by the user, the middle-level user has no high-level access rights, that is, allows accessing the technical information in the primary and middle-level area levels, and the high-level user can select to call the technical information based on adding new technical information, can simultaneously display a plurality of related pieces of technical information content, that allows accessing the technical information in the primary, middle-level and high-level area levels, that allows modifying and deleting the technical information published by any user, and can govern management of other users, such as a special-level account number of users can not read the technical information.
Acquiring index values corresponding to the keyword indexes of each region level, wherein the index values comprise the search times of the keywords and the search quantity of different users, inputting the index values corresponding to the keyword indexes of each region level into a technological innovation analysis model, analyzing and sorting the index values according to the index values of each region level, wherein the sorting mode can be one of positive sequence and reverse sequence, and storing corresponding technological information according to the index value sorting of each region;
matching keywords input by a user with stored related words of the technical information, outputting the technical information with high matching degree preferentially according to the keyword field, screening the technical information to be output, and selecting the technical information with the area level lower than the current user authority level for output;
before the user inputs the keywords through the input box, the user further performs recognition analysis on the keywords input by the user through data monitoring to judge whether the input keywords are correct or not, and if the input keywords do not correspond, the display fails.
According to the method and the device, the index values corresponding to the keyword indexes of the regional grades are obtained, wherein the index values comprise the search times of the keywords and the search quantity of different users, the index values corresponding to the keyword indexes of the regional grades are input into the technological innovation analysis model, analysis and sorting are carried out according to the index values of the regional grades, and the heat of the technological information under the different regional grades is effectively reflected. Dividing the scientific and technological information into a plurality of regional grades according to the keyword indexes of the content of the scientific and technological information, wherein the regional grades respectively correspond to the multilevel authorities of users, matching the keywords input by the users with related words of the scientific and technological information stored by a data processing host, preferentially outputting the scientific and technological information with high matching degree according to the keyword fields, screening the scientific and technological information to be output, and selecting the scientific and technological information with the regional grade lower than the authority grade of the current user to output, so that the resource allocation is more flexible, and the safety of the scientific and technological resources is effectively improved. The technical problem that the resource allocation solidification is caused by the fact that the prior scientific and technological service platform does not consider the influences of different user behaviors and the heat of scientific and technological information can be solved.
In still another embodiment of the present invention, after the monitoring and collecting the text information related to the technical information, the method further includes:
and carrying out data preprocessing on the technical information, eliminating data noise and data irrelevant to the technical subject, and carrying out real-time updating on the processed text data information.
When the embodiment is implemented, after the related text information of the technical information is monitored and collected, the related text information of the technical information is stored, the collected technical information is subjected to data preprocessing, data noise and data irrelevant to the technical subject are eliminated, and the processed text data information is updated in real time.
The acquired technical information is subjected to data preprocessing to eliminate data noise and data irrelevant to the technical subject, so that the quality of a data set is improved, and the requirement of data analysis at the present stage is met.
In another embodiment of the present invention, the process of performing data preprocessing on the technical information, eliminating data noise and data irrelevant to the technical theme, and performing real-time update on the processed text data information specifically includes:
acquiring field names and values of text information related to input technical information;
generating a list, and traversing data items of text information related to the input technical information;
the first data item is put into the list, and the remaining data items are compared with the values of the data items in the list: if the value of the field in a certain data item is the same as the value of the data item in the list, judging the data item as repeated data; if the values of the fields in a certain data item are different from the values of the data items in the list, judging the data item as non-repeated data, and storing the non-repeated data into the list;
and after traversing, finally taking the data in the list as the technological information content.
In the implementation of this embodiment, referring to fig. 2, a schematic flow chart of data preprocessing provided in the embodiment of the present invention is shown; the data preprocessing steps are as follows:
acquiring field names and values of text information related to input technical information;
then generating a list and traversing data items of the text information related to the input technical information;
putting the first data item into a list, and comparing the rest data items with the values of the data items in the list respectively:
if the value of the field in the data item is the same as the value of the data item in the list, judging that the data is repeated, and not storing the repeated data in the list;
if the values of the fields in a certain data item are different from the values of the data items in the list, judging the data item as non-repeated data, and storing the non-repeated data into the list;
and after traversing, finally, storing and updating the data in the list as technological information content.
Data noise and data irrelevant to a science and technology theme are eliminated by carrying out data preprocessing on the acquired science and technology information, and the data preprocessing is carried out: for example, missing data are filled, noise data are eliminated, and the like, mainly through analyzing the generation reason and the existence form of the data, the data are cleaned by utilizing the existing data acquisition means and methods, and the data are converted into data meeting the data quality requirement or application requirement, so that the quality of a data set is improved, the requirement of data analysis at the present stage is met, and the processed text data information is updated in real time.
In yet another embodiment provided by the present invention, after the step S7, the method further includes:
and displaying the related data information of the technological information to the user in a data visualization mode.
When the embodiment is implemented, the related data information of the technical information is displayed to the user in a data visualization mode, so that the user is helped to carry out consultation and communication with the platform, the result or the intermediate result of the index value corresponding to the keyword index of the technical information is displayed or explained to the user, and the user can be helped to browse the content of the data object.
In another embodiment of the present invention, the displaying, by means of data visualization, the technological information related data information to the user specifically includes:
acquiring technological information contents of a plurality of regional levels, carrying out data abstraction on index values corresponding to keyword indexes of each regional level, and establishing an information polyhedron data model aiming at an information side face and the user access to realize visualization of the information side face and the user access;
adopting a DataV technology for the polyhedral data model, and visualizing complex association relationship data in the scientific and technological information content;
the information polyhedron data model comprises a user access model, an information side model and a keyword popularity model.
In the implementation of this embodiment, referring to fig. 3, a schematic flow chart of data visualization provided in the embodiment of the present invention is shown; the data visualization steps are as follows:
firstly, acquiring technological information of a plurality of regional levels, carrying out data abstraction on index values corresponding to keyword indexes of each regional level, establishing an information polyhedron data model aiming at an information side face and user access, realizing visualization of the information side face and the user access, and then combining a DataV technology to realize visualization of complex association relation data in technological innovation data, wherein the DataV technology makes a data visualization page which can be displayed on a display screen.
The information polyhedron data model comprises a user access model, an information side model and a keyword popularity model.
And displaying the related data information of the technical information to the user in a data visualization mode.
In yet another embodiment provided by the present invention, the user access model includes a user search category classification, a set of user permission association tables, and a set of user access time dimensions;
the information side model comprises a data item set, a data item association table set, a data item time dimension set, a data item region dimension set and a data item category classification, wherein the data item set comprises a data item and a data item keyword set;
the keyword popularity model comprises a keyword set and keyword class classification.
When the embodiment is implemented, the user access model comprises user search category classification, a user permission association table set and a user access time dimension set;
the information side model comprises a data item set, a data item association table set, a data item time dimension set, a data item region dimension set and a data item category classification, wherein the data item set comprises data items and a data item keyword set;
the keyword popularity model includes a set of keywords and keyword category classification.
The visualization of complex association relationship data in the scientific and technological information content can be realized through an information polyhedron data model formed by the user access model, the information side model and the keyword popularity model.
In a further embodiment of the present invention, the technological innovation analysis model is
Wherein,ordering value for regional class, +.>Is the coefficient of the nth index, +.>Is the weight of the nth index,,/>wherein->An n-th index which is an index value of the keyword, < + >>Is the average of the nth index of the keyword index values.
In the implementation of this embodiment, the technological innovation analysis model is as follows,/>For regional-level rowsSequence value->Is the coefficient of the nth index, +.>Is the weight of the nth index, +.>,/>,/>An n-th index which is an index value of the keyword, < + >>Is the average of the nth index of the keyword index values.
Analyzing and sorting the region grades through a technological innovation analysis model, and respectively storing technological information contents of the region grades according to the sorting; thereby enabling regional grading of the scientific and technological information contents. And the scientific and technological information with the regional level lower than the current user authority level is selected for output, so that the resource allocation is more flexible, and the safety of the scientific and technological resource is effectively improved.
Referring to fig. 4, a schematic structural diagram of a scientific and technological information recommendation device based on data analysis according to an embodiment of the present invention is provided;
the data acquisition module is used for monitoring and acquiring the related text information of the technical information and acquiring the keyword index of the content of the technical information;
the data analysis module is used for dividing the scientific and technological information content into a plurality of region grades according to the keyword index of the scientific and technological information content, and different region grades respectively correspond to different grade authorities;
the index value acquisition module is used for acquiring index values corresponding to the keyword indexes of the regional grades, wherein the index values comprise the keyword searching times and the searching quantity of different users;
the ranking value calculation module is used for inputting index values corresponding to the keyword indexes of the region grades into a pre-established technological innovation analysis model for calculation to obtain ranking values of the region grades;
the sequencing module is used for analyzing and sequencing according to the sequencing value of each region level, and storing the technological information content of each region level according to the sequencing;
the matching module is used for matching the keyword field input by the user with the stored technical information content and sequencing the matched technical information content according to the matching degree sequence of the keyword field;
and the output module is used for selecting the technical information content with the area level lower than the user authority of the user from the ordered technical information content to output.
It should be noted that, the technical information recommendation device based on data analysis provided in the embodiment of the present invention can execute the technical information recommendation method based on data analysis described in any embodiment of the foregoing embodiments, and specific functions of the technical information recommendation device based on data analysis are not described herein.
Referring to fig. 5, a schematic structural diagram of a terminal device according to an embodiment of the present invention is provided. The terminal device of this embodiment includes: a processor, a memory, and a computer program stored in the memory and executable on the processor, such as a scientific and technological information recommendation program based on data analysis. The steps in the above embodiments of the method for recommending technological information based on data analysis are implemented when the processor executes the computer program, for example, steps S1 to S7 shown in fig. 1. Alternatively, the processor may implement the functions of the modules in the above-described device embodiments when executing the computer program.
The computer program may be divided into one or more modules/units, which are stored in the memory and executed by the processor to accomplish the present invention, for example. The one or more modules/units may be a series of computer program instruction segments capable of performing the specified functions, which instruction segments are used for describing the execution of the computer program in the terminal device. For example, the computer program may be divided into modules, and specific functions of each module are not described herein.
The terminal equipment can be computing equipment such as a desktop computer, a notebook computer, a palm computer, a cloud server and the like. The terminal device may include, but is not limited to, a processor, a memory. It will be appreciated by those skilled in the art that the schematic diagram is merely an example of a terminal device and does not constitute a limitation of the terminal device, and may include more or less components than illustrated, or may combine certain components, or different components, e.g., the terminal device may further include an input-output device, a network access device, a bus, etc.
The processor may be a central processing unit (Central Processing Unit, CPU), other general purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), field programmable gate arrays (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. The general purpose processor may be a microprocessor or the processor may be any conventional processor or the like, which is a control center of the terminal device, and which connects various parts of the entire terminal device using various interfaces and lines.
The memory may be used to store the computer program and/or module, and the processor may implement various functions of the terminal device by running or executing the computer program and/or module stored in the memory and invoking data stored in the memory. The memory may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like; the storage data area may store data (such as audio data, phonebook, etc.) created according to the use of the handset, etc. In addition, the memory may include high-speed random access memory, and may also include non-volatile memory, such as a hard disk, memory, plug-in hard disk, smart Media Card (SMC), secure Digital (SD) Card, flash Card (Flash Card), at least one disk storage device, flash memory device, or other volatile solid-state storage device.
Wherein the terminal device integrated modules/units may be stored in a computer readable storage medium if implemented in the form of software functional units and sold or used as stand alone products. Based on such understanding, the present invention may implement all or part of the flow of the method of the above embodiment, or may be implemented by a computer program to instruct related hardware, where the computer program may be stored in a computer readable storage medium, and when the computer program is executed by a processor, the computer program may implement the steps of each of the method embodiments described above. Wherein the computer program comprises computer program code, which may be in the form of code, object code, executable files, or in some intermediate form, etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth.
It should be noted that the above-described apparatus embodiments are merely illustrative, and the units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed over a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. In addition, in the drawings of the embodiment of the device provided by the invention, the connection relation between the modules represents that the modules have communication connection, and can be specifically implemented as one or more communication buses or signal lines. Those of ordinary skill in the art will understand and implement the present invention without undue burden.
While the foregoing is directed to the preferred embodiments of the present invention, it will be appreciated by those skilled in the art that changes and modifications may be made without departing from the principles of the invention, such changes and modifications are also intended to be within the scope of the invention.

Claims (7)

1. A technological information recommendation method based on data analysis, the method comprising:
monitoring and collecting related text information of the technical information to obtain keyword indexes of the content of the technical information;
dividing the technical information content into a plurality of region grades according to the keyword index of the technical information content, wherein different region grades respectively correspond to different grade authorities;
acquiring index values corresponding to keyword indexes of each region level, wherein the index values comprise keyword searching times and searching numbers of different users;
inputting index values corresponding to the keyword indexes of each region level into a pre-established technological innovation analysis model for calculation to obtain a sequencing value of each region level;
analyzing and sorting according to the sorting value of each region level, and respectively storing the technological information content of each region level according to the sorting;
matching keyword fields input by a user with stored technical information contents, and sequencing the matched technical information contents according to the matching degree sequence of the keyword fields;
selecting technological information content with the area level lower than the user authority of the user from the ordered technological information content, and outputting the technological information content;
after selecting the technological information content with the area level lower than the user authority of the user from the ordered technological information contents and outputting the technological information content, the method further comprises the following steps:
displaying related data information of the technological information to the user in a data visualization mode;
the method for displaying the related data information of the technical information to the user through the data visualization specifically comprises the following steps:
acquiring technological information contents of a plurality of regional levels, carrying out data abstraction on index values corresponding to keyword indexes of each regional level, and establishing an information polyhedron data model aiming at an information side face and the user access to realize visualization of the information side face and the user access;
adopting a DataV technology for the polyhedral data model, and visualizing complex association relationship data in the scientific and technological information content;
the information polyhedron data model comprises a user access model, an information side model and a keyword popularity model;
the technological innovation analysis model is that
Wherein,ordering value for regional class, +.>Is the coefficient of the nth index, +.>Is the weight of the nth index,,/>wherein->An n-th index which is an index value of the keyword, < + >>Is the average of the nth index of the keyword index values.
2. The method for recommending technological information based on data analysis according to claim 1, wherein after monitoring and collecting the relevant text information of the technological information, the method further comprises:
and carrying out data preprocessing on the technical information, eliminating data noise and data irrelevant to the technical subject, and carrying out real-time updating on the processed text data information.
3. The method for recommending technological information based on data analysis according to claim 2, wherein the process of performing data preprocessing on the technological information, eliminating data noise and data irrelevant to technological topics, and performing real-time update on the processed text data information specifically comprises:
acquiring field names and values of text information related to input technical information;
generating a list, and traversing data items of text information related to the input technical information;
the first data item is put into the list, and the remaining data items are compared with the values of the data items in the list: if the value of the field in a certain data item is the same as the value of the data item in the list, judging the data item as repeated data; if the values of the fields in a certain data item are different from the values of the data items in the list, judging the data item as non-repeated data, and storing the non-repeated data into the list;
and after traversing, finally taking the data in the list as the technological information content.
4. The data analysis-based technological information recommendation method according to claim 1, wherein the user access model includes a user search category classification, a user authority association table set, and a user access time dimension set;
the information side model comprises a data item set, a data item association table set, a data item time dimension set, a data item region dimension set and a data item category classification, wherein the data item set comprises a data item and a data item keyword set;
the keyword popularity model comprises a keyword set and keyword class classification.
5. A scientific and technological information recommending device based on data analysis, characterized in that the device comprises:
the data acquisition module is used for monitoring and acquiring the related text information of the technical information and acquiring the keyword index of the content of the technical information;
the data analysis module is used for dividing the scientific and technological information content into a plurality of region grades according to the keyword index of the scientific and technological information content, and different region grades respectively correspond to different grade authorities;
the index value acquisition module is used for acquiring index values corresponding to the keyword indexes of the regional grades, wherein the index values comprise the keyword searching times and the searching quantity of different users;
the ranking value calculation module is used for inputting index values corresponding to the keyword indexes of the region grades into a pre-established technological innovation analysis model for calculation to obtain ranking values of the region grades;
the sequencing module is used for analyzing and sequencing according to the sequencing value of each region level, and storing the technological information content of each region level according to the sequencing;
the matching module is used for matching the keyword field input by the user with the stored technical information content and sequencing the matched technical information content according to the matching degree sequence of the keyword field;
the output module is used for selecting the scientific information content with the area level lower than the user authority of the user from the sequenced scientific information contents to output;
the apparatus further comprises a visualization module for:
after selecting the technical information content with the area level lower than the user authority of the user from the ordered technical information content, displaying the technical information related data information to the user in a data visualization mode;
the visualization module is specifically configured to:
acquiring technological information contents of a plurality of regional levels, carrying out data abstraction on index values corresponding to keyword indexes of each regional level, and establishing an information polyhedron data model aiming at an information side face and the user access to realize visualization of the information side face and the user access;
adopting a DataV technology for the polyhedral data model, and visualizing complex association relationship data in the scientific and technological information content;
the information polyhedron data model comprises a user access model, an information side model and a keyword popularity model;
the technological innovation analysis model is that
Wherein,ordering value for regional class, +.>Is the coefficient of the nth index, +.>Is the weight of the nth index,,/>wherein->An n-th index which is an index value of the keyword, < + >>The nth index being a keyword index valueAverage number.
6. A computer readable storage medium, characterized in that the computer readable storage medium comprises a stored computer program, wherein the computer program when run controls a device in which the computer readable storage medium is located to perform the technological information recommendation method based on data analysis according to any one of claims 1 to 4.
7. A terminal device comprising a processor, a memory and a computer program stored in the memory and configured to be executed by the processor, the processor implementing the data analysis-based technological information recommendation method according to any one of claims 1 to 4 when the computer program is executed.
CN202311329773.XA 2023-10-16 2023-10-16 Scientific and technological information recommendation method, device, medium and equipment based on data analysis Active CN117076783B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311329773.XA CN117076783B (en) 2023-10-16 2023-10-16 Scientific and technological information recommendation method, device, medium and equipment based on data analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311329773.XA CN117076783B (en) 2023-10-16 2023-10-16 Scientific and technological information recommendation method, device, medium and equipment based on data analysis

Publications (2)

Publication Number Publication Date
CN117076783A CN117076783A (en) 2023-11-17
CN117076783B true CN117076783B (en) 2023-12-26

Family

ID=88702866

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311329773.XA Active CN117076783B (en) 2023-10-16 2023-10-16 Scientific and technological information recommendation method, device, medium and equipment based on data analysis

Country Status (1)

Country Link
CN (1) CN117076783B (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106169032A (en) * 2016-09-30 2016-11-30 广东省科技基础条件平台中心 A kind of computational methods of industry cluster Innovation Index
JP2018005305A (en) * 2016-06-27 2018-01-11 Necパーソナルコンピュータ株式会社 Information processing system, information processing device, and program
CN109446247A (en) * 2018-09-12 2019-03-08 石家庄铁道大学 The analysis of scientific and technical innovation class data visualization and methods of exhibiting
CN109559206A (en) * 2018-12-27 2019-04-02 深圳市中电数通智慧安全科技股份有限公司 A kind of regional enterprises Credit Evaluation System method, apparatus and terminal device
CN111382341A (en) * 2020-03-23 2020-07-07 湖南城市学院 Scientific and technological information resource retrieval and query system and method based on big data
CN112989164A (en) * 2021-03-26 2021-06-18 北京金堤征信服务有限公司 Search result processing method and device and electronic equipment
WO2021121106A1 (en) * 2019-12-20 2021-06-24 深圳前海微众银行股份有限公司 Federated learning-based personalized recommendation method, apparatus and device, and medium
CN113643070A (en) * 2021-08-20 2021-11-12 林秀珍 Intelligent information pushing method and system based on big data
CN114841662A (en) * 2022-04-19 2022-08-02 南方电网大数据服务有限公司 Infrastructure construction project management and control method and device, computer equipment and storage medium
WO2023020167A1 (en) * 2021-08-16 2023-02-23 北京字节跳动网络技术有限公司 Information display method and apparatus, computer device, and storage medium
CN116503026A (en) * 2023-06-26 2023-07-28 广东省科技基础条件平台中心 Operation and maintenance risk assessment method, system and storage medium for science and technology items

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714088A (en) * 2012-10-09 2014-04-09 深圳市世纪光速信息技术有限公司 Method for acquiring search terms, server and method and system for recommending search terms
US10437898B2 (en) * 2015-05-04 2019-10-08 Dac Group (Holdings) Limited Systems and methods for targeted content presentation based on search query analysis

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2018005305A (en) * 2016-06-27 2018-01-11 Necパーソナルコンピュータ株式会社 Information processing system, information processing device, and program
CN106169032A (en) * 2016-09-30 2016-11-30 广东省科技基础条件平台中心 A kind of computational methods of industry cluster Innovation Index
CN109446247A (en) * 2018-09-12 2019-03-08 石家庄铁道大学 The analysis of scientific and technical innovation class data visualization and methods of exhibiting
CN109559206A (en) * 2018-12-27 2019-04-02 深圳市中电数通智慧安全科技股份有限公司 A kind of regional enterprises Credit Evaluation System method, apparatus and terminal device
WO2021121106A1 (en) * 2019-12-20 2021-06-24 深圳前海微众银行股份有限公司 Federated learning-based personalized recommendation method, apparatus and device, and medium
CN111382341A (en) * 2020-03-23 2020-07-07 湖南城市学院 Scientific and technological information resource retrieval and query system and method based on big data
CN112989164A (en) * 2021-03-26 2021-06-18 北京金堤征信服务有限公司 Search result processing method and device and electronic equipment
WO2023020167A1 (en) * 2021-08-16 2023-02-23 北京字节跳动网络技术有限公司 Information display method and apparatus, computer device, and storage medium
CN113643070A (en) * 2021-08-20 2021-11-12 林秀珍 Intelligent information pushing method and system based on big data
CN114841662A (en) * 2022-04-19 2022-08-02 南方电网大数据服务有限公司 Infrastructure construction project management and control method and device, computer equipment and storage medium
CN116503026A (en) * 2023-06-26 2023-07-28 广东省科技基础条件平台中心 Operation and maintenance risk assessment method, system and storage medium for science and technology items

Also Published As

Publication number Publication date
CN117076783A (en) 2023-11-17

Similar Documents

Publication Publication Date Title
CN111753198A (en) Information recommendation method and device, electronic equipment and readable storage medium
CN103778548A (en) Goods information and keyword matching method, and goods information releasing method and device
CN109325121B (en) Method and device for determining keywords of text
US11514124B2 (en) Personalizing a search query using social media
WO2011087904A1 (en) Matching of advertising sources and keyword sets in online commerce platforms
TWI705411B (en) Method and device for identifying users with social business characteristics
CN111259220B (en) Data acquisition method and system based on big data
CN113836131A (en) Big data cleaning method and device, computer equipment and storage medium
CN113435859A (en) Letter processing method and device, electronic equipment and computer readable medium
CN105786810B (en) The method for building up and device of classification mapping relations
CN113221535B (en) Information processing method, device, computer equipment and storage medium
Rai et al. Using open source intelligence as a tool for reliable web searching
CN114429265A (en) Enterprise portrait service construction method, device and equipment based on grid technology
CN117076783B (en) Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN116485019A (en) Data processing method and device
CN114780712B (en) News thematic generation method and device based on quality evaluation
CN111026940A (en) Network public opinion and risk information monitoring system and electronic equipment for power grid electromagnetic environment
US11074486B2 (en) Query analysis using deep neural net classification
JPH08305724A (en) Device for managing design supporting information document
CN112328752B (en) Course recommendation method and device based on search content, computer equipment and medium
CN114003787A (en) Data visualization method based on artificial intelligence and related equipment
CN111784069B (en) User preference prediction method, device, equipment and storage medium
CN117056392A (en) Big data retrieval service system and method based on dynamic hypergraph technology
CN115618034A (en) Mapping application of machine learning model to answer queries according to semantic specifications
US11157532B2 (en) Hierarchical target centric pattern generation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant