CN117407429A - Park information retrieval method, device, computer equipment and storage medium - Google Patents

Park information retrieval method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN117407429A
CN117407429A CN202311604225.3A CN202311604225A CN117407429A CN 117407429 A CN117407429 A CN 117407429A CN 202311604225 A CN202311604225 A CN 202311604225A CN 117407429 A CN117407429 A CN 117407429A
Authority
CN
China
Prior art keywords
information
park
campus
enterprise
address
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311604225.3A
Other languages
Chinese (zh)
Inventor
周立运
请求不公布姓名
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rubik's Cube Medical Technology Suzhou Co ltd
Original Assignee
Rubik's Cube Medical Technology Suzhou Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rubik's Cube Medical Technology Suzhou Co ltd filed Critical Rubik's Cube Medical Technology Suzhou Co ltd
Priority to CN202311604225.3A priority Critical patent/CN117407429A/en
Publication of CN117407429A publication Critical patent/CN117407429A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/243Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Fuzzy Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a park information retrieval method, a device, computer equipment and a storage medium, wherein the method comprises the following steps: receiving a park information retrieval request carrying a park information retrieval word; responding to a park information retrieval request, and screening out park information matched with a park information retrieval word from a pre-constructed park information database to serve as target park information; the park information database is constructed by mining the relevance between the park and the enterprise; and sending the target park information to the client for the client to display. By adopting the method and the device, the efficiency of acquiring the park information can be remarkably improved, and the retrieval requirement of users on the park information can be met.

Description

Park information retrieval method, device, computer equipment and storage medium
Technical Field
The embodiment of the invention relates to the technical field of computers, in particular to a park information retrieval method, a device, computer equipment and a storage medium.
Background
An industrial park refers to an economic aggregation area where enterprises and institutions of a certain scale and number are intensively built, organized and managed in a specific area to promote industrial development, resource aggregation and innovation cooperation. Knowledge of industrial park information is an indispensable reference basis for investors, entrepreneurs and government decision makers.
In particular, when investors or entrepreneurs consider investment or entrepreneurs, knowing information about basic conditions, policy support, industry set-ups, etc. of an industrial park is a very important reference factor, because they need to obtain relevant information about development planning, benefit analysis, market prospects, etc. of the park to evaluate risk and industry development potential. For enterprises operating in industrial parks, knowing information such as conditions and market competitiveness of other enterprises in the same industry is a basis for pushing cooperation and sharing resources, and the enterprises can search for cooperation opportunities by acquiring contact information, project information, research and development achievements and the like of the enterprises in the parks, so as to jointly develop technical innovation and market expansion. Furthermore, government agencies are often required to know the status of park development, operational effectiveness, and park business needs to develop more accurate and effective support policies and management measures. The information such as economic data, industry development scale and technical level of enterprises in the park is mastered, and a reference basis for scientific decision can be provided for government departments.
However, at present, the industrial park information is generally obtained by means of in-situ access, media report or official information formula platform inquiry, so that the method has certain limitations and time delay, and the information obtaining efficiency is low.
Disclosure of Invention
The invention aims to provide a park information retrieval method, a device, computer equipment and a storage medium, which are used for constructing a park information database by combining an artificial intelligence technology to collect and clean detailed information of an industrial park, so that comprehensive and systematic data support is provided for related institutions, information collection time and cost are saved, and information acquisition efficiency is improved.
In a first aspect, the present invention provides a method for retrieving campus information, including:
receiving a park information retrieval request carrying a park information retrieval word;
responding to a park information retrieval request, and screening out park information matched with a park information retrieval word from a pre-constructed park information database to serve as target park information; the park information database is constructed by mining the relevance between the park and the enterprise;
and sending the target park information to the client for the client to display.
In some embodiments of the present invention, before selecting the campus information matching with the campus information search word in the pre-constructed campus information database as the target campus information in response to the campus information search request, the method further includes: acquiring information business information; analyzing information business information to clean each industrial park to obtain park list information; carrying out enterprise association on each industrial park in the park list information one by one to construct a key value pair between the park and the enterprise so as to obtain park information; and constructing and obtaining a park information database based on the park information.
In some embodiments of the present invention, the information business information includes information, and analyzing the information business information to clean each industrial park to obtain park list information, including: data cleaning is carried out on the information so as to extract each industrial park in the information; acquiring park attribute information of each industrial park as park list information; among other things, campus attribute information includes, but is not limited to: at least one of a park name, a superior park name, park province information, park city information, number of resident enterprises, and park address information.
In some embodiments of the present invention, the information business information further includes business information, the business information is business information of a preset enterprise object, and the step of obtaining the campus list information further includes: aiming at the business information, cleaning the enterprise address information of a preset enterprise object; classifying and counting the enterprise address information to take the enterprise address information with the statistical quantity larger than or equal to a preset threshold value as target enterprise address information; and taking the target enterprise address information as park address information to update the park list information.
In some embodiments of the present invention, enterprise association is performed on each industrial park in the park list information one by one to construct a key value pair between the park and the enterprise, so as to obtain park information, including: extracting park address information of each industrial park in the park list information; performing word segmentation processing on the park address information to obtain park address word segmentation; and carrying out association matching on the park address word and prestored enterprise information to construct a key value pair between the park and the enterprise, so as to obtain park information.
In some embodiments of the invention, the campus information retrieval method further comprises: acquiring a park accurate address of each industrial park in park list information; wherein, the accurate address of garden includes at least one of the following: park name, park level address, park peripheral address; acquiring enterprise business information; and screening out target enterprises matched with the accurate addresses of the parks according to the enterprise business information so as to update key value pairs between the parks and the enterprises and obtain the park information.
In some embodiments of the present invention, building a campus information database based on the campus information includes: acquiring resource integration information aiming at enterprises corresponding to and associated with each industrial park in park information; wherein the resource integration information includes, but is not limited to: at least one of a resource engagement time, a resource engagement event, and a resource engagement amount; and marking each industrial park and related enterprises in the park information based on the resource integration information to obtain a park information database.
In a second aspect, the present invention provides a campus information retrieval device, comprising:
the request receiving module is used for receiving a park information retrieval request carrying a park information retrieval word;
The request response module is used for responding to the park information retrieval request, and screening out park information matched with the park information retrieval words from a pre-constructed park information database to serve as target park information; the park information database is constructed by mining the relevance between the park and the enterprise;
and the information sending module is used for sending the target park information to the client for the client to display.
In a third aspect, the present invention also provides a computer device comprising:
one or more processors;
a memory; and one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the processor to implement the campus information retrieval method described above.
In a fourth aspect, the invention also provides a computer readable storage medium having stored thereon a computer program for loading by a processor to perform steps in a campus information retrieval method.
In a fifth aspect, embodiments of the present invention provide a computer program product or computer program comprising computer instructions stored in a computer readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device performs the method provided in the first aspect.
According to the park information retrieval method, device, computer equipment and storage medium, the server firstly builds the park information database by excavating the relevance between the park and the enterprise, so that the comprehensiveness and accuracy of recorded park information can be ensured, then the target park information matched with the park information retrieval words can be screened out from the pre-built park information database by receiving and responding to the park information retrieval request carrying the park information retrieval words, and then the target park information is sent to the client for the client to display, so that the time and cost for searching information for a user are saved, and the park information acquisition efficiency is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic view of a scenario of a campus information retrieval method according to an embodiment of the present invention;
FIG. 2 is a flow chart of a method for retrieving campus information according to an embodiment of the invention;
FIG. 3 is a schematic diagram of a campus information retrieval device according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a computer device in an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all, of the embodiments of the present application. All other embodiments, which can be made by those skilled in the art based on the embodiments herein without making any inventive effort, are intended to be within the scope of the present application.
It should be noted that in the description of the present application, the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or implying a number of technical features which is being indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more of the described features. In the description of the present application, the meaning of "a plurality" is two or more, unless explicitly defined otherwise.
Meanwhile, the campus information retrieval method provided by the embodiment of the invention can be applied to a campus information retrieval system shown in fig. 1. Wherein the campus information retrieval system includes a client 102 and a server 104. The client 102 may be a device that includes both receive and transmit hardware, i.e., a device having receive and transmit hardware capable of performing bi-directional communications over a bi-directional communication link. Such a device may include: a cellular or other communication device having a single-line display or a multi-line display or a cellular or other communication device without a multi-line display. The client 102 may be a desktop terminal or a mobile terminal, and the client 102 may be one of a mobile phone, a tablet computer, and a notebook computer. The server 104 may be a stand-alone server, or may be a server network or a server cluster of servers, including but not limited to a computer, a network host, a single network server, a set of multiple network servers, or a cloud server of multiple servers. Wherein the Cloud server is composed of a large number of computers or web servers based on Cloud Computing (Cloud Computing). In addition, the client 102 and the server 104 establish a communication connection through a network, and the network may specifically be any one of a wide area network, a local area network, and a metropolitan area network.
In addition, it will be appreciated by those skilled in the art that the application environment shown in fig. 1 is only one application scenario applicable to the present application and is not limited to the application scenario of the present application, and other application environments may include more or fewer devices than those shown in fig. 1. For example, only 1 server is shown in fig. 1. It will be appreciated that the campus information retrieval system may also include one or more other devices, particularly without limitation. Additionally, the campus information retrieval system may also include a memory for storing data, such as storing the campus information.
Of course, the schematic view of the scenario of the campus information retrieval system shown in fig. 1 is only an example, and the campus information retrieval system and the scenario described in the embodiments of the present invention are for more clearly describing the technical solutions of the embodiments of the present invention, and do not constitute a limitation on the technical solutions provided by the embodiments of the present invention, and those skilled in the art can know that, with the evolution of the campus information retrieval system and the appearance of new service scenarios, the technical solutions provided by the embodiments of the present invention are equally applicable to similar technical problems.
With the explosive development and economic transformation and upgrading of industrial parks, construction of industrial park databases has become imperative. The importance and the value of the method are not only embodied in providing accurate basis for government decision, but also in promoting interconnection and intercommunication of enterprises, optimizing resource allocation and promoting industry collaborative development. However, at present, a service platform for recording detailed information of industrial parks produced in various places throughout the country is rarely available on the market, and most of the service platforms, the operators of the industrial parks themselves, build official websites to record and provide information such as basic profiles, development plans, policies and regulations of the industrial parks. Although government related departments and industry related associations and businesses may issue related reports and announcements to provide information such as campus dynamics and policy updates, the information still has certain limitations, and problems such as outdated information and errors are prone to occur, so that it is inevitable that large labor cost and time cost are required for an information acquirer.
Based on the above, from the viewpoint of improving the efficiency of acquiring the campus information and the accuracy and comprehensiveness of the information, the embodiment of the invention provides that a campus information database containing the detailed information of all industrial parks nationwide can be constructed, so that the comprehensive, accurate and real-time collection, integration and display of the information of all industrial parks and resident enterprises are realized, and accurate and efficient information services are provided for industry decision makers, investors and other interested parties. The technical scheme of the present invention will be described in detail with reference to the accompanying drawings.
Referring to fig. 2, a flow chart of a method for retrieving campus information according to an embodiment of the present invention is mainly illustrated by the method being applied to the server 104 in fig. 1, and the method includes steps S201 to S203, specifically as follows:
s201, receiving a park information retrieval request carrying a park information retrieval word.
Since the term is generally a related vocabulary for summarizing the content to be searched, the term for the campus information may be a related vocabulary for summarizing the corresponding campus information of an industrial campus to be searched, and may be used in the information searching platform for the user to search for the specified campus information. Park information retrieval words include, but are not limited to: park names (e.g., zhang Jianggao scientific and technological parks, chinese medicine City), park provinces (e.g., shanghai, beijing), park cities (e.g., nanjing, suzhou), resident enterprises (e.g., 1-20, 21-50), etc.
In particular implementations, the user may send a campus information retrieval request to the server 104 via the client 102, or may send a campus information retrieval request to the server 104 via other devices. The "other device" mentioned herein may be a device that has no communication connection with the client 102, or may be a device that has a communication connection with the client 102, which is not limited by the embodiment of the present invention.
Illustratively, after the server 104 renders the search page of the campus information search platform to the client 102 for display, the user can then determine at least one campus information search term through the campus information search page displayed on the display screen of the client 102, and then the client 102 generates a campus information search request for one or more campus information search terms submitted by the user, and sends the request to the server 104, and instructs the server 104 to respond to the request, and obtain the target campus information corresponding to a certain industrial campus required by the user. The determining manner of the campus information search term mentioned in this embodiment includes, but is not limited to: clicking, double clicking or long pressing the preset candidate search term currently displayed on the display screen, or inputting a non-preset search term.
S202, in response to a park information retrieval request, the park information matched with the park information retrieval word is screened out from a pre-built park information database to serve as target park information; the park information database is constructed by mining the relevance between the park and the enterprise.
Where a campus may refer to a defined geographic area, typically with specific industrial and functional locations, providing business, production, and innovative infrastructure and service support for a range of businesses. For example, the campus in embodiments of the present invention may be an industrial campus, a scientific campus, an economic development area, and so on. The business may be an individual business, partner business, finite liability company, stock finite company, etc.
In particular implementations, after receiving the campus information search request, the server 104 may respond to the request to screen out the campus information matching the campus information search word from the campus information database as the target campus information required by the user. Here, the target campus information may include at least one of campus attribute information (including, but not limited to, at least one of a campus name, a superior campus name, a campus province information, a campus city information, a number of resident enterprises), and enterprise attribute information (including, but not limited to, at least one of enterprise business information, enterprise know-how information (e.g., patent information, trademark information, etc.), enterprise investment information (e.g., a historical cumulative financing size, a current annual financing size, a latest financing course, etc.), news information, etc., of each enterprise resident in the corresponding campus.
For example, after the server 104 analyzes the request, the industrial park whose park address is in "Changzhou city" can be screened from the park information database, and if only the biomedical park is recorded in the park information database, the screening result is output as follows: changzhou life health industry park, west Taihu medical industry hatchery park, first industry park-Changzhou.
In one embodiment, before step S202, further includes: acquiring information business information; analyzing information business information to clean each industrial park to obtain park list information; carrying out enterprise association on each industrial park in the park list information one by one to construct a key value pair between the park and the enterprise so as to obtain park information; and constructing and obtaining a park information database based on the park information.
The information and business information may be a generic term of information and business information after being combined, and the processing steps of the information and business information will be described in detail below. Key-value pairs are a data structure in computer programming that consists of two parts: a key and a value associated therewith; a key (key) is an identifier or index that is used to uniquely identify and access a value (value). A key is typically a string, integer, or other hashed data type; the value (value) is data associated with the key. The key-value pair establishes a mapping relation between keys and values, and corresponding values can be quickly accessed and acquired through the keys.
In particular, considering that the data update mechanism of the current data presentation platform may be imperfect, such as some of the latest enterprise information, campus changes, etc. are not reflected in real time, in this embodiment, the server 104 may first periodically obtain information business information known to the public in a certain range, such as collecting various data sources that may include campus-related information in a nationwide range, in a compliance manner. The collected information is then preprocessed, including noise data removal, standardized text formats, and the like. Finally, executing an entity identification step to clean out the industrial park names and the different names thereof meeting preset conditions from the information industry and commerce information to form park list information. Or directly displaying the park list information in the information business information, for example, a certain official website is provided with the park list information in an excel format, and the server 104 directly reads the document content after downloading to obtain the park list information.
Further, after the server 104 analyzes the information business information to obtain the campus list information, enterprise association can be performed on each industrial campus included in the campus list information one by one, that is, it is determined which enterprises in each campus reside in particular, so as to construct a key value pair (i.e., enterprise list information) between each campus and each enterprise, thereby collecting the detail information of each campus and storing the detail information in a database, named as a campus information database, for searching and querying by a user.
It should be noted that there are a plurality of preset conditions described above, and each preset condition is set for each information type of information industry and commerce, and specifically will be described in the following embodiments. Meanwhile, the industrial park described above may be scaled down according to the application requirements, for example, a biomedical park will be explained as an example. In addition, the user in the embodiment of the invention can be an investment institution, a park manager or an enterprise manager.
Specifically, the embodiment of the invention provides the created park information database, which can be used for an investment institution to analyze the policy bonus, the land edge advantage, the return on investment and the like of each park, help the investment institution save investigation time, shorten project investigation period and improve the park information retrieval efficiency; secondly, the intelligent park is used for analyzing the potential and advantages of the intelligent park, monitoring the development condition of the resident enterprises and the like, so that better quotation and financing are achieved, the intelligent park is operated more efficiently, and the intelligent park is managed accurately; thirdly, the method can be used for an enterprise to select a park area to be resided, analyzing enterprise patterns, talent density and the like of each park, predicting park potential, providing data guidance for enterprise development and helping the enterprise to save operation cost.
In summary, the embodiment of the invention provides the established campus information database, which not only has perfect industrial chains and realizes the chain development of mutual matching, separate work and writing and mutual promotion in the campus, but also can play an advantageous role in industry, strengthen the clustering effect and realize the gathering development with high concentration of industrial racetracks.
In addition, through reliable database information, not only can help the user clearly know the state of each garden and inside enterprise, help the enterprise to make more intelligent decision in the aspect of planning show, regional layout etc., avoid because of obtaining the misjudgement that information is incomplete or inaccurate produces, but also can help enterprise and investor find matched garden and enterprise fast, reduce market contact cost, improve decision-making efficiency. Moreover, through disclosing transparent information, resources can be guided to flow to a high-quality industrial park and a powerful resident enterprise, so that a market mechanism plays a larger role in resource allocation, and authorities can also utilize the database to conduct policy research and formulation, such as industry planning, policy support and the like, so that the technical problems of low information acquisition efficiency, lack of comprehensiveness, reliability, timeliness and the like are solved in a subjective way, other commercial effects are brought objectively while corresponding technical effects are brought, development potential of the power-assisted mining industry is brought, and industrial upgrading is promoted.
In one embodiment, the information business information includes information, and analyzing the information business information to clean each industrial park to obtain park list information includes: data cleaning is carried out on the information so as to extract each industrial park in the information; acquiring park attribute information of each industrial park as park list information; among other things, campus attribute information includes, but is not limited to: at least one of a park name, a superior park name, park province information, park city information, number of resident enterprises, and park address information.
The information can be news information which is acquired from various big information websites through a web crawler at intervals. The campus keyword may be a term related to the campus, such as "campus", "industrial garden", "scientific garden", etc.
In a specific implementation, the campus data cleaning for information may be to extract each industrial campus in the information based on a preset campus keyword, and bind the names of each industrial campus and its different names (such as the names of the parks are referred to as "the development area of the high and new technology of the combined fertilizer" and the different names of the parks are referred to as "the high and new area of the combined fertilizer"). Here, the association relationship between each campus name and its synonym may be determined by context semantic analysis or similarity analysis.
In particular, the server 104 may extract keywords and phrases related to the campus by analyzing the identified context information of the campus name and synonym in the original text, and then create feature vectors for the campus based on the extracted keywords and phrases. Here, a conventional text feature extraction method such as TF-IDF (word frequency-inverse document frequency) or a word embedding model based on deep learning may be used. And then, comparing the similarity between different texts by using a similarity calculation method, such as cosine similarity, and finding out the names and the synonyms of the parks with similar feature vectors. And finally, establishing the association relationship between the park name and the different names according to the similarity calculation result.
Further, after the campus names and the different names of the industrial parks are obtained through analysis, at least one of the campus names, the superior campus names, the campus province information, the campus city information, the number of resident enterprises and the campus address information can be obtained based on the campus names and the different names of the industrial parks, the keyword extraction technology can be adopted as the campus attribute information in the obtaining mode, then the campus names are used as keys, and meanwhile the different names and the campus attribute information except for the campus names are used as values, so that the campus list information is correspondingly generated.
In one embodiment, the information business information further includes business information, the business information is business information of a preset enterprise object, and the step of obtaining the campus list information further includes: aiming at the business information, cleaning the enterprise address information of a preset enterprise object; classifying and counting the enterprise address information to take the enterprise address information with the statistical quantity larger than or equal to a preset threshold value as target enterprise address information; and taking the target enterprise address information as park address information to update the park list information.
The business information may be business information collected for a preset enterprise object. Here, the preset enterprise object may be set according to actual business requirements, for example, the preset enterprise object may be an innovative medicine enterprise.
In a specific implementation, the server 104 is to wash out the campus list information from the massive data, and can set and wash the keyword for the information, and also can wash the address for the business information of the preset enterprise object, that is, by analyzing the business information, the business address information of all the preset enterprise objects is classified and counted, and then the business address information with the statistical number greater than or equal to the preset threshold value is used as the target business address information, so that the target business address information is directly determined to be the campus address information, and the information is newly added to the campus list information to improve the comprehensiveness of the information.
For example, according to the current business requirement, the preset enterprise object includes: the innovation medicine enterprise A, the innovation medicine enterprise B, the innovation medicine enterprise C, the innovation medicine enterprise D and the innovation medicine enterprise E can be further cleaned to obtain the registration addresses of the five innovation medicine enterprises by acquiring the industrial and commercial information of the five innovation medicine enterprises, at the moment, classification statistics are carried out to obtain the registration addresses of the innovation medicine enterprise A, the innovation medicine enterprise D and the innovation medicine enterprise E which are xx province xx city area xx street number 218, the registration address of the innovation medicine enterprise B is yy province yy area yy street number 100, the registration address of the innovation medicine enterprise C is zz area zz street number 316 of zz province zz city, and the preset threshold is 3, so that the 'xx province xx city xx street number 218' is a high probability biological medicine industry garden, and the target enterprise address information is not recorded in the garden list information, namely the target enterprise address information is utilized to obtain the attribute information of the updated area enterprise list.
In one embodiment, enterprise association is performed on each industrial park in the park list information one by one to construct a key value pair between the park and the enterprise to obtain park information, including: extracting park address information of each industrial park in the park list information; performing word segmentation processing on the park address information to obtain park address word segmentation; and carrying out association matching on the park address word and prestored enterprise information to construct a key value pair between the park and the enterprise, so as to obtain park information.
In this embodiment, the innovative drug enterprise is still taken as an example for explanation. The pre-stored enterprise information is enterprise registration address information obtained by analyzing and mining based on the industrial and commercial information in the early stage. Here, the enterprise information pre-stored by the server 104 may be enterprise information of a specific domain for different domains.
In a specific implementation, after each campus is obtained by cleaning the server 104 based on the steps, and the campus list information is obtained, the parks list information can be further utilized to determine the enterprise where each industrial park is resided in, so that the association binding between the parks and the enterprise is realized, and the data management is convenient.
Specifically, there are two ways to construct a key value pair between a campus and an enterprise, namely, automatically matching the enterprise, and actively monitoring a newly added enterprise. Here, the first way will be described in detail, that is, word segmentation processing is performed on the campus address information first, where word segmentation processing may be performed using a deep learning model, such as a round robin (Recurrent Neural Networks, RNN), long Short Term Memory (LSTM), and transform-like big data driven model, which can automatically learn the relationship between words and contexts to perform word segmentation prediction and segmentation. Then, the association matching is carried out by utilizing the park address word segmentation and the prestored enterprise information so as to endow the park label with the innovative medicine enterprise existing at the current moment, and the key value pair structure between the park and the enterprise is realized and used as park information.
For example, a piece of campus information consists of the following fields: park name, park level address, business name, province, city, business address.
In one embodiment, the campus information retrieval method further comprises: acquiring a park accurate address of each industrial park in park list information; wherein, the accurate address of garden includes at least one of the following: park name, park level address, park peripheral address; acquiring enterprise business information; and screening out target enterprises matched with the accurate addresses of the parks according to the enterprise business information so as to update key value pairs between the parks and the enterprises and obtain the park information.
In a specific implementation, as the number of enterprises is continuously increasing, in order to ensure the information comprehensiveness of the campus information database, the server 104 is required to update data periodically or irregularly, so as to add the newly registered enterprises meeting the recording conditions of the campus information database to the database, and the data update mode is active monitoring.
Specifically, for the mode of actively monitoring the newly added enterprise, the embodiment of the invention provides that the enterprise business information can be obtained to be used as a data source to be screened, then the accurate address of each industrial park existing in the park list information is used as a keyword, the keyword extraction is carried out on the enterprise business information, and the main extraction object is the registered address and/or the enterprise name in the enterprise business information, so that the update of the park information is realized.
In addition, the enterprise can be associated with the park by analyzing the registration mechanism in the enterprise business information, because the business bureau is correspondingly arranged in the part of the park, so that the enterprise which is newly added but does not pass through the park and is associated with the park by the accurate address of the park can be analyzed by the registration mechanism, and the enterprise is associated and bound with the park pointed by the registration mechanism, so that the integrity of the information is ensured.
In one embodiment, building a campus information database based on the campus information includes: acquiring resource integration information aiming at enterprises corresponding to and associated with each industrial park in park information; wherein the resource integration information includes, but is not limited to: at least one of a resource engagement time, a resource engagement event, and a resource engagement amount; and marking each industrial park and related enterprises in the park information based on the resource integration information to obtain a park information database.
The resource integration time, the resource integration event and the resource integration amount are all specific to enterprises. The resource integration time may refer to a specific point in time or period of time when the virtual resource integration activity occurs. The resource integration event may refer to a virtual resource integration activity, and an embodiment of the activity form is not specifically limited herein. The resource engagement amount may refer to a total value of resources involved in the virtual resource engagement activity.
In a specific implementation, in order to enable a user to fully acquire park detail information, the embodiment of the invention provides that resource integration information of an enterprise can be acquired, so that park potential can be predicted more accurately, and further expansion of park attributes is realized. Therefore, the resource integration time, the resource integration event and/or the resource integration amount can be obtained through integrating enterprise business information and information, then the resource integration information of each enterprise associated with a park is summarized according to the association relation between the park and the enterprise, the resource integration information of the park can be obtained, and finally the resource integration scale, the number of the resource integration events and the like can be used as search fields for a user to carry out deep search. In addition, by integrating the information, a label can be given to an enterprise, for example, an enterprise of a ranking topN obtained by screening a certain field is given with a preset label, so that a user can better position key information, and the investigation cost and the investigation period of the user are saved.
And S203, the target park information is sent to the client side for the client side to display.
In a specific implementation, after the server 104 queries and obtains the target campus information required by the user, the target campus information can be sent to the client 102, so that the client 102 displays the target campus information to the user through the display screen thereof, thereby meeting the requirement of searching the campus information of the user and reducing the time and cost for searching the information for the user.
It will be appreciated that the presentation of the target campus information on the client 102 includes, but is not limited to: list, grid view, summary card, map view, chart or graphic, etc. Wherein, the list is a simple list which arranges the search results according to the sequence number or importance, and each item can comprise information such as title, abstract, link and the like. The grid view shows the search result in a grid form, and each item has a box, including a title, a picture, a brief description, and the like. The abstract card displays each search result as a small card, and comprises information such as a title, an abstract, a link and the like; cards are generally arranged in a vertical or horizontal orientation, but may also be arranged in an angled or stacked orientation, and embodiments of the invention are not limited. The map view may present the results in a map manner; for example, the target campus information may be presented based on the divided areas on the map. The chart or graphic may graphically, or visually present the target flow information.
According to the park information retrieval method in the embodiment, the server firstly builds the park information database by excavating the relevance between the park and the enterprise, so that the comprehensiveness and the accuracy of recorded park information can be ensured, then, the target park information matched with the park information retrieval words can be screened out from the pre-built park information database by receiving and responding to the park information retrieval request carrying the park information retrieval words, and then, the target park information is sent to the client for display by the client, the time and the cost for searching information are saved for a user, and the park information acquisition efficiency is improved.
It should be understood that, although the steps in the flowchart of fig. 2 are shown in sequence as indicated by the arrows, the steps are not necessarily performed in sequence as indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in fig. 2 may include multiple sub-steps or stages that are not necessarily performed at the same time, but may be performed at different times, nor do the order in which the sub-steps or stages are performed necessarily performed in sequence, but may be performed alternately or alternately with at least a portion of the sub-steps or stages of other steps or other steps.
In order to better implement the campus information retrieval method provided in the embodiment of the present invention, on the basis of the campus information retrieval method provided in the embodiment of the present invention, the embodiment of the present invention further provides a campus information retrieval device, as shown in fig. 3, where the campus information retrieval device 300 includes:
a request receiving module 310, configured to receive a campus information search request carrying a campus information search term;
A request response module 320, configured to respond to the campus information search request, and screen the campus information matched with the campus information search word from the pre-constructed campus information database as target campus information; the park information database is constructed by mining the relevance between the park and the enterprise;
the information sending module 330 is configured to send the target campus information to the client for the client to display.
In one embodiment, the campus information retrieval device 300 further includes a database construction module for obtaining information about the information industry and commerce; analyzing information business information to clean each industrial park to obtain park list information; carrying out enterprise association on each industrial park in the park list information one by one to construct a key value pair between the park and the enterprise so as to obtain park information; and constructing and obtaining a park information database based on the park information.
In one embodiment, the information business information comprises information, and the database construction module is further used for performing data cleaning on the information to extract each industrial park in the information; acquiring park attribute information of each industrial park as park list information; among other things, campus attribute information includes, but is not limited to: at least one of a park name, a superior park name, park province information, park city information, number of resident enterprises, and park address information.
In one embodiment, the information business information further includes business information, wherein the business information is business information of a preset business object, and the database construction module is further configured to clean business address information of the preset business object for the business information; classifying and counting the enterprise address information to take the enterprise address information with the statistical quantity larger than or equal to a preset threshold value as target enterprise address information; and taking the target enterprise address information as park address information to update the park list information.
In one embodiment, the database construction module is further configured to extract campus address information of each industrial campus in the campus list information; performing word segmentation processing on the park address information to obtain park address word segmentation; and carrying out association matching on the park address word and prestored enterprise information to construct a key value pair between the park and the enterprise, so as to obtain park information.
In one embodiment, the database construction module is further configured to obtain a campus precision address of each industrial campus in the campus list information; wherein, the accurate address of garden includes at least one of the following: park name, park level address, park peripheral address; acquiring enterprise business information; and screening out target enterprises matched with the accurate addresses of the parks according to the enterprise business information so as to update key value pairs between the parks and the enterprises and obtain the park information.
In one embodiment, the database construction module is further configured to obtain resource integration information for enterprises associated with each industrial park in the park information; wherein the resource integration information includes, but is not limited to: at least one of a resource engagement time, a resource engagement event, and a resource engagement amount; and marking each industrial park and related enterprises in the park information based on the resource integration information to obtain a park information database.
In the above embodiment, the server firstly builds the park information database by mining the relevance between the park and the enterprise, so that the comprehensiveness and accuracy of the recorded park information can be ensured, then the target park information matched with the park information retrieval words can be screened out from the pre-built park information database by receiving and responding to the park information retrieval request carrying the park information retrieval words, and then the target park information is sent to the client for the client to display, so that the time and cost for searching the information for the user are saved, and the park information acquisition efficiency is improved.
It should be noted that, the specific limitation of the campus information retrieval device may be referred to the limitation of the method for retrieving the campus information hereinabove, and will not be described herein. The various modules in the campus information retrieval device described above may be implemented in whole or in part by software, hardware, and combinations thereof. The above modules may be embedded in hardware or independent of a processor in the electronic device, or may be stored in software in a memory in the electronic device, so that the processor may call and execute operations corresponding to the above modules.
In some embodiments of the present application, the campus information retrieval apparatus 300 may be implemented in the form of a computer program that is executable on a computer device such as that shown in fig. 4. The memory of the computer device may store various program modules constituting the campus information retrieval apparatus 300, such as the request receiving module 310, the request responding module 320, and the information transmitting module 330 shown in fig. 3; the computer program of each program module causes the processor to execute the steps in the campus information retrieval method of each embodiment of the present application described in the present specification. For example, the computer apparatus shown in fig. 4 may perform step S201 through the request receiving module 310 in the campus information retrieval apparatus 300 shown in fig. 3. The computer device may perform step S202 through the request response module 320. The computer device may perform step S203 through the information transmission module 330. The computer device includes a processor, a memory, and a network interface coupled by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The network interface of the computer device is used for communicating with an external computer device through a network connection. The computer program is executed by a processor to implement a campus information retrieval method.
Those skilled in the art will appreciate that the structures shown in FIG. 4 are block diagrams only and do not constitute a limitation of the computer device on which the present aspects apply, and that a particular computer device may include more or less components than those shown, or may combine some of the components, or have a different arrangement of components.
In some embodiments of the present application, a computer device is provided that includes one or more processors; a memory; and one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the processor to perform the steps of the campus information retrieval method described above. The steps of the campus information retrieval method herein may be the steps of the campus information retrieval method of the above-described respective embodiments.
In some embodiments of the present application, a computer readable storage medium is provided, storing a computer program, the computer program being loaded by a processor, such that the processor performs the steps of the above-described campus information retrieval method. The steps of the campus information retrieval method herein may be the steps of the campus information retrieval method of the above-described respective embodiments.
Those of ordinary skill in the art will appreciate that implementing all or part of the above-described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the various embodiments provided herein can include at least one of non-volatile and volatile memory. The nonvolatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical Memory, or the like. Volatile memory can include random access memory (Random Access Memory, RAM) or external cache memory. By way of illustration, and not limitation, RAM can take many forms, such as static random access memory (Static Random Access Memory, SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM), among others.
The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description.
The above describes in detail a method, apparatus, computer device and storage medium for retrieving campus information provided by the embodiments of the present invention, and specific examples are applied to illustrate the principles and embodiments of the present invention, where the above descriptions of the embodiments are only used to help understand the method and core idea of the present invention; meanwhile, as those skilled in the art will have variations in the specific embodiments and application scope in light of the ideas of the present invention, the present description should not be construed as limiting the present invention.

Claims (10)

1. A method of campus information retrieval, comprising:
receiving a park information retrieval request carrying a park information retrieval word;
responding to the park information retrieval request, and screening out park information matched with the park information retrieval words from a pre-constructed park information database to serve as target park information; the park information database is constructed by mining the relevance between a park and an enterprise;
and sending the target park information to a client for display by the client.
2. The method of claim 1, further comprising, prior to said responding to said campus information retrieval request, screening a pre-built database of campus information for a target campus information that matches said campus information retrieval word:
Acquiring information business information;
analyzing the information business information to clean each industrial park to obtain park list information;
carrying out enterprise association on each industrial park in the park list information one by one to construct a key value pair between the park and the enterprise so as to obtain the park information;
and constructing and obtaining the park information database based on the park information.
3. The method of claim 2, wherein the information business information includes information, and wherein the analyzing the information business information to clean each industrial park to obtain park list information includes:
data cleaning is carried out on the information so as to extract each industrial park in the information;
acquiring park attribute information of each industrial park as the park list information;
wherein the campus attribute information includes, but is not limited to: at least one of a park name, a superior park name, park province information, park city information, number of resident enterprises, and park address information.
4. The method of claim 3, wherein the information business information further comprises business information, the business information being business information of a predetermined business object, the step of obtaining the campus list information further comprising:
Cleaning enterprise address information of the preset enterprise object aiming at the business information;
classifying and counting the enterprise address information to take the enterprise address information with the statistical quantity larger than or equal to a preset threshold value as target enterprise address information;
and taking the target enterprise address information as the park address information to update the park list information.
5. The method of any one of claims 2-4, wherein the enterprise association of each industrial campus in the campus list information to construct a pair of keys between a campus and an enterprise, to obtain the campus information, comprises:
extracting park address information of each industrial park in the park list information;
performing word segmentation processing on the park address information to obtain park address word segmentation;
and carrying out association matching on the park address word and prestored enterprise information to construct a key value pair between the park and the enterprise, so as to obtain the park information.
6. The method of claim 5, wherein the method further comprises:
acquiring a park accurate address of each industrial park in the park list information; wherein, the accurate address of garden includes at least one of: park name, park level address, park peripheral address;
Acquiring enterprise business information;
and screening out target enterprises matched with the accurate campus addresses according to the enterprise business information so as to update key value pairs between the parks and the enterprises and obtain the park information.
7. The method of any one of claims 2-4, wherein constructing the campus information database based on the campus information comprises:
acquiring resource integration information aiming at enterprises corresponding to and associated with each industrial park in the park information; wherein the resource integration information includes, but is not limited to: at least one of a resource engagement time, a resource engagement event, and a resource engagement amount;
and marking each industrial park and the related enterprises in the park information based on the resource integration information to obtain the park information database.
8. A campus information retrieval device, comprising:
the request receiving module is used for receiving a park information retrieval request carrying a park information retrieval word;
the request response module is used for responding to the park information retrieval request, and screening out park information matched with the park information retrieval words from a pre-constructed park information database to serve as target park information; the park information database is constructed by mining the relevance between a park and an enterprise;
And the information sending module is used for sending the target park information to a client for the client to display.
9. A computer device, the computer device comprising:
one or more processors;
a memory; and one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the processor to implement the campus information retrieval method of any one of claims 1 to 7.
10. A computer readable storage medium having stored thereon a computer program, the computer program being loaded by a processor to perform the steps of the campus information retrieval method of any one of claims 1 to 7.
CN202311604225.3A 2023-11-28 2023-11-28 Park information retrieval method, device, computer equipment and storage medium Pending CN117407429A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311604225.3A CN117407429A (en) 2023-11-28 2023-11-28 Park information retrieval method, device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311604225.3A CN117407429A (en) 2023-11-28 2023-11-28 Park information retrieval method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN117407429A true CN117407429A (en) 2024-01-16

Family

ID=89494586

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311604225.3A Pending CN117407429A (en) 2023-11-28 2023-11-28 Park information retrieval method, device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN117407429A (en)

Similar Documents

Publication Publication Date Title
Stieglitz et al. Social media analytics–Challenges in topic discovery, data collection, and data preparation
Kalampokis et al. Open government data: A stage model
US8793285B2 (en) Multidimensional tags
Littman et al. API-based social media collecting as a form of web archiving
Fejzer et al. Profile based recommendation of code reviewers
WO2011094341A2 (en) System and method for social networking
US20150161555A1 (en) Scheduling tasks to operators
KR102319438B1 (en) System for Providing Tourism information based on Bigdata and Driving method of the Same
CN112989156A (en) Big data based policy and enterprise matching method and system
CN113254630B (en) Domain knowledge map recommendation method for global comprehensive observation results
CN112927082A (en) Credit risk prediction method, apparatus, device, medium, and program product
Jiang et al. Application intelligent search and recommendation system based on speech recognition technology
CN110929134A (en) Investment and financing data management method and device, computer equipment and storage medium
CN111696656B (en) Doctor evaluation method and device of Internet medical platform
CN111859969A (en) Data analysis method and device, electronic equipment and storage medium
US8799314B2 (en) System and method for managing information map
US10628421B2 (en) Managing a single database management system
KR20210065773A (en) Big data based emotional information analysis and evaluation system and Driving method of the Same
Cai et al. Research on multi-source POI data fusion based on ontology and clustering algorithms
Peng et al. Research trends in social media/big data with the emphasis on data collection and data management: A bibliometric analysis
Tavra et al. Unpacking the role of volunteered geographic information in disaster management: focus on data quality
Rahal et al. The rating dilemma of academic management journals: Attuning the perceptions of peer rating
CN117033654A (en) Science and technology event map construction method for science and technology mist identification
CN111930891A (en) Retrieval text expansion method based on knowledge graph and related device
Bakaev et al. Prospects and challenges in online data mining: experiences of three-year labour market monitoring project

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination