CN112990845A - Intelligent acquisition method for mapping market project - Google Patents

Intelligent acquisition method for mapping market project Download PDF

Info

Publication number
CN112990845A
CN112990845A CN202110006800.4A CN202110006800A CN112990845A CN 112990845 A CN112990845 A CN 112990845A CN 202110006800 A CN202110006800 A CN 202110006800A CN 112990845 A CN112990845 A CN 112990845A
Authority
CN
China
Prior art keywords
bid
winning
item
data set
project
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110006800.4A
Other languages
Chinese (zh)
Inventor
许瑞栋
徐荣煜
潘宸
严瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Information Center Of Jiangsu Bureau Of Surveying Mapping And Geographic Information
Original Assignee
Information Center Of Jiangsu Bureau Of Surveying Mapping And Geographic Information
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Information Center Of Jiangsu Bureau Of Surveying Mapping And Geographic Information filed Critical Information Center Of Jiangsu Bureau Of Surveying Mapping And Geographic Information
Priority to CN202110006800.4A priority Critical patent/CN112990845A/en
Publication of CN112990845A publication Critical patent/CN112990845A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/08Auctions

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Data Mining & Analysis (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Development Economics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application discloses an intelligent acquisition method for mapping market projects. The intelligent acquisition method of the mapping market project comprises the following steps: receiving a data set of winning bid items; extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, amount and time attributes; and screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model. The technical problems that time and labor are wasted and data timeliness cannot be guaranteed due to the fact that the bidding project is manually screened out in the surveying and mapping are solved.

Description

Intelligent acquisition method for mapping market project
Technical Field
The application relates to the field of surveying and mapping project data processing, in particular to an intelligent acquisition method for surveying and mapping market projects.
Background
The deep development of economic globalization and social informatization promotes the high-speed development of mapping equipment and mapping information technology, attracts a large number of technicians and enterprises to join in the mapping geographic information industry, and the rapid increase of the industry scale brings prosperous vitality for the development of the mapping geographic information industry and brings brand-new challenges for the management of the mapping geographic information department. Meanwhile, with the rapid development of modern technologies such as big data, artificial intelligence, cloud computing and the like, in recent years, people can obtain data from multiple parties in real time and safely, and how to effectively utilize big data resources and perform efficient supervision is a key problem to be solved by administrative departments in various regions urgently.
The surveying and mapping geographic information project is used as an important hand grip for surveying and mapping activity management, and has important significance for surveying and mapping qualification audit or surveying and mapping result quality supervision and inspection. Only by accurately mastering the dynamics of a surveying and mapping market, the cooperative supervision pattern of market autonomy, government supervision and social supervision which are mutually supported can be effectively formed, so that fairness, efficiency and vitality are managed, the competitive strength and market efficiency of market main bodies are promoted to be improved, and the sustainable and healthy development of the economy and the society is promoted.
The bidding and bidding are taken as an important ring for mapping the market behavior, the basic conditions of market bidding and government purchasing projects can be fully reflected, and therefore the bidding and bidding information can be mastered in time, so that the project source management and control effect can be effectively achieved. Because the bidding project is complicated, the supervision can be carried out more pertinently only by effectively screening the information and removing the projects which are irrelevant to mapping, have smaller scale, appear the condition of the drift and the like. However, by 12 months in 2020, statistics shows that nearly 200 ten thousand pieces of bidding project information are disclosed only in Jiangsu province in the current year, wherein nearly 10000 pieces of bidding project information related to mapping are time-consuming and labor-consuming if only manual screening is relied on, and timeliness of data cannot be guaranteed.
Aiming at the problems that time and labor are wasted and data timeliness cannot be guaranteed due to manual screening of a mapping bidding project in the related technology, an effective solution is not provided at present.
Disclosure of Invention
The application mainly aims to provide an intelligent acquisition method for surveying and mapping market projects, and aims to solve the problems that time and labor are wasted when a bidding project is manually screened out, and data timeliness cannot be guaranteed.
In order to achieve the above object, according to one aspect of the present application, an intelligent acquisition method for mapping market items is provided.
The mapping market project intelligent acquisition method comprises the following steps: receiving a data set of winning bid items; extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, amount and time attributes; and screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model.
Further, receiving the data set of winning bid items comprises: and acquiring a data set of the bid-winning project from the third-party bid-bidding information publicizing platform by adopting a data interface.
Further, screening a target item set from the data set according to the first characteristic attribute through the trained item screening model includes: inputting the characteristic attributes into the project screening model; the item screening model judges whether the corresponding first bid-winning item is effective or not according to the first characteristic attribute; if the first winning bid item is valid, the first winning bid item is screened to the target item set.
Further, after the trained item screening model screens out the target item set from the data set according to the first characteristic attribute, the method further includes: and pushing the target project set to a supervision end through a short message or a data interface.
Further, the training of the project screening model comprises: receiving a sample data set of the bid winning item; extracting a second characteristic attribute from each bid-winning item in the sample data set, wherein the second characteristic attribute at least comprises bid-winning contents, units, money and time attributes; text truncation, filling and encoding operations are performed on the second characteristic attribute to obtain a vectorized mapping dictionary file; training, testing and verifying operations are performed according to the sample data set to obtain a weight and a deviation; and obtaining a project screening model based on the mapping dictionary file, the weight and the deviation.
In order to achieve the above object, according to another aspect of the present application, an intelligent acquisition device for mapping market items is provided.
The intelligent acquisition device for mapping market projects comprises: the receiving module is used for receiving a data set of the bid winning item; the extraction module is used for extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning content, units, amount and time attributes; and the screening module is used for screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model.
Further, after the trained item screening model screens out the target item set from the data set according to the first characteristic attribute, the method further includes: and pushing the target project set to a supervision end through a short message or a data interface.
Further, the training of the project screening model comprises: receiving a sample data set of the bid winning item; extracting a second characteristic attribute from each bid-winning item in the sample data set, wherein the second characteristic attribute at least comprises bid-winning contents, units, money and time attributes; text truncation, filling and encoding operations are performed on the second characteristic attribute to obtain a vectorized mapping dictionary file; training, testing and verifying operations are performed according to the sample data set to obtain a weight and a deviation; and obtaining a project screening model based on the mapping dictionary file, the weight and the deviation.
In order to achieve the above object, according to another aspect of the present application, there is provided a storage medium.
The storage medium is used for storing the mapping market project intelligent acquisition method.
To achieve the above object, according to another aspect of the present application, there is provided a server.
A server according to the present application, comprising: a memory and a processor for executing the mapping market project intelligent acquisition method stored in the memory.
In the embodiment of the application, a characteristic attribute extraction and model prediction mode is adopted, and a data set of a bid-winning project is received; extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, amount and time attributes; screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model; the purpose that the characteristic attribute is extracted and matched with the model prediction target item set to replace manual screening is achieved, time is saved, labor cost is reduced, the technical effect of data timeliness can be guaranteed, and the technical problems that time and labor are wasted and data timeliness cannot be guaranteed due to the fact that the bidding items are manually screened in surveying and mapping are solved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this application, serve to provide a further understanding of the application and to enable other features, objects, and advantages of the application to be more apparent. The drawings and their description illustrate the embodiments of the invention and do not limit it. In the drawings:
FIG. 1 is a schematic flow chart diagram of an intelligent acquisition method for mapping market projects according to an embodiment of the application;
FIG. 2 is a schematic structural diagram of an intelligent acquisition device for mapping market items according to an embodiment of the present application;
FIG. 3 is a schematic structural diagram of a storage medium according to an embodiment of the present application;
fig. 4 is a schematic structural diagram of a server according to an embodiment of the present application.
Detailed Description
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be used. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
In this application, the terms "upper", "lower", "left", "right", "front", "rear", "top", "bottom", "inner", "outer", "middle", "vertical", "horizontal", "lateral", "longitudinal", and the like indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings. These terms are used primarily to better describe the invention and its embodiments and are not intended to limit the indicated devices, elements or components to a particular orientation or to be constructed and operated in a particular orientation.
Moreover, some of the above terms may be used to indicate other meanings besides the orientation or positional relationship, for example, the term "on" may also be used to indicate some kind of attachment or connection relationship in some cases. The specific meanings of these terms in the present invention can be understood by those skilled in the art as appropriate.
Furthermore, the terms "mounted," "disposed," "provided," "connected," and "sleeved" are to be construed broadly. For example, it may be a fixed connection, a removable connection, or a unitary construction; can be a mechanical connection, or an electrical connection; may be directly connected, or indirectly connected through intervening media, or may be in internal communication between two devices, elements or components. The specific meanings of the above terms in the present invention can be understood by those of ordinary skill in the art according to specific situations.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
According to an embodiment of the present invention, there is provided an intelligent acquisition method for mapping market projects, as shown in fig. 1, the method includes the following steps S101 to S103:
step S101, receiving a data set of a bid-winning project;
according to the embodiment of the present invention, preferably, the receiving the data set of the winning bid item includes:
and acquiring a data set of the bid-winning project from the third-party bid-bidding information publicizing platform by adopting a data interface.
In this embodiment, the server may access the third-party bid and bid information publishing platform through the data interface to establish communication, and thus, the server may actively or passively obtain a data set formed by all bid-winning items on the platform. Preferably, the server periodically obtains the data set formed by all winning bid items in the time period.
Step S102, extracting a first characteristic attribute from each bid-winning item in a data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, money and time attributes;
in the embodiment, each bid-winning item contains corresponding information, the content in the bid-winning item can be identified by combining a character identification technology and semantic identification, the content of each bid-winning item is determined according to an identification result, and the identified bid-winning content, unit, amount and time attributes are used as first characteristic attributes corresponding to a certain bid-winning item; and input parameter guarantee is provided for the screening of the bid-winning items.
And S103, screening a target item set from the data set according to the first characteristic attribute through the trained item screening model.
According to the embodiment of the present invention, preferably, the screening, by the trained item screening model, the target item set from the data set according to the first feature attribute includes:
inputting the characteristic attributes into the project screening model;
the item screening model judges whether the corresponding first bid-winning item is effective or not according to the first characteristic attribute;
if the first winning bid item is valid, the first winning bid item is screened to the target item set.
In this embodiment, the trained item screening model, in combination with the first characteristic attribute, has an effect of determining whether a certain winning bid item (i.e., a first target item) meets a standard; specifically, the item screening model can judge whether the bid-winning content is a flow mark or not by combining the bid-winning content attribute in the first characteristic attribute, so that the flow mark items can be screened out; the item screening model can judge whether the bid-winning units are related to surveying and mapping by combining the attributes of the bid-winning units in the first characteristic attributes, so that the initially screened bid-winning items can be further screened; the item screening model can judge whether the bid amount is too small by combining the bid amount attribute in the first characteristic attribute, so that the small-scale bid items can be screened; the item screening model can judge whether the winning bid time is overdue by combining the winning bid time attribute in the first characteristic attribute, so that winning bid items with the overdue period can be screened out. And screening each bid-winning item to a target item set through all the judgment.
Thus, multi-dimensional screening of the bid-winning items is realized, the left bid-winning items in the target item set are more accurate and more meet the requirements, and the method can be directly applied to supervision; the purpose that the characteristic attribute extraction is matched with the model to predict the target item set to replace manual screening is achieved, so that the technical effects of saving time, reducing labor cost and guaranteeing data timeliness are achieved.
From the above description, it can be seen that the present invention achieves the following technical effects:
in the embodiment of the application, a characteristic attribute extraction and model prediction mode is adopted, and a data set of a bid-winning project is received; extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, amount and time attributes; screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model; the purpose that the characteristic attribute is extracted and matched with the model prediction target item set to replace manual screening is achieved, time is saved, labor cost is reduced, the technical effect of data timeliness can be guaranteed, and the technical problems that time and labor are wasted and data timeliness cannot be guaranteed due to the fact that the bidding items are manually screened in surveying and mapping are solved.
According to the embodiment of the present invention, after the trained item screening model screens out the target item set from the data set according to the first feature attribute, the method further includes:
and pushing the target project set to a supervision end through a short message or a data interface.
After all the bid-winning items are judged, the target item set is pushed to a mobile phone or a computer of a supervisor through a short message or a data interface, so that the supervisor can carry out efficient supervision.
According to the embodiment of the present invention, preferably, the training of the item screening model includes:
receiving a sample data set of the bid winning item;
in another preferred embodiment, the training of the project screening model comprises:
and preprocessing the sample data set, and extracting attributes such as bid content, bid unit, bid amount and the like of each item text instance.
And (4) according to a rough set theory, performing attribute reduction by using the conditional information entropy, and calculating and screening a decision attribute (D).
And (4) performing text training on the sample data set with the reduced indexes by using a fastText frame as a text classification frame to obtain project screening model parameters.
In this embodiment, the server may access the third-party bid and bid information publishing platform through the data interface to establish communication, and thus, the server may actively or passively obtain a sample data set formed by all bid-winning items on the platform. Preferably, the server passively acquires a sample data set formed by massive bid-winning items.
Extracting a second characteristic attribute from each bid-winning item in the sample data set, wherein the second characteristic attribute at least comprises bid-winning contents, units, money and time attributes;
and preprocessing the sample data set, and extracting attributes such as bid content, bid-winning units, bid-winning amount, bid-winning time and the like of each item text instance. Training is performed based on these attributes.
Text truncation, filling and encoding operations are performed on the second characteristic attribute to obtain a vectorized mapping dictionary file;
specifically, text truncation and filling are performed on sample data, preprocessed data are read in, each record is truncated into a data (D) part and a label (L) part to form a data set D and a label set L, and the length of data required to be input in model training is fixed for the data set D. Therefore, the data with the length exceeding the specified limit is directly cut into a fixed length; and filling null values for data with insufficient length. After the steps of label extraction, text truncation and filling, the preprocessed data are converted into a data set D with a fixed length of a label set L.
And coding the words in all the texts, and storing the corresponding mapping relation as a dictionary file. The dictionary sets an upper data limit and words outside the dictionary will be assigned null values. After this process, the fixed length text is converted into a numerical vector for subsequent calculation.
For the dictionary file formed in the mapping process of establishing word encoding-text vectorization, file storage is needed to facilitate subsequent calling.
Training, testing and verifying operations are performed according to the sample data set to obtain a weight and a deviation; and obtaining a project screening model based on the mapping dictionary file, the weight and the deviation.
Specifically, the data set is divided into a training set, a test set and a verification set according to a certain proportion. The training set is used for training network parameters, the testing set is used for testing training results, and the verification set is used for determining auxiliary parameters. After this process, the data set is divided into three parts for subsequent process invocation, and iterative training of the model begins.
Training the sample, calculating the deviation between the predicted value and the actual value, adjusting the weight of the network, saving the weight and the bias corresponding to the current network after the model iterates enough training turns according to the process, and exporting the trained model and a mapping dictionary for text vectorization for item screening and calling.
In a preferred embodiment, the input decision table DT ═ (U, C ═ D), where U is the domain of discourse, and C and D are the conditional and decision attribute sets, respectively. A relative reduction R of the decision table is output. The main steps are 1) solving the information entropy H (D) of the decision attribute D. 2) And calculating the mutual information I (C, D) of all the attribute sets C to the decision attribute set D in the decision table. 3) And solving the core attribute core. First, initialize kernel
Figure RE-GDA0002885920320000091
For a ∈ C, f (x, D) ≠ f (y, D), and f (x, C-a) ≠ f (y, C-a), and a is the core attribute. 4) Let R be core, calculate the mutual information I (R, D) of the core attribute R pair decision attribute set D. 5) For a belongs to C-R, the mutual information quantity I (a, D) of the decision attribute set D is calculated, and the attribute which enables the mutual information quantity to be maximum, namely the attribute is the important attribute, is found out. Has R ═ R ^ a. 6) The mutual information amount I (R, D) of the pair of decision attribute sets D by the attribute set R at this time is calculated. When the information amount of the attribute set R is equal to that of all the attribute sets C, the algorithm is terminated; otherwise, go to step 5).
It should be noted that the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowcharts, in some cases, the steps illustrated or described may be performed in an order different than presented herein.
According to an embodiment of the present invention, there is also provided an apparatus for implementing the above intelligent acquisition method for mapping market items, as shown in fig. 2, the apparatus includes:
a receiving module 10, configured to receive a data set of bid-winning items;
according to the embodiment of the present invention, preferably, the receiving the data set of the winning bid item includes:
and acquiring a data set of the bid-winning project from the third-party bid-bidding information publicizing platform by adopting a data interface.
In this embodiment, the server may access the third-party bid and bid information publishing platform through the data interface to establish communication, and thus, the server may actively or passively obtain a data set formed by all bid-winning items on the platform. Preferably, the server periodically obtains the data set formed by all winning bid items in the time period.
An extracting module 20, configured to extract a first characteristic attribute from each bid-winning item in the data set, where the first characteristic attribute at least includes bid-winning content, unit, amount, and time attribute;
in the embodiment, each bid-winning item contains corresponding information, the content in the bid-winning item can be identified by combining a character identification technology and semantic identification, the content of each bid-winning item is determined according to an identification result, and the identified bid-winning content, unit, amount and time attributes are used as first characteristic attributes corresponding to a certain bid-winning item; and input parameter guarantee is provided for the screening of the bid-winning items.
And the screening module 30 is configured to screen out a target item set from the data set according to the first characteristic attribute through the trained item screening model.
According to the embodiment of the present invention, preferably, the screening, by the trained item screening model, the target item set from the data set according to the first feature attribute includes:
inputting the characteristic attributes into the project screening model;
the item screening model judges whether the corresponding first bid-winning item is effective or not according to the first characteristic attribute;
if the first winning bid item is valid, the first winning bid item is screened to the target item set.
In this embodiment, the trained item screening model, in combination with the first characteristic attribute, has an effect of determining whether a certain winning bid item (i.e., a first target item) meets a standard; specifically, the item screening model can judge whether the bid-winning content is a flow mark or not by combining the bid-winning content attribute in the first characteristic attribute, so that the flow mark items can be screened out; the item screening model can judge whether the bid-winning units are related to surveying and mapping by combining the attributes of the bid-winning units in the first characteristic attributes, so that the initially screened bid-winning items can be further screened; the item screening model can judge whether the bid amount is too small by combining the bid amount attribute in the first characteristic attribute, so that the small-scale bid items can be screened; the item screening model can judge whether the winning bid time is overdue by combining the winning bid time attribute in the first characteristic attribute, so that winning bid items with the overdue period can be screened out. And screening each bid-winning item to a target item set through all the judgment.
Thus, multi-dimensional screening of the bid-winning items is realized, the left bid-winning items in the target item set are more accurate and more meet the requirements, and the method can be directly applied to supervision; the purpose that the characteristic attribute extraction is matched with the model to predict the target item set to replace manual screening is achieved, so that the technical effects of saving time, reducing labor cost and guaranteeing data timeliness are achieved.
From the above description, it can be seen that the present invention achieves the following technical effects:
in the embodiment of the application, a characteristic attribute extraction and model prediction mode is adopted, and a data set of a bid-winning project is received; extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, amount and time attributes; screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model; the purpose that the characteristic attribute is extracted and matched with the model prediction target item set to replace manual screening is achieved, time is saved, labor cost is reduced, the technical effect of data timeliness can be guaranteed, and the technical problems that time and labor are wasted and data timeliness cannot be guaranteed due to the fact that the bidding items are manually screened in surveying and mapping are solved.
According to the embodiment of the present invention, after the trained item screening model screens out the target item set from the data set according to the first feature attribute, the method further includes:
and pushing the target project set to a supervision end through a short message or a data interface.
After all the bid-winning items are judged, the target item set is pushed to a mobile phone or a computer of a supervisor through a short message or a data interface, so that the supervisor can carry out efficient supervision.
According to the embodiment of the present invention, preferably, the training of the item screening model includes:
receiving a sample data set of the bid winning item;
in this embodiment, the server may access the third-party bid and bid information publishing platform through the data interface to establish communication, and thus, the server may actively or passively obtain a sample data set formed by all bid-winning items on the platform. Preferably, the server passively acquires a sample data set formed by massive bid-winning items.
Extracting a second characteristic attribute from each bid-winning item in the sample data set, wherein the second characteristic attribute at least comprises bid-winning contents, units, money and time attributes;
and preprocessing the sample data set, and extracting attributes such as bid content, bid-winning units, bid-winning amount, bid-winning time and the like of each item text instance. Training is performed based on these attributes.
Text truncation, filling and encoding operations are performed on the second characteristic attribute to obtain a vectorized mapping dictionary file;
specifically, text truncation and filling are performed on sample data, preprocessed data are read in, each record is truncated into a data (D) part and a label (L) part to form a data set D and a label set L, and the length of data required to be input in model training is fixed for the data set D. Therefore, the data with the length exceeding the specified limit is directly cut into a fixed length; and filling null values for data with insufficient length. After the steps of label extraction, text truncation and filling, the preprocessed data are converted into a data set D with a fixed length of a label set L
And coding the words in all the texts, and storing the corresponding mapping relation as a dictionary file. The dictionary sets an upper data limit and words outside the dictionary will be assigned null values. After this process, the fixed length text is converted into a numerical vector for subsequent calculation.
For the dictionary file formed in the mapping process of establishing word encoding-text vectorization, file storage is needed to facilitate subsequent calling.
Training, testing and verifying operations are performed according to the sample data set to obtain a weight and a deviation; and obtaining a project screening model based on the mapping dictionary file, the weight and the deviation.
Specifically, the data set is divided into a training set, a test set and a verification set according to a certain proportion. The training set is used for training network parameters, the testing set is used for testing training results, and the verification set is used for determining auxiliary parameters. After this process, the data set is divided into three parts for subsequent process invocation, and iterative training of the model begins.
Training the sample, calculating the deviation between the predicted value and the actual value, adjusting the weight of the network, saving the weight and the bias corresponding to the current network after the model iterates enough training turns according to the process, and exporting the trained model and a mapping dictionary for text vectorization for item screening and calling.
According to an embodiment of the present invention, as shown in fig. 3, there is further provided a storage medium for storing the above intelligent acquisition method for mapping market items; the same technical effect of the intelligent acquisition method of the mapping market project can be achieved.
According to an embodiment of the present invention, as shown in fig. 4, there is also provided a server; the server includes: a processor 200 and a memory 100, wherein the processor 200 is used for executing the mapping market project intelligent acquisition method stored in the memory 100; the same technical effect of the intelligent acquisition method of the mapping market project can be achieved.
It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and they may alternatively be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, or fabricated separately as individual integrated circuit modules, or fabricated as a single integrated circuit module from multiple modules or steps. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (10)

1. An intelligent acquisition method for mapping market projects is characterized by comprising the following steps:
receiving a data set of winning bid items;
extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning contents, units, amount and time attributes;
and screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model.
2. The intelligent mapping market project acquisition method of claim 1, wherein receiving a dataset of winning bid projects comprises:
and acquiring a data set of the bid-winning project from the third-party bid-bidding information publicizing platform by adopting a data interface.
3. The intelligent mapping market project acquisition method of claim 1, wherein screening out a set of target projects from a data set according to the first feature attribute through the trained project screening model comprises:
inputting the characteristic attributes into the project screening model;
the item screening model judges whether the corresponding first bid-winning item is effective or not according to the first characteristic attribute;
if the first winning bid item is valid, the first winning bid item is screened to the target item set.
4. The intelligent mapping market project acquisition method according to claim 1, wherein after the trained project screening model screens out the target project set from the data set according to the first characteristic attribute, the method further comprises:
and pushing the target project set to a supervision end through a short message or a data interface.
5. The intelligent acquisition method for mapping market projects according to claim 1, wherein the training of the project screening model comprises:
receiving a sample data set of the bid winning item;
extracting a second characteristic attribute from each bid-winning item in the sample data set, wherein the second characteristic attribute at least comprises bid-winning contents, units, money and time attributes;
text truncation, filling and encoding operations are performed on the second characteristic attribute to obtain a vectorized mapping dictionary file;
training, testing and verifying operations are performed according to the sample data set to obtain a weight and a deviation;
and obtaining a project screening model based on the mapping dictionary file, the weight and the deviation.
6. An intelligent acquisition device for mapping market projects, comprising:
the receiving module is used for receiving a data set of the bid winning item;
the extraction module is used for extracting a first characteristic attribute from each bid-winning item in the data set, wherein the first characteristic attribute at least comprises bid-winning content, units, amount and time attributes;
and the screening module is used for screening out a target item set from the data set according to the first characteristic attribute through the trained item screening model.
7. The intelligent mapping market project acquisition device of claim 1, further comprising, after the trained project screening model screens out the target project set from the data set according to the first characteristic attribute:
and pushing the target project set to a supervision end through a short message or a data interface.
8. The intelligent acquisition device for mapping market items according to claim 1, wherein the training of the item screening model comprises:
receiving a sample data set of the bid winning item;
extracting a second characteristic attribute from each bid-winning item in the sample data set, wherein the second characteristic attribute at least comprises bid-winning contents, units, money and time attributes;
text truncation, filling and encoding operations are performed on the second characteristic attribute to obtain a vectorized mapping dictionary file;
training, testing and verifying operations are performed according to the sample data set to obtain a weight and a deviation;
and obtaining a project screening model based on the mapping dictionary file, the weight and the deviation.
9. A storage medium for storing the mapping market project intelligent acquisition method of any one of claims 1 to 5.
10. A server, comprising: memory and processor, characterized in that the processor is configured to execute the mapping market project intelligent acquisition method of any one of claims 1 to 5 stored in the memory.
CN202110006800.4A 2021-01-04 2021-01-04 Intelligent acquisition method for mapping market project Pending CN112990845A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110006800.4A CN112990845A (en) 2021-01-04 2021-01-04 Intelligent acquisition method for mapping market project

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110006800.4A CN112990845A (en) 2021-01-04 2021-01-04 Intelligent acquisition method for mapping market project

Publications (1)

Publication Number Publication Date
CN112990845A true CN112990845A (en) 2021-06-18

Family

ID=76345242

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110006800.4A Pending CN112990845A (en) 2021-01-04 2021-01-04 Intelligent acquisition method for mapping market project

Country Status (1)

Country Link
CN (1) CN112990845A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108388595A (en) * 2018-01-31 2018-08-10 合肥和钧正策信息技术有限公司 Enterprise bidding trade-off evaluation system based on big data analysis
CN109408825A (en) * 2018-11-06 2019-03-01 杭州费尔斯通科技有限公司 A kind of acceptance of the bid data extraction method based on name Entity recognition
CN109753660A (en) * 2019-01-07 2019-05-14 福州大学 A kind of acceptance of the bid webpage name entity abstracting method based on LSTM
CN110009242A (en) * 2019-04-12 2019-07-12 网易(杭州)网络有限公司 Tender Evaluation Method neural network based and device, storage medium
CN111047268A (en) * 2018-10-11 2020-04-21 上海汽车集团股份有限公司 Bidding method and device
CN111275374A (en) * 2019-12-09 2020-06-12 佛山欧神诺云商科技有限公司 Optimal carrier screening method, device and medium based on logistics bidding data
CN111507760A (en) * 2019-03-21 2020-08-07 广州翔实工程咨询有限公司 Method and system for screening reasonableness of bidding documents
CN111985752A (en) * 2019-05-23 2020-11-24 北京京供民科技开发有限公司 Bid information evaluation method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108388595A (en) * 2018-01-31 2018-08-10 合肥和钧正策信息技术有限公司 Enterprise bidding trade-off evaluation system based on big data analysis
CN111047268A (en) * 2018-10-11 2020-04-21 上海汽车集团股份有限公司 Bidding method and device
CN109408825A (en) * 2018-11-06 2019-03-01 杭州费尔斯通科技有限公司 A kind of acceptance of the bid data extraction method based on name Entity recognition
CN109753660A (en) * 2019-01-07 2019-05-14 福州大学 A kind of acceptance of the bid webpage name entity abstracting method based on LSTM
CN111507760A (en) * 2019-03-21 2020-08-07 广州翔实工程咨询有限公司 Method and system for screening reasonableness of bidding documents
CN110009242A (en) * 2019-04-12 2019-07-12 网易(杭州)网络有限公司 Tender Evaluation Method neural network based and device, storage medium
CN111985752A (en) * 2019-05-23 2020-11-24 北京京供民科技开发有限公司 Bid information evaluation method and device
CN111275374A (en) * 2019-12-09 2020-06-12 佛山欧神诺云商科技有限公司 Optimal carrier screening method, device and medium based on logistics bidding data

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
MUNER, WFS (MUNER, WAN FAREZ SAINI): "A New Model for Multicriteria Risk-Based Tender Evaluation", ADVANCED SCIENCE LETTERS *
卢睿: "基于粒子群算法的铁路工程投标项目择优决策研究", 铁道学报 *
徐怡、余浩、刘刚、倪治伟: "基于粗糙集的影响大学生心理健康的研究", 《计算机技术与发展》, pages 1 *

Similar Documents

Publication Publication Date Title
Tsui et al. Knowledge-based extraction of intellectual capital-related information from unstructured data
Athawale et al. Decision making for facility location selection using PROMETHEE II method
CN110569322A (en) Address information analysis method, device and system and data acquisition method
CN111144723A (en) Method and system for recommending people's job matching and storage medium
CN112633962B (en) Service recommendation method and device, computer equipment and storage medium
CN110598070B (en) Application type identification method and device, server and storage medium
CN103123633A (en) Generation method of evaluation parameters and information searching method based on evaluation parameters
CN108898476A (en) A kind of loan customer credit-graded approach and device
CN111105209A (en) Job resume matching method and device suitable for post matching recommendation system
CN111352962A (en) Client portrait construction method and device
CN110532301B (en) Audit method, system and readable storage medium
CN105225135A (en) Potentiality customer recognition method and device
CN110766438A (en) Method for analyzing user behaviors of power grid users through artificial intelligence
CN115423578B (en) Bid bidding method and system based on micro-service containerized cloud platform
CN114638498A (en) ESG evaluation method, ESG evaluation system, electronic equipment and storage equipment
Ha et al. Job forecasting based on the patent information: a word embedding-based approach
CN114218291A (en) Portrait generation method, apparatus, device and storage medium based on target object
CN111882224A (en) Method and device for classifying consumption scenes
CN116522131A (en) Object representation method, device, electronic equipment and computer readable storage medium
CN117522256A (en) Purchasing screening method, system, storage medium and terminal based on supplier portrait
CN112990845A (en) Intelligent acquisition method for mapping market project
CN115687788A (en) Intelligent business opportunity recommendation method and system
CN115099680A (en) Risk management method, device, equipment and storage medium
CN114092057A (en) Project model construction method and device, terminal equipment and storage medium
CN113190666A (en) Industrial intellectual property analysis method, system, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination