CN112015850A - Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics - Google Patents

Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics Download PDF

Info

Publication number
CN112015850A
CN112015850A CN202011189393.7A CN202011189393A CN112015850A CN 112015850 A CN112015850 A CN 112015850A CN 202011189393 A CN202011189393 A CN 202011189393A CN 112015850 A CN112015850 A CN 112015850A
Authority
CN
China
Prior art keywords
poi
data
information
library
characteristic information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011189393.7A
Other languages
Chinese (zh)
Other versions
CN112015850B (en
Inventor
白峻铭
郭晟
邹利平
尹芬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Leador Spatial Information Technology Co ltd
Original Assignee
Leador Spatial Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Leador Spatial Information Technology Co ltd filed Critical Leador Spatial Information Technology Co ltd
Priority to CN202011189393.7A priority Critical patent/CN112015850B/en
Publication of CN112015850A publication Critical patent/CN112015850A/en
Application granted granted Critical
Publication of CN112015850B publication Critical patent/CN112015850B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and a system for updating POI electronic map data based on data mining and POI vertical industry data characteristics, wherein the method comprises the following steps: selecting POI vertical industry data characteristic information: selecting POI vertical industry and data characteristic information; POI vertical industry data characteristic information collection method and management: the method comprises the steps of POI data characteristic label identification, classification management, sample check, import and export; POI data characteristic information extraction: automatically extracting industry characteristic information and POI data information of vertical industry POI data; POI inspection: verifying and modifying the POI result, and storing the verified and modified data into a corresponding result database; POI result management: the method comprises the steps of importing and exporting the extraction result, carrying out classification management, viewing the result and the like. The invention can greatly reduce the workload of manual POI acquisition and efficiently and normatively manage the sample characteristic information and the POI result.

Description

Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics
Technical Field
The invention relates to the technical field of map plotting and surveying, in particular to a method and a system for updating POI electronic map data based on data mining and POI vertical industry data characteristics.
Background
In recent years, along with the continuous development of fine classification of the industry of the vertical field of the internet, a big data mining technology has larger data acquisition capacity and data analysis capacity along with the vertical industry, meanwhile, along with the fine operation of the vertical industry, a selection space with richer contents and more professional styles is rapidly provided for various groups and different crowds of the society, and POI data in the technical field of mapping is an indispensable important component of position service, and provides position retrieval service, geographic information display service, participation in path planning calculation and characteristic information query for the crowds.
The inventor of the present application finds that the method of the prior art has at least the following technical problems in the process of implementing the present invention:
the prior map mapping field acquires POI data information and geocoding information on site by a mobile measurement technology aiming at POI, and has the defects of long data acquisition period, high cost and unrefined and abundant data content of the POI vertical industry due to the fact that the POI vertical industry has more types and large quantity of data and the diversified industry characteristics need to consume a large amount of manpower for manufacturing.
Disclosure of Invention
The invention provides a method and a system for updating POI electronic map data based on data mining and POI vertical industry data characteristics, which are used for solving or at least partially solving the technical problem of low efficiency in updating the POI electronic map data in the existing method.
In order to solve the above technical problem, a first aspect of the present invention provides a method for updating electronic map data of a POI based on data mining and vertical industry data features of the POI, including:
s1: carrying out geographic coding processing on the POI vertical industry classification data, checking the coded POI vertical industry classification data and repairing problem data in the coded POI vertical industry classification data, wherein the checked and repaired POI vertical industry classification data are stored in a POI sample library;
s2: reading POI vertical industry classification data in a POI sample library, and identifying characteristic information of the POI vertical industry classification data, wherein the characteristic information is information related to industry, and the identified data is stored in the POI sample library;
s3: reading data in the POI sample library, matching a target object in the POI result library according to the classification of POIs in different industries, storing data information and source information contained in vertical industry POI data of which the target object matching source is a newly added POI into the POI result library together, and storing the POI vertical industry data which needs to be verified and modified into a POI operation library; the method comprises the following steps of matching a target object in a POI result library according to the classification of POIs in different industries, wherein the method comprises the following steps:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is original POI vertical industry data, and not updating the POI result library;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is newly added POI vertical industry data, and updating the newly added POI vertical industry data into the POI result library;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
the data information contained in the POI data comprises POI attribute information and characteristic information, wherein the POI attribute information comprises a POI name, a geocode position and a type;
s4: reading data in the POI operation library, browsing the POI vertical industry data, matching the POI name, the geocode position, the type and the characteristic information contained in the POI vertical industry data of the extracted target object, if the POI name, the geocode position, the type and the characteristic information are consistent, not modifying the POI name, the geocode position, the type and the characteristic information, and if the POI name, the geocode position, the type and the characteristic information are inconsistent, modifying the POI vertical industry data to be consistent, and then submitting the.
In one embodiment, after storing the identified data in the POI sample repository in step S2, the method further comprises:
and the POI sample library is managed, and inquiry, browsing, importing and exporting are supported.
In one embodiment, S1 includes:
according to the industry data of different sources, geographic codes are matched, and the industry data sources are finely classified and integrated, wherein the industry data sources can be expanded to POI information classification, and the POI data comprises POI data which are externally disclosed in the catering industry, the hotel industry, the travel industry, the medicine industry, all groups and all industry fields;
and checking whether the credibility of the POI vertical industry classification data after the coding processing is a channel source issued by an official party, and repairing the problem data.
In one embodiment, the identifying the feature information of the POI vertical industry classification data in step S2 includes:
if the data is catering industry data, the characteristic information comprises dish information, per-capita consumption information, evaluation information, picture information uploaded by a user and service scores, and the characteristic information contained in the catering industry data is identified;
if the data is hotel industry data, the characteristic information comprises guest room information, check-in information, evaluation information, picture information uploaded by a user and service scores, and the characteristic information contained in the hotel industry data is identified;
if the data is tourism industry data, the characteristic information comprises scenic spot pictures, entrance ticket prices, evaluation information and service scores, and the characteristic information contained in the tourism industry data is identified;
if the data is the pharmaceutical industry data, the characteristic information comprises recent order data volume, evaluation information and physician resource introduction information, and the characteristic information contained in the pharmaceutical industry data is identified.
In one embodiment, the method further comprises: data that needs to be verified and modified is verified and modified.
In one embodiment, the method further includes step S5:
and managing data in the POI result library.
In one embodiment, managing data in a POI outcome repository includes:
inquiring, browsing, information statistics, importing and exporting data in a POI result library, wherein the inquiry comprises the following steps: the characteristic information identifies queries, data mining channel source queries, and queries that modify process information.
Based on the same inventive concept, the second aspect of the present invention provides an updating system of POI electronic map data based on data mining and POI vertical industry data features, comprising:
the POI data classification selection sub-module is used for carrying out geographic coding processing on the POI vertical industry classification data, checking the POI vertical industry classification data after the coding processing and repairing problem data in the POI vertical industry classification data, wherein the checked and repaired POI vertical industry classification data are stored in a POI sample library;
the POI characteristic information identification selection submodule is used for reading POI vertical industry classification data in the POI sample library and identifying the characteristic information of the POI vertical industry classification data, wherein the characteristic information is information related to industries, and the identified data is stored in the POI sample library;
the POI prior information comparison submodule is used for reading data in the POI sample library, matching a target object in the POI result library according to the classification of POIs in different industries, storing data information and source information contained in vertical industry POI data of which the target object matching source is a newly-added POI into the POI result library together, and storing the POI vertical industry data which needs to be verified and modified into the POI operation library, wherein the matching of the target object in the POI result library is carried out according to the classification of the POIs in different industries, and comprises the following steps:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is original POI vertical industry data, and not updating the POI result library;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is newly added POI vertical industry data, and updating the newly added POI vertical industry data into the POI result library;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
the data information contained in the POI data comprises POI attribute information and characteristic information, wherein the POI attribute information comprises a POI name, a geocode position and a type;
and the POI checking sub-module is used for reading data in the POI operation library, browsing the POI vertical industry data, matching the POI name, the geocode position, the type and the characteristic information contained in the POI vertical industry data of the extracted target object, if the POI name, the geocode position, the type and the characteristic information are consistent, not modifying the POI name, the geocode position, the type and the characteristic information, if the POI name, the geocode position, the type and the characteristic information are inconsistent, modifying the POI vertical industry data to be consistent.
In one embodiment, the system further comprises a POI result management submodule for managing data in the POI result library.
One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:
the invention discloses a method for updating POI electronic map data based on data mining and POI vertical industry data characteristics, which takes POI vertical industry classification data stored in a POI sample library as prior information, and stores data information and source information contained in vertical industry POI data of which the target object matching source is a newly added POI into a POI result library through a prior information comparison matching method, thereby greatly shortening labor cost and operation time and improving the updating efficiency of the POI electronic map data.
The embodiment also provides a system for updating POI electronic map data based on data mining and POI vertical industry data characteristics, which manages POI sample data, operation process data and result data in a database form, avoids the problems of low efficiency and redundancy of a file data management form, and can also avoid the problem of file loss caused in a data circulation process; the database management mode can be used for conveniently inquiring and displaying data, the safety of the database can be ensured by using authority management, and the subsequent use of POI data results is more flexible and easy to operate; historical version control can be performed by using database management, required tag information can be customized along with feature information of the POI vertical industry, and the accuracy of comparison and matching of the POI vertical industry data is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a flowchart of a method for updating POI electronic map data based on data mining and POI vertical industry data features according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of a method for updating POI electronic map data based on data mining and POI vertical industry data features according to an embodiment of the present invention.
Detailed Description
The invention provides a method and a system for updating POI electronic map data based on data mining and POI vertical industry data characteristics, which are used for solving the technical problem of low updating efficiency of the POI electronic map data in the existing method, thereby achieving the purpose of improving the manufacturing and updating efficiency of the POI electronic map data.
In order to achieve the above object, the present invention is generally conceived as follows:
the method for updating the POI electronic map data by combining the POI vertical industry data features based on the data mining technology comprises the following steps:
(step one), POI vertical industry data characteristic information selection: selecting POI vertical industry and data characteristic information;
(step two) POI vertical industry data characteristic information collection method and management: the method comprises the steps of POI data characteristic label identification, classification management, sample check, import and export;
(step three) POI data characteristic information extraction: automatically extracting industry characteristic information and POI data information of vertical industry POI data;
(step four) POI inspection: verifying and modifying the POI result, and storing the modified data into a corresponding database;
(step five) POI result management: the method comprises the steps of importing and exporting the extraction result, carrying out classification management, viewing the result and the like.
The invention provides a method and a system for updating POI electronic map data based on a data mining technology and by combining with POI vertical industry data characteristics. The accuracy of vertical industry POI information and the richness of vertical industry POI content expression are utilized, the extracted POI is converted through a geocoding conversion technology, the workload of manual POI collection can be greatly reduced, and sample characteristic information and POI results are efficiently and normatively managed.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example one
The embodiment provides a method for updating electronic map data of a POI based on data mining and vertical industry data features of the POI, please refer to fig. 1, and the method includes:
s1: carrying out geographic coding processing on the POI vertical industry classification data, checking the coded POI vertical industry classification data and repairing problem data in the coded POI vertical industry classification data, wherein the checked and repaired POI vertical industry classification data are stored in a POI sample library;
s2: reading POI vertical industry classification data in a POI sample library, and identifying characteristic information of the POI vertical industry classification data, wherein the characteristic information is information related to industry, and the identified data is stored in the POI sample library;
s3: reading data in a POI sample library, matching a target object in a POI result library according to the classification of POIs in different industries, storing data information and source information contained in vertical industry POI data of which the target object matching source is a newly-added POI into the POI result library together, and storing the POI vertical industry data which needs to be verified and modified into a POI operation library, wherein the matching of the target object in the POI result library according to the classification of the POIs in different industries comprises the following steps:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is original POI vertical industry data, and not updating the POI result library;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is newly added POI vertical industry data, and updating the newly added POI vertical industry data into the POI result library;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
the data information contained in the POI data comprises POI attribute information and characteristic information, wherein the POI attribute information comprises a POI name, a geocode position and a type;
s4: reading data in the POI operation library, browsing the POI vertical industry data, matching the POI name, the geocode position, the type and the characteristic identification information contained in the POI vertical industry data of the extracted target object, if the POI name, the geocode position, the type and the characteristic identification information are consistent, not modifying the POI name, and if the POI name, the geocode position, the type and the characteristic identification information are inconsistent, modifying the POI name, the geocode position, the type and the characteristic identification information to be consistent, and then submitting.
Specifically, S1 obtains POI big data based on a data mining technology, performs POI classification and feature information classification on the POI big data according to the industry verticals, and then stores the classified data in the POI sample library. S2, the POI obtained by the data mining technology is identified and repaired by the characteristic information. Step S3 matches the data read from the POI sample library with the data in the POI result library by using a POI data information comparison method. And storing the newly added data into a POI result library, and updating the result library. The general store that needs to verify the modification is in the POI job repository. And matching the POI vertical industry data of the extracted target object with the pre-obtained characteristic information to finally obtain corresponding POI result data, namely POI electronic map data.
In one embodiment, after storing the identified data in the POI sample repository in step S2, the method further comprises:
and the POI sample library is managed, and inquiry, browsing, importing and exporting are supported.
In one embodiment, S1 includes:
according to the industry data of different sources, geographic codes are matched, and the industry data sources are finely classified and integrated, wherein the industry data sources can be expanded to POI information classification, and the POI data comprises POI data which are externally disclosed in the catering industry, the hotel industry, the travel industry, the medicine industry, all groups and all industry fields;
and checking whether the credibility of the POI vertical industry classification data after the coding processing is a channel source issued by an official party, and repairing the problem data.
Specifically, the data of the catering industry includes typical data such as popular comment, American group, hungry and the like, the data of the hotel industry includes typical data such as travel taking, pig flying, dragon art, cattle on the way and the like, the data of the tourism industry includes typical data such as hornet nest, where to go, daily tour, accurate flight and the like, and the data of the medicine industry includes typical data such as dripping and running legs, jingdan fast medicine, and hospital reservation platforms in various regions. POI data externally disclosed by each group and each industry field: the method comprises the steps of providing POI data and position data published by China Mobile; if the product is the POI data and the position data published externally in China; such as POI data and location data published externally by medium petroleum and medium petro-chemical, etc.
When POI characteristic information identification is carried out, a POI characteristic information visualization interface and an editing tool are provided and are used for carrying out special information sample identification on POI vertical industry data, and samples are managed through a POI sample library; POI characteristic information identification query: and the method supports various modes of classified query, including identification type, industry source, data source time, POI information attribute and the like.
In one embodiment, the identifying the feature information of the POI vertical industry classification data in step S2 includes:
if the data is catering industry data, the characteristic information comprises dish information, per-capita consumption information, evaluation information, picture information uploaded by a user and service scores, and the characteristic information contained in the catering industry data is identified;
if the data is hotel industry data, the characteristic information comprises guest room information, check-in information, evaluation information, picture information uploaded by a user and service scores, and the characteristic information contained in the hotel industry data is identified;
if the data is tourism industry data, the characteristic information comprises scenic spot pictures, entrance ticket prices, evaluation information and service scores, and the characteristic information contained in the tourism industry data is identified;
if the data is the pharmaceutical industry data, the characteristic information comprises recent order data volume, evaluation information and physician resource introduction information, and the characteristic information contained in the pharmaceutical industry data is identified.
The content richness of each industry characteristic information sample is more detailed and more authoritative than that of the electronic map POI, so that the industry characteristic information sample is a main basis for mainly distinguishing the electronic map POI, and the characteristics of each industry characteristic information sample are different. The method mainly lists the general characteristic information identification basis which is the principal principle of comparison and matching of prior information.
In one embodiment, the method further comprises: data that needs to be verified and modified is verified and modified.
In one embodiment, the method further includes step S5:
and managing data in the POI result library.
And managing data in the POI result base, including classifying and managing the final result of POI vertical industry data extraction, checking the final result and the like.
In one embodiment, managing data in a POI outcome repository includes:
inquiring, browsing, information statistics, importing and exporting data in a POI result library, wherein the inquiry comprises the following steps: the characteristic information identifies queries, data mining channel source queries, and queries that modify process information.
Example two
Based on the same inventive concept, the present embodiment provides a system for updating electronic map data of a POI based on data mining and vertical industry data features of the POI, please refer to fig. 2, the system includes:
the POI data classification selection sub-module is used for carrying out geographic coding processing on the POI vertical industry classification data, checking the POI vertical industry classification data after the coding processing and repairing problem data in the POI vertical industry classification data, wherein the checked and repaired POI vertical industry classification data are stored in a POI sample library;
the POI characteristic information identification selection submodule is used for reading POI vertical industry classification data in the POI sample library and identifying the characteristic information of the POI vertical industry classification data, wherein the characteristic information is information related to industries, and the identified data is stored in the POI sample library;
the POI prior information comparison submodule is used for reading data in the POI sample library, matching a target object in the POI result library according to the classification of POIs in different industries, storing data information and source information contained in vertical industry POI data of which the target object matching source is a newly-added POI into the POI result library together, and storing the POI vertical industry data which needs to be verified and modified into the POI operation library, wherein the matching of the target object in the POI result library is carried out according to the classification of the POIs in different industries, and comprises the following steps:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is original POI vertical industry data, and not updating the POI result library;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is newly added POI vertical industry data, and updating the newly added POI vertical industry data into the POI result library;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
the data information contained in the POI data comprises POI attribute information and characteristic information, wherein the POI attribute information comprises a POI name, a geocode position and a type;
and the POI checking sub-module is used for reading data in the POI operation library, browsing the POI vertical industry data, matching the POI name, the geocode position, the type and the characteristic identification information contained in the POI vertical industry data of the extracted target object, if the POI name, the geocode position and the type are consistent with the characteristic identification information, not modifying the POI name, the geocode position and the type, and if the POI name, the geocode position, the type and the characteristic identification information are inconsistent with each other, modifying the POI vertical industry data to be.
In one embodiment, the system further comprises a POI result management submodule for managing data in the POI result library.
Specifically, the POI data classification selection sub-module is mainly used for performing program automated geocoding processing on classified data which conform to the POI vertical industry, and manually checking the data which do not conform to the POI classification selection standard.
And the POI characteristic information identification selection submodule is used for identifying and repairing the characteristic information of the POI acquired by the data mining technology.
The POI prior information comparison submodule is used for comparing data in the POI sample library with original POI in the POI result database for matching and comparison, and mainly comprises the following situations:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information in the POI result library, the data is regarded as original POI vertical industry data, and the POI result library is not updated;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information in the POI result library, the data need to be transferred to the POI operation library for manual verification and repair, and the data can be transferred to the POI result library after the data are qualified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information in the POI result library, the data is regarded as newly added POI vertical industry data, and the newly added POI vertical industry data is transferred to the POI result library;
and if the data of the POI sample library is inconsistent with the POI attribute information in the POI result library and the characteristic information is consistent, the data needs to be transferred to the POI operation library to be verified and repaired manually, and the data is transferred to the POI result library after being checked to be qualified.
The data needed for verification and repair may be passed to the POI check sub-module. And the POI checking sub-module is used for carrying out human-computer interaction checking and modification on the POI data information and the POI characteristic information and providing a warehousing function.
And the POI result management submodule is used for management, query, statistics, import, export and the like of result data.
The following describes in detail the update system of electronic map data of POI based on data mining and vertical industry data features of POI by using specific examples.
As shown in FIG. 2, the POI vertical industry data feature update operation system is used for task state flow management among data processing sub-modules.
After a POI vertical industry data classification task is newly established on the basis of POI big data acquired by an external source data mining technology, a POI data classification selection sub-module is started;
the POI data classification selection sub-module firstly checks whether the POI big data classification provided by the big data mining technology is accurate, whether feature information labels (such as consumption labels, scoring labels and the like) of the POI vertical industry are all provided, whether a data capture source (which industry subclass belongs to) is clearly and clearly recorded, discards or deletes the POI data classification and the feature information which are ambiguous, and verifies and modifies part of the POI data classification which is inaccurate. And synchronizing the processed POI vertical industry data to a POI sample library, and simultaneously sending the data classification processing completion task state to the POI vertical industry data characteristic updating operation system.
And then starting a POI data characteristic information selection sub-module, checking whether the characteristic information labels of the POI vertical industry data meet the admission requirement of the industry characteristic labels, discarding or deleting the characteristic information labels which do not meet the admission requirement, and verifying and modifying the inaccurate expression of the POI labels. And synchronizing the processed POI vertical industry data to a POI sample library, and simultaneously sending the data with the task state of 'information identification processing completed' to the POI vertical industry data characteristic updating operation system.
The integral POI vertical industry data completes classification confirmation and feature information confirmation of the POI vertical industry data through a POI data classification selection sub-module and a POI data feature information selection sub-module, the POI vertical industry data feature updating operation system starts a POI priori information comparison sub-module to read a POI sample library, a target object is subjected to POI matching in a result database (POI result library), the target object matching source is POI data of the POI vertical industry which is in accordance with complete matching and is newly added into the result database, POI data of the POI vertical industry with the target object matching source being in partial matching can be transferred to the POI operation library, and meanwhile, the POI vertical industry data feature updating operation system is sent to the POI vertical industry data feature updating operation system with the task state of 'comparison information processing is completed'.
The POI checking sub-module can check data processed by other sub-modules, after the comparison of POI prior information to the sub-modules is completed, the POI vertical industry data feature updating operation system starts the POI checking sub-module, the system can read POI vertical industry data of the POI operation library to display geographical positions and data attributes, feature information corresponding to the data can be displayed step by step, data verification is carried out on a target, if data information or feature tag information needing to be modified can be manually edited, modified and stored, and after the POI checking sub-module is completed, the data can be uniformly transferred to a result database through the POI vertical industry data feature updating operation system.
The POI achievement management sub-module is started through the POI vertical industry data characteristic updating operation system, so that the data of the POI vertical industry data can be inquired, browsed, counted, imported and exported; wherein the query comprises: characteristic information identification query, data mining channel source query, query for modifying process information, and the like. The browsing operation provides visual display of the query result; the importing operation provides external POI data to be imported into the POI sample library; the export operation provides data that is exported by query content into a variety of alternative formats.
The production operation process for updating the operation system based on the POI vertical industry data characteristics is as follows:
firstly, vertical industry classification and feature information are carried out on POI big data acquired by a data mining technology, POI classification and POI feature information auditing processing is carried out, and processed data are stored in a POI sample library.
And secondly, comparing and matching the data in the POI sample library through prior information, storing POI data completely conforming to the matching rules into a POI result database, and storing part of the data conforming to the matching rules into a POI operation library.
After the 2 processes are completed, only the data in the POI operation library needs to be verified and modified, and the data after manual confirmation can be transferred to the POI result database.
Effects and effects of the embodiments
According to the method and the system for updating the POI electronic map data based on the data mining technology and the POI vertical industry data characteristics, the operation system for updating the POI vertical industry data characteristics is used for carrying out data according to sub-module process steps to realize uniform POI data production and realize data synchronization and information flow circulation among databases, and scale and standardized management of POI data production can be effectively achieved.
The operation system of the embodiment also manages POI sample data, operation process data and result data in a database form, so that the problems of low efficiency and redundancy of a file data management form are avoided, and the problem of file loss caused in a data circulation process can be avoided; the database management mode can be used for conveniently inquiring and displaying data, the safety of the database can be ensured by using authority management, and the subsequent use of POI data results is more flexible and easy to operate; historical version control can be performed by using database management, required tag information can be customized along with feature information of the POI vertical industry, and the accuracy of comparison and matching of the POI vertical industry data is also improved.
According to the method for updating the POI electronic map data based on the data mining technology and combined with the POI vertical industry data characteristics, the prior information comparison and matching method is used for directly writing the target into the database for the data completely meeting the conditions, so that labor cost and operation time are greatly reduced, only the data partially meeting the conditions are verified and the extraction result is modified, manual identification is simple, and the probability of operation errors is reduced.
After the operation system of the embodiment is applied, the standardized management of the operation process is realized from the aspect of flow management; from the aspect of data management, the safety of data is ensured, and the error occurrence rate of manual operation is reduced; from the productivity utilization ratio, the operation time is shortened, the manual operation amount is reduced, and the cost in the manual aspect is saved.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.

Claims (9)

1. A method for updating POI electronic map data based on data mining and POI vertical industry data features is characterized by comprising the following steps:
s1: carrying out geographic coding processing on the POI vertical industry classification data, checking the coded POI vertical industry classification data and repairing problem data in the coded POI vertical industry classification data, wherein the checked and repaired POI vertical industry classification data are stored in a POI sample library;
s2: reading POI vertical industry classification data in a POI sample library, and identifying characteristic information of the POI vertical industry classification data, wherein the characteristic information is information related to industry, and the identified data is stored in the POI sample library;
s3: reading data in a POI sample library, matching a target object in a POI result library according to the classification of POIs in different industries, storing data information and source information contained in vertical industry POI data of which the target object matching source is a newly-added POI into the POI result library together, and storing the POI vertical industry data which needs to be verified and modified into a POI operation library, wherein the matching of the target object in the POI result library according to the classification of the POIs in different industries comprises the following steps:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is original POI vertical industry data, and not updating the POI result library;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is newly added POI vertical industry data, and updating the newly added POI vertical industry data into the POI result library;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
the data information contained in the POI data comprises POI attribute information and characteristic information, wherein the POI attribute information comprises a POI name, a geocode position and a type;
s4: reading data in the POI operation library, browsing the POI vertical industry data, matching the POI name, the geocode position, the type and the characteristic information contained in the POI vertical industry data of the extracted target object, if the POI name, the geocode position, the type and the characteristic information are consistent, not modifying the POI name, the geocode position, the type and the characteristic information, and if the POI name, the geocode position, the type and the characteristic information are inconsistent, modifying the POI vertical industry data to be consistent, and then submitting the.
2. The method of claim 1, wherein after storing the identified data in the POI sample repository at step S2, the method further comprises:
and the POI sample library is managed, and inquiry, browsing, importing and exporting are supported.
3. The method of claim 1, wherein S1 includes:
according to the industry data of different sources, geographic codes are matched, and the industry data sources are finely classified and integrated, wherein the industry data sources can be expanded to POI information classification, and the POI data comprises POI data which are externally disclosed in the catering industry, the hotel industry, the travel industry, the medicine industry, all groups and all industry fields;
and checking whether the credibility of the POI vertical industry classification data after the coding processing is a channel source issued by an official party, and repairing the problem data.
4. The method of claim 1, wherein the step of identifying characteristic information of the POI vertical industry classification data in S2 comprises:
if the data is catering industry data, the characteristic information comprises dish information, per-capita consumption information, evaluation information, picture information uploaded by a user and service scores, and the characteristic information contained in the catering industry data is identified;
if the data is hotel industry data, the characteristic information comprises guest room information, check-in information, evaluation information, picture information uploaded by a user and service scores, and the characteristic information contained in the hotel industry data is identified;
if the data is tourism industry data, the characteristic information comprises scenic spot pictures, entrance ticket prices, evaluation information and service scores, and the characteristic information contained in the tourism industry data is identified;
if the data is the pharmaceutical industry data, the characteristic information comprises recent order data volume, evaluation information and physician resource introduction information, and the characteristic information contained in the pharmaceutical industry data is identified.
5. The method of claim 1, wherein the method further comprises: data that needs to be verified and modified is verified and modified.
6. The method according to claim 1, characterized in that the method further comprises step S5:
and managing data in the POI result library.
7. The method of claim 6, wherein managing data in the POI effect library comprises:
inquiring, browsing, information statistics, importing and exporting data in a POI result library, wherein the inquiry comprises the following steps: the characteristic information identifies queries, data mining channel source queries, and queries that modify process information.
8. A POI electronic map data updating system based on data mining and POI vertical industry data characteristics is characterized by comprising:
the POI data classification selection sub-module is used for carrying out geographic coding processing on the POI vertical industry classification data, checking the POI vertical industry classification data after the coding processing and repairing problem data in the POI vertical industry classification data, wherein the checked and repaired POI vertical industry classification data are stored in a POI sample library;
the POI characteristic information identification selection submodule is used for reading POI vertical industry classification data in the POI sample library and identifying the characteristic information of the POI vertical industry classification data, wherein the characteristic information is information related to industries, and the identified data is stored in the POI sample library;
the POI prior information comparison submodule is used for reading data in the POI sample library, matching a target object in the POI result library according to the classification of POIs in different industries, storing data information and source information contained in vertical industry POI data of which the target object matching source is a newly-added POI into the POI result library together, and storing the POI vertical industry data which needs to be verified and modified into the POI operation library, wherein the matching of the target object in the POI result library is carried out according to the classification of the POIs in different industries, and comprises the following steps:
if the data in the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is original POI vertical industry data, and not updating the POI result library;
if the data of the POI sample library is consistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, indicating that the data in the POI sample library is newly added POI vertical industry data, and updating the newly added POI vertical industry data into the POI result library;
if the data of the POI sample library is inconsistent with the POI attribute information and the characteristic information of the data in the POI result library, the data in the POI sample library is the data needing to be verified and modified;
the data information contained in the POI data comprises POI attribute information and characteristic information, wherein the POI attribute information comprises a POI name, a geocode position and a type;
and the POI checking sub-module is used for reading data in the POI operation library, browsing the POI vertical industry data, matching the POI name, the geocode position, the type and the characteristic information contained in the POI vertical industry data of the extracted target object, if the POI name, the geocode position, the type and the characteristic information are consistent, not modifying the POI name, the geocode position, the type and the characteristic information, if the POI name, the geocode position, the type and the characteristic information are inconsistent, modifying the POI vertical industry data to be consistent.
9. The system of claim 8, further comprising a POI effect management sub-module for managing data in the POI effect library.
CN202011189393.7A 2020-10-30 2020-10-30 Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics Active CN112015850B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011189393.7A CN112015850B (en) 2020-10-30 2020-10-30 Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011189393.7A CN112015850B (en) 2020-10-30 2020-10-30 Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics

Publications (2)

Publication Number Publication Date
CN112015850A true CN112015850A (en) 2020-12-01
CN112015850B CN112015850B (en) 2021-02-19

Family

ID=73527721

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011189393.7A Active CN112015850B (en) 2020-10-30 2020-10-30 Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics

Country Status (1)

Country Link
CN (1) CN112015850B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113627535A (en) * 2021-08-12 2021-11-09 福建中信网安信息科技有限公司 Data grading classification system and method based on data security and privacy protection

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113627535A (en) * 2021-08-12 2021-11-09 福建中信网安信息科技有限公司 Data grading classification system and method based on data security and privacy protection

Also Published As

Publication number Publication date
CN112015850B (en) 2021-02-19

Similar Documents

Publication Publication Date Title
US9311334B2 (en) Geospatial database integration using business models
CN101324439B (en) Navigation apparatus for searching interest point and method for searching interest point
CN113434623A (en) Fusion method based on multi-source heterogeneous space planning data
CN112883042A (en) Data updating and displaying method and device, electronic equipment and storage medium
CN111008253A (en) Data model generation method, data warehouse generation device and electronic equipment
CN112000773A (en) Data association relation mining method based on search engine technology and application
CN112633822B (en) Asset management method based on digital twin technology, storage medium and mobile terminal
CN110728422A (en) Building information model, method, device and settlement system for construction project
WO2023241519A1 (en) Bim component creation method and apparatus, and digital design resource library application method and apparatus
CN111143478A (en) Two-three-dimensional file association method based on lightweight model and engineering object bit number
CN110688434B (en) Method, device, equipment and medium for processing interest points
CN111563103A (en) Method and system for detecting data blood margin
CN111553556A (en) Business data analysis method and device, computer equipment and storage medium
CN112015850B (en) Method and system for updating POI electronic map data based on data mining and POI vertical industry data characteristics
CN111414410A (en) Data processing method, device, equipment and storage medium
CN114637740A (en) Novel map platform construction method based on knowledge representation and knowledge extraction
CN116303641B (en) Laboratory report management method supporting multi-data source visual configuration
CN113626558A (en) Intelligent recommendation-based field standardization method and system
CN103309888A (en) Method and device for verifying data of electronic map
CN116010439A (en) Visual Chinese SQL system and query construction method
CN115114297A (en) Data lightweight storage and search method and device, electronic equipment and storage medium
CN114969115A (en) Data management method and system based on standardized metadata system
CN112052309A (en) Text data retrieval method, related equipment and readable storage medium
CN115146604B (en) Method, device, equipment and storage medium for generating interface technical document
CN112925856B (en) Entity relationship analysis method, entity relationship analysis device, entity relationship analysis equipment and computer storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant