CN111950243B - FME-based standard topographic map outer frame information extraction method and system - Google Patents

FME-based standard topographic map outer frame information extraction method and system Download PDF

Info

Publication number
CN111950243B
CN111950243B CN202010834185.1A CN202010834185A CN111950243B CN 111950243 B CN111950243 B CN 111950243B CN 202010834185 A CN202010834185 A CN 202010834185A CN 111950243 B CN111950243 B CN 111950243B
Authority
CN
China
Prior art keywords
data
topographic map
standard
information
data information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010834185.1A
Other languages
Chinese (zh)
Other versions
CN111950243A (en
Inventor
项超超
郭思逸
李美丽
郦优青
林金双
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Taizhou Geographic Information Surveying And Mapping Center Co ltd
Original Assignee
Taizhou Geographic Information Surveying And Mapping Center Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Taizhou Geographic Information Surveying And Mapping Center Co ltd filed Critical Taizhou Geographic Information Surveying And Mapping Center Co ltd
Priority to CN202010834185.1A priority Critical patent/CN111950243B/en
Publication of CN111950243A publication Critical patent/CN111950243A/en
Application granted granted Critical
Publication of CN111950243B publication Critical patent/CN111950243B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/177Editing, e.g. inserting or deleting of tables; using ruled lines
    • G06F40/18Editing, e.g. inserting or deleting of tables; using ruled lines of spreadsheets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Remote Sensing (AREA)
  • Processing Or Creating Images (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Instructional Devices (AREA)

Abstract

The application relates to a standard topographic map outer frame information extraction method and system based on FME, wherein the coordinate point of one corner of the standard topographic map is configured into the file name in the standard topographic map; obtaining a standard topographic map, importing the standard topographic map into an x-y coordinate system, and extracting coordinate points in the naming of a standard topographic map file to determine a coordinate range occupied by the standard topographic map in the x-y coordinate system; determining an out-of-frame area on a standard topographic map, and extracting data information; filling data information into a pre-established data table; a data table is exported. The data information in the standard topographic map is extracted and packaged into the data form through the FME software, so that personnel are not required to manually input the data information in the standard topographic map, and the method is convenient; and through the standard detection of the data information in the standard topographic text region and the abnormal detection of the data information in the data table, the accuracy of the data table is further ensured.

Description

FME-based standard topographic map outer frame information extraction method and system
Technical Field
The application relates to the technical field of geographic mapping, in particular to a standard topographic map outline information extraction method and system based on FME.
Background
At present, after a person draws a standard topographic map, information on the standard topographic map is required to be stored in a database, wherein the database comprises data such as a map name, a map number, an inspector and the like, and the existing information storing operation is that the person manually identifies the information on the standard topographic map and then records the information on a data table.
For the related technology, the inventor considers that the efficiency of manual warehousing of the information of the standard topographic map by personnel is low, and the probability of generating abnormality of the manual data information is high.
Disclosure of Invention
In order to improve the extraction efficiency of information on a standard topographic map and reduce errors, the application provides an FME-based standard topographic map outer frame information extraction method.
The standard topographic map outer frame information extraction method based on the FME adopts the following technical scheme:
a standard topographic map outer frame information extraction method based on FME comprises the following steps;
exporting a standard topographic map with standard graphic dimensions through CASS software, and configuring the coordinate point of one corner of the standard topographic map into the file name in the standard topographic map;
obtaining a standard topographic map, importing the standard topographic map into a pre-established x-y coordinate system, and extracting coordinate points in file naming of the standard topographic map to determine a coordinate range occupied by the standard topographic map in the x-y coordinate system;
determining an in-frame area and an out-frame area on the standard topographic map based on the acquired standard topographic map coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-frame area;
correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction area;
a data table is exported.
Through adopting above-mentioned technical scheme, personnel only need with the leading-in FME of standard topography, fix a position standard topography through FME internal module, confirm the position that data information is located on the standard topography, write into the data form with data information, derive the data form at last, need not personnel's manual work and enter data form information, information extraction efficiency to the standard topography is high, the rate of accuracy is high.
Preferably, determining an in-frame area and an out-frame area on the standard topographic map based on the acquired coordinate range of the standard topographic map, and extracting data information in a plurality of data extraction areas pre-established in the out-frame area, wherein the method further comprises the following steps;
and eliminating information in the intersection area of the area in the picture frame and the standard topographic map.
Through adopting above-mentioned technical scheme, this operation is based on the data information that this application draws, all is the data information that marks outside the picture frame, and is irrelevant with standard topography picture content, and the existence of standard topography picture content can occupy software operation space, and this action can reduce the load of software operation, makes its processing flow more smooth and easy, improves its processing efficiency.
Preferably, after determining an in-frame area and an out-of-frame area on the standard topography based on the obtained standard topography coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-of-frame area, and correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction areas, the method comprises the following steps;
according to the position in the standard topographic map Yu Yu file folder, acquiring the adjacent standard topographic map below the standard topographic map, extracting data information and filling the data information into a data table;
the above operation is repeated until all the standard topographic map in the pre-built folder is processed.
By adopting the technical scheme, the method sequentially fills the data information in the plurality of standard topographic maps in the pre-built folder into the table, and sequentially orders and writes the plurality of standard topographic maps into the data table according to the order in the pre-built folder, so that personnel can conveniently and quickly find out the corresponding standard topographic map according to the position of the information in the data table.
Preferably, in the position in the file folder built according to the standard topographic map Yu Yu, acquiring the adjacent standard topographic map below the position, repeating the above operation until the standard topographic map in the pre-built file folder is completely processed, and further comprising the following steps;
and detecting the number of words of the data information in the data table, comparing the detected number of words with a preset reference value in the data table, deriving the data table if the number of words of the data information meets the requirement, and screening abnormal data information if the number of words of the detected data information does not meet the requirement, namely, the abnormal data information occurs.
By adopting the technical scheme, the data information is conveniently screened, and the personnel can conveniently analyze and process the data information to find out the nonstandard drawing, so that the personnel can conveniently modify the data information, and meanwhile, the information accuracy of the derived data table is ensured.
Preferably, after screening out abnormal data information, the method further comprises the following steps; the personnel judges the data information again, if the data information is correct, a data table is derived, and if the data information is wrong, an error mark is written into the data table.
By adopting the technical scheme, the screened data information is abnormal information, the total amount of the screened data information is less, the workload of personnel is low, the data information is further identified by manual screening and is not immediately identified as error information, and the situation that some error judgment occurs because the special information does not accord with the screening specification is avoided by manual re-judgment.
Preferably, the method further comprises the following steps after writing the error mark;
and determining the picture name according to the position of the data table where the error mark is located, finding out the corresponding standard topographic map according to the picture name, and deriving the standard topographic map.
By adopting the technical scheme, the standard topographic map is automatically derived according to the error mark, so that personnel do not need to manually search the standard topographic map, and the method is more convenient.
Preferably, after determining the graph name according to the position of the error mark, finding out the corresponding standard topographic map according to the graph name and deriving the standard topographic map, the method further comprises the following steps;
and (3) delivering the derived standard topographic map to personnel for modification, after the personnel modification is finished, re-importing the standard topographic map into a pre-built folder and covering the original standard topographic map, re-extracting modified data information and covering the data information on the data table, and exporting the data table after word number detection and personnel judgment are finished.
By adopting the technical scheme, the personnel modified standard topographic map can directly cover the wrong data information through FME import, and the personnel is not required to manually modify the wrong data information, so that the method is more convenient.
Preferably, in determining an in-frame area and an out-frame area on the standard topography based on the obtained standard topography coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-frame area, the method further comprises the following steps;
color number detection is carried out on the data extraction area and the data detection area pre-established around the data extraction area, and the detection result is compared with an internal preset value to judge whether the detection is passed or not;
if the detection is passed, extracting data information in the data extraction area;
if the detection is not passed, the error mark is directly stored in the FME to replace the data information to be extracted.
By adopting the technical scheme, the color detection is used for detecting the divided area, if the color number of the data extraction area is 1, the data extraction area is blank, the standard topographic map is not in accordance with the regulation, if the color number of the data detection area beside the data extraction area is 2, the text super-area or information misplaced of the standard topographic map is indicated, the modification is needed, the non-standard area of the data information can be detected in the pre-detection step, the non-standard topographic map is marked, the non-standard topographic map is led out, the convenience is realized, the screening force of the non-standard topographic map is improved again, and the accuracy of the information of the data table is improved.
On the other hand, the application provides a standard topographic map outer frame information extraction system based on FME, adopts following technical scheme:
an FME-based standard topography outline information extraction system, comprising;
a configuration module for exporting a standard topographic map with standard graphic dimensions through CASS software and configuring the coordinate point of one corner of the standard topographic map to the file name in the standard topographic map;
the importing module is used for acquiring a standard topographic map, importing the standard topographic map into a pre-established x-y coordinate system, and extracting coordinate points in file naming of the standard topographic map to determine a coordinate range occupied by the standard topographic map in the x-y coordinate system;
an information identification extraction module for determining an in-frame area and an out-of-frame area on the standard topography based on the obtained standard topography coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-of-frame area;
the information writing module is used for correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction area;
and an export module to export the data table.
By adopting the technical scheme, the system can be used for executing the data information extraction method outside the standard topographic map frame based on the FME without manually inputting the data form information by personnel, so that the extraction efficiency and the accuracy of the information of the standard topographic map are improved.
In summary, the present application includes at least one of the following beneficial technical effects:
the data information in the standard topographic map is extracted and packaged into the data form through the FME software, so that personnel are not required to manually input the data information in the standard topographic map, and the method is convenient;
and through the standard detection of the data information in the standard topographic map data extraction area and the abnormal and error detection of the data information in the data table, the accuracy of the data table is further ensured.
Drawings
Fig. 1 is a flowchart of a first part of a standard topographic map outline border information extraction method based on FME according to an embodiment of the present application.
Fig. 2 is a second partial flowchart of a standard topographic map outline information extraction method based on FME according to an embodiment of the present application.
Fig. 3 is a flowchart of a standard topography outline border information extraction system based on FME according to an embodiment of the present application.
Reference numerals illustrate: 1. a configuration module; 2. an import module; 3. an information identification and extraction module; 4. an information writing module; 5. a derivation module; 6. a data extraction region specification judging module; 7. an abnormal data detection judging module; 71. an exception handling module; 8. and an error processing module.
Detailed Description
The present application is described in further detail below in conjunction with figures 1-3.
One of the purposes of the embodiment of the application is to disclose a standard topographic map outer frame information extraction method based on FME. Referring to fig. 1 and 2, a standard topographic map outer frame information extraction method based on FME includes the steps of;
step S1: exporting a standard topographic map with standard graphic dimensions through CASS software, and configuring the coordinate point of one corner of the standard topographic map into the file name in the standard topographic map;
specifically, firstly, a standard topographic map is derived and drawn through CASS software, coordinate points in the drawn standard topographic map are obtained, coordinates of the points are coordinate points of the lower left corner of the standard topographic map according to naming standards in the geographical mapping industry, the standard topographic map is derived, and the coordinate points of the lower left corner of the standard topographic map are configured to the map name of the standard topographic map. And then, the personnel classifies the derived standard topographic map according to the measurement drawing area, puts the standard topographic map belonging to the same area into the same pre-built folder, and sorts the standard topographic map according to the size of the map name coordinate.
Step S2: obtaining a standard topographic map, importing the standard topographic map into a pre-established x-y coordinate system, and extracting coordinate points in file naming of the standard topographic map to determine a coordinate range occupied by the standard topographic map in the x-y coordinate system;
specifically, standard topographic maps in the ordered pre-built folders are sequentially imported into an FME, an x-y coordinate system matched with the coordinate system in the standard topographic map is pre-built in the FME, the FME places the standard topographic map in the x-y coordinate system, coordinate points are extracted from the names of the standard topographic map, and the positions of the points are located in the x-y coordinate system according to the extracted coordinate points.
Step S3: determining an in-frame area and an out-frame area on the standard topographic map based on the acquired standard topographic map coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-frame area;
specifically, according to the standard topographic map size specification, the sizes of the standard topographic maps in the x-y coordinate system in the pre-built folder are the same, and the size and shape of each standard topographic map needing to be derived information and the occupied area of each standard topographic map in the x-y coordinate system are the same, so that a rectangular map frame surrounding the standard topographic map can be generated in the FME software x-y coordinate system according to addition and subtraction of coordinates by taking the lower left corner coordinate of the standard topographic map as a reference, and the standard topographic map is divided into an in-frame area and an out-of-frame area. After the frame is generated, the standard topographic map of the part intersected with the frame, namely the region in the frame, namely the standard topographic map part positioned in the frame, is removed by using a deleting program. The operation is based on the required extracted data information, and the data information is marked in a preset data extraction area of an area outside the frame, is irrelevant to the content in the standard topographic map frame, and the existence of the standard topographic map content occupies the software running space, so that the load of software running can be reduced, and the processing flow is smoother.
After removing the region in the frame, dividing the standard topographic map of the region outside the frame, carrying out step-by-step identification on a plurality of data extraction regions to extract data information, storing the data information into an internal storage space of the FME according to a step-by-step sequence, wherein the region dividing method is to divide the region outside the frame by carrying out addition and subtraction of an abscissa through the acquired coordinate point of the name of the map, and because the coordinate distance between various data information of the standard topographic map and the coordinate point is fixed, the FME can independently divide various data extraction regions in an x-y coordinate system through simple addition and subtraction of the abscissa, then carrying out color number detection on the data extraction regions and the data detection regions preset around the data extraction regions, wherein the data extraction regions are positioned in the data detection regions, and the data detection regions are overlapped with one data extraction region at most.
Comparing the number of colors extracted from the data extraction area with a preset value of 1, and if the number of colors is equal to 1, detecting that the number of colors does not pass the detection, otherwise, detecting that the number of colors does not pass the detection; comparing the number of colors extracted from the data detection area with a preset value of 1, if the number is larger than 1, the detection is not passed, otherwise, the detection is passed. According to the common sense that the character color of the data information is required to be distinguished from the drawing background, the data extraction area is blank if the color number of the data extraction area where the data information is located is 1, the standard topography is not in accordance with the regulation, and the standard topography data information super-area or the data information is filled by mistake if the color number of the data detection area beside the data extraction area is more than 1, the character content of part of the data information is displayed. After detecting one of the two conditions in a certain data extraction area, the error marked symbol 'F' replaces data information and is stored in an internal storage space of the FME.
Step S4: correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction area;
specifically, the data information in each data extraction area is written into a pre-established mdb data table in the FME according to the step sequence, the public parameters of the data table correspond to the data information in each data extraction area, the information comprises the contents of a picture name, a picture number, a product ownership unit, a measurement date, coordinate system information, a measurer and the like, the data information in the internal storage space of the FME is called out according to the original step sequence, the information is sequentially filled into the row cells of the data table, after the data information is completely input, the information is completely called out, the extraction of the data information of the standard topographic map is started in the sequence of the standard topographic map and the operation, the data information of the standard topographic map is filled into the next row of data table, the operation is repeated until the standard topographic map in the pre-built folder is completely identified, the standard topographic map information on the data table corresponds to the standard topographic map sequence in the pre-built folder, the staff can quickly determine the position of the pre-built standard topographic map in the data table according to the value corresponding to the standard topographic map information on the data table, and the operation is convenient.
Step S41: detecting the number of words of the data information in the data table, comparing the detected number of words with a preset reference value in the data table, deriving the data table if the number of words of the data information meets the requirement, and screening abnormal data information if the number of words of the detected data information does not meet the requirement, namely, abnormal data information is screened;
specifically, the number of words of the data information in each data table cell is classified and detected, each column of cells is classified, the number of fonts of the cells is detected, the detected number of words is compared with two preset reference values, for example, for the data information such as a measurer, the number of words is usually two to three, so that the preset first reference value is 2, the preset second reference value is 3, when the number of words of the detected data information is more than 3 or less than 2, the data information does not meet the requirements, the data information is judged to be abnormal information, an error mark symbol "F" is detected as abnormal information, other information such as a production unit and the like is detected by similar operation, the data information with the detected abnormality is screened out in different colors from the line mark of the corresponding data table, and the screening can be performed in the data table by adopting red.
Step S5: processing abnormal data information and exporting a data table;
specifically, after screening abnormal data information, a person manually judges to further determine the correctness of the abnormal data information, and if the data information is not error information, a data table is derived; if the data information is error information, personnel adds error marks to the data table, the error marks are marked as symbols 'F', and then the row with the error marks where the error information is located is imported into the FME, and the FME determines the picture name (coordinate point) and the position in the pre-built folder according to the position of the error marks in the data table and exports the picture name (coordinate point) and the position in the pre-built folder, so that the personnel can conveniently check and correct the picture. The doping of error data is reduced by a software program and manual double confirmation mode, so that the extracted data is more accurate, and meanwhile, the abnormality caused by the data is prevented, and the error data is used as the error data for modification operation. If no anomaly is detected, a data table is derived, after which the derived error standard topography is modified by personnel. After personnel modification, re-importing the standard topographic map and covering the original standard topographic map, re-extracting the modified data information and covering the data information on the data table, and exporting the data table after word number detection and personnel judgment.
The implementation principle of the FME-based standard topographic map outer frame information extraction method is as follows: the information is processed manually in a software processing mode, so that the extraction efficiency of the information on the standard topographic map is improved, the data errors caused by personnel operation reasons are reduced, and the method is convenient.
The embodiment of the application also discloses a standard topographic map outer frame information extraction system based on the FME.
Referring to fig. 3, a standard topography outline border information extraction system based on FME includes a configuration module 1, an import module 2, an information identification extraction module 3, an information writing module 4, a data extraction area specification judgment module 6, an export module 5, an abnormal data detection judgment module 7, an abnormal processing module 71, and an error processing module 8.
Configuration module 1: the standard topography is derived and drawn by CASS software, and the left lower corner coordinate point of the standard topography is configured into the file name of the derived standard topography. And then, personnel classify the standard topographic map according to the measurement area, put the standard topographic map belonging to the same area into the same pre-built folder, and sort the standard topographic map according to the name coordinates of the standard topographic map.
The importing module 2: the method comprises the steps of obtaining standard topographic maps, sequentially importing the standard topographic maps in the ordered pre-built folders into an FME, pre-building a pre-built x-y coordinate system matched with a coordinate system in the standard topographic maps in the FME, placing the standard topographic maps into the x-y coordinate system by the FME, extracting coordinate points from the names of the standard topographic maps, positioning the positions of the points in the x-y coordinate system according to the extracted coordinate points, and obtaining the coordinates of the points which are the lower left corner coordinates of the standard topographic maps according to naming standards in the geographic mapping industry.
Information identification extraction module 3: determining an in-frame area and an out-frame area on the standard topographic map based on the acquired standard topographic map coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-frame area;
specifically, according to the standard topographic map size specification, the standard topographic maps in the pre-built folder are identical in size and shape in the x-y coordinate system, and the size and the shape of each standard topographic map needing to be derived information and the occupied area in the x-y coordinate system are identical, so that a rectangular frame surrounding the standard topographic map can be generated in the software x-y coordinate system according to addition and subtraction of coordinates by taking the lower left corner coordinate of the standard topographic map as a reference, and the standard topographic map is divided into an in-frame area and an out-frame area. After the frame is generated, the standard topographic map of the part intersected with the frame, namely the region in the frame, namely the standard topographic map part positioned in the frame, is removed by using a deleting program. The operation is based on the required extracted data information, and the data information is marked in a preset data extraction area of an area outside the picture frame, is irrelevant to the content in the standard topographic map picture frame, and the existence of the standard topographic map content occupies the software running space, so that the load of software running can be reduced, and the processing flow is smoother.
After removing the region in the frame, carrying out region division on the standard topographic map of the region outside the frame, dividing a plurality of data extraction regions, carrying out step-by-step identification to extract data information, storing the data information into an internal storage space of the FME according to a step-by-step sequence, wherein the region division method is to divide the region outside the frame by carrying out addition and subtraction of an abscissa through the acquired coordinate point of the name of the map.
Data extraction area specification judging module 6: the method is used for detecting the color number of the data extraction area and a data detection area preset around the data extraction area, the data detection area is slightly larger than the data extraction area, the data extraction area is positioned in the data detection area, the data detection area is overlapped with one data extraction area at most, if the color number of the data extraction area where the data information is positioned is 1, the data extraction area is blank, the standard topography is not in accordance with the regulation, and if the color number of the data detection area beside the data extraction area is 2, the standard topography data information super-area or the data information is filled in error. After detecting one of the two conditions in a certain data extraction area, the error marked symbol 'F' replaces data information and is stored in an internal storage space of the FME.
Information writing module 4: correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction area;
specifically, the data information in each data extraction area is written into a pre-established mdb data table in the FME according to the step sequence, the public parameters of the data table correspond to the data information in each data extraction area, the information comprises the contents of a picture name, a picture number, a product ownership unit, a measurement date, coordinate system information, a measurer and the like, the data information in the internal storage space of the FME is called out according to the original step sequence, the information is sequentially filled into the row cells of the data table, after the data information is completely input, the information is completely called out, the extraction of the data information of the standard topographic map is started in the sequence of the standard topographic map and the operation, the data information of the standard topographic map is filled into the next row of data table, the operation is repeated until the standard topographic map in the pre-built folder is completely identified, the standard topographic map information on the data table corresponds to the standard topographic map sequence in the pre-built folder, the staff can quickly determine the position of the pre-built standard topographic map in the data table according to the value corresponding to the standard topographic map information on the data table, and the operation is convenient.
Abnormal data detection judgment module 7: the method comprises the steps of detecting the number of words of data information in a data table, comparing the detected number of words with a preset reference value in the data table, deriving the data table if the number of words of the data information meets the requirement, screening abnormal data information if the number of words of the detected data information does not meet the requirement, namely abnormal data information occurs, classifying and detecting the number of words of the data information in each data table cell, classifying each column of cells into one type, detecting the number of the fonts of the cells, comparing the detected number of words with the preset two reference values, for example, for the data information such as a measurer, the number of words of the data information is usually two to three, so that the preset first reference value is 2, the second reference value is 3, when the number of words of the detected data information is larger than 3 or smaller than 2, judging that the data information does not meet the requirement, and detecting other information such as abnormal information such as error mark symbol 'F', and detecting other information such as production units in a similar operation.
The exception handling module 71: the abnormal data detection and judgment module 7 detects abnormal data information and the corresponding line of the data table are screened out, the specific screening mode can be that the abnormal data information is screened out by marking different colors on the line of the abnormal data information in the data table, so that people can find the abnormal data information conveniently, after the abnormal data information is screened out, the people manually judge the abnormal data information to further determine the correctness of the abnormal data information, and if the data information is not error information, the data table is derived; if the data information is error information, personnel adds error marks to the data information in the data table, and the error marks are marked as 'F'.
Error handling module 8: after the exception handling module 71 runs, the fault flags on the data table are detected, and the FME facilitates the personnel to check and correct it by determining its name (coordinate point) and its location in the pre-built folder based on the location of the fault flags in the data table and deriving it. The doping of error data is reduced by a software program and manual double confirmation mode, so that the extracted data is more accurate, and meanwhile, the abnormality caused by the data is prevented, and the error data is used as the error data for modification operation. If no abnormality is detected, the data form is exported, then the exported error standard topographic map is modified by personnel, after the personnel modification is finished, the standard topographic map is imported again and covered with the original standard topographic map, and the modified data information is extracted again and the data information on the data form is covered.
Export module 5: when the error handling module 8 does not detect an error flag, the export module 5 exports the data table.
The foregoing are all preferred embodiments of the present application, and are not intended to limit the scope of the present application in any way, therefore: all equivalent changes in structure, shape and principle of this application should be covered in the protection scope of this application.

Claims (4)

1. The FME-based standard topographic map outer frame information extraction method is characterized by comprising the following steps of;
exporting a standard topographic map with standard graphic dimensions through CASS software, and configuring the coordinate point of one corner of the standard topographic map into the file name in the standard topographic map;
obtaining a standard topographic map, importing the standard topographic map into a pre-established x-y coordinate system, and extracting coordinate points in file naming of the standard topographic map to determine a coordinate range occupied by the standard topographic map in the x-y coordinate system;
determining an in-frame area and an out-frame area on the standard topographic map based on the acquired standard topographic map coordinate range, and extracting data information in a plurality of data extraction areas pre-established in the out-frame area;
color number detection is carried out on the data extraction area and the data detection area pre-established around the data extraction area, and the detection result is compared with an internal preset value to judge whether the detection is passed or not; the data detection area is slightly larger than the data extraction area, the data extraction area is positioned in the data detection area, the data detection area is overlapped with one data extraction area at most, if the color number of the data extraction area where the data information is positioned is 1, the data extraction area is blank, the standard topography is not in accordance with the regulation, and if the color number of the data detection area beside the data extraction area is larger than 1, the data information super area of the standard topography or the data information is filled by mistake;
if the detection is passed, extracting data information in the data extraction area;
if the detection is not passed, directly storing the error mark to replace the data information to be extracted into the FME;
correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction area;
according to the position in the standard topographic map Yu Yu file folder, acquiring the adjacent standard topographic map below the standard topographic map, extracting data information and filling the data information into a data table;
repeating the operation of establishing the position in the folder according to the standard topographic map Yu Yu, acquiring the adjacent standard topographic map below the position, extracting data information and filling the data form until the whole processing of the standard topographic map in the pre-established folder is completed;
detecting the number of words of the data information in the data table, and comparing the detected number of words with a preset reference value in the data table;
if the number of the data information words meets the requirement, a data table is exported;
screening abnormal data information if the word number of the detected data information does not meet the requirement and is abnormal; judging the data information again by personnel;
if the data information is wrong, writing an error mark into the data information; determining the picture name according to the position of the data table where the error mark is located, finding out the corresponding standard topographic map according to the picture name and deriving the standard topographic map;
and if the data information is correct, the data table is exported.
2. The FME-based standard topographic map outer frame information extraction method of claim 1, wherein an in-frame area and an out-of-frame area on the standard topographic map are determined based on the acquired standard topographic map coordinate range, and data information in a plurality of data extraction areas pre-established in the out-of-frame area is extracted, further comprising the steps of;
and eliminating information in the intersection area of the area in the picture frame and the standard topographic map.
3. The FME-based standard topographic map outer frame information extraction method according to claim 1, further comprising the steps of, after determining its map name according to the location of the error marker, finding out and deriving a corresponding standard topographic map according to the map name;
and (3) delivering the derived standard topographic map to personnel for modification, after the personnel modification is finished, re-importing the standard topographic map into a pre-built folder and covering the original standard topographic map, re-extracting modified data information and covering the data information on the data table, and exporting the data table after word number detection and personnel judgment are finished.
4. An FME-based standard topographic map outer frame information extraction system is characterized by comprising;
a configuration module (1) for exporting a standard topographic map with standard graphic dimensions through CASS software and configuring the coordinate point of one corner of the standard topographic map to the file name in the standard topographic map;
the importing module (2) is used for acquiring a standard topographic map, importing the standard topographic map into a pre-established x-y coordinate system, and extracting coordinate points in file naming of the standard topographic map to determine a coordinate range occupied by the standard topographic map in the x-y coordinate system;
an information identification extraction module (3) for determining an in-frame region and an out-of-frame region on the standard topography based on the acquired standard topography coordinate range, and extracting data information in a plurality of data extraction regions previously established in the out-of-frame region;
detecting the color number of a data extraction area and a data detection area preset around the data extraction area, wherein the data detection area is slightly larger than the data extraction area, the data extraction area is positioned in the data detection area, the data detection area is overlapped with one data extraction area at most, if the color number of the data extraction area where the data information is positioned is 1, the data extraction area is blank, the standard topography is not in accordance with the regulation, and if the color number of the data detection area beside the data extraction area is larger than 1, the data information is in a super-region or the data information is filled in error in the standard topography; after detecting one of the two conditions of a certain data extraction area, storing the error mark instead of the data information into a data extraction area specification judging module (6) of an internal storage space of the FME;
an information writing module (4) for correspondingly filling the data information into the position of the data information in the pre-established data table according to the data extraction area;
the abnormal data detection judging module (7) is used for detecting the number of words of the data information in the data table, comparing the detected number of words with a preset reference value in the data table, deriving the data table if the number of words of the data information meets the requirement, and screening abnormal data information if the number of words of the detected data information does not meet the requirement, namely, the abnormal data information appears;
an abnormality processing module (71) for screening out the data information detected as abnormality by the abnormality data detection judging module (7) and the row of the data table corresponding to the data information;
an error processing module (8) which runs after the abnormality processing module (71) and detects the error mark on the data table, and the FME determines the picture name and the position in the pre-built folder according to the position of the error mark in the data table and derives the picture name and the position in the pre-built folder, so that personnel can conveniently check and correct the picture name and the position;
and an export module (5) for exporting the data table when the error processing module (8) does not detect the error flag.
CN202010834185.1A 2020-08-19 2020-08-19 FME-based standard topographic map outer frame information extraction method and system Active CN111950243B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010834185.1A CN111950243B (en) 2020-08-19 2020-08-19 FME-based standard topographic map outer frame information extraction method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010834185.1A CN111950243B (en) 2020-08-19 2020-08-19 FME-based standard topographic map outer frame information extraction method and system

Publications (2)

Publication Number Publication Date
CN111950243A CN111950243A (en) 2020-11-17
CN111950243B true CN111950243B (en) 2024-02-02

Family

ID=73342111

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010834185.1A Active CN111950243B (en) 2020-08-19 2020-08-19 FME-based standard topographic map outer frame information extraction method and system

Country Status (1)

Country Link
CN (1) CN111950243B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111090714A (en) * 2019-12-24 2020-05-01 广州市城市建设工程监理公司 Data storage method and device based on CASS topographic map

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10331156B2 (en) * 2015-03-03 2019-06-25 Leidos, Inc. System and method for big data geographic information system discovery

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111090714A (en) * 2019-12-24 2020-05-01 广州市城市建设工程监理公司 Data storage method and device based on CASS topographic map

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
利用FME实现地形高程数据快速入库的研究与应用;陈继山;湖南城市学院学报(自然科学版);第20卷(第01期);第66-69页 *
数字临海基础地理信息数据库建设;余群凤;中国优秀硕士学位论文全文数据库 基础科学辑(第11期);A008-21 *

Also Published As

Publication number Publication date
CN111950243A (en) 2020-11-17

Similar Documents

Publication Publication Date Title
WO2021147252A1 (en) Ocr-based table format recovery method and apparatus, electronic device, and storage medium
CN107862303B (en) Information identifying method, electronic device and the readable storage medium storing program for executing of form class diagram picture
JP2004139484A (en) Form processing device, program for implementing it, and program for creating form format
CN111931774A (en) Method and system for warehousing medicine data
US10783325B1 (en) Visual data mapping
CN115063802A (en) PSENet-based circular seal identification method, device and medium
CN108763575B (en) Image control point automatic selection method based on image control point database
CN115240213A (en) Form image recognition method and device, electronic equipment and storage medium
CN107315817A (en) Electronic drawing text matching technique, device, storage medium and computer equipment
CN111950243B (en) FME-based standard topographic map outer frame information extraction method and system
CN116682130A (en) Method, device and equipment for extracting icon information and readable storage medium
CN104408403A (en) Arbitration method and apparatus for inconsistent phenomenon of two pieces of entry information
JP3394694B2 (en) Format information registration method and OCR system
CN112486989A (en) Multi-source data granulation fusion and index classification and layering processing method
CN112084103A (en) Interface test method, device, equipment and medium
CN116311299A (en) Method, device and system for identifying structured data of table
CN103700084A (en) Chemical molecular structure chart partition method based on area size and curvature
JP4521466B2 (en) Form processing device
CN113191351B (en) Reading identification method and device of digital electric meter and model training method and device
US11748341B2 (en) Method, apparatus, and system for form auto-registration using virtual table generation and association
CN115600570A (en) Power grid equipment early warning limit value table reconstruction method, entry method and device
KR20180119501A (en) Method and computer program for extracting table record from spread sheet
CN114443624B (en) Basic geographic information data quality inspection method
CN114417788A (en) Drawing analysis method and device, storage medium and electronic equipment
CN111752663A (en) Method and device for repairing layout compatibility problem in Web application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20231219

Address after: 318000 East of Room 215, 2nd Floor, No. 465 Shifu Avenue, Baiyun Street, Jiaojiang District, Taizhou City, Zhejiang Province

Applicant after: Taizhou geographic information surveying and Mapping Center Co.,Ltd.

Address before: 318001 Construction Building, No. 465 Shifu Avenue, Taizhou City, Zhejiang Province

Applicant before: TAIZHOU GEOGRAPHY INFORMATION PLOTTING CENTER

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant