WO2023216675A1 - Data processing method and apparatus - Google Patents

Data processing method and apparatus Download PDF

Info

Publication number
WO2023216675A1
WO2023216675A1 PCT/CN2023/077321 CN2023077321W WO2023216675A1 WO 2023216675 A1 WO2023216675 A1 WO 2023216675A1 CN 2023077321 W CN2023077321 W CN 2023077321W WO 2023216675 A1 WO2023216675 A1 WO 2023216675A1
Authority
WO
WIPO (PCT)
Prior art keywords
grid
business
grids
spatial
data
Prior art date
Application number
PCT/CN2023/077321
Other languages
French (fr)
Chinese (zh)
Inventor
孙士玺
牟晓敏
王怡刚
Original Assignee
北京沃东天骏信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京沃东天骏信息技术有限公司 filed Critical 北京沃东天骏信息技术有限公司
Publication of WO2023216675A1 publication Critical patent/WO2023216675A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Definitions

  • the present disclosure relates to the technical field of the Internet of Things, and in particular, to a data processing method and device.
  • Existing spatial data calculation methods include longitude and latitude calculation, spatial judgment and classification using strings as the main keys, GeoHash (an address encoding), etc.
  • GeoHash an address encoding
  • embodiments of the present disclosure provide a data processing method and device.
  • a data processing method including:
  • Grid the business map; wherein, the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
  • Receive the input address code identification, business keywords and business scope perform grid positioning based on the input address code identification, center on the positioned grid, find other grids that match the business scope, and return the other grids that match the business key Word spatial data and displayed.
  • gridding a business map includes:
  • the preset value is the number of intervals between vertically adjacent grids
  • the order of magnitude of the total number of transverse grids or the total number of longitudinal grids is determined, and the sum of the order of magnitude and the first preset value is used as the number of intervals between vertically adjacent grids.
  • the address encoding identification as a grid it further includes:
  • taking the positioned grid as the center, searching for other grids that match the business scope, returning and displaying spatial data in other grids that match the business keywords includes:
  • one or more spatial data that match the business keywords are filtered out.
  • a data processing apparatus including:
  • the delimitation module is used to delineate the grid of the business map; wherein the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
  • the calculation module is used to calculate, for each grid, the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
  • the processing module is used to receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, take the positioned grid as the center, find other grids that match the business scope, and return other networks
  • the spatial data that matches the business keywords in the grid are displayed.
  • a data processing electronic device is provided.
  • the electronic device of the embodiment of the present disclosure includes: one or more processors; a storage device configured to store one or more programs. When the one or more programs are executed by the one or more processors, the One or more processors implement any of the above-mentioned data processing methods.
  • a computer-readable medium on which a computer program is stored, and when the program is executed by a processor, any one of the above-mentioned data processing methods is implemented.
  • Figure 1 is a main flow diagram of a data processing method according to an embodiment of the present disclosure
  • Figure 2 is a schematic diagram of dividing a business map grid according to an embodiment of the present disclosure
  • Figure 3 is a schematic diagram of the implementation flow of the present disclosure
  • Figure 4 is a schematic diagram of the main modules of a data processing device according to an embodiment of the present disclosure.
  • Figure 5 is an exemplary system architecture diagram in which embodiments of the present disclosure may be applied.
  • FIG. 6 is a schematic structural diagram of a computer system suitable for implementing a mobile device or server according to an embodiment of the present disclosure.
  • Method 1 Use longitude and latitude information to directly calculate the distance value between two points through spatial geometric relationships, and then determine the spatial relationship between the two points.
  • Method 2 Use string as the primary key for spatial judgment and classification. For example, data association is performed through province names such as Beijing and Henan, and then spatial attributes are expanded.
  • GeoHash is currently a relatively mainstream technology for implementing location services. It is usually used in combination with method one to improve computing efficiency.
  • Method 1 generally uses two-dimensional space for spatial description. It is difficult to discern the spatial patterns of data, and high-precision longitude and latitude information is not used to protect user privacy. The space marked by longitude and latitude cannot directly reflect the spatial distance in meters, and the calculation efficiency is low.
  • Method 2 has high requirements for the standardization of primary key values. For example, to associate two pieces of data using districts and counties, the district and county names must be unified and standardized to ensure that the association is correct. However, the accuracy is still limited and cannot meet the accuracy needs of micro-scenario applications. .
  • Method 3 has a boundary problem.
  • Geohash cannot intuitively judge the distance relationship between the two literally. It needs to be mapped into longitude and latitude, and the spatial distance is calculated based on the longitude and latitude.
  • Geohash An address encoding that encodes two-dimensional longitude and latitude into a one-dimensional string, representing a rectangular area. Geohash is often used to search for nearby locations. Nearby locations are first filtered out based on Geohash, and then nearby locations are calculated based on distance.
  • One-dimensional A one-dimensional space consisting only of points within a line. It has only length, no width and height, and can only extend infinitely to both sides. Compared with the XY (latitude and longitude) spatial identification method, it is suitable for efficient size comparison to distinguish spatial positions.
  • Spatial data refers to data used to represent the location, shape, size and distribution characteristics of spatial entities. It can be used to describe targets from the real world and has characteristics such as positioning, qualitative, time and spatial relationships. Spatial data is a kind of data that uses basic spatial data structures such as points, lines, surfaces, and entities to represent the natural world on which people rely for survival.
  • FIG. 1 shown is a main flow chart of a data processing method provided by an embodiment of the present disclosure, which includes the following steps:
  • S101 Grid the business map; the business map consists of spatial location information Information composition, each grid includes a spatial location information, and each spatial location information includes one or more spatial data;
  • S102 For each grid, calculate the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
  • S103 Receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, search for other grids that match the business scope, and return the other grids that match the business scope.
  • the spatial data of business keywords are displayed.
  • the statistical unit size is first defined according to business needs. Different businesses require different data accuracy. Therefore, the corresponding relationship between the business parameters and the statistical unit size can be established in advance. . In actual operation, business maps can be defined according to different business requirements, such as a map of a certain urban area.
  • the user needs to divide a certain business map, he needs to send a grid delineation request for the map through the device. For example, click the "Grid" button in the computer interface.
  • the request carries business parameters, which can be queried and used with the business.
  • the size of the statistical unit corresponding to the parameter enables grid delineation of the business map, such as grid delineation of 1km*1km (the size of the statistical unit can be adjusted according to actual business needs, generally not less than 100m*100m).
  • the business map is divided into k grids horizontally and n grids vertically. This solution sets that each grid only covers one spatial location information. Since the spatial data belong to different spatial locations, some spatial location information may cover multiple spatial data, and some may only cover one spatial data.
  • the number M of intervals between longitudinal adjacent grids is defined to facilitate subsequent calculations.
  • the order of magnitude is 4 (in scientific notation, a number is recorded in the form of a*10 ⁇ b, b is the order of magnitude)
  • add the order of magnitude 4 to 1 that is, the first preset value, only an example, Actual adjustable
  • a new order of magnitude 5 is obtained, so the number of intervals between longitudinal grids is 1*10 ⁇ 5, which represents the difference in geo_id of vertically adjacent grids.
  • the same method is used to define M using the total number of longitudinal grids. .
  • Spatial data is data based on spatial location.
  • spatial data in a certain spatial location includes:
  • Meteorological data longitude 117.34 latitude 34.532 temperature 32 degrees
  • the data of temperature, park and number of people are of little value. However, combined with the spatial location, it can be known that the location is a park with high temperature and 5 people active. If you add the time dimension and more types of spatial data, combined with machine learning, you can generate very large data value. The spatial association of different spatial data is often costly.
  • This solution adds geo_id as the spatial primary key to various spatial data based on the above method. Assuming that the geo_id of the grid is 89442, the spatial primary key of the meteorological data is the meteorological data of 89442, or other data identified by 89442. This plan does not limit this.
  • specific data such as signaling data needs further processing after generating the geo_id, such as removing the latitude and longitude information (retaining the original ID for data backtracking), so as not to expose high-precision latitude and longitude data and improve data security.
  • the signaling data longitude 117.34, latitude 34.532, Zhang San, 20211205, means that Zhang San appeared at this longitude and latitude on December 5, 2021.
  • the geo_id After generating the geo_id, it changes to: 2958146 geo_id, 1 person, 20211205, which means 2958146. There is one person in the grid who can distribute this data for business applications.
  • map data is: 2958146geo_id, park. Therefore, the above signaling data combined with the map data shows that there is a person in the park.
  • the location of the park needs to be decrypted. Decryption uses the original ID to collide with the original longitude and latitude information, and the original information can be managed by a specific department. Based on the above, more data security issues can be filtered without affecting data usage.
  • step S103 in business construction and mining, spatial distance calculation tools are constructed based on geo_id, such as nearby point tools, kernel density tools, etc., to achieve efficient application of spatial calculation methods in distributed systems.
  • abs is a function, taking the absolute value.
  • substr is a C++ language function. Its main function is to copy a substring, starting from a specified position and having a specified length.
  • LENGTH is the length.
  • POWER(number,power) where the parameter number represents the base and the parameter power represents the exponent. From this calculation, the horizontal distance between two points is 6677km, the vertical distance is 71km, the Manhattan distance is 6784km, and the Euclidean distance is 6677.38km. Based on the preliminary judgment of the horizontal distance and vertical distance, the Euclidean distance and Manhattan distance can be quickly calculated, and the overall calculation amount is All extremely small.
  • the above four distances are just examples. In fact, there can be other formulas, and they should be calculated as needed when used.
  • geo_id itself is not in meter system, but through any two geo_ids, the distance between the two (meter system) can be quickly calculated.
  • the conversion of geo_id itself requires the spatial dimension table of the business map.
  • the business scope usually specifies a certain distance inside/outside in the horizontal direction, vertical direction, surrounding, oblique direction, etc.
  • Different distance calculation formulas are used for different business scopes. , such as the above horizontal distance, Vertical distance, Manhattan distance and Euclidean distance. Position the grid based on the input geo_id, with the grid as the center. Regardless of the above distance calculation formula, you need to enter the geo_id of the grid, the geo_id of other grids located around the grid and corresponding to the business scope, The number M of intervals between vertical adjacent grids is used to calculate the distance between this grid and other grids, and then one or more grids that fit the business scope are screened out. Find other grids that match the business keywords and business scope, return the spatial data in other grids and display them.
  • the business scope is within 100m. Based on this business scope, the spatial data that is greater than 100m away from the geo_id is filtered out, and then the business keyword "convenience store” is used. ", filter out non-convenience store spatial data within 100m of the geo_id.
  • the user wants to calculate whether there are kindergartens, primary schools, hospitals, etc. within a range of 1000m around a certain geo_id.
  • the business keywords are "kindergarten”, “primary school”, “hospital”, and the business scope is within a range of 1000m. The calculation can also be completed quickly. After determining the distance between the surrounding grid and the grid, filtering is performed based on business keywords to achieve spatial data information association based on geo_id.
  • the business map includes spatial data, specifically spatial location information, and each spatial location information includes at least one spatial data.
  • multivariate spatial data analysis and mining can be carried out, such as querying specific spatial data within a certain business scope near a certain grid.
  • the method provided by the above embodiment adds geo_id as the spatial primary key to various types of spatial data.
  • only the geo_id can be input to calculate the spatial distance more efficiently and improve the efficiency of spatial distance calculation.
  • the method provided by the embodiments of the present disclosure has at least the following beneficial effects:
  • Geo_id is a string of numbers, so that the spatial position can be compared intuitively. Compared with the two-dimensional identification method of longitude and latitude, it speeds up the calculation of spatial distance. High-precision data mining analysis, such as adjacent point extraction, multivariate data spatial correlation and other spatial computing requirements, can be further developed on this basis.
  • FIG. 4 a schematic diagram of the main modules of a data processing device 400 provided by an embodiment of the present disclosure is shown, including:
  • the delineation module 401 is used to delimit the business map into grids; wherein the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
  • the calculation module 402 is used to calculate, for each grid, the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
  • the processing module 403 is used to receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, search for other grids that match the business scope, and return other grids.
  • the spatial data that matches the business keywords in the grid are displayed.
  • the demarcation module 401 is used for:
  • the preset value is the interval number of vertically adjacent grids
  • the calculation module 402 is also used to:
  • the order of magnitude of the total number of transverse grids or the total number of longitudinal grids is determined, and the sum of the order of magnitude and the first preset value is used as the number of intervals between vertically adjacent grids.
  • the implementation device of the present disclosure also includes a screening module for:
  • the processing module 403 is used for:
  • one or more spatial data that match the business keywords are filtered out.
  • Figure 5 shows an exemplary system architecture 500 to which embodiments of the present disclosure may be applied, including terminal devices 501, 502, 503, a network 504 and a server 505 (examples only).
  • Terminal devices 501, 502, and 503 can be various electronic devices with display screens and support web browsing, and are installed with various communication client applications. Users can use terminal devices 501, 502, and 503 to interact with the server 505 through the network 504 to Receive or send messages, etc.
  • Network 504 is used to provide a medium for communication links between terminal devices 501, 502, 503 and server 505.
  • Network 504 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
  • the server 505 may be a server that provides various services. It should be noted that the methods provided by the embodiments of the present disclosure are generally executed by the server 505. Accordingly, the device is generally provided in the server 505.
  • FIG. 6 a schematic structural diagram of a computer system 600 suitable for implementing a terminal device according to an embodiment of the present disclosure is shown.
  • the terminal device shown in FIG. 6 is only an example and should not impose any restrictions on the functions and scope of use of the embodiments of the present disclosure.
  • computer system 600 includes a central processing unit (CPU) 601 that can operate according to a program stored in a read-only memory (ROM) 602 or loaded from a storage portion 608 into a random access memory (RAM) 603 And perform various appropriate actions and processing.
  • CPU central processing unit
  • RAM random access memory
  • various programs and data required for the operation of the system 600 are also stored.
  • CPU 601, ROM 602 and RAM 603 are connected to each other through bus 604.
  • An input/output (I/O) interface 605 is also connected to bus 604.
  • the following components are connected to the I/O interface 605: an input section 606 including a keyboard, a mouse, etc.; an output section 607 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., speakers, etc.; and a storage section 608 including a hard disk, etc. ; and a communication section 609 including a network interface card such as a LAN card, a modem, etc.
  • the communication section 609 performs communication processing via a network such as the Internet.
  • Driver 610 is also connected to I/O interface 605 as needed.
  • Removable media 611 such as magnetic disks, optical disks, magneto-optical disks, semiconductor memories, etc., are installed on the drive 610 as needed, so that a computer program read therefrom is installed into the storage portion 608 as needed.
  • embodiments of the present disclosure include a computer program product including a computer program carried on a computer-readable medium, the computer program A program contains program code for executing the method shown in the flowchart.
  • the computer program may be downloaded and installed from the network via communication portion 609, and/or installed from removable media 611.
  • CPU central processing unit
  • the computer-readable medium shown in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium may be, for example, but is not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmd read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
  • Program code embodied on a computer-readable medium may be transmitted using any suitable medium, including but not limited to: wireless, wire, optical cable, RF, etc., or any suitable combination of the foregoing.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions.
  • Implementations may allow the functions noted in the blocks to occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved.
  • each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
  • the modules involved in the embodiments of the present disclosure can be implemented in software or hardware.
  • the described module can also be provided in a processor.
  • a processor includes a demarcation module, a calculation module, and a processing module.
  • the names of these modules do not constitute a limitation on the module itself under certain circumstances.
  • the processing module can also be described as a "spatial data processing module".
  • the present disclosure also provides a computer-readable medium.
  • the computer-readable medium may be included in the device described in the above embodiments; it may also exist separately without being assembled into the device.
  • the above computer-readable medium carries one or more programs. When the above one or more programs are executed by a device, the device includes:
  • Grid the business map; wherein, the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
  • Receive the input address code identification, business keywords and business scope perform grid positioning based on the input address code identification, center on the positioned grid, find other grids that match the business scope, and return the other grids that match the business key Word spatial data and displayed.
  • the grid is described by one-dimensional digital geo_id, which simplifies the spatial position description rules, enables intuitive comparison of spatial positions, greatly improves the calculation efficiency of spatial position distance, and specific spatial data can no longer retain high-precision latitude and longitude coordinates, At the same time, it ensures that data is not lost and strengthens the security of spatial data.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application relates to the technical field of the Internet of Things. Disclosed are a data processing method and apparatus. A specific embodiment of the method comprises: performing gridding on a service map; for each grid cell, calculating the product of an arrangement number of the grid cells in the longitudinal direction and a preset numerical value, and taking the sum of the product and an arrangement number of the grid cells in the transverse direction as an address coding identifier of the grid cells; and receiving an input address code identifier, a service keyword and a service range, performing grid cell positioning on the basis of the input address code identifier, taking a positioned grid acell s a center to search for other grid cells that are in line with the service range, and returning spatial data, which is in line with the service keyword, in the other grid cells and displaying same.

Description

数据处理方法和装置Data processing methods and devices
相关申请的交叉引用Cross-references to related applications
本申请要求享有2022年5月7日提交的发明名称为“一种数据处理方法和装置”的中国专利申请No.202210490874.4的优先权,在此全文引用上述中国专利申请公开的内容以作为本申请的一部分或全部。This application claims priority to Chinese Patent Application No. 202210490874.4, which is titled "A Data Processing Method and Device" and was submitted on May 7, 2022. The disclosure of the above-mentioned Chinese patent application is cited in its entirety as this application. part or all of.
技术领域Technical field
本公开涉及物联网技术领域,尤其涉及一种数据处理方法和装置。The present disclosure relates to the technical field of the Internet of Things, and in particular, to a data processing method and device.
背景技术Background technique
物联网时代,空间大数据已成为极为重要的数据资源,高效、简易的空间数据入库规则将极大提升了数据存储、计算、分析、挖掘等的效率,因此建立空间数据的高效计算方法是十分有意义的。In the Internet of Things era, spatial big data has become an extremely important data resource. Efficient and simple spatial data warehousing rules will greatly improve the efficiency of data storage, calculation, analysis, mining, etc. Therefore, establishing an efficient computing method for spatial data is Very meaningful.
现有空间数据计算方法如经纬度计算、以字符串为主键进行空间判断归类、GeoHash(一种地址编码)等,但在实现本公开的过程中,发明人发现现有技术至少存在如下问题:虽均可以反映空间信息,但对两个及以上的相对空间位置关系计算效率低,甚至无法直观表达,以此需实现增强相对空间位置计算的便捷性。Existing spatial data calculation methods include longitude and latitude calculation, spatial judgment and classification using strings as the main keys, GeoHash (an address encoding), etc. However, in the process of realizing the present disclosure, the inventor found that the existing technology has at least the following problems: Although they can reflect spatial information, the calculation efficiency of two or more relative spatial position relationships is low and cannot even be expressed intuitively. Therefore, it is necessary to enhance the convenience of relative spatial position calculation.
发明内容Contents of the invention
有鉴于此,本公开实施例提供一种数据处理方法和装置。In view of this, embodiments of the present disclosure provide a data processing method and device.
根据本公开实施例的一个方面,提供了一种数据处理方法,包括:According to an aspect of an embodiment of the present disclosure, a data processing method is provided, including:
对业务地图进行网格划定;其中,业务地图由空间位置信息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;Grid the business map; wherein, the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将 所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;For each grid, calculate the product of the vertical arrangement number of the grid and the preset value, and The sum of the product and the horizontal arrangement number of the grid is used as the address code identifier of the grid;
接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。Receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, find other grids that match the business scope, and return the other grids that match the business key Word spatial data and displayed.
根据本公开的一个或多个实施例,所述对业务地图进行网格划定,包括:According to one or more embodiments of the present disclosure, gridding a business map includes:
接收对业务地图的网格划定请求;其中,所述网格划定请求中包括业务参数;Receive a grid delimitation request for a service map; wherein the grid delimitation request includes service parameters;
查询与所述业务参数对应的预设统计单元大小,以使用所述预设统计单元大小对业务地图进行网格划定。Query the preset statistical unit size corresponding to the business parameter, so as to use the preset statistical unit size to grid delineate the business map.
根据本公开的一个或多个实施例,所述预设数值为纵向相邻网格的间隔数;According to one or more embodiments of the present disclosure, the preset value is the number of intervals between vertically adjacent grids;
在所述计算网格在纵向的排列数与预设数值的乘积之前,还包括:Before calculating the product of the vertical arrangement number of the grid and the preset value, it also includes:
确定横向网格总数量或纵向网格总数量的数量级,将所述数量级与第一预设数值之和,作为纵向相邻网格的间隔数。The order of magnitude of the total number of transverse grids or the total number of longitudinal grids is determined, and the sum of the order of magnitude and the first preset value is used as the number of intervals between vertically adjacent grids.
根据本公开的一个或多个实施例,在所述作为网格的地址编码标识之后,还包括:According to one or more embodiments of the present disclosure, after the address encoding identification as a grid, it further includes:
筛选标注为特定数据的空间数据,去除所述特定数据中的经纬度信息。Filter the spatial data marked as specific data and remove the latitude and longitude information in the specific data.
根据本公开的一个或多个实施例,所述以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示,包括:According to one or more embodiments of the present disclosure, taking the positioned grid as the center, searching for other grids that match the business scope, returning and displaying spatial data in other grids that match the business keywords includes:
确定与所述业务范围对应的距离计算公式;Determine the distance calculation formula corresponding to the business scope;
将输入的地址编码标识、位于所述输入的地址编码标识周围的其他地址编码标识、所述预设数值,输入所述距离计算公式中,以计算其他网格与所述定位的网格的距离,进而筛选出符合所述业务范围的 一个或多个网格;Enter the input address code identifier, other address code identifiers located around the input address code identifier, and the preset value into the distance calculation formula to calculate the distance between other grids and the positioned grid. , and then filter out those that fit the stated business scope one or more grids;
从所述一个或多个网格的空间数据中,筛选出符合所述业务关键词的一个或多个空间数据。From the spatial data of the one or more grids, one or more spatial data that match the business keywords are filtered out.
根据本公开实施例的另一方面,提供了一种数据处理装置,包括:According to another aspect of the embodiments of the present disclosure, a data processing apparatus is provided, including:
划定模块,用于对业务地图进行网格划定;其中,业务地图由空间位置信息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;The delimitation module is used to delineate the grid of the business map; wherein the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
计算模块,用于对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;The calculation module is used to calculate, for each grid, the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
处理模块,用于接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。The processing module is used to receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, take the positioned grid as the center, find other grids that match the business scope, and return other networks The spatial data that matches the business keywords in the grid are displayed.
根据本公开实施例的再一方面,提供了一种数据处理电子设备。According to yet another aspect of embodiments of the present disclosure, a data processing electronic device is provided.
本公开实施例的电子设备包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现上述任一所述的数据处理方法。The electronic device of the embodiment of the present disclosure includes: one or more processors; a storage device configured to store one or more programs. When the one or more programs are executed by the one or more processors, the One or more processors implement any of the above-mentioned data processing methods.
根据本公开实施例的再一方面,提供了一种计算机可读介质,其上存储有计算机程序,所述程序被处理器执行时实现上述任一所述的数据处理方法。According to yet another aspect of the embodiments of the present disclosure, there is provided a computer-readable medium on which a computer program is stored, and when the program is executed by a processor, any one of the above-mentioned data processing methods is implemented.
上述的非惯用的可选方式所具有的进一步效果将在下文中结合具体实施方式加以说明。 Further effects of the above-mentioned non-conventional optional methods will be described below in conjunction with specific implementations.
附图说明Description of the drawings
附图用于更好地理解本公开,不构成对本公开的不当限定。其中:The accompanying drawings are used for a better understanding of the present disclosure and do not constitute an improper limitation of the present disclosure. in:
图1是根据本公开实施例的一种数据处理方法的主要流程示意图;Figure 1 is a main flow diagram of a data processing method according to an embodiment of the present disclosure;
图2是根据本公开实施例的划分业务地图网格的示意图;Figure 2 is a schematic diagram of dividing a business map grid according to an embodiment of the present disclosure;
图3是本公开实施流程示意图;Figure 3 is a schematic diagram of the implementation flow of the present disclosure;
图4是根据本公开实施例的一种数据处理装置的主要模块示意图;Figure 4 is a schematic diagram of the main modules of a data processing device according to an embodiment of the present disclosure;
图5是本公开实施例可以应用于其中的示例性系统架构图;Figure 5 is an exemplary system architecture diagram in which embodiments of the present disclosure may be applied;
图6是适于用来实现本公开实施例的移动设备或服务器的计算机系统的结构示意图。FIG. 6 is a schematic structural diagram of a computer system suitable for implementing a mobile device or server according to an embodiment of the present disclosure.
具体实施方式Detailed ways
以下结合附图对本公开的示范性实施例做出说明,其中包括本公开实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本公开的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the present disclosure are included to facilitate understanding and should be considered to be exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.
需要指出的是,在不冲突的情况下,本公开中的实施例以及实施例中的特征可以互相组合。本申请技术方案中对数据的获取、存储、使用、处理等均符合国家法律法规的相关规定。It should be noted that the embodiments and features in the embodiments of the present disclosure can be combined with each other without conflict. The acquisition, storage, use and processing of data in the technical solution of this application all comply with the relevant provisions of national laws and regulations.
此处对现有技术及缺点进行详细描述,目前较为常用的空间数据计算方法分别为以下三种:The existing technologies and shortcomings are described in detail here. Currently, the more commonly used spatial data calculation methods are the following three:
方式一:利用经纬度信息,通过空间几何关系直接计算得出两点之间的距离值,进而确定两者之间的空间关系。Method 1: Use longitude and latitude information to directly calculate the distance value between two points through spatial geometric relationships, and then determine the spatial relationship between the two points.
方式二:以字符串为主键进行空间判断、归类。例如,通过省份名称北京、河南等进行数据关联,进而进行空间属性的扩充。Method 2: Use string as the primary key for spatial judgment and classification. For example, data association is performed through province names such as Beijing and Henan, and then spatial attributes are expanded.
方式三:GeoHash是目前比较主流的实现位置服务的技术,通常结合方式一的使用,用于提升计算效率。 Method three: GeoHash is currently a relatively mainstream technology for implementing location services. It is usually used in combination with method one to improve computing efficiency.
在实现本公开的过程中,发明人发现现有技术至少存在如下问题:In the process of realizing the present disclosure, the inventor found that the existing technology has at least the following problems:
方式一普遍使用二维空间进行空间描述,数据空间规律难洞察,且高精度经纬度信息不利用进行用户隐私保护。经纬标注的空间无法直接反映米进制的空间距离,且计算效率低。Method 1 generally uses two-dimensional space for spatial description. It is difficult to discern the spatial patterns of data, and high-precision longitude and latitude information is not used to protect user privacy. The space marked by longitude and latitude cannot directly reflect the spatial distance in meters, and the calculation efficiency is low.
方式二对主键值的标准化要求高,例如对两份数据利用区县进行关联,需在保证区县名称进行统一标准化之后才能确保关联无误,但仍精度有限,无法满足微观场景应用的精度需要。Method 2 has high requirements for the standardization of primary key values. For example, to associate two pieces of data using districts and counties, the district and county names must be unified and standardized to ensure that the association is correct. However, the accuracy is still limited and cannot meet the accuracy needs of micro-scenario applications. .
方式三存在边界问题,Geohash前面的值相同的越多,两个位置越近,反之两个位置越近,前面的值并不一定相同的越多。比如有两个特别近的地点[116.3967,45.0009]、[116.3967,44.9999],其Geohash值分别为y84b08j2和wxfzbxvr,虽然这两个点相距很近,但是由于刚好在分界点45两侧,导致Geohash值大相径庭。且Geohash从字面上无法直观判断出两者之间的距离关系,需要映射成经纬度,依据经纬度进行空间距离的计算。Method 3 has a boundary problem. The more the previous values of the Geohash are the same, the closer the two positions are. On the contrary, the closer the two positions are, the more the previous values are not necessarily the same. For example, there are two particularly close locations [116.3967,45.0009] and [116.3967,44.9999], whose Geohash values are y84b08j2 and wxfzbxvr respectively. Although these two points are very close to each other, they are just on both sides of the dividing point 45, resulting in Geohash The values are quite different. Moreover, Geohash cannot intuitively judge the distance relationship between the two literally. It needs to be mapped into longitude and latitude, and the spatial distance is calculated based on the longitude and latitude.
对于本方案涉及的词语,做解释如下:The explanations for the words involved in this plan are as follows:
Geohash:一种地址编码,将二维的经纬度编码成一维的字符串,表示一个矩形区域。Geohash常用于附近地点搜索,先根据Geohash筛选出附近的地点,然后根据距离计算得出附近地点。Geohash: An address encoding that encodes two-dimensional longitude and latitude into a one-dimensional string, representing a rectangular area. Geohash is often used to search for nearby locations. Nearby locations are first filtered out based on Geohash, and then nearby locations are calculated based on distance.
一维:一维只由一条线内的点所组成的空间,它只有长度,没有宽度和高度,只能向两边无限延展。相较于XY(经纬度)的空间标识方法,其适合高效进行大小比较,从而分辨空间位置。One-dimensional: A one-dimensional space consisting only of points within a line. It has only length, no width and height, and can only extend infinitely to both sides. Compared with the XY (latitude and longitude) spatial identification method, it is suitable for efficient size comparison to distinguish spatial positions.
空间数据:指用来表示空间实体的位置、形状、大小及其分布特征诸多方面信息的数据,可以用来描述来自现实世界的目标,具有定位、定性、时间和空间关系等特性。空间数据是一种用点、线、面以及实体等基本空间数据结构,来表示人们赖以生存的自然世界的数据。Spatial data: refers to data used to represent the location, shape, size and distribution characteristics of spatial entities. It can be used to describe targets from the real world and has characteristics such as positioning, qualitative, time and spatial relationships. Spatial data is a kind of data that uses basic spatial data structures such as points, lines, surfaces, and entities to represent the natural world on which people rely for survival.
参见图1,示出的是本公开实施例提供的一种数据处理方法的主要流程图,包括如下步骤:Referring to Figure 1, shown is a main flow chart of a data processing method provided by an embodiment of the present disclosure, which includes the following steps:
S101:对业务地图进行网格划定;其中,业务地图由空间位置信 息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;S101: Grid the business map; the business map consists of spatial location information Information composition, each grid includes a spatial location information, and each spatial location information includes one or more spatial data;
S102:对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;S102: For each grid, calculate the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
S103:接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。S103: Receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, search for other grids that match the business scope, and return the other grids that match the business scope. The spatial data of business keywords are displayed.
上述实施方式中,对于步骤S101,在分布式文件系统中,首先根据业务需要定义统计单元大小,业务不同所需要的数据精度也不一致,因而可以预先建立业务参数-统计单元大小之间的对应关系。实际操作中,可以按照不同的业务需求定义业务地图,如某一市区的地图。In the above embodiment, for step S101, in the distributed file system, the statistical unit size is first defined according to business needs. Different businesses require different data accuracy. Therefore, the corresponding relationship between the business parameters and the statistical unit size can be established in advance. . In actual operation, business maps can be defined according to different business requirements, such as a map of a certain urban area.
用户若需划分某一业务地图,需通过设备发送对该地图的网格划定请求,如点击计算机界面中的“划分网格”按钮,请求中携带有业务参数,通过查询并使用与该业务参数对应的统计单元大小,实现对该业务地图的网格划定操作,如进行1km*1km的网格划定(可根据实际业务需求调整统计单元大小,一般不小于100m*100m)。If the user needs to divide a certain business map, he needs to send a grid delineation request for the map through the device. For example, click the "Grid" button in the computer interface. The request carries business parameters, which can be queried and used with the business. The size of the statistical unit corresponding to the parameter enables grid delineation of the business map, such as grid delineation of 1km*1km (the size of the statistical unit can be adjusted according to actual business needs, generally not less than 100m*100m).
当业务地图为不规则图形时,可能有些网格下不存在实体数据对应,这些网格后续也不会出现计算需求。参见图2所示,将业务地图横向划分为k个网格,纵向划分为n个网格。本方案设定每个网格下仅覆盖一个空间位置信息,由于空间数据分属不同的空间位置,因而有些空间位置信息下可能涵盖多个空间数据,有些可能仅涵盖一个空间数据。When the business map is an irregular shape, there may be no entity data corresponding to some grids, and these grids will not require subsequent calculations. Referring to Figure 2, the business map is divided into k grids horizontally and n grids vertically. This solution sets that each grid only covers one spatial location information. Since the spatial data belong to different spatial locations, some spatial location information may cover multiple spatial data, and some may only cover one spatial data.
对于步骤S102,基于横向网格总数量或纵向网格总数量的数量级,定义纵向相邻网格的间隔数M,便于后续计算。假设横向网格总数量 k=18944,数量级为4(科学计数法中,把一个数记为a*10^b的形式,b即为数量级),将数量级4加上1(即第一预设数值,仅为示例,实际可调),得到新的数量级5,因而纵向网格的间隔数为1*10^5,表示纵向相邻网格的geo_id的差值,对于使用纵向网格总数量定义M的方式同理。For step S102, based on the order of magnitude of the total number of horizontal grids or the total number of vertical grids, the number M of intervals between longitudinal adjacent grids is defined to facilitate subsequent calculations. Assume that the total number of horizontal grids k=18944, the order of magnitude is 4 (in scientific notation, a number is recorded in the form of a*10^b, b is the order of magnitude), add the order of magnitude 4 to 1 (that is, the first preset value, only an example, Actual adjustable), a new order of magnitude 5 is obtained, so the number of intervals between longitudinal grids is 1*10^5, which represents the difference in geo_id of vertically adjacent grids. The same method is used to define M using the total number of longitudinal grids. .
具体参见图2所示,计算每个网格geo_id=nM+k,此处n表示网格在纵向的排列数、M表示纵向相邻网格的差值、k表示网格在横向的排列数。因此后续根据geo_id计算网格所处行数时,可以采用向下取整方式:INT(geo_id/M),相应的geo_id–M*INT(geo_id/M)的差值,即为网格所处列数。Refer to Figure 2 for details. Calculate each grid geo_id=nM+k, where n represents the number of grids arranged in the longitudinal direction, M represents the difference between adjacent grids in the longitudinal direction, and k represents the number of grids arranged in the transverse direction. . Therefore, when subsequently calculating the number of rows of the grid based on geo_id, the downward rounding method can be used: INT (geo_id/M). The difference between the corresponding geo_id–M*INT (geo_id/M) is the location of the grid. Number of columns.
以上述M=100000为例,假设某一网格的geo_id为89442,计算INT(89442/100000)=0,即表示该网格位于第0行、第89442列,假设另一网格的geo_id为569443,计算INT(569443/100000)=5,即表示该网格位于第52行、第69443列。相应的,该网格相邻的其他网格的geo_id也可以快速计算得出,如左侧网格的geo_id为(569443-1),右侧网格的geo_id为(569443+1),上方网格的geo_id为(569443–100000),下方网格的geo_id为(569443+100000)。Take the above M=100000 as an example. Assume that the geo_id of a certain grid is 89442. Calculate INT (89442/100000) = 0, which means that the grid is located at row 0 and column 89442. Assume that the geo_id of another grid is 569443, calculate INT (569443/100000) = 5, which means that the grid is located at row 52 and column 69443. Correspondingly, the geo_id of other grids adjacent to this grid can also be quickly calculated. For example, the geo_id of the left grid is (569443-1), the geo_id of the right grid is (569443+1), and the geo_id of the upper grid is (569443+1). The geo_id of the grid is (569443–100000), and the geo_id of the grid below is (569443+100000).
实际操作中,虽可以将M值定义为k值,但本方案优选上述方式。假若M=k=18944,对于上述geo_id为569443的网格,INT(569443/18944)=30,后续人工计算上方网格的geo_id为(569443-18944),下方网格的geo_id为(569443+189443),对人工计算不友好,不便于大多数情况对空间关系的人脑直接判断决策。且考虑M值并不影响原业务地图的划分,因此优选上述方式。In actual operation, although the M value can be defined as the k value, the above method is preferred in this solution. If M=k=18944, for the above-mentioned grid with geo_id 569443, INT (569443/18944)=30, subsequent manual calculation of the geo_id of the upper grid is (569443-18944), and the geo_id of the lower grid is (569443+189443 ), which is not friendly to manual calculations and is inconvenient for the human brain to directly judge and make decisions on spatial relationships in most cases. And considering that the M value does not affect the division of the original business map, the above method is preferred.
空间数据是基于空间位置的数据,例如某一空间位置中的空间数据包括:Spatial data is data based on spatial location. For example, spatial data in a certain spatial location includes:
气象数据,经度117.34纬度34.532温度32度 Meteorological data, longitude 117.34 latitude 34.532 temperature 32 degrees
地图数据,经度117.34纬度34.532公园Map data, longitude 117.34 latitude 34.532 park
信令数据,经度117.34纬度34.532 5个人Signaling data, longitude 117.34 latitude 34.532 5 people
若脱离空间位置,温度、公园和人数的数据价值不大,但是结合空间位置,即可得知该位置是公园,温度高,活动着5个人。如果再增加时间维度、更多种类的空间数据,结合机器学习,可以产生非常大的数据价值。而不同空间数据的空间关联,往往成本较高,本方案实现将各类空间数据基于上述方式增加geo_id作为空间主键。假设该网格的geo_id为89442,则气象数据的空间主键为89442的气象数据,或其他由89442标识的数据组成,本方案对此不做限制。If separated from the spatial location, the data of temperature, park and number of people are of little value. However, combined with the spatial location, it can be known that the location is a park with high temperature and 5 people active. If you add the time dimension and more types of spatial data, combined with machine learning, you can generate very large data value. The spatial association of different spatial data is often costly. This solution adds geo_id as the spatial primary key to various spatial data based on the above method. Assuming that the geo_id of the grid is 89442, the spatial primary key of the meteorological data is the meteorological data of 89442, or other data identified by 89442. This plan does not limit this.
另外,如信令数据等特定数据,在生成geo_id后需要进一步处理,如去除经纬度信息(保留原id,用于数据回溯),以不暴露高精度经纬度数据,实现数据安全的提升。例如信令数据:经度117.34、纬度34.532、张三、20211205,表示张三于2021年12月5日在这个经纬度出现过,在生成geo_id后则更改为:2958146geo_id、1人、20211205,表示2958146这个网格内有1个人,可将此数据下发进行业务应用。In addition, specific data such as signaling data needs further processing after generating the geo_id, such as removing the latitude and longitude information (retaining the original ID for data backtracking), so as not to expose high-precision latitude and longitude data and improve data security. For example, the signaling data: longitude 117.34, latitude 34.532, Zhang San, 20211205, means that Zhang San appeared at this longitude and latitude on December 5, 2021. After generating the geo_id, it changes to: 2958146 geo_id, 1 person, 20211205, which means 2958146. There is one person in the grid who can distribute this data for business applications.
假设地图数据:2958146geo_id、公园,因而上述信令数据结合地图数据可知公园内有1个人,至于该公园所处位置是需要解密的。解密则通过原id对撞原始经纬度信息,原始信息可由特定部门管理。基于以上可以过滤较多的数据安全问题,同时不影响数据使用。Assume that the map data is: 2958146geo_id, park. Therefore, the above signaling data combined with the map data shows that there is a person in the park. The location of the park needs to be decrypted. Decryption uses the original ID to collide with the original longitude and latitude information, and the original information can be managed by a specific department. Based on the above, more data security issues can be filtered without affecting data usage.
对于步骤S103,在业务建设与挖掘中,基于geo_id建设空间距离的计算工具,如临近点工具,核密度工具等,实现空间计算方法在分布式系统中的高效应用。For step S103, in business construction and mining, spatial distance calculation tools are constructed based on geo_id, such as nearby point tools, kernel density tools, etc., to achieve efficient application of spatial calculation methods in distributed systems.
计算任意两个geo_id的空间距离,示意代码如下:Calculate the spatial distance between any two geo_ids. The code is as follows:
--id1 2958146--id1 2958146
--id2 10064823 --id2 10064823
--M=100000--M=100000
--每个id以1km长度为度量单位--Each ID is measured in 1km length
SELECTSELECT
abs(int(SUBSTR(a.id1,-LENGTH(100000)+1)-SUBSTR(a.id2,-LENGTH(100000)+1)))AS水平距离,abs(int(SUBSTR(a.id1,-LENGTH(100000)+1)-SUBSTR(a.id2,-LENGTH(100000)+1)))AS horizontal distance,
abs(int((a.id2-a.id1)/100000))AS垂直距离,abs(int((a.id2-a.id1)/100000))AS vertical distance,
abs(int(SUBSTR(a.id1,-LENGTH(100000)+1)-SUBSTR(a.id2,-LENGTH(100000)+1)))+abs(int((a.id2-a.id1)/100000))AS曼哈顿距离,abs(int(SUBSTR(a.id1,-LENGTH(100000)+1)-SUBSTR(a.id2,-LENGTH(100000)+1)))+abs(int((a.id2-a.id1)/ 100000))AS Manhattan distance,
power(power(abs(int(SUBSTR(a.id1,-LENGTH(100000)+1)-SUBSTR(a.id2,-LENGTH(100000)+1))),2)+power(abs(int((a.id2-a.id1)/100000)),2),0.5)AS欧式距离power(power(abs(int(SUBSTR(a.id1,-LENGTH(100000)+1)-SUBSTR(a.id2,-LENGTH(100000)+1))),2)+power(abs(int(( a.id2-a.id1)/100000)),2),0.5)AS Euclidean distance
FROMFROM
(SELECT 2958146 AS id1,10064823 AS id2)a(SELECT 2958146 AS id1,10064823 AS id2)a
其中,abs是一个函数,取绝对值。substr是C++语言函数,主要功能是复制子字符串,要求从指定位置开始,并具有指定的长度。LENGTH为长度。POWER(number,power),其中参数number表示底数,参数power表示指数。由此计算可得,两点水平距离6677km,垂直距离71km,曼哈顿距离6784km,欧式距离6677.38km,基于横向距离和纵向距离的初步判断,更可以快速计算出欧氏距离与曼哈顿距离,整体计算量均极小。上述四个距离只是示例,实际还可以有其他公式,具体使用时根据需要进行计算。Among them, abs is a function, taking the absolute value. substr is a C++ language function. Its main function is to copy a substring, starting from a specified position and having a specified length. LENGTH is the length. POWER(number,power), where the parameter number represents the base and the parameter power represents the exponent. From this calculation, the horizontal distance between two points is 6677km, the vertical distance is 71km, the Manhattan distance is 6784km, and the Euclidean distance is 6677.38km. Based on the preliminary judgment of the horizontal distance and vertical distance, the Euclidean distance and Manhattan distance can be quickly calculated, and the overall calculation amount is All extremely small. The above four distances are just examples. In fact, there can be other formulas, and they should be calculated as needed when used.
需要说明的是,底层逻辑是基于CGCS2000投影坐标系(米进制)进行的网格划分,geo_id本身不是米进制,但通过任意两个geo_id,可以快速计算两者距离(米进制),geo_id本身转换需要业务地图的空间维度表。It should be noted that the underlying logic is grid division based on the CGCS2000 projected coordinate system (meter system). geo_id itself is not in meter system, but through any two geo_ids, the distance between the two (meter system) can be quickly calculated. The conversion of geo_id itself requires the spatial dimension table of the business map.
后续用户应用时,只需输入geo_id、业务关键词和业务范围,业务范围通常指定水平方向、垂直方向、周围、斜角方向一定距离内/外等,对于不同的业务范围使用不同的距离计算公式,如上述水平距离、 垂直距离、曼哈顿距离和欧式距离。基于输入的geo_id定位网格,以该网格为中心,无论上述哪种距离计算公式,均需要输入该网格的geo_id、位于该网格周围且与该业务范围对应的其他网格的geo_id、纵向相邻网格的间隔数M,计算出该网格与其他网格的距离,进而筛选出符合该业务范围的一个或多个网格。查找符合业务关键词和业务范围的其他网格,返回其他网格中的空间数据并显示。When subsequent users apply, they only need to enter geo_id, business keywords and business scope. The business scope usually specifies a certain distance inside/outside in the horizontal direction, vertical direction, surrounding, oblique direction, etc. Different distance calculation formulas are used for different business scopes. , such as the above horizontal distance, Vertical distance, Manhattan distance and Euclidean distance. Position the grid based on the input geo_id, with the grid as the center. Regardless of the above distance calculation formula, you need to enter the geo_id of the grid, the geo_id of other grids located around the grid and corresponding to the business scope, The number M of intervals between vertical adjacent grids is used to calculate the distance between this grid and other grids, and then one or more grids that fit the business scope are screened out. Find other grids that match the business keywords and business scope, return the spatial data in other grids and display them.
例如,用户想计算某一geo_id周围100m范围内的便利店数量,则业务范围为周围100m距离内,基于该业务范围过滤掉与该geo_id距离大于100m的空间数据,再使用业务关键词“便利店”,过滤掉与该geo_id距离在100m范围内的非便利店空间数据。For example, if the user wants to calculate the number of convenience stores within 100m around a certain geo_id, the business scope is within 100m. Based on this business scope, the spatial data that is greater than 100m away from the geo_id is filtered out, and then the business keyword "convenience store" is used. ", filter out non-convenience store spatial data within 100m of the geo_id.
例如,用户想计算某一geo_id周围1000m范围内是否有幼儿园、小学、医院等,业务关键词为“幼儿园”、“小学”、“医院”,业务范围为周围1000m范围内,也可以快速计算完周边网格与该网格的距离后,依据业务关键词进行筛选,以此实现基于geo_id的空间数据信息关联。For example, the user wants to calculate whether there are kindergartens, primary schools, hospitals, etc. within a range of 1000m around a certain geo_id. The business keywords are "kindergarten", "primary school", "hospital", and the business scope is within a range of 1000m. The calculation can also be completed quickly. After determining the distance between the surrounding grid and the grid, filtering is performed based on business keywords to achieve spatial data information association based on geo_id.
参见图3所示,业务地图包括空间数据,具体地包括空间位置信息,每个空间位置信息包括至少一个空间数据。对业务地图进行网格划分并分别计算每个网格的geo_id,以此得到该业务地图的空间维度表,此为数据保标准化处理阶段。在后续数据生产阶段,可以进行多元空间数据的分析和挖掘,例如查询某一网格附近一定业务范围内的特定空间数据。As shown in FIG. 3 , the business map includes spatial data, specifically spatial location information, and each spatial location information includes at least one spatial data. Grid the business map and calculate the geo_id of each grid separately to obtain the spatial dimension table of the business map. This is the data preservation and standardization processing stage. In the subsequent data production stage, multivariate spatial data analysis and mining can be carried out, such as querying specific spatial data within a certain business scope near a certain grid.
上述实施例所提供的方法,将各类空间数据增加geo_id作为空间主键,后续应用时只需输入geo_id即可可更高效的计算空间距离,提高空间距离计算效率。The method provided by the above embodiment adds geo_id as the spatial primary key to various types of spatial data. In subsequent applications, only the geo_id can be input to calculate the spatial distance more efficiently and improve the efficiency of spatial distance calculation.
本公开实施例所提供的方法,相对于现有技术,至少存在如下有益效果: Compared with the existing technology, the method provided by the embodiments of the present disclosure has at least the following beneficial effects:
1、简化空间位置描述规则,增强空间规律辨识度,提高空间距离计算效率。设置geo_id描述空间位置,geo_id是一串数字,从而使空间位置可直观比较,相较于经纬度的二维标识方法,加快了空间距离的计算速度。高精度数据挖掘分析,如邻近点提取、多元数据空间关联等空间计算需求,可在此基础上进一步展开。1. Simplify spatial location description rules, enhance spatial regularity recognition, and improve spatial distance calculation efficiency. Set geo_id to describe the spatial position. Geo_id is a string of numbers, so that the spatial position can be compared intuitively. Compared with the two-dimensional identification method of longitude and latitude, it speeds up the calculation of spatial distance. High-precision data mining analysis, such as adjacent point extraction, multivariate data spatial correlation and other spatial computing requirements, can be further developed on this basis.
2、加强空间数据安全性。基于geo_id标识,对于特定数据不保留高精度的经纬度坐标,提高了空间数据的安全性。同时也保持了数据精度不丢失,通过geo_id可反向关联原始经纬度信息。2. Strengthen spatial data security. Based on the geo_id identifier, high-precision longitude and latitude coordinates are not retained for specific data, which improves the security of spatial data. At the same time, the data accuracy is not lost, and the original longitude and latitude information can be reversely associated through geo_id.
参见图4,示出了本公开实施例提供的一种数据处理装置400的主要模块示意图,包括:Referring to Figure 4, a schematic diagram of the main modules of a data processing device 400 provided by an embodiment of the present disclosure is shown, including:
划定模块401,用于对业务地图进行网格划定;其中,业务地图由空间位置信息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;The delineation module 401 is used to delimit the business map into grids; wherein the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
计算模块402,用于对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;The calculation module 402 is used to calculate, for each grid, the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
处理模块403,用于接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。The processing module 403 is used to receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, search for other grids that match the business scope, and return other grids. The spatial data that matches the business keywords in the grid are displayed.
本公开实施装置中,所述划定模块401,用于:In the implementation device of this disclosure, the demarcation module 401 is used for:
接收对业务地图的网格划定请求;其中,所述网格划定请求中包括业务参数;Receive a grid delimitation request for a service map; wherein the grid delimitation request includes service parameters;
查询与所述业务参数对应的预设统计单元大小,以使用所述预设统计单元大小对业务地图进行网格划定。Query the preset statistical unit size corresponding to the business parameter, so as to use the preset statistical unit size to grid delineate the business map.
本公开实施装置中,所述预设数值为纵向相邻网格的间隔数;In the implementation device of the present disclosure, the preset value is the interval number of vertically adjacent grids;
所述计算模块402,还用于: The calculation module 402 is also used to:
确定横向网格总数量或纵向网格总数量的数量级,将所述数量级与第一预设数值之和,作为纵向相邻网格的间隔数。The order of magnitude of the total number of transverse grids or the total number of longitudinal grids is determined, and the sum of the order of magnitude and the first preset value is used as the number of intervals between vertically adjacent grids.
本公开实施装置还包括筛选模块,用于:The implementation device of the present disclosure also includes a screening module for:
筛选标注为特定数据的空间数据,去除所述特定数据中的经纬度信息。Filter the spatial data marked as specific data and remove the latitude and longitude information in the specific data.
本公开实施装置中,所述处理模块403,用于:In the implementation device of this disclosure, the processing module 403 is used for:
确定与所述业务范围对应的距离计算公式;Determine the distance calculation formula corresponding to the business scope;
将输入的地址编码标识、位于所述输入的地址编码标识周围的其他地址编码标识、所述预设数值,输入所述距离计算公式中,以计算其他网格与所述定位的网格的距离,进而筛选出符合所述业务范围的一个或多个网格;Enter the input address code identifier, other address code identifiers located around the input address code identifier, and the preset value into the distance calculation formula to calculate the distance between other grids and the positioned grid. , and then filter out one or more grids that meet the business scope;
从所述一个或多个网格的空间数据中,筛选出符合所述业务关键词的一个或多个空间数据。From the spatial data of the one or more grids, one or more spatial data that match the business keywords are filtered out.
另外,在本公开实施例中所述装置的具体实施内容,在上面所述方法中已经详细说明了,故在此重复内容不再说明。In addition, the specific implementation content of the device in the embodiment of the present disclosure has been described in detail in the above method, so the repeated content will not be described here.
图5示出了可以应用本公开实施例的示例性系统架构500,包括终端设备501、502、503,网络504和服务器505(仅仅是示例)。Figure 5 shows an exemplary system architecture 500 to which embodiments of the present disclosure may be applied, including terminal devices 501, 502, 503, a network 504 and a server 505 (examples only).
终端设备501、502、503可以是具有显示屏并且支持网页浏览的各种电子设备,安装有各种通讯客户端应用,用户可以使用终端设备501、502、503通过网络504与服务器505交互,以接收或发送消息等。Terminal devices 501, 502, and 503 can be various electronic devices with display screens and support web browsing, and are installed with various communication client applications. Users can use terminal devices 501, 502, and 503 to interact with the server 505 through the network 504 to Receive or send messages, etc.
网络504用以在终端设备501、502、503和服务器505之间提供通信链路的介质。网络504可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。 Network 504 is used to provide a medium for communication links between terminal devices 501, 502, 503 and server 505. Network 504 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
服务器505可以是提供各种服务的服务器,需要说明的是,本公开实施例所提供的方法一般由服务器505执行,相应地,装置一般设置于服务器505中。The server 505 may be a server that provides various services. It should be noted that the methods provided by the embodiments of the present disclosure are generally executed by the server 505. Accordingly, the device is generally provided in the server 505.
应该理解,图5中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。It should be understood that the number of terminal devices, networks and servers in Figure 5 is only illustrative. Depending on implementation needs, there can be any number of end devices, networks, and servers.
下面参考图6,其示出了适于用来实现本公开实施例的终端设备的计算机系统600的结构示意图。图6示出的终端设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。Referring now to FIG. 6 , a schematic structural diagram of a computer system 600 suitable for implementing a terminal device according to an embodiment of the present disclosure is shown. The terminal device shown in FIG. 6 is only an example and should not impose any restrictions on the functions and scope of use of the embodiments of the present disclosure.
如图6所示,计算机系统600包括中央处理单元(CPU)601,其可以根据存储在只读存储器(ROM)602中的程序或者从存储部分608加载到随机访问存储器(RAM)603中的程序而执行各种适当的动作和处理。在RAM 603中,还存储有系统600操作所需的各种程序和数据。CPU 601、ROM 602以及RAM 603通过总线604彼此相连。输入/输出(I/O)接口605也连接至总线604。As shown in FIG. 6 , computer system 600 includes a central processing unit (CPU) 601 that can operate according to a program stored in a read-only memory (ROM) 602 or loaded from a storage portion 608 into a random access memory (RAM) 603 And perform various appropriate actions and processing. In the RAM 603, various programs and data required for the operation of the system 600 are also stored. CPU 601, ROM 602 and RAM 603 are connected to each other through bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
以下部件连接至I/O接口605:包括键盘、鼠标等的输入部分606;包括诸如阴极射线管(CRT)、液晶显示器(LCD)等以及扬声器等的输出部分607;包括硬盘等的存储部分608;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分609。通信部分609经由诸如因特网的网络执行通信处理。驱动器610也根据需要连接至I/O接口605。可拆卸介质611,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器610上,以便于从其上读出的计算机程序根据需要被安装入存储部分608。The following components are connected to the I/O interface 605: an input section 606 including a keyboard, a mouse, etc.; an output section 607 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., speakers, etc.; and a storage section 608 including a hard disk, etc. ; and a communication section 609 including a network interface card such as a LAN card, a modem, etc. The communication section 609 performs communication processing via a network such as the Internet. Driver 610 is also connected to I/O interface 605 as needed. Removable media 611, such as magnetic disks, optical disks, magneto-optical disks, semiconductor memories, etc., are installed on the drive 610 as needed, so that a computer program read therefrom is installed into the storage portion 608 as needed.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程 序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信部分609从网络上被下载和安装,和/或从可拆卸介质611被安装。在该计算机程序被中央处理单元(CPU)601执行时,执行本公开的系统中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product including a computer program carried on a computer-readable medium, the computer program A program contains program code for executing the method shown in the flowchart. In such embodiments, the computer program may be downloaded and installed from the network via communication portion 609, and/or installed from removable media 611. When the computer program is executed by the central processing unit (CPU) 601, the above-described functions defined in the system of the present disclosure are performed.
需要说明的是,本公开所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:无线、电线、光缆、RF等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium may be, for example, but is not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmed read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In this disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device. In the present disclosure, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device . Program code embodied on a computer-readable medium may be transmitted using any suitable medium, including but not limited to: wireless, wire, optical cable, RF, etc., or any suitable combination of the foregoing.
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的 实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operations of possible implementations of systems, methods, and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions. It should also be noted that in some cases as replacements Implementations may allow the functions noted in the blocks to occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
描述于本公开实施例中所涉及到的模块可以通过软件的方式实现,也可以通过硬件的方式来实现。所描述的模块也可以设置在处理器中,例如,可以描述为:一种处理器包括划定模块、计算模块、处理模块。其中,这些模块的名称在某种情况下并不构成对该模块本身的限定,例如,处理模块还可以被描述为“空间数据处理模块”。The modules involved in the embodiments of the present disclosure can be implemented in software or hardware. The described module can also be provided in a processor. For example, it can be described as follows: a processor includes a demarcation module, a calculation module, and a processing module. The names of these modules do not constitute a limitation on the module itself under certain circumstances. For example, the processing module can also be described as a "spatial data processing module".
作为另一方面,本公开还提供了一种计算机可读介质,该计算机可读介质可以是上述实施例中描述的设备中所包含的;也可以是单独存在,而未装配入该设备中。上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被一个该设备执行时,使得该设备包括:As another aspect, the present disclosure also provides a computer-readable medium. The computer-readable medium may be included in the device described in the above embodiments; it may also exist separately without being assembled into the device. The above computer-readable medium carries one or more programs. When the above one or more programs are executed by a device, the device includes:
对业务地图进行网格划定;其中,业务地图由空间位置信息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;Grid the business map; wherein, the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;For each grid, calculate the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。Receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, find other grids that match the business scope, and return the other grids that match the business key Word spatial data and displayed.
根据本公开实施例的技术方案,通过一维数字geo_id描述网格,简化空间位置描述规则,使得空间位置可直观比较,大幅提升空间位置距离的计算效率,特定空间数据可以不再保留高精度的经纬度坐标, 同时保证了数据不丢失,加强空间数据的安全性。According to the technical solution of the embodiment of the present disclosure, the grid is described by one-dimensional digital geo_id, which simplifies the spatial position description rules, enables intuitive comparison of spatial positions, greatly improves the calculation efficiency of spatial position distance, and specific spatial data can no longer retain high-precision latitude and longitude coordinates, At the same time, it ensures that data is not lost and strengthens the security of spatial data.
上述具体实施方式,并不构成对本公开保护范围的限制。本领域技术人员应该明白的是,取决于设计要求和其他因素,可以发生各种各样的修改、组合、子组合和替代。任何在本公开的精神和原则之内所作的修改、等同替换和改进等,均应包含在本公开保护范围之内。 The above-mentioned specific embodiments do not constitute a limitation on the scope of the present disclosure. It will be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may occur depending on design requirements and other factors. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of this disclosure shall be included in the protection scope of this disclosure.

Claims (10)

  1. 一种数据处理方法,包括:A data processing method including:
    对业务地图进行网格划定;其中,业务地图由空间位置信息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;Grid the business map; wherein, the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
    对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;For each grid, calculate the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
    接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。Receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, center on the positioned grid, find other grids that match the business scope, and return the other grids that match the business key Word spatial data and displayed.
  2. 根据权利要求1所述的方法,其中,所述对业务地图进行网格划定,包括:The method according to claim 1, wherein the grid delineation of the business map includes:
    接收对业务地图的网格划定请求;其中,所述网格划定请求中包括业务参数;Receive a grid delimitation request for a service map; wherein the grid delimitation request includes service parameters;
    查询与所述业务参数对应的预设统计单元大小,以使用所述预设统计单元大小对业务地图进行网格划定。Query the preset statistical unit size corresponding to the business parameter, so as to use the preset statistical unit size to grid delineate the business map.
  3. 根据权利要求1或2所述的方法,其中,所述预设数值为纵向相邻网格的间隔数;The method according to claim 1 or 2, wherein the preset value is the interval number of vertically adjacent grids;
    在所述计算网格在纵向的排列数与预设数值的乘积之前,还包括:Before calculating the product of the vertical arrangement number of the grid and the preset value, it also includes:
    确定横向网格总数量或纵向网格总数量的数量级,将所述数量级与第一预设数值之和,作为纵向相邻网格的间隔数。The order of magnitude of the total number of transverse grids or the total number of longitudinal grids is determined, and the sum of the order of magnitude and the first preset value is used as the number of intervals between vertically adjacent grids.
  4. 根据权利要求1所述的方法,其中,在所述作为网格的地址编码标识之后,还包括:The method according to claim 1, wherein after the address encoding identification as a grid, it further includes:
    筛选标注为特定数据的空间数据,去除所述特定数据中的经纬度信息。 Filter the spatial data marked as specific data and remove the latitude and longitude information in the specific data.
  5. 根据权利要求1所述的方法,其中,所述以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示,包括:The method according to claim 1, wherein, taking the positioned grid as the center, searching for other grids that conform to the business scope, returning and displaying the spatial data in other grids that conform to the business keywords, includes:
    确定与所述业务范围对应的距离计算公式;Determine the distance calculation formula corresponding to the business scope;
    将输入的地址编码标识、位于所述输入的地址编码标识周围的其他地址编码标识、所述预设数值,输入所述距离计算公式中,以计算其他网格与所述定位的网格的距离,进而筛选出符合所述业务范围的一个或多个网格;Enter the input address code identifier, other address code identifiers located around the input address code identifier, and the preset value into the distance calculation formula to calculate the distance between other grids and the positioned grid. , and then filter out one or more grids that meet the business scope;
    从所述一个或多个网格的空间数据中,筛选出符合所述业务关键词的一个或多个空间数据。From the spatial data of the one or more grids, one or more spatial data that match the business keywords are filtered out.
  6. 一种数据处理装置,包括:A data processing device including:
    划定模块,用于对业务地图进行网格划定;其中,业务地图由空间位置信息构成,每个网格包括一个空间位置信息,每个空间位置信息包括一个或多个空间数据;The delimitation module is used to delineate the grid of the business map; wherein the business map is composed of spatial location information, each grid includes one spatial location information, and each spatial location information includes one or more spatial data;
    计算模块,用于对于每个网格,计算网格在纵向的排列数与预设数值的乘积,将所述乘积与网格在横向的排列数之和,作为网格的地址编码标识;The calculation module is used to calculate, for each grid, the product of the vertical arrangement number of the grid and the preset value, and use the sum of the product and the horizontal arrangement number of the grid as the address code identification of the grid;
    处理模块,用于接收输入的地址编码标识、业务关键词和业务范围,基于输入的地址编码标识进行网格定位,以定位的网格为中心,查找符合业务范围的其他网格,返回其他网格中符合业务关键词的空间数据并显示。The processing module is used to receive the input address code identification, business keywords and business scope, perform grid positioning based on the input address code identification, take the positioned grid as the center, find other grids that match the business scope, and return other networks The spatial data that matches the business keywords in the grid are displayed.
  7. 根据权利要求6所述的装置,其中,所述预设数值为纵向相邻网格的间隔数;The device according to claim 6, wherein the preset value is the interval number of longitudinally adjacent grids;
    所述计算模块,还用于:The computing module is also used for:
    确定横向网格总数量或纵向网格总数量的数量级,将所述数量级与第一预设数值之和,作为纵向相邻网格的间隔数。The order of magnitude of the total number of transverse grids or the total number of longitudinal grids is determined, and the sum of the order of magnitude and the first preset value is used as the number of intervals between vertically adjacent grids.
  8. 根据权利要求6所述的装置,其中,所述处理模块,用于: The device according to claim 6, wherein the processing module is used for:
    确定与所述业务范围对应的距离计算公式;Determine the distance calculation formula corresponding to the business scope;
    将输入的地址编码标识、位于所述输入的地址编码标识周围的其他地址编码标识、所述预设数值,输入所述距离计算公式中,以计算其他网格与所述定位的网格的距离,进而筛选出符合所述业务范围的一个或多个网格;Enter the input address code identifier, other address code identifiers located around the input address code identifier, and the preset value into the distance calculation formula to calculate the distance between other grids and the positioned grid. , and then filter out one or more grids that meet the business scope;
    从所述一个或多个网格的空间数据中,筛选出符合所述业务关键词的一个或多个空间数据。From the spatial data of the one or more grids, one or more spatial data that match the business keywords are filtered out.
  9. 一种电子设备,包括:An electronic device including:
    一个或多个处理器;one or more processors;
    存储装置,用于存储一个或多个程序,a storage device for storing one or more programs,
    当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-5中任一所述的方法。When the one or more programs are executed by the one or more processors, the one or more processors are caused to implement the method as described in any one of claims 1-5.
  10. 一种计算机可读介质,其上存储有计算机程序,所述程序被处理器执行时实现如权利要求1-5中任一所述的方法。 A computer-readable medium having a computer program stored thereon, which implements the method according to any one of claims 1-5 when executed by a processor.
PCT/CN2023/077321 2022-05-07 2023-02-21 Data processing method and apparatus WO2023216675A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210490874.4 2022-05-07
CN202210490874.4A CN117056581A (en) 2022-05-07 2022-05-07 Data processing method and device

Publications (1)

Publication Number Publication Date
WO2023216675A1 true WO2023216675A1 (en) 2023-11-16

Family

ID=88655990

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/077321 WO2023216675A1 (en) 2022-05-07 2023-02-21 Data processing method and apparatus

Country Status (2)

Country Link
CN (1) CN117056581A (en)
WO (1) WO2023216675A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651005A (en) * 2011-02-24 2012-08-29 北京千橡网景科技发展有限公司 Interest point searching method and equipment
US8538679B1 (en) * 2011-04-08 2013-09-17 Oberweis Dairy, Inc. Enhanced geocoding
CN104166660A (en) * 2013-05-20 2014-11-26 鸿富锦精密工业(深圳)有限公司 Search system and method based on range selection
CN105509743A (en) * 2015-11-24 2016-04-20 上海汽车集团股份有限公司 A positioning processing method, a business platform and a network system
CN106156138A (en) * 2015-04-10 2016-11-23 高德软件有限公司 The foundation of the spatial index of point of interest, Perimeter method and apparatus
CN110765331A (en) * 2019-07-08 2020-02-07 中国人民解放军战略支援部队信息工程大学 Retrieval method and system of spatio-temporal data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651005A (en) * 2011-02-24 2012-08-29 北京千橡网景科技发展有限公司 Interest point searching method and equipment
US8538679B1 (en) * 2011-04-08 2013-09-17 Oberweis Dairy, Inc. Enhanced geocoding
CN104166660A (en) * 2013-05-20 2014-11-26 鸿富锦精密工业(深圳)有限公司 Search system and method based on range selection
CN106156138A (en) * 2015-04-10 2016-11-23 高德软件有限公司 The foundation of the spatial index of point of interest, Perimeter method and apparatus
CN105509743A (en) * 2015-11-24 2016-04-20 上海汽车集团股份有限公司 A positioning processing method, a business platform and a network system
CN110765331A (en) * 2019-07-08 2020-02-07 中国人民解放军战略支援部队信息工程大学 Retrieval method and system of spatio-temporal data

Also Published As

Publication number Publication date
CN117056581A (en) 2023-11-14

Similar Documents

Publication Publication Date Title
Lv et al. BIM big data storage in WebVRGIS
CN110543571A (en) knowledge graph construction method and device for water conservancy informatization
WO2019072092A1 (en) Ip address positioning method and apparatus, electronic device, and storage medium
CN111046237A (en) User behavior data processing method and device, electronic equipment and readable medium
CN104794221A (en) Multi-dimensional data analyzing system based on service objects
CN104794150A (en) Cloud storage model and management method based on space knowledge cloud environment
CN109002440A (en) Method, apparatus and system for big data multidimensional analysis
Lee et al. Lightweight indexing and querying services for big spatial data
CN108268614A (en) A kind of distribution management method of forest reserves spatial data
Chen et al. High performance integrated spatial big data analytics
CN105404974A (en) Data capitalization method and apparatus and management platform
WO2015070806A1 (en) Audio file management method, device and storage medium
CN110502595A (en) A method of picking up longitude and latitude information from planning map
Nusrat et al. Visualizing cartograms: Goals and task taxonomy
CN109213940B (en) Method, storage medium, equipment and system for realizing user position calculation under big data
WO2023216675A1 (en) Data processing method and apparatus
CN109996275A (en) A kind of method and apparatus obtaining network signal coverage information
CN110110153A (en) A kind of method and apparatus of node searching
Yuan et al. Towards the next-generation GIS: A geometric algebra approach
CN110162521A (en) A kind of payment system transaction data processing method and system
CN110019252A (en) The method, apparatus and electronic equipment of information processing
CN111737490B (en) Knowledge graph ontology model generation method and device based on banking channel
Rai et al. Top-k community similarity search over large road-network graphs
CN111309988B (en) Character string retrieval method and device based on coding and electronic equipment
Xiong et al. HiGIS: An open framework for high performance geographic information system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23802451

Country of ref document: EP

Kind code of ref document: A1