CN111046122A - Method for determining attributes of geographic area and related equipment - Google Patents

Method for determining attributes of geographic area and related equipment Download PDF

Info

Publication number
CN111046122A
CN111046122A CN201911243596.7A CN201911243596A CN111046122A CN 111046122 A CN111046122 A CN 111046122A CN 201911243596 A CN201911243596 A CN 201911243596A CN 111046122 A CN111046122 A CN 111046122A
Authority
CN
China
Prior art keywords
area
geographic
geographic area
geographical area
degree
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911243596.7A
Other languages
Chinese (zh)
Inventor
陈哲诗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Jizhi Digital Technology Co Ltd
Original Assignee
Shenzhen Jizhi Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Jizhi Digital Technology Co Ltd filed Critical Shenzhen Jizhi Digital Technology Co Ltd
Priority to CN201911243596.7A priority Critical patent/CN111046122A/en
Publication of CN111046122A publication Critical patent/CN111046122A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the application discloses a method and related equipment for determining attributes of geographic areas, wherein interactive data corresponding to a preset mode between a first geographic area and a second geographic area within a range to be determined are obtained, and the interactive data comprise outflow data flowing from the first geographic area to the second geographic area and inflow data flowing from the second geographic area to the first geographic area. And then, clustering the geographic areas in the range to be determined according to the interactive data between the first geographic area and the second geographic area. Finally, according to the clustering result, a geographical area combination which conforms to the predetermined pattern is determined, and the geographical area combination belongs to one attribute determined for the geographical area. The accuracy of the geographic area attribute, which is one of the geographic area attributes, can be improved by determining the geographic area combination based on the determined data in which the interaction data between the first geographic area and the second geographic area is objective.

Description

Method for determining attributes of geographic area and related equipment
Technical Field
The present application relates to the field of data processing, and in particular, to a method for determining attributes of a geographic area and a related device.
Background
In some methods of determining attributes of a geographic region, the determination is sometimes based on subjective features of the geographic region. For example: when the combination of close features is determined for some geographic areas, the combination is determined according to subjective features of the geographic areas, such as the existing administrative region division mode, and therefore the combination of the geographic areas, which is one of the attributes of the geographic areas, is not accurate enough.
Therefore, how to determine the attributes of the geographic area more objectively and accurately is a problem which needs to be solved urgently at present.
Disclosure of Invention
In order to solve the technical problem, the application provides an attribute determining method of a geographic area and related equipment, which can improve the accuracy of determining the attribute of the geographic area.
The embodiment of the application discloses the following technical scheme:
in one aspect, an embodiment of the present application provides a method for determining an attribute of a geographic area, where the method includes:
acquiring interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined, wherein the interactive data comprises outflow data of the first geographical area flowing out to the second geographical area and inflow data of the second geographical area flowing into the first geographical area;
clustering the geographic areas within the range to be determined according to the interactive data between the first geographic area and the second geographic area;
determining a geographical area combination according to the predetermined pattern according to the clustering result, the geographical area combination belonging to an attribute determined for the geographical area.
In another aspect, an embodiment of the present application provides an apparatus for determining an attribute of a geographic area, where the apparatus includes:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined, and the interactive data comprises outflow data flowing out from the first geographical area to the second geographical area and inflow data flowing into the first geographical area from the second geographical area;
the clustering unit is used for clustering the geographic areas in the range to be determined according to the interactive data between the first geographic area and the second geographic area;
a determining unit, configured to determine, according to the clustering result, a geographical area combination that conforms to the predetermined pattern, the geographical area combination belonging to an attribute determined for the geographical area.
In another aspect, an embodiment of the present application provides an attribute determining apparatus for a geographic area, where the apparatus includes a processor and a memory:
the memory is used for storing program codes and transmitting the program codes to the processor;
the processor is configured to execute the method for determining the attribute of the geographic area according to instructions in the program code.
In another aspect, an embodiment of the present application provides a computer-readable storage medium for storing a program code, where the program code is used to execute the method for determining the attribute of the geographic area.
According to the technical scheme, interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined are obtained, and the interactive data comprise outflow data flowing from the first geographical area to the second geographical area and inflow data flowing from the second geographical area to the first geographical area. And then, clustering the geographic areas in the range to be determined according to the interactive data between the first geographic area and the second geographic area. Finally, according to the clustering result, a geographical area combination which conforms to the predetermined pattern is determined, and the geographical area combination belongs to one attribute determined for the geographical area. The accuracy of the geographic area attribute, which is one of the geographic area attributes, can be improved by determining the geographic area combination based on the determined data in which the interaction data between the first geographic area and the second geographic area is objective.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without inventive exercise.
Fig. 1 is a flowchart of an attribute determining method for a geographic area according to an embodiment of the present disclosure;
fig. 2 is a flowchart of a method for determining a rank attribute according to an embodiment of the present application;
fig. 3 is a flowchart of a centrality determining method according to an embodiment of the present disclosure;
fig. 4 is a flowchart of a method for determining contact level according to an embodiment of the present disclosure;
FIG. 5 is a flow chart of a method for determining outflow and inflow according to an embodiment of the present disclosure;
FIG. 6 is a schematic diagram of determining a city attribute according to an embodiment of the present disclosure;
fig. 7 is a structural diagram of an attribute determination apparatus for a geographic area according to an embodiment of the present application.
Detailed Description
Embodiments of the present application are described below with reference to the accompanying drawings.
The embodiment of the application provides a method for determining the attribute of a geographic area, which is determined by objective interactive data between a first geographic area and a second geographic area, and improves the accuracy of the determined attribute of the geographic area.
The method for determining the attribute of the geographic area provided by the embodiment of the application can be executed by data processing equipment, such as a terminal and a server.
Next, a method for determining an attribute of the geographic area is described, referring to fig. 1, which shows a flowchart of a method for determining an attribute of a geographic area according to an embodiment of the present application, and as shown in fig. 1, the method includes:
s101: and acquiring interactive data corresponding to a preset mode between the first geographical area and the second geographical area within the range to be determined.
In the embodiment of the present application, the range to be determined may include one or more geographic areas, and events such as population movement may occur between the geographic areas, so that objective interaction data between the geographic areas will be generated. Wherein the interaction data between the two geographical areas may be data generated by an event occurring between the two geographical areas. The first geographical area and the second geographical area may be any two geographical areas within the range to be determined. For a first geographic region and a second geographic region in the target region, the interaction data between them may include the outflow data of the first geographic region flowing out to the second geographic region and the inflow data of the second geographic region flowing into the first geographic region.
The interaction data may be interaction data corresponding to a predetermined pattern, and the predetermined pattern may be a pattern of a combination of geographical areas that need to be determined. In one possible implementation, the predetermined pattern may include one or more combinations of a population aggregation pattern, a network data strong interaction pattern, or a regional economic clustering pattern.
The population aggregation mode may be a determination mode for a combination of geographic areas with respect to a population aggregation degree, and the interactive data corresponding to the mode may be outbound population data that is outbound from the first geographic area to the second geographic area and inbound population data that is inbound from the second geographic area to the first geographic area.
The network data strong interaction degree mode may be a geographical area combination determination mode related to the network data interaction degree, and the network data strong interaction degree may refer to that the interaction degree of the network data between geographical areas is higher than a preset interaction degree threshold. The interactive data corresponding to the mode may be data representing the number of network data streams flowing out of the first geographical area to the second geographical area and data representing the number of network data streams flowing in of the second geographical area to the first geographical area.
The regional economic trunking mode may be a geographical region combination determination mode for economic aspects, and the interaction data corresponding to the mode may be economic outflow data flowing from the first geographical region to the second geographical region and economic inflow data flowing from the second geographical region to the first geographical region.
S102: and clustering the geographic areas within the range to be determined according to the interactive data between the first geographic area and the second geographic area.
In a specific implementation, for example, by means of a graph clustering algorithm, the geographic areas within the range to be determined are clustered according to the interactive data between the first geographic area and the second geographic area.
S103: and determining the geographical area combination which accords with the preset mode according to the clustering result.
Wherein a combination of geographical areas may belong to one attribute determined for the geographical area.
For the methods of S101-S103, in a specific implementation, it is assumed that the range to be determined mentioned in the embodiments of the present application may be a province region or the like, the geographical region mentioned may be a city, and the predetermined pattern is a population aggregation pattern. Therefore, floating population data in the map for cities in the province can be obtained as interactive data, and a population floating map model is established. Then, clustering is carried out by applying a clustering algorithm such as a Louvain graph clustering algorithm, and each city combination obtained by clustering results is recorded as a city group, namely a geographical area combination which accords with a preset mode.
According to the technical scheme, interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined are obtained, and the interactive data comprise outflow data flowing from the first geographical area to the second geographical area and inflow data flowing from the second geographical area to the first geographical area. And then, clustering the geographic areas in the range to be determined according to the interactive data between the first geographic area and the second geographic area. Finally, according to the clustering result, a geographical area combination which conforms to the predetermined pattern is determined, and the geographical area combination belongs to one attribute determined for the geographical area. The accuracy of the geographic area attribute, which is one of the geographic area attributes, can be improved by determining the geographic area combination based on the determined data in which the interaction data between the first geographic area and the second geographic area is objective.
At present, when a city is classified, statistical indexes such as financial income and GDP are generally applied for classification, and since the statistical rules are set manually, the influence of subjective factors is large, so that the classification is not accurate enough. Referring to fig. 2, which shows a flowchart of a method for determining a rank attribute according to an embodiment of the present application, as shown in fig. 2, the method may further include:
s201: and acquiring fixed data in each geographic area within the range to be determined, and determining the grade attribute corresponding to the geographic area according to the fixed data in the geographic area.
In the embodiment of the present application, the fixed data in the geographic area may be data for the geographic area itself, for example: the fixed data may be resident demographic data within the geographic area. By acquiring the fixed data in each geographical area within the range to be determined, the level attribute corresponding to the geographical area can be determined.
In a specific implementation, for example, the geographic areas within the range to be determined may be sorted based on fixed data of the geographic areas, so as to determine the rank attribute corresponding to the geographic area based on the sorting position of the geographic area. As another example, based on resident demographic data (fixed data) of a city (geographic area), clustering is performed using, for example, a hierarchical clustering algorithm to determine city rank attributes. Specifically, the city level attribute may be divided into 4 classes, and cut off at a suitable position.
The method performs the grade division of the geographic area by applying the objectively existing fixed data, and improves the accuracy.
In order to more accurately characterize the geographic region, in one possible implementation, the attributes determined for the geographic region further include a centrality of the geographic region. The centrality of the geographic area may be used to identify how important the geographic area is in the combination of geographic areas to which it belongs. Referring to fig. 3, which shows a flowchart of a centrality determining method provided in an embodiment of the present application, as shown in fig. 3, the method further includes:
s301: and determining an initial importance matrix according to the fixed data in each geographic area in the target geographic area combination.
The target geographical area combination may be any one of the geographical area combinations determined in S103.
In the embodiment of the present application, a random walk algorithm may be applied to calculate the centrality of the combination of the geographic areas in the area to which the geographic areas belong. Thus, the initial importance matrix may be determined based on the fixed data within each geographic region in the target combination of geographic regions.
In a particular implementation, subgraphs may be determined by a graph clustering algorithm for each combination of geographic regions within the range to be determined. Based on a geographical area combination (city group) including n geographical areas (cities), the fixed data (resident population data) in each geographical area (city) can be: position of a work1、population2、…、populationnFrom this, the initial importance matrix v can be determined0=[population1,population2,…,populationn]T
S302: and determining a transfer matrix corresponding to the target geographical area combination.
In the embodiment of the application, a transition matrix corresponding to a target geographical area combination for random walk may be determined.
In particular implementations, X may be0,X1,X2,…,XnIs a Markov random process wherein XiRepresenting the ith state, then: p (X)n+1=xn+1|X0=x0,X1=x1,…,Xn=xn)=P(Xn+1=xn+1|X=xn) Further, the transition matrix may be T- α P + (1- α) I, where α is the preset transition probability and I is the identity matrix with all elements 1.
S303: and performing first preset times of iterative calculation on the initial importance matrix by using the transfer matrix to determine a target importance matrix.
In this embodiment of the present application, the iteration number of the initial importance matrix corresponding to the target geographic area combination may be preset and used as the first preset number m. Specifically, the convergence of the target importance matrix should be ensured as much as possible when the first preset number of times is preset.
Then, the transition matrix T may be applied to the initial importance matrix v0And performing iterative calculation of a first preset number of times to determine a target importance matrix.
In a particular implementation, the initial importance matrix v may be paired by the following formula0Performing iterative calculation: v. ofn=TTvn-1=(TT)2vn-2=…=(TT)m-1v1=(TT)mv0. Wherein, when m is 1, namely the target importance matrix after one iteration calculation is v1=TTv0
S304: and determining the centrality of the first target geographical area for the target geographical area combination according to the target importance matrix and the fixed data in the geographical areas in the target geographical area combination.
The first target geographic area may be any one geographic area in the target geographic area combination.
In the embodiment of the present application, for each geographic area in the target geographic area combination, based on the fixed data of the geographic area, a ratio of the fixed data of the geographic area to the total geographic area in the range to be determined may be determined as a corresponding weight. Then, a weight matrix may be determined for the weight corresponding to each geographic area, and a centrality matrix may be determined according to the target importance matrix and the weight matrix, where each element in the centrality matrix represents the centrality of the corresponding geographic area with respect to the target geographic area combination.
For the following description of the method of S304, assuming that the target geographic areas are combined into an urban group, the urban group includes 3 cities (geographic areas) which are city 1, city 2, and city 3, respectively, and the resident population data of the three cities are a, b, and c, calculating ratios of city 1, city 2, and city 3 to the sum of the resident population data (a + b + c) of all cities in the urban group, and obtaining the weights of the 3 cities as: a/(a + b + c), b/(a + b + c) andc/(a + b + c), determining the weight matrix as [ a/(a + b + c), b/(a + b + c), c/(a + b + c)]T. And finally, multiplying the target importance matrix by the weight matrix to obtain a centrality matrix. Each element in the centrality matrix represents the centrality of the corresponding city for the city group.
The method is calculated through objective fixed data of the geographic areas in the target geographic area combination, and the too simple modes such as the first degree and the like are avoided, so that the accuracy of the geographic area attribute of the centrality is improved.
In a possible implementation manner, the attribute may further include a degree of association of the geographic areas, and the degree of association between two geographic areas may be used to identify a degree of association between the geographic areas. Referring to fig. 4, which shows a flowchart of a method for determining a contact level according to an embodiment of the present application, as shown in fig. 4, the method may further include:
s401: and determining an initial contact degree vector of the second target geographic area according to the interaction data between the second target geographic area and other geographic areas.
The second target geographic area may be any geographic area within the range to be determined.
In an embodiment of the present application, a restart random walk algorithm may be applied to determine the degree of association between the second target geographic area and other geographic areas. Wherein restarting the random walk algorithm may be an improvement over the random walk algorithm. Starting from a certain node in the graph, each step can face two choices, and adjacent nodes are randomly selected or the starting node is returned. The algorithm includes a restart probability a, which may represent the probability that each node returns to the starting node. In addition, the probability of moving to a neighboring node 1-a is also included.
In addition, the initial contact degree vector of the second target geographic area can be determined according to the interaction data between the second target geographic area and other geographic areas
Figure BDA0002306924320000085
For example, assume the second eyeThe interaction data between the geographic region and other geographic regions are respectively: position of a work1’、population2’、…、populationn' wherein the flowing data between the second target geographic area and the one geographic area may include: the second target geographic region flows outbound data to the geographic region and the geographic region flows inbound data to the second target geographic region. If there is no interaction data between the second target geographical area and one geographical area, it may be set to 0. The initial contact degree vector can be obtained
Figure BDA0002306924320000086
Is determined as [ publication ]1’、population2’、…、populationn’]T
S402: and performing second preset times of iterative calculation on the initial contact degree vector according to the preset matrix, and determining the target contact degree vector between the second target geographic area and other geographic areas.
In this embodiment of the present application, the number of iterations of the initial contact degree vector may be preset and used as the second preset number x. Specifically, the convergence of the target contact degree vector should be ensured as much as possible when the second preset number of times is preset.
In addition, a preset matrix W may be determined according to the interaction data between the first geographical area and the second geographical area within the range to be determined, that is, according to the interaction data between every two geographical areas within the range to be determined.
In a specific implementation, the determining the preset matrix W may include: c geographical areas are included in the range to be determined, and a first geographical area z in the range to be determinedj(j 1.. c.) interaction data z with itselfjSet as a position(j,j)Z is 0, which is between c-1 second geographical areaskThe interaction data of (k ≠ j) is 1.. c, and k ≠ j: position of a work(j,1)’、…population(j,k)’、…population(j,c). Then, the preset matrix W ═ z(j,k),(j=1,...,c),(k=1,...,c)。
The initial contact degree vector may then be iteratively calculated a second predetermined number of times based on the predetermined matrix W, such as by a formula
Figure BDA0002306924320000081
Where i may refer to the number of current iterations, when i-1 is 0,
Figure BDA0002306924320000082
when i is equal to n, the sum of n,
Figure BDA0002306924320000083
the
Figure BDA0002306924320000084
Namely the target contact degree vector.
After smoothing through a second preset number of iterations, the resulting probability distribution can be considered as a distribution affected by the start node. And the random walk algorithm is restarted to capture the multi-aspect relationship between the two nodes and capture the whole structure information of the graph.
Each element in the obtained target contact degree vector can be used as the contact degree between the second target geographic area and the corresponding geographic area in other geographic areas.
Interactive data such as floating population data of a geographic area is simulated by a Random Walking with Restart (RWR). Because the interaction data is strongly correlated with its origin, the RWR can more accurately simulate flow results. Meanwhile, a contact degree matrix among all geographic areas can be obtained, and edges are added to the geographic areas without direct contact degree information. In summary, RWRs can capture structural information global to the graph.
The method determines the contact degree between the geographic areas through objective interactive data between the geographic areas, and ensures the accuracy of the determined contact degree.
In one possible implementation, the attribute may further include a degree of cohesion of the geographic area, and the degree of cohesion may be used to identify a degree of flow between the geographic area and the geographic area in the combination of the geographic areas. Thus, the method may further comprise: and determining the cohesion degree of the geographic area in the target geographic area combination according to the interaction data between any two geographic areas in the range to be determined and the interaction data between any two geographic areas in the target geographic area combination.
The target geographical area combination can be any one geographical area combination. In determining the cohesion degree of any geographic area a in the target geographic area combination, the calculation can be performed by the following formula:
Figure BDA0002306924320000091
the term "group inflow data" refers to interaction data flowing from the geographic area a to the geographic area a (except the geographic area a) in the target geographic area combination, and the term "group outflow data" refers to interaction data flowing from the geographic area a to the geographic area a (except the geographic area a) in the target geographic area combination.
Therefore, the cohesion degree of the geographic area in the target geographic area combination is determined through the objective data, and the accuracy of determining the cohesion degree is improved.
In a possible implementation, the attributes may further include an outflow region combination and/or an inflow region combination of the geographic region. The combination of the outflow areas of the geographic areas may be a combination of the geographic areas to which the geographic areas flow, and the combination of the inflow areas of the geographic areas may be a combination of the geographic areas to which the geographic areas flow. The method further comprises the following steps: and if the grade attribute corresponding to the geographic area is determined according to the fixed data of the geographic area, determining the outflow area combination and/or inflow area combination of the first geographic area according to the grade attribute corresponding to the first geographic area and the degree of contact with the second geographic area.
For a first set consisting of a first geographic area and all geographic areas to which the first geographic area flows, if the contact degree between the first geographic area and any geographic area in the first set accounts for a higher proportion of the sum of the contact degrees of the first geographic area and all other geographic areas in the first set, the geographic area can be added into an outflow area combination of the first geographic area; for a first geographic area and a second set of geographic areas that are all geographic areas that flow to the first geographic area, the geographic area may be added to the inflow area combination of the first geographic area if the degree of association between the first geographic area and any geographic area in the second set accounts for a higher inflow proportion of the sum of the degrees of association of the first geographic area and all other geographic areas in the second set.
In particular implementations, suitable outflow proportion thresholds and inflow proportion thresholds may be determined based on the rank attributes of the first geographic region. Wherein the outflow ratio threshold may be used to determine a geographic area from the first set of first geographic areas that joins the outflow area combination of the first geographic area. The inflow proportion threshold may be used to determine a geographic area from the second set of first geographic areas that may join the inflow area combination of the first geographic areas.
When the outflow proportion threshold of the first geographic area is applied to determine the geographic area which can be added into the outflow area combination of the first geographic area from the first set of the first geographic area, all the geographic areas in the first set can be sorted according to the degree of contact with the first geographic area, and the first n geographic areas meeting the following conditions are determined: for the sum of the degrees of contact of the first n geographic areas with the first geographic area m1, and the sum of the degrees of contact of all geographic areas in the first set with the first geographic area m2, if the ratio of m1 to m2 is less than the outflow ratio threshold, and n is a maximum value. Then the first n geographical areas may be determined as the combination of outflow areas of the first geographical area.
When the inflow proportion threshold of the first geographic area is applied to determine the geographic area which can be added into the inflow area combination of the first geographic area from the second set of the first geographic area, the first n' geographic areas which meet the following conditions can be determined by sorting all the geographic areas in the second set according to the degree of contact with the first geographic area: for the sum of the degrees of contact of the first n 'geographic regions with the first geographic region m 1' and the sum of the degrees of contact of all geographic regions in the second set with the first geographic region m2 ', if the ratio of m 1' to m2 'is less than the inflow proportion threshold, and n' is a maximum. Then the first n' geographical areas may be determined as inflow area combinations of the first geographical area.
For the setting of the outflow proportion threshold and the inflow proportion threshold, for example, a city with the first-line attribute as the level attribute may be set to 0.8; the class attributes for second-line cities may all be set to 0.7; the rank attributes may all be set to 0.6 for a three-line city; the rating attribute may be set to 0.4 for other cities.
Thus, the combination of outflow areas and/or the combination of inflow areas of the first geographical area may be determined based on the rank attributes corresponding to the first geographical area and the interaction data with the second geographical area.
In one possible implementation, the attributes may also include outflow or inflow of the geographic area. Wherein the outflow degree can be used to identify the degree of interaction between the geographic area and the flowed geographic area. The degree of inflow may be used to identify a degree of interaction between the geographic area and the inflow geographic area. Referring to fig. 5, which shows a flowchart of an outflow degree and inflow degree determining method provided in an embodiment of the present application, as shown in fig. 5, the method further includes:
s501: and if the grade attribute corresponding to the geographic area is determined according to the fixed data of the geographic area, determining the target situation of the first geographic area aiming at the second geographic area according to the grade attributes of the first geographic area and the second geographic area. If the target situation of the first geographic area for the second geographic area is an outflow situation, executing S502; if the target situation of the first geographic area for the second geographic area is an inflow situation, S503 is executed.
Wherein the target situation comprises an outgoing situation or an incoming situation. If the rank attribute of the first geographic area is not higher than that of the second geographic area, the population and the like in the first geographic area generally flow out of the first geographic area and flow to the second geographic area, and the first geographic area belongs to the outflow situation; if the first geographic area has a lower rank attribute than the second geographic area, the population in the second geographic area will generally flow out of the second geographic area to the first geographic area, and the first geographic area will be in an influx situation.
Therefore, according to the rank attributes of the first geographic area and the second geographic area, if the rank attribute of the first geographic area is higher than that of the second geographic area, it may be determined that the first geographic area is in an inflow state with respect to the second geographic area, and then S503 is performed; if the rank attribute of the first geographic area is lower than that of the second geographic area, it may be determined that the first geographic area is in an outgoing situation with respect to the second geographic area, and then S502 is performed.
S502: and determining the outflow degree of the first geographical area to the second geographical area based on a preset degree threshold corresponding to the outflow situation and according to the degree of association between the first geographical area and the second geographical area.
S503: and determining the inflow degree of the first geographical area to the second geographical area based on a preset degree threshold corresponding to the inflow situation and according to the degree of association between the first geographical area and the second geographical area.
In the embodiment of the present application, corresponding degree thresholds may be determined for the outflow situation and the inflow situation, respectively, to embody the outflow degree and the inflow degree.
In a specific implementation, four thresholds are set to be 0.01, 0.05, 0.1, and 0.2, respectively, for the outflow situation, where for the degree of association between two geographic areas, if the value is greater than or equal to the threshold of 0.2, it may be indicated that the outflow degree is super-strong, if the value is greater than or equal to the threshold of 0.1 and less than the threshold of 0.2, it may be indicated that the outflow degree is strong, if the value is greater than or equal to the threshold of 0.05 and less than the threshold of 0.1, it may be indicated that the outflow degree is medium-dependent, and if the value is greater than or equal to the threshold of 0.01 and less than the threshold of 0.05, it may be indicated that the outflow degree is weak-dependent.
Correspondingly, four thresholds, namely 0.01, 0.05, 0.1 and 0.2, can be set for the inflow situation, wherein for the degree of association between two geographic areas, if the value is greater than or equal to the threshold of 0.2, the inflow degree can be represented as super-strong attraction, if the value is greater than or equal to the threshold of 0.1 and less than the threshold of 0.2, the outflow degree can be represented as strong attraction, if the value is greater than or equal to the threshold of 0.05 and less than the threshold of 0.1, the outflow degree can be represented as medium attraction, and if the value is greater than or equal to the threshold of 0.01 and less than the threshold of 0.05, the outflow degree can be represented as weak attraction.
Therefore, the outflow degree of the first geographic area to the second geographic area can be determined based on the preset degree threshold corresponding to the outflow situation and according to the degree of association between the first geographic area and the second geographic area.
In addition, the inflow degree of the first geographic area to the second geographic area may be determined based on a preset degree threshold corresponding to the inflow situation and according to the degree of association between the first geographic area and the second geographic area.
By performing the method, a main influence range of a geographical area within the range to be determined may be determined.
Next, the attribute determination of the geographic area provided by the embodiment of the present application will be described with reference to an actual application scenario.
Referring to fig. 6, a schematic diagram for determining a city attribute provided in an embodiment of the present application is shown. As shown in fig. 6, resident population data (fixed data) of a city and floating population data (interactive data) between cities may be input. Then, an urban group (namely a geographical area combination) can be determined according to the graph clustering algorithm aiming at the floating population data; in addition, city level attributes can be determined for resident population data and according to a hierarchical clustering algorithm. Next, the degree of cohesion of a city with respect to its affiliated urban group may be determined from floating population data.
Next, the centrality of the city for the city group to which the city belongs may be determined for resident population data according to a centrality algorithm (the methods of S301 to S304 described above); the degree of contact between cities may be determined according to a degree of contact algorithm (the method of S401-S402 described above) for floating demographic data. Further, the output circle (outflow area combination) and the gravitational circle (inflow area combination) of the city, and the attachment degree (outflow degree) and the attraction degree (outflow degree) of the city may be determined according to the contact degree and the level attribute of the city.
Based on the foregoing method for determining an attribute of a geographic area, an embodiment of the present application further provides an attribute determining apparatus for a geographic area, as shown in fig. 7, which shows a structure diagram of the attribute determining apparatus for a geographic area provided in an embodiment of the present application, where the apparatus includes:
an obtaining unit 701, configured to obtain interaction data corresponding to a predetermined pattern between a first geographic area and a second geographic area within a range to be determined, where the interaction data includes outflow data that flows from the first geographic area to the second geographic area and inflow data that flows from the second geographic area to the first geographic area;
a clustering unit 702, configured to cluster the geographic areas within the range to be determined according to the interaction data between the first geographic area and the second geographic area;
a determining unit 703 is configured to determine, according to the clustering result, a geographical area combination that conforms to the predetermined pattern, where the geographical area combination belongs to one attribute determined for the geographical area.
In one possible implementation, the predetermined pattern includes one or more combinations of a population aggregation pattern, a network data strong interaction pattern, or a regional economic clustering pattern.
In a possible implementation manner, the attributes further include a rank attribute of a geographic area, where the rank attribute is used to identify a rank of the geographic area within the range to be determined, and the determining unit 703 is further specifically configured to:
and acquiring fixed data in each geographic area within the range to be determined, and determining the grade attribute corresponding to the geographic area according to the fixed data in the geographic area.
In a possible implementation manner, the attribute further includes a centrality of the geographic area, where the centrality is used to identify an importance degree of the geographic area in the geographic area combination to which the geographic area belongs, and the determining unit 703 is further specifically configured to:
determining an initial importance matrix according to fixed data in each geographic area in a target geographic area combination, wherein the target geographic area combination is any one geographic area combination;
determining a transfer matrix corresponding to the target geographical area combination;
performing iterative computation on the initial importance matrix for a first preset number of times by applying the transfer matrix to determine the target importance matrix;
and determining the centrality of a first target geographical area for the target geographical area combination according to the target importance matrix and fixed data in the geographical areas in the target geographical area combination, wherein the first target geographical area is any one geographical area in the target geographical area combination.
In a possible implementation manner, the attribute further includes a contact degree of the geographic areas, where the contact degree is used to identify a degree of association between the geographic areas, and the determining unit 703 is further specifically configured to:
determining an initial contact degree vector of a second target geographical area according to interaction data between the second target geographical area and other geographical areas, wherein the second target geographical area is any geographical area within the range to be determined;
and performing second preset times of iterative computation on the initial contact degree vector according to a preset matrix, and determining a target contact degree vector of the second target geographic area aiming at other geographic areas, wherein one element in the target contact degree vector is the contact degree between the second target geographic area and the corresponding geographic area in the other geographic areas, and the preset matrix is determined according to the interactive data between the first geographic area and the second geographic area in the range to be determined.
In a possible implementation manner, the attribute further includes a cohesion degree of the geographic area, where the cohesion degree is used to identify a degree of flow between the geographic area and a geographic area in the geographic area combination, and the determining unit 703 is further specifically configured to:
and determining the cohesion degree of the geographic area in the target geographic area combination according to the interaction data between every two geographic areas in the range to be determined and the interaction data between every two geographic areas in the target geographic area combination, wherein the target geographic area combination is any one geographic area combination.
In a possible implementation manner, the attributes further include an outflow area combination and/or an inflow area combination of a geographic area, where the outflow area combination is a combination of geographic areas to which the geographic area flows, and the inflow area combination is a combination of geographic areas to which the geographic area flows, and the determining unit 703 is further specifically configured to:
and if the grade attribute corresponding to the geographic area is determined according to the fixed data of the geographic area, determining the outflow area combination and/or inflow area combination of the first geographic area according to the grade attribute corresponding to the first geographic area and the degree of contact with the second geographic area.
In a possible implementation manner, the attribute further includes an outflow degree or an inflow degree of a geographic area, where the outflow degree is used to identify an interaction degree between the geographic area and the geographic area to which the flow is flowing, and the inflow degree is used to identify an interaction degree between the geographic area and the geographic area to which the flow is flowing, and the determining unit 703 is further specifically configured to:
if the level attribute corresponding to the geographic area is determined according to the fixed data of the geographic area, determining a target situation of the first geographic area for the second geographic area according to the level attributes of the first geographic area and the second geographic area, wherein the target situation comprises an outflow situation or an inflow situation;
if the target situation of the first geographical area for the second geographical area is an outflow situation, determining the outflow degree of the first geographical area for the second geographical area based on a preset degree threshold corresponding to the outflow situation and according to the degree of association between the first geographical area and the second geographical area;
if the target situation of the first geographical area for the second geographical area is an inflow situation, determining the inflow degree of the first geographical area for the second geographical area based on a preset degree threshold corresponding to the inflow situation and according to the degree of association between the first geographical area and the second geographical area.
According to the technical scheme, interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined are obtained, and the interactive data comprise outflow data flowing from the first geographical area to the second geographical area and inflow data flowing from the second geographical area to the first geographical area. And then, clustering the geographic areas in the range to be determined according to the interactive data between the first geographic area and the second geographic area. Finally, according to the clustering result, a geographical area combination which conforms to the predetermined pattern is determined, and the geographical area combination belongs to one attribute determined for the geographical area. The accuracy of the geographic area attribute, which is one of the geographic area attributes, can be improved by determining the geographic area combination based on the determined data in which the interaction data between the first geographic area and the second geographic area is objective.
An embodiment of the present application further provides an attribute determining device for a geographic area, where the device includes a processor and a memory:
the memory is used for storing program codes and transmitting the program codes to the processor;
the processor is configured to execute the method for determining the attribute of the geographic area according to instructions in the program code.
An embodiment of the present application further provides a computer-readable storage medium, which is used for storing a program code, where the program code is used for executing the method for determining the attribute of the geographic area.
As can be seen from the above description of the embodiments, those skilled in the art can clearly understand that all or part of the steps in the above embodiment methods can be implemented by software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present application may be essentially or partially implemented in the form of a software product, which may be stored in a storage medium, such as a ROM/RAM, a magnetic disk, an optical disk, etc., and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network communication device such as a media gateway, etc.) to execute the method according to the embodiments or some parts of the embodiments of the present application.
It should be noted that, in the present specification, the embodiments are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments may be referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
It is further noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (11)

1. A method for determining attributes of a geographical area, the method comprising:
acquiring interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined, wherein the interactive data comprises outflow data of the first geographical area flowing out to the second geographical area and inflow data of the second geographical area flowing into the first geographical area;
clustering the geographic areas within the range to be determined according to the interactive data between the first geographic area and the second geographic area;
determining a geographical area combination according to the predetermined pattern according to the clustering result, the geographical area combination belonging to an attribute determined for the geographical area.
2. The method of claim 1, wherein the predetermined patterns comprise one or more combinations of population gathering patterns, network data strong interaction patterns, and regional economic clustering patterns.
3. The method of claim 1, wherein the attributes further comprise a rank attribute for a geographic region, the rank attribute identifying a rank at which the geographic region is within the range to be determined, the method further comprising:
and acquiring fixed data in each geographic area within the range to be determined, and determining the grade attribute corresponding to the geographic area according to the fixed data in the geographic area.
4. The method of claim 3, wherein the attributes further include a centrality of a geographic area, the centrality identifying a degree of importance of the geographic area in the set of geographic areas, the method further comprising:
determining an initial importance matrix according to fixed data in each geographic area in a target geographic area combination, wherein the target geographic area combination is any one geographic area combination;
determining a transfer matrix corresponding to the target geographical area combination;
performing iterative computation on the initial importance matrix for a first preset number of times by applying the transfer matrix to determine the target importance matrix;
and determining the centrality of a first target geographical area for the target geographical area combination according to the target importance matrix and fixed data in the geographical areas in the target geographical area combination, wherein the first target geographical area is any one geographical area in the target geographical area combination.
5. The method of any one of claims 1-4, wherein the attributes further include a degree of association of geographic regions, the degree of association identifying a degree of association between geographic regions, the method further comprising:
determining an initial contact degree vector of a second target geographical area according to interaction data between the second target geographical area and other geographical areas, wherein the second target geographical area is any geographical area within the range to be determined;
and performing second preset times of iterative computation on the initial contact degree vector according to a preset matrix, and determining a target contact degree vector of the second target geographic area aiming at other geographic areas, wherein one element in the target contact degree vector is the contact degree between the second target geographic area and the corresponding geographic area in the other geographic areas, and the preset matrix is determined according to the interactive data between the first geographic area and the second geographic area in the range to be determined.
6. The method of any one of claims 1-4, wherein the attributes further include a degree of cohesion of the geographic area, the degree of cohesion identifying a degree of flow between the geographic area and a geographic area in the combination of geographic areas to which the geographic area belongs, the method further comprising:
and determining the cohesion degree of the geographic area in the target geographic area combination according to the interaction data between every two geographic areas in the range to be determined and the interaction data between every two geographic areas in the target geographic area combination, wherein the target geographic area combination is any one geographic area combination.
7. The method of claim 5, wherein the attributes further comprise combinations of outflow regions and/or combinations of inflow regions for a geographic region, wherein the combinations of outflow regions are combinations of geographic regions to which the geographic region flows, and wherein the combinations of inflow regions are combinations of geographic regions to which the geographic region flows, and wherein the method further comprises:
and if the grade attribute corresponding to the geographic area is determined according to the fixed data of the geographic area, determining the outflow area combination and/or inflow area combination of the first geographic area according to the grade attribute corresponding to the first geographic area and the degree of contact with the second geographic area.
8. The method of claim 5, wherein the attributes further comprise an outflow degree or an inflow degree of a geographic area, wherein the outflow degree is used for identifying a degree of interaction between the geographic area and the geographical area to which the flow is directed, and wherein the inflow degree is used for identifying a degree of interaction between the geographic area and the geographical area to which the flow is directed, and wherein the method further comprises:
if the level attribute corresponding to the geographic area is determined according to the fixed data of the geographic area, determining a target situation of the first geographic area for the second geographic area according to the level attributes of the first geographic area and the second geographic area, wherein the target situation comprises an outflow situation or an inflow situation;
if the target situation of the first geographical area for the second geographical area is an outflow situation, determining the outflow degree of the first geographical area for the second geographical area based on a preset degree threshold corresponding to the outflow situation and according to the degree of association between the first geographical area and the second geographical area;
if the target situation of the first geographical area for the second geographical area is an inflow situation, determining the inflow degree of the first geographical area for the second geographical area based on a preset degree threshold corresponding to the inflow situation and according to the degree of association between the first geographical area and the second geographical area.
9. An apparatus for determining attributes of a geographic area, the apparatus comprising:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring interactive data corresponding to a preset mode between a first geographical area and a second geographical area within a range to be determined, and the interactive data comprises outflow data flowing out from the first geographical area to the second geographical area and inflow data flowing into the first geographical area from the second geographical area;
the clustering unit is used for clustering the geographic areas in the range to be determined according to the interactive data between the first geographic area and the second geographic area;
a determining unit, configured to determine, according to the clustering result, a geographical area combination that conforms to the predetermined pattern, the geographical area combination belonging to an attribute determined for the geographical area.
10. An attribute determination device for a geographic area, the device comprising a processor and a memory:
the memory is used for storing program codes and transmitting the program codes to the processor;
the processor is configured to perform the method for attribute determination of a geographical area of claims 1-8 according to instructions in the program code.
11. A computer-readable storage medium, characterized in that the computer-readable storage medium is configured to store a program code for performing the method of attribute determination of a geographical area of claims 1-8.
CN201911243596.7A 2019-12-06 2019-12-06 Method for determining attributes of geographic area and related equipment Pending CN111046122A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911243596.7A CN111046122A (en) 2019-12-06 2019-12-06 Method for determining attributes of geographic area and related equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911243596.7A CN111046122A (en) 2019-12-06 2019-12-06 Method for determining attributes of geographic area and related equipment

Publications (1)

Publication Number Publication Date
CN111046122A true CN111046122A (en) 2020-04-21

Family

ID=70234943

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911243596.7A Pending CN111046122A (en) 2019-12-06 2019-12-06 Method for determining attributes of geographic area and related equipment

Country Status (1)

Country Link
CN (1) CN111046122A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170064015A1 (en) * 2015-09-02 2017-03-02 Facebook, Inc. Geographic-Area Cluster
CN108921214A (en) * 2018-06-28 2018-11-30 中国联合网络通信有限公司深圳市分公司 Acquisition methods, device and the computer readable storage medium of City attribution
CN110503485A (en) * 2019-08-27 2019-11-26 京东城市(北京)数字科技有限公司 Geographic area classification method and device, electronic equipment, storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170064015A1 (en) * 2015-09-02 2017-03-02 Facebook, Inc. Geographic-Area Cluster
CN108921214A (en) * 2018-06-28 2018-11-30 中国联合网络通信有限公司深圳市分公司 Acquisition methods, device and the computer readable storage medium of City attribution
CN110503485A (en) * 2019-08-27 2019-11-26 京东城市(北京)数字科技有限公司 Geographic area classification method and device, electronic equipment, storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵渺希 等: "《基于网络关联的长三角区域腹地划分》" *

Similar Documents

Publication Publication Date Title
CN108234198B (en) Base station flow prediction method and equipment
CN112084422B (en) Account data intelligent processing method and device
JP6685541B2 (en) Method and apparatus for optimizing user credit score
WO2022217839A1 (en) Air quality prediction method based on deep spatiotemporal similarity
CN103647800A (en) Method and system of recommending application resources
CN105869022B (en) Application popularity prediction method and device
WO2018153292A1 (en) Method for generating social circle, and apparatus and storage medium
CN112559745B (en) Method and related device for determining hot event
WO2021208535A1 (en) Recommendation method and device based on automatic feature grouping
CN110784883B (en) Base station construction evaluation method, device, equipment and storage medium
CN110322295B (en) Relationship strength determination method and system, server and computer readable medium
CN110298687B (en) Regional attraction assessment method and device
CN104750716A (en) Social network site friend recommendation method and social network site friend recommendation device
Alswailim et al. A reputation system to evaluate participants for participatory sensing
CN111061624A (en) Policy execution effect determination method and device, electronic equipment and storage medium
US20190197633A1 (en) Social media account filtering method and apparatus
CN112328865A (en) Information processing and recommending method, device, equipment and storage medium
CN109451334A (en) User, which draws a portrait, generates processing method, device and electronic equipment
CN107784511A (en) A kind of customer loss Forecasting Methodology and device
CN112016836A (en) Method and device for determining similarity between objects
CN111046122A (en) Method for determining attributes of geographic area and related equipment
CN108271183B (en) Neighbor cell optimization method and device based on original measurement report
CN114511125A (en) Space division method, device, equipment and medium
CN108337644B (en) Data processing method and device
CN112488743A (en) Satisfaction degree prediction method, network equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination