CN117177177B - Method and system for demographics of small-area occupancy based on signaling data - Google Patents
Method and system for demographics of small-area occupancy based on signaling data Download PDFInfo
- Publication number
- CN117177177B CN117177177B CN202311452812.5A CN202311452812A CN117177177B CN 117177177 B CN117177177 B CN 117177177B CN 202311452812 A CN202311452812 A CN 202311452812A CN 117177177 B CN117177177 B CN 117177177B
- Authority
- CN
- China
- Prior art keywords
- user
- preset
- boundary
- base station
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000011664 signaling Effects 0.000 title claims abstract description 51
- 238000000034 method Methods 0.000 title claims abstract description 36
- 230000006399 behavior Effects 0.000 claims description 36
- 230000004048 modification Effects 0.000 abstract description 3
- 238000012986 modification Methods 0.000 abstract description 3
- 230000003068 static effect Effects 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
Abstract
The invention discloses a small-area occupancy demographics method and a system based on signaling data, wherein the method comprises the steps of acquiring occupancy list information; based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group; acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations; obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations; judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information. The invention bypasses the existing method that the actual position of the user is replaced by the position of the base station, simulates the longitude and latitude coordinate point of the actual position of the user according to the jump condition of the base station during the static period of the user, and is convenient for the modification of the statistical caliber, and the like.
Description
Technical Field
The invention relates to the technical field of data processing, in particular to a small-area occupancy demographics method and system based on signaling data.
Background
When the prior art performs occupancy population statistics, the occupancy population and the attribution of the workplace can be judged only according to the base station or the position of the base station, and the method cannot perform attribution statistics on small area ranges such as buildings, cells and the like.
Accordingly, there is a need for improvement in the art.
Disclosure of Invention
In view of the above problems, an object of the present invention is to provide a method and a system for small-area occupancy demographics based on signaling data, which can more conveniently and accurately count occupancy demographics in a small area, and separate occupancy positions of the occupancy demographics from base station positions, so as to facilitate subsequent statistics.
The first aspect of the present invention provides a method for small-area occupancy demographics based on signaling data, comprising:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
In this scheme, the step of acquiring the staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
In this scheme, the step of obtaining behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically includes:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
In this scheme, the formula for actually counting the position of the corresponding position of the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is;
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>。
In this scheme, the step of acquiring the weight value of the corresponding base station specifically includes:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then-> , ,/> 。
In this scheme, if not, after deleting the corresponding user information, specifically including:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
In this scheme, if yes, the probability that the marking user actually counts the position of the affiliated job to be within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
The second aspect of the present invention provides a signaling data-based small-area occupancy demographics system, comprising a memory and a processor, wherein the memory stores a signaling data-based small-area occupancy demographics method program, and the processor executes the signaling data-based small-area occupancy demographics method program to implement the following steps:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
In this scheme, the step of acquiring the staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
In this scheme, the step of obtaining behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically includes:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
In this scheme, the formula for actually counting the position of the corresponding position of the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is ;
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>。
In this scheme, the step of acquiring the weight value of the corresponding base station specifically includes:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->, ,/>。
In this scheme, if not, after deleting the corresponding user information, specifically including:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
In this scheme, if yes, the probability that the marking user actually counts the position of the affiliated job to be within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
The invention discloses a small-area occupancy demographics method and a system based on signaling data, which simulate longitude and latitude coordinate points of the actual position of a user according to the jump condition of a base station during the rest period of the user by bypassing the existing method and system for replacing the actual position of the user by the position of the base station, thereby flexibly judging the small area of the user, such as buildings, cells and the like, and facilitating the modification of the statistical caliber and the like.
Drawings
FIG. 1 illustrates a flow chart of a small area occupancy demographic method based on signaling data in accordance with the present invention;
fig. 2 shows a block diagram of a small area occupancy demographic system based on signaling data in accordance with the present invention.
Detailed Description
In order that the above-recited objects, features and advantages of the present invention will be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description. It should be noted that, in the case of no conflict, the embodiments of the present application and the features in the embodiments may be combined with each other.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those described herein, and therefore the scope of the present invention is not limited to the specific embodiments disclosed below.
Fig. 1 shows a flow chart of a small area occupancy demographics method based on signaling data of the present invention.
As shown in fig. 1, the small-area occupancy demographics method based on signaling data of the present invention includes:
s101, acquiring staff information;
s102, based on a preset base station, extracting users belonging to the preset base station in a living or working area in a staff table as a preselected user group;
s103, acquiring three pieces of base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of the corresponding three base stations;
s104, obtaining the actual statistics of the position of the job of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
s105, judging whether the user actually counts the position of the corresponding job in the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
According to the embodiment of the invention, the staff table stores user information and information that the user living or working place belongs to the base station, the preset base station is a boundary of a preset area and a base station in a buffer area range, the buffer area ranges set in different scenes are different, wherein the buffer area ranges set in areas with more population and denser base stations are smaller, the buffer area ranges set in areas with more remote positions and less base stations are larger, for example, the boundary of the preset area is outwards expanded by 20 meters to be set as the buffer area range corresponding to the preset area, and the buffer area ranges are set by a person skilled in the art according to actual requirements.
According to an embodiment of the present invention, the step of obtaining staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
It should be noted that the behavior track information of the corresponding user includes the duration of residence of the corresponding user on the behavior track and the base station to which the corresponding user belongs, the duration of residence of the user on the behavior track is arranged in order from small to large, the base station with the longest residence time and on the behavior track is extracted, the base station with the longest residence time of the user is set as the home base station, and the preset table stores the user information.
According to the embodiment of the invention, the step of obtaining the behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically comprises the following steps:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
It should be noted that, the behavior track of the user is formed by drawing a connection line of the base station according to the time sequence of the user, and the behavior track information of the user includes the information of the base station of the user and the time length of the corresponding user in the base station of the user.
According to the embodiment of the invention, the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is;
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>。
It should be noted that, according to the information of three base stations with longest residence time of the user in the pre-selected user group in the preset time period, the corresponding user actual statistics of the position of the corresponding user is obtained through calculation; weighting the base station position where the user resides according to the weight value of the base station, whereinRepresenting the longitude and latitude of the base station with the number 1; />Representing the longitude and latitude of the base station with the number of 2; />Representing the longitude and latitude of the base station with the number of 2; wherein->Representing longitude, & gt of base station>Representing the latitude of the base station; wherein->A weight value representing number 1; />A weight value representing the base station of number 2; />The weight value of the base station of number 3 is indicated.
According to the embodiment of the invention, the step of acquiring the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then-> ,,/>。
The longer the user resides in the base station location, the higher the base station location weight value.
According to an embodiment of the present invention, if not, deleting the corresponding user information includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
It should be noted that, in order to prevent deviation of some user signaling data, after user information in the staff table is deleted, the deleted user is marked and set as a marked user, if the preset first time range is set to 15 days, the staff position of the marked user in the actual statistics of 15 days is recorded, according to whether the staff position of the marked user in the actual statistics of the preset first time range is judged in the boundary of the preset area, the probability that the staff position of the marked user in the actual statistics is in the boundary of the preset area is determined, if the corresponding probability is greater than the preset probability threshold, the staff population of the corresponding marked user as the preset area is indicated, and the preset distance threshold and the preset probability threshold are set by a person skilled in the art.
According to the embodiment of the present invention, if yes, the probability that the marking user actually counts the position of the job to which the marking user belongs is within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
It should be noted that, the probability that the marking user actually counts the position of the home position within the boundary of the preset area is equal to the number of times that the marking user actually counts the position of the home position within the boundary of the preset area divided by the total number of times.
According to an embodiment of the present invention, further comprising:
acquiring an area value in a preset area and information of the number of the living population;
dividing the number of the occupancy population by the area value in the preset area to obtain population density of the corresponding preset area;
judging whether the population density of the preset area is larger than a preset population density threshold value, if so, triggering population prompt information;
and sending the population prompt information to a preset management end for display.
It should be noted that, when the population density in the preset area is greater than the preset population density threshold, the population in the corresponding preset area is too compact, so that the population prompt information is triggered, and the preset proportion threshold is set by a person skilled in the art.
According to an embodiment of the present invention, further comprising:
extracting the residence time values of the users in the preselected user group in the corresponding three base stations;
obtaining a maximum time value according to the time values of the users in the preselected user group in the corresponding three base stations;
judging whether the maximum time value is larger than a preset time threshold value, if so, the users in the corresponding preselected user group are reasonable users; if not, deleting the users in the corresponding preselected user group.
It should be noted that, the time value of the user in the pre-selected user group corresponding to the three base stations is the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations, for example, the preset time threshold is 8 hours, when the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations exceeds 8 hours, the user in the pre-selected user group is a reasonable user, otherwise, the user in the pre-selected user group is deleted.
Fig. 2 shows a block diagram of a small area occupancy demographic system based on signaling data in accordance with the present invention.
As shown in fig. 2, a second aspect of the present invention provides a signaling data based small area occupancy demographics system 2, comprising a memory 21 and a processor 22, wherein the memory stores a signaling data based small area occupancy demographics method program, which when executed by the processor, implements the steps of:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
According to the embodiment of the invention, the staff table stores user information and information that the user living or working place belongs to the base station, the preset base station is a boundary of a preset area and a base station in a buffer area range, the buffer area ranges set in different scenes are different, wherein the buffer area ranges set in areas with more population and denser base stations are smaller, the buffer area ranges set in areas with more remote positions and less base stations are larger, for example, the boundary of the preset area is outwards expanded by 20 meters to be set as the buffer area range corresponding to the preset area, and the buffer area ranges are set by a person skilled in the art according to actual requirements.
According to an embodiment of the present invention, the step of obtaining staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
It should be noted that the behavior track information of the corresponding user includes the duration of residence of the corresponding user on the behavior track and the base station to which the corresponding user belongs, the duration of residence of the user on the behavior track is arranged in order from small to large, the base station with the longest residence time and on the behavior track is extracted, the base station with the longest residence time of the user is set as the home base station, and the preset table stores the user information.
According to the embodiment of the invention, the step of obtaining the behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically comprises the following steps:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
It should be noted that, the behavior track of the user is formed by drawing a connection line of the base station according to the time sequence of the user, and the behavior track information of the user includes the information of the base station of the user and the time length of the corresponding user in the base station of the user.
According to the embodiment of the invention, the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is;
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>。
It should be noted that, according to the information of three base stations with longest residence time of the user in the pre-selected user group in the preset time period, the corresponding user actual statistics of the position of the corresponding user is obtained through calculation; weighting the base station position where the user resides according to the weight value of the base station, whereinRepresenting the longitude and latitude of the base station with the number 1; />Representing the longitude and latitude of the base station with the number of 2; />Representing the longitude and latitude of the base station with the number of 2; wherein->Representing longitude, & gt of base station>Representing the latitude of the base station; wherein->A weight value representing number 1; />A weight value representing the base station of number 2; />The weight value of the base station of number 3 is indicated.
According to the embodiment of the invention, the step of acquiring the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then-> ,,/>。
The longer the user resides in the base station location, the higher the base station location weight value.
According to an embodiment of the present invention, if not, deleting the corresponding user information includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
It should be noted that, in order to prevent deviation of some user signaling data, after user information in the staff table is deleted, the deleted user is marked and set as a marked user, if the preset first time range is set to 15 days, the staff position of the marked user in the actual statistics of 15 days is recorded, according to whether the staff position of the marked user in the actual statistics of the preset first time range is judged in the boundary of the preset area, the probability that the staff position of the marked user in the actual statistics is in the boundary of the preset area is determined, if the corresponding probability is greater than the preset probability threshold, the staff population of the corresponding marked user as the preset area is indicated, and the preset distance threshold and the preset probability threshold are set by a person skilled in the art.
According to the embodiment of the present invention, if yes, the probability that the marking user actually counts the position of the job to which the marking user belongs is within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
It should be noted that, the probability that the marking user actually counts the position of the home position within the boundary of the preset area is equal to the number of times that the marking user actually counts the position of the home position within the boundary of the preset area divided by the total number of times.
According to an embodiment of the present invention, further comprising:
acquiring an area value in a preset area and information of the number of the living population;
dividing the number of the occupancy population by the area value in the preset area to obtain population density of the corresponding preset area;
judging whether the population density of the preset area is larger than a preset population density threshold value, if so, triggering population prompt information;
and sending the population prompt information to a preset management end for display.
It should be noted that, when the population density in the preset area is greater than the preset population density threshold, the population in the corresponding preset area is too compact, so that the population prompt information is triggered, and the preset proportion threshold is set by a person skilled in the art.
According to an embodiment of the present invention, further comprising:
extracting the residence time values of the users in the preselected user group in the corresponding three base stations;
obtaining a maximum time value according to the time values of the users in the preselected user group in the corresponding three base stations;
judging whether the maximum time value is larger than a preset time threshold value, if so, the users in the corresponding preselected user group are reasonable users; if not, deleting the users in the corresponding preselected user group.
It should be noted that, the time value of the user in the pre-selected user group corresponding to the three base stations is the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations, for example, the preset time threshold is 8 hours, when the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations exceeds 8 hours, the user in the pre-selected user group is a reasonable user, otherwise, the user in the pre-selected user group is deleted.
The invention discloses a small-area occupancy demographics method and a system based on signaling data, wherein the method comprises the steps of acquiring occupancy list information; based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group; acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations; obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations; judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information. The invention simulates the longitude and latitude coordinate point of the actual position of the user according to the jump condition of the base station during the static period of the user by bypassing the existing base station position to replace the actual position of the user, thereby flexibly judging the small area of the user, such as buildings, cells and the like, and being convenient for the modification of the statistical caliber and the like.
In the several embodiments provided in this application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above described device embodiments are only illustrative, e.g. the division of the units is only one logical function division, and there may be other divisions in practice, such as: multiple units or components may be combined or may be integrated into another system, or some features may be omitted, or not performed. In addition, the various components shown or discussed may be coupled or directly coupled or communicatively coupled to each other via some interface, whether indirectly coupled or communicatively coupled to devices or units, whether electrically, mechanically, or otherwise.
The units described above as separate components may or may not be physically separate, and components shown as units may or may not be physical units; can be located in one place or distributed to a plurality of network units; some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in each embodiment of the present invention may be integrated in one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated in one unit; the integrated units may be implemented in hardware or in hardware plus software functional units.
Those of ordinary skill in the art will appreciate that: all or part of the steps for implementing the above method embodiments may be implemented by hardware related to program instructions, and the foregoing program may be stored in a computer readable storage medium, where the program, when executed, performs steps including the above method embodiments; and the aforementioned storage medium includes: a mobile storage device, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk or an optical disk, or the like, which can store program codes.
Alternatively, the above-described integrated units of the present invention may be stored in a computer-readable storage medium if implemented in the form of software functional modules and sold or used as separate products. Based on such understanding, the technical solutions of the embodiments of the present invention may be embodied in essence or a part contributing to the prior art in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a removable storage device, ROM, RAM, magnetic or optical disk, or other medium capable of storing program code.
Claims (7)
1. A method of small area occupancy demographics based on signaling data, comprising:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information;
the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, and specifically comprises the following steps:
setting the position of the user to which the actual statistics belongsThe formula is;
Wherein the method comprises the steps ofRespectively representing the longitude and latitude of the corresponding three base stations,respectively represent weight values corresponding to three base stations, and +.>;
The step of obtaining the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->,/>,;
If not, deleting the corresponding user information, which specifically includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
2. A method of small area occupancy demographics based on signaling data as claimed in claim 1, wherein said step of obtaining occupancy table information comprises:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
3. The method for small-area occupancy demographics based on signaling data according to claim 2, wherein the step of obtaining the behavior trace information of the corresponding user according to the signaling data information of the mobile phone of the user specifically comprises:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
4. The method of claim 1, wherein if so, obtaining a probability that the marking user actually counts the occupancy location within the boundary of the preset area comprises:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
5. A signaling data based small area occupancy demographics system comprising a memory and a processor, wherein the memory stores a signaling data based small area occupancy demographics method program, which when executed by the processor performs the steps of:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information;
the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, and specifically comprises the following steps:
setting the position of the user to which the actual statistics belongsThe formula is;
Wherein the method comprises the steps ofRespectively representing the longitude and latitude of the corresponding three base stations,respectively represent weight values corresponding to three base stations, and +.>;
The step of obtaining the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->,/>,;
If not, deleting the corresponding user information, which specifically includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
6. A small area occupancy demographic system based on signaling data in accordance with claim 5 wherein said step of obtaining occupancy table information comprises:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
7. The small area occupancy demographics system based on signaling data of claim 6, wherein the step of obtaining behavior trace information of the corresponding user according to the signaling data information of the mobile phone of the user specifically comprises:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311452812.5A CN117177177B (en) | 2023-11-03 | 2023-11-03 | Method and system for demographics of small-area occupancy based on signaling data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311452812.5A CN117177177B (en) | 2023-11-03 | 2023-11-03 | Method and system for demographics of small-area occupancy based on signaling data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117177177A CN117177177A (en) | 2023-12-05 |
CN117177177B true CN117177177B (en) | 2024-02-27 |
Family
ID=88943587
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311452812.5A Active CN117177177B (en) | 2023-11-03 | 2023-11-03 | Method and system for demographics of small-area occupancy based on signaling data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117177177B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011021606A1 (en) * | 2009-08-17 | 2011-02-24 | 株式会社エヌ・ティ・ティ・ドコモ | Population fluidity information generation system and population fluidity information generation method |
KR20140056461A (en) * | 2012-10-25 | 2014-05-12 | 에스케이텔레콤 주식회사 | Supporting method for forecasting population density and apparatus supporting the same |
CN111615054A (en) * | 2020-05-25 | 2020-09-01 | 和智信(山东)大数据科技有限公司 | Population analysis method and device |
CN115665677A (en) * | 2022-10-14 | 2023-01-31 | 深圳市规划国土发展研究中心 | Method and system for acquiring regional population based on mobile phone signaling data |
CN115866547A (en) * | 2023-03-01 | 2023-03-28 | 北京融信数联科技有限公司 | Fixed area tourist counting method, system and storage medium based on signaling data |
-
2023
- 2023-11-03 CN CN202311452812.5A patent/CN117177177B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011021606A1 (en) * | 2009-08-17 | 2011-02-24 | 株式会社エヌ・ティ・ティ・ドコモ | Population fluidity information generation system and population fluidity information generation method |
KR20140056461A (en) * | 2012-10-25 | 2014-05-12 | 에스케이텔레콤 주식회사 | Supporting method for forecasting population density and apparatus supporting the same |
CN111615054A (en) * | 2020-05-25 | 2020-09-01 | 和智信(山东)大数据科技有限公司 | Population analysis method and device |
CN115665677A (en) * | 2022-10-14 | 2023-01-31 | 深圳市规划国土发展研究中心 | Method and system for acquiring regional population based on mobile phone signaling data |
CN115866547A (en) * | 2023-03-01 | 2023-03-28 | 北京融信数联科技有限公司 | Fixed area tourist counting method, system and storage medium based on signaling data |
Also Published As
Publication number | Publication date |
---|---|
CN117177177A (en) | 2023-12-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110401779B (en) | Method and device for identifying telephone number and computer readable storage medium | |
CN105045911B (en) | Label generating method and equipment for user to mark | |
CN111148018B (en) | Method and device for identifying and positioning regional value based on communication data | |
CN111626754B (en) | Card-keeping user identification method and device | |
CN104702804A (en) | Method and device for marking number | |
CN108764951A (en) | User's similarity preparation method and device, equipment, storage medium | |
CN114024737B (en) | Method, apparatus and computer readable storage medium for determining live room volume | |
CN108985048A (en) | Simulator recognition methods and relevant apparatus | |
CN117177177B (en) | Method and system for demographics of small-area occupancy based on signaling data | |
CN108600961A (en) | Preparation method and device, equipment, the storage medium of user's similarity | |
CN108924840B (en) | Blacklist management method and device and terminal | |
CN111669710B (en) | Demographic deduplication method | |
CN111368858A (en) | User satisfaction evaluation method and device | |
CN116156416A (en) | Method and device for extracting liveplace based on signaling data | |
CN109413459B (en) | User recommendation method and related equipment in live broadcast platform | |
CN115967906A (en) | User resident position identification method, terminal, electronic device and storage medium | |
CN106708418A (en) | Message storage method and device in game environment | |
CN110866241A (en) | Evaluation model generation and equipment association method, device and storage medium | |
CN116957520B (en) | Big data-based loss of business rate monitoring method, system and storage medium | |
CN117216391B (en) | News recommendation method, system and equipment | |
CN116561508B (en) | Outlier detection method, system and medium for population data based on big data | |
CN115150749B (en) | High-risk roaming user positioning method, equipment, device and storage medium | |
CN117473428A (en) | Service promotion method, device, equipment and storage medium | |
CN111143333B (en) | Labeling data processing method, device, equipment and computer readable storage medium | |
CN107770129A (en) | Method and apparatus for detecting user behavior |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |