CN115409434B - Regional demographic method, system and storage medium based on signaling big data - Google Patents

Regional demographic method, system and storage medium based on signaling big data Download PDF

Info

Publication number
CN115409434B
CN115409434B CN202211359377.7A CN202211359377A CN115409434B CN 115409434 B CN115409434 B CN 115409434B CN 202211359377 A CN202211359377 A CN 202211359377A CN 115409434 B CN115409434 B CN 115409434B
Authority
CN
China
Prior art keywords
data
sector
signaling
base station
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202211359377.7A
Other languages
Chinese (zh)
Other versions
CN115409434A (en
Inventor
成立立
于笑博
张广志
李铭哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beiling Rongxin Datalnfo Science and Technology Ltd
Original Assignee
Beiling Rongxin Datalnfo Science and Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beiling Rongxin Datalnfo Science and Technology Ltd filed Critical Beiling Rongxin Datalnfo Science and Technology Ltd
Priority to CN202211359377.7A priority Critical patent/CN115409434B/en
Publication of CN115409434A publication Critical patent/CN115409434A/en
Application granted granted Critical
Publication of CN115409434B publication Critical patent/CN115409434B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages
    • G06F16/244Grouping and aggregation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • G06F16/24554Unary operations; Data partitioning operations
    • G06F16/24556Aggregation; Duplicate elimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Computational Linguistics (AREA)
  • Development Economics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Educational Administration (AREA)
  • General Business, Economics & Management (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • Quality & Reliability (AREA)
  • Fuzzy Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The embodiment of the application provides a regional demographic method, a regional demographic system and a storage medium based on signaling big data. The method comprises the following steps: acquiring a signaling data information set of terminal users in an area within a preset time period, extracting and cleaning to obtain a base station sector dynamic track portrait, extracting base station sector coverage identification data, combining with signaling response time data, processing to obtain sector intersection density data, performing model processing according to the sector intersection density data of a second identification user, and aggregating to obtain a grid density distribution diagram in the area to convert to obtain population density distribution conditions; therefore, the sector intersection density data obtained by cleaning the signaling data of the terminal user are screened out based on the signaling big data, the grid density distribution data obtained by model processing are subjected to aggregation conversion to obtain population density distribution data, the user sector grid track parking density recognition is carried out according to the signaling data to obtain population density distribution conditions, and accurate statistics on the population resident density distribution in the region is realized.

Description

Region demographic method, system and storage medium based on signaling big data
Technical Field
The present application relates to the field of big data and signaling data technology, and in particular, to a method, system and storage medium for regional demographic based on signaling big data.
Background
The collection of signaling data is a continuous process, correspondingly, the storage of the signaling data is also a continuous process, each operator records the signaling data according to the time acquired by the signaling and the place where the service occurs, the data used for identifying the place where the service occurs in the signaling data is a base station sector connected when the signaling occurs, the base station sector has certain changes in the network optimization and service adjustment processes, which causes inconsistency of corresponding position information of sectors in different time periods, in addition, the signaling data is used for carrying out demographic statistics or related analysis, and the processing needs to be carried out according to certain spatial latitudes, such as communities, villages, towns, key areas and the like.
The existing urban area usually adopts information registration to acquire aiming at the identification and statistics of the distribution situation of the floating population, but the method is easy to generate technical loopholes of missing or statistical deviation, the statistical result lacks accuracy and rapidness, the identification and statistics of the user terminal information lacks accuracy and rapidness, the resident information of the floating population is difficult to accurately master, the effective identification and statistics of the distribution situation of the area population cannot be carried out, and the dynamic statistics technology of the distribution situation of the population is lacked.
In view of the above problems, an effective technical solution is urgently needed.
Disclosure of Invention
The embodiment of the application aims to provide a regional population statistical method, a system and a storage medium based on signaling big data, which can perform a technology for acquiring a regional population resident distribution situation according to signaling data information of a terminal user, and improve the accuracy of population distribution situation statistics.
The embodiment of the application also provides a regional demographic method based on the signaling big data, which comprises the following steps:
acquiring a signaling data information set of terminal users in an area within a preset time period;
according to the signaling data information set, data extraction is carried out to obtain signaling response time data and signaling response address data, and data cleaning is carried out to obtain a first identification user;
acquiring base station sector dynamic information corresponding to a signaling response address of the first identification user in the preset time period and generating a base station sector dynamic track portrait of the first identification user;
extracting base station sector coverage identification data according to the base station sector dynamic track portrait and combining corresponding signaling response time data of the identification data to process to obtain sector intersection density data;
comparing the threshold value according to the sector intersection density data and a preset resident intersection density threshold value, screening corresponding identification users meeting the threshold value comparison requirement, and marking the identification users as second identification users;
Inputting the sector intersection density data of the second identification user into a preset sector grid dense distribution model for processing to obtain grid density distribution data;
and aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition.
Optionally, in the area demographic method based on signaling big data according to the embodiment of the present application, the acquiring a signaling data information set of an end user in an area within a preset time period, performing data extraction according to the signaling data information set to acquire signaling response time data and signaling response address data, and performing data cleaning to acquire a first identified user includes:
acquiring a signaling data information set of a terminal user in an area within a preset time period, wherein the signaling data information set comprises signaling time information, signaling positioning information, signaling interaction information and signaling service identification information;
extracting data according to the signaling data information set to obtain signaling response time data and signaling response address data of signaling response;
carrying out time point, time point positioning and duration data cleaning according to the signaling response time data and the signaling response address data to screen out signaling data information which does not meet the requirement;
And marking the terminal user corresponding to the cleaned and retained second signaling data information set as a first identification user.
Optionally, in the signaling big data-based regional demographic method according to the embodiment of the present application, the obtaining base station sector dynamic information corresponding to a signaling response address of the first identified user in the preset time period and generating a base station sector dynamic trajectory representation of the first identified user includes:
extracting base station response identification data according to the signaling response address data of the first identification user in the preset time period;
inquiring in a base station identification database according to the base station response identification data to obtain base station sectors corresponding to the identification in the area and extracting dynamic information of the base station sectors;
the base station sector dynamic information comprises base station sector response data, base station sector coverage data and base station dynamic interaction data;
and generating a base station sector dynamic track portrait of the first identification user according to the base station sector responsiveness data, the base station sector coverage data and the base station dynamic interaction data.
Optionally, in the area demographic method based on signaling big data according to the embodiment of the present application, the extracting, according to the base station sector dynamic trajectory representation, base station sector coverage identification data and processing with corresponding signaling response time data of the identification data to obtain sector intersection density data includes:
Extracting base station sector coverage identification data according to the base station sector dynamic track image, wherein the base station sector coverage identification data comprises base station arrangement identification data, sector intersection degree data, sector track identification data and sector residence data;
calculating according to the base station arrangement identification data, the sector intersection degree data, the sector track identification data and the sector residence data and combining with a signaling response time period node in a preset time period to obtain sector intersection density data in the preset time period;
the sector intersection density data calculation formula is as follows:
Figure 349638DEST_PATH_IMAGE001
where P is the sector intersection density data,
Figure 713624DEST_PATH_IMAGE002
sector resident data of the node for the ith signaling response time period,
Figure 136515DEST_PATH_IMAGE003
sector-diversity-level data for the ith signaling response time period node,
Figure 320371DEST_PATH_IMAGE004
identifying data for the sector trace of the ith signaling response time period node,
Figure 752490DEST_PATH_IMAGE005
arranging identification data for a base station, wherein n is the number of signaling response time period nodes in a preset time period, i is the ith signaling response time period node in the n time period nodes,
Figure 938358DEST_PATH_IMAGE006
is the track density coefficient.
Optionally, in the signaling big data-based regional demographic method according to the embodiment of the present application, the performing threshold comparison according to the sector intersection density data and a preset resident intersection density threshold to screen a corresponding identified user meeting a threshold comparison requirement, and marking the identified user as a second identified user includes:
Comparing a threshold value according to the sector intersection density data of the first identification user and a preset resident intersection density threshold value;
and identifying the first identification user which meets the threshold comparison result, and marking the first identification user as a second identification user.
Optionally, in the area demographic method based on the signaling big data according to the embodiment of the present application, the inputting the sector intersection density data of the second identified user into a preset sector grid dense distribution model for processing to obtain grid density distribution data includes:
acquiring a trained sector grid dense model;
and inputting the sector intersection density data of the second identification user in combination with the sector intersection degree data and the sector track identification data into a trained sector grid dense distribution model for processing to obtain grid density distribution data.
Optionally, in the area demographic method based on the large signaling data according to the embodiment of the present application, the aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition includes:
aggregating according to the grid density distribution data to obtain grid density distribution data in the area;
Constructing a grid density distribution map according to the grid density distribution data;
performing population density value conversion on the grid density distribution map according to a preset grid density comparison value to obtain regional resident population density layout data;
and taking the area resident population density distribution data as the statistical area resident population density distribution situation in the preset time period.
In a second aspect, embodiments of the present application provide a region demographic system based on signaling big data, the system including: a memory and a processor, wherein the memory includes a program of a region demographic method based on signaling big data, and the program of the region demographic method based on signaling big data realizes the following steps when being executed by the processor:
acquiring a signaling data information set of terminal users in an area within a preset time period;
according to the signaling data information set, data extraction is carried out to obtain signaling response time data and signaling response address data, and data cleaning is carried out to obtain a first identification user;
acquiring base station sector dynamic information corresponding to a signaling response address of the first identification user in the preset time period and generating a base station sector dynamic track portrait of the first identification user;
Extracting base station sector coverage identification data according to the base station sector dynamic track portrait and combining corresponding signaling response time data of the identification data to process to obtain sector intersection density data;
comparing the sector intersection density data with a preset resident intersection density threshold value according to the threshold value, screening corresponding identification users meeting the threshold value comparison requirement, and marking the identification users as second identification users;
inputting the sector intersection density data of the second identification user into a preset sector grid dense distribution model for processing to obtain grid density distribution data;
and aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition.
Optionally, in the area demographic system based on signaling big data according to the embodiment of the present application, the obtaining a signaling data information set of an end user in an area within a preset time period, performing data extraction according to the signaling data information set to obtain signaling response time data and signaling response address data, and performing data cleaning to obtain a first identified user includes:
Acquiring a signaling data information set of a terminal user in an area within a preset time period, wherein the signaling data information set comprises signaling time information, signaling positioning information, signaling interaction information and signaling service identification information;
extracting data according to the signaling data information set to obtain signaling response time data and signaling response address data of signaling response;
carrying out time point, time point positioning and duration data cleaning according to the signaling response time data and the signaling response address data to screen out signaling data information which does not meet the requirement;
and marking the terminal user corresponding to the cleaned and retained second signaling data information set as a first identification user.
In a third aspect, the present application further provides a computer-readable storage medium, where the computer-readable storage medium includes a signaling big data based regional demographic method program, and when the signaling big data based regional demographic method program is executed by a processor, the method implements the steps of the signaling big data based regional demographic method as described in any one of the above.
As can be seen from the above, the area demographic method, system and storage medium based on signaling big data provided in the embodiment of the present application obtain a base station sector dynamic trajectory representation by obtaining a signaling data information set of a terminal user in an area within a preset time period and cleaning the signaling data set, extract base station sector coverage identification data and process the signaling response time data to obtain sector intersection density data, perform model processing according to the sector intersection density data of a second identified user and aggregate the data to obtain a grid density distribution map in the area, and obtain a population density distribution situation by conversion; therefore, the sector intersection density data obtained by cleaning the signaling data of the terminal user are screened out based on the signaling big data, the grid density distribution data obtained by model processing are subjected to aggregation conversion to obtain population density distribution data, the user sector grid track parking density recognition is carried out according to the signaling data to obtain population density distribution conditions, and accurate statistics on the population resident density distribution in the region is realized.
Additional features and advantages of the present application will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by the practice of the embodiments of the present application. The objectives and other advantages of the application may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are required to be used in the embodiments of the present application will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered as limiting the scope, and that those skilled in the art can also obtain other related drawings based on the drawings without inventive efforts.
Fig. 1 is a flowchart of a regional demographic method based on signaling big data according to an embodiment of the present application;
fig. 2 is a flowchart of screening out a first identified user according to a regional demographic method based on signaling big data provided in an embodiment of the present application;
FIG. 3 is a flow chart of generating a dynamic trajectory representation of a base station sector according to a signaling big data based regional demographic method provided in an embodiment of the present application;
Fig. 4 is a schematic structural diagram of a regional demographic system based on signaling big data according to an embodiment of the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. The components of the embodiments of the present application, generally described and illustrated in the figures herein, can be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present application, presented in the accompanying drawings, is not intended to limit the scope of the claimed application, but is merely representative of selected embodiments of the application. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present application without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that like reference numerals and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures. Meanwhile, in the description of the present application, the terms "first", "second", and the like are used only for distinguishing the description, and are not to be construed as indicating or implying relative importance.
Referring to fig. 1, fig. 1 is a flow chart of a signaling big data based regional demographic method in some embodiments of the present application. The regional demographic method based on the signaling big data is used in terminal equipment, such as a computer, a mobile phone terminal and the like. The regional demographic method based on the signaling big data comprises the following steps:
s101, acquiring a signaling data information set of terminal users in an area within a preset time period;
s102, extracting data according to the signaling data information set to obtain signaling response time data and signaling response address data, and performing data cleaning to obtain a first identification user;
s103, acquiring base station sector dynamic information corresponding to a signaling response address of the first identification user in the preset time period and generating a base station sector dynamic track portrait of the first identification user;
s104, extracting base station sector coverage identification data according to the base station sector dynamic track portrait and combining corresponding signaling response time data processing of the identification data to obtain sector intersection density data;
s105, comparing the threshold value according to the sector intersection density data and a preset resident intersection density threshold value, screening corresponding identification users meeting the threshold value comparison requirement, and marking the identification users as second identification users;
S106, inputting the sector intersection density data of the second identification user into a preset sector grid dense distribution model for processing to obtain grid density distribution data;
and S107, aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition.
It should be noted that, in order to obtain the resident distribution condition and distribution density of the terminal users in a certain preset time period in the area, the signaling data information set of the terminal users in the area in the preset time period is obtained to extract the data to obtain the signaling response time data and the signaling response address data, and the data is cleaned to obtain the first identification user, the signaling response time in the area which is not satisfied and the signaling data corresponding to the signaling response address distribution are screened out to obtain the first identification user, the base station sector dynamic track portrait generated according to the base station sector dynamic information corresponding to the signaling response address in the preset time period describes the marked base station sector of the user signaling response path to obtain the track information of the base station sector on the path, and the base station sector coverage identification data is extracted to be combined with the corresponding signaling response time data to obtain the sector intersection density data, the sector intersection density data reflects the residence time of a user in a sector overlapping area in an area, the number of nodes of the residence time and the residence frequency of each residence point in a time period, then the threshold comparison is carried out according to the sector intersection density data to screen a mark which meets the threshold comparison requirement as a second mark user, namely the residence time and the frequency of the user in the sector intersection area are insufficient, the user which frequently moves or does not move in a certain time period such as express delivery, area service, work access, temporary stop and the like is eliminated to obtain the resident user of the area, then the sector intersection density data of the second mark user is input into a preset sector grid dense distribution model to be processed to obtain grid density distribution data, namely the sector intersection information is rasterized, namely, the grid with the adaptive size is obtained according to the effective area of the sector or the intersection area, the intelligent screening statistical technology is convenient for performing data mapping and visual description on the sector parking condition through the grid density, then aggregating the grid density distribution data of the second identification users in the area to obtain an area grid density distribution map, wherein the distribution map is a data grid density distribution map and reflects the parking residence condition of the second identification users in the area, and then performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition, namely, the resident parking distribution condition of the second identification users in the area is presented through data conversion of preset density values, if a certain subarea A has more resident parking users and has high frequency, the density of the subarea A is displayed as large as deep as chromaticity, and the corresponding density value is large, and if a subarea B has less resident parking users, the density of the subarea A is displayed as small as shallow as chromaticity and the corresponding density value is small, namely, the intelligent screening statistical technology for obtaining the grid density distribution data aggregation and conversion of the area resident population density distribution condition through rasterization processing of the base station sector intersection density data of the signaling users is realized.
Referring to fig. 2, fig. 2 is a flowchart illustrating a method for filtering out first identified users based on regional demographics of signaling big data according to some embodiments of the present disclosure. According to the embodiment of the invention, the obtaining of the signaling data information set of the terminal user in the area within the preset time period, the data extraction according to the signaling data information set to obtain the signaling response time data and the signaling response address data, and the data cleaning to obtain the first identification user specifically comprises:
s201, acquiring a signaling data information set of a terminal user in an area within a preset time period, wherein the signaling data information set comprises signaling time information, signaling positioning information, signaling interaction information and signaling service identification information;
s202, extracting data according to the signaling data information set to obtain signaling response time data and signaling response address data of signaling response;
s203, performing time point, time point positioning and duration data cleaning according to the signaling response time data and the signaling response address data to screen out signaling data information which does not meet the requirement;
and S204, marking the terminal user corresponding to the cleaned and retained second signaling data information set as a first identification user.
It should be noted that, data extraction is performed according to a signaling data information set of an end user in an area within a preset time period to obtain signaling response time data and signaling response address data, that is, address location corresponding to signaling response time, and signaling interaction information and signaling service identifiers such as signaling information of a communication provider, a registration place, and the like are extracted to obtain signaling response time and data corresponding to a response address, data cleaning and screening are performed on the data according to signaling response time nodes, response time point locations, and signaling time lengths, and signaling data information that does not meet preset requirements are screened out, if the signaling response time does not meet the requirements, the number of times of the signaling response time nodes is insufficient, and signaling response location distribution, location points, associated location addresses, and other signaling data information that do not meet the requirements are cleaned, a user corresponding to a remaining signaling data information set is marked as a first identified user, so as to perform preliminary screening on users who are effectively resident in the area according to preset conditions of the signaling data information.
Referring to fig. 3, fig. 3 is a flow chart of generating a dynamic trajectory representation of a base station sector based on big data signaling processing according to some embodiments of the present application. According to the embodiment of the present invention, the obtaining of the base station sector dynamic information corresponding to the signaling response address of the first identified user in the preset time period and generating the base station sector dynamic trajectory sketch of the first identified user specifically includes:
s301, extracting base station response identification data according to the signaling response address data of the first identification user in the preset time period;
s302, inquiring in a base station identification database according to the base station response identification data to obtain base station sectors corresponding to identification in an area and extracting dynamic information of the base station sectors;
s303, the dynamic information of the base station sector comprises base station sector response data, base station sector coverage data and base station dynamic interaction data;
and S304, generating a base station sector dynamic track portrait of the first identification user according to the base station sector responsiveness data, the base station sector coverage data and the base station dynamic interaction data.
It should be noted that, in order to obtain the dynamic trajectory situation of the first identified user within the preset time period, the signaling response address data of the first identified user within the preset time period is used to extract the base station response identification data, that is, the identification data of the signaling corresponding to the response base station, the base station sector corresponding to the identification in the area is obtained by querying in the base station identification database according to the base station response identification data, and the base station sector dynamic information is extracted, that is, the base station sector is located in the base station identification database through the location identification of the base station and the dynamic interaction data between the signaling and the identified base station sector is obtained, the base station sector dynamic information includes base station sector response data, base station sector coverage data, and base station dynamic interaction data, each data reflects the interaction situation of the signaling response base station sector, the interaction frequency, the response intensity and the interaction duration of each base station sector or the intersection coverage area of the base station sector, and the parameter values corresponding to the interaction duration, the frequency, the coverage area, the repeatability, the base station sector dynamic trajectory of the base station which can reflect the interaction situation and the activity situation of the signaling corresponding to the first identified user signaling in the base station sector, and the dynamic trajectory of the signaling between the sectors and the user in the switching situation of the signaling trajectory.
According to the embodiment of the present invention, the extracting of the base station sector coverage identification data according to the base station sector dynamic trajectory sketch and the processing of the corresponding signaling response time data of the identification data to obtain the sector intersection density data specifically include:
extracting base station sector coverage identification data according to the base station sector dynamic track portrait, wherein the base station sector coverage identification data comprises base station arrangement identification data, sector intersection degree data, sector track identification data and sector residence data;
calculating according to the base station arrangement identification data, the sector intersection degree data, the sector track identification data and the sector residence data in combination with a signaling response time period node in a preset time period to obtain sector intersection density data in the preset time period;
the sector intersection density data calculation formula is as follows:
Figure 797730DEST_PATH_IMAGE001
where P is the sector intersection density data,
Figure 203303DEST_PATH_IMAGE007
sector resident data of the node for the ith signaling response time period,
Figure 439113DEST_PATH_IMAGE008
sector-diversity-level data for the ith signaling response time period node,
Figure 13576DEST_PATH_IMAGE009
identifying data for the sector trace of the ith signaling response time period node,
Figure 778269DEST_PATH_IMAGE010
arranging identification data for a base station, wherein n is a signaling response in a preset time periodAccording to the number of the time period nodes, i is the ith signaling response time period node in the n time period nodes,
Figure 671139DEST_PATH_IMAGE011
Is the track density coefficient (
Figure 445060DEST_PATH_IMAGE012
Obtained by querying the base station identification database according to the sector track identification data).
It should be noted that, the base station sector coverage identification data extracted according to the base station sector dynamic trajectory representation includes base station arrangement identification data, sector intersection degree data, sector trajectory identification data and sector residence data, the data reflects the base station path identification arrangement, the intersection area between sectors, the sector trajectory identification, the response residence time length and residence frequency of the sector and sector intersection area, and then the sector intersection density data in the preset time period is obtained by calculating the number in combination with the signaling response time period node in the preset time period, that is, the sector intersection density data of the user is obtained by performing the programmed calculation of the time node data accumulation on the base station sector intersection area distribution, the sector trajectory path, the sector response time length and the sector residence frequency corresponding to each signaling response time period node in the preset time period, and the density data reflects the activity intensity of the user in the sector or sector intersection area and the distribution density of the signaling response sector or sector intersection area in the preset time period, and the residence condition of each point of the user in the area and the residence point can be measured by the sector density.
According to the embodiment of the present invention, the step of performing threshold comparison according to the sector intersection density data and the preset resident intersection density threshold to screen the corresponding identified user meeting the threshold comparison requirement, and marking the identified user as a second identified user specifically includes:
comparing a threshold value according to the sector intersection density data of the first identification user and a preset resident intersection density threshold value;
and identifying the first identification user which meets the threshold comparison result, and marking the first identification user as a second identification user.
It should be noted that, a threshold value comparison is performed according to the obtained sector intersection density data and a preset resident intersection density threshold value, users who do not meet the activity intensity of the sector or sector intersection area and the distribution density of the sector or sector intersection area of the signaling response within a preset time period are removed, and users who do not meet the requirements of parking and route within the area are removed, wherein the resident intersection density threshold value can be set according to the area attribute information, for example, the threshold value is set to 85%.
According to the embodiment of the present invention, the sector intersection density data of the second identified user is input into a preset sector grid dense distribution model for processing to obtain grid density distribution data, specifically:
Acquiring a trained sector grid dense model;
and inputting the sector intersection density data of the second identification user in combination with the sector intersection degree data and the sector track identification data into a trained sector grid dense distribution model for processing to obtain grid density distribution data.
It should be noted that, the sector intersection density data of the second identification user is input to a trained sector grid density distribution model in combination with the sector intersection density data and the sector track identification data to obtain grid density distribution data, that is, model rasterization is performed according to the activity intensity of the sector or the sector intersection region in a preset time period and the distribution density of the sector or the sector intersection region responded by the signaling to obtain a density grid which is adaptive and reflects the activity intensity and the activity distribution density, rasterization is performed according to the activity intensity and the density condition of the signaling in the sector to obtain an adaptive density grid and grid density distribution data, that is, the grid density distribution is performed on the activity intensity and the activity distribution condition of the sector and the sector intersection region of the base station, wherein the sector grid density distribution model is obtained by performing training processing on a large number of historical data samples of sector intersection density data, sector track identification data and grid density distribution data, the more accurate the processing result obtained by the model is obtained, the sector grid density distribution model is output to a preset grid density distribution model according to the historical data of the sector intersection density data, the sector intersection density data and the sector track identification data are input to the preset threshold value, and the learning threshold value is obtained by the learning threshold value is output to the sector grid density learning threshold value.
According to the embodiment of the present invention, the aggregating is performed according to the grid density distribution data to obtain the grid density distribution map in the area, and the density value conversion is performed according to the grid density distribution map in the area to obtain the density distribution situation of the resident population in the area, specifically:
aggregating according to the grid density distribution data to obtain grid density distribution data in the region;
constructing a grid density distribution map according to the grid density distribution data;
performing population density value conversion on the grid density distribution map according to a preset grid density comparison value to obtain regional resident population density layout data;
and taking the area resident population density distribution data as the statistical area resident population density distribution situation in the preset time period.
The method includes the steps of obtaining grid density distribution data of second identification users in an area by aggregating the obtained grid density distribution data of the second identification users in the area, enabling activity intensity and activity density of the second identification users at each point and each sub-area in the area to be reflected in a centralized mode, constructing an area grid density distribution map according to the data, enabling the distribution map to be a digitalized grid density distribution map, reflecting parking, activity intensity and activity density of the second identification users in the area, conducting density value conversion according to the area grid density distribution map to obtain an area resident population density distribution situation, namely displaying the activity distribution and the parking distribution situation of the second identification users in the area through data conversion of preset density values, and reflecting the area resident population density distribution situation.
As shown in fig. 4, the present invention also discloses a region demographic system based on signaling big data, which includes a memory 41 and a processor 42, wherein the memory includes a region demographic method program based on signaling big data, and when the processor executes the region demographic method program based on signaling big data, the following steps are implemented:
acquiring a signaling data information set of a terminal user in an area within a preset time period;
according to the signaling data information set, data extraction is carried out to obtain signaling response time data and signaling response address data, and data cleaning is carried out to obtain a first identification user;
acquiring base station sector dynamic information corresponding to a signaling response address of the first identification user in the preset time period and generating a base station sector dynamic track portrait of the first identification user;
extracting base station sector coverage identification data according to the base station sector dynamic track portrait and combining corresponding signaling response time data of the identification data to process to obtain sector intersection density data;
comparing the sector intersection density data with a preset resident intersection density threshold value according to the threshold value, screening corresponding identification users meeting the threshold value comparison requirement, and marking the identification users as second identification users;
Inputting the sector intersection density data of the second identification user into a preset sector grid dense distribution model for processing to obtain grid density distribution data;
and aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition.
It should be noted that, in order to obtain the resident distribution condition and distribution density of the terminal users in a certain preset time period in the area, the signaling data information set of the terminal users in the area in the preset time period is obtained to extract the data to obtain the signaling response time data and the signaling response address data, and the data is cleaned to obtain the first identification user, the signaling response time in the area which is not satisfied and the signaling data corresponding to the signaling response address distribution are screened out to obtain the first identification user, the base station sector dynamic track portrait generated according to the base station sector dynamic information corresponding to the signaling response address in the preset time period describes the marked base station sector of the user signaling response path to obtain the track information of the base station sector on the path, and the base station sector coverage identification data is extracted to be combined with the corresponding signaling response time data to obtain the sector intersection density data, the sector intersection density data reflects the residence time of a user in a sector overlapping area in an area, the number of nodes of the residence time and the residence frequency of each residence point in a time period, then the threshold comparison is carried out according to the sector intersection density data to screen a mark which meets the threshold comparison requirement as a second mark user, namely the residence time and the frequency of the user in the sector intersection area are insufficient, the user which frequently moves or does not move in a certain time period such as express delivery, area service, work access, temporary stop and the like is eliminated to obtain the resident user of the area, then the sector intersection density data of the second mark user is input into a preset sector grid dense distribution model to be processed to obtain grid density distribution data, namely the sector intersection information is rasterized, namely, the grid with the adaptive size is obtained according to the effective area of the sector or the intersection area, the intelligent screening statistical technology is convenient for performing data mapping and visual description on the sector parking condition through the grid density, then aggregating the grid density distribution data of the second identification users in the area to obtain an area grid density distribution map, wherein the distribution map is a data grid density distribution map and reflects the parking residence condition of the second identification users in the area, and then performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition, namely, the resident parking distribution condition of the second identification users in the area is presented through data conversion of preset density values, if a certain subarea A has more resident parking users and has high frequency, the density of the subarea A is displayed as large as deep as chromaticity, and the corresponding density value is large, and if a subarea B has less resident parking users, the density of the subarea A is displayed as small as shallow as chromaticity and the corresponding density value is small, namely, the intelligent screening statistical technology for obtaining the grid density distribution data aggregation and conversion of the area resident population density distribution condition through rasterization processing of the base station sector intersection density data of the signaling users is realized.
According to the embodiment of the invention, the obtaining of the signaling data information set of the terminal user in the area within the preset time period, the data extraction according to the signaling data information set to obtain the signaling response time data and the signaling response address data, and the data cleaning to obtain the first identification user specifically comprises:
acquiring a signaling data information set of a terminal user in an area within a preset time period, wherein the signaling data information set comprises signaling time information, signaling positioning information, signaling interaction information and signaling service identification information;
performing data extraction according to the signaling data information set to obtain signaling response time data and signaling response address data of signaling response;
carrying out time point, time point positioning and duration data cleaning according to the signaling response time data and the signaling response address data to screen out signaling data information which does not meet the requirement;
and marking the terminal user corresponding to the cleaned and retained second signaling data information set as a first identification user.
It should be noted that, data extraction is performed according to a signaling data information set of an end user in an area within a preset time period to obtain signaling response time data and signaling response address data, that is, address location corresponding to signaling response time, and signaling interaction information and signaling service identifiers such as signaling information of a communication provider, a registration place, and the like are extracted to obtain signaling response time and data corresponding to a response address, data cleaning and screening are performed on the data according to signaling response time nodes, response time point locations, and signaling time lengths, and signaling data information that does not meet preset requirements are screened out, if the signaling response time does not meet the requirements, the number of times of the signaling response time nodes is insufficient, and signaling response location distribution, location points, associated location addresses, and other signaling data information that do not meet the requirements are cleaned, a user corresponding to a remaining signaling data information set is marked as a first identified user, so as to perform preliminary screening on users who are effectively resident in the area according to preset conditions of the signaling data information.
According to the embodiment of the present invention, the obtaining of the base station sector dynamic information corresponding to the signaling response address of the first identified user in the preset time period and generating the base station sector dynamic track portrait of the first identified user specifically include:
extracting base station response identification data according to the signaling response address data of the first identification user in the preset time period;
inquiring in a base station identification database according to the base station response identification data to obtain base station sectors corresponding to the identification in the area and extracting dynamic information of the base station sectors;
the base station sector dynamic information comprises base station sector response data, base station sector coverage data and base station dynamic interaction data;
and generating a base station sector dynamic track portrait of the first identification user according to the base station sector responsiveness data, the base station sector coverage data and the base station dynamic interaction data.
It should be noted that, in order to obtain the dynamic trajectory situation of the first identified user within the preset time period, the signaling response address data of the first identified user within the preset time period is used to extract the base station response identification data, that is, the identification data of the signaling corresponding to the response base station, the base station sector corresponding to the identification in the area is obtained by querying in the base station identification database according to the base station response identification data, and the base station sector dynamic information is extracted, that is, the base station sector is located in the base station identification database through the location identification of the base station and the dynamic interaction data between the signaling and the identified base station sector is obtained, the base station sector dynamic information includes base station sector response data, base station sector coverage data, and base station dynamic interaction data, each data reflects the interaction situation of the signaling response base station sector, the interaction frequency, the response intensity and the interaction duration of each base station sector or the intersection coverage area of the base station sector, and the parameter values corresponding to the interaction duration, the frequency, the coverage area, the repeatability, the base station sector dynamic trajectory of the base station which can reflect the interaction situation and the activity situation of the signaling corresponding to the first identified user signaling in the base station sector, and the dynamic trajectory of the signaling between the sectors and the user in the switching situation of the signaling trajectory.
According to the embodiment of the present invention, the extracting of the base station sector coverage identification data according to the base station sector dynamic trajectory sketch and the processing of the corresponding signaling response time data of the identification data to obtain the sector intersection density data specifically include:
extracting base station sector coverage identification data according to the base station sector dynamic track portrait, wherein the base station sector coverage identification data comprises base station arrangement identification data, sector intersection degree data, sector track identification data and sector residence data;
calculating according to the base station arrangement identification data, the sector intersection degree data, the sector track identification data and the sector residence data in combination with a signaling response time period node in a preset time period to obtain sector intersection density data in the preset time period;
the sector intersection density data calculation formula is as follows:
Figure 372565DEST_PATH_IMAGE001
where P is the sector intersection density data,
Figure 72274DEST_PATH_IMAGE013
sector resident data of the node for the ith signaling response time period,
Figure 452440DEST_PATH_IMAGE014
sector-diversity-level data for the ith signaling response time period node,
Figure 764472DEST_PATH_IMAGE015
identifying data for the sector trace of the ith signaling response time period node,
Figure 812063DEST_PATH_IMAGE016
arranging identification data for a base station, wherein n is the number of signaling response time period nodes in a preset time period, i is the ith signaling response time period node in the n time period nodes,
Figure 420024DEST_PATH_IMAGE017
Is the track density coefficient (
Figure 553065DEST_PATH_IMAGE018
Obtained by querying the base station identification database according to the sector track identification data).
It should be noted that, the base station sector coverage identification data extracted according to the base station sector dynamic trajectory representation includes base station arrangement identification data, sector intersection degree data, sector trajectory identification data and sector residence data, the data reflects the base station path identification arrangement, the intersection area between sectors, the sector trajectory identification, the response residence time length and residence frequency of the sector and sector intersection area, and then the sector intersection density data in the preset time period is obtained by calculating the number in combination with the signaling response time period node in the preset time period, that is, the sector intersection density data of the user is obtained by performing the programmed calculation of the time node data accumulation on the base station sector intersection area distribution, the sector trajectory path, the sector response time length and the sector residence frequency corresponding to each signaling response time period node in the preset time period, and the density data reflects the activity intensity of the user in the sector or sector intersection area and the distribution density of the signaling response sector or sector intersection area in the preset time period, and the residence condition of each point of the user in the area and the residence point can be measured by the sector density.
According to the embodiment of the present invention, the step of performing threshold comparison according to the sector intersection density data and a preset resident intersection density threshold to screen a corresponding identified user meeting a threshold comparison requirement, and marking the identified user as a second identified user specifically includes:
comparing a threshold value according to the sector intersection density data of the first identification user and a preset resident intersection density threshold value;
and identifying the first identification user which meets the threshold comparison result, and marking the first identification user as a second identification user.
It should be noted that, a threshold value comparison is performed according to the obtained sector intersection density data and a preset resident intersection density threshold value, users who do not meet the activity intensity of the sector or sector intersection area and the distribution density of the sector or sector intersection area of the signaling response within a preset time period are removed, and users who do not meet the requirements of parking and route within the area are removed, wherein the resident intersection density threshold value can be set according to the area attribute information, for example, the threshold value is set to 85%.
According to the embodiment of the present invention, the sector intersection density data according to the second identified user is input into a preset sector grid dense model for processing to obtain grid density distribution data, and the specific steps are as follows:
Acquiring a trained sector grid dense model;
and inputting the sector intersection density data of the second identification user in combination with the sector intersection degree data and the sector track identification data into a trained sector grid dense distribution model for processing to obtain grid density distribution data.
It should be noted that, the sector intersection density data of the second identification user is input to a trained sector grid density distribution model in combination with the sector intersection density data and the sector track identification data to obtain grid density distribution data, that is, model rasterization is performed according to the activity intensity of the sector or the sector intersection region in a preset time period and the distribution density of the sector or the sector intersection region responded by the signaling to obtain a density grid which is adaptive and reflects the activity intensity and the activity distribution density, rasterization is performed according to the activity intensity and the density condition of the signaling in the sector to obtain an adaptive density grid and grid density distribution data, that is, the grid density distribution is performed on the activity intensity and the activity distribution condition of the sector and the sector intersection region of the base station, wherein the sector grid density distribution model is obtained by performing training processing on a large number of historical data samples of sector intersection density data, sector track identification data and grid density distribution data, the more accurate the processing result obtained by the model is obtained, the sector grid density distribution model is output to a preset grid density distribution model according to the historical data of the sector intersection density data, the sector intersection density data and the sector track identification data are input to the preset threshold value, and the learning threshold value is obtained by the learning threshold value is output to the sector grid density learning threshold value.
According to the embodiment of the present invention, the aggregating is performed according to the grid density distribution data to obtain the grid density distribution map in the area, and the density value conversion is performed according to the grid density distribution map in the area to obtain the density distribution situation of the resident population in the area, specifically:
aggregating according to the grid density distribution data to obtain grid density distribution data in the area;
constructing a grid density distribution map according to the grid density distribution data;
performing population density value conversion on the grid density distribution map according to a preset grid density comparison value to obtain regional resident population density layout data;
and taking the area resident population density distribution data as the statistical area resident population density distribution situation in the preset time period.
The method includes the steps of obtaining grid density distribution data of second identification users in an area by aggregating the obtained grid density distribution data of the second identification users in the area, enabling activity intensity and activity density of the second identification users at each point and each sub-area in the area to be reflected in a centralized mode, constructing an area grid density distribution map according to the data, enabling the distribution map to be a digitalized grid density distribution map, reflecting parking, activity intensity and activity density of the second identification users in the area, conducting density value conversion according to the area grid density distribution map to obtain an area resident population density distribution situation, namely displaying the activity distribution and the parking distribution situation of the second identification users in the area through data conversion of preset density values, and reflecting the area resident population density distribution situation.
A third aspect of the present invention provides a readable storage medium, which includes a signaling big data based regional demographic method program, and when the signaling big data based regional demographic method program is executed by a processor, the method implements the steps of the signaling big data based regional demographic method as described in any one of the above.
The invention discloses a region population statistical method, a system and a storage medium based on signaling big data.A base station sector dynamic track portrait is obtained by obtaining a signaling data information set of terminal users in a region within a preset time period, and then extracting base station sector coverage identification data and combining with signaling response time data to process so as to obtain sector intersection density data, then model processing is carried out according to the sector intersection density data of a second identification user, and aggregation is carried out so as to obtain a grid density distribution map in the region to convert so as to obtain population density distribution conditions; therefore, the sector intersection density data obtained by cleaning the signaling data of the terminal user are screened out based on the signaling big data, the grid density distribution data obtained by model processing are subjected to aggregation conversion to obtain population density distribution data, the user sector grid track parking density recognition is carried out according to the signaling data to obtain population density distribution conditions, and accurate statistics on the population resident density distribution in the region is realized.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above-described device embodiments are merely illustrative, for example, the division of the unit is only a logical functional division, and there may be other division ways in actual implementation, such as: multiple units or components may be combined, or may be integrated into another system, or some features may be omitted, or not implemented. In addition, the coupling, direct coupling or communication connection between the components shown or discussed may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be electrical, mechanical or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units; can be located in one place or distributed on a plurality of network units; some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, all the functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may be separately regarded as one unit, or two or more units may be integrated into one unit; the integrated unit may be implemented in the form of hardware, or in the form of hardware plus a software functional unit.
Those of ordinary skill in the art will understand that: all or part of the steps for realizing the method embodiments can be completed by hardware related to program instructions, the program can be stored in a readable storage medium, and the program executes the steps comprising the method embodiments when executed; and the aforementioned storage medium includes: a mobile storage device, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and various media capable of storing program codes.
Alternatively, the integrated unit of the present invention may be stored in a readable storage medium if it is implemented in the form of a software functional module and sold or used as a separate product. Based on such understanding, the technical solutions of the embodiments of the present invention may be essentially implemented or a part contributing to the prior art may be embodied in the form of a software product stored in a storage medium, and including several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code.

Claims (9)

1. A regional demographic method based on signaling big data, comprising the steps of:
acquiring a signaling data information set of terminal users in an area within a preset time period;
according to the signaling data information set, data extraction is carried out to obtain signaling response time data and signaling response address data, and data cleaning is carried out to obtain a first identification user;
acquiring base station sector dynamic information corresponding to a signaling response address of the first identification user in the preset time period and generating a base station sector dynamic track portrait of the first identification user;
extracting base station sector coverage identification data according to the base station sector dynamic track portrait and combining corresponding signaling response time data of the identification data to process to obtain sector intersection density data;
comparing the sector intersection density data with a preset resident intersection density threshold value according to the threshold value, screening corresponding identification users meeting the threshold value comparison requirement, and marking the identification users as second identification users;
inputting the sector intersection density data of the second identification user into a preset sector grid dense distribution model for processing to obtain grid density distribution data;
aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition;
The method for extracting the base station sector coverage identification data according to the base station sector dynamic track portrait and processing the base station sector coverage identification data by combining the corresponding signaling response time data of the identification data to obtain the sector intersection density data comprises the following steps:
extracting base station sector coverage identification data according to the base station sector dynamic track portrait, wherein the base station sector coverage identification data comprises base station arrangement identification data, sector intersection degree data, sector track identification data and sector residence data;
calculating according to the base station arrangement identification data, the sector intersection degree data, the sector track identification data and the sector residence data in combination with a signaling response time period node in a preset time period to obtain sector intersection density data in the preset time period;
the calculation formula of the sector intersection density data is as follows:
Figure 247773DEST_PATH_IMAGE001
where P is the sector intersection density data,
Figure 708841DEST_PATH_IMAGE002
sector resident data of the node for the ith signaling response time period,
Figure 76369DEST_PATH_IMAGE003
sector-diversity-level data for the ith signaling response time period node,
Figure 255677DEST_PATH_IMAGE004
identifying data for the sector trace of the ith signaling response time period node,
Figure 232598DEST_PATH_IMAGE005
arranging identification data for a base station, wherein n is the number of signaling response time period nodes in a preset time period, i is the ith signaling response time period node in the n time period nodes,
Figure 282594DEST_PATH_IMAGE006
Is the track density coefficient.
2. The method for regional demographic statistics based on signaling big data as claimed in claim 1, wherein the obtaining of the signaling data information set of the end user in the region within the preset time period, performing data extraction according to the signaling data information set to obtain the signaling response time data and the signaling response address data, and performing data cleaning to obtain the first identified user comprises:
acquiring a signaling data information set of a terminal user in an area within a preset time period, wherein the signaling data information set comprises signaling time information, signaling positioning information, signaling interaction information and signaling service identification information;
extracting data according to the signaling data information set to obtain signaling response time data and signaling response address data of signaling response;
carrying out time point, time point positioning and duration data cleaning according to the signaling response time data and the signaling response address data to screen out signaling data information which does not meet the requirement;
and marking the terminal user corresponding to the cleaned and retained second signaling data information set as a first identification user.
3. The signaling big data based regional demographic method as claimed in claim 2, wherein the obtaining base station sector dynamic information corresponding to the signaling response address of the first identified user in the preset time period and generating a base station sector dynamic trajectory representation of the first identified user comprises:
Extracting base station response identification data according to the signaling response address data of the first identification user in the preset time period;
inquiring in a base station identification database according to the base station response identification data to obtain base station sectors corresponding to the identification in the area and extracting dynamic information of the base station sectors;
the base station sector dynamic information comprises base station sector response data, base station sector coverage data and base station dynamic interaction data;
and generating a base station sector dynamic track portrait of the first identification user according to the base station sector responsiveness data, the base station sector coverage data and the base station dynamic interaction data.
4. The signaling big data-based regional demographic method as claimed in claim 1, wherein the selecting, according to the threshold comparison between the sector intersection density data and a preset resident intersection density threshold, a corresponding identified user meeting a threshold comparison requirement as a second identified user comprises:
comparing a threshold value according to the sector intersection density data of the first identification user and a preset resident intersection density threshold value;
and identifying the first identification user which meets the threshold comparison result, and marking the first identification user as a second identification user.
5. The signaling big data based regional demographic method as recited in claim 4, wherein the inputting sector intersection density data of the second identified user into a preset sector grid dense distribution model for processing to obtain grid density distribution data comprises:
acquiring a trained sector grid dense model;
and inputting the sector intersection density data of the second identification user in combination with the sector intersection degree data and the sector track identification data into a trained sector grid dense distribution model for processing to obtain grid density distribution data.
6. The regional demographic method as claimed in claim 5, wherein the aggregating according to the grid density distribution data to obtain an intra-regional grid density distribution map and performing density value conversion according to the intra-regional grid density distribution map to obtain a regional resident population density distribution condition comprises:
aggregating according to the grid density distribution data to obtain grid density distribution data in the area;
constructing a grid density distribution map according to the grid density distribution data;
performing population density value conversion on the grid density distribution map according to a preset grid density comparison value to obtain regional resident population density distribution data;
And taking the area resident population density distribution data as the statistical area resident population density distribution situation in the preset time period.
7. A regional demographic system based on signaling big data, the system comprising: a memory and a processor, wherein the memory includes a program of a region demographic method based on signaling big data, and the program of the region demographic method based on signaling big data realizes the following steps when being executed by the processor:
acquiring a signaling data information set of terminal users in an area within a preset time period;
performing data extraction according to the signaling data information set to obtain signaling response time data and signaling response address data, and performing data cleaning to obtain a first identification user;
acquiring base station sector dynamic information corresponding to a signaling response address of the first identification user in the preset time period and generating a base station sector dynamic track portrait of the first identification user;
extracting base station sector coverage identification data according to the base station sector dynamic track portrait and combining corresponding signaling response time data of the identification data to process to obtain sector intersection density data;
comparing the sector intersection density data with a preset resident intersection density threshold value according to the threshold value, screening corresponding identification users meeting the threshold value comparison requirement, and marking the identification users as second identification users;
Inputting the sector intersection density data of the second identification user into a preset sector grid dense distribution model for processing to obtain grid density distribution data;
aggregating according to the grid density distribution data to obtain an area grid density distribution map, and performing density value conversion according to the area grid density distribution map to obtain an area resident population density distribution condition;
the method for extracting the base station sector coverage identification data according to the base station sector dynamic track portrait and processing the base station sector coverage identification data by combining the corresponding signaling response time data of the identification data to obtain the sector intersection density data comprises the following steps:
extracting base station sector coverage identification data according to the base station sector dynamic track portrait, wherein the base station sector coverage identification data comprises base station arrangement identification data, sector intersection degree data, sector track identification data and sector residence data;
calculating according to the base station arrangement identification data, the sector intersection degree data, the sector track identification data and the sector residence data in combination with a signaling response time period node in a preset time period to obtain sector intersection density data in the preset time period;
the sector intersection density data calculation formula is as follows:
Figure 24285DEST_PATH_IMAGE007
where P is the sector intersection density data,
Figure 392687DEST_PATH_IMAGE002
Sector resident data of the node for the ith signaling response time period,
Figure 674764DEST_PATH_IMAGE003
sector-diversity-level data for the ith signaling response time period node,
Figure 579266DEST_PATH_IMAGE004
identifying data for the sector trace of the ith signaling response time period node,
Figure 787131DEST_PATH_IMAGE005
arranging identification data for a base station, wherein n is the number of signaling response time period nodes in a preset time period, i is the ith signaling response time period node in the n time period nodes,
Figure 878715DEST_PATH_IMAGE006
is the track density coefficient.
8. The system of claim 7, wherein the acquiring signaling data information sets of end users in a region within a preset time period, performing data extraction according to the signaling data information sets to acquire signaling response time data and signaling response address data, and performing data cleaning to acquire a first identified user comprises:
acquiring a signaling data information set of a terminal user in an area within a preset time period, wherein the signaling data information set comprises signaling time information, signaling positioning information, signaling interaction information and signaling service identification information;
performing data extraction according to the signaling data information set to obtain signaling response time data and signaling response address data of signaling response;
carrying out time point, time point positioning and duration data cleaning according to the signaling response time data and the signaling response address data to screen out signaling data information which does not meet the requirement;
And marking the terminal user corresponding to the cleaned and retained second signaling data information set as a first identification user.
9. A computer-readable storage medium, characterized in that the computer-readable storage medium comprises a signaling big data based region demographic method program, which when executed by a processor implements the steps of the signaling big data based region demographic method according to any one of claims 1 to 6.
CN202211359377.7A 2022-11-02 2022-11-02 Regional demographic method, system and storage medium based on signaling big data Active CN115409434B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211359377.7A CN115409434B (en) 2022-11-02 2022-11-02 Regional demographic method, system and storage medium based on signaling big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211359377.7A CN115409434B (en) 2022-11-02 2022-11-02 Regional demographic method, system and storage medium based on signaling big data

Publications (2)

Publication Number Publication Date
CN115409434A CN115409434A (en) 2022-11-29
CN115409434B true CN115409434B (en) 2023-03-24

Family

ID=84169260

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211359377.7A Active CN115409434B (en) 2022-11-02 2022-11-02 Regional demographic method, system and storage medium based on signaling big data

Country Status (1)

Country Link
CN (1) CN115409434B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116017333A (en) * 2022-12-14 2023-04-25 北京市大数据中心 Population identification method, system and storage medium based on big data signaling processing
CN115866547B (en) * 2023-03-01 2023-06-30 北京融信数联科技有限公司 Fixed-area tourist statistics method, system and storage medium based on signaling data
CN116485143B (en) * 2023-04-27 2024-06-07 华北水利水电大学 Space planning processing method based on population density big data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111615054A (en) * 2020-05-25 2020-09-01 和智信(山东)大数据科技有限公司 Population analysis method and device
CN113993120A (en) * 2021-12-30 2022-01-28 中国移动紫金(江苏)创新研究院有限公司 Demographic method and system based on wireless user signaling

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011052473A1 (en) * 2009-10-30 2011-05-05 株式会社エヌ・ティ・ティ・ドコモ Management server, population information calculation management server, zero population distribution area management method, and population information calculation method
US9773328B2 (en) * 2014-08-18 2017-09-26 Wells Fargo Bank, N.A. Density gradient analysis tool
CN105682025B (en) * 2016-01-05 2019-01-04 重庆邮电大学 User based on mobile signaling protocol data resident ground recognition methods
CN108322891B (en) * 2018-01-11 2020-08-28 江苏欣网视讯软件技术有限公司 Traffic area congestion identification method based on user mobile phone signaling
CN110505576A (en) * 2018-05-19 2019-11-26 北京融信数联科技有限公司 A kind of geo-demographics' method based on base station coverage sector gridding
CN110245981B (en) * 2019-05-31 2021-10-01 南京瑞栖智能交通技术产业研究院有限公司 Crowd type identification method based on mobile phone signaling data
CN114741612B (en) * 2022-06-13 2022-09-02 北京融信数联科技有限公司 Consumption habit classification method, system and storage medium based on big data

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111615054A (en) * 2020-05-25 2020-09-01 和智信(山东)大数据科技有限公司 Population analysis method and device
CN113993120A (en) * 2021-12-30 2022-01-28 中国移动紫金(江苏)创新研究院有限公司 Demographic method and system based on wireless user signaling

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于移动通信大数据的城市人口空间分布统计;周天绮;《计算机与现代化》;20180515(第05期);全文 *

Also Published As

Publication number Publication date
CN115409434A (en) 2022-11-29

Similar Documents

Publication Publication Date Title
CN115409434B (en) Regional demographic method, system and storage medium based on signaling big data
CN107948928B (en) Empirical expert determination and problem routing system and method
CN110309437B (en) Information pushing method and device
CN104484790A (en) Address match method and device of logistics business
CN109151890A (en) A kind of mobile terminal locating method and device
CN110851646B (en) Working efficiency statistical method for intelligent park
CN114741612B (en) Consumption habit classification method, system and storage medium based on big data
CN113613174A (en) Method, device and storage medium for identifying occupational sites based on mobile phone signaling data
CN113392338A (en) Hotspot area positioning method, device, equipment and storage medium
CN109660936A (en) A kind of face and mobile device identification number mapping method, system and readable storage medium storing program for executing
CN110968712B (en) BIM family model acquisition method, device and system
CN110991751B (en) User life mode prediction method and device, electronic equipment and storage medium
CN113722617A (en) Method and device for identifying actual office address of enterprise and electronic equipment
CN111898918A (en) Equipment use tracking management method, device, equipment and storage medium
CN116017333A (en) Population identification method, system and storage medium based on big data signaling processing
CN116761185A (en) Method, system and medium for predicting daily active users based on signaling
CN113158084A (en) Method and device for processing movement track data, computer equipment and storage medium
CN113065016A (en) Offline store information processing method, device, equipment and system
CN111092764A (en) Real-time dynamic intimacy relationship analysis method and system
CN110059234A (en) Water utilities anomalous event method for detecting and device, computer installation and storage medium
CN110428627B (en) Bus trip potential area identification method and system
CN110858955B (en) Crowd classification method and crowd classification device
CN112800035A (en) GIS (geographic information System) -based power grid data communication sharing system
CN114385899A (en) User group accurate identification system and method based on big data analysis
CN107992556B (en) Site management method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant