CN112541013A - Mobile signaling big data-based due graduate slot hopping frequency analysis method - Google Patents
Mobile signaling big data-based due graduate slot hopping frequency analysis method Download PDFInfo
- Publication number
- CN112541013A CN112541013A CN202010003495.9A CN202010003495A CN112541013A CN 112541013 A CN112541013 A CN 112541013A CN 202010003495 A CN202010003495 A CN 202010003495A CN 112541013 A CN112541013 A CN 112541013A
- Authority
- CN
- China
- Prior art keywords
- time
- graduate
- base station
- residence
- working
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/105—Human resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/03—Data mining
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Probability & Statistics with Applications (AREA)
- General Physics & Mathematics (AREA)
- Strategic Management (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Fuzzy Systems (AREA)
- Economics (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Quality & Reliability (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
The invention provides a graduate groove-skipping frequency analysis method based on mobile signaling big data, which is characterized in that by means of mobile signaling containing abundant information, a design caliber and an algorithm are used for dynamically and effectively mining and analyzing the living working condition after graduate leaving school, the data source is reliable, the reliability of an analysis result is high, and reliable basis can be provided for making decisions by related management departments.
Description
Technical Field
The invention belongs to the technical field of mobile big data mining and application, and particularly relates to an due graduate slot hopping frequency analysis method based on mobile signaling big data.
Background
College graduates are one of social employment groups with extreme particularity, the current situation of jumping of research graduates not only provides a theoretical framework for exploring how to 'call and reserve' university graduates from the management angle of enterprise employees, but also has important guiding significance for how to orderly realize the life value of the university graduates, but at present, the research on the graduates is limited to before leaving school, the dynamic information after leaving school is not fully mastered, the information after leaving school is mostly dependent on the traditional questionnaire survey technology, the reality of data is difficult to guarantee, and the obtained analysis conclusion is not accurate enough.
Disclosure of Invention
The invention aims to provide a frequency analysis method for the due graduate groove skipping based on mobile signaling big data, which utilizes the mobile big data to dig out mobile track information of a mobile phone user so as to analyze the working and living conditions of a graduate group after leaving school.
The technical scheme of the invention is as follows:
a should end graduate jump frequency analysis method based on mobile signaling big data is characterized in that:
(1) data acquisition: acquiring the position and the access time information of the base station sector where each IMSI identification number is located by utilizing signaling data of a telecom operator, and cleaning the data;
(2) data preprocessing: carrying out interpolation compensation on missing signaling of an in-out base station;
(3) and (3) judging the residence: reading the IMSI at 21:00 to the next day 7:00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in the residence time period corresponding to the IMSI, and further counting the stay time of each base station. And determining the geographic position corresponding to the base station with the longest residence time as the residence, wherein the residence is called a daily residence. And in a natural month, the position where the daily residence is accumulated most is determined as the monthly residence.
(4) The working area judges: reading the IMSI at 7: 00-19: 00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in all the working time periods corresponding to the IMSI, and further counting the stay time of each base station. And judging the geographic position corresponding to the base station with the longest stay time as the work place of the base station, wherein the work place is called a daily work place. And in a natural month, the position where the daily workplace is accumulated and stays most is judged as the monthly workplace.
(5) Identification of the due graduates: and identifying graduates according to the age of the mobile user and the change situation of the monthly residence. The changes in places of residence of graduates are characterised by the presence of: the graduate should stay in the school dormitory between february and june of the research year, but after 7 months of graduations, the school and school will clear the dormitory of the graduate, the place of residence of the graduate will not be in the school scope, and the specific identification method for the graduate according to the characteristics is as follows: the first step is to screen IMSI sets which accord with the graduate age range; secondly, screening IMSIs in the IMSI set, wherein the residence places of the IMSIs in any one month from February to June are within the range of a designated school; and thirdly, IMSI of which the residential areas from September to November are not in the school range is screened from the results of the first two steps, and the IMSI set screened in the third step is the due graduate of the year under study.
(6) Judging the working time length of the first part of the graduate work: and analyzing the change condition of the monthly workplace of the graduates according to the workplace judging method, wherein the time interval of the change of the monthly workplace is the working time length of the graduates in one work. Beginning in July of the study year, the time from the time of collection to the place of graduate to the time of change of place is the working time of the first work of graduate.
The invention implements dynamic and effective mining analysis on the living working condition after graduation leaving school by means of mobile signaling containing rich information and design caliber and algorithm, has reliable data source and high reliability of analysis result, and can provide reliable basis for making decisions by related management departments.
Detailed Description
The specific implementation process of the invention is as follows:
step 1: data acquisition: and acquiring the position and the access time information of the base station sector where each IMSI identification number is located by utilizing signaling data of a telecom operator, and cleaning the data.
Step 2: data preprocessing: carrying out interpolation compensation on missing signaling entering and exiting a base station, and if a user only enters a certain base station time and does not leave the base station time or only leaves the certain base station time and does not enter the base station time in a statistical time period, carrying out interpolation on missing data, wherein interpolation time points are the starting time and the ending time of the statistical time period;
for example, a T user enters sector X at 21:00:00 on day 1 of 6 month, leaves sector X at 7:00:00 on day 2 of 6 month, and the T user enters sector Y at 22:00:00 on day 2 of 6 month, and leaves at 7:00: 00:00 on day 3 of 6 month, and when the information of the T user on day 2 of 6 month is collected, the time point of entering sector X and the time point of leaving sector Y are missing, so that it is necessary to interpolate the time point of entering sector X at 00:00: 00:00 on day 2 of 6 month, and the time point of leaving sector Y at 23:59:59 on day 2 of 6 month.
And step 3: and (3) judging the residence: reading the IMSI at 21:00 to the next day 7:00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in the residence time period corresponding to the IMSI, and further counting the stay time of each base station. And determining the geographic position corresponding to the base station with the longest residence time as the residence, wherein the residence is called a daily residence. And in a natural month, the position where the daily residence is accumulated most is determined as the monthly residence.
And 4, step 4: the working area judges: reading the IMSI at 7: 00-19: 00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in all the working time periods corresponding to the IMSI, and further counting the stay time of each base station. And judging the geographic position corresponding to the base station with the longest stay time as the work place of the base station, wherein the work place is called a daily work place. And in a natural month, the position where the daily workplace is accumulated and stays most is judged as the monthly workplace.
And 5: identification of the due graduates: according to the age of the mobile user and the change situation of the monthly residence, the current graduate is identified.
The changes in places of residence of graduates are characterised by the presence of: the graduates should remain in school dormitories between february and june of the research year, but after 7 months of graduations, the school students will clear their dormitories of the graduates, the places of residence of the graduates will not be within the school, and the graduates are identified according to this feature:
the first step is to screen IMSI sets which accord with the graduate age range; secondly, screening IMSIs in the IMSI set, wherein the residence places of the IMSIs in any one month from February to June are within the range of a designated school; and thirdly, IMSI of which the residential areas from September to November are not in the school range is screened from the results of the first two steps, and the IMSI set screened in the third step is the due graduate of the year under study.
One specific identification method is as follows:
firstly, screening IMSI sets which are in line with the graduation age range between 21 and 30 years, wherein the age range basically covers all graduates from the graduation of the subject to the doctor graduation, and the IMSI sets can be subdivided for distinguishing graduates with different scholars, such as 21 to 24 years of the graduates of the subject, and 25 to 30 years of the graduates of the research students and the graduates of the school students;
secondly, screening the IMSI in the IMSI set, wherein the residence place of the IMSI in any month of April or May is within the range of a designated school;
and finally, IMSI of which the residence areas in September and October are not within the school range is screened in the results of the first two steps, and the IMSI set screened in the three steps is the due graduate of the year.
Step 6: judging the working time length of the first part of the graduate work: and analyzing the change condition of the monthly workplace of the graduates according to the workplace judging method, wherein the time interval of the change of the monthly workplace is the working time length of the graduates in one work. Starting from the seventh month of the research year, the time from the first collection to the first change of the working place of the graduates is the working time of the first work of the graduates, and the time interval is the working time of the first work of the graduates, so that whether the working frequency of the graduate group is frequent or not can be judged according to the screened average working time of the graduates.
By the method, not only the working time of the first part of the graduate can be calculated, but also the working time of each part of the graduate can be calculated, and the working time of the individual graduate is not recommended to be researched when the method is applied, because if the graduate goes on a business trip or has relatively long holiday time (such as more than 3 days), the working time is influenced by the caliber in the judgment method of the monthly working place or the working place is changed, so that the judgment result is influenced, and therefore the average working time of the first part of the graduate which should be ended is recommended to be calculated from the group of the graduates. In addition, according to the method, graduate groups can be subdivided according to the gender and the school calendar and used for researching whether the frequency of jumping the groove is related to the school calendar and the gender.
Claims (6)
1. A graduate slot hopping frequency analysis method based on mobile signaling big data is characterized in that:
(1) data acquisition: acquiring the position and the access time information of the base station sector where each IMSI identification number is located by utilizing signaling data of a telecom operator, and cleaning the data;
(2) data preprocessing: carrying out interpolation compensation on missing signaling of an in-out base station;
(3) and (3) judging the residence: reading the IMSI at 21:00 to the next day 7:00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations visited in the residence time period corresponding to the IMSI, further counting the residence time of each base station, and determining the geographic position corresponding to the base station with the longest residence time as the residence place, wherein the residence place is called a daily residence place; in a natural month, the position where the daily residence is accumulated and stays most is judged as the monthly residence;
(4) the working area judges: reading the IMSI at 7: 00-19: 00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in all the working time periods corresponding to the IMSI, further counting the stay time of each base station, and determining the geographical position corresponding to the base station with the longest stay time as the working place of the base station, wherein the working place is called a daily working place; in a natural month, the position where the daily workplace is accumulated and stays most is judged as the monthly workplace;
(5) identification of the due graduates: the first step is to screen IMSI sets which accord with the graduate age range; secondly, screening IMSIs in the IMSI set, wherein the residence places of the IMSIs in any one month from February to June are within the range of a designated school; thirdly, IMSI of which the residential areas from September to November are not in the school range is screened from the results of the first two steps, and the IMSI set screened in the third step is the due graduate of the year under study;
(6) judging the working time length of the first part of the graduate work: according to the working place judging method, the change situation of the graduate monthly working place is analyzed, starting from the seventh month of the research year, from the time of first collection to the working place of the graduate to the time of first change of the working place, and the time interval is the working time of the first work of the graduate.
2. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the interpolation compensation in the step (3) means that if the user only enters a certain base station time and does not leave the base station time or only leaves the certain base station time and does not enter the base station time in the statistical time period, missing data is interpolated, and the interpolation time points are the starting time and the ending time of the statistical time period.
3. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the graduate age range in the first screening of step (5) is between 21 and 30 years of age.
4. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the time selection in the second step of screening in the step (5) can be two months from February to June, and the purpose of doing so is to reduce the calculated amount as much as possible on the basis of ensuring the accuracy.
5. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the time selection in the third step of the step (5) can be at least two months from September to November, and the selection time is continuous in time, so that the calculation amount is reduced as much as possible on the basis of ensuring the accuracy.
6. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the specific implementation method of the time interval of the monthly workplace change in the step (6) is that the working places of all graduates are identified from July, if the working place of the next month is not changed, the working time is added by one month in an accumulated mode, if the working place is changed, the work is considered to be changed, and the accumulated month number before the work is changed is the working time of one work.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010003495.9A CN112541013B (en) | 2020-01-02 | 2020-01-02 | Mobile signaling big data-based due graduate slot hopping frequency analysis method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010003495.9A CN112541013B (en) | 2020-01-02 | 2020-01-02 | Mobile signaling big data-based due graduate slot hopping frequency analysis method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112541013A true CN112541013A (en) | 2021-03-23 |
CN112541013B CN112541013B (en) | 2021-12-28 |
Family
ID=75013343
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010003495.9A Active CN112541013B (en) | 2020-01-02 | 2020-01-02 | Mobile signaling big data-based due graduate slot hopping frequency analysis method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112541013B (en) |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103942674A (en) * | 2014-04-11 | 2014-07-23 | 北京工业大学 | User-oriented method for extracting career information of fresh university graduates |
CN104348635A (en) * | 2013-07-24 | 2015-02-11 | 中国移动通信集团福建有限公司 | Mobile user behavior analysis method and apparatus |
CN105657666A (en) * | 2016-03-31 | 2016-06-08 | 东南大学 | Commercial employee group residence recognition method based on mobile phone positioning data |
CN106503843A (en) * | 2016-10-20 | 2017-03-15 | 上海萃图数字科技有限公司 | A kind of regular public traffic line network optimization and method of adjustment based on mobile phone signaling data |
CN107040894A (en) * | 2017-04-21 | 2017-08-11 | 杭州市综合交通研究中心 | A kind of resident trip OD acquisition methods based on mobile phone signaling data |
US20180129960A1 (en) * | 2016-11-10 | 2018-05-10 | Facebook, Inc. | Contact information confidence |
CN109063116A (en) * | 2018-07-27 | 2018-12-21 | 考拉征信服务有限公司 | Data identification method, device, electronic equipment and computer readable storage medium |
US20190174449A1 (en) * | 2018-02-09 | 2019-06-06 | Intel Corporation | Technologies to authorize user equipment use of local area data network features and control the size of local area data network information in access and mobility management function |
WO2019108133A1 (en) * | 2017-11-30 | 2019-06-06 | X0Pa Ai Pte Ltd | Talent management platform |
CN110418287A (en) * | 2019-07-12 | 2019-11-05 | 重庆市交通规划研究院 | Migrate recognition methods to inhabitants live based on mobile phone signaling |
-
2020
- 2020-01-02 CN CN202010003495.9A patent/CN112541013B/en active Active
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104348635A (en) * | 2013-07-24 | 2015-02-11 | 中国移动通信集团福建有限公司 | Mobile user behavior analysis method and apparatus |
CN103942674A (en) * | 2014-04-11 | 2014-07-23 | 北京工业大学 | User-oriented method for extracting career information of fresh university graduates |
CN105657666A (en) * | 2016-03-31 | 2016-06-08 | 东南大学 | Commercial employee group residence recognition method based on mobile phone positioning data |
CN106503843A (en) * | 2016-10-20 | 2017-03-15 | 上海萃图数字科技有限公司 | A kind of regular public traffic line network optimization and method of adjustment based on mobile phone signaling data |
US20180129960A1 (en) * | 2016-11-10 | 2018-05-10 | Facebook, Inc. | Contact information confidence |
CN107040894A (en) * | 2017-04-21 | 2017-08-11 | 杭州市综合交通研究中心 | A kind of resident trip OD acquisition methods based on mobile phone signaling data |
WO2019108133A1 (en) * | 2017-11-30 | 2019-06-06 | X0Pa Ai Pte Ltd | Talent management platform |
US20190174449A1 (en) * | 2018-02-09 | 2019-06-06 | Intel Corporation | Technologies to authorize user equipment use of local area data network features and control the size of local area data network information in access and mobility management function |
CN109063116A (en) * | 2018-07-27 | 2018-12-21 | 考拉征信服务有限公司 | Data identification method, device, electronic equipment and computer readable storage medium |
CN110418287A (en) * | 2019-07-12 | 2019-11-05 | 重庆市交通规划研究院 | Migrate recognition methods to inhabitants live based on mobile phone signaling |
Non-Patent Citations (1)
Title |
---|
李诗雅: "广西本科毕业生就业质量评价研究", 《中国优秀硕士学位论文全文数据库 社会科学II辑》 * |
Also Published As
Publication number | Publication date |
---|---|
CN112541013B (en) | 2021-12-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Becker et al. | A tale of one city: Using cellular network data for urban planning | |
CN110324787B (en) | Method for acquiring occupational sites of mobile phone signaling data | |
CN111582948B (en) | Individual behavior analysis method based on mobile phone signaling data and POI (Point of interest) | |
CN109583640A (en) | A kind of Urban Traffic passenger flow attribute recognition approach based on multi-source location data | |
CN105513351A (en) | Traffic travel characteristic data extraction method based on big data | |
Pan et al. | Cellular-based data-extracting method for trip distribution | |
CN106464706A (en) | Method and system for identifying significant locations through data obtainable from telecommunication network | |
CN107194525A (en) | A kind of down town appraisal procedure based on mobile phone signaling | |
CN106547894A (en) | The system and method that location tags are lived in duty is excavated based on mobile communication signaling big data | |
CN106339716A (en) | Mobile trajectory similarity matching method based on weighted Euclidean distance | |
CN112001829B (en) | Population distribution judging method based on mobile phone signaling data | |
CN107977673A (en) | A kind of economically active population's recognition methods based on big data | |
Demissie et al. | Inferring origin-destination flows using mobile phone data: A case study of Senegal | |
CN109842886A (en) | Communication network subtracts frequency logout method and device | |
CN112512032B (en) | Mobile phone signaling data-based external trip crowd identification method | |
CN109670540A (en) | It is resident number variation tendency Forecasting Approach for Short-term in Passenger Transport Hub region based on kNN algorithm | |
CN110245773A (en) | A kind of method that multi-source fact space-time predictor extracted and be included in interpretation of scheme application | |
CN111417075B (en) | User workplace identification method based on mobile communication big data | |
CN112541013B (en) | Mobile signaling big data-based due graduate slot hopping frequency analysis method | |
CN112738729A (en) | Method and system for distinguishing visiting hometown visitor by mobile phone signaling data | |
CN114564552A (en) | Territorial space planning method based on GIS | |
CN110958558A (en) | Mobile big data-based mobile phone user space-time trajectory depicting method | |
CN112561759B (en) | Graduate forward dynamic monitoring method based on mobile signaling big data | |
CN102638821A (en) | Method for mining move modes of users on basis of mobile phone base station data | |
Charles-Edwards | The estimation of temporary populations in Australia |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |