CN112541013A - Mobile signaling big data-based due graduate slot hopping frequency analysis method - Google Patents

Mobile signaling big data-based due graduate slot hopping frequency analysis method Download PDF

Info

Publication number
CN112541013A
CN112541013A CN202010003495.9A CN202010003495A CN112541013A CN 112541013 A CN112541013 A CN 112541013A CN 202010003495 A CN202010003495 A CN 202010003495A CN 112541013 A CN112541013 A CN 112541013A
Authority
CN
China
Prior art keywords
time
graduate
base station
residence
working
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010003495.9A
Other languages
Chinese (zh)
Other versions
CN112541013B (en
Inventor
贺炎俊
秦星星
刘增礼
于海薇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beiling Rongxin Datalnfo Science and Technology Ltd
Original Assignee
Beiling Rongxin Datalnfo Science and Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beiling Rongxin Datalnfo Science and Technology Ltd filed Critical Beiling Rongxin Datalnfo Science and Technology Ltd
Priority to CN202010003495.9A priority Critical patent/CN112541013B/en
Publication of CN112541013A publication Critical patent/CN112541013A/en
Application granted granted Critical
Publication of CN112541013B publication Critical patent/CN112541013B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/105Human resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/03Data mining

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The invention provides a graduate groove-skipping frequency analysis method based on mobile signaling big data, which is characterized in that by means of mobile signaling containing abundant information, a design caliber and an algorithm are used for dynamically and effectively mining and analyzing the living working condition after graduate leaving school, the data source is reliable, the reliability of an analysis result is high, and reliable basis can be provided for making decisions by related management departments.

Description

Mobile signaling big data-based due graduate slot hopping frequency analysis method
Technical Field
The invention belongs to the technical field of mobile big data mining and application, and particularly relates to an due graduate slot hopping frequency analysis method based on mobile signaling big data.
Background
College graduates are one of social employment groups with extreme particularity, the current situation of jumping of research graduates not only provides a theoretical framework for exploring how to 'call and reserve' university graduates from the management angle of enterprise employees, but also has important guiding significance for how to orderly realize the life value of the university graduates, but at present, the research on the graduates is limited to before leaving school, the dynamic information after leaving school is not fully mastered, the information after leaving school is mostly dependent on the traditional questionnaire survey technology, the reality of data is difficult to guarantee, and the obtained analysis conclusion is not accurate enough.
Disclosure of Invention
The invention aims to provide a frequency analysis method for the due graduate groove skipping based on mobile signaling big data, which utilizes the mobile big data to dig out mobile track information of a mobile phone user so as to analyze the working and living conditions of a graduate group after leaving school.
The technical scheme of the invention is as follows:
a should end graduate jump frequency analysis method based on mobile signaling big data is characterized in that:
(1) data acquisition: acquiring the position and the access time information of the base station sector where each IMSI identification number is located by utilizing signaling data of a telecom operator, and cleaning the data;
(2) data preprocessing: carrying out interpolation compensation on missing signaling of an in-out base station;
(3) and (3) judging the residence: reading the IMSI at 21:00 to the next day 7:00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in the residence time period corresponding to the IMSI, and further counting the stay time of each base station. And determining the geographic position corresponding to the base station with the longest residence time as the residence, wherein the residence is called a daily residence. And in a natural month, the position where the daily residence is accumulated most is determined as the monthly residence.
(4) The working area judges: reading the IMSI at 7: 00-19: 00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in all the working time periods corresponding to the IMSI, and further counting the stay time of each base station. And judging the geographic position corresponding to the base station with the longest stay time as the work place of the base station, wherein the work place is called a daily work place. And in a natural month, the position where the daily workplace is accumulated and stays most is judged as the monthly workplace.
(5) Identification of the due graduates: and identifying graduates according to the age of the mobile user and the change situation of the monthly residence. The changes in places of residence of graduates are characterised by the presence of: the graduate should stay in the school dormitory between february and june of the research year, but after 7 months of graduations, the school and school will clear the dormitory of the graduate, the place of residence of the graduate will not be in the school scope, and the specific identification method for the graduate according to the characteristics is as follows: the first step is to screen IMSI sets which accord with the graduate age range; secondly, screening IMSIs in the IMSI set, wherein the residence places of the IMSIs in any one month from February to June are within the range of a designated school; and thirdly, IMSI of which the residential areas from September to November are not in the school range is screened from the results of the first two steps, and the IMSI set screened in the third step is the due graduate of the year under study.
(6) Judging the working time length of the first part of the graduate work: and analyzing the change condition of the monthly workplace of the graduates according to the workplace judging method, wherein the time interval of the change of the monthly workplace is the working time length of the graduates in one work. Beginning in July of the study year, the time from the time of collection to the place of graduate to the time of change of place is the working time of the first work of graduate.
The invention implements dynamic and effective mining analysis on the living working condition after graduation leaving school by means of mobile signaling containing rich information and design caliber and algorithm, has reliable data source and high reliability of analysis result, and can provide reliable basis for making decisions by related management departments.
Detailed Description
The specific implementation process of the invention is as follows:
step 1: data acquisition: and acquiring the position and the access time information of the base station sector where each IMSI identification number is located by utilizing signaling data of a telecom operator, and cleaning the data.
Step 2: data preprocessing: carrying out interpolation compensation on missing signaling entering and exiting a base station, and if a user only enters a certain base station time and does not leave the base station time or only leaves the certain base station time and does not enter the base station time in a statistical time period, carrying out interpolation on missing data, wherein interpolation time points are the starting time and the ending time of the statistical time period;
for example, a T user enters sector X at 21:00:00 on day 1 of 6 month, leaves sector X at 7:00:00 on day 2 of 6 month, and the T user enters sector Y at 22:00:00 on day 2 of 6 month, and leaves at 7:00: 00:00 on day 3 of 6 month, and when the information of the T user on day 2 of 6 month is collected, the time point of entering sector X and the time point of leaving sector Y are missing, so that it is necessary to interpolate the time point of entering sector X at 00:00: 00:00 on day 2 of 6 month, and the time point of leaving sector Y at 23:59:59 on day 2 of 6 month.
And step 3: and (3) judging the residence: reading the IMSI at 21:00 to the next day 7:00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in the residence time period corresponding to the IMSI, and further counting the stay time of each base station. And determining the geographic position corresponding to the base station with the longest residence time as the residence, wherein the residence is called a daily residence. And in a natural month, the position where the daily residence is accumulated most is determined as the monthly residence.
And 4, step 4: the working area judges: reading the IMSI at 7: 00-19: 00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in all the working time periods corresponding to the IMSI, and further counting the stay time of each base station. And judging the geographic position corresponding to the base station with the longest stay time as the work place of the base station, wherein the work place is called a daily work place. And in a natural month, the position where the daily workplace is accumulated and stays most is judged as the monthly workplace.
And 5: identification of the due graduates: according to the age of the mobile user and the change situation of the monthly residence, the current graduate is identified.
The changes in places of residence of graduates are characterised by the presence of: the graduates should remain in school dormitories between february and june of the research year, but after 7 months of graduations, the school students will clear their dormitories of the graduates, the places of residence of the graduates will not be within the school, and the graduates are identified according to this feature:
the first step is to screen IMSI sets which accord with the graduate age range; secondly, screening IMSIs in the IMSI set, wherein the residence places of the IMSIs in any one month from February to June are within the range of a designated school; and thirdly, IMSI of which the residential areas from September to November are not in the school range is screened from the results of the first two steps, and the IMSI set screened in the third step is the due graduate of the year under study.
One specific identification method is as follows:
firstly, screening IMSI sets which are in line with the graduation age range between 21 and 30 years, wherein the age range basically covers all graduates from the graduation of the subject to the doctor graduation, and the IMSI sets can be subdivided for distinguishing graduates with different scholars, such as 21 to 24 years of the graduates of the subject, and 25 to 30 years of the graduates of the research students and the graduates of the school students;
secondly, screening the IMSI in the IMSI set, wherein the residence place of the IMSI in any month of April or May is within the range of a designated school;
and finally, IMSI of which the residence areas in September and October are not within the school range is screened in the results of the first two steps, and the IMSI set screened in the three steps is the due graduate of the year.
Step 6: judging the working time length of the first part of the graduate work: and analyzing the change condition of the monthly workplace of the graduates according to the workplace judging method, wherein the time interval of the change of the monthly workplace is the working time length of the graduates in one work. Starting from the seventh month of the research year, the time from the first collection to the first change of the working place of the graduates is the working time of the first work of the graduates, and the time interval is the working time of the first work of the graduates, so that whether the working frequency of the graduate group is frequent or not can be judged according to the screened average working time of the graduates.
By the method, not only the working time of the first part of the graduate can be calculated, but also the working time of each part of the graduate can be calculated, and the working time of the individual graduate is not recommended to be researched when the method is applied, because if the graduate goes on a business trip or has relatively long holiday time (such as more than 3 days), the working time is influenced by the caliber in the judgment method of the monthly working place or the working place is changed, so that the judgment result is influenced, and therefore the average working time of the first part of the graduate which should be ended is recommended to be calculated from the group of the graduates. In addition, according to the method, graduate groups can be subdivided according to the gender and the school calendar and used for researching whether the frequency of jumping the groove is related to the school calendar and the gender.

Claims (6)

1. A graduate slot hopping frequency analysis method based on mobile signaling big data is characterized in that:
(1) data acquisition: acquiring the position and the access time information of the base station sector where each IMSI identification number is located by utilizing signaling data of a telecom operator, and cleaning the data;
(2) data preprocessing: carrying out interpolation compensation on missing signaling of an in-out base station;
(3) and (3) judging the residence: reading the IMSI at 21:00 to the next day 7:00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations visited in the residence time period corresponding to the IMSI, further counting the residence time of each base station, and determining the geographic position corresponding to the base station with the longest residence time as the residence place, wherein the residence place is called a daily residence place; in a natural month, the position where the daily residence is accumulated and stays most is judged as the monthly residence;
(4) the working area judges: reading the IMSI at 7: 00-19: 00, according to the track data obtained in the data acquisition step, establishing a statistical table of the information of the base stations accessed in all the working time periods corresponding to the IMSI, further counting the stay time of each base station, and determining the geographical position corresponding to the base station with the longest stay time as the working place of the base station, wherein the working place is called a daily working place; in a natural month, the position where the daily workplace is accumulated and stays most is judged as the monthly workplace;
(5) identification of the due graduates: the first step is to screen IMSI sets which accord with the graduate age range; secondly, screening IMSIs in the IMSI set, wherein the residence places of the IMSIs in any one month from February to June are within the range of a designated school; thirdly, IMSI of which the residential areas from September to November are not in the school range is screened from the results of the first two steps, and the IMSI set screened in the third step is the due graduate of the year under study;
(6) judging the working time length of the first part of the graduate work: according to the working place judging method, the change situation of the graduate monthly working place is analyzed, starting from the seventh month of the research year, from the time of first collection to the working place of the graduate to the time of first change of the working place, and the time interval is the working time of the first work of the graduate.
2. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the interpolation compensation in the step (3) means that if the user only enters a certain base station time and does not leave the base station time or only leaves the certain base station time and does not enter the base station time in the statistical time period, missing data is interpolated, and the interpolation time points are the starting time and the ending time of the statistical time period.
3. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the graduate age range in the first screening of step (5) is between 21 and 30 years of age.
4. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the time selection in the second step of screening in the step (5) can be two months from February to June, and the purpose of doing so is to reduce the calculated amount as much as possible on the basis of ensuring the accuracy.
5. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the time selection in the third step of the step (5) can be at least two months from September to November, and the selection time is continuous in time, so that the calculation amount is reduced as much as possible on the basis of ensuring the accuracy.
6. The method of claim 1 for frequency analysis of due graduate slot hopping based on big data of mobile signaling, characterized in that: the specific implementation method of the time interval of the monthly workplace change in the step (6) is that the working places of all graduates are identified from July, if the working place of the next month is not changed, the working time is added by one month in an accumulated mode, if the working place is changed, the work is considered to be changed, and the accumulated month number before the work is changed is the working time of one work.
CN202010003495.9A 2020-01-02 2020-01-02 Mobile signaling big data-based due graduate slot hopping frequency analysis method Active CN112541013B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010003495.9A CN112541013B (en) 2020-01-02 2020-01-02 Mobile signaling big data-based due graduate slot hopping frequency analysis method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010003495.9A CN112541013B (en) 2020-01-02 2020-01-02 Mobile signaling big data-based due graduate slot hopping frequency analysis method

Publications (2)

Publication Number Publication Date
CN112541013A true CN112541013A (en) 2021-03-23
CN112541013B CN112541013B (en) 2021-12-28

Family

ID=75013343

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010003495.9A Active CN112541013B (en) 2020-01-02 2020-01-02 Mobile signaling big data-based due graduate slot hopping frequency analysis method

Country Status (1)

Country Link
CN (1) CN112541013B (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103942674A (en) * 2014-04-11 2014-07-23 北京工业大学 User-oriented method for extracting career information of fresh university graduates
CN104348635A (en) * 2013-07-24 2015-02-11 中国移动通信集团福建有限公司 Mobile user behavior analysis method and apparatus
CN105657666A (en) * 2016-03-31 2016-06-08 东南大学 Commercial employee group residence recognition method based on mobile phone positioning data
CN106503843A (en) * 2016-10-20 2017-03-15 上海萃图数字科技有限公司 A kind of regular public traffic line network optimization and method of adjustment based on mobile phone signaling data
CN107040894A (en) * 2017-04-21 2017-08-11 杭州市综合交通研究中心 A kind of resident trip OD acquisition methods based on mobile phone signaling data
US20180129960A1 (en) * 2016-11-10 2018-05-10 Facebook, Inc. Contact information confidence
CN109063116A (en) * 2018-07-27 2018-12-21 考拉征信服务有限公司 Data identification method, device, electronic equipment and computer readable storage medium
US20190174449A1 (en) * 2018-02-09 2019-06-06 Intel Corporation Technologies to authorize user equipment use of local area data network features and control the size of local area data network information in access and mobility management function
WO2019108133A1 (en) * 2017-11-30 2019-06-06 X0Pa Ai Pte Ltd Talent management platform
CN110418287A (en) * 2019-07-12 2019-11-05 重庆市交通规划研究院 Migrate recognition methods to inhabitants live based on mobile phone signaling

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104348635A (en) * 2013-07-24 2015-02-11 中国移动通信集团福建有限公司 Mobile user behavior analysis method and apparatus
CN103942674A (en) * 2014-04-11 2014-07-23 北京工业大学 User-oriented method for extracting career information of fresh university graduates
CN105657666A (en) * 2016-03-31 2016-06-08 东南大学 Commercial employee group residence recognition method based on mobile phone positioning data
CN106503843A (en) * 2016-10-20 2017-03-15 上海萃图数字科技有限公司 A kind of regular public traffic line network optimization and method of adjustment based on mobile phone signaling data
US20180129960A1 (en) * 2016-11-10 2018-05-10 Facebook, Inc. Contact information confidence
CN107040894A (en) * 2017-04-21 2017-08-11 杭州市综合交通研究中心 A kind of resident trip OD acquisition methods based on mobile phone signaling data
WO2019108133A1 (en) * 2017-11-30 2019-06-06 X0Pa Ai Pte Ltd Talent management platform
US20190174449A1 (en) * 2018-02-09 2019-06-06 Intel Corporation Technologies to authorize user equipment use of local area data network features and control the size of local area data network information in access and mobility management function
CN109063116A (en) * 2018-07-27 2018-12-21 考拉征信服务有限公司 Data identification method, device, electronic equipment and computer readable storage medium
CN110418287A (en) * 2019-07-12 2019-11-05 重庆市交通规划研究院 Migrate recognition methods to inhabitants live based on mobile phone signaling

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李诗雅: "广西本科毕业生就业质量评价研究", 《中国优秀硕士学位论文全文数据库 社会科学II辑》 *

Also Published As

Publication number Publication date
CN112541013B (en) 2021-12-28

Similar Documents

Publication Publication Date Title
Becker et al. A tale of one city: Using cellular network data for urban planning
CN110324787B (en) Method for acquiring occupational sites of mobile phone signaling data
CN111582948B (en) Individual behavior analysis method based on mobile phone signaling data and POI (Point of interest)
CN109583640A (en) A kind of Urban Traffic passenger flow attribute recognition approach based on multi-source location data
CN105513351A (en) Traffic travel characteristic data extraction method based on big data
Pan et al. Cellular-based data-extracting method for trip distribution
CN106464706A (en) Method and system for identifying significant locations through data obtainable from telecommunication network
CN107194525A (en) A kind of down town appraisal procedure based on mobile phone signaling
CN106547894A (en) The system and method that location tags are lived in duty is excavated based on mobile communication signaling big data
CN106339716A (en) Mobile trajectory similarity matching method based on weighted Euclidean distance
CN112001829B (en) Population distribution judging method based on mobile phone signaling data
CN107977673A (en) A kind of economically active population's recognition methods based on big data
Demissie et al. Inferring origin-destination flows using mobile phone data: A case study of Senegal
CN109842886A (en) Communication network subtracts frequency logout method and device
CN112512032B (en) Mobile phone signaling data-based external trip crowd identification method
CN109670540A (en) It is resident number variation tendency Forecasting Approach for Short-term in Passenger Transport Hub region based on kNN algorithm
CN110245773A (en) A kind of method that multi-source fact space-time predictor extracted and be included in interpretation of scheme application
CN111417075B (en) User workplace identification method based on mobile communication big data
CN112541013B (en) Mobile signaling big data-based due graduate slot hopping frequency analysis method
CN112738729A (en) Method and system for distinguishing visiting hometown visitor by mobile phone signaling data
CN114564552A (en) Territorial space planning method based on GIS
CN110958558A (en) Mobile big data-based mobile phone user space-time trajectory depicting method
CN112561759B (en) Graduate forward dynamic monitoring method based on mobile signaling big data
CN102638821A (en) Method for mining move modes of users on basis of mobile phone base station data
Charles-Edwards The estimation of temporary populations in Australia

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant