CN108391235B - Data analysis method and device - Google Patents

Data analysis method and device Download PDF

Info

Publication number
CN108391235B
CN108391235B CN201810150323.7A CN201810150323A CN108391235B CN 108391235 B CN108391235 B CN 108391235B CN 201810150323 A CN201810150323 A CN 201810150323A CN 108391235 B CN108391235 B CN 108391235B
Authority
CN
China
Prior art keywords
state
user
record
communication record
communication
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810150323.7A
Other languages
Chinese (zh)
Other versions
CN108391235A (en
Inventor
韩玉辉
冉萌
包宸曦
曹振
周诗雨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China United Network Communications Group Co Ltd
Original Assignee
China United Network Communications Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China United Network Communications Group Co Ltd filed Critical China United Network Communications Group Co Ltd
Priority to CN201810150323.7A priority Critical patent/CN108391235B/en
Publication of CN108391235A publication Critical patent/CN108391235A/en
Application granted granted Critical
Publication of CN108391235B publication Critical patent/CN108391235B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/029Location-based management or tracking services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Abstract

The invention provides a data analysis method and device, relates to the field of communication, and aims to perform fine analysis on operation information of a user based on mobile communication data and provide rich information for portrayal of geographic portraits. The method comprises the following steps: acquiring mobile communication data, wherein the mobile communication data corresponds to a user; each communication record comprises a user identifier, a communication record timestamp and position information; acquiring a corresponding geographic position according to the position information in each communication record; determining an operation record of the user according to the communication record time stamp in the (n + 1) th communication record and the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively; the operation record comprises a user state, a departure point, an arrival point, state duration, a travel number and a sub-travel number; wherein the user state comprises a stay state or a travel state.

Description

Data analysis method and device
Technical Field
The present invention relates to the field of communications, and in particular, to a data analysis method and apparatus.
Background
With the development of internet technology and communication technology, especially the development of mobile communication and software technology, feasibility is provided for a user analysis system based on a mobile communication data engine. In the current application of data analysis based on mobile communication data, the requirements for assisting the city planning and the related decision of traffic planning are increasing, and the related technologies and the analysis research are also increasing. However, the conventional patent technologies have focused on analyzing user features and have made little research on feature analysis of geographic region images; in the research of the user activity track and the traffic mode, the existing user activity chain recording mode cannot finely present the user behavior rule, and the user activity track is further accurately analyzed.
Disclosure of Invention
The application provides a data analysis method and device, which can be used for carrying out fine analysis on the operation information of a user based on mobile communication data, providing rich information for the depiction of a geographic portrait and further providing information guidance for function planning, traffic planning and the like of a geographic area.
In order to achieve the purpose, the technical scheme is as follows:
in a first aspect, the present application provides a data analysis method, which may include:
acquiring mobile communication data, wherein the mobile communication data corresponds to a user and comprises at least two communication records; each communication record comprises a user identifier, a communication record timestamp and position information; the communication records are arranged according to the communication record time stamps in the sequence from first to last; acquiring a corresponding geographic position according to the position information in each communication record; determining an operation record of the user according to the communication record time stamp in the (n + 1) th communication record and the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively; wherein, the time interval between the n +1 th communication record and the nth communication record is a recording time period; the operation record comprises a user state, a departure point, an arrival point, state duration, a travel number and a sub-travel number; wherein the user state comprises a stay state or a travel state; if the user state is the resident state, the departure point is the departure point of the user in the current resident state, the arrival point is the arrival point of the user in the current recording time period, the state duration is the retention time of the user in the resident state, the travel number is the travel number of the user in the previous travel state, and the sub-travel number is the sub-travel number of the last sub-travel when the user is in the previous travel state; if the user state is a traveling state, the departure point is the departure point of the user in the current recording time period, the arrival point is the arrival point of the user in the current recording time period, the state duration is the time for keeping the traveling state of the user in the current recording time period, the travel number is the travel number of the user in the current traveling state, and the sub-travel number is the sub-travel number of the user in the current recording time period.
In a second aspect, the present application provides a data analysis apparatus, the apparatus comprising: the device comprises an acquisition module and a determination module. The mobile communication system comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring mobile communication data, the mobile communication data corresponds to a user, and the mobile communication data comprises at least two communication records; each communication record comprises a user identifier, a communication record timestamp and position information; the communication records are arranged according to the communication record time stamps in the sequence from first to last; the acquisition module is further used for acquiring a corresponding geographic position according to the position information in each communication record; the determining module is used for determining one running record of the user according to the communication record time stamp in the (n + 1) th communication record and the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively; wherein, the time interval between the n +1 th communication record and the nth communication record is a recording time period; the operation record comprises a user state, a departure point, an arrival point, state duration, a travel number and a sub-travel number; wherein the user state comprises a stay state or a travel state; if the user state is the resident state, the departure point is the departure point of the user in the current resident state, the arrival point is the arrival point of the user in the current recording time period, the state duration is the retention time of the user in the resident state, the travel number is the travel number of the user in the previous travel state, and the sub-travel number is the sub-travel number of the last sub-travel when the user is in the previous travel state; if the user state is a traveling state, the departure point is the departure point of the user in the current recording time period, the arrival point is the arrival point of the user in the current recording time period, the state duration is the time for keeping the traveling state of the user in the current recording time period, the travel number is the travel number of the user in the current traveling state, and the sub-travel number is the sub-travel number of the user in the current recording time period.
In a third aspect, the present application provides a data analysis apparatus, comprising: a processor, a transceiver, and a memory. Wherein the memory is configured to store computer executable instructions, and when the apparatus is running, the processor executes the computer executable instructions stored in the memory, so as to cause the apparatus to perform the data analysis method according to any one of the first aspect and various alternative implementations thereof.
In a fourth aspect, the present application provides a computer-readable storage medium having one or more programs stored therein, the one or more programs including computer-executable instructions, which when executed by a processor of the data analysis apparatus, cause the data analysis apparatus to perform the data analysis method of any one of the first aspect and its various alternative implementations.
Compared with the prior art that the operation information of the user is analyzed according to the mobile communication data, the data analysis method and the device provided by the embodiment of the invention have the advantages that the operation information of the user is more detailed, the behavior rule of the user can be presented in a refined manner, and more abundant information is provided for city planning and traffic planning.
Drawings
FIG. 1 is a first diagram illustrating a data analysis method according to an embodiment of the present invention;
FIG. 2 is a diagram illustrating a second data analysis method according to an embodiment of the present invention;
fig. 3 is a first schematic structural diagram of a data analysis apparatus according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a data analysis apparatus according to an embodiment of the present invention.
Detailed Description
The data analysis method and apparatus provided by the embodiments of the present invention are described in detail below with reference to the accompanying drawings.
The term "and/or" herein is merely an association describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone.
The terms "first" and "second" and the like in the description of the present invention and the drawings are used for distinguishing different objects or for distinguishing different processes for the same object, and are not used for describing a specific order of the objects.
Furthermore, the terms "comprising" and "having" and any variations thereof as referred to in the description of the invention are intended to cover non-exclusive inclusions. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements but may alternatively include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
It should be noted that, in the embodiments of the present invention, words such as "exemplary" or "for example" are used to indicate examples, illustrations or explanations. Any embodiment or design described as "exemplary" or "e.g.," an embodiment of the present invention is not necessarily to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the word "exemplary" or "such as" is intended to present concepts related in a concrete fashion.
In the description of the present invention, the meaning of "a plurality" means two or more unless otherwise specified.
The data analysis method provided in the embodiment of the present invention may be applied to analyze communication data in a mobile communication network, where the mobile communication network may be a fifth generation (5th generation, 5G) mobile communication network, and may also be a fourth generation (4th generation, 4G) (e.g., an Evolved Packet System (EPS) mobile communication network, and may also be other actual mobile communication networks, and the present invention is not limited thereto.
An embodiment of the present invention provides a data analysis method applied to a data analysis device, and as shown in fig. 1, the method may include S101 to S103:
s101, mobile communication data are obtained.
Optionally, the data analysis device obtains mobile communication data from the mobile communication network, and the mobile communication data may include control signaling and user plane event data. For example, the control signaling may be control signaling of an a interface and a Gb interface in a second generation (2th generation, 2G) mobile communication network, or control signaling of an Iu-cs interface and an Iu-ps interface in a third generation (3th generation, 3G) mobile communication network, or control signaling of an S1-MME interface in a 4G network; the user plane event data can be user plane event data in a 2G, 3G or 4G network; the embodiment of the present invention does not limit this.
Optionally, the control signaling includes a user identifier, a cell identifier, a communication recording timestamp, and the like; the user plane event data comprises user identification, user position longitude and latitude, communication recording time stamp and the like.
Optionally, the acquired mobile communication data is integrated according to the user identifier and the communication recording timestamp. Specifically, according to the user identification, the control signaling and the user plane event data of the same user are integrated into a group; combining the control signaling and the user plane event data with the same communication record timestamp in each group into a communication record; then, the communication records in each group are arranged in order of time from first to last according to the communication record time stamp. The integrated mobile communication data are divided into different groups according to different users, each group comprises at least two communication records, and the communication records are arranged according to the sequence of time from first to last; each communication record comprises a user identifier, a communication record timestamp and position information, wherein the position information comprises a cell identifier and user position longitude and latitude.
And S102, acquiring a corresponding geographic position according to the position information in each communication record.
Specifically, the location information in the communication record corresponds to the actual geographic location. For example, a thiessen polygon may be generated according to the position information in the communication record through tools such as geographic information system software ARCGIS, and a corresponding geographic map layer may be obtained, so as to obtain a corresponding relationship between the position information in the communication record and an actual geographic position.
Optionally, the obtained information of the corresponding geographic location may be added to each communication record.
S103, determining the operation information of the user.
Optionally, the operation information of one user may be determined according to a group of communication records corresponding to each user.
Specifically, the operation information of each user includes at least one operation record, where each operation record includes a user state, a departure point, an arrival point, a state duration, a trip number, and a sub-trip number, and the user state includes a resident state or a travel state. Further, the operation track of each user can be obtained according to the operation information of each user.
Optionally, a geographical location range and a certain time range may be preset to screen the communication records of the user. And determining the communication records according with the preset geographical position range and the preset time range according to the geographical position corresponding to each communication record and the communication record time stamp in each communication record. According to the screened communication records, the operation track of a user in a preset time range and a preset geographic position range can be analyzed. Further, the user travel characteristics in the designated area and time period can be obtained by analyzing the travel tracks of all users in the predetermined time range and the predetermined geographic position range.
Optionally, as shown in fig. 2, the method for determining the operation information of one user according to a group of communication records corresponding to each user may include S1031 to S1035:
and S1031, obtaining the nth communication record corresponding to one user.
Specifically, an nth communication record corresponding to one user is obtained, and a communication record timestamp in the nth communication record is recorded as TnAnd the geographic position corresponding to the nth communication record is recorded as Rn
For example, the 1 st communication record of the user may be obtained first in the order of time from first to last, that is, n is 1.
Specifically, when n is 1, the user operation information includes an operation record. And the user state, the departure point, the arrival point, the state duration, the travel number and the sub-travel number of the one operation record are initial values. Illustratively, the user state is set to be the resident state, the starting point is the geographic position corresponding to the 1 st communication record, the arrival point is the geographic position corresponding to the 1 st communication record, the state duration is 0, and both the trip number and the sub-trip number are 0.
S1032, acquiring the (n + 1) th communication record corresponding to one user.
Specifically, the (n + 1) th communication record corresponding to one user is obtained, and the communication record timestamp in the (n + 1) th communication record is marked as Tn+1And the geographic position corresponding to the (n + 1) th communication record is recorded as Rn+1
Illustratively, the 2 nd communication record of the user is obtained in the order from first to last, where n is 1.
Specifically, the time interval between the (n + 1) th communication record and the nth communication record is a recording time period. Illustratively, the duration of the nth recording period is Tn+1-Tn
S1033, determining the current user state according to the geographical positions respectively corresponding to the n +1 th communication record and the nth communication record; if the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is smaller than or equal to a first preset value, the current user state is a resident state, and S1034 is executed; if the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is greater than the first preset value, the current user state is a traveling state, and S1035 is executed.
Specifically, judgment of Rn+1And RnIs greater than a first predetermined value, if R is greater thann+1And RnIf the distance is greater than the first preset value, the current user state is a traveling state; if R isn+1And RnIf the distance is less than or equal to the first preset value, the current user state is the resident state.
The specific value of the first preset value can be set according to actual conditions, and when the moving distance of the user is greater than the first preset value, it is indicated that the cell where the user resides is changed. For example, according to actual device deployment, if distances between different cells in the target area range are all greater than or equal to 1 km, the first preset value may be set to 1 km.
And S1034, determining the current operation record corresponding to the resident state according to the previous operation record, and then executing S1031.
Optionally, the current user state is a resident state, and whether the current resident state is newly resident may be determined according to the previous operation record.
Specifically, if the user state of the previous operation record is the travel state, the current resident state is the new resident state, and an operation record is newly added to the operation information of the user. In a newly added operation record, the user state is the resident state, and the starting point is the geographic position R corresponding to the nth communication recordnThe arrival point is the geographic position R corresponding to the (n + 1) th communication recordn+1Duration of state Tn+1-TnThe trip number is the trip number of the previous operation record, and the sub-trip number is the sub-trip number of the previous operation record. In the operation record corresponding to the new residence state, the departure point and the arrival point respectively record the departure point and the arrival point of the geographic position in the recording time period, namely, the change of the geographic position of the user in the duration time of the new residence state is reflected; the state duration represents the duration of the new resident state; since the new resident state does not correspond to the trip, the trip number and the sub-trip number are not changed relative to the last running record, and the trip number and the sub-trip number of the last running state are recorded.
And if the user state of the last operation record is the resident state, the current resident state is the continuation of the resident state, and the operation record corresponding to the last resident state is updated. Wherein, the user state is not updated, the departure point is not updated, and the arrival point is updated to the geographic position R corresponding to the (n + 1) th communication recordn+1The state duration is the last running record state duration plus Tn+1-TnThe trip number is not updated, and the sub-trip number is not updated. The updated operation record, the departure point is unchanged, and the arrival point is updated to the current latest user positionThe change of the geographical position of the user within the duration time of the whole residence state is reflected; the state duration represents the duration of the whole resident state; the stroke number and the sub-stroke number are recorded as the stroke number and the sub-stroke number of the previous travel state.
S1035, determining a current operation record corresponding to the travel state according to the previous operation record, and then executing S1031.
Optionally, the current user state is a travel state, and whether the current travel state corresponds to an independent trip or a sub-trip of a previous trip may be determined according to the user state and the state duration of the previous operation record.
Specifically, if the user state of the previous operation record is the resident state and the state duration is greater than the second preset value, the current travel state corresponds to the independent trip, and an operation record corresponding to the independent trip is newly added in the operation information of the user. Wherein, the user state is a traveling state, and the starting point is a geographic position R corresponding to the nth communication recordnThe arrival point is the geographic position R corresponding to the (n + 1) th communication recordn+1Duration of state Tn+1-TnThe stroke number is the stroke number of the operation record corresponding to the previous independent stroke plus 1, and the sub-stroke number is a preset initial value. The second preset value can be set according to practical application conditions and is used for indicating the short stay time of the user in the stay state. For example, the second preset value may be set to 2 minutes. In the operation record corresponding to the independent journey, the departure point and the arrival point respectively record the departure point and the arrival point of the geographic position in the recording time period, namely the change of the geographic position of the user in the independent journey is reflected; the state duration represents the duration of the independent journey; the journey number is increased progressively according to the number of the independent journey of the user; the sub-strokes are not included because of the independent strokes, and the number of the sub-strokes is a preset initial value. Illustratively, the preset initial value is 0.
If the user state of the last operation record is the resident state and the state duration is less than or equal to the second preset value, or the user state of the last operation record is the traveling state, the current state isThe corresponding advancing state of the next independent journey is the sub-journey of the previous independent journey, and a new running record corresponding to the sub-journey is added. Wherein, the user state is a traveling state, and the starting point is a geographic position R corresponding to the nth communication recordnThe arrival point is the geographic position R corresponding to the (n + 1) th communication recordn+1Duration of state Tn+1-TnThe journey number is the journey number of the last operation record, and the sub-journey number is the sub-journey number of the last operation record plus 1. In the operation records corresponding to the sub-journey, the departure point and the arrival point respectively record the departure point and the arrival point of the geographic position in the recording time period, namely the change of the geographic position of the user in the sub-journey is reflected; the state duration represents the duration of the sub-journey; the journey number records the number of the independent journey comprising the sub-journey; the sub-run numbers represent the number of the sub-run in the independent run including the sub-run.
Further, S1031 to S1035 are executed in a loop until all the corresponding operation records are obtained according to a group of communication records corresponding to the user, and all the operation records constitute the operation information of the user. For example, a set of communication records of a user and the operation information of the user are shown in table 1. It should be noted that table 1 is only for facilitating better understanding of the correspondence between a group of communication records of a user and the operation information of the user, and is not limited to the operation information of the user.
TABLE 1
Figure BDA0001579739590000081
As shown in table 1, the user state in the operation information includes a stay state or a travel state; if the user state is the resident state, the departure point is the departure point of the user in the current resident state, the arrival point is the arrival point of the user in the current recording time period, the state duration is the retention time of the user in the resident state, the stroke number is the stroke number of the user in the previous travel state, and the sub-stroke number is the sub-stroke number of the last sub-stroke when the user is in the previous travel state; if the user state is the traveling state, the departure point is the departure point of the user in the current recording time period, the arrival point is the arrival point of the user in the current recording time period, the state duration is the time for keeping the traveling state of the user in the current recording time period, the travel number is the travel number of the user in the current traveling state, and the sub-travel number is the sub-travel number of the user in the current recording time period.
Specifically, in the operation record corresponding to the first communication record, the user state, the departure point, the arrival point, the state duration, the trip number, and the sub-trip number are all initial values. The operation information has three residence states in total, in the first residence state, the position change in the residence period is from position 1 to position 2, and the residence time is 12 s; in the second dwell state, the position of the dwell period changes from position 5 to position 6, the dwell duration being 2 s; in the third dwell state, where the position during dwell changes from position 7 to position 9, the dwell duration is 55s, it can be seen that the communication log 7-9 determines a third dwell state, which lasts for two log periods.
The running information has two running states in total, the first running state is a trip 1 including 4 sub-trips, and the second running state is a trip 2. From the operation record of the operation information, the position change and the duration during each sub-trip can be obtained, and also the position change and the duration in the traveling state can be obtained. For example, the position of sub-stroke 3 of stroke 1 changes from position 6 to position 7, and the duration of sub-stroke 3 is 252 s; from the information of sub-trip 0, sub-trip 1, sub-trip 2, and sub-trip 3 of trip 1, it can be obtained that the position change of trip 1 is position 2 to position 7, and the duration of trip 1 is 7s +34s +20s +252s — 313 s.
It should be noted that, between the sub-trip 2 and the sub-trip 3 of the trip 1, there is a dwell state, which lasts only 2s, and is a short dwell in the trip state, according to the determination rule, "if the user state of the last operation record is the dwell state and the state duration is less than or equal to the second preset value, the current travel state corresponds to the sub-trip of the previous trip", and a trip after the dwell state is the sub-trip 3 of the trip 1, instead of the independent trip 2.
Further, after the operation information of at least one user is determined according to S101-S103, the operation track of the user can be analyzed according to the operation information of the user. For example, according to the duration of the user's residence state, the residence state can be divided into long residence, short residence and ultra-short residence, so as to reflect the purpose of the user to reach a certain address. The long residence can be used for reflecting information of a user such as a place where the user lives, a place where the user works and the like, the short residence can be used for reflecting information that the user temporarily travels and arrives at a certain address, and the ultra-short residence can be used for reflecting that the user encounters an accidental event at a certain address in the traveling process. For example, the departure point and the arrival point according to the traveling state of the user may reflect the relationship between two positions in the trip; firstly, according to a source address and a destination address of a user trip, further reflecting a travel intensity relation among geographic areas through clustering analysis; secondly, according to the departure point, the arrival point and the state duration of the user journey, the time consumed by the user for traveling, the traveling speed and the main transportation mode selected for traveling can be reflected, and further the convenience degree of the transportation between geographic areas and the type composition of the transportation mode can be reflected; the sub-trip mainly reflects details in the trip process, on one hand, the position track experienced by the user during trip can be reflected, the traffic route and the traffic mode selected by the user in the sub-trip can be obtained by fitting with a road network, and on the other hand, the trip time and the trip speed of the user in the sub-trip can be reflected; thirdly, according to the travel and sub-travel information of the user travel state, the number of sub-travels, traffic travel characteristics of each section and middle residence characteristics of the user in the travel process of one travel can be reflected, and therefore rich information in the travel of the user is reflected.
According to the acquired rich operation information of the user, rich information connotation can be provided for the portrayal of the geographic portrait. When the geographic representation is sufficiently clear, information guidance can be further provided for function planning (such as residence, workplace, temporary destination, and destination reservation of contingency) and traffic planning (such as road network resource allocation, route planning and matching setting of subway, public transport, walking, riding, taxi, self-driving, parking lot, and network appointment car) of the geographic area.
Compared with the prior art that the operation information of the user is analyzed according to the mobile communication data, the data analysis method provided by the embodiment of the invention has the advantages that the operation information of the user is more detailed, the behavior rule of the user can be presented in a refined manner, and more abundant information is provided for city planning and traffic planning.
The above description mainly introduces the solution provided by the embodiment of the present invention from the perspective of a data analysis device. It is to be understood that the data analysis device includes hardware structures and/or software modules corresponding to the respective functions for realizing the functions. Those of skill in the art will readily appreciate that the present invention can be implemented in hardware or a combination of hardware and computer software, with the exemplary data analysis devices and algorithm steps described in connection with the embodiments disclosed herein. Whether a function is performed as hardware or computer software drives hardware depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the embodiment of the present invention, the data analysis device may be divided into the functional modules or the functional units according to the above method examples, for example, each functional module or functional unit may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The integrated module may be implemented in a form of hardware, or may be implemented in a form of a software functional module or a functional unit. The division of the modules or units in the embodiments of the present invention is schematic, and is only a logic function division, and there may be another division manner in actual implementation.
In an embodiment of the present invention, a data analysis apparatus is provided, and fig. 3 illustrates a schematic diagram of a possible structure of the data analysis apparatus according to the foregoing embodiment, in a case where each functional module is divided according to each function. The data analysis apparatus includes an acquisition module 301 and a determination module 302.
The obtaining module 301 is configured to obtain mobile communication data, where the mobile communication data corresponds to a user, and the mobile communication data includes at least two communication records; each communication record comprises a user identifier, a communication record timestamp and position information; the communication records are arranged in time order from first to last according to the communication record time stamps.
The obtaining module 301 is further configured to obtain a corresponding geographic location according to the location information in each communication record.
The determining module 302 is configured to determine an operation record of the user according to a communication record timestamp in the (n + 1) th communication record and the nth communication record and a geographic location corresponding to the (n + 1) th communication record and the nth communication record, respectively; wherein, the time interval between the n +1 th communication record and the nth communication record is a recording time period; the operation record comprises a user state, a departure point, an arrival point, state duration, a travel number and a sub-travel number; wherein the user state comprises a stay state or a travel state; if the user state is the resident state, the departure point is the departure point of the user in the current resident state, the arrival point is the arrival point of the user in the current recording time period, the state duration is the retention time of the user in the resident state, the travel number is the travel number of the user in the previous travel state, and the sub-travel number is the sub-travel number of the last sub-travel when the user is in the previous travel state; if the user state is a traveling state, the departure point is the departure point of the user in the current recording time period, the arrival point is the arrival point of the user in the current recording time period, the state duration is the time for keeping the traveling state of the user in the current recording time period, the travel number is the travel number of the user in the current traveling state, and the sub-travel number is the sub-travel number of the user in the current recording time period.
Optionally, the determining module 302 determines one operation record of the user according to the communication record timestamp in the (n + 1) th communication record and the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively, and specifically includes: determining the current user state according to the geographical positions respectively corresponding to the n +1 th communication record and the nth communication record; and determining the current operation record according to the current user state and the last operation record.
Optionally, the determining module 302 determines the current user state according to the geographic positions respectively corresponding to the n +1 th communication record and the nth communication record, and specifically includes: the determining module 302 determines whether the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is greater than a first preset value; if the determining module 302 determines that the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is greater than a first preset value, determining that the current user state is a traveling state; if the determining module 302 determines that the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is less than or equal to a first preset value, it is determined that the current user state is the resident state.
Optionally, the determining module 302 determines the current operation record according to the current user state and the previous operation record, and specifically includes: if the determining module 302 determines that the current user state is the resident state and the user state of the previous operation record is the travel state, adding an operation record corresponding to the resident state; if the determining module 302 determines that the current user state is the resident state and the user state of the previous operation record is the resident state, updating the operation record corresponding to the previous resident state; if the determining module 302 determines that the current user state is the traveling state, determining the current operation record according to the user state and the state duration of the previous operation record; if the determining module 302 determines that the user state of the previous operation record is the resident state and the state duration is greater than the second preset value, adding an operation record corresponding to an independent travel; if the determining module 302 determines that the user state of the previous operation record is the resident state and the state duration is less than or equal to the second preset value, or the user state of the previous operation record is the travel state, a new operation record corresponding to the sub-trip is added.
Optionally, the determining module 302 is further configured to determine, after the obtaining module 301 obtains the corresponding geographic position according to the position information in each communication record, a communication record that meets a preset geographic position range and a preset time range according to the geographic position corresponding to each communication record and the communication record timestamp in each communication record.
Optionally, the location information in the communication record includes: cell identification and user location latitude and longitude.
Compared with the prior art in which the operation information of the user is analyzed according to the mobile communication data, the data analysis method provided by the embodiment of the invention has the advantages that the analysis of the operation information of the user is more detailed, the behavior rule of the user can be presented in a refined manner, and more abundant information is provided for city planning and traffic planning.
Fig. 4 shows a schematic diagram of a possible configuration of the data analysis device according to the exemplary embodiment described above, in the case of an integrated unit. The data analysis device includes: a processing unit 402 and a communication unit 403. The processing unit 402 is used for controlling and managing the actions of the data analysis apparatus, for example, performing the steps performed by the above-described acquisition module 301, determination module 302, and/or other processes for performing the techniques described herein. The communication unit 403 is used to support communication of the data analysis apparatus with other network entities. The data analysis device may further comprise a memory unit 401 and a bus 404, the memory unit 401 being adapted to store program codes and data of the data analysis device.
The processing unit 402 may be, for example, a processor or a controller in a data analysis device, which may implement or execute various exemplary logical blocks, modules, and circuits described in connection with the present disclosure. The processor or controller may be a central processing unit, general purpose processor, digital signal processor, application specific integrated circuit, field programmable gate array or other programmable logic device, transistor logic device, hardware component, or any combination thereof. Which may implement or perform the various illustrative logical blocks, modules, and circuits described in connection with the disclosure. The processor may also be a combination of computing functions, e.g., comprising one or more microprocessors, DSPs, and microprocessors, among others.
The communication unit 403 may be a transceiver, a transceiver circuit, a communication interface, or the like in the data analysis apparatus.
The storage unit 401 may be a memory or the like in the data analysis apparatus, and the memory may include a volatile memory such as a random access memory; the memory may also include non-volatile memory, such as read-only memory, flash memory, a hard disk, or a solid state disk; the memory may also comprise a combination of memories of the kind described above.
The bus 404 may be an Extended Industry Standard Architecture (EISA) bus or the like. The bus 404 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one thick line is shown in FIG. 4, but this does not indicate only one bus or one type of bus.
Embodiments of the present invention further provide a computer-readable storage medium, where one or more programs are stored in the computer-readable storage medium, where the one or more programs include instructions, and when the processor of the data analysis apparatus executes the instructions, the data analysis apparatus executes each step performed by the data analysis apparatus in the method flow shown in the foregoing method embodiments.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied in hardware or in software instructions executed by a processor. The software instructions may consist of corresponding software modules that may be stored in RAM, flash memory, ROM, Erasable Programmable Read Only Memory (EPROM), Electrically Erasable Programmable Read Only Memory (EEPROM), registers, a hard disk, a removable hard disk, a compact disc read only memory (CD-ROM), or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be integral to the processor. The processor and the storage medium may reside in an ASIC.
Through the above description of the embodiments, it is clear to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely used as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device may be divided into different functional modules to complete all or part of the above described functions. For the specific working processes of the system, the apparatus and the unit described above, reference may be made to the corresponding processes in the foregoing method embodiments, and details are not described here again.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules or units is only one logical division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) or a processor to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: flash memory, removable hard drive, read only memory, random access memory, magnetic or optical disk, and the like.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (7)

1. A method of data analysis, comprising:
acquiring mobile communication data, wherein the mobile communication data corresponds to a user and comprises at least two communication records; each communication record comprises a user identifier, a communication record timestamp and position information; the communication records are arranged according to the communication record time stamps in the sequence from first to last;
acquiring a corresponding geographic position according to the position information in each communication record;
determining an operation record of the user according to the communication record time stamp in the (n + 1) th communication record and the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively; wherein, the time interval between the n +1 th communication record and the nth communication record is a recording time period; the operation record comprises a user state, a departure point, an arrival point, state duration, a travel number and a sub-travel number;
wherein the user state comprises a stay state or a travel state; if the user state is the resident state, the departure point is the departure point of the user in the current resident state, the arrival point is the arrival point of the user in the current recording time period, the state duration is the retention time of the user in the resident state, the travel number is the travel number of the user in the previous travel state, and the sub-travel number is the sub-travel number of the last sub-travel when the user is in the previous travel state; if the user state is a traveling state, the departure point is the departure point of the user in the current recording time period, the arrival point is the arrival point of the user in the current recording time period, the state duration is the time for keeping the traveling state of the user in the current recording time period, the travel number is the travel number of the user in the current traveling state, and the sub-travel number is the sub-travel number of the user in the current recording time period;
determining an operation record of the user according to the communication record time stamp in the (n + 1) th communication record and the communication record time stamp in the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively, wherein the operation record comprises: determining the current user state according to the geographical positions respectively corresponding to the n +1 th communication record and the nth communication record; determining a current operation record according to the current user state and the previous operation record;
the determining the current user state according to the geographical positions respectively corresponding to the n +1 th communication record and the nth communication record includes: if the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is greater than a first preset value, the current user state is a traveling state; if the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is smaller than or equal to a first preset value, the current user state is a resident state;
determining a current operation record according to the current user state and a previous operation record, including: if the current user state is the resident state and the user state of the last operation record is the advancing state, adding an operation record corresponding to the resident state; if the current user state is the resident state and the user state of the last operation record is the resident state, updating the operation record corresponding to the last resident state;
if the current user state is the advancing state, determining the current operation record according to the user state and the state duration of the last operation record; if the user state of the last operation record is the resident state and the state duration is greater than a second preset value, adding an operation record corresponding to an independent travel; and if the user state of the last operation record is the resident state and the state duration is less than or equal to a second preset value, or the user state of the last operation record is the advancing state, newly adding an operation record corresponding to the sub-stroke.
2. The method of claim 1, wherein after obtaining the corresponding geographic location according to the location information in each of the communication records, the method further comprises:
and determining the communication records according with a preset geographical position range and a preset time range according to the geographical position corresponding to each communication record and the communication record time stamp in each communication record.
3. The method of claim 1, wherein the location information in the communication record comprises: cell identification and user location latitude and longitude.
4. A data analysis apparatus, comprising:
the mobile communication system comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring mobile communication data, the mobile communication data corresponds to a user, and the mobile communication data comprises at least two communication records; each communication record comprises a user identifier, a communication record timestamp and position information; the communication records are arranged according to the communication record time stamps in the sequence from first to last;
the acquisition module is further used for acquiring a corresponding geographic position according to the position information in each communication record;
the determining module is used for determining one running record of the user according to the communication record time stamp in the (n + 1) th communication record and the nth communication record and the geographic position corresponding to the (n + 1) th communication record and the nth communication record respectively; wherein, the time interval between the n +1 th communication record and the nth communication record is a recording time period; the operation record comprises a user state, a departure point, an arrival point, state duration, a travel number and a sub-travel number;
wherein the user state comprises a stay state or a travel state; if the user state is the resident state, the departure point is the departure point of the user in the current resident state, the arrival point is the arrival point of the user in the current recording time period, the state duration is the retention time of the user in the resident state, the travel number is the travel number of the user in the previous travel state, and the sub-travel number is the sub-travel number of the last sub-travel when the user is in the previous travel state; if the user state is a traveling state, the departure point is the departure point of the user in the current recording time period, the arrival point is the arrival point of the user in the current recording time period, the state duration is the time for keeping the traveling state of the user in the current recording time period, the travel number is the travel number of the user in the current traveling state, and the sub-travel number is the sub-travel number of the user in the current recording time period;
the determining module determines an operation record of the user according to the communication record time stamp in the (n + 1) th communication record and the communication record time stamp in the n +1 th communication record and the geographic position corresponding to the (n + 1) th communication record and the n communication record respectively, and specifically includes: determining the current user state according to the geographical positions respectively corresponding to the n +1 th communication record and the nth communication record; determining a current operation record according to the current user state and the previous operation record;
the determining module determines the current user state according to the geographical positions respectively corresponding to the (n + 1) th communication record and the nth communication record, and specifically includes: the determining module determines whether the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is greater than a first preset value; if the determining module determines that the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is greater than a first preset value, determining that the current user state is a traveling state; if the determining module determines that the distance between the geographic position corresponding to the (n + 1) th communication record and the geographic position corresponding to the nth communication record is smaller than or equal to a first preset value, determining that the current user state is a resident state;
the determining module determines a current operation record according to the current user state and a previous operation record, and specifically includes: if the determining module determines that the current user state is the resident state and the user state of the last operation record is the traveling state, adding an operation record corresponding to the resident state; if the determining module determines that the current user state is the resident state and the user state of the last operation record is the resident state, updating the operation record corresponding to the last resident state;
if the determining module determines that the current user state is the advancing state, determining the current operation record according to the user state and the state duration of the last operation record; if the determining module determines that the user state of the last operation record is the resident state and the state duration is greater than a second preset value, adding an operation record corresponding to an independent travel; and if the determining module determines that the user state of the last operation record is the resident state and the state duration is less than or equal to a second preset value, or the user state of the last operation record is the traveling state, adding an operation record corresponding to the sub-stroke.
5. The apparatus of claim 4,
the determining module is further configured to determine, after the obtaining module obtains the corresponding geographic position according to the position information in each communication record, a communication record that conforms to a preset geographic position range and a preset time range according to the geographic position corresponding to each communication record and the communication record timestamp in each communication record.
6. The apparatus of claim 4, wherein the location information in the communication record comprises: cell identification and user location latitude and longitude.
7. A computer-readable storage medium comprising computer instructions which, when run on a data analysis apparatus, cause the data analysis apparatus to perform a data analysis method as claimed in any one of claims 1 to 3.
CN201810150323.7A 2018-02-13 2018-02-13 Data analysis method and device Active CN108391235B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810150323.7A CN108391235B (en) 2018-02-13 2018-02-13 Data analysis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810150323.7A CN108391235B (en) 2018-02-13 2018-02-13 Data analysis method and device

Publications (2)

Publication Number Publication Date
CN108391235A CN108391235A (en) 2018-08-10
CN108391235B true CN108391235B (en) 2020-12-22

Family

ID=63069692

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810150323.7A Active CN108391235B (en) 2018-02-13 2018-02-13 Data analysis method and device

Country Status (1)

Country Link
CN (1) CN108391235B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109948826A (en) * 2019-01-03 2019-06-28 阿里巴巴集团控股有限公司 Method, client, server and the electronic equipment of trip characteristics analysis
CN111476409B (en) * 2020-03-30 2023-07-18 海南太美航空股份有限公司 Prediction method, system and equipment for opening new airlines
CN112417273B (en) * 2020-11-17 2022-04-19 平安科技(深圳)有限公司 Region portrait image generation method, region portrait image generation device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101827180A (en) * 2010-01-27 2010-09-08 深圳市同洲电子股份有限公司 Ticket based method, device and system for determining activity range of mobile terminal
CN102467771A (en) * 2010-10-29 2012-05-23 国际商业机器公司 System and method for recognizing incidence relation of smart card and mobile telephone
CN103516855A (en) * 2012-06-20 2014-01-15 鸿富锦精密工业(深圳)有限公司 Electronic device with telephone number dynamic sorting function
CN104717743A (en) * 2013-12-16 2015-06-17 蓝燕君 Mobile terminal tracking method and system based on signaling analysis
CN107121690A (en) * 2017-03-31 2017-09-01 景致惠通工程咨询(武汉)有限公司 A kind of dwell point recognition methods and device based on parameter of doing more physical exercises

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100191446A1 (en) * 2009-01-28 2010-07-29 Gm Global Technology Operations, Inc. Systems and methods for predictive engine re-starting and predictive neutral/idle operation of a vehicle

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101827180A (en) * 2010-01-27 2010-09-08 深圳市同洲电子股份有限公司 Ticket based method, device and system for determining activity range of mobile terminal
CN102467771A (en) * 2010-10-29 2012-05-23 国际商业机器公司 System and method for recognizing incidence relation of smart card and mobile telephone
CN103516855A (en) * 2012-06-20 2014-01-15 鸿富锦精密工业(深圳)有限公司 Electronic device with telephone number dynamic sorting function
CN104717743A (en) * 2013-12-16 2015-06-17 蓝燕君 Mobile terminal tracking method and system based on signaling analysis
CN107121690A (en) * 2017-03-31 2017-09-01 景致惠通工程咨询(武汉)有限公司 A kind of dwell point recognition methods and device based on parameter of doing more physical exercises

Also Published As

Publication number Publication date
CN108391235A (en) 2018-08-10

Similar Documents

Publication Publication Date Title
JP2019537757A (en) System and method for displaying vehicle movement on a map
US20190147368A1 (en) Parking availability predictor
CN108391235B (en) Data analysis method and device
RU2014109432A (en) METHOD FOR READING VEHICLE REGISTRATION NUMBERS
CN104318477A (en) Vehicle insurance evaluation method based on big data
US10699575B2 (en) Parking availability monitor for a non-demarcated parking zone
CN107615345A (en) Auxiliary device for moving, mobile auxiliary server and mobile accessory system
Schreieck et al. Improving urban transportation: an open plat-form for digital mobility services
CN111680102A (en) Positioning data processing method based on artificial intelligence and related equipment
CN111739294B (en) Road condition information collection method, device, equipment and storage medium
Pramanik Carpooling solutions using machine learning tools
CN112509317B (en) Bus real-time arrival prediction method, device and equipment based on machine learning algorithm
Visan et al. Towards intelligent public transport systems in Smart Cities; Collaborative decisions to be made
Noulas et al. Developing and deploying a taxi price comparison mobile app in the wild: Insights and challenges
CN111858806A (en) Passenger travel track detection method, device, equipment and storage medium
CN112069635A (en) Battery replacement cabinet deployment method, device, medium and electronic equipment
US20190017832A1 (en) Method and apparatus for providing recorded, anonymized routes
Tafidis et al. Interregional European Cooperation platform to promote sustainable transport through ICT: an overview of best practices
Rehrl et al. Collecting floating car data with smartphones: results from a field trial in Austria
CN109446437B (en) Information mining method, device, server and storage medium
JP6070123B2 (en) Position information concealment method, position information concealment device, and position information concealment program
CN104121917A (en) Method and device for automatically discovering new bridge
CN110648008A (en) Road condition prediction method and device
CN107270919B (en) Bus route grading method and device and bus route navigation method and device
Dash et al. CDR-To-MoVis: Developing a mobility visualization system from CDR data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant