US20140031004A1 - Method, computer programs and a use for automatic identification and classification of land uses - Google Patents

Method, computer programs and a use for automatic identification and classification of land uses Download PDF

Info

Publication number
US20140031004A1
US20140031004A1 US13/556,629 US201213556629A US2014031004A1 US 20140031004 A1 US20140031004 A1 US 20140031004A1 US 201213556629 A US201213556629 A US 201213556629A US 2014031004 A1 US2014031004 A1 US 2014031004A1
Authority
US
United States
Prior art keywords
land
activity
base stations
geographical region
call records
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US13/556,629
Other versions
US8639213B1 (en
Inventor
Enrique Frías Martínez
Vanessa Frías Martínez
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonica SA
Original Assignee
Telefonica SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonica SA filed Critical Telefonica SA
Priority to US13/556,629 priority Critical patent/US8639213B1/en
Assigned to TELEFONICA S.A. reassignment TELEFONICA S.A. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MARTINEZ, ENRIQUE FRIAS, MARTINEZ, VANESSA FRIAS
Application granted granted Critical
Publication of US8639213B1 publication Critical patent/US8639213B1/en
Publication of US20140031004A1 publication Critical patent/US20140031004A1/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/02Agriculture; Fishing; Mining
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/14Charging, metering or billing arrangements for data wireline or wireless communications
    • H04L12/1453Methods or systems for payment or settlement of the charges for data transmission involving significant interaction with the data transmission network
    • H04L12/1482Methods or systems for payment or settlement of the charges for data transmission involving significant interaction with the data transmission network involving use of telephony infrastructure for billing for the transport of data, e.g. call detail record [CDR] or intelligent network infrastructure

Definitions

  • the present invention generally relates in a first aspect to a method for the identification and classification of land uses, and more particularly to a method for the automatic identification and classification of land uses using the information provided from cell phone networks.
  • a second aspect of the present invention relates to computer programs comprising computer program code means adapted to perform an approximation of each coverage region, and to perform a comparison.
  • a third aspect of the invention relates to a use of information from a plurality of call records during a given time period to automatically identify and classify land uses of a geographical region R by measuring a number of interactions received by each one of a plurality of base stations giving coverage to said geographical region R during said given period of time.
  • the concept of land use refers to the type of activities that take place in a specific geographic area, such as residential, industrial, etc.
  • base station in the current description, it has to be understood a base station providing communications under any standards, sometimes referred to as BTS.
  • BTS any standards
  • the term encompasses a radio base station, or the so-called node B or eNB and other development standards.
  • the base station is preferably part of a cellular tower, but other embodiments are also possible.
  • CDRs Call Detail Records
  • the main limitations of the current approaches are the cost in time and money due to the need of training individuals for collecting data and/or the need to prepared, send and collect questionnaires. Also there is the fact that individuals are increasingly opposed to provide information they may consider personal. As a result these studies are typically done every two to four years which highly limits the study of the evolution of land uses in a city.
  • the present invention proposes to solve these limitations because the use of pervasive infrastructures drastically reduces the cost and eliminates the need of collection data on-site. As a result, studies can be done as frequently as needed.
  • the present invention provides, in a first aspect, to a method for automatic identification and classification of land uses, for resources allocation or tourism characterization comprising computing means running in a computer device receiving as inputs, a geographical region R, a plurality of base stations giving coverage to said geographical region R and a plurality of call records generated by individuals using said plurality of base stations
  • the method comprises performing automatically said identification and classification of land uses by making use of information extracted during a given time period directly from said plurality of call records.
  • the method of the invention comprises characterize the activity of each one of said geographical region R and assign to each region R a set of labels in order to identify the land use activity characterized.
  • the method of the present invention comprises the use of the land uses identification and characterization for urban planning applications, resources allocation and/or tourism characterization.
  • Each coverage region of each one of said plurality of base stations is approximated by a 2-dimensional non-overlapping polygon by using a Voronoi tessellation.
  • a second aspect of the present invention relates to a computer program comprising computer program code means adapted to perform said approximation of each coverage region of claim 11 when the program is run on a computer, and to a computer program comprising computer program code means adapted to perform said comparison of claim 12 when the program is run on a computer.
  • a third aspect of the invention relates to a use of information from a plurality of call records during a given time period to automatically identify and classify land uses of a geographical region R by measuring a number of interactions received by each one of a plurality of base stations giving coverage to said geographical region R during said given period of time.
  • FIG. 1 shows an example of a typical representation of the aggregate weekday-weekend activity of a generic BTS tower. Each 24 hour period has typically two peaks corresponding to a morning and an afternoon peak. The normalized value of those peaks will be used as an indication of the land use.
  • FIG. 2 shows a typical representation of the activity signature that characterizes an industrial park/office land use, according to an embodiment of the present invention.
  • FIG. 3 shows a typical representation of the activity signature that characterizes a commercial land use, according to an embodiment of the present invention.
  • FIG. 4 shows a typical representation of the activity signature that characterizes a night activity land use, according to an embodiment of the present invention.
  • FIG. 5 shows a typical representation of the activity signature that characterizes a weekend leisure land use, according to an embodiment of the present invention.
  • FIG. 6 shows a typical representation of the activity signature that characterizes a residential land use, according to an embodiment of the present invention.
  • the present invention proposes, in order to automatically identify land use behaviors, a technique that makes use of the information extracted from cell phone networks.
  • Cell phone networks are built using base transceiver station (BTS) towers that are in charge of communicating cell phones with the network.
  • BTS base transceiver station
  • BTS base transceiver station
  • CDR databases are populated whenever a mobile phone makes/receives a call or uses a service (e.g. SMS, MMS).
  • a service e.g. SMS, MMS.
  • the set of fields typically contained in a CDR include: (a) originating encrypted phone number; (b) destination encrypted phone number; (c) identifier of the BTS that handled the originating phone number (if available); (d) identifier of the BTS that handled the destination phone number (if available); (e) date and time of the call; and (f) duration of the call.
  • a CDR database generated from the BTS towers that give coverage to city it can be characterized the use that citizens make of specific urban areas.
  • the city is initially divided into the coverage areas defined by the Voronoi tessellation.
  • Each area is then characterized by the activity associated to its corresponding BTS tower which is measured as the number of interactions (voice, SMS and MMS) per time unit. This measure will be the signature of the BTS tower.
  • a rule-based knowledge based assigns labels that describe the uses of a BTS and by extension of its area of coverage.
  • the key element that defines the behaviors identified is the amount of time covered by the CDR database used. In general is recommended to use a CDR database of 30 days. Shorter databases will not have enough information to characterize land uses, while longer periods of time will mix stationary behaviors that will produce fuzzier land uses.
  • the present invention seek to assign a land use label (residential, commercial, nigh activity, weekend activity, office/industrial park or combined) to each V i of R using the information contained in a 30-day Call Detail Record database collected by BTS.
  • the method of the present invention has two parts: (1) Characterization of the Activity of Each Geographical region, and (2) Labeling of the activity of each geographical region.
  • Step 1 Construct the Activity Matrix A i for each bts i .
  • a i is a two-dimensional matrix where each element A i ( ⁇ , ⁇ ) contains the activity of bts i during a 5-minute time interval ⁇ on a given day ⁇ , where:
  • Step 2 Aggregate & Concatenate Information. Human dynamics are well differentiated between week days and weekend days [7], and those differences will translate into different BTS levels of activity. In order to preserve that information the present invention opts to build each bts i signature X i as the concatenation of the aggregated activity of the BTS during weekdays (Y i , Monday to Friday) and weekends (Z i , Saturday and Sunday), producing a final signature of 576 elements.
  • the weekday-weekend aggregation is computed as (++ indicates concatenation):
  • Step 3 Normalization. Once the signature X i has been obtained it is normalized so the area under the curve has a value of 1. Formally being T i the normalized vector of X i :
  • T i ⁇ ( ⁇ ) X i ⁇ ( ⁇ ) ⁇ t ⁇ X i ⁇ ( t )
  • FIG. 1 presents a typical representation of activity signature of a BTS. Both the weekday activity and the weekend activity are typically characterized by two peaks, one between 10 am and 2 pm another between 4 pm and 8 pm. The relation between these peaks defines the actual land use. At what time is that maximum value of activity highly depends on cultural factors and the time during the year where the CDR data used to generate the signatures represents.
  • Step 4 Using T i , identify the maximum values of activity in the range 10 am-2 pm and 4 pm-8 pm, for both weekdays and weekends. Formally:
  • Step 1 Rule for Assigning INDUSTRIAL/OFFICE Land Uses:
  • the rules captures the idea that INDUSTRIAL/OFFICE geographic areas have mainly activity during weekdays and the activity of weekends is non-relevant compared to weekday activity.
  • the rule specifies that the maximum level of activity during weekdays is in the 11 am-2 pm time period and that the maximum peaks during weekends represent less than 15% of the activity during weekdays.
  • FIG. 2 graphically presents a typical representation of an INDUSTRIAL/OFFICE area.
  • Step 2 Rule for Assigning COMMERCIAL Land Uses:
  • FIG. 3 graphically presents a typical representation of a COMMERCIAL area.
  • Step 3 Rule for Assigning NIGHLIFE Land Uses:
  • FIG. 4 graphically presents a typical representation of a NIGHLIFE activity area (in this case in verifies the second cart of the OR).
  • Step 4 Rule for Assigning WEEKEND LEISURE Land Uses:
  • FIG. 5 graphically presents a typical representation of a WEEKEND LEISURE activity area.
  • Step 5 Rule for Assigning RESIDENTIAL Land Uses:
  • RESIDENTIAL behavior is characterized by the fact that there is more activity during weekends than during weekdays, and that during weekdays the activity is higher in the afternoon than in the morning (WDM ⁇ WDA) representing the idea that people use their cell phone more when day are not in working hours to contact their social network.
  • FIG. 5 graphically presents a typical representation of a RESIDENTIAL activity area.
  • Step 6 Rule for Assigning COMBINED Land Uses:
  • Step 7 assigns a COMBINED land use if none of the previous rules have assigned a label. This case can be typical when an area has more than one land uses, and as a result the signature obtained is a combination of the land uses involved. In general, when applying the method to an urban CDR dataset, close to 40% of the areas will be classified as COMBINED because in dense urban areas it is typical than more than one use occurs in the same geographical area.
  • the invention here presented solves the limitations that previous approaches have when identifying land uses, mainly:
  • the invention is relevant for a variety of urban planning applications, like urban zoning validation, resources allocation and tourism characterization.
  • urban zoning is defined as the designation of permitted uses of land based on mapped zones which separate one set of land uses from another (for example residential areas from industrial areas).
  • One of the main problems of zoning is to actually evaluate to which extent the areas are being used as required or planned, because the collection of data has to be done on site.
  • the present invention approach allows comparing the planned used of a city with the actual use that citizens give to the different areas of the city without the need of on-site data collection.
  • nightlife areas one type of land uses that causes more disturbances.
  • the problem is that the identification of nightlife areas changes over the year (nightlife areas move from winter to summer) and those new areas are continuously appearing and old ones disappearing. With the present invention it can be easily identified these areas to allocate resources and adapt to changes.
  • tourism is one of the main industries.
  • the study of tourists is key for any city hall to cater to their needs and preferences. Questions such as where do tourists stay or where do they shop are very relevant for a city.
  • the invention proposed can characterize how tourists use the city (the different land uses they give) by just considering a CDR database containing tourist information (which can be identified by the fact that they will be roaming in the network.).

Abstract

A method, computer programs and a use for automatic identification and classification of land uses. The method including a computing mechanism running in a computer device receiving as inputs, a geographical region R, a plurality of base stations giving coverage to the geographical region R and a call records generated by individuals using the base stations, and performing automatically the identification and classification of land uses by making use of information extracted during a given time period from the call records. The programs include code adapted to perform the approximation of each coverage region when the program is run on a computer, and to perform a comparison when the program is run on a computer.

Description

    FIELD OF THE ART
  • The present invention generally relates in a first aspect to a method for the identification and classification of land uses, and more particularly to a method for the automatic identification and classification of land uses using the information provided from cell phone networks.
  • A second aspect of the present invention relates to computer programs comprising computer program code means adapted to perform an approximation of each coverage region, and to perform a comparison.
  • A third aspect of the invention relates to a use of information from a plurality of call records during a given time period to automatically identify and classify land uses of a geographical region R by measuring a number of interactions received by each one of a plurality of base stations giving coverage to said geographical region R during said given period of time.
  • The concept of land use refers to the type of activities that take place in a specific geographic area, such as residential, industrial, etc.
  • By base station in the current description, it has to be understood a base station providing communications under any standards, sometimes referred to as BTS. The term encompasses a radio base station, or the so-called node B or eNB and other development standards. The base station is preferably part of a cellular tower, but other embodiments are also possible.
  • Call records are sometimes referred to Call Detail Records (CDRs).
  • PRIOR STATE OF THE ART
  • With the increasing capabilities of mobile devices, individuals leave behind footprints of their interaction with the urban environment. As a result, new research areas focus on improving the quality of life in an urban environment by understanding the city dynamics through the data provided by ubiquitous technologies. One of these areas is the automatic identification of land uses using information collected from pervasive infrastructures (such as cell phone networks).
  • Current approaches for the identification of land uses imply the use of questionnaires or the training of individuals that collect information directly on site.
  • Some authors have already used cell phone traces to implement urban analysis studies. Among others, prior state of the art studies uses for example, aggregated cell-phone data to analyze urban planning in Milan, identified behavioral patterns from the information captured by phones carrying logging software, and/or used bluetooth to characterize pedestrian flow data. Previous work on the use of the cell phone data for land use analysis is scarce, although some studies has been presented to solve related questions. For example, one monitors the dynamics of Rome and obtains clusters of geographical areas measuring cell phone towers activity using Erlangs. Another study, analyses four different geographical spots at different times in Bangkok. Related to this present invention, another prior study used eigendecomposition to study the time structure, finding correlations between the number of Erlangs and the commercial activity of the area.
  • Some patents already focus their attention in the automatic identification of specific land uses mainly using satellite images. For example US 2006/0294062 uses images to determine the percentage of land available for development, US 2007/0162372 for the prediction of economic land use and U.S. Pat. No. 7,873,524 monitors the use of land to raise alarms if risk situations arise. These inventions always focus in a specific land use and do not consider the variety of uses defined in the present invention.
  • Problems with Existing Solutions
  • The main limitations of the current approaches are the cost in time and money due to the need of training individuals for collecting data and/or the need to prepared, send and collect questionnaires. Also there is the fact that individuals are increasingly opposed to provide information they may consider personal. As a result these studies are typically done every two to four years which highly limits the study of the evolution of land uses in a city. The present invention proposes to solve these limitations because the use of pervasive infrastructures drastically reduces the cost and eliminates the need of collection data on-site. As a result, studies can be done as frequently as needed.
  • DESCRIPTION OF THE INVENTION
  • It is necessary to offer an alternative to the state of the art which covers the gaps found therein, particularly related to the lack of proposals which really allows the identification and classification of land uses in urban areas using the information provided by cell phone records.
  • To that end, the present invention provides, in a first aspect, to a method for automatic identification and classification of land uses, for resources allocation or tourism characterization comprising computing means running in a computer device receiving as inputs, a geographical region R, a plurality of base stations giving coverage to said geographical region R and a plurality of call records generated by individuals using said plurality of base stations
  • On contrary to the known proposals, the method comprises performing automatically said identification and classification of land uses by making use of information extracted during a given time period directly from said plurality of call records.
  • On a preferred embodiment, the method of the invention comprises characterize the activity of each one of said geographical region R and assign to each region R a set of labels in order to identify the land use activity characterized.
  • In another preferred embodiment, the method of the present invention comprises the use of the land uses identification and characterization for urban planning applications, resources allocation and/or tourism characterization.
  • Each coverage region of each one of said plurality of base stations is approximated by a 2-dimensional non-overlapping polygon by using a Voronoi tessellation.
  • Other embodiments of the method of the invention are described according to appended claims, and in a subsequent section related to the detailed description of several embodiments.
  • A second aspect of the present invention relates to a computer program comprising computer program code means adapted to perform said approximation of each coverage region of claim 11 when the program is run on a computer, and to a computer program comprising computer program code means adapted to perform said comparison of claim 12 when the program is run on a computer.
  • A third aspect of the invention relates to a use of information from a plurality of call records during a given time period to automatically identify and classify land uses of a geographical region R by measuring a number of interactions received by each one of a plurality of base stations giving coverage to said geographical region R during said given period of time.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The previous and other advantages and features will be more fully understood from the following detailed description of embodiments, with reference to the attached drawings, which must be considered in an illustrative and non-limiting manner, in which:
  • FIG. 1 shows an example of a typical representation of the aggregate weekday-weekend activity of a generic BTS tower. Each 24 hour period has typically two peaks corresponding to a morning and an afternoon peak. The normalized value of those peaks will be used as an indication of the land use.
  • FIG. 2 shows a typical representation of the activity signature that characterizes an industrial park/office land use, according to an embodiment of the present invention.
  • FIG. 3 shows a typical representation of the activity signature that characterizes a commercial land use, according to an embodiment of the present invention.
  • FIG. 4 shows a typical representation of the activity signature that characterizes a night activity land use, according to an embodiment of the present invention.
  • FIG. 5 shows a typical representation of the activity signature that characterizes a weekend leisure land use, according to an embodiment of the present invention.
  • FIG. 6 shows a typical representation of the activity signature that characterizes a residential land use, according to an embodiment of the present invention.
  • DETAILED DESCRIPTION OF SEVERAL EMBODIMENTS
  • The present invention proposes, in order to automatically identify land use behaviors, a technique that makes use of the information extracted from cell phone networks. Cell phone networks are built using base transceiver station (BTS) towers that are in charge of communicating cell phones with the network. A given geographical region will be serviced by a set of BTSs BTS={bts1 . . . btsN}, each one characterized by its geographical coordinates (latitude, longitude). For simplicity, it is assumed that the area of coverage of each BTS, the cell, can be approximated by a 2-dimensional non-overlapping polygon, and approximate it using Voronoi tessellation.
  • Call Detail Record (CDR) databases are populated whenever a mobile phone makes/receives a call or uses a service (e.g. SMS, MMS). Hence, there is an entry for each interaction with the network, with its associated timestamp and the BTS that handled it, which gives an indication of the geographical location of the mobile phone at a given moment in time. It can be noted that no information about the position of a user within a cell is known. The set of fields typically contained in a CDR include: (a) originating encrypted phone number; (b) destination encrypted phone number; (c) identifier of the BTS that handled the originating phone number (if available); (d) identifier of the BTS that handled the destination phone number (if available); (e) date and time of the call; and (f) duration of the call.
  • Using the information contained in a CDR database generated from the BTS towers that give coverage to city, it can be characterized the use that citizens make of specific urban areas. In order to do so, the city is initially divided into the coverage areas defined by the Voronoi tessellation. Each area is then characterized by the activity associated to its corresponding BTS tower which is measured as the number of interactions (voice, SMS and MMS) per time unit. This measure will be the signature of the BTS tower. Once all the signatures have been computed, a rule-based knowledge based assigns labels that describe the uses of a BTS and by extension of its area of coverage.
  • Although the invention presented can be applied to both rural and urban environments, it is better to apply it in urban environment where the high density of towers allows identifying well defined behaviors.
  • The key element that defines the behaviors identified is the amount of time covered by the CDR database used. In general is recommended to use a CDR database of 30 days. Shorter databases will not have enough information to characterize land uses, while longer periods of time will mix stationary behaviors that will produce fuzzier land uses.
  • Given an initial set of BTS=(bts1;bts2; . . . ;btsN) that gives coverage to an urban region R characterized by its Voronoi tessellation R=(V1, V2, . . . , VN), the present invention seek to assign a land use label (residential, commercial, nigh activity, weekend activity, office/industrial park or combined) to each Vi of R using the information contained in a 30-day Call Detail Record database collected by BTS.
  • The method of the present invention has two parts: (1) Characterization of the Activity of Each Geographical region, and (2) Labeling of the activity of each geographical region.
  • A. Characterization of the Activity of Each Geographical Region:
  • For each btsi i=1 . . . n, a signature that describes the number of interactions handled every 5 minutes is generated as follows:
  • Step 1: Construct the Activity Matrix Ai for each btsi. Ai is a two-dimensional matrix where each element Ai(δ,Γ) contains the activity of btsi during a 5-minute time interval Γ on a given day δ, where:

  • δε{1, . . . , NumberOfDays}

  • Γε{1, . . . , 288}
  • With NumberOfDays, the number of days collected by the CDR database (typically 30 days) and 288 indicating the total number of measurements per each 24 hour period. Although other r intervals are possible, higher resolutions did not add any extra information (while increasing the complexity), and lower resolutions affected greatly the results due to the linear interpolation effect.
  • Step 2: Aggregate & Concatenate Information. Human dynamics are well differentiated between week days and weekend days [7], and those differences will translate into different BTS levels of activity. In order to preserve that information the present invention opts to build each btsisignature Xi as the concatenation of the aggregated activity of the BTS during weekdays (Yi, Monday to Friday) and weekends (Zi, Saturday and Sunday), producing a final signature of 576 elements. The weekday-weekend aggregation is computed as (++ indicates concatenation):
  • Y i ( Γ ) = 1 δ weekday δ weekday A i ( δ , Γ ) Z i ( Γ ) = 1 δ weekend δ weekend A i ( δ , Γ ) X i = Y i ++ Z i
  • Step 3: Normalization. Once the signature Xi has been obtained it is normalized so the area under the curve has a value of 1. Formally being Ti the normalized vector of Xi:
  • T i ( Γ ) = X i ( Γ ) t X i ( t )
  • By extension the signature Ti also characterizes the corresponding geographical area of the Voronoi tessellation Vi. FIG. 1 presents a typical representation of activity signature of a BTS. Both the weekday activity and the weekend activity are typically characterized by two peaks, one between 10 am and 2 pm another between 4 pm and 8 pm. The relation between these peaks defines the actual land use. At what time is that maximum value of activity highly depends on cultural factors and the time during the year where the CDR data used to generate the signatures represents.
  • Step 4: Using Ti, identify the maximum values of activity in the range 10 am-2 pm and 4 pm-8 pm, for both weekdays and weekends. Formally:
      • WDMi: represents the maximum level of activity during weekdays in the 10 am-2 pm time period.
      • WDAi: represents the maximum level of activity during weekdays in the 4 pm-8 pm time period.
      • WEMi: represents the maximum level of activity during weekends in the 11 am-2 pm time period.
      • WEAi: represents the maximum level of activity during weekends in the 4 pm-8 pm time period.
  • Two more values are obtained for each signature activity:
      • MWDi: represents the maximum level of activity during weekdays outside the 10 am-2 pm and 4 pm-8 pm time periods.
      • MWEi: represents the maximum level of activity during weekends outside the 10 am-2 pm and 4 pm-8 pm time periods.
    B. Activity Labeling of Each Geographical Region:
  • Given the set of labels USE={RESIDENTIAL, COMERCIAL, INDUSTRIAL/OFFICE, NIGHT LEISURE, WEEKEND LEISURE, COMBINED}, the second step of the method assigns to each area of coverage Vi the set of labels from USE that identifies its land use using the information extracted from the corresponding activity signature. For each Ti, i=1, . . . , N:
  • Step 1: Rule for Assigning INDUSTRIAL/OFFICE Land Uses:
  • IF WDMi>WDAi AND 0.15 WDMi>WEMi AND WEMi<0.15 WDMi AND WEAi<0.15 WDMi
  • THEN ASSIGN INDUSTRIAL/OFFICE TO Vi
  • The rules captures the idea that INDUSTRIAL/OFFICE geographic areas have mainly activity during weekdays and the activity of weekends is non-relevant compared to weekday activity. Formally the rule specifies that the maximum level of activity during weekdays is in the 11 am-2 pm time period and that the maximum peaks during weekends represent less than 15% of the activity during weekdays. FIG. 2 graphically presents a typical representation of an INDUSTRIAL/OFFICE area.
  • Step 2: Rule for Assigning COMMERCIAL Land Uses:
  • IF WDMi>WDAi AND WEMi>WEAi AND WEMi>0.5 WDMi
  • THEN ASSIGN COMMERCIAL TO Vi
  • Commercial areas, from a BTS activity perspective, are characterized by the fact that they have relevant activities both during weekdays and weekends (the activity during weekends have to be at least 50% of the activity during weekdays), and also, both during weekdays and weekends, the activity in the morning is higher than in the afternoon. FIG. 3 graphically presents a typical representation of a COMMERCIAL area.
  • Step 3: Rule for Assigning NIGHLIFE Land Uses:
  • IF (WEMi>WEAi AND MWEi>0.7 WEMi AND (MWEi<10 AM OR MWEi>8 pm) OR WEAi>WEM AND MWEi>0.7 WEAi AND (MWEi<10 AM OR MWEi>8 pm))
  • THEN ASSIGN NIGHTLIFE TO Vi
  • This rule captures the idea that NIGHLIFE areas will have activity during weekends typically between 8 pm and 4 am. The rule specifies than the maximum activity outside 10 am-2 pm and 4 pm-8 pm time periods has at least 70% of the activity of those periods, indicating nightlife activities. The information during weekdays in this case is not relevant. FIG. 4 graphically presents a typical representation of a NIGHLIFE activity area (in this case in verifies the second cart of the OR).
  • Step 4: Rule for Assigning WEEKEND LEISURE Land Uses:
  • IF WEMi>WEAi AND 0.66 WEMi>WDMi AND 0.66 WEMi>WDAi
  • THEN ASSIGN WEEKEND LEISURE TO Vi
  • This rule represents the idea that WEEKEND LEISURE areas (such as parks) takes place during light hours in weekends (WEM>WEA) and that the activity is higher (at least 33% higher) during weekends than during weekdays. FIG. 5 graphically presents a typical representation of a WEEKEND LEISURE activity area.
  • Step 5: Rule for Assigning RESIDENTIAL Land Uses:
  • IF WDMi<WDAi AND WEMi>WDAi AND WEAi>WDAi
  • THEN ASSIGN RESIDENTIAL TO Vi
  • RESIDENTIAL behavior is characterized by the fact that there is more activity during weekends than during weekdays, and that during weekdays the activity is higher in the afternoon than in the morning (WDM<WDA) representing the idea that people use their cell phone more when day are not in working hours to contact their social network. FIG. 5 graphically presents a typical representation of a RESIDENTIAL activity area.
  • Step 6: Rule for Assigning COMBINED Land Uses:
  • IF Vi HAS NO LABELS
  • THEN ASSIGN COMBINED TO Vi
  • Step 7 assigns a COMBINED land use if none of the previous rules have assigned a label. This case can be typical when an area has more than one land uses, and as a result the signature obtained is a combination of the land uses involved. In general, when applying the method to an urban CDR dataset, close to 40% of the areas will be classified as COMBINED because in dense urban areas it is typical than more than one use occurs in the same geographical area.
  • The values used to generate the rules for this classification can be found in [8] and [9] where different clustering techniques were applied to BTS activity data to identify common signatures of behavior.
  • In general the previous rules have been design in an exclusive way, i.e. once an area is classified with one land use; no other antecedent of the rules will be true so no other label will be assigned. This is not the case of NIGHLIFE land use, which can be assigned in combination with any other label. Typically, considering only the areas that are not assigned a COMBINED land use; close to 50% correspond to RESIDENTIAL uses, 30% to COMMERCIAL uses, 10% to INDUSTRIA, 5% NIGHT ACTIVITIES and 5% to WEEKEND LEISURE. These values are just indicative of a typical urban environment and can vary not only between different cities but also between different moments of the year in the same city.
  • Advantages of the Invention
  • The invention here presented solves the limitations that previous approaches have when identifying land uses, mainly:
      • it does not need the use of questionnaires or on-site data collection for capturing the information, which lowers the cost for obtaining the results.
      • land uses can be identified as frequently as needed, which is especially useful for study the evolution of land uses over time.
      • land uses can be identified for different groups of individuals, i.e. elders, young, socio-economic divisions or tourists. The only difference in the present invention consists on filtering the original CDR database to consider only the entries that are done by the group in which the study focuses.
    Potential Uses of the Invention
  • The invention is relevant for a variety of urban planning applications, like urban zoning validation, resources allocation and tourism characterization.
  • In the context of urban planning, urban zoning is defined as the designation of permitted uses of land based on mapped zones which separate one set of land uses from another (for example residential areas from industrial areas). One of the main problems of zoning is to actually evaluate to which extent the areas are being used as required or planned, because the collection of data has to be done on site. The present invention approach allows comparing the planned used of a city with the actual use that citizens give to the different areas of the city without the need of on-site data collection.
  • One of the main problems of city halls is how to allocate resources over the city to control problems, being nightlife areas, one type of land uses that causes more disturbances. The problem is that the identification of nightlife areas changes over the year (nightlife areas move from winter to summer) and those new areas are continuously appearing and old ones disappearing. With the present invention it can be easily identified these areas to allocate resources and adapt to changes.
  • In any modern city, tourism is one of the main industries. The study of tourists is key for any city hall to cater to their needs and preferences. Questions such as where do tourists stay or where do they shop are very relevant for a city. The invention proposed can characterize how tourists use the city (the different land uses they give) by just considering a CDR database containing tourist information (which can be identified by the fact that they will be roaming in the network.).
  • ACRONYMS
    • CDR Call Detail Records
    • BTS Base Transceiver Station
    • MMS Multimedia Messaging System
    • SMS Short Message Service

Claims (16)

1. (canceled)
2. A method for automatic identification and classification of land uses, the method comprising:
using a computer device to receive as inputs: a geographical region R, a plurality of base stations giving coverage to said geographical region R, and a plurality of call records generated by individuals using said plurality of base stations,
wherein said identification and classification of land uses are performed automatically by making use of information during a given time period extracted directly from said plurality of call records, and
wherein said information extracted from said plurality of call records is contained in a call detail record (CDR) database.
3. The method according to claim 2, further comprising extracting said information from said plurality of call records during 30 days.
4. The method according to claim 2, further comprising characterizing the activity of each geographical region R.
5. The method according to claim 4, wherein said activity is characterized by measuring a number of interactions received by each one of said plurality of base stations during said given time period.
6. The method according to claim 5, further comprising differentiating said activity of each one of said plurality of base stations between weekdays and weekend days.
7. The method according to claim 5, further comprising measuring said number of interactions every 5 minutes.
8. The method according to claim 7, wherein said interactions measured comprises: voice calls, SMS, MMS or a combination thereof.
9. The method according to claim 4, further comprising assigning to each geographical region R a set of labels in order to identify the land use using said characterized activity.
10. The method according to claim 9, wherein said land use labels comprise any of: residential, commercial, night leisure, weekend leisure, office/industrial and/or combined.
11. The method according to claim 2, wherein each coverage region of each one of said plurality of base stations is approximated by a 2-dimensional non-overlapping polygon by using a Voronoi tessellation.
12. The method according to claim 2, further comprising comparing a planned use of an area with the actual use of said area.
13. A non-transitory computer readable medium storing a program causing a computer to execute a method for automatic identification and classification of land uses, the method comprising:
using the computer to receive as inputs: a geographical region R, a plurality of base stations giving coverage to said geographical region R, and a plurality of call records generated by individuals using said plurality of base stations,
wherein said identification and classification of land uses are performed automatically by making use of information during a given time period extracted directly from said plurality of call records, and
wherein said information extracted from said plurality of call records is contained in a call detail record (CDR) database.
14. The non-transitory computer readable medium according to claim 13, further comprising comparing a planned use of an area with the actual use of said area.
15. (canceled)
16. A system for automatically identifying and classifying land uses, the system comprising:
a receiving unit that receives a plurality of call records during a given time period, from a call detail record (CDR) database;
a determining unit, comprising a processor, that uses the plurality of call records received by the receiving unit to determine a number of interactions received by each of a plurality of base stations giving coverage to the geographical region R during the given period of time; and
a classification unit that uses the number of interactions determined by the determining unit to automatically identify and classify the land uses of the geographical region R.
US13/556,629 2012-07-24 2012-07-24 Method, computer programs and a use for automatic identification and classification of land uses Expired - Fee Related US8639213B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/556,629 US8639213B1 (en) 2012-07-24 2012-07-24 Method, computer programs and a use for automatic identification and classification of land uses

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US13/556,629 US8639213B1 (en) 2012-07-24 2012-07-24 Method, computer programs and a use for automatic identification and classification of land uses

Publications (2)

Publication Number Publication Date
US8639213B1 US8639213B1 (en) 2014-01-28
US20140031004A1 true US20140031004A1 (en) 2014-01-30

Family

ID=49957998

Family Applications (1)

Application Number Title Priority Date Filing Date
US13/556,629 Expired - Fee Related US8639213B1 (en) 2012-07-24 2012-07-24 Method, computer programs and a use for automatic identification and classification of land uses

Country Status (1)

Country Link
US (1) US8639213B1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017133110A1 (en) * 2016-02-02 2017-08-10 东南大学 Urban dynamic spatial structure circle layer definition method
US20210201186A1 (en) * 2019-12-27 2021-07-01 Paypal, Inc. Utilizing Machine Learning to Predict Information Corresponding to Merchant Offline Presence
CN117408438A (en) * 2023-12-14 2024-01-16 中国测绘科学研究院 Industrial park low-utility land extraction method and system based on multi-source big data

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150039523A1 (en) * 2013-07-30 2015-02-05 Universitat De Girona Methods and devices for urban planning

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5345595A (en) * 1992-11-12 1994-09-06 Coral Systems, Inc. Apparatus and method for detecting fraudulent telecommunication activity

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017133110A1 (en) * 2016-02-02 2017-08-10 东南大学 Urban dynamic spatial structure circle layer definition method
US20210201186A1 (en) * 2019-12-27 2021-07-01 Paypal, Inc. Utilizing Machine Learning to Predict Information Corresponding to Merchant Offline Presence
CN117408438A (en) * 2023-12-14 2024-01-16 中国测绘科学研究院 Industrial park low-utility land extraction method and system based on multi-source big data

Also Published As

Publication number Publication date
US8639213B1 (en) 2014-01-28

Similar Documents

Publication Publication Date Title
Becker et al. Human mobility characterization from cellular network data
Becker et al. A tale of one city: Using cellular network data for urban planning
Chon et al. Evaluating mobility models for temporal prediction with high-granularity mobility data
Isaacman et al. Identifying important places in people’s lives from cellular network data
US10395519B2 (en) Method and system for computing an O-D matrix obtained through radio mobile network data
US8504035B2 (en) System and method for population tracking, counting, and movement estimation using mobile operational data and/or geographic information in mobile network
CN102300220B (en) Method and device for determining deployment position of micro base station
Demissie et al. Exploring cellular network handover information for urban mobility analysis
Fekih et al. A data-driven approach for origin–destination matrix construction from cellular network signalling data: a case study of Lyon region (France)
US8838134B2 (en) Method and computer programs for the construction of communting matrices using call detail records and a use for providing user&#39;s mobility information
Demissie et al. Analysis of the pattern and intensity of urban activities through aggregate cellphone usage
Holleczek et al. Detecting weak public transport connections from cellphone and public transport data
US20180124566A1 (en) Method and system for a real-time counting of a number of persons in a crowd by means of aggregated data of a telecommunication network
US20120108261A1 (en) Dynamic travel behavior estimation in mobile network
Kalogianni et al. Passive WiFi monitoring of the rhythm of the campus
CN108427679B (en) People stream distribution processing method and equipment thereof
US8639213B1 (en) Method, computer programs and a use for automatic identification and classification of land uses
Zheng et al. Exploring both home-based and work-based jobs-housing balance by distance decay effect
Zin et al. Estimation of originating-destination trips in Yangon by using big data source
Imai et al. Origin-destination trips generated from operational data of a mobile network for urban transportation planning
Khan et al. A hierarchical approach for identifying user activity patterns from mobile phone call detail records
Woods et al. Exploring methods for mapping seasonal population changes using mobile phone data
Tsumura et al. Examining potentials and practical constraints of mobile phone data for improving transport planning in developing countries
CN111212381A (en) Mobile user behavior data analysis method and device, computer equipment and medium
Makita et al. Can mobile phone network data be used to estimate small area population? A comparison from Japan

Legal Events

Date Code Title Description
AS Assignment

Owner name: TELEFONICA S.A., SPAIN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MARTINEZ, ENRIQUE FRIAS;MARTINEZ, VANESSA FRIAS;REEL/FRAME:028901/0252

Effective date: 20120809

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.)

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.)

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20180128