WO2017000817A1 - 获取数据之间的匹配关系的方法和装置 - Google Patents

获取数据之间的匹配关系的方法和装置 Download PDF

Info

Publication number
WO2017000817A1
WO2017000817A1 PCT/CN2016/086649 CN2016086649W WO2017000817A1 WO 2017000817 A1 WO2017000817 A1 WO 2017000817A1 CN 2016086649 W CN2016086649 W CN 2016086649W WO 2017000817 A1 WO2017000817 A1 WO 2017000817A1
Authority
WO
WIPO (PCT)
Prior art keywords
wireless router
target object
wireless
log
information
Prior art date
Application number
PCT/CN2016/086649
Other languages
English (en)
French (fr)
Inventor
范文
傅劲
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2017000817A1 publication Critical patent/WO2017000817A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/52Network services specially adapted for the location of the user terminal

Definitions

  • the present invention relates to the field of data processing, and in particular to a method and apparatus for acquiring a matching relationship between data.
  • the China Point of Interest (POI) database contains POI data from various regions of the country.
  • the data mainly includes four aspects: name, category, latitude and longitude, nearby hotels, restaurants, shops and other information.
  • Targets of hotels, restaurants, shops, etc. can obtain log information of the target object from the POI database, and the log information can cover the following aspects: the name of the target object, and the coordinate information of the target object (for example, the target object) Longitude and latitude information), the address of the target object (for example, the street where the target object is located), and the location information of the target object (for example, the city and administrative area where the target object is located).
  • the mobile terminal held by the consumer searches for and records the wireless network of the surrounding area, and the recorded log information of the wireless network basically includes the following aspects: the unique identifier of the mobile terminal, and the movement
  • the coordinate information of the terminal for example, the latitude and longitude information of the mobile terminal
  • the location information of the mobile terminal for example, the city and the administrative area where the mobile terminal is located
  • the identifier of the wireless network searched by the mobile terminal and the strength information of the wireless network signal.
  • Obtaining the correspondence between the target object and the wireless network, and performing statistical analysis, can obtain commercially valuable analytical data such as consumer consumption, consumption preferences, or business conditions of the store. For example, if the target object is used as the analysis object, if the wireless network corresponding to the target object is known, the log information of the wireless network recorded by the mobile terminal can be queried, and the flow of the target object in different time periods can be known, and the flow can be known. The information of the consumer who has connected to the wireless network, and based on the social network of the consumer, deeply analyzes the distribution of the consumer of the target object or automatically recommends the target object to the friend of the consumer. For example, if the consumer is the analysis object, the wireless network that the consumer has connected to can be obtained.
  • Target object you can analyze the consumer's stay time in these target objects or the frequency/number of times the consumers visit these target objects. You can also analyze the consumer's consumption preferences by integrating the target objects, and also based on the analysis results. Recommend similar target objects to consumers.
  • the target object is deployed to actively cooperate with the target object to deploy a wireless network (for example, Huawei WiFi, 360 WiFi, etc.), so as to obtain a relatively accurate correspondence between the target object and the wireless network.
  • a wireless network for example, Huawei WiFi, 360 WiFi, etc.
  • the acquisition method in the prior art has high economic cost and requires a lot of manpower, material resources and financial resources, and when the number of target objects to be acquired is huge, the time cost for obtaining the correspondence relationship is also quite high, and the acquisition is relatively high.
  • the matching relationship between a large number of target objects and a wireless network is extremely difficult.
  • An embodiment of the present invention provides a method and an apparatus for acquiring a matching relationship between data, so as to at least solve a method for acquiring a corresponding relationship between a target object and a mobile terminal by using a manual method in the prior art, thereby causing the acquired target object and Technical issues in which the relationship between wireless networks is inaccurate and costly.
  • a method for obtaining a matching relationship between data includes: obtaining log information of a target object included in a target object set and a positioning log of a wireless router included in the wireless routing device set. Reading location information of any one or more target objects from the log information, and reading location information of any one or more wireless routers from the location log; according to location information of any one or more target objects and any one Or the location information of the multiple wireless routers, determining a set of wireless routers corresponding to the target object, to obtain a matching relationship between the target object included in the target object set and the wireless router included in the wireless routing device set.
  • an apparatus for acquiring a matching relationship between data includes: an acquiring module, configured to acquire log information of a target object included in a target object set, and a wireless routing device set The location record of the included wireless router; the reading module, configured to read location information of any one or more target objects from the log information, and read location information of any one or more wireless routers from the location log; a module, configured to determine, according to location information of any one or more target objects and location information of any one or more wireless routers, a set of wireless routers corresponding to the target object, to obtain target objects and wireless objects included in the target object set The matching relationship between wireless routers included in the routing device set.
  • the target object and the target object are respectively read from the log information and the positioning log.
  • the location information of the wireless router achieves the purpose of determining the correspondence between the target object and a group of wireless routers according to the location information of the target object and the wireless router, thereby realizing the collection of the target object and the wireless routing device set included in the target object set.
  • the technical effect of the matching relationship between the wireless routers included in the method further solves the relationship between the acquired target object and the wireless network due to the method of manually acquiring the correspondence between the target object and the mobile terminal in the prior art. Inaccurate and costly technical issues.
  • FIG. 1 is a block diagram showing the hardware structure of a computer terminal for acquiring a matching relationship between data according to Embodiment 1 of the present application;
  • FIG. 2 is a schematic flowchart of a method for acquiring a matching relationship between data according to Embodiment 1 of the present application;
  • FIG. 3 is a flowchart of an optional method for acquiring a matching relationship between data according to Embodiment 1 of the present application;
  • FIG. 4 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to Embodiment 2 of the present application;
  • FIG. 5 is a schematic structural diagram of an optional acquisition module according to the embodiment shown in FIG. 4 of the present application.
  • FIG. 6 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 5 of the present application;
  • FIG. 7 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 6 of the present application;
  • FIG. 8 is a schematic structural diagram of an optional second screening module according to the embodiment shown in FIG. 7 of the present application.
  • FIG. 9 is a schematic structural diagram of an optional processing module according to the embodiment shown in FIG. 4 of the present application.
  • FIG. 10 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 9 of the present application; FIG.
  • FIG. 11 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 4 of the present application;
  • FIG. 12 is a structural block diagram of a computer terminal according to an embodiment of the present application.
  • the POI database which is the China Point of Interest (POI) database, contains POI data from various regions of the country.
  • the data mainly includes four aspects: name, category, latitude and longitude, and nearby hotels/restaurants/shops.
  • This application is exemplified by the China Information Point Database. Those skilled in the art can use the POI data of various regions in foreign countries without the creative work.
  • the International Mobile Equipment Identity (IMEI) of the mobile terminal is an electronic serial number consisting of 15 digits. It corresponds to each mobile device and is the unique identifier of the mobile device in the world.
  • IMEI International Mobile Equipment Identity
  • Edit Distance also known as Levenshtein distance
  • Levenshtein distance is the minimum number of edit operations required between two strings, one from one to another.
  • Licensed editing operations include replacing one character with another, inserting one character, and deleting one character.
  • FIG. 1 is an example of acquiring data between embodiments of the present invention.
  • FIG. 1 is an example of acquiring data between embodiments of the present invention.
  • computer terminal 10 may include one or more (only one shown) processor 102 (processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA)
  • a memory 104 for storing data
  • a transmission module 106 for communication functions.
  • FIG. 1 is merely illustrative and does not limit the structure of the above electronic device.
  • computer terminal 10 may also include more or fewer components than those shown in FIG. 1, or have a different configuration than that shown in FIG.
  • the memory 104 can be used to store software programs and modules of the application software, such as program instructions/modules corresponding to the method for obtaining a matching relationship between data in the embodiment of the present invention, and the processor 102 runs the software program stored in the memory 104 and Modules, which perform various functional applications and data processing, that is, implement the vulnerability detection method of the above application.
  • Memory 104 may include high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory.
  • memory 104 may further include memory remotely located relative to processor 102, which may be coupled to computer terminal 10 via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • Transmission device 106 is for receiving or transmitting data via a network.
  • the network specific examples described above may include a wireless network provided by a communication provider of the computer terminal 10.
  • the transmission device 106 includes a Network Interface Controller (NIC) that can be connected to other network devices through a base station to communicate with the Internet.
  • the transmission device 106 can be a Radio Frequency (RF) module for communicating with the Internet wirelessly.
  • NIC Network Interface Controller
  • RF Radio Frequency
  • the present application provides a method for obtaining a matching relationship between data as shown in FIG. 2.
  • 2 is a flow chart of a method of acquiring a matching relationship between data according to a first embodiment of the present invention.
  • an optional method for obtaining a matching relationship between data includes the following implementation steps:
  • Step S202 Acquire log information of the target object included in the target object set and a positioning log of the wireless router included in the wireless routing device set;
  • the log information is a target unit, and the log information of any one of the target objects includes at least one type of data field.
  • the data field of the specified category of the target object included in the target object set is filtered out from the original database in which the target object information is recorded, and the log information of the target object included in the target object set is obtained. .
  • the original database that records a large amount of target object information may be a POI database, or The secondary processing of the integrated database (such as the store database of the high German map, the database of the public reviews).
  • the category of the data field may include at least one of the following: a name of the target object, a category to which the target object belongs, coordinate information of the target object, an address of the target object, and location information of the target object, wherein the address of the target object and the location of the target object Information can also be obtained indirectly from the coordinate information of the target object.
  • the wireless routing device set includes at least one wireless router.
  • the positioning log uses a wireless router as a recording unit, and the positioning log of any one of the wireless routers includes at least one type of data field.
  • the location log of the wireless router included in the wireless routing device set includes information of a specified data field of the wireless router included in the wireless routing device set.
  • the target object set includes a plurality of restaurants to be matched, and the specified data field includes, for example, a name and a latitude and longitude coordinate, and the designation of all the restaurants to be matched is extracted from the restaurant database of the Gaode map.
  • the information of the data field is sorted to obtain log information of several restaurants to be matched included in the target object set.
  • the four restaurants such as Jingweizhai, Yutoubu, Northeastern, and Qingfeng Baozi, are randomly selected from the restaurants to be matched, and the scheme of the present application is described in detail.
  • the wireless routing device set includes a plurality of wireless routers to be matched, and the positioning logs of the wireless routers included in the wireless routing device set can also be obtained.
  • the following wireless router is extracted from the wireless router to be matched as an example to describe the solution of the present application: Jwz, ytpb, dongbeicai, Q@fbzp, quan-ju-de.
  • Step S204 reading location information of any one or more target objects from the log information, and reading location information of any one or more wireless routers from the location log;
  • the target object may be an object carrying the mobile terminal or the mobile terminal itself, and the mobile terminal may use the international identity code as the unique identifier to identify.
  • the category of the data field included in the log information of the target object may include at least location information of the target object, and the category of the data field included in the location log of the wireless router includes at least location information of the wireless router.
  • the location information may include any one of the following: latitude and longitude information, street information, geographical area information, attribution of business circle information, and the like.
  • the location information is that the log information of the to-be-matched restaurant included in the acquired target object set includes the geographical area of the restaurant to be matched, for example, the above four restaurants are located in Beijing. Chaoyang District; the obtained wireless router device set contains a number of wireless routers to be matched.
  • the location log also contains the geographic area to which the router to be matched belongs. For example, in the above wireless router, quan-ju-de is located in Haidian District, Beijing. The rest is located in Chaoyang District, Beijing.
  • Step S206 Determine a group of wireless routers corresponding to the target object according to location information of any one or more target objects and location information of any one or more wireless routers, to obtain target objects and wireless routes included in the target object set. The matching relationship between wireless routers included in the device collection.
  • step S206 of the present application according to the location information of the target object and the location information of the wireless router, it can be determined whether the target object and the wireless router are in the same, extremely close, or specific relationship geographic location, to determine that the target object has Corresponding relationship between a group of wireless routers to further obtain the matching relationship between the target object and the wireless router.
  • the correspondence between the Beijing-style fast and the wireless router to be matched can be determined. Relationship, for example, can be determined that the wireless router quan-ju-de is located in Haidian District, Beijing, and does not have a corresponding relationship with Beijing Weizhai, located in Chaoyang District, Beijing.
  • step S202 to step S206 provided by the present application can automatically identify the wireless corresponding to each target object from a large amount of complicated data according to the acquired log information of the target object and the location information in the location log of the wireless router. router.
  • a bridge can be established between the target object information database (for example, POI data) and the database in which the wireless router specifies the data field, so that joint analysis of the two databases becomes possible.
  • the log information of the target object included in the target object set and the location log of the wireless router included in the wireless routing device set are obtained, and the log information is obtained from the log information.
  • the location information of the target object and the wireless router are respectively read in the location log, and the purpose of determining the correspondence between the target object and a group of wireless routers according to the location information of the target object and the wireless router is achieved, thereby realizing the acquisition of the target object set.
  • step S202 obtaining a positioning log of the wireless router included in the wireless routing device set includes the following specific implementation steps:
  • Step S2022 Obtain network log information of the mobile terminal included in the mobile terminal set, where the network log information includes at least the following data fields: location information of the mobile terminal and routing information of the wireless router accessed by the mobile terminal;
  • the mobile terminal can acquire the mobile terminal through the wireless communication module set thereon.
  • the routing information of the wireless router near the end, the mobile terminal accessing the wireless router means that the mobile terminal connects to the wireless router through a correct password or the mobile terminal obtains a nearby wireless router through detection.
  • the wireless routing information may include at least one of the following: a name of the wireless router, an identifier of the wireless router, and an intensity of a wireless signal sent by the wireless router.
  • the location information of the mobile terminal included in the network log information of the mobile terminal may be the GPS information collected by the positioning module of the mobile terminal, or may be connected to the surrounding WiFi through the mobile terminal, and the WiFi positioning technology is adopted.
  • the obtained location information of the mobile terminal may also be the second type of location information generated by the first type of location information conversion of the mobile terminal (for example, obtaining the area information of the mobile terminal according to the latitude and longitude coordinates of the mobile terminal), and It is a combination of any of the above various information.
  • the mobile terminal set includes at least one mobile terminal.
  • the more mobile terminals included in the mobile terminal set, the more wireless routers are covered in the collected network information of the mobile terminal. .
  • a WiFi enabled mobile phone automatically detects surrounding WiFi wireless router information and generates a WiFi log detected by a single mobile phone.
  • the WiFi log detected by a single mobile phone includes, for example, a mobile phone identifier, a mobile phone location, and WiFi information detected by the mobile phone, wherein the WiFi information detected by the mobile phone includes a WiFi identity and a WiFi signal strength.
  • the original log may further include data for further processing according to the mobile phone identifier, the location of the mobile phone, and the WiFi information detected by the mobile phone, for example, information about a city and a region where the mobile phone is located according to the latitude and longitude information.
  • Table 1 exemplarily shows the original log generated after statistics on the WiFi logs of several mobile phones.
  • the SSID Service Set Identifier
  • the network name of the WiFi network is the network name of the WiFi network.
  • IMEI User ID
  • WiFi SSID list and its signal strength 123456789012345 Chaoyang District, Beijing Jwz, -30; ytpb, -80; dongbeicai, -15; 123456789012346 Chaoyang District, Beijing Jwz,-70;Q@fbzp,-25; 123456789012347 Chaoyang District, Beijing Jwz, -25; 123456789012348 Haiding district, Beijing Quan-ju-de,-10
  • Step S2024 Formatting and converting the network log information to generate a positioning log of any one or more wireless routers.
  • the positioning log of the wireless router includes at least the following data fields: identification information and location information.
  • the identifier information may be the name of the wireless router acquired by the mobile terminal or
  • the other information that can be used to distinguish the wireless router the location information may be the location information of the mobile terminal when the wireless router is detected, or may be the location information of another data format converted according to the location information of the mobile terminal.
  • the format conversion is mainly to record the conversion of the unit, that is, the network log with the mobile terminal as the unit of the record is converted into the location log with the wireless router as the recording unit. Specifically, since the network log information of the mobile terminal records the routing information of the wireless router detected by each mobile terminal, the data format cannot directly obtain the routing information of the wireless router; and the positioning log record of the wireless router generated after the conversion Identification information and location information for each wireless router.
  • step S2024 of the present application it is solved that the log information of the target object and the log information of the wireless network recorded by the mobile terminal are independent of each other and difficult to be connected, and the log information of the target object and the log information of the wireless network cannot be directly obtained.
  • the correspondence between the target object and the wireless router is
  • the location log of the converted wireless router includes at least the WiFi SSID of the wireless router and the location information of the wireless router.
  • the location information of the wireless router is set. It is the city and administrative area to which the wireless router belongs (according to the latitude and longitude information).
  • Table 2 exemplarily shows the location log of the wireless router generated by the conversion. In Table 2, in addition to the WiFi SSID of the wireless router and the location information of the wireless router, the signal strength of the wireless network is also included.
  • Identification information location information Signal strength Jwz Chaoyang District, Beijing -30 Ytpb Chaoyang District, Beijing -80 Dongbeicai Chaoyang District, Beijing -15 Jwz Chaoyang District, Beijing -70 Q@fbzp Chaoyang District, Beijing -25 Jwz Chaoyang District, Beijing -25 Quan-ju-de Haiding district, Beijing -10
  • the above steps S2022 to S2024 of the present application provide an alternative for obtaining a location log of a wireless router included in a set of wireless routing devices.
  • the network log of the mobile terminal is obtained and integrated based on the above-mentioned step S2022, and the conversion from the common network log of the mobile terminal to the location log of the wireless router is implemented in step S2024, so that step S202 is performed.
  • the location information of the wireless router can be directly obtained from the location log of the wireless router, and the correspondence between the target object and the wireless router is obtained.
  • Step S2032 Perform aggregation processing on the wireless routers in the wireless routing device set according to the identification information of the wireless router, and generate an aggregation result of any one or more wireless routers in the wireless routing device set, where the aggregation result includes: a signal of the wireless router strength;
  • step S2032 of the present application since the same wireless router may be collected by multiple mobile terminals at multiple locations, multiple location information and multiple signal strength data may be corresponding to the same wireless router in the location log. Therefore, it is necessary to further determine which of the plurality of location information in the location log is closest to the real geographical location of the wireless router, or further calculate the most likely location of the wireless router based on the plurality of location information.
  • the aggregation process of the wireless router may be based on the identification information of the wireless router, and the data with the same wireless router identification information in the location log is aggregated to form the wireless router identifier, the wireless router location information, and The result of the aggregation of the wireless router signal strength.
  • each wireless router is aggregated by using the identification information of the wireless router as a key from the location log of the wireless router generated by the conversion.
  • Table 3 shows only a part of the data aggregation result by taking the wireless router whose identification information is jwz as an example.
  • Step S2034 The aggregation result is filtered by using a preset filtering threshold to determine a valid log in the positioning log of any one or more wireless routers.
  • the effective log is a positioning log of the wireless router whose signal strength is greater than or equal to the filtering threshold.
  • step S2034 of the present application when the signal strength is weak to a certain extent, the reliability of the entire piece of data (especially the position information) corresponding thereto is low.
  • the filtering threshold is set, and the entire data corresponding to the signal strength is determined to be reliable according to the relationship between the signal strength and the filtering threshold.
  • the aggregation result may be obtained from the positioning log. Exclude the piece of data to get a valid log.
  • the matching relationship between the restaurant and the wireless router is still taken.
  • the filtering threshold it is determined that the entire information corresponding to the signal strength is unreliable.
  • the piece of information is deleted from the aggregation result.
  • Table 4 shows only the valid log obtained by filtering the aggregation result by taking the wireless router whose identification information is jwz as an example.
  • the above steps S2032 to S2034 of the present application provide an alternative for filtering the positioning log.
  • the positioning logs are aggregated according to the identification information of the wireless router, and the aggregation result of each wireless router is generated, and then the aggregation result is filtered by the step S2034, the reliable data in the aggregation result is retained, and the effective log is obtained, and the implementation is performed.
  • the processing of the above steps S2032 to S2034 can simplify the data and ensure the reliability of the data.
  • the location log of the wireless router further includes: positioning coordinates of the wireless router.
  • the positioning coordinates may be coordinate data based on any pre-established coordinate system, such as latitude and longitude coordinate data.
  • the location information included in the location log of the wireless router may be coordinate information (for example, latitude and longitude coordinate information, coordinate information in other coordinate systems), or It is non-coordinate information (for example, the city and administrative area information of the wireless router converted according to the latitude and longitude coordinate information).
  • the positioning log of the wireless router in the alternative provided by the present application further includes the positioning coordinates of the wireless router.
  • the location log of the wireless router further includes: positioning coordinates of the wireless router, performing step S2034 above: determining the location log of any one or more wireless routers. After the valid log, you can also perform the following implementation steps:
  • Step S2036 Clustering the positioning coordinates of any one or more wireless routers by using preset conditions, and acquiring cluster clusters of any one or more wireless routers, wherein the wireless router generates at least one cluster cluster;
  • an algorithm may be used to cluster multiple positioning coordinates corresponding to each wireless router, for example, a density-based clustering algorithm may be selected, that is, when When the density of the positioning coordinates in a region exceeds the threshold, it can be divided into clusters.
  • the preset condition is a condition that needs to be preset when using the clustering algorithm, and different clustering algorithms require different preset conditions.
  • the preset conditions include: E domain and core object, E domain refers to a region with a given object radius of E, and the core object refers to a given object E domain.
  • Positioning coordinates of each wireless router screened in the effective log by the DBSCAN algorithm Columns are clustered. In this case, only the first step of the DBSCAN algorithm is executed, that is, the positioning coordinates are gathered into a "small circle” that satisfies our preset conditions, and the second step is not merged.
  • set the E field to 10 meters and the core object to 20, that is, we require that if a wifi is positioned by 20 different imei in a circle with a radius of 10 meters, a cluster of clusters is formed.
  • Step S2038 Filter the wireless routers in the wireless routing device set according to the number of clustering clusters of the wireless router.
  • step S2038 of the present application when a wireless router is in a state of stable performance and a fixed position, the location of the cluster cluster of the wireless router may exhibit a certain regionality, and the number of cluster clusters may also exhibit a certain regularity. Sex.
  • the foregoing step S2038 provided by the present application determines the running status of the wireless router by determining the number of clustering clusters of the wireless router by using a preset rule, and implements the wireless router in the wireless routing device set by clustering clusters. filter.
  • the above steps S2036 to S2038 of the present application provide an alternative for screening wireless routers in the wireless routing device set. Based on the cluster cluster of each wireless router generated in the above step S2036, the state of each wireless router is determined through step S2038, and the screening of the wireless routers in the wireless routing device set is completed.
  • step S2038 screening the wireless routers in the wireless routing device set according to the number of clustering clusters of the wireless router, including the following specific implementation steps:
  • Step S20380 Calculate a center point coordinate of each cluster of the wireless router
  • the calculation method of the center point coordinates of the cluster cluster may adopt a center point calculation method in the European space.
  • the calculation formula for calculating the coordinates of the center point of the cluster cluster is:
  • Center(cluster) [(lat1+lat2+...+latn)/n,(lng1+lng2+...+lngn)/n]
  • center (cluster) represents the coordinates of the center point of the cluster
  • lat is the abbreviation of latitude (latitude)
  • lng is the abbreviation of longitude (longitude) Lng1
  • lng2...lngn are the longitudes in each positioning coordinate in the cluster
  • n is the number of positioning coordinates contained in the cluster.
  • Step S20382 If the number of cluster clusters of the wireless router exceeds a preset threshold, calculate the center distance of any two cluster clusters of the wireless router by using the center point coordinates of any two cluster clusters of the wireless router;
  • the preset threshold is, for example, 2.
  • the center distance of the two clusters that are the farthest from the wireless router is limited because the signal coverage of the wireless router is limited. It also has an upper limit. Therefore, the center distance of any two cluster clusters of the wireless router can be calculated and compared with the distance threshold to determine whether there are two cluster clusters whose center distance is greater than the distance threshold. Through this In this way, the validity of the positioning coordinates in the cluster cluster can be judged.
  • the number of positioning coordinates in each cluster of the wireless router may be acquired first, and according to the cluster cluster.
  • the number of positioning coordinates is used to sort the cluster clusters, and then the distances of the center points of the two cluster clusters are sequentially determined according to the order of the cluster clusters in the sorting. It is also possible to calculate the center distance of any two clusters.
  • the calculation method of the spherical distance is used when calculating the center distance of the two clusters of the wireless router.
  • the wireless router can be directly determined to be an effective wireless router, and the coordinates of the center point of the cluster cluster are assigned to The effective wireless router can also determine that the wireless router is an invalid wireless router when the number of the clusters is less than the trusted threshold according to the number of clusters in the cluster, thereby avoiding the number of positioning coordinates of the wireless router. Not enough may cause the wireless router to have low reliability.
  • Step S20384 When the center distance of the wireless router is less than or equal to the distance threshold, determine that the wireless router is an effective wireless router;
  • the center distance of the wireless router is greater than the distance threshold, it is considered that the location log of the wireless router has an error, or the location of the wireless router has changed, and the location log of the wireless router needs to be re-acquired. Therefore, it can be determined that the wireless router is an invalid router.
  • N 150 meters
  • NO.1center (cluster), NO.2center (cluster) ⁇ N it is determined that the wireless router is an effective wireless router; wherein distance uses a spherical distance calculation method, and NO.1center (cluster) may be based on a cluster cluster Number of Positioning Coordinates After clustering the clusters by at least one, the clusters with the largest number of coordinates are located, and the coordinates of NO.2center (cluster) are numbered.
  • the wireless router is determined whether the wireless router is valid or not.
  • the positioning information of the wireless router can be discriminated by the above steps, and the correspondence between the target object and the wireless router caused by the error of the positioning information in the positioning log is avoided. mistake.
  • the wireless router may be acquired by a large number of wireless terminals during the mobile process. Through the above steps, the wireless router may be screened to avoid the mobile wireless router. An error in the correspondence between the target object and the wireless router caused by the positioning.
  • Step S20386 Retain the effective wireless router in the wireless routing device set, and read the cluster with the largest number of clusters in the effective wireless router;
  • the effective wireless router in the set of reserved wireless routing devices can be realized by deleting the invalid wireless router or extracting the effective wireless router.
  • Step S20388 Assign the center point coordinates of the cluster cluster with the largest number of clusters to the effective wireless router.
  • the cluster cluster with the largest number of clusters indicates that the wireless router is positioned most frequently in the region, and the probability of being closest to the real position of the wireless router is also the largest.
  • the optimal location coordinates of the wireless router are obtained according to the wireless router positioning log.
  • steps S20380 to S20388 of the present application provide an alternative for screening wireless routers in the set of wireless routing devices. Based on the foregoing steps S20380 to S20384, the determination of whether the wireless router is valid is implemented. Steps S20386 to S20388 are implemented to extract an effective wireless router and assign an optimal position coordinate to the effective wireless router, and finally achieve accurate screening of the wireless router device set. The wireless router gives the filtered wireless router the technical effect of optimal position coordinates.
  • the log information of the target object includes at least: coordinate information of the target object; and the location log of the wireless router includes at least: coordinate information of the wireless router.
  • the log information of the target object or the location information included in the location log of the wireless router may be coordinate information (for example, latitude and longitude coordinate information, coordinate information in other coordinate systems), or non-coordinate information (for example, converted according to latitude and longitude coordinate information)
  • the obtained target object or the city and administrative area information of the wireless router may be coordinate information (for example, latitude and longitude coordinate information, coordinate information in other coordinate systems), or non-coordinate information (for example, converted according to latitude and longitude coordinate information)
  • the obtained target object or the city and administrative area information of the wireless router may be coordinate information (for example, latitude and longitude coordinate information, coordinate information in other coordinate systems), or non-coordinate information (for example, converted according to latitude and longitude coordinate information)
  • the obtained target object or the city and administrative area information of the wireless router may be coordinate information (for example, latitude and longitude coordinate information, coordinate information in other coordinate systems), or non-coordinate information (for example, converted according to latitude and longitude coordinate information)
  • Table 5 exemplarily shows log information including restaurant coordinate information
  • Table 6 exemplarily shows the first screening through steps S2032 to S2034, and the steps.
  • the positioning log of the wireless router coordinate information is included; in Table 5 and Table 6, lng1 to lng4 respectively indicate the longitude information of the restaurant, and lng5 to lng8 respectively represent the clusters of the effective wireless router.
  • the longitude information of the center point of the largest cluster cluster; lat1 to lat4 respectively represent the latitude information indicating the restaurant, and lat5 to lat8 respectively represent the latitude information of the center point of the cluster cluster cluster having the largest number of clusters in the effective wireless router.
  • step S206 when the log information of the target object includes at least the coordinate information of the target object and the log information of the wireless router includes at least the coordinate information of the wireless router, step S206: according to any one or more The location information of the target object and the location information of any one or more wireless routers determine a set of wireless routers corresponding to the target object, including the following specific implementation steps:
  • Step S2062 Matching the location information of the target object and the location information of the wireless router as keywords, and acquiring at least one wireless router having a mapping relationship with the target object;
  • a mapping relationship is established between a target object having the same location information, adjacent to each other, or a specific relationship, and a wireless router is acquired, and then at least one mapping relationship with the target object may be acquired by using the target object as a unit.
  • the wireless router can also acquire at least one target object that has a mapping relationship with the wireless router in a wireless router unit.
  • the "Beijing Chaoyang District” is used as a key to obtain a wireless router that has a mapping relationship with the restaurant.
  • the target objects "Jingweizhai” and Jwz have mapping relationships.
  • Step S2064 Calculate a spherical distance between the target object and any one of the wireless routers having the mapping relationship according to the coordinate information of the target object and the coordinate information of the at least one wireless router having a mapping relationship with the target object;
  • the information (lat1, lng1) and the coordinate information (lat5, lng5), (lat6, lng6), (lat7, lng7) of the above three wireless routers respectively calculate the spherical distance between the target object and each wireless router.
  • Step S2066 extracting a target object whose spherical distance is less than or equal to the position threshold and a wireless router having a mapping relationship to acquire at least one wireless router having a matching relationship with the target object.
  • the matching relationship between the restaurant and the wireless router is still obtained.
  • the set position threshold is 20 meters, and the spherical distance between Jingweizhai and Jwz and Q@fbzp is less than 20 meters, so that the distance from the Jingweizhai spherical surface can be obtained less than or equal to
  • the wireless routers with location thresholds are Jwz, Q@fbzp.
  • steps S2062 to S2066 are based on the target object, to obtain a wireless router having a mapping relationship with a certain target object and having a spherical distance smaller than a position threshold;
  • the wireless router can be used as a unit to acquire a target object that has a mapping relationship with a wireless router and whose spherical distance is smaller than the position threshold.
  • the above steps S2062 to S2066 of the present application provide an alternative solution for obtaining a mapping relationship between a target object and a group of wireless routers.
  • the location log of the wireless router further includes: a wireless router.
  • the name is obtained, in step S2066: after acquiring at least one wireless router having a matching relationship with the target object, the following implementation steps may also be performed:
  • Step S2072 Perform a first pre-processing on the target object name to generate a new target object name that satisfies the first predetermined format and/or the first predetermined content;
  • the first predetermined format is used to specify a format of the target object name, such as a language used by the target object name, a character type included in the target object name, a capitalization format of the letter when the target object is English, and the like;
  • the first predetermined content is used to specify a specific content of the target object name and a display manner thereof, for example, the target object name is a full spell or a first letter pinyin, the target object name includes a shorthand manner when the English name is included, and the like.
  • the target object name is mostly the original business name that has not been modified, it may contain Chinese, English, numbers, pictures, special characters, etc., and the name of the wireless router. Usually letters, and the name of the wireless router is usually set according to the target object name, which has a high degree of recognition. Therefore, the first pre-processing of the name of the target object is required, so that the new target pair after processing The image name can correspond to the wireless router name to further ensure the accuracy of the matching relationship.
  • the first preprocessing can be done using the open source java project pinyin4j.
  • the first pre-processing is used to convert the Chinese name of the target object to the full spell or initials of the Chinese name.
  • the first preprocessing can be done using the open source java project pinyin4j.
  • Step S2074 performing a second pre-processing on the wireless router name of the at least one wireless router having a matching relationship with the target object, and generating a new wireless router name that satisfies the second predetermined format and/or the second predetermined content.
  • the second predetermined format and/or the second predetermined content may be consistent with the specification of the first predetermined format and/or the first predetermined content, or may be slightly adjusted.
  • the name of the wireless router is usually set for the consumer to identify based on the target object name, the same target object may set up multiple wireless routers or transmit multiple wireless networks through a wireless router. In this case, the wireless router The name will have numbers or special characters that are not associated with the original name of the target object and are only used to distinguish between wireless networks. Therefore, the first pre-processing of the name of the wireless router is also required, so that the processed new wireless router name can correspond to the target object name to further ensure the accuracy of the matching relationship.
  • the second pre-processing is used to identify the category of the wireless router name as the full object of the target object, the initial letter of Pinyin or the corresponding English, if it can be identified or can be pre-stored In the stored database, the category of the wireless router name is uniquely determined, and the wireless router name correspondence is converted into the same category as the first predetermined content.
  • the new target object name is P2 (jwz)
  • the wireless router names matching the Jingweizhai are Jwz and Q respectively.
  • the new wireless router names after the second pre-processing are jwz, qfbzp.
  • Step S2076 Perform filtering processing on at least one wireless router having a matching relationship with the target object according to the new target object name and the new wireless router name, to obtain a wireless router that matches the target object that meets the preset condition.
  • the preset condition may be that the new target object name is identical to the new wireless router name, and/or the new target object name and the new wireless router name are similar to the similarity threshold.
  • the new target object name is compared with the new wireless router name having a matching relationship, and if the new target object name is completely consistent with the new wireless router name, it is determined that the wireless router matches the target object.
  • the new target object name is compared with the new wireless router name with a matching relationship. If the new target object name is not completely consistent with the new wireless router name, the new target object name is calculated to be similar to the new wireless router name. Degree, if the new target object name and the new wireless router name similarity reach the similarity threshold, it is determined that the wireless router matches the target object.
  • determining the similarity between the name of the new target object and the name of the new wireless router may be performed by calculating a distance edited by the two strings.
  • the string edit distance between the new target object name and the new wireless router name is less than or equal to the edit distance threshold, If the number of characters of the new target object name is greater than the number of characters threshold, the new target object name is considered to be similar to the new wireless router name to a similarity threshold, and the wireless router is determined to match the target object.
  • the preset condition is that the new target object name is exactly the same as the new wireless router name, and the string edit distance of the new target object name and the new wireless router name is 1 and new.
  • the number of wireless router name characters is greater than or equal to 5.
  • the new target object name is jwz
  • the new wireless router name is jwz
  • qfbzp contrast screening the wireless router matching the target object that meets the preset condition is jwz.
  • the above steps S2072 to S2076 of the present application provide an alternative of acquiring at least one wireless router having a matching relationship with the target object. Based on the above steps S2072 and S2074, the processing of the target object name and the wireless router name is completed, and the wireless router matching the target object is obtained through the screening of step S2076, thereby further improving the matching accuracy of the target object and the wireless router. .
  • the acquiring target object set is included in step S206: After the matching relationship between the target object and the wireless router included in the wireless routing device set, the following implementation steps can also be performed:
  • Step S208 When the same wireless router has a matching relationship with multiple target objects, the target object closest to the wireless router is read.
  • the above step S208 provided by the present application determines whether the distance between the wireless router and the target object, for example, the spherical distance, avoids whether the wireless router matching the target object is acquired regardless of the target object, or the wireless router is used as the unit.
  • the wireless router may be attributed to multiple target objects when acquiring a target object that matches a wireless router.
  • FIG. 3 is a flowchart of an optional method for acquiring a matching relationship between data according to an embodiment of the present application. The following describes the functions implemented by the application scenario in the application scenario in conjunction with FIG. 3:
  • Step A Format the conversion to generate a location log of the wireless router.
  • the mobile terminal acquires the routing information of the surrounding wireless router, and combines the location information when the mobile terminal acquires the routing information to generate a network information log of the mobile terminal.
  • the network information log is formatted and converted into a positioning log of the wireless router with the wireless router as the recording unit, wherein the positioning log includes the identification information, location information and signal strength of the wireless router.
  • Step B A polymerization treatment is carried out to obtain an polymerization result including signal intensity.
  • step B of the present application based on the identification information of the wireless router, the data with the same wireless router identification information in the location log is aggregated to form an aggregation result including the wireless router identifier, the wireless router location information, and the wireless router signal strength. .
  • Step C Determine whether the signal strength is greater than or equal to the filtering threshold.
  • step C of the present application when the signal strength is weak to a certain extent, the reliability of the entire piece of data (especially the position information) corresponding thereto is low.
  • the filtering threshold By setting the filtering threshold, the relationship between the signal strength and the filtering threshold is determined, and then it is possible to determine whether the entire data corresponding to the signal strength is reliable.
  • Step D Discard the wireless router.
  • step D of the present application when the signal strength is less than the filtering threshold, it is determined that the entire information corresponding to the signal strength is unreliable, and the related log corresponding to the signal strength less than the filtering threshold in the wireless router is deleted from the aggregation result. .
  • Step E Get a valid log.
  • step E of the present application when the signal strength is greater than or equal to the filtering threshold, it is determined that the positioning log corresponding to the signal strength greater than or equal to the filtering threshold in the wireless router is a valid log.
  • Step F Perform clustering processing to generate cluster clusters.
  • a density-based clustering algorithm DBSCAN is selected, and the positioning coordinates of the wireless router are clustered to generate cluster clusters.
  • Step G Determine whether the number of cluster clusters does not exceed a preset threshold.
  • the running status of the wireless router can be inferred by determining the number of clusters of the wireless router.
  • Step H Determine whether the cluster cluster center distance is less than or equal to the distance threshold.
  • step H of the present application if the number of cluster clusters of one wireless router exceeds a preset threshold, it is further determined whether the center distance of any two cluster clusters is less than or equal to the distance threshold.
  • the number of positioning coordinates in each cluster of the wireless router may be obtained first, and the clusters are sorted according to the number of positioning coordinates in the cluster, and then the order of clusters in the sorting is followed. , in turn, determine the distance between the center points of the two cluster clusters. It is also possible to calculate the center distance of two clusters of arbitrary or random.
  • Step I Discard the wireless router.
  • step I of the present application when it is determined that the center distance of two cluster clusters is greater than the distance threshold, it is considered that the location log of the wireless router has an error, or the location of the wireless router has changed.
  • the location log of the wireless router needs to be re-acquired, so it can be determined that the wireless router is an invalid router.
  • Step J The coordinates of the center point of the cluster with the largest number of clusters are assigned to the effective wireless router.
  • step J of the present application there are two cases as follows:
  • Case 1 In the case that the number of cluster clusters of a wireless router does not exceed a preset threshold, the wireless router can be directly determined to be an effective wireless router, and the center point coordinates of the cluster cluster are assigned to the effective wireless router. Of course, according to the number of clusters in the cluster, when the number of the clusters is less than the trusted threshold, the wireless router is determined to be an invalid wireless router.
  • Case 2 When the number of cluster clusters of a wireless router exceeds a preset threshold and the center distance of any two cluster clusters is less than or equal to the distance threshold, determine that the wireless router is an effective wireless router and read the effective wireless The cluster cluster with the largest number of clusters in the router assigns the coordinates of the center point of the cluster with the largest number of clusters to the effective wireless router.
  • Step K Determine whether it is Chinese.
  • Step L Discard the target object name.
  • step L of the present application when the machine translation result is not ideal, the target object whose part or all of the name is non-Chinese may be directly discarded.
  • Step M The first pre-processing, obtaining new target object names P1 (full spell) and P2 (first letter pinyin).
  • steps K to M are the processing of the name of the target object. It should be noted that the step K to the step M may be performed before the step N as a pre-processing of the target object name; or after the step N Execute again.
  • Step N Determine a group of wireless routers corresponding to the target object according to the location information of the target object and the location information of the wireless router.
  • step N of the present application according to the location information of the target object and the location information of the wireless router, it can be determined whether the target object and the wireless router are in the same, extremely close, or specific relationship to determine the target object.
  • Step O The second pre-processing results in a new wireless router name S.
  • the name is set to S.
  • Step Q Assign the wireless router to the target object.
  • Step S Discard the wireless router.
  • step S of the present application if the conditions in step P and step R are not satisfied, the wireless path is discarded. By the device.
  • Step T Filtering when the same wireless router belongs to multiple target objects.
  • step T of the present application for the possibility that the same wireless router may be given multiple target objects, it is necessary to filter by which cell the wireless router is closer to, and the distance calculation function still uses the spherical distance.
  • the method according to the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware, but in many cases, the former is A better implementation.
  • the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
  • the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
  • an apparatus for implementing the foregoing method for acquiring a matching relationship between data includes: an obtaining module 402, a first reading module 404, and a processing module. 406;
  • the obtaining module 402 is configured to obtain log information of the target object included in the target object set and a positioning log of the wireless router included in the wireless routing device set.
  • the first reading module 404 is configured to read location information of any one or more target objects from the log information, and read location information of any one or more wireless routers from the location log;
  • the processing module 406 is configured to determine, according to location information of any one or more target objects and location information of any one or more wireless routers, a set of wireless routers corresponding to the target object, to obtain target objects included in the target object set. A match relationship with a wireless router included in a wireless routing device set.
  • the foregoing obtaining module 402, the first reading module 404, and the processing module 406 correspond to the steps S202 to S206 in the first embodiment, and the examples and application scenarios implemented by the three modules and corresponding steps. The same, but not limited to, the content disclosed in the above embodiment 1. It should be noted that the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware. achieve.
  • the foregoing obtaining module 402, the first reading module 404, and the processing module 406 provided by the present application can automatically generate information from a large amount of complicated data according to the obtained log information of the target object and the location information in the positioning log of the wireless router. Identify the wireless router corresponding to each target object.
  • the target object information database for example, POI data
  • the wireless router designation data field is described, making it possible to perform joint analysis of the two databases.
  • the log information of the target object included in the target object set and the location log of the wireless router included in the wireless routing device set are obtained, and the log information and the log information are used.
  • the location information of the target object and the wireless router are respectively read in the location log, and the purpose of determining the correspondence between the target object and a group of wireless routers according to the location information of the target object and the wireless router is achieved, thereby realizing the acquisition of the target object set.
  • FIG. 5 is a schematic structural diagram of an optional obtaining module according to the embodiment shown in FIG. 4 of the present application; as shown in FIG. 5, the obtaining module 402 includes: an obtaining unit 502 and a converting unit 504, where:
  • the obtaining unit 502 is configured to obtain network log information of the mobile terminal included in the mobile terminal set, where the network log information includes at least the following data fields: location information of the mobile terminal and routing information of the wireless router accessed by the mobile terminal;
  • the converting unit 504 is configured to format and convert the network log information to generate a positioning log of any one or more wireless routers.
  • the positioning log of the wireless router includes at least the following data fields: identification information and location information.
  • the foregoing obtaining unit 502 and the converting unit 504 correspond to steps S2022 to S2024 in the first embodiment, and the two modules are the same as the examples and application scenarios implemented by the corresponding steps, but are not limited to the above implementation.
  • the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the foregoing obtaining unit 502 and the converting unit 504 provided by the embodiment of the present application provide an optional solution for acquiring a positioning log of a wireless router included in a wireless routing device set.
  • the acquisition and integration of the network log of the mobile terminal is implemented based on the above-mentioned obtaining unit 502, and the conversion from the common network log of the mobile terminal unit to the positioning log of the wireless router unit is realized by the conversion unit 504, so that the application according to the present application is implemented.
  • the device for obtaining the matching relationship between the data in the embodiment may directly obtain the location information of the wireless router from the location log of the wireless router. Information, and obtain the correspondence between the target object and the wireless router.
  • FIG. 6 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 5 of the present application. As shown in FIG. 6, the data is obtained according to an embodiment of the present application.
  • the device for matching the relationship further includes: a first processing module 602 and a first screening module 604, wherein:
  • the first processing module 602 is configured to perform aggregation processing on the wireless routers in the wireless routing device set according to the identifier information of the wireless router, to generate an aggregation result of any one or more wireless routers in the wireless routing device set, where the aggregation result includes : the signal strength of the wireless router;
  • the first screening module 604 is configured to filter the aggregation result by using a preset filtering threshold to determine a valid log in the positioning log of any one or more wireless routers, where the effective log is a wireless router whose signal strength is greater than or equal to the filtering threshold. Log.
  • the foregoing first processing module 602 and the first screening module 604 correspond to steps S2032 to S2034 in the first embodiment, and the two modules are the same as the examples and application scenarios implemented by the corresponding steps, but It is not limited to the contents disclosed in the above embodiment 1. It should be noted that the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the foregoing first processing module 602 and the first screening module 604 provided by the embodiments of the present application provide an alternative for performing screening processing on the location log.
  • the positioning logs are aggregated according to the identifier information of the wireless router, and the aggregation result of each wireless router is generated, and then the aggregation result is filtered by the first screening module 604, and the reliable data in the aggregation result is retained.
  • a valid log is obtained, which further filters the location log, especially when the amount of information in the location log is large, which can simplify the data and ensure the reliability of the data.
  • the location log of the wireless router further includes: a positioning coordinate of the wireless router
  • FIG. 7 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 6 of the present application, such as As shown in FIG. 7, the apparatus for acquiring a matching relationship between data according to an embodiment of the present application further includes: a second processing module 702 and a second screening module 704, where:
  • the second processing module 702 is configured to cluster the positioning coordinates of any one or more wireless routers by using preset conditions, and acquire clusters of any one or more wireless routers, where the wireless router generates at least one cluster of clusters. ;
  • the second screening module 704 is configured to filter the wireless routers in the wireless routing device set according to the number of clustering clusters of the wireless router.
  • the foregoing second processing module 702 and the second screening module 704 correspond to the embodiment.
  • the two modules are the same as the examples and application scenarios implemented by the corresponding steps, but are not limited to the contents disclosed in the first embodiment.
  • the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the foregoing second processing module 702 and the second screening module 704 provided by the embodiments of the present application implement screening of the wireless routers in the wireless routing device set by determining the state of each wireless router.
  • FIG. 8 is a schematic structural diagram of an optional second screening module according to the embodiment shown in FIG. 7 of the present application; as shown in FIG. 8, the second screening module 704 includes: a first computing unit 800, Two computing unit 802, first processing unit 804, second processing unit 806, and third processing unit 808, wherein:
  • a first calculating unit 800 configured to calculate a center point coordinate of each cluster of the wireless router
  • the second calculating unit 802 is configured to calculate any two clusters of the wireless router by using the center point coordinates of any two cluster clusters of the wireless router if the number of cluster clusters of the wireless router exceeds a preset threshold.
  • the first processing unit 804 is configured to determine that the wireless router is an effective wireless router when a center distance of the wireless router is less than or equal to a distance threshold;
  • the second processing unit 806 is configured to reserve an effective wireless router in the set of wireless routing devices, and read a cluster of clusters having the largest number of clusters in the effective wireless router;
  • the third processing unit 808 is configured to assign a center point coordinate of the cluster cluster with the largest number of clusters to the effective wireless router.
  • first calculating unit 800, second calculating unit 802, first processing unit 804, second processing unit 806, and third processing unit 808 correspond to step S20380 to step S20388 in the first embodiment.
  • the five modules are the same as the examples and application scenarios implemented by the corresponding steps, but are not limited to the contents disclosed in the above embodiment 1. It should be noted that the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the foregoing second calculating unit 802 and the first processing unit 804 provided by the embodiment of the present application implement determining whether the wireless router is valid.
  • the positioning information of the wireless router can be discriminated by the above steps, and the correspondence between the target object and the wireless router caused by the error of the positioning information in the positioning log is avoided. mistake.
  • the wireless router may be acquired by a large number of wireless terminals during the mobile process. Through the above steps, the wireless router may be screened to avoid the mobile wireless router. An error in the correspondence between the target object and the wireless router caused by the positioning.
  • the first computing unit 800, the second computing unit 802, the first processing unit 804, the second processing unit 806, and the third processing unit 808 provided by the embodiments of the present application provide a wireless in the wireless routing device set.
  • An alternative for routers to filter It not only realizes the judgment of whether the wireless router is effective, but also extracts the effective wireless router and assigns the optimal position coordinates to the effective wireless router, and finally achieves the precise screening of the wireless router in the wireless router device set and assigns the filtered wireless router. The technical effect of the optimal position coordinates.
  • the log information of the target object includes at least: coordinate information of the target object
  • the location log of the wireless router includes at least: coordinate information of the wireless router
  • FIG. 9 is an optional according to the embodiment shown in FIG. 4 of the present application.
  • Schematic diagram of the processing module as shown in FIG. 9, the processing module 406 includes: a matching unit 902, a third calculating unit 904, and an extracting unit 906, where:
  • the matching unit 902 is configured to match the location information of the target object and the location information of the wireless router as keywords, and acquire at least one wireless router that has a mapping relationship with the target object;
  • a third calculating unit 904 configured to calculate, according to coordinate information of the target object, and coordinate information of the at least one wireless router that has a mapping relationship with the target object, a spherical distance between the target object and any one of the wireless routers having a mapping relationship;
  • the extracting unit 906 is configured to extract a target object whose spherical distance is less than or equal to the position threshold and a wireless router having a mapping relationship to acquire at least one wireless router having a matching relationship with the target object.
  • the foregoing matching unit 902, the third calculating unit 904, and the extracting unit 906 correspond to steps S2062 to S2066 in the first embodiment, and the three modules are the same as the examples and application scenarios implemented by the corresponding steps. However, it is not limited to the content disclosed in the first embodiment above. It should be noted that the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the foregoing matching unit 902, the third calculating unit 904, and the extracting unit 906 provided by the embodiments of the present application provide an alternative solution for acquiring a mapping relationship between a target object and a group of wireless routers.
  • the third computing unit 904 Based on the matching unit 902 acquiring the target object with the same location information and the wireless router, the third computing unit 904 calculates the spherical distance of each wireless router whose target object is the same as the location information, and via the spherical distance and the position threshold in the extracting unit 906. It is determined that the target object whose spherical distance is less than or equal to the position threshold and the wireless router with the mapping relationship are extracted to establish the matching relationship between the target object and the wireless router.
  • the log information of the target object includes at least the following data fields: a target object name, target object coordinate information, and target object location information
  • the location log of the wireless router further includes: a wireless router name.
  • FIG. 10 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 9 of the present application; As shown in FIG. 10, the apparatus for acquiring a matching relationship between data according to an embodiment of the present application further includes: a first pre-processing module 1002, a second pre-processing module 1004, and a third screening module 1006, wherein:
  • a first pre-processing module 1002 configured to perform a first pre-processing on the target object name, and generate a new target object name that satisfies the first predetermined format and/or the first predetermined content;
  • the second pre-processing module 1004 performs a second pre-processing on the wireless router name of the at least one wireless router that has a matching relationship with the target object, and generates a new wireless router name that satisfies the second predetermined format and/or the second predetermined content.
  • the third screening module 1006 is configured to perform screening processing on the at least one wireless router that has a matching relationship with the target object according to the new target object name and the new wireless router name, to obtain a wireless router that matches the target object that meets the preset condition.
  • the foregoing first pre-processing module 1002, the second pre-processing module 1004, and the third screening module 1006 correspond to steps S2072 to S2076 in the first embodiment, and the three modules and corresponding steps are implemented.
  • the example is the same as the application scenario, but is not limited to the content disclosed in the first embodiment.
  • the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the first pre-processing module 1002, the second pre-processing module 1004, and the third screening module 1006 provided by the embodiment of the present application complete the target object name and the wireless router based on the first pre-processing module 1002 and the second pre-processing module 1004.
  • the processing of the name, and through the screening of the third screening module 1006, obtains a wireless router that matches the target object, further improving the matching accuracy of the target object and the wireless router.
  • FIG. 11 is a schematic structural diagram of an apparatus for acquiring a matching relationship between data according to the embodiment shown in FIG. 4 of the present application. As shown in FIG. 11, the data is obtained according to an embodiment of the present application.
  • the device for matching the relationship further includes: a second reading module 1102, wherein the second reading module 1102 is configured to read the distance from the wireless router when the same wireless router has a matching relationship with multiple target objects. The closest target object.
  • the foregoing second reading module 1102 corresponds to the step S208 in the first embodiment, and the module is the same as the example and the application scenario implemented by the corresponding steps, but is not limited to the content disclosed in the first embodiment. . It should be noted that the foregoing module may be implemented in the computer terminal 10 provided in the first embodiment as a part of the device, and may be implemented by software or by hardware.
  • the foregoing second reading module 1102 avoids the distance between the wireless router and the target object, for example, the spherical distance, and avoids acquiring the wireless router matching the target object regardless of the target object.
  • Embodiment 2 of the present application is the same as the implementation solution and the application scenario provided in Embodiment 1, but is not limited to the solution provided in Embodiment 1.
  • Embodiments of the present invention may provide a computer terminal, which may be any one of computer terminal groups.
  • the foregoing computer terminal may also be replaced with a terminal device such as a mobile terminal.
  • the computer terminal may be located in at least one network device of the plurality of network devices of the computer network.
  • the computer terminal may execute the program code of the following steps in the vulnerability detection method of the application: acquiring log information of the target object included in the target object set and a positioning log of the wireless router included in the wireless routing device set; Reading location information of any one or more target objects from the log information, and reading location information of any one or more wireless routers from the location log; according to location information of any one or more target objects and any one or The location information of the plurality of wireless routers determines a group of wireless routers corresponding to the target object to obtain a matching relationship between the target object included in the target object set and the wireless router included in the wireless routing device set.
  • FIG. 12 is a structural block diagram of a computer terminal according to an embodiment of the present invention.
  • the computer terminal A may include one or more (only one shown in the figure) processor 51, memory 53, and transmission device 55.
  • the memory 53 can be used to store software programs and modules, such as the security vulnerability detection method and the program instruction/module corresponding to the device in the embodiment of the present invention, and the processor 51 executes by executing the software program and the module stored in the memory 53.
  • Software programs and modules such as the security vulnerability detection method and the program instruction/module corresponding to the device in the embodiment of the present invention
  • the processor 51 executes by executing the software program and the module stored in the memory 53.
  • Various functional applications and data processing that is, detection methods for implementing the aforementioned system vulnerability attacks.
  • Memory 53 may include high speed random access memory and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory.
  • memory 53 may further include memory remotely located relative to processor 51, which may be connected to terminal A via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the transmission device 55 described above is for receiving or transmitting data via a network.
  • Specific examples of the above network may include a wired network and a wireless network.
  • the transmission device 55 includes a Network Interface Controller (NIC) that can be connected to other network devices and routers via a network cable to communicate with the Internet or a local area network.
  • the transmission device 55 is a radio frequency (RF) module. It is used to communicate wirelessly with the Internet.
  • NIC Network Interface Controller
  • RF radio frequency
  • the memory 53 is configured to store preset action conditions and information of the preset rights user, and an application.
  • the processor 51 may call the information and the application stored by the memory 53 through the transmission device to perform the following steps: acquiring log information of the target object included in the target object set and a positioning log of the wireless router included in the wireless routing device set; Reading the location information of any one or more target objects in the log information, and reading the location information of any one or more wireless routers from the location log; according to the location information of any one or more target objects and any one or more The location information of the wireless routers determines a set of wireless routers corresponding to the target object to obtain a matching relationship between the target objects included in the target object set and the wireless routers included in the wireless routing device set.
  • the processor 51 may further execute the following steps: acquiring network log information of the mobile terminal included in the mobile terminal set, where the network log information includes at least the following data fields: location information of the mobile terminal and the mobile terminal. Routing information of the accessed wireless router; formatting and converting the network log information to generate a positioning log of any one or more wireless routers, and the positioning log of the wireless router includes at least the following data fields: identification information and location information.
  • the processor 51 may further execute the following program code: perform aggregation processing on the wireless routers in the wireless routing device set according to the identifier information of the wireless router, and generate any one or more wireless routers in the wireless routing device set.
  • the result of the aggregation wherein the aggregation result includes: the signal strength of the wireless router; the aggregation result is filtered by using a preset filtering threshold to determine a valid log in the location log of any one or more wireless routers, and the effective log is greater than the signal strength A positioning log of the wireless router equal to the filtering threshold.
  • the processor 51 may further execute the following steps: clustering the positioning coordinates of any one or more wireless routers by using preset conditions, and acquiring clusters of any one or more wireless routers, where The wireless router generates at least one cluster of clusters; and filters the wireless routers in the set of wireless routing devices according to the number of clusters of the wireless routers.
  • the processor 51 may further execute the following steps: calculating a center point coordinate of each cluster of the wireless router; and using wireless when the number of clusters of the wireless router exceeds a preset threshold The center point coordinates of any two clusters of the router are calculated, and the center distance of any two clusters of the wireless router is calculated; when the center distance of the wireless router is less than or equal to the distance threshold, the wireless router is determined to be an effective wireless router; Routing the effective wireless routers in the device set, and reading the clusters with the largest number of clusters in the effective wireless router; assigning the center point coordinates of the clusters with the largest number of clusters to the effective wireless router.
  • the processor 51 may further execute the following steps: matching the location information of the target object and the location information of the wireless router as keywords, and acquiring at least one wireless router that has a mapping relationship with the target object; The coordinate information of the object and the coordinate information of at least one wireless router having a mapping relationship with the target object, and calculating a spherical distance between the target object and any wireless router having a mapping relationship; extracting a target object whose spherical distance is less than or equal to the position threshold And a wireless router having a mapping relationship to obtain at least one wireless router having a matching relationship with the target object.
  • the processor 51 may further execute the following steps: performing a first pre-processing on the target object name, generating a new target object name that satisfies the first predetermined format and/or the first predetermined content; and the target object
  • the wireless router name of the at least one wireless router having the matching relationship performs a second pre-processing to generate a new wireless router name that satisfies the second predetermined format and/or the second predetermined content; according to the new target object name and the new wireless router name,
  • the at least one wireless router that has the matching relationship of the target object performs filtering processing to obtain a wireless router that matches the target object that meets the preset condition.
  • the processor 51 may further execute the following program code: when the same wireless router has a matching relationship with multiple target objects, read the target object that is closest to the wireless router.
  • a solution for obtaining a matching relationship between data is provided.
  • the purpose of the matching relationship between the target object included in the target object set and the wireless router included in the wireless routing device set thereby solving the method for manually acquiring the correspondence between the target object and the mobile terminal in the prior art, The technical problem of inaccurate and costly relationship between the acquired target object and the wireless network.
  • FIG. 10 is merely illustrative, and the computer terminal can also be a smart phone (such as an Android mobile phone, an iOS mobile phone, etc.), a tablet computer, an applause computer, and a mobile Internet device (Mobile Internet Devices, MID). ), PAD and other terminal devices.
  • FIG. 10 does not limit the structure of the above electronic device.
  • computer terminal 10 may also include more or fewer components (such as a network interface, display device, etc.) than shown in FIG. 10, or have a different configuration than that shown in FIG.
  • the storage medium may include a flash disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk, and the like.
  • Embodiments of the present invention also provide a storage medium.
  • the foregoing storage medium may be used to save the program code executed by the method for obtaining the matching relationship between the data provided in the foregoing Embodiment 1.
  • the foregoing storage medium may be located in any one of the computer terminal groups in the computer network, or in any one of the mobile terminal groups.
  • the storage medium is configured to store program code for performing the following steps: acquiring log information of the target object included in the target object set and positioning logs of the wireless router included in the wireless routing device set. Reading location information of any one or more target objects from the log information, and reading location information of any one or more wireless routers from the location log; according to location information of any one or more target objects and any one Or the location information of the multiple wireless routers, determining a set of wireless routers corresponding to the target object, to obtain a matching relationship between the target object included in the target object set and the wireless router included in the wireless routing device set.
  • the storage medium is further configured to store program code for performing the following steps: the processor 51 may further execute the following program code: acquire a network of the mobile terminal included in the mobile terminal set.
  • Log information where the network log information includes at least the following data fields: location information of the mobile terminal and routing information of the wireless router accessed by the mobile terminal; formatting and converting the network log information to generate positioning of any one or more wireless routers
  • the log, the location log of the wireless router includes at least the following data fields: identification information and location information.
  • the storage medium is further configured to store program code for performing the following steps: performing aggregation processing on the wireless routers in the wireless routing device set according to the identification information of the wireless router, and generating a wireless routing device set.
  • the valid log is the location log of the wireless router whose signal strength is greater than or equal to the filtering threshold.
  • the storage medium is further configured to store program code for performing the following steps: clustering the positioning coordinates of any one or more wireless routers using preset conditions, and acquiring any one or more A cluster of wireless routers, wherein the wireless router generates at least one cluster of clusters; and filters the wireless routers in the set of wireless routing devices according to the number of clusters of the wireless routers.
  • the storage medium is further configured to store program code for performing the following steps: calculating a center point coordinate of each cluster of the wireless router; the number of cluster clusters in the wireless router exceeds In the case of a preset threshold, the center distance of any two clusters of the wireless router is calculated using the coordinates of the center points of any two clusters of the wireless router; when the center distance of the wireless router is less than or equal to the distance threshold, it is determined
  • the wireless router is an effective wireless router; the effective wireless router in the set of wireless routing devices is reserved, and the clusters with the largest number of clusters in the effective wireless router are read; the coordinates of the center points of the clusters with the largest number of clusters are assigned Give an effective wireless router.
  • the storage medium is further configured to store program code for performing the following steps: matching location information of the target object and location information of the wireless router as keywords, and obtaining mapping with the target object At least one wireless router of the relationship; calculating a spherical distance between the target object and any one of the wireless routers having a mapping relationship according to the coordinate information of the target object and the coordinate information of the at least one wireless router having a mapping relationship with the target object; The target object whose spherical distance is less than or equal to the position threshold and the wireless router having the mapping relationship acquire at least one wireless router having a matching relationship with the target object.
  • the storage medium is further configured to store program code for performing the following steps: performing a first pre-processing on the target object name to generate a first predetermined format and/or a first predetermined content. a new target object name; performing a second pre-processing on the wireless router name of the at least one wireless router having a matching relationship with the target object, generating a new wireless router name that satisfies the second predetermined format and/or the second predetermined content; according to the new target object
  • the name and the new wireless router name are filtered by at least one wireless router having a matching relationship with the target object, and a wireless router matching the target object that satisfies the preset condition is obtained.
  • the storage medium is further configured to store program code for performing the following steps: when the same wireless router has a matching relationship with multiple target objects, the reading is closest to the wireless router. Target object.
  • any one of the above computer terminal groups can establish a communication relationship with the website server and the scanner, and the scanner can scan the value command of the web application executed by php on the computer terminal.
  • the disclosed client may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined.
  • the integration can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

Abstract

本发明公开一种获取数据之间的匹配关系的方法和装置。该方法包括:获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。本发明解决了由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的技术问题。

Description

获取数据之间的匹配关系的方法和装置
本申请要求2015年06月29日递交的申请号为201510370088.0、发明名称为“获取数据之间的匹配关系的方法和装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明涉及数据处理领域,具体而言,涉及一种获取数据之间的匹配关系的方法和装置。
背景技术
中国信息点(Point of Interest,POI)数据库中包含全国各地区的POI数据,数据主要包含四方面内容:名称、类别、经纬度、附近的酒店、饭店、商铺等信息。以酒店、饭店、商铺等作为目标对象,可以从POI数据库中获取目标对象的日志信息,该日志信息可以涵盖如下几个方面的内容:目标对象的名称、目标对象的坐标信息(例如目标对象的经纬度信息)、目标对象的地址(例如目标对象所在街道)、目标对象的位置信息(例如目标对象所在的城市和行政区域)。
为了给消费者提供更好的消费体验,大多数酒店、饭店、商铺等目标对象会对外提供例如WiFi的无线网络,以满足消费者的网络需求。消费者在经过或者进入目标对象时,其所持有的移动终端会所搜索并记录周边区域的无线网络,其所记录的无线网络的日志信息基本包括如下几方面的内容:移动终端唯一标识、移动终端的坐标信息(例如移动终端的经纬度信息)、移动终端的位置信息(例如移动终端所在的城市和行政区域)、移动终端搜索到的无线网络的标识和该无线网络信号的强度信息等。
获取目标对象和无线网络的对应关系,并以此进行统计分析,可以获取例如消费者的消费情况、消费偏好、或商铺的经营状况等极具商业价值的分析数据。例如,以目标对象作为分析对象,如果得知该目标对象对应的无线网络,便可以通过查询移动终端记录的无线网络的日志信息,获知该目标对象在不同时间段内的人流情况,还可以获知曾连接过该无线网络的消费者的信息,并根据该消费者的社交关系网,深入分析该目标对象的消费人群分布或向该消费者的好友自动推荐该目标对象。又例如,以消费者作为分析对象,可以获取到消费者曾连接过的无线网络,此时,如果得知这些无线网络对应的 目标对象,就可以分析出该消费者在这些目标对象的逗留时间或消费者光临这些目标对象的频率/次数,还可以通过对目标对象进行整合,分析消费者的消费喜好,还可以根据分析结果向消费者推荐相似的目标对象。
现有技术中基本上是通过与目标对象主动合作的方式来协助目标对象部署无线网络(例如,小米WiFi、360WiFi等),以此获取比较准确的目标对象和无线网络的对应关系。然而,现有技术中的这种获取方式经济成本高,需要耗费大量人力、物力和财力,而且在要获取的目标对象的数量巨大时,为了获取对应关系所耗费的时间成本也相当高,获取大量目标对象与无线网络的匹配关系难度极大。
由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的问题,目前尚未提出有效的解决方案。
发明内容
本发明实施例提供了一种获取数据之间的匹配关系的方法和装置,以至少解决由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的技术问题。
根据本发明实施例的一个方面,提供了一种获取数据之间的匹配关系的方法,包括:获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
根据本发明实施例的另一方面,还提供了一种获取数据之间的匹配关系的装置,包括:获取模块,用于获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;读取模块,用于从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;处理模块,用于根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
在本发明实施例中,采用获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志的方式,通过从日志信息和定位日志中分别读取目标对象和无线路由器的位置信息,达到了根据目标对象和无线路由器的位置信息确定目标对象与一组无线路由器之间的对应关系的目的,从而实现了获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系的技术效果,进而解决了由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的技术问题。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1是根据本申请实施例一的一种获取数据之间的匹配关系的方法的计算机终端的硬件结构框图;
图2是根据本申请实施例一的获取数据之间的匹配关系的方法的流程示意图;
图3是根据本申请实施例一的一种可选的获取数据之间的匹配关系的方法的流程图;
图4是根据本申请实施例二的获取数据之间的匹配关系的装置的结构示意图;
图5是根据本申请图4所示实施例的一种可选的获取模块的结构示意图;
图6是根据本申请图5所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图;
图7是根据本申请图6所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图;
图8是根据本申请图7所示实施例的一种可选的第二筛选模块的结构示意图;
图9是根据本申请图4所示实施例的一种可选的处理模块的结构示意图;
图10是根据本申请图9所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图;
图11是根据本申请图4所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图;以及
图12是根据本申请实施例的一种计算机终端的结构框图。
具体实施方式
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。
下面对本申请涉及到的术语进行解释如下:
POI数据库,即中国信息点(Point of Interest,POI)数据库,该数据库中包含全国各地区的POI数据,数据主要包含四方面内容:名称、类别、经纬度、附近的酒店/饭店/商铺等信息。本申请以中国信息点数据库进行举例说明,本领域技术人员可以不经创造性劳动想到本申请也可以应用在国外各地区的POI数据。
移动终端的国际身份码(International Mobile Equipment Identity,IMEI)是由15位数字组成的电子串号,与每台移动设备一一对应,是移动设备在全世界的唯一识别码。
编辑距离(Edit Distance),又称Levenshtein距离,是指两个字符串之间,由一个转成另一个所需的最少编辑操作次数。许可的编辑操作包括将一个字符替换成另一个字符、插入一个字符、删除一个字符。
实施例1
根据本发明实施例,还提供了一种获取数据之间的匹配关系的方法实施例,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。
本申请实施例一所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在计算机终端上为例,图1是本发明实施例的一种获取数据之间的 匹配关系的方法的计算机终端的硬件结构框图。如图1所示,计算机终端10可以包括一个或多个(图中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)、用于存储数据的存储器104、以及用于通信功能的传输模块106。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述电子装置的结构造成限定。例如,计算机终端10还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。
存储器104可用于存储应用软件的软件程序以及模块,如本发明实施例中的获取数据之间的匹配关系的方法对应的程序指令/模块,处理器102通过运行存储在存储器104内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的应用程序的漏洞检测方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至计算机终端10。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括计算机终端10的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,RF)模块,其用于通过无线方式与互联网进行通讯。
在上述运行环境下,本申请提供了如图2所示的获取数据之间的匹配关系的方法。图2是根据本发明实施例一的获取数据之间的匹配关系的方法的流程图。
如图2所示,一种可选的获取数据之间的匹配关系的方法包括如下实施步骤:
步骤S202:获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;
本申请上述步骤S202中,目标对象集合中包含至少一个目标对象。上述日志信息以目标对象为记录单元,其中任意一个目标对象的日志信息包含至少一种类别的数据字段。可选地,从记载着大量目标对象信息的原始数据库中,筛选出目标对象集合中包含的目标对象的指定类别的数据字段,加以整理后得到上述的目标对象集合中包含的目标对象的日志信息。
此处需要说明的是,记载着大量目标对象信息的原始数据库可以为POI数据库,或 者为二次处理整合后的数据库(例如高德地图的商铺数据库、大众点评的商铺数据库)。数据字段的类别可以至少包括如下任意一种:目标对象的名称、目标对象所属类别、目标对象的坐标信息、目标对象的地址、目标对象的位置信息,其中,目标对象的地址、目标对象的位置信息还可以由目标对象的坐标信息间接得到。
本申请上述步骤S202中,无线路由设备集合中包含至少一个无线路由器。上述定位日志以无线路由器为记录单元,其中任意一个无线路由器的定位日志包含至少一种类别的数据字段。上述无线路由设备集合中包含的无线路由器的定位日志中包含了无线路由设备集合中包含的无线路由器的指定数据字段的信息。
例如,以获取餐厅与无线路由器的匹配关系为例,目标对象集合中包含若干待匹配的餐厅,指定数据字段例如包括名称和经纬度坐标,从高德地图的餐厅数据库中提取所有待匹配餐厅的指定数据字段的信息,整理后得到了目标对象集合中包含的若干待匹配餐厅的日志信息。囿于篇幅限制,在本申请实施例中,从待匹配餐厅中随机抽取比如京味斋、鱼头泡饼、东北菜、庆丰包子铺这四家餐厅对本申请的方案予以详细说明。无线路由设备集合中包含若干待匹配的无线路由器,同样可以获取无线路由设备集合中包含的无线路由器的定位日志。囿于篇幅限制,在本申请实施例中,从待匹配无线路由器中抽取如下无线路由器为例对本申请的方案予以详细说明:Jwz、ytpb、dongbeicai、Q@fbzp、quan-ju-de。
步骤S204:从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;
本申请上述步骤S204中,上述目标对象可以是携带了移动终端的对象,或者是移动终端本身,移动终端可以采用国际身份码作为唯一识别码进行标识。上述目标对象的日志信息中包含的数据字段的类别可以至少包括目标对象的位置信息,上述无线路由器的定位日志中包含的数据字段的类别至少包括无线路由器的位置信息。此处需要说明的是,位置信息可以包括如下任意一种:经纬度信息、街道信息、所处地理区域信息、所归属的商圈信息等。
仍旧以获取餐厅与无线路由器的匹配关系为例,位置信息为获取的目标对象集合中包含的若干待匹配餐厅的日志信息中包含待匹配餐厅的所属地理区域,比如上述四个餐厅均位处北京市朝阳区;获取的无线路由设备集合中包含若干待匹配的无线路由器的定位日志中也包含了待匹配路由器所属地理区域,比如,上述无线路由器中,quan-ju-de位处北京市海淀区,其余位处北京市朝阳区。
步骤S206:根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
本申请上述步骤S206中,根据目标对象的位置信息和无线路由器的位置信息,可以判断目标对象和无线路由器是否处于相同的、极其接近的、或者有特定关系的地理位置,来确定与目标对象具有对应关系的一组无线路由器,以进一步的获取目标对象与无线路由器的匹配关系。
仍旧以获取餐厅与无线路由器的匹配关系为例,以待匹配餐厅中的京味斋为例,根据待匹配餐厅和待匹配无线路由器的所属地理区域信息,可以确定京味斋与待匹配无线路由器的对应关系,例如,可以确定无线路由器quan-ju-de位处北京市海淀区,与位处北京市朝阳区的京味斋不具有对应关系。
本申请提供的上述步骤S202至步骤S206,可以实现根据获取的目标对象的日志信息和无线路由器的定位日志中的位置信息,从大量且繁杂的数据中,自动识别每个目标对象所对应的无线路由器。通过上述步骤,能够在目标对象信息数据库(例如POI数据)和记载无线路由器指定数据字段的数据库之间建立桥梁,使得两个数据库的联合分析成为可能。
由上可知,本申请上述实施例一所提供的方案中,采用获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志的方式,通过从日志信息和定位日志中分别读取目标对象和无线路由器的位置信息,达到了根据目标对象和无线路由器的位置信息确定目标对象与一组无线路由器之间的对应关系的目的,从而实现了获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系的技术效果,进而解决了由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的技术问题。
本申请上述实施例提供的一种可选方案中,步骤S202中:获取无线路由设备集合中包含的无线路由器的定位日志,包括如下具体的实施步骤:
步骤S2022:获取移动终端集合中包含的移动终端的网络日志信息,其中,网络日志信息至少包括如下数据字段:移动终端的位置信息和移动终端接入的无线路由器的路由信息;
本申请上述步骤S2022中,移动终端可以通过其上设置的无线通信模块获取移动终 端附近的无线路由器的路由信息,移动终端接入无线路由器是指移动终端通过正确的密码连接该无线路由器或者移动终端通过检测获取到附近的无线路由器。无线路由信息可以包括如下至少一种:无线路由器的名称、无线路由器的标识、无线路由器发出的无线信号的强度。
此处需要说明的是,移动终端的网络日志信息中包含的移动终端的位置信息可以是通过移动终端的定位模块采集到的GPS信息,也可以是通过移动终端连接周边WiFi,并通过WiFi定位技术获取到的移动终端的位置信息,也可以是通过移动终端的第一类位置信息转换生成的第二类位置信息(例如,根据移动终端的经纬度坐标获取到移动终端所处区域信息),还可以是以上任意多种信息的组合。
此处还需要说明的是,移动终端集合中包含至少一个移动终端,理论上移动终端集合中包含的移动终端数量越多,则收集的移动终端的网络日志信息中覆盖的无线路由器数量也越多。
仍旧以获取餐厅与无线路由器的匹配关系为例,开启了WiFi功能的手机,会自动检测周边WiFi无线路由器信息,生成单个手机检测到的WiFi日志。通过对经常活动于北京市朝阳区的手机用户中的WiFi日志进行收集和统计,可生成关于WiFi信息的原始日志。单个手机检测到的WiFi日志例如包括:手机标识、手机位置、手机检测到的WiFi信息,其中,手机检测到的WiFi信息包括WiFi标识和WiFi信号强度。可选的,原始日志中还可以包含根据上述手机标识、手机位置、手机检测到的WiFi信息进行进一步处理的数据,例如,根据经纬度信息获取的手机所处城市和区域的信息。表1示例性的展现了对若干手机的WiFi日志进行统计后生成的原始日志。在表1中,SSID(Service Set Identifier)为WiFi网络的网络名称。
表1
用户标识(IMEI) 位置信息 WiFi SSID列表及其信号强度
123456789012345 北京市朝阳区 Jwz,-30;ytpb,-80;dongbeicai,-15;
123456789012346 北京市朝阳区 Jwz,-70;Q@fbzp,-25;
123456789012347 北京市朝阳区 Jwz,-25;
123456789012348 北京市海淀区 quan-ju-de,-10
步骤S2024:对网络日志信息进行格式化转换,生成任意一个或多个无线路由器的定位日志,无线路由器的定位日志至少包括如下数据字段:标识信息和位置信息。
本申请上述步骤S2024中,标识信息可以为移动终端获取的无线路由器的名称或者 其他能够用于区分无线路由器的数据,位置信息可以为检测到该无线路由器时移动终端的位置信息,也可以为以根据移动终端的位置信息转换后的另一种数据格式的位置信息。格式化转换主要是记载单元的转换,即将以移动终端为记载单元的网络日志,转换为以无线路由器为记载单元的定位日志。具体的,由于移动终端的网络日志信息记录每个移动终端所检测到的无线路由器的路由信息,这种数据格式并不能直接得到无线路由器的路由信息;而转换后生成的无线路由器的定位日志记录每个无线路由器的标识信息和位置信息。通过本申请上述步骤S2024,解决了由于目标对象的日志信息和移动终端记录的无线网络的日志信息相互独立、且难以对接,造成的不能直接根据目标对象的日志信息和无线网络的日志信息来获取目标对象与无线路由器的对应关系。
仍旧以获取餐厅与无线路由器的匹配关系为例,转换后的无线路由器的定位日志中至少包含了无线路由器的WiFi SSID和无线路由器的位置信息,在一种示例中,无线路由器的位置信息设定为无线路由器所归属的城市和行政区域(可根据经纬度信息获取)。表2示例性的展现了转换生成的无线路由器的定位日志。在表2中,除无线路由器的WiFi SSID和无线路由器的位置信息外,还包括了无线网络的信号强度。
表2
标识信息 位置信息 信号强度
Jwz 北京市朝阳区 -30
ytpb 北京市朝阳区 -80
dongbeicai 北京市朝阳区 -15
Jwz 北京市朝阳区 -70
Q@fbzp 北京市朝阳区 -25
Jwz 北京市朝阳区 -25
quan-ju-de 北京市海淀区 -10
本申请上述步骤S2022至步骤S2024提供了一种获取无线路由设备集合中包含的无线路由器的定位日志的可选方案。基于上述步骤S2022实现了移动终端的网络日志的获取和整合,通过步骤S2024实现了从常见的以移动终端为单元的网络日志向以无线路由器为单元的定位日志的转换,使得在执行步骤S202至S206时,可以直接从无线路由器的定位日志中获取无线路由器的位置信息,并获取目标对象与无线路由器的对应关系。
本申请上述实施例提供的一种可选方案中,在执行上述步骤S2024:对网络日志信息进行格式化转换,生成任意一个或多个无线路由器的定位日志之后,还可以执行如下 实施步骤:
步骤S2032:根据无线路由器的标识信息对无线路由设备集合中的无线路由器进行聚合处理,生成无线路由设备集合中的任意一个或多个无线路由器的聚合结果,其中,聚合结果包括:无线路由器的信号强度;
本申请上述步骤S2032中,由于同一个无线路由器可能会被多个移动终端在多个位置采集到,所以在定位日志中关于同一个无线路由器可能有对应的多个位置信息和多个信号强度数据,因而需要进一步判断定位日志中多个位置信息里的哪一个与无线路由器的真实地理位置最接近,或者进一步的根据上述多个位置信息计算无线路由器的最有可能的位置。本申请上述步骤S2032中对无线路由器进行聚合处理,可以是以无线路由器的标识信息为依据,将定位日志中无线路由器标识信息相同的数据进行聚合,形成包含了无线路由器标识、无线路由器位置信息和无线路由器信号强度的聚合结果。
仍旧以获取餐厅与无线路由器的匹配关系为例,从转换生成的无线路由器的定位日志中,以无线路由器的标识信息为关键字,对每个无线路由器进行聚合处理。表3仅展示了以其中标识信息为jwz的无线路由器为例的一部分数据聚合结果。
表3
标识信息 位置信息 信号强度
jwz 北京市朝阳区 -30
jwz 北京市朝阳区 -70
jwz 北京市朝阳区 -25
步骤S2034:使用预先设置的过滤阈值对聚合结果进行筛选,确定任意一个或多个无线路由器的定位日志中的有效日志,有效日志为信号强度大于等于过滤阈值的无线路由器的定位日志。
本申请上述步骤S2034中,当信号强度弱到一定程度时,其所对应的整条数据(尤其是位置信息)的可信度就较低。通过设置过滤阈值,并根据信号强度与过滤阈值的大小关系,来判断该信号强度对应的整条数据是否可靠,当该信号强度对应的整条数据不可靠时,可从定位日志的聚合结果中剔除该条数据,以最终获取有效日志。
仍旧以获取餐厅与无线路由器的匹配关系为例,可选的,可以设定过滤阈值D=-30,当信号强度小于该过滤阈值时,判定该信号强度所对应的整条信息不可靠,从聚合结果中删除该条信息。表4仅展示了以其中标识信息为jwz的无线路由器为例的、对聚合结果进行筛选后得到的有效日志。
表4
标识信息 位置信息 信号强度
jwz lat2,lng2 -30
jwz lat3,lng3 -25
本申请上述步骤S2032至步骤S2034提供了一种对于定位日志进行筛选处理的可选方案。基于上述步骤S2032,对定位日志按照无线路由器的标识信息进行聚合,生成每个无线路由器的聚合结果,再通过步骤S2034对聚合结果进行筛选,保留聚合结果中可靠的数据并得到有效日志,实现了对定位日志的进一步筛选处理,当定位日志的信息量庞大时,通过上述步骤S2032至步骤S2034的处理,可以简化数据并确保数据的可靠性。
本申请上述实施例提供的一种可选方案中,无线路由器的定位日志还包括:无线路由器的定位坐标。
具体的,定位坐标可以为基于任何预先建立的坐标系的坐标数据,例如经纬度坐标数据。此处需要说明的是,在执行步骤S2024以生成无线路由器的定位日志时,无线路由器的定位日志包含的位置信息可以为坐标信息(例如经纬度坐标信息、其他坐标系下的坐标信息),也可以为非坐标信息(例如根据经纬度坐标信息转换得到的无线路由器所属城市和行政区域信息)。在执行步骤S2024时生成的位置信息为非坐标信息的应用场景下,本申请提供的另一种可选方案中无线路由器的定位日志还需包括无线路由器的定位坐标。
本申请上述实施例提供的一种可选方案中,在无线路由器的定位日志还包括:无线路由器的定位坐标时,在执行上述步骤S2034:在确定任意一个或多个无线路由器的定位日志中的有效日志之后,还可以执行如下实施步骤:
步骤S2036:使用预设条件对任意一个或多个无线路由器的定位坐标进行聚类,获取任意一个或多个无线路由器的聚类簇,其中,无线路由器至少生成一个聚类簇;
本申请上述步骤S2036中,对有效日志中筛选出的每一个无线路由器,可以使用算法对每一个无线路由器所对应的多个定位坐标进行聚类,例如可以选择基于密度的聚类算法,即当在一个区域中定位坐标的密度超过阈值时,可以将其划分为聚类簇。预设条件为使用聚类算法时需要预先设定的条件,不同的聚类算法所需预设条件不同。
以使用基于密度的聚类算法中的DBSCAN算法为例,预设条件包括:E领域和核心对象,E领域是指给定对象半径为E内的区域,核心对象是指给定对象E领域内的样本点数的最小值。通过DBSCAN算法,对有效日志中筛选的每个无线路由器的定位坐标一 列进行聚类,此时,只需执行DBSCAN算法的第一步,即将定位坐标聚成满足我们预设条件的一个个“小圆”,而不进行第二步的再合并。在使用DBSCAN算法时,设置E领域为10米,核心对象为20个,即我们要求如果一个wifi在一个半径为10米的圆内被20个不同的imei定位过,则形成一个聚类簇。
步骤S2038:根据无线路由器的聚类簇的数量,对无线路由设备集合中的无线路由器进行筛选。
本申请上述步骤S2038中,当一个无线路由器处于性能稳定且位置固定的状态时,该无线路由器的聚类簇的位置可能会呈现一定的区域性,聚类簇的数量也可能会呈现一定的规律性。本申请提供的上述步骤S2038通过预设规则,通过判断无线路由器的聚类簇的数量,来推断无线路由器的运行状况,实现了通过聚类簇的情况来对无线路由设备集合中的无线路由器进行筛选。
本申请上述步骤S2036至步骤S2038提供了一种对无线路由设备集合中的无线路由器进行筛选的可选方案。基于上述步骤S2036生成的每个无线路由器的聚类簇,通过步骤S2038来判断每个无线路由器的状态,完成对无线路由设备集合中的无线路由器的筛选。
本申请上述实施例提供的一种可选方案中,步骤S2038:根据无线路由器的聚类簇的数量,对无线路由设备集合中的无线路由器进行筛选,包括如下具体的实施步骤:
步骤S20380:计算无线路由器的每一个聚类簇的中心点坐标;
本申请上述步骤S20380中,聚类簇的中心点坐标的计算方法可以采用欧式空间下的中心点计算方法。计算聚类簇的中心点坐标的计算公式为:
center(簇)=[(lat1+lat2+…+latn)/n,(lng1+lng2+…+lngn)/n]
其中:center(簇)表示聚类簇的中心点坐标,lat为latitude(纬度)的缩写,lat1、lat2…latn为该聚类簇内各个定位坐标中的纬度,lng为longitude(经度)的缩写,lng1、lng2…lngn为该聚类簇内各个定位坐标中的经度,n为该聚类簇内包含的定位坐标的个数。
步骤S20382:在无线路由器的聚类簇的数量超过预设阈值的情况下,使用无线路由器的任意两个聚类簇的中心点坐标,计算得到无线路由器的任意两个聚类簇的中心距离;
本申请上述步骤S20382中,预设阈值例如为2,当一个无线路由器的位置固定时,由于无线路由器的信号覆盖范围有限,即便是该无线路由器相距最远的两个聚类簇,其中心距离也具有上限。因此,可以通过计算无线路由器的任意两个聚类簇的中心距离,并与距离阈值进行比对,判断是否存在中心距离大于距离阈值的两个聚类簇。通过这种 方式,可以判断聚类簇中定位坐标的有效性。
此处需要说明的是,在一个无线路由器的聚类簇的数量超过预设阈值的情况下,可以先获取该无线路由器中每个聚类簇内定位坐标的个数,并根据聚类簇内定位坐标的个数对聚类簇进行排序,再按照该排序中聚类簇的顺序,依次判断两个聚类簇中心点的距离。也可以计算任意两个聚类簇的中心距离。可选地,计算无线路由器两个聚类簇的中心距离时采用球面距离的计算方法。
此处还需要说明的是,在一个无线路由器的聚类簇的数量没有超过预设阈值的情况下,可直接确定该无线路由器为有效无线路由器,并将该聚类簇的中心点坐标赋值给该有效无线路由器;也可根据该聚类簇的簇内个数,在判断簇内个数小于可信阈值时,认定该无线路由器为无效无线路由器,避免了因为该无线路由器的定位坐标的数量不足够而可能导致的无线路由器可信度低的问题。
步骤S20384:当无线路由器的中心距离小于等于距离阈值时,确定无线路由器为有效无线路由器;
此处需要说明的是,当无线路由器的中心距离大于距离阈值时,则认为该无线路由器的定位日志出现了错误,或者该无线路由器的位置发生了变化,需要重新获取该无线路由器的定位日志,因此可以确定该无线路由器为无效路由器。
例如,如果distance(NO.1center(簇),NO.2center(簇))>N,N=150米,则确定该无线路由器为无效无线路由器,或从定位日志中剔除该无线路由器;如果distance(NO.1center(簇),NO.2center(簇))≤N时,确定该无线路由器为有效无线路由器;其中,distance采用球面距离的计算方法,NO.1center(簇)可以为根据聚类簇内定位坐标的个数对聚类簇进行由多至少的排序后,定位坐标个数最多的聚类簇,NO.2center(簇)的坐标个数次之。
通过本申请上述步骤S20382和步骤S20384,实现了对无线路由器是否有效进行判定。在一种情况下,当定位日志中出现错误或较大误差时,通过上述步骤可以对无线路由器的定位信息进行甄别,避免定位日志中的定位信息的误差引起的目标对象与无线路由器的对应关系的错误。在另一种情况下,当无线路由器的位置并非固定,而是产生移动时,无线路由器在移动过程中可能被大量的无线终端获取,通过上述步骤还可以对无线路由器进行甄别,避免移动无线路由器被定位而引起的目标对象与无线路由器的对应关系的错误。
步骤S20386:保留无线路由设备集合中的有效无线路由器,并读取有效无线路由器的簇内个数最大的聚类簇;
本申请上述步骤S20386中,可以通过删除无效的无线路由器或者提取有效无线路由器的方式,实现保留无线路由设备集合中的有效无线路由器。
步骤S20388:将簇内个数最大的聚类簇的中心点坐标赋值给有效无线路由器。
本申请上述步骤S20388中,簇内个数最大的聚类簇表征着在该区域范围内,该无线路由器被定位的次数最多,其最接近该无线路由器真实位置的概率也最大。通过本申请提供的上述步骤S20388实现了根据无线路由器定位日志,获取无线路由器的最优位置坐标。
本申请上述步骤S20380至步骤S20388提供了一种对所述无线路由设备集合中的无线路由器进行筛选的可选方案。基于上述步骤S20380至步骤S20384,实现了对无线路由器是否有效的判断,通过步骤S20386至S20388实现了提取有效无线路由器并对有效无线路由器赋予最优位置坐标,最终达到了精确筛选无线路由器设备集合中的无线路由器并对筛选后的无线路由器赋予最优位置坐标的技术效果。
本申请上述实施例提供的一种可选方案中,目标对象的日志信息至少包括:目标对象的坐标信息;无线路由器的定位日志至少包括:无线路由器的坐标信息。
具体的,目标对象的日志信息或无线路由器的定位日志包含的位置信息可以为坐标信息(例如经纬度坐标信息、其他坐标系下的坐标信息),也可以为非坐标信息(例如根据经纬度坐标信息转换得到的目标对象或无线路由器所属城市和行政区域信息)。在目标对象的日志信息中和/或无线路由器的日志信息中包含的位置信息为非坐标信息的应用场景下,本申请提供的另一种可选方案中目标对象的日志信息或无线路由器的定位日志还需包括坐标信息,该坐标信息可以为任何坐标系下的坐标数据。
仍旧以获取餐厅与无线路由器的匹配关系为例,表5示例性的展现了包含餐厅坐标信息的日志信息,表6示例性的展现了经过步骤S2032至步骤S2034的第一次筛选、以及经过步骤S2036至步骤S2038的第二次筛选后包含无线路由器坐标信息的定位日志;在表5和表6中,lng1至lng4分别表示餐厅所在的经度信息,lng5至lng8分别表示有效无线路由器的簇内个数最大的聚类簇的中心点的经度信息;lat1至lat4分别表示表示餐厅所在的纬度信息,lat5至lat8分别表示有效无线路由器的簇内个数最大的聚类簇的中心点的纬度信息。
表5
名称 位置信息 坐标信息
京味斋 北京市朝阳区 lat1,lng1
鱼头泡饼 北京市朝阳区 lat2,lng2
东北菜 北京市朝阳区 lat3,lng3
庆丰包子铺 北京市朝阳区 lat4,lng4
表6
标识信息 位置信息 坐标信息
Jwz 北京市朝阳区 lat5,lng5
dongbeicai 北京市朝阳区 Lat6,lng6
Q@fbzp 北京市朝阳区 lat7,lng7
quan-ju-de 北京市海淀区 lat8,lng8
本申请上述实施例提供的一种可选方案中,当目标对象的日志信息至少包括目标对象的坐标信息、无线路由器的日志信息至少包括无线路由器的坐标信息时,步骤S206:根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,包括如下具体的实施步骤:
步骤S2062:将目标对象的位置信息和无线路由器的位置信息作为关键字进行匹配,获取与目标对象具有映射关系的至少一个无线路由器;
本申请上述步骤S2062中,在位置信息相同、相邻近或具有特定关系的目标对象与无线路由器之间建立映射关系,而后既可以以目标对象为单元,获取与目标对象具有映射关系的至少一个无线路由器,还可以以无线路由器为单元,获取与无线路由器具有映射关系的至少一个目标对象。
仍旧以获取餐厅与无线路由器的匹配关系为例,以“北京市朝阳区”作为关键字,获取与餐厅具有映射关系的无线路由器,结合上述表5和表6,目标对象“京味斋”与Jwz、dongbeicai、Q@fbzp三个无线路由器有映射关系。
步骤S2064:根据目标对象的坐标信息,以及与目标对象具有映射关系的至少一个无线路由器的坐标信息,计算得到目标对象与具有映射关系的任意一个无线路由器之间的球面距离;
仍旧以获取餐厅与无线路由器的匹配关系为例,根据目标对象“京味斋”的坐标信 息(lat1,lng1)和上述三个无线路由器的坐标信息(lat5,lng5)、(lat6,lng6)、(lat7,lng7),分别计算目标对象与每个无线路由器之间的球面距离。
步骤S2066:提取球面距离小于等于位置阈值的目标对象和具有映射关系的无线路由器,以获取与目标对象具有匹配关系的至少一个无线路由器。
仍旧以获取餐厅与无线路由器的匹配关系为例,例如设定位置阈值为20米,京味斋与Jwz、Q@fbzp之间的球面距离小于20米,以此可以获取与京味斋球面距离小于等于位置阈值的无线路由器为Jwz、Q@fbzp。
此处需要说明的是,上述步骤S2062至步骤S2066所示的方案是以目标对象为单元,来获取与某个目标对象具有映射关系的、且球面距离小于位置阈值的无线路由器;而对上述步骤做适应性修改后,还可以实现以无线路由器为单元,来获取与某个无线路由器具有映射关系的、且球面距离小于位置阈值的目标对象。
本申请上述步骤S2062至步骤S2066提供了一种获取目标对象与一组无线路由器的映射关系的可选方案。基于上述步骤S2062获取位置信息相同的目标对象与无线路由器,通过步骤S2064计算目标对象与位置信息相同的每一个无线路由器的球面距离,并经由步骤S2066中球面距离与位置阈值的判断,提取球面距离小于等于位置阈值的目标对象和具有映射关系的无线路由器,以实现目标对象与无线路由器匹配关系的建立。
本申请上述实施例提供的一种可选方案中,当目标对象的日志信息至少包括如下数据字段:目标对象名称、目标对象坐标信息和目标对象位置信息,无线路由器的定位日志还包括:无线路由器名称时,在步骤S2066:获取与目标对象具有匹配关系的至少一个无线路由器之后,还可以执行如下实施步骤:
步骤S2072:对目标对象名称进行第一预处理,生成满足第一预定格式和/或第一预定内容的新目标对象名称;
本申请上述步骤S2072中,第一预定格式用于规定目标对象名称的格式,例如目标对象名称所使用的语言、目标对象名称所包含的字符种类、目标对象为英文时字母的大小写格式等;第一预定内容用于规定目标对象名称的具体内容及其展示方式,例如,目标对象名称为全拼或首字母拼音、目标对象名称包含英文时的简写方式等。
此处需要说明的是,在可能的其中一个应用场景中,由于目标对象名称多为未经修改的原始商户名称,可能包含中文、英文、数字、图片以及特殊字符等内容,而无线路由器的名称多为字母,而且,无线路由器的名称通常会根据目标对象名称来设置,具有较高的辨识度,因此,需要对目标对象的名称进行第一预处理,使得处理后的新目标对 象名称能够与无线路由器名称对应,以进一步保证匹配关系的准确性。
例如,为生成满足第一预定格式的新目标对象名称,第一预处理用于判断目标对象的名称是否为中文,若否,则查询其对应的公认的中文或将其丢弃,和/或,第一预处理用于判断目标对象名称中是否包含特殊字符和数字,特殊字符例如~!#$%^&*()_+-=等,若是,则去掉目标对象名称中的特殊字符和数字。可以使用开源java项目pinyin4j来进行第一预处理。
又例如,为生成满足第一预定内容的新目标对象名称,第一预处理用于将目标对象的中文名称转为该中文名称的全拼或首字母拼音。可以使用开源java项目pinyin4j来进行第一预处理。
仍旧以获取餐厅与无线路由器的匹配关系为例,对于餐厅京味斋,首先判断出该名称中不包含特殊字符(包括~!#$%^&*()_+-=)与数字,且该名称为中文,则该名称满足第一预定格式;然后,将名称京味斋转换为满足第一预定内容的新名称,当第一预定内容规定使用目标对象名称的全拼时,转换后的新目标对象名称为jingweizhai,记为P1,当第一预定内容规定使用目标对象名称的首字母拼音时,转换后的新目标对象名称为jwz,记为P2。
步骤S2074:对与目标对象具有匹配关系的至少一个无线路由器的无线路由器名称进行第二预处理,生成满足第二预定格式和/或第二预定内容的新无线路由器名称;
此处需要说明的是,第二预定格式和/或第二预定内容可以与第一预定格式和/或第一预定内容的规定保持一致,也可以略作调整。虽然,无线路由器的名称通常会根据目标对象名称来设置以供消费者识别,然而同一个目标对象可能会设置多个无线路由器或通过一个无线路由器发射出多个无线网络,此时,无线路由器的名称中就会具有与目标对象的原始名称并无关联的、仅用以区分无线网络的数字或特殊字符。因此,也需要对无线路由器的名称进行第一预处理,使得处理后的新无线路由器名称能够与目标对象名称对应,以进一步保证匹配关系的准确性。
例如,为生成满足第二预定格式的新无线路由器名称,第二预处理用于判断无线路由器名称中是否包含特殊字符和数字,特殊字符例如~!#$%^&*()_+-=等,若是,则去掉无线路由器名称中的特殊字符和数字,和/或,第二预处理用于判断无线路由器名称中的字母是否均为小写,若否,则将无线路由器名称中的字母转化成小写字母。
又例如,为生成满足第二预定内容的新无线路由器名称,第二预处理用于识别无线路由器名称的类别为目标对象的全拼、首字母拼音或对应英文,若能识别或能从预先存 储的数据库中唯一确定该无线路由器名称的类别,则将无线路由器名称对应转换为与第一预定内容相同的类别。
仍旧以获取餐厅与无线路由器的匹配关系为例,对于餐厅京味斋,经过第一预处理后的新目标对象名称为P2(jwz),与京味斋具有匹配关系的无线路由器名称分别为Jwz、Q@fbzp,通过第二预处理后的新无线路由器名称分别为jwz,qfbzp。
步骤S2076:根据新目标对象名称和新无线路由器名称,对与目标对象具有匹配关系的至少一个无线路由器进行筛选处理,得到满足预设条件的与目标对象匹配的无线路由器。
本申请上述步骤S2076中,预设条件可以为新目标对象名称与新无线路由器名称完全相同,和/或,新目标对象名称与新无线路由器名称相似程度达到相似度阈值。可选的,将新目标对象名称与具有匹配关系的新无线路由器名称进行逐一比对,如果新目标对象名称与新无线路由器名称完全一致,则判定该无线路由器与该目标对象匹配。可选的,将新目标对象名称与具有匹配关系的新无线路由器名称进行逐一比对,如果新目标对象名称与新无线路由器名称并非完全一致,则计算新目标对象名称与新无线路由器名称的相似度,如果新目标对象名称与新无线路由器名称相似度达到相似度阈值,则判定该无线路由器与该目标对象匹配。
具体的,判断新目标对象名称与新无线路由器名称的相似度,可采用计算二者字符串编辑距离的方式,当新目标对象名称与新无线路由器名称的字符串编辑距离小于等于编辑距离阈值,且新目标对象名称的字符个数大于字符数阈值时,则认为新目标对象名称与新无线路由器名称相似程度达到相似度阈值,判定该无线路由器与该目标对象匹配。
仍旧以获取餐厅与无线路由器的匹配关系为例,设置预设条件为新目标对象名称与新无线路由器名称完全相同,和,新目标对象名称与新无线路由器名称的字符串编辑距离为1且新无线路由器名称字符个数大于等于5。对于餐厅京味斋,新目标对象名称为jwz,与新无线路由器名称分别为jwz,qfbzp对比筛选后,得到满足预设条件的目标对象匹配的无线路由器为jwz,
本申请上述步骤S2072至步骤S2076提供了一种获取与目标对象具有匹配关系的至少一个无线路由器的可选方案。基于上述步骤S2072和步骤S2074,完成了对目标对象名称和无线路由器名称的处理,并通过步骤S2076的筛选,得到了与目标对象匹配的无线路由器,进一步提高了目标对象和无线路由器的匹配准确性。
本申请上述实施例提供的一种可选方案中,在步骤S206:获取目标对象集合中包含 的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系之后,方还可以执行如下实施步骤:
步骤S208:当同一个无线路由器与多个目标对象具有匹配关系的情况下,读取与无线路由器距离最近的目标对象。
本申请提供的上述步骤S208通过判断这个无线路由器与目标对象的距离,例如球面距离,避免了不论以目标对象为单元,来获取与某个目标对象匹配的无线路由器,还是以无线路由器为单元,来获取与某个无线路由器匹配的目标对象时可能造成的一个无线路由器归属于多个目标对象的情况。
图3是根据本申请实施例的一种可选的获取数据之间的匹配关系的方法的流程图。下面就结合图3,将本申请的方案应用在应用场景所实现的功能进行详细描述:
步骤A:格式化转换,生成无线路由器的定位日志。
在本申请上述步骤A中,移动终端获取到周边的无线路由器的路由信息,并结合移动终端获取到路由信息时的位置信息,生成移动终端的网络信息日志。将网络信息日志进行格式化转换,转为以无线路由器为记载单元的无线路由器的定位日志,其中,定位日志中包含了无线路由器的标识信息、位置信息和信号强度。
步骤B:进行聚合处理,得到包含信号强度的聚合结果。
在本申请上述步骤B中,以无线路由器的标识信息为依据,将定位日志中无线路由器标识信息相同的数据进行聚合,形成包含了无线路由器标识、无线路由器位置信息和无线路由器信号强度的聚合结果。
步骤C:判断信号强度是否大于等于过滤阈值。
在本申请上述步骤C中,当信号强度弱到一定程度时,其所对应的整条数据(尤其是位置信息)的可信度就较低。通过设置过滤阈值,来判断信号强度与过滤阈值的大小关系,进而可以判断该信号强度对应的整条数据是否可靠。
步骤D:丢弃该无线路由器。
在本申请上述步骤D中,当信号强度小于该过滤阈值时,判定该信号强度所对应的整条信息不可靠,从聚合结果中删除该无线路由器中对应于该信号强度小于过滤阈值的相关日志。
步骤E:得到有效日志。
在本申请上述步骤E中,当信号强度大于等于该过滤阈值时,确定该无线路由器中对应于该信号强度大于等于过滤阈值的定位日志为有效日志。
步骤F:进行聚类处理,生成聚类簇。
在本申请上述步骤F中,选择基于密度的聚类算法:DBSCAN,对无线路由器的定位坐标进行聚类,生成聚类簇。
步骤G:判断聚类簇数量是否未超预设阈值。
在本申请上述步骤G中,可以通过判断无线路由器的聚类簇的数量,来推断无线路由器的运行状况。
步骤H:判断聚类簇中心距离是否小于等于距离阈值。
在本申请上述步骤H中,在一个无线路由器的聚类簇的数量超过预设阈值的情况下,进一步判断任意两个聚类簇的中心距离是否小于等于距离阈值。此时,可以先获取该无线路由器中每个聚类簇内定位坐标的个数,并根据聚类簇内定位坐标的个数对聚类簇进行排序,再按照该排序中聚类簇的顺序,依次判断两个聚类簇中心点的距离。也可以计算任意的或随机的两个聚类簇的中心距离。
步骤I:丢弃该无线路由器。
在本申请上述步骤I中,在判断出有其中两个聚类簇的中心距离大于距离阈值的情况下,则认为该无线路由器的定位日志出现了错误,或者该无线路由器的位置发生了变化,需要重新获取该无线路由器的定位日志,因此可以确定该无线路由器为无效路由器。
步骤J:簇内个数最大的聚类簇的中心点坐标赋值给有效无线路由器。
在本申请上述步骤J中,有如下两种情况:
情况一:在一个无线路由器的聚类簇的数量没有超过预设阈值的情况下,可直接确定该无线路由器为有效无线路由器,并将该聚类簇的中心点坐标赋值给该有效无线路由器。当然,也可根据该聚类簇的簇内个数,在判断簇内个数小于可信阈值时,认定该无线路由器为无效无线路由器。
情况二:在一个无线路由器的聚类簇的数量超过预设阈值、且任意两个聚类簇的中心距离均小于等于距离阈值的情况下,确定无线路由器为有效无线路由器,并读取有效无线路由器的簇内个数最大的聚类簇,将簇内个数最大的聚类簇的中心点坐标赋值给有效无线路由器。
步骤K:判断是否为中文。
步骤L:丢弃该目标对象名称。
在本申请上述步骤L中,当机器翻译结果不理想时,可以直接丢弃名称中部分或全部为非中文的目标对象。
步骤M:第一预处理,得到新目标对象名称P1(全拼)和P2(首字母拼音)。
在本申请上述步骤M中,第一预处理去掉名称中的特殊字符(包括~!#$%^&*()_+-=)与数字,然后使用开源java项目pinyin4j(可以将中文转化成拼音)将将剩下的目标对象名称转为两种内容,一种是目标对象名称的全拼,设为P1;一种是目标对象名称的首字母,设为P2。
上述步骤K至步骤M为对目标对象的名称进行的处理,需要说明的是,该步骤K至步骤M可以在步骤N之前来执行,作为对目标对象名称的预处理;也可以在步骤N之后再来执行。
步骤N:根据目标对象的位置信息和无线路由器的位置信息,确定目标对象所对应的一组无线路由器。
在本申请上述步骤N中,根据目标对象的位置信息和无线路由器的位置信息,可以判断目标对象和无线路由器是否处于相同的、极其接近的、或者有特定关系的地理位置,来确定与目标对象具有对应关系的一组无线路由器,以进一步的获取目标对象与无线路由器的匹配关系。
步骤O:第二预处理,得到新无线路由器名称S。
在本申请上述步骤O中,第二预处理将无线路由器名称转化成小写字母,并去掉特殊字符(包括~!#$%^&*()_+-=)与数字,得到的新无线路由器名称设为S。
步骤P:判断是否S=P1或者S=P2。
在本申请上述步骤P中,将新无线路由器名称S逐一与该目标对象的P1,P2计算,计算方法如下,如果S=P1,则认为该无线路由器属于该目标对象;否则,如果S=P2,则认为该无线路由器属于该目标对象。
步骤Q:将该无线路由器赋予该目标对象。
步骤R:判断是否Levenshtein(S,P1)=1或者Levenshtein(S,P2)=1且S长度≥5。
在本申请上述步骤R中,在S=P1或者S=P2都不满足的情况下,逐一判断S与P1、P2的字符串间距,其中,当levenshtein(S,P1)=1,认为该无线路由器属于该目标对象;否则,如果levenshtein(S,P2)=1,且S的字符个数>=5,则认为该无线路由器属于该目标对象。特别的,对于加上S的字符个数>=5这一条件,是为了降低由于首字母组合在较短的情况下,虽然levenshtein(S,P2)=1,但是依然存在较大误差的可能。
步骤S:丢弃该无线路由器。
在本申请上述步骤S中,如果步骤P和步骤R中的条件都不满足,则丢弃该无线路 由器。
步骤T:对于同一无线路由器属于多个目标对象的情况进行过滤。
在本申请上述步骤T中,对于可能造成的同一个无线路由器被赋予多个目标对象的可能,需要通过判断这个无线路由器与哪个商铺更近来进行过滤,距离计算函数依然使用球面距离。
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。
实施例2
根据本发明实施例,还提供了一种用于实施上述获取数据之间的匹配关系的方法的装置,如图4所示,该装置包括:获取模块402、第一读取模块404以及处理模块406;
其中,获取模块402,用于获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;
第一读取模块404,用于从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;
处理模块406,用于根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
此处需要说明的是,上述获取模块402、第一读取模块404以及处理模块406,对应于实施例一中的步骤S202至步骤S206,三个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件 实现。
本申请提供的上述获取模块402、第一读取模块404以及处理模块406,可以实现根据获取的目标对象的日志信息和无线路由器的定位日志中的位置信息,从大量且繁杂的数据中,自动识别每个目标对象所对应的无线路由器。通过上述装置,能够在目标对象信息数据库(例如POI数据)和记载无线路由器指定数据字段的数据库之间建立桥梁,使得两个数据库的联合分析成为可能。
由上可知,本申请上述实施例二所提供的方案中,采用获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志的方式,通过从日志信息和定位日志中分别读取目标对象和无线路由器的位置信息,达到了根据目标对象和无线路由器的位置信息确定目标对象与一组无线路由器之间的对应关系的目的,从而实现了获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系的技术效果,进而解决了由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的技术问题。
可选地,图5是根据本申请图4所示实施例的一种可选的获取模块的结构示意图;如图5所示,获取模块402包括:获取单元502以及转换单元504,其中:
获取单元502,用于获取移动终端集合中包含的移动终端的网络日志信息,其中,网络日志信息至少包括如下数据字段:移动终端的位置信息和移动终端接入的无线路由器的路由信息;
转换单元504,用于对网络日志信息进行格式化转换,生成任意一个或多个无线路由器的定位日志,无线路由器的定位日志至少包括如下数据字段:标识信息和位置信息。
此处需要说明的是,上述获取单元502以及转换单元504,对应于实施例一中的步骤S2022至步骤S2024,两个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述获取单元502以及转换单元504提供了一种获取无线路由设备集合中包含的无线路由器的定位日志的可选方案。基于上述获取单元502实现了移动终端的网络日志的获取和整合,通过转换单元504实现了从常见的以移动终端为单元的网络日志向以无线路由器为单元的定位日志的转换,使得根据本申请实施例的获取数据之间的匹配关系的装置可以直接从无线路由器的定位日志中获取无线路由器的位置信 息,并获取目标对象与无线路由器的对应关系。
可选地,图6是根据本申请图5所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图,如图6所示,根据本申请实施例的获取数据之间的匹配关系的装置还包括:第一处理模块602以及第一筛选模块604,其中:
第一处理模块602,用于根据无线路由器的标识信息对无线路由设备集合中的无线路由器进行聚合处理,生成无线路由设备集合中的任意一个或多个无线路由器的聚合结果,其中,聚合结果包括:无线路由器的信号强度;
第一筛选模块604,用于使用预先设置的过滤阈值对聚合结果进行筛选,确定任意一个或多个无线路由器的定位日志中的有效日志,有效日志为信号强度大于等于过滤阈值的无线路由器的定位日志。
此处需要说明的是,上述第一处理模块602以及第一筛选模块604,对应于实施例一中的步骤S2032至步骤S2034,两个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述第一处理模块602以及第一筛选模块604提供了一种对于定位日志进行筛选处理的可选方案。基于上述第一处理模块602,对定位日志按照无线路由器的标识信息进行聚合,生成每个无线路由器的聚合结果,再通过第一筛选模块604对聚合结果进行筛选,保留聚合结果中可靠的数据并得到有效日志,实现了对定位日志的进一步筛选处理,尤其是在定位日志的信息量庞大时可以简化数据并确保数据的可靠性。
可选地,无线路由器的定位日志还包括:无线路由器的定位坐标,图7是根据本申请图6所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图,如图7所示,根据本申请实施例的获取数据之间的匹配关系的装置还包括:第二处理模块702以及第二筛选模块704,其中:
第二处理模块702,用于使用预设条件对任意一个或多个无线路由器的定位坐标进行聚类,获取任意一个或多个无线路由器的聚类簇,其中,无线路由器至少生成一个聚类簇;
第二筛选模块704,用于根据无线路由器的聚类簇的数量,对无线路由设备集合中的无线路由器进行筛选。
此处需要说明的是,上述第二处理模块702以及第二筛选模块704,对应于实施例 一中的步骤S2036至步骤S2038,两个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述第二处理模块702以及第二筛选模块704实现了通过判断每个无线路由器的状态,完成对无线路由设备集合中的无线路由器的筛选。
可选地,图8是根据本申请图7所示实施例的一种可选的第二筛选模块的结构示意图;如图8所示,第二筛选模块704包括:第一计算单元800、第二计算单元802、第一处理单元804、第二处理单元806以及第三处理单元808,其中:
第一计算单元800,用于计算无线路由器的每一个聚类簇的中心点坐标;
第二计算单元802,用于在无线路由器的聚类簇的数量超过预设阈值的情况下,使用无线路由器的任意两个聚类簇的中心点坐标,计算得到无线路由器的任意两个聚类簇的中心距离;
第一处理单元804,用于当无线路由器的中心距离小于等于距离阈值时,确定无线路由器为有效无线路由器;
第二处理单元806,用于保留无线路由设备集合中的有效无线路由器,并读取有效无线路由器的簇内个数最大的聚类簇;
第三处理单元808,用于将簇内个数最大的聚类簇的中心点坐标赋值给有效无线路由器。
此处需要说明的是,上述第一计算单元800、第二计算单元802、第一处理单元804、第二处理单元806以及第三处理单元808,对应于实施例一中的步骤S20380至步骤S20388,五个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述第二计算单元802以及第一处理单元804实现了对无线路由器是否有效进行判定。在一种情况下,当定位日志中出现错误或较大误差时,通过上述步骤可以对无线路由器的定位信息进行甄别,避免定位日志中的定位信息的误差引起的目标对象与无线路由器的对应关系的错误。在另一种情况下,当无线路由器的位置并非固定,而是产生移动时,无线路由器在移动过程中可能被大量的无线终端获取,通过上述步骤还可以对无线路由器进行甄别,避免移动无线路由器被定位而引起的目标对象与无线路由器的对应关系的错误。
本申请实施例提供的上述第一计算单元800、第二计算单元802、第一处理单元804、第二处理单元806以及第三处理单元808提供了一种对所述无线路由设备集合中的无线路由器进行筛选的可选方案。既实现了对无线路由器是否有效的判断,又实现了提取有效无线路由器并对有效无线路由器赋予最优位置坐标,最终达到了精确筛选无线路由器设备集合中的无线路由器并对筛选后的无线路由器赋予最优位置坐标的技术效果。
可选地,目标对象的日志信息至少包括:目标对象的坐标信息,无线路由器的定位日志至少包括:无线路由器的坐标信息,图9是根据本申请图4所示实施例的一种可选的处理模块的结构示意图;如图9所示,处理模块406包括:匹配单元902、第三计算单元904以及提取单元906,其中:
匹配单元902,用于将目标对象的位置信息和无线路由器的位置信息作为关键字进行匹配,获取与目标对象具有映射关系的至少一个无线路由器;
第三计算单元904,用于根据目标对象的坐标信息,以及与目标对象具有映射关系的至少一个无线路由器的坐标信息,计算得到目标对象与具有映射关系的任意一个无线路由器之间的球面距离;
提取单元906,用于提取球面距离小于等于位置阈值的目标对象和具有映射关系的无线路由器,以获取与目标对象具有匹配关系的至少一个无线路由器。
此处需要说明的是,上述匹配单元902、第三计算单元904以及提取单元906,对应于实施例一中的步骤S2062至步骤S2066,三个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述匹配单元902、第三计算单元904以及提取单元906提供了一种获取目标对象与一组无线路由器的映射关系的可选方案。基于上述匹配单元902获取位置信息相同的目标对象与无线路由器,通过第三计算单元904计算目标对象与位置信息相同的每一个无线路由器的球面距离,并经由提取单元906中球面距离与位置阈值的判断,提取球面距离小于等于位置阈值的目标对象和具有映射关系的无线路由器,以实现目标对象与无线路由器匹配关系的建立。
可选地,目标对象的日志信息至少包括如下数据字段:目标对象名称、目标对象坐标信息和目标对象位置信息,无线路由器的定位日志还包括:无线路由器名称。图10是根据本申请图9所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意 图;如图10所示,根据本申请实施例的获取数据之间的匹配关系的装置还包括:第一预处理模块1002、第二预处理模块1004以及第三筛选模块1006,其中:
第一预处理模块1002,用于对目标对象名称进行第一预处理,生成满足第一预定格式和/或第一预定内容的新目标对象名称;
第二预处理模块1004,对与目标对象具有匹配关系的至少一个无线路由器的无线路由器名称进行第二预处理,生成满足第二预定格式和/或第二预定内容的新无线路由器名称;
第三筛选模块1006,用于根据新目标对象名称和新无线路由器名称,对与目标对象具有匹配关系的至少一个无线路由器进行筛选处理,得到满足预设条件的与目标对象匹配的无线路由器。
此处需要说明的是,上述第一预处理模块1002、第二预处理模块1004以及第三筛选模块1006,对应于实施例一中的步骤S2072至步骤S2076,三个模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述第一预处理模块1002、第二预处理模块1004以及第三筛选模块1006,基于第一预处理模块1002、第二预处理模块1004完成了对目标对象名称和无线路由器名称的处理,并通过第三筛选模块1006的筛选,得到了与目标对象匹配的无线路由器,进一步提高了目标对象和无线路由器的匹配准确性。
可选地,图11是根据本申请图4所示实施例的一种可选的获取数据之间的匹配关系的装置的结构示意图,如图11所示,根据本申请实施例的获取数据之间的匹配关系的装置还包括:第二读取模块1102,其中,第二读取模块1102,用于当同一个无线路由器与多个目标对象具有匹配关系的情况下,读取与无线路由器距离最近的目标对象。
此处需要说明的是,上述第二读取模块1102,对应于实施例一中的步骤S208,模块与对应的步骤所实现的示例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中,可以通过软件实现,也可以通过硬件实现。
本申请实施例提供的上述第二读取模块1102通过判断这个无线路由器与目标对象的距离,例如球面距离,避免了不论以目标对象为单元,来获取与某个目标对象匹配的无线路由器,还是以无线路由器为单元,来获取与某个无线路由器匹配的目标对象时可 能造成的一个无线路由器归属于多个目标对象的情况。
此处需要说明的是,本申请上述实施例二所提供的优选实施方案与实施例一所提供的可选方案以及应用场景实施过程相同,但不限于实施例一所提供的方案。
实施例3
本发明的实施例可以提供一种计算机终端,该计算机终端可以是计算机终端群中的任意一个计算机终端设备。可选地,在本实施例中,上述计算机终端也可以替换为移动终端等终端设备。
可选地,在本实施例中,上述计算机终端可以位于计算机网络的多个网络设备中的至少一个网络设备。
在本实施例中,上述计算机终端可以执行应用程序的漏洞检测方法中以下步骤的程序代码:获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
可选地,图12是根据本发明实施例的一种计算机终端的结构框图。如图12所示,该计算机终端A可以包括:一个或多个(图中仅示出一个)处理器51、存储器53、以及传输装置55。
其中,存储器53可用于存储软件程序以及模块,如本发明实施例中的安全漏洞检测方法和装置对应的程序指令/模块,处理器51通过运行存储在存储器53内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的系统漏洞攻击的检测方法。存储器53可包括高速随机存储器,还可以包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器53可进一步包括相对于处理器51远程设置的存储器,这些远程存储器可以通过网络连接至终端A。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。
上述的传输装置55用于经由一个网络接收或者发送数据。上述的网络具体实例可包括有线网络及无线网络。在一个实例中,传输装置55包括一个网络适配器(Network Interface Controller,NIC),其可通过网线与其他网络设备与路由器相连从而可与互联网或局域网进行通讯。在一个实例中,传输装置55为射频(Radio Frequency,RF)模块, 其用于通过无线方式与互联网进行通讯。
其中,具体地,存储器53用于存储预设动作条件和预设权限用户的信息、以及应用程序。
处理器51可以通过传输装置调用存储器53存储的信息及应用程序,以执行下述步骤:获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
可选的,上述处理器51还可以执行如下步骤的程序代码:获取移动终端集合中包含的移动终端的网络日志信息,其中,网络日志信息至少包括如下数据字段:移动终端的位置信息和移动终端接入的无线路由器的路由信息;对网络日志信息进行格式化转换,生成任意一个或多个无线路由器的定位日志,无线路由器的定位日志至少包括如下数据字段:标识信息和位置信息。
可选的,上述处理器51还可以执行如下步骤的程序代码:根据无线路由器的标识信息对无线路由设备集合中的无线路由器进行聚合处理,生成无线路由设备集合中的任意一个或多个无线路由器的聚合结果,其中,聚合结果包括:无线路由器的信号强度;使用预先设置的过滤阈值对聚合结果进行筛选,确定任意一个或多个无线路由器的定位日志中的有效日志,有效日志为信号强度大于等于过滤阈值的无线路由器的定位日志。
可选的,上述处理器51还可以执行如下步骤的程序代码:使用预设条件对任意一个或多个无线路由器的定位坐标进行聚类,获取任意一个或多个无线路由器的聚类簇,其中,无线路由器至少生成一个聚类簇;根据无线路由器的聚类簇的数量,对无线路由设备集合中的无线路由器进行筛选。
可选的,上述处理器51还可以执行如下步骤的程序代码:计算无线路由器的每一个聚类簇的中心点坐标;在无线路由器的聚类簇的数量超过预设阈值的情况下,使用无线路由器的任意两个聚类簇的中心点坐标,计算得到无线路由器的任意两个聚类簇的中心距离;当无线路由器的中心距离小于等于距离阈值时,确定无线路由器为有效无线路由器;保留无线路由设备集合中的有效无线路由器,并读取有效无线路由器的簇内个数最大的聚类簇;将簇内个数最大的聚类簇的中心点坐标赋值给有效无线路由器。
可选的,上述处理器51还可以执行如下步骤的程序代码:将目标对象的位置信息和无线路由器的位置信息作为关键字进行匹配,获取与目标对象具有映射关系的至少一个无线路由器;根据目标对象的坐标信息,以及与目标对象具有映射关系的至少一个无线路由器的坐标信息,计算得到目标对象与具有映射关系的任意一个无线路由器之间的球面距离;提取球面距离小于等于位置阈值的目标对象和具有映射关系的无线路由器,以获取与目标对象具有匹配关系的至少一个无线路由器。
可选的,上述处理器51还可以执行如下步骤的程序代码:对目标对象名称进行第一预处理,生成满足第一预定格式和/或第一预定内容的新目标对象名称;对与目标对象具有匹配关系的至少一个无线路由器的无线路由器名称进行第二预处理,生成满足第二预定格式和/或第二预定内容的新无线路由器名称;根据新目标对象名称和新无线路由器名称,对与目标对象具有匹配关系的至少一个无线路由器进行筛选处理,得到满足预设条件的与目标对象匹配的无线路由器。
可选的,上述处理器51还可以执行如下步骤的程序代码:当同一个无线路由器与多个目标对象具有匹配关系的情况下,读取与无线路由器距离最近的目标对象。
采用本发明实施例,提供了一种获取数据之间的匹配关系的方案。通过获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,从而达到了获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系的目的,进而解决了由于现有技术中采用人工方式获取目标对象与移动终端的对应关系的方法,导致获取到的目标对象和无线网络之间的关系不准确且成本高的技术问题。
本领域普通技术人员可以理解,图10所示的结构仅为示意,计算机终端也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌声电脑以及移动互联网设备(Mobile Internet Devices,MID)、PAD等终端设备。图10其并不对上述电子装置的结构造成限定。例如,计算机终端10还可包括比图10中所示更多或者更少的组件(如网络接口、显示装置等),或者具有与图10所示不同的配置。
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令终端设备相关的硬件来完成,该程序可以存储于一计算机可读存储介质中, 存储介质可以包括:闪存盘、只读存储器(Read-Only Memory,ROM)、随机存取器(Random Access Memory,RAM)、磁盘或光盘等。
实施例4
本发明的实施例还提供了一种存储介质。可选地,在本实施例中,上述存储介质可以用于保存上述实施例一所提供的获取数据之间的匹配关系的方法所执行的程序代码。
可选地,在本实施例中,上述存储介质可以位于计算机网络中计算机终端群中的任意一个计算机终端中,或者位于移动终端群中的任意一个移动终端中。
可选地,在本实施例中,存储介质被设置为存储用于执行以下步骤的程序代码:获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;从日志信息中读取任意一个或多个目标对象的位置信息,并从定位日志中读取任意一个或多个无线路由器的位置信息;根据任意一个或多个目标对象的位置信息和任意一个或多个无线路由器的位置信息,确定目标对象所对应的一组无线路由器,以获取目标对象集合中包含的目标对象与无线路由设备集合中包含的无线路由器之间的匹配关系。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:上述处理器51还可以执行如下步骤的程序代码:获取移动终端集合中包含的移动终端的网络日志信息,其中,网络日志信息至少包括如下数据字段:移动终端的位置信息和移动终端接入的无线路由器的路由信息;对网络日志信息进行格式化转换,生成任意一个或多个无线路由器的定位日志,无线路由器的定位日志至少包括如下数据字段:标识信息和位置信息。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:根据无线路由器的标识信息对无线路由设备集合中的无线路由器进行聚合处理,生成无线路由设备集合中的任意一个或多个无线路由器的聚合结果,其中,聚合结果包括:无线路由器的信号强度;使用预先设置的过滤阈值对聚合结果进行筛选,确定任意一个或多个无线路由器的定位日志中的有效日志,有效日志为信号强度大于等于过滤阈值的无线路由器的定位日志。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:使用预设条件对任意一个或多个无线路由器的定位坐标进行聚类,获取任意一个或多个无线路由器的聚类簇,其中,无线路由器至少生成一个聚类簇;根据无线路由器的聚类簇的数量,对无线路由设备集合中的无线路由器进行筛选。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:计算无线路由器的每一个聚类簇的中心点坐标;在无线路由器的聚类簇的数量超过预设阈值的情况下,使用无线路由器的任意两个聚类簇的中心点坐标,计算得到无线路由器的任意两个聚类簇的中心距离;当无线路由器的中心距离小于等于距离阈值时,确定无线路由器为有效无线路由器;保留无线路由设备集合中的有效无线路由器,并读取有效无线路由器的簇内个数最大的聚类簇;将簇内个数最大的聚类簇的中心点坐标赋值给有效无线路由器。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:将目标对象的位置信息和无线路由器的位置信息作为关键字进行匹配,获取与目标对象具有映射关系的至少一个无线路由器;根据目标对象的坐标信息,以及与目标对象具有映射关系的至少一个无线路由器的坐标信息,计算得到目标对象与具有映射关系的任意一个无线路由器之间的球面距离;提取球面距离小于等于位置阈值的目标对象和具有映射关系的无线路由器,以获取与目标对象具有匹配关系的至少一个无线路由器。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:对目标对象名称进行第一预处理,生成满足第一预定格式和/或第一预定内容的新目标对象名称;对与目标对象具有匹配关系的至少一个无线路由器的无线路由器名称进行第二预处理,生成满足第二预定格式和/或第二预定内容的新无线路由器名称;根据新目标对象名称和新无线路由器名称,对与目标对象具有匹配关系的至少一个无线路由器进行筛选处理,得到满足预设条件的与目标对象匹配的无线路由器。
可选地,在本实施例中,存储介质还被设置为存储用于执行以下步骤的程序代码:当同一个无线路由器与多个目标对象具有匹配关系的情况下,读取与无线路由器距离最近的目标对象。
此处需要说明的是,上述计算机终端群中的任意一个可以与网站服务器和扫描器建立通信关系,扫描器可以扫描计算机终端上php执行的web应用程序的值命令。
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。
在本发明的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
在本申请所提供的几个实施例中,应该理解到,所揭露的客户端,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结 合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。

Claims (16)

  1. 一种获取数据之间的匹配关系的方法,其特征在于,包括:
    获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;
    从所述日志信息中读取任意一个或多个目标对象的位置信息,并从所述定位日志中读取任意一个或多个无线路由器的位置信息;
    根据所述任意一个或多个目标对象的位置信息和所述任意一个或多个无线路由器的位置信息,确定所述目标对象所对应的一组无线路由器,以获取所述目标对象集合中包含的目标对象与所述无线路由设备集合中包含的无线路由器之间的匹配关系。
  2. 根据权利要求1所述的方法,其特征在于,获取无线路由设备集合中包含的无线路由器的定位日志包括:
    获取移动终端集合中包含的移动终端的网络日志信息,其中,所述网络日志信息至少包括如下数据字段:所述移动终端的位置信息和所述移动终端接入的无线路由器的路由信息;
    对所述网络日志信息进行格式化转换,生成所述任意一个或多个无线路由器的定位日志,所述无线路由器的定位日志至少包括如下数据字段:标识信息和所述位置信息。
  3. 根据权利要求2所述的方法,其特征在于,在对所述网络日志信息进行格式化转换,生成所述任意一个或多个无线路由器的定位日志之后,所述方法还包括:
    根据所述无线路由器的标识信息对所述无线路由设备集合中的无线路由器进行聚合处理,生成所述无线路由设备集合中的任意一个或多个无线路由器的聚合结果,其中,所述聚合结果包括:所述无线路由器的信号强度;
    使用预先设置的过滤阈值对所述聚合结果进行筛选,确定所述任意一个或多个无线路由器的定位日志中的有效日志,所述有效日志为所述信号强度大于等于所述过滤阈值的无线路由器的定位日志。
  4. 根据权利要求3所述的方法,其特征在于,所述无线路由器的定位日志还包括:所述无线路由器的定位坐标,其中,在确定所述任意一个或多个无线路由器的定位日志中的有效日志之后,所述方法还包括:
    使用预设条件对所述任意一个或多个无线路由器的定位坐标进行聚类,获取所述任意一个或多个无线路由器的聚类簇,其中,所述无线路由器至少生成一个聚类簇;
    根据所述无线路由器的聚类簇的数量,对所述无线路由设备集合中的无线路由器进 行筛选。
  5. 根据权利要求4所述的方法,其特征在于,根据所述无线路由器的聚类簇的数量,对所述无线路由设备集合中的无线路由器进行筛选,包括:
    计算所述无线路由器的每一个聚类簇的中心点坐标;
    在所述无线路由器的聚类簇的数量超过预设阈值的情况下,使用所述无线路由器的任意两个聚类簇的中心点坐标,计算得到所述无线路由器的任意两个聚类簇的中心距离;
    当所述无线路由器的中心距离小于等于距离阈值时,确定所述无线路由器为有效无线路由器;
    保留所述无线路由设备集合中的有效无线路由器,并读取所述有效无线路由器的簇内个数最大的聚类簇;
    将所述簇内个数最大的聚类簇的中心点坐标赋值给所述有效无线路由器。
  6. 根据权利要求1至5中任意一项所述的方法,其特征在于,所述目标对象的日志信息至少包括:所述目标对象的坐标信息,所述无线路由器的定位日志至少包括:所述无线路由器的坐标信息,其中,
    根据所述任意一个或多个目标对象的位置信息和所述任意一个或多个无线路由器的位置信息,确定所述目标对象所对应的一组无线路由器,包括:
    将所述目标对象的位置信息和所述无线路由器的位置信息作为关键字进行匹配,获取与所述目标对象具有映射关系的至少一个无线路由器;
    根据所述目标对象的坐标信息,以及与所述目标对象具有映射关系的至少一个无线路由器的坐标信息,计算得到所述目标对象与具有所述映射关系的任意一个无线路由器之间的球面距离;
    提取所述球面距离小于等于位置阈值的目标对象和具有所述映射关系的所述无线路由器,以获取与所述目标对象具有所述匹配关系的至少一个无线路由器。
  7. 根据权利要求6所述的方法,其特征在于,所述目标对象的日志信息至少包括如下数据字段:目标对象名称、目标对象坐标信息和目标对象位置信息,所述无线路由器的定位日志还包括:无线路由器名称,其中,
    在获取与所述目标对象具有所述匹配关系的至少一个无线路由器之后,所述方法还包括:
    对所述目标对象名称进行第一预处理,生成满足第一预定格式和/或第一预定内容的 新目标对象名称;
    对与所述目标对象具有所述匹配关系的至少一个无线路由器的无线路由器名称进行第二预处理,生成满足第二预定格式和/或第二预定内容的新无线路由器名称;
    根据所述新目标对象名称和所述新无线路由器名称,对与所述目标对象具有所述匹配关系的至少一个无线路由器进行筛选处理,得到满足预设条件的与所述目标对象匹配的无线路由器。
  8. 根据权利要求1所述的方法,其特征在于,在获取所述目标对象集合中包含的目标对象与所述无线路由设备集合中包含的无线路由器之间的匹配关系之后,所述方法还包括:
    当同一个无线路由器与多个目标对象具有所述匹配关系的情况下,读取与所述无线路由器距离最近的目标对象。
  9. 一种获取数据之间的匹配关系的装置,其特征在于,包括:
    获取模块,用于获取目标对象集合中包含的目标对象的日志信息和无线路由设备集合中包含的无线路由器的定位日志;
    第一读取模块,用于从所述日志信息中读取任意一个或多个目标对象的位置信息,并从所述定位日志中读取任意一个或多个无线路由器的位置信息;
    处理模块,用于根据所述任意一个或多个目标对象的位置信息和所述任意一个或多个无线路由器的位置信息,确定所述目标对象所对应的一组无线路由器,以获取所述目标对象集合中包含的目标对象与所述无线路由设备集合中包含的无线路由器之间的匹配关系。
  10. 根据权利要求9所述的装置,其特征在于,获取模块包括:
    获取单元,用于获取移动终端集合中包含的移动终端的网络日志信息,其中,所述网络日志信息至少包括如下数据字段:所述移动终端的位置信息和所述移动终端接入的无线路由器的路由信息;
    转换单元,用于对所述网络日志信息进行格式化转换,生成所述任意一个或多个无线路由器的定位日志,所述无线路由器的定位日志至少包括如下数据字段:标识信息和所述位置信息。
  11. 根据权利要求10所述的装置,其特征在于,所述装置还包括:
    第一处理模块,用于根据所述无线路由器的标识信息对所述无线路由设备集合中的无线路由器进行聚合处理,生成所述无线路由设备集合中的任意一个或多个无线路由器 的聚合结果,其中,所述聚合结果包括:所述无线路由器的信号强度;
    第一筛选模块,用于使用预先设置的过滤阈值对所述聚合结果进行筛选,确定所述任意一个或多个无线路由器的定位日志中的有效日志,所述有效日志为所述信号强度大于等于所述过滤阈值的无线路由器的定位日志。
  12. 根据权利要求11所述的装置,其特征在于,所述无线路由器的定位日志还包括:所述无线路由器的定位坐标,所述装置还包括:
    第二处理模块,用于使用预设条件对所述任意一个或多个无线路由器的定位坐标进行聚类,获取所述任意一个或多个无线路由器的聚类簇,其中,所述无线路由器至少生成一个聚类簇;
    第二筛选模块,用于根据所述无线路由器的聚类簇的数量,对所述无线路由设备集合中的无线路由器进行筛选。
  13. 根据权利要求12所述的装置,其特征在于,第二筛选模块包括:
    第一计算单元,用于计算所述无线路由器的每一个聚类簇的中心点坐标;
    第二计算单元,用于在所述无线路由器的聚类簇的数量超过预设阈值的情况下,使用所述无线路由器的任意两个聚类簇的中心点坐标,计算得到所述无线路由器的任意两个聚类簇的中心距离;
    第一处理单元,用于当所述无线路由器的中心距离小于等于距离阈值时,确定所述无线路由器为有效无线路由器;
    第二处理单元,用于保留所述无线路由设备集合中的有效无线路由器,并读取所述有效无线路由器的簇内个数最大的聚类簇;
    第三处理单元,用于将所述簇内个数最大的聚类簇的中心点坐标赋值给所述有效无线路由器。
  14. 根据权利要求9至13中任意一项所述的装置,其特征在于,所述目标对象的日志信息至少包括:所述目标对象的坐标信息,所述无线路由器的定位日志至少包括:所述无线路由器的坐标信息,所述处理模块包括:
    匹配单元,用于将所述目标对象的位置信息和所述无线路由器的位置信息作为关键字进行匹配,获取与所述目标对象具有映射关系的至少一个无线路由器;
    第三计算单元,用于根据所述目标对象的坐标信息,以及与所述目标对象具有映射关系的至少一个无线路由器的坐标信息,计算得到所述目标对象与具有所述映射关系的任意一个无线路由器之间的球面距离;
    提取单元,用于提取所述球面距离小于等于位置阈值的目标对象和具有所述映射关系的所述无线路由器,以获取与所述目标对象具有所述匹配关系的至少一个无线路由器。
  15. 根据权利要求14所述的装置,其特征在于,所述目标对象的日志信息至少包括如下数据字段:目标对象名称、目标对象坐标信息和目标对象位置信息,所述无线路由器的定位日志还包括:无线路由器名称,所述装置还包括:
    第一预处理模块,用于对所述目标对象名称进行第一预处理,生成满足第一预定格式和/或第一预定内容的新目标对象名称;
    第二预处理模块,对与所述目标对象具有所述匹配关系的至少一个无线路由器的无线路由器名称进行第二预处理,生成满足第二预定格式和/或第二预定内容的新无线路由器名称;
    第三筛选模块,用于根据所述新目标对象名称和所述新无线路由器名称,对与所述目标对象具有所述匹配关系的至少一个无线路由器进行筛选处理,得到满足预设条件的与所述目标对象匹配的无线路由器。
  16. 根据权利要求9所述的装置,其特征在于,所述装置还包括:
    第二读取模块,用于当同一个无线路由器与多个目标对象具有所述匹配关系的情况下,读取与所述无线路由器距离最近的目标对象。
PCT/CN2016/086649 2015-06-29 2016-06-22 获取数据之间的匹配关系的方法和装置 WO2017000817A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510370088.0A CN106326263B (zh) 2015-06-29 2015-06-29 获取数据之间的匹配关系的方法和装置
CN201510370088.0 2015-06-29

Publications (1)

Publication Number Publication Date
WO2017000817A1 true WO2017000817A1 (zh) 2017-01-05

Family

ID=57607714

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/086649 WO2017000817A1 (zh) 2015-06-29 2016-06-22 获取数据之间的匹配关系的方法和装置

Country Status (2)

Country Link
CN (1) CN106326263B (zh)
WO (1) WO2017000817A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112232639A (zh) * 2020-09-22 2021-01-15 支付宝(杭州)信息技术有限公司 统计方法、装置和电子设备

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110740418A (zh) * 2018-07-03 2020-01-31 百度在线网络技术(北京)有限公司 用于生成用户到访信息的方法和装置
CN110493848B (zh) * 2019-08-20 2021-04-16 赛尔网络有限公司 用户终端路由ip变化的监测方法、装置、系统及介质
CN111475562B (zh) * 2020-04-11 2021-01-29 上海星地通讯工程研究所 应用于业务处理系统的数据格式优化方法及业务服务器

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100125407A1 (en) * 2008-11-17 2010-05-20 Cho Chae-Guk Method for providing poi information for mobile terminal and apparatus thereof
CN103152696A (zh) * 2013-03-19 2013-06-12 沈志松 基于WiFi的兴趣点定位系统
CN103607771A (zh) * 2013-11-15 2014-02-26 四川长虹电器股份有限公司 基于wifi的定位系统及方法
CN103945007A (zh) * 2014-05-08 2014-07-23 百度在线网络技术(北京)有限公司 信息推送方法和装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102063499A (zh) * 2011-01-04 2011-05-18 百度在线网络技术(北京)有限公司 构建电子地图定位数据库的方法及系统
CN102737048A (zh) * 2011-04-01 2012-10-17 北京千橡网景科技发展有限公司 用于修正社交网站中保存的poi的方法和设备
CN104501798A (zh) * 2014-12-18 2015-04-08 深圳先进技术研究院 一种基于增强现实ip地图的网络对象定位追踪方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100125407A1 (en) * 2008-11-17 2010-05-20 Cho Chae-Guk Method for providing poi information for mobile terminal and apparatus thereof
CN103152696A (zh) * 2013-03-19 2013-06-12 沈志松 基于WiFi的兴趣点定位系统
CN103607771A (zh) * 2013-11-15 2014-02-26 四川长虹电器股份有限公司 基于wifi的定位系统及方法
CN103945007A (zh) * 2014-05-08 2014-07-23 百度在线网络技术(北京)有限公司 信息推送方法和装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112232639A (zh) * 2020-09-22 2021-01-15 支付宝(杭州)信息技术有限公司 统计方法、装置和电子设备
CN112232639B (zh) * 2020-09-22 2023-06-30 支付宝(杭州)信息技术有限公司 统计方法、装置和电子设备

Also Published As

Publication number Publication date
CN106326263A (zh) 2017-01-11
CN106326263B (zh) 2019-10-08

Similar Documents

Publication Publication Date Title
CN105808988B (zh) 一种识别异常账户的方法及装置
CN103795613B (zh) 一种在线社交网络中朋友关系预测的方法
CN104699835B (zh) 用于确定网页页面中包括兴趣点poi数据的方法及装置
CN105187395B (zh) 基于接入路由器进行恶意软件网络行为检测的方法及系统
WO2017000817A1 (zh) 获取数据之间的匹配关系的方法和装置
CN107341220B (zh) 一种多源数据融合方法和装置
WO2016127904A1 (zh) 文本地址处理方法及装置
CN109104688A (zh) 使用聚集技术生成无线网络接入点模型
CN105550583A (zh) 基于随机森林分类方法的Android平台恶意应用检测方法
CN111078818B (zh) 地址分析方法、装置、电子设备及存储介质
TW201537915A (zh) 確定ip位址段及其對應的經緯度的方法及裝置
CN105376223B (zh) 网络身份关系的可靠度计算方法
CN104486143B (zh) 一种深度报文检测方法、检测系统
CN113412608B (zh) 内容推送方法、装置、服务端及存储介质
CN107515915A (zh) 基于用户行为数据的用户标识关联方法
CN106843941B (zh) 信息处理方法、装置和计算机设备
CN108427679B (zh) 一种人流分布处理方法及其设备
US11368901B2 (en) Method for identifying a type of a wireless hotspot and a network device thereof
WO2018010693A1 (zh) 识别伪基站信息的方法及装置
CN105491444A (zh) 一种数据识别处理方法以及装置
WO2018113370A1 (zh) 扩展用户的方法、装置及系统
WO2017124881A1 (zh) 信息推送方法及装置
CN110648172A (zh) 一种融合多种移动设备的身份识别方法和系统
CN109963253B (zh) 一种用户居住地理位置的识别方法及装置
CN107133689B (zh) 一种位置标记方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16817181

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16817181

Country of ref document: EP

Kind code of ref document: A1