WO2024000519A1 - Trajectory data characterization method and apparatus - Google Patents

Trajectory data characterization method and apparatus Download PDF

Info

Publication number
WO2024000519A1
WO2024000519A1 PCT/CN2022/103140 CN2022103140W WO2024000519A1 WO 2024000519 A1 WO2024000519 A1 WO 2024000519A1 CN 2022103140 W CN2022103140 W CN 2022103140W WO 2024000519 A1 WO2024000519 A1 WO 2024000519A1
Authority
WO
WIPO (PCT)
Prior art keywords
type
signals
position point
characteristic signals
characteristic
Prior art date
Application number
PCT/CN2022/103140
Other languages
French (fr)
Chinese (zh)
Inventor
唐国斌
周才发
八华峰
曾丹丹
李亮
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2022/103140 priority Critical patent/WO2024000519A1/en
Publication of WO2024000519A1 publication Critical patent/WO2024000519A1/en

Links

Images

Definitions

  • the present application relates to the field of indoor and outdoor positioning technologies, and in particular, to a method and device for representing trajectory data.
  • GNSS Global Navigation Satellite System
  • Existing indoor positioning technology can be divided into three types based on data sources and positioning methods: 1) redeployment indoor positioning: indoor positioning is achieved by deploying specific positioning base stations and communication servers in the indoor environment; 2) light deployment indoor positioning: through An indoor multi-floor fingerprint database is constructed through manual collection, and positioning is achieved by initiating a positioning request and matching the currently scanned fingerprint with the fingerprint database; 3) Zero-deployment indoor positioning: Collect sensors collected by users under specific trigger conditions through crowdsourcing. , WIFI, Global Positioning System (GPS) and other data are uploaded to the cloud. The cloud processes the algorithm to obtain an indoor positioning fingerprint database for subsequent indoor positioning. Among them, zero-deployment indoor positioning has gradually become a mainstream indoor positioning solution due to its many advantages such as low cost, high timeliness, and high degree of automation.
  • the embodiments of the present application provide a method and device for representing trajectory data, which can effectively save the storage space of trajectory data; at the same time, it can be compatible with the format of outdoor data, realize the integration of indoor and outdoor data, and thereby improve the use of trajectory data in the field of indoor and outdoor positioning. of practicality.
  • this application provides a method for characterizing trajectory data.
  • the method includes: acquiring multiple pieces of trajectory data, each piece of trajectory data in the multiple pieces of trajectory data including multiple time periods, and the multiple pieces of trajectory data.
  • Characteristic signals collected in each time period in the time period ; generating multiple location points and attributes of each of the multiple location points based on the multiple trajectory data; wherein the multiple location points include A first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, the combination identifier of each time period in the one or more time periods is the same, and each time period
  • the combined identification of the segment is generated by the identification ID of at least one characteristic signal corresponding to each time period, and the first position point is characterized by the combined identification;
  • the characteristic signals collected in one or more time periods include At least one type of characteristic signal, the attribute of the first position point includes the quantity of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal.
  • the first position point is the collection position of the characteristic signal in the one or more time periods.
  • At least one characteristic signal corresponding to each time period includes the characteristic signal collected during the time period.
  • the characteristic signal data in the one or more time periods is Collected at the same location. Furthermore, the characteristic signal data in the one or more time periods can be jointly counted and characterized to obtain data such as the quantity and quantity distribution characteristics of each type of characteristic signal in the one or more time periods as attributes of the location point. That is, this application uses position points and attributes of the position points to characterize multiple trajectory data to replace the existing method of directly storing the original trajectory data, thereby achieving the technical effect of saving trajectory data storage space. Furthermore, when the trajectory data reaches a certain amount, storing the newly collected trajectory data will hardly increase the storage overhead, making it possible to store massive trajectory data in the big data era, which can effectively improve the effectiveness of trajectory data for indoor and outdoor positioning. value.
  • the one or more time periods include a first time period, the first time period includes M characteristic signals, and the M characteristic signals include Class C characteristic signals, M and C are positive integers; the combination identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D-type characteristic signals, and N is a positive integer less than or equal to M. , D is a positive integer less than or equal to C.
  • the characteristic signals collected at different locations and the corresponding signal strengths are different, at least one of the characteristic signals collected in each time period can be used to generate a combined identification of each time period.
  • the collection location corresponding to each time period is identified, thereby achieving unified representation and storage of subsequent data collected at the same location point, thus replacing the method of storing trajectory data one by one in the existing technology and saving storage space for trajectory data.
  • the C-type characteristic signals include first-type characteristic signals
  • the M characteristic signals include A of the first-type characteristic signals
  • the N characteristic signals include B
  • the first type of characteristic signals A of the first type of characteristic signals include the B of the first type of characteristic signals
  • the signal strength of the B of the first type of characteristic signals is greater than that of the A of the first type of characteristic signals The signal strength of other characteristic signals; where B is a positive integer less than or equal to A.
  • the characteristic signal intensity can be used to filter out the characteristic signals used to generate the combined identification in each type of characteristic signal.
  • the characteristic signals collected at the same location can be
  • the combined identification generated by the collected characteristic signals that is, the characteristic signal data in one or more time periods is collected at the same location point
  • the combination identification generated by the characteristic signals collected at different location points is different; further, using The similarities and differences of the combined identifiers will uniformly represent and store the characteristic signal data of one or more time periods with the same combined identifier, thus replacing the way of storing trajectory data one by one in the existing technology and saving the storage space of trajectory data.
  • the attributes of the first location point include indoor and outdoor location identifiers, and the indoor and outdoor location identifiers are used to indicate that the first location point is an indoor location point or an outdoor location point; when the When the first position point is an indoor position point, the collection position of the characteristic signal in the one or more time periods is located indoors. When the first position point is an outdoor position point, the collection position of the characteristic signal in the one or more time periods is The signal collection location is located outdoors.
  • the positions of different position points are distinguished through indoor and outdoor position markers, so that the coordinates of the outdoor position points can be used to calculate the relevant indoor
  • the longitude and latitude of the location point so that indoor location points can also be represented and stored based on coordinates, and the compatibility of indoor and outdoor location point data can be achieved.
  • the plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of E characteristic signals; when the first position point and When the second position point satisfies one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point; wherein the first condition is :
  • the E characteristic signals and the N characteristic signals include common G characteristic signals, the ratio of G and E, and the ratio of G and N are both greater than or equal to the first threshold; and among the E characteristic signals The difference between the average signal strength of the first signal and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is any one of the G characteristic signals;
  • the second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods
  • the segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold.
  • the I time segment includes the third time segment, and the third time segment
  • the second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period;
  • the third condition is: the distance between the first position point and the second position point is less than or equal to the fifth threshold, the first position point and the third position point are The distance between the two position points is calculated by the coordinates of the first position point and the coordinates of the second position point.
  • the coordinates of the first position point are calculated by comparing the coordinates collected in the one or more time periods. The position coordinates are summed and averaged.
  • the any two position points can be determined based on the spatial relationship (i.e., the first condition and the third condition) and the temporal relationship (i.e., the second condition). Whether the positions indicated by the position points are adjacent, and then this adjacent relationship can be used in the future, based on the approximation of the same attribute between adjacent position points, through the first attribute included in the attributes of some position points) to calculate this Some position points have the first attribute of other position points (whose attributes do not include the first attribute) with a certain adjacent relationship.
  • the first condition indicating the spatial relationship is specifically: for any two position points, when the signal intensity of the characteristic signal that generates their combined identification is close, it can be indicated that the two position points are adjacent or close;
  • the temporal relationship is specifically: For any one of the multiple time periods corresponding to a location point, among the multiple time periods corresponding to another location point, there is a time period with a collection time close to it, and these two The two time periods are located on the same trajectory data. It can be seen from the movement patterns of indoor people or objects that the collection positions corresponding to these two time periods are adjacent.
  • the attributes of the first position point include a first attribute
  • the method further includes: generating a first graph structure based on the adjacent relationship between the plurality of position points; wherein, The first graph structure is used to describe the connection relationship between the plurality of position points, and when the first position point is adjacent to the second position point, the first position point and the third position point Two position points are connected by an edge in the first graph structure.
  • the first graph structure includes a first type of position point and a second type of position point.
  • the position points of each position point in the first type of position point are The attributes include the first attribute, and the attributes of each position point in the second type of position points do not include the first attribute; the first graph structure is cut to obtain a second graph structure; where , the edge between any two first-type position points in the first graph structure is cut, and the edge between any two second-type position points connected to the first-type position point is cut Cutting, each position point in the second graph structure has a connection relationship with at least one position point.
  • the first attribute is any one of the attributes of the first position point.
  • the adjacent relationship of each position point is represented through the first graph structure, and then a second graph composed of the first type of position point and the second type of position point adjacent to the first type of position point is generated. structure, and use the second graph structure to calculate the missing or inaccurate first attributes among the attributes of the second type of location points, so as to remove the farther first type of location points that need to be included in the attributes of the second type of location points. The impact of the calculation of the first attribute.
  • the method further includes: obtaining an adjacency distance matrix based on the connection relationship between each position point in the second graph structure; wherein each element in the adjacency distance matrix is represented by In order to characterize the number of edges between a second type position point and a first type position point in the second graph structure; normalize the adjacency distance matrix to obtain a weight matrix; Multiply the weight matrix and the first matrix to obtain a second matrix; wherein, the elements in the first matrix are used to characterize the attributes of each first type position point in the second graph structure. An attribute, the elements in the second matrix are used to characterize the first attribute included in the attributes of each second type position point in the second graph structure.
  • the distance from the second type position point to the first type position point is represented by the number of edges, and then normalized to obtain the weight matrix of all second type position points in the second graph structure, and then Multiply with the first matrix to accurately obtain the first attribute required by each position point in the second type of position point.
  • the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier
  • the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location. location point.
  • the coordinates can be used to replace the combined identification of each indoor location point for indexing, achieving format compatibility and unification with outdoor data.
  • attribute completion in this way can better compensate for flaws and errors in the data collection process, so that each location point contains complete attributes, effectively improving the practicality of the solution.
  • the method further includes: performing location positioning, navigation, big data statistical analysis or service recommendation based on the coordinates of each location point in the plurality of location points.
  • the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
  • some original data that conforms to the distribution rules can also be stored in the attributes of the location points, so that the true characteristics of the original data can be better preserved while discarding most of the original trajectory data.
  • the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, wherein, Radio frequency signals include one or more of WIFI signals, Bluetooth signals, cell CELL signals or ultra-wideband UWB signals.
  • the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
  • interval distribution and/or normal distribution can greatly save the storage space of trajectory data.
  • distribution-specific such as the mean and variance of normal distribution
  • partially consistent with distribution-specific original data it can be achieved Greatly retain the characteristics of the original data while saving storage space.
  • the present application provides a device for characterizing trajectory data.
  • the device includes: an acquisition unit configured to acquire multiple pieces of trajectory data. Each piece of trajectory data in the multiple pieces of trajectory data includes multiple time periods, and the characteristic signals collected in each of the plurality of time periods; a processing unit configured to generate a plurality of position points and attributes of each of the plurality of position points based on the plurality of trajectory data; Wherein, the plurality of position points include a first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, and each time period in the one or more time periods
  • the combination identification is the same, the combination identification of each time period is generated by the identification ID of at least one characteristic signal corresponding to each time period, the first position point is characterized by the combination identification; the one or The characteristic signals collected in multiple time periods include at least one type of characteristic signal, and the attributes of the first location point include the number of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal.
  • the one or more time periods include a first time period, the first time period includes M characteristic signals, and the M characteristic signals include Class C characteristic signals, M and C are positive integers; the combination identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D-type characteristic signals, and N is a positive integer less than or equal to M. , D is a positive integer less than or equal to C.
  • the C-type characteristic signals include first-type characteristic signals
  • the M characteristic signals include A of the first-type characteristic signals
  • the N characteristic signals include B
  • the first type of characteristic signals A of the first type of characteristic signals include the B of the first type of characteristic signals
  • the signal strength of the B of the first type of characteristic signals is greater than that of the A of the first type of characteristic signals The signal strength of other characteristic signals; where B is a positive integer less than or equal to A.
  • the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
  • the plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of E characteristic signals; when the first position point and When the second position point satisfies one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point; wherein the first condition is :
  • the E characteristic signals and the N characteristic signals include common G characteristic signals, the ratio of G and E, and the ratio of G and N are both greater than or equal to the first threshold; and among the E characteristic signals The difference between the average signal strength of the first signal and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is any one of the G characteristic signals;
  • the second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods
  • the segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold.
  • the I time segment includes the third time segment, and the third time segment
  • the second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period;
  • the third condition is: the distance between the first position point and the second position point is less than or equal to the fifth threshold, the first position point and the third position point are The distance between the two position points is calculated by the coordinates of the first position point and the coordinates of the second position point.
  • the coordinates of the first position point are calculated by comparing the coordinates collected in the one or more time periods. The position coordinates are summed and averaged.
  • the attributes of the first location point include a first attribute
  • the processing unit is further configured to: generate a first graph structure based on the adjacent relationship between the multiple location points; Wherein, the first graph structure is used to describe the connection relationship between the plurality of position points, and when the first position point is adjacent to the second position point, the first position point and the The second position points are connected by an edge in the first graph structure.
  • the first graph structure includes first type position points and second type position points.
  • Each position in the first type of position point The attributes of the points include the first attribute, and the attributes of each position point in the second type of position points do not include the first attribute; cut the first graph structure to obtain the second graph structure ; Wherein, the edge between any two of the first type position points in the first graph structure is cut, and the edge between any two of the second type of position points connected to the first type of position point The edges are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
  • the processing unit is further configured to: obtain an adjacency distance matrix based on the connection relationship between each location point in the second graph structure; wherein each of the adjacency distance matrix The element is used to represent the number of edges between a second type position point and a first type position point in the second graph structure; normalize the adjacency distance matrix to obtain a weight matrix; Multiply the weight matrix and the first matrix to obtain a second matrix; wherein the elements in the first matrix are used to represent the attributes contained in each of the first type position points in the second graph structure.
  • the first attribute of the second matrix is used to characterize the first attribute included in the attributes of each second type position point in the second graph structure.
  • the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier
  • the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location. location point.
  • the processing unit is further configured to perform location positioning, navigation, big data statistical analysis, or service recommendation based on the coordinates of each of the multiple location points.
  • the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
  • the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, wherein, Radio frequency signals include one or more of WIFI signals, Bluetooth signals, cell CELL signals or ultra-wideband UWB signals.
  • inventions of the present application provide a chip system.
  • the chip system includes at least one processor, a memory, and an interface circuit.
  • the memory, the interface circuit, and the at least one processor are interconnected through lines. Instructions are stored in the at least one memory; when the instructions are executed by the processor, the method described in any one of the above first aspects is implemented.
  • inventions of the present application provide a computer device.
  • the computer device includes at least one processor, a memory, and an interface circuit.
  • the memory, the interface circuit, and the at least one processor are interconnected through lines. Instructions are stored in the at least one memory; when the instructions are executed by the processor, the method described in any one of the above first aspects is implemented.
  • the computer device is a server or a terminal device, wherein the terminal device includes a mobile phone, a computer, a car machine, or a tablet.
  • embodiments of the present application provide a computer-readable storage medium that stores a computer program.
  • the computer program When the computer program is executed, the method described in any one of the above-mentioned first aspects can be performed. accomplish.
  • embodiments of the present application provide a computer program product.
  • the computer program product includes program instructions. When the program instructions are run on a computer, the method described in any one of the above first aspects is implemented. .
  • Figure 1 is a schematic diagram of a system architecture provided by an embodiment of the present application.
  • Figure 2 is a schematic diagram of a trajectory data collection scenario provided by an embodiment of the present application.
  • Figure 3 is a schematic flow chart of a characterization method for trajectory data provided by an embodiment of the present application.
  • Figure 4 is a schematic structural diagram of trajectory data in an embodiment of the present application.
  • Figures 5A-5C are schematic diagrams of the generation process of a graph structure provided by the embodiment of the present application.
  • Figure 6 is a schematic diagram of the calculation process of a second matrix provided by an embodiment of the present application.
  • Figures 7A-7B are schematic diagrams of indoor and outdoor spatial topological structures extracted based on the trajectory data representation method of this application;
  • Figure 8 is a schematic structural diagram of a device for characterizing trajectory data provided by an embodiment of the present application.
  • Figure 9 is a schematic structural diagram of a computer device provided by this application.
  • Crowdsourcing refers to collecting available data such as sensors, radio frequency signals and network signals of users’ smart terminals without the user’s perception through a certain triggering mechanism. Crowdsourcing is a common way to collect indoor and outdoor trajectory data.
  • Trajectory data refers to the collection of characteristic signal data collected by the user's intelligent terminal within a continuous period of time. Each trajectory data represents the characteristic signal data collected at different locations by the intelligent terminal as the user moves, including indoor trajectories. data and outdoor trajectory data.
  • Cell CELL A wireless coverage area identified by a base station identification code or a Cell Global Identifier (CGI). When using an omnidirectional antenna structure, the cell is the base station area.
  • CGI Cell Global Identifier
  • IOD Indoor or Outdoor Location Identifier
  • Figure 1 is a schematic diagram of a system architecture provided by an embodiment of the present application.
  • the system architecture includes user equipment 101, user equipment 102, cloud 110 and data storage system 120, where cloud 110 and user equipment perform data transmission through a communication network.
  • cloud 110 and user equipment perform data transmission through a communication network.
  • the user equipment listed in Figure 1 does not constitute a limit to their number.
  • the user equipment collects data over a period of time to form trajectory data, and then uploads the trajectory data to the cloud 110 through the communication network.
  • the trajectory data collection method may be crowdsourcing, manual collection or other collection methods, which is not limited by this application.
  • the trajectory data includes indoor trajectory data and/or outdoor trajectory data.
  • the cloud 110 characterizes the trajectory data received in a specific area (such as a shopping mall-centered area) within a period of time (such as 3 days or a month) (i.e., the characterization method of trajectory data in this application) to form the Maps in a specific area (including indoor and outdoor), and store the generated position map and representation data of the trajectory data (ie, the position points and attributes of the position points found in the following embodiments) into the data storage system 120 .
  • the user device may send a positioning request to the cloud 110 .
  • the cloud 110 provides users with online positioning services based on the generated map, or sends the user a map of the specific area to provide users with offline low-power positioning services. Then, based on the positioning service and other personalized needs of the user (for example, navigation request and service recommendation request), the user is provided with navigation service and personalized recommendation service (for example, based on the user's specific request to recommend the user in the specific area) merchants, etc.).
  • the positioning service and other personalized needs of the user for example, navigation request and service recommendation request
  • personalized recommendation service for example, based on the user's specific request to recommend the user in the specific area
  • the cloud 110 may include at least one server, and the data storage system 120 may be a device independent of the server, or integrated within the server, which is not limited in this application.
  • the user in Figure 1
  • the device can independently execute the trajectory data characterization method in this application.
  • the trajectory data at this time may be collected by the user equipment and/or transmitted to the user equipment by other equipment, which is not limited in this application.
  • the user equipment is a terminal device with communication and data collection functions such as a mobile phone, computer, smart watch, bracelet, etc. This application is not limited to this.
  • Figure 2 is a schematic diagram of a trajectory data collection scenario provided by an embodiment of the present application. It is used to describe the collection of trajectory data in a specific area centered on a building.
  • the specific area includes the building and the surrounding area. Streets adjacent to buildings.
  • Figure 2 shows 6 trajectories (ie, 6 trajectories data) within this specific area.
  • the data collection locations on Track 1 and Track 2 are all located indoors
  • the data collection locations on Tracks 3 to 5 include both indoors and outdoors
  • the data collection locations on Track 6 are all located outdoors.
  • FIG. 3 is a schematic flowchart of a method for characterizing trajectory data provided by an embodiment of the present application. As shown in Figure 3, the method includes step S310 and step S320. in,
  • Step S310 Acquire multiple pieces of trajectory data, each of the multiple pieces of trajectory data includes multiple time periods, and characteristic signals collected in each of the multiple time periods.
  • Each track data is a characteristic signal collected within a continuous period of time.
  • the length of the continuous period of time is not limited, for example, it can be 30 minutes or 1 hour.
  • the characteristic signals collected within a continuous period of time include at least one type of characteristic signals.
  • the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, where the radio frequency signals include WIFI signals, Bluetooth signal, cell CELL signal or ultra-wideband UWB signal.
  • the above-mentioned position coordinates may be coordinates in different coordinate reference systems.
  • the coordinate reference system may be a geodetic coordinate system, a projected coordinate system, a custom coordinate system, etc. This application is not limited to this.
  • the position coordinates in the geodetic coordinate system can be characterized by longitude, latitude, elevation, etc.
  • Ultra Wide Band is a wireless carrier signal with a frequency bandwidth above 1GHz.
  • the above position coordinates may be calculated through collected GNSS signals, or may be obtained through manual collection or direct acquisition from other terminal devices, which is not limited in this application.
  • each type of characteristic signal can be collected at fixed or non-fixed time intervals, and the collection time intervals of different types of characteristic signals can be the same or different.
  • its collection content at a collection moment includes at least one identification ID of this type of characteristic signal and the corresponding signal strength.
  • the collection content at each collection moment includes the ID of the WIFI signal recognized by the smart terminal (for example, Media Access Control (MAC) ID) and the corresponding signal strength.
  • MAC Media Access Control
  • the collection content at each collection moment includes the ID of the CELL signal recognized by the smart terminal (such as the global cell identifier (Cell Global Identifier, CGI) and/or the physical cell identifier (Physical Cell Identifier, PCI) )) and the corresponding signal strength.
  • the smart terminal such as the global cell identifier (Cell Global Identifier, CGI) and/or the physical cell identifier (Physical Cell Identifier, PCI)
  • CGI Cell Global Identifier
  • PCI Physical Cell Identifier
  • the relationship between the multiple time periods included in the above-mentioned trajectory data and a continuous period of time during which each trajectory data is collected is: the multiple time periods included in each trajectory data are based on a fixed time interval for this continuous period of time. Or divided into non-fixed time intervals, and any two time periods among the multiple divided time periods do not overlap.
  • the method of dividing a continuous period of time for data collection of each trajectory data can be determined according to the specific scenario, which is not limited in this application.
  • each divided time period contains only one collection moment of the type of characteristic signal.
  • FIG 4 is a schematic structural diagram of trajectory data in an embodiment of the present application. Examples of collecting two types of characteristic signals on a trajectory data are listed.
  • the collection time of this trajectory data i.e., the above-mentioned continuous time
  • the collected characteristic signals include cell CELL signals and WIFI signals.
  • the collection times of cell signals are: 12:32, 12:37 and 12:43; the collection times of WIFI signals are 12:34, 12:39 and 12:44. That is, the collection time interval of both cell signal and WIFI signal is 5 minutes.
  • the above-mentioned cell signals collected at each moment include a master station signal and at least one neighbor station signal.
  • the WIFI signals collected at each moment include all WIFI signals recognized by the smart terminal. That is, at each collection moment within the above collection time, the number of collected characteristic signals is at least one.
  • Step S320 Generate multiple location points and attributes of each location point in the multiple location points based on the multiple trajectory data; wherein the multiple location points include a first location point, and the first location point Points in the plurality of trajectory data correspond to one or more time periods, the combination identifier of each time period in the one or more time periods is the same, and the combination identifier of each time period is determined by each time period.
  • the identification ID of at least one characteristic signal corresponding to the segment is generated, and the first location point is characterized by the combined identification; the characteristic signals collected in one or more time periods include at least one type of characteristic signal, and the first
  • the attributes of the location points include the number of each type of characteristic signals in the at least one type of characteristic signals and/or the quantity distribution characteristics of each type of characteristic signals.
  • this application describes the first position point among multiple position points as an object to represent the generation of each position point and its attributes, that is, the first position point is any one of the multiple position points.
  • Time periods with the same combination identifier correspond to the same location point, and the same location point is represented by the combination identifier.
  • At least one characteristic signal corresponding to each time period includes the characteristic signal collected during the time period.
  • the combined identifier of each time period is obtained by combining the identifiers of at least one characteristic signal collected within the time period. From a technical effect point of view, since the types and signal strengths of the characteristic signals collected at different locations are different, combination identifiers can be used to indicate the similarities and differences of the collection locations. That is, the time period with the same combination identifier corresponds to the same collection location, and different combinations The identified time periods correspond to different collection locations.
  • the following describes the construction process of the combined identifier of each time period, taking one or more time periods corresponding to the first position point as an object.
  • the one or more time periods include a first time period, the first time period contains M characteristic signals, the M characteristic signals include C-type characteristic signals, M and C are positive Integer; the combined identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D type characteristic signals, N is a positive integer less than or equal to M, and D is less than or equal to A positive integer equal to C.
  • the first time period is any time period among the one or more time periods mentioned above. That is, for each time period, its combination identification is generated by a combination of the IDs of at least one characteristic signal collected within the time period.
  • the at least one characteristic signal includes at least one type of characteristic signal.
  • the combined identification of each time period can be generated by a combination of IDs of multiple CELL signals (such as a master station signal + multiple neighbor station signals); or by a combination of MAC IDs of multiple WIFI signals; or by at least one It is generated by an ID combination of CELL signal, at least one WIFI signal and at least one Bluetooth signal; or it is generated by an ID combination of at least one WIFI signal, at least one geomagnetic signal and at least one sensor signal, which is not limited by this application.
  • a combination of IDs of multiple CELL signals such as a master station signal + multiple neighbor station signals
  • MAC IDs of multiple WIFI signals or by at least one It is generated by an ID combination of CELL signal, at least one WIFI signal and at least one Bluetooth signal; or it is generated by an ID combination of at least one WIFI signal, at least one geomagnetic signal and at least one sensor signal, which is not limited by this application.
  • the method of generating the combined identification from the ID combination of at least one characteristic signal can be directly using the serial combination of IDs, or can be calculated using a specific algorithm (such as a hash algorithm, etc.).
  • the screening method of multiple characteristic signals in the same type of characteristic signals is as follows:
  • the same type of characteristic signals in the first time period are sorted from high to low according to the signal strength, and then a preset number of characteristic signals are filtered out according to the signal strength from high to low, and are The filtered characteristic signals are subsequently used to generate a combined identification of the first time period.
  • the above-mentioned type C characteristic signals include first type characteristic signals.
  • the M characteristic signals include A first-type characteristic signals
  • the N characteristic signals include B first-type characteristic signals
  • the A first-type characteristic signals include the B first-type characteristic signals.
  • Class characteristic signals the signal strength of the B first class characteristic signals is greater than the signal intensity of other characteristic signals in the A first class characteristic signals; where B is a positive integer less than or equal to A.
  • the above-mentioned A first-type characteristic signals include the above-mentioned B first-type characteristic signals.
  • B first-type characteristic signals are selected from A first-type characteristic signals based on the signal strength of the characteristic signal. Specifically, the signal strength of the B first-type characteristic signals is greater than the signal strength of other characteristic signals among the A first-type characteristic signals.
  • the combined identifier in each time period is generated by a combination of the primary station ID and three neighbor station IDs. If the CELL signals collected in the first time period include a master station signal and 10 neighbor station signals, first filter out the top 3 neighbor station signals with higher signal strength according to the signal strength of the neighbor station signals, and then use the main station signal to The station ID (such as CGI) and the three filtered neighbor station IDs (such as PCI) are combined to generate a combined identification of the first time period.
  • the station ID such as CGI
  • the three filtered neighbor station IDs such as PCI
  • the combination identifier is generated by a combination of 5 WIFI signals in each time period. If the WIFI signals collected in the first time period include 10 WIFI signals, then the top 5 WIIF signals with higher signal strengths are screened out according to the signal strength of the WIFI signals, and then the ID combination of the top 5 WIFI signals is used to generate The combination identifier of the first time period.
  • the attributes of the first location point include indoor and outdoor location identifiers, and the indoor and outdoor location identifiers are used to indicate that the first location point is an indoor location point or an outdoor location point; when the first location point is When the first position point is an indoor position point, the collection position of the characteristic signal in the one or more time periods is located indoors. When the first position point is an outdoor position point, the collection position of the characteristic signal in the one or more time periods is located in outdoor.
  • the indoor and outdoor identification of the first location point can be determined by counting the number of time periods corresponding to the first location point and whether the characteristic signals collected in each time period include GNSS signals. When the number of time periods during which satellite positioning signals are collected is greater than or equal to the preset ratio, the indoor and outdoor location identifier of the first location point can be set to 1 to indicate one or more time periods corresponding to the first location point.
  • the collection location of the characteristic signal is located outdoors; when the number of time periods in which satellite positioning signals are collected is less than the preset ratio, the indoor and outdoor location identifier of the first location point can be set to 0 to indicate a location corresponding to the first location point. Or the collection location of characteristic signals in multiple time periods is located indoors.
  • the plurality of position points also include a second position point, and the combined identifier representing the second position point is generated from the IDs of the E characteristic signals.
  • the following takes the first position point and the second position point among the above multiple position points as an example to describe how to determine the adjacent relationship between any two position points among the multiple position points, that is, the first position point and the second position point. are any two of multiple location points.
  • the first position point and the second position point satisfy one or more of the first condition, the second condition or the third condition, the first position point and the second position point are Adjacent.
  • first position point and the second position point are adjacent (or have an adjacent relationship).
  • the above-mentioned first position point and the second position point are adjacent means that the distance between the first position point and the second position point is less than or equal to the preset distance.
  • the first condition is: the E characteristic signals and the N characteristic signals include common G characteristic signals, and the ratio of G and E and the ratio of G and N are both greater than or equal to the first threshold; and The difference between the average signal strength of the first signal among the E characteristic signals and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is the G characteristic signal. any of the signals.
  • the average signal strength of each of the N characteristic signals used to generate the combined identification is first calculated. Specifically, for each of the N characteristic signals, the signal intensity of the characteristic signal in each time period corresponding to the first position point is summed, and then averaged to obtain each characteristic signal. the average signal strength. In the same way, the average signal strength of each of the E characteristic signals can also be obtained by referring to the above method. Finally, calculate the difference between the average signal intensity of the first characteristic signal among the E characteristic signals and the average signal intensity among the N characteristic signals.
  • the first threshold and the second threshold are set based on specific scenarios.
  • the first threshold is to ensure that the same characteristic signals contained in the E characteristic signals and N characteristic signals reach a certain proportion to prove that the first position point and the second position point are close to each other.
  • E, G and N are positive integers.
  • the second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods
  • the segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold.
  • the I time segment includes the third time segment, and the third time segment
  • the second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period.
  • the second condition is also called a timing condition.
  • the second condition For J time periods among the H time periods corresponding to the first position point (the ratio of J to H is greater than or equal to the third threshold), for any one of the J time periods (i.e., the above-mentioned second time period)
  • This third time period and the second time period are located on the same trajectory data, and the second time period and the third time period
  • the time interval is less than or equal to the fourth threshold, it is determined that the first position point and the second position point satisfy the above-mentioned second condition.
  • the collection time of the second time period is 15:35-15:45
  • the collection time of the third time period is 15:45-15:55.
  • the end time of the previous time period i.e., the second time period
  • Calculate the time interval with the start time of the subsequent time period ie, the third time period
  • the time interval between the second time period and the third time period is 0.
  • other methods can also be used to calculate the time interval between the above two time periods, and this application is not limited to this.
  • the third threshold and the fourth threshold are set based on specific scenarios.
  • H, I and J are positive integers.
  • the third condition is: the distance between the first position point and the second position point is less than or equal to a fifth threshold, and the distance between the first position point and the second position point passes through The coordinates of the first position point and the coordinates of the second position point are calculated. The coordinates of the first position point are obtained by summing and averaging the position coordinates collected in the one or more time periods. .
  • the distance between two position points can be directly calculated using the coordinates.
  • the coordinates of each position point are obtained by summing and averaging the position coordinates collected in one or more corresponding time periods.
  • the first position point corresponds to 100 time periods, and position coordinates are collected in 89 of the 100 time periods. Then the coordinates of the first position point can be obtained using the 89 data collected in the 89 time periods. The position coordinates are calculated by averaging. For the collection process of position coordinates in each time period, please refer to the description in the previous embodiment.
  • the fifth threshold is set according to specific scenarios, which is not limited in this application.
  • attribute completion of the location points can be performed based on the adjacent relationships.
  • the attributes of the first position point include the first attribute, and the first attribute is any attribute that needs attribute completion. Attribute completion of the first attribute means: for position points whose first attribute is missing or inaccurate, calculate the first attribute of these position points. The process is as follows:
  • the method further includes: generating a first graph structure based on adjacent relationships between the multiple location points; wherein the first graph structure is used to describe connections between the multiple location points. relationship, and when the first position point is adjacent to the second position point, the first position point and the second position point are connected by an edge in the first graph structure, and the third position point
  • a graph structure includes a first type of location point and a second type of location point.
  • the attributes of each location point in the first type of location point include the first attribute.
  • Each location point in the second type of location point includes the first attribute.
  • the attributes of the position points do not contain the first attribute; the first graph structure is cut to obtain a second graph structure; wherein, the distance between any two first-type position points in the first graph structure The edges are cut, and the edges between any two second-type position points connected to the first-type position points are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
  • the first type of location points are location points whose attributes include the first attribute
  • the above-mentioned second type of location points are location points whose attributes do not include the first attribute.
  • not including the first attribute should be understood to mean that the first attribute does not exist in the attributes or the first attribute is inaccurate. Whether the first attribute is accurate depends on the specific scenario, and this application will not elaborate on it in detail.
  • two adjacent position points (having an adjacent relationship) are connected with an edge to generate a first graph structure representing the connection relationship between the plurality of position points. Then cut the edges between all first-type position points in the first graph structure, and cut the edges between any two second-type position points that have an edge relationship with the first-type position points (connected by an edge). Carry out cutting to obtain the second picture structure.
  • the second graph structure is used to represent the connection relationship between the second type of location points, and between the second type of location points and adjacent first type location points.
  • Figures 5A-5C are schematic diagrams of a generation process of a graph structure provided by an embodiment of the present application.
  • FIG. 5A is a specific example of a first graph structure, used to describe adjacent relationships between multiple position points generated from trajectory data collected in a specific area. As shown in Figure 5A, it contains a total of 14 position points (i.e. position points 0-13). Among them, the dotted circle represents the first type of location points (i.e., location points 4-6 and location points 8-13), and the solid line circle represents the second type of location points (i.e., location points 0-3 and location point 7), with There is an edge connecting the two position points of the adjacent relationship.
  • FIG. 5B is a schematic diagram describing the process of cutting the first image structure shown in FIG. 5A.
  • the dotted line segment indicates the edge to be cut.
  • the process of cutting the edges between all first-type position points in Figure 5B includes cutting between position points 6 and 8, between position points 6 and 9, between position points 5 and 10, and between position points 5 and 11. Cut the edges between, between position points 4 and 12, between position points 4 and 13, and between position points 12 and 13.
  • the process of cutting the edge between any two second-type position points that have an edge relationship with the first-type position point includes cutting the edge between position points 7 and 3.
  • FIG. 5C is used to describe an example of the second graph structure obtained after cutting. As shown in Figure 5C, it contains 4 second-type position points (ie, position points 0-3) and 3 first-type position points (ie, position points 4-6).
  • the second type of position can be calculated based on the first attribute contained in the attributes of each position point in the first type of position point in the second graph structure and the connection relationship between each position point.
  • the method further includes: obtaining an adjacency distance matrix based on the connection relationship between each position point in the second graph structure; wherein each element in the adjacency distance matrix is used to characterize the first The number of edges between a second type position point and a first type position point in the two-graph structure; normalize the adjacency distance matrix to obtain a weight matrix; multiply the weight matrix and the first matrix , obtain the second matrix; wherein, the elements in the first matrix are used to characterize the first attributes contained in the attributes of each first type position point in the second graph structure, and the elements in the second matrix The first attribute used to characterize the attributes of each second type position point in the second graph structure.
  • the elements in each row respectively represent the path from a second type location point to each first type location point (or a second type location point and a first type location point). the number of edges contained between). For example, if the second feature map contains three first-category position points, then the path between a second-category position point and these three first-category position points contains the number of edges (such as 2, 3, and 4), is a row element in the adjacency distance matrix.
  • each column element in the adjacency distance matrix can also be used to respectively represent the number of edges contained on the path from a second type location point to each first type location point, and this application is not limited to this. That is, each element in the adjacency distance matrix is used to represent the number of edges contained on the path from a second-type location point to a first-type location point.
  • the above-mentioned normalization processing refers to normalization processing in units of each row or column of elements in the adjacency distance matrix.
  • each row element in the adjacency distance matrix represents the number of edges contained on the path from a second-type position point to each first-type position point
  • normalization is performed in units of each row element;
  • the adjacency distance matrix When the elements in each column represent the number of edges contained on the path from a second-category position point to each first-category position point, normalization is performed in units of each column element.
  • each row of elements when normalizing each row of elements as a unit, if one of the row elements is 2, 3, and 3, then after normalization, the row elements will be 1/4, 3/8, and 3/8 respectively. .
  • each row or column element in the first matrix is used to represent the first attribute contained in the attributes of each first-type position point.
  • Each row or column element in the second matrix is used to represent the first attribute that needs to be included in the attributes of each second type position point. That is, after calculating the first attribute that needs to be included in the attributes of each second type position point, the calculated first attribute is added to the attributes of the position point and stored.
  • the above-mentioned first type of location point is the location point that contains coordinates in the attribute
  • the second type of location point is the location point that is not included in the attribute or contains inaccurate coordinates.
  • the adjacency distance matrix is obtained based on the second graph structure in the embodiment shown in Figures 5A-5C. It is a 4*3 matrix, in which each row of elements is used to represent a second type of location point.
  • the number of edges contained on the path to each first-type location point. 0, 1, 2 and 3 on the left side of the adjacency distance matrix represent the second type of position points corresponding to the elements in each row, and 4, 5 and 6 above the adjacency distance matrix represent the first type of position points corresponding to the elements in each column.
  • Each element is used to represent the number of edges contained on the path from a second-category location point to a first-category location point.
  • the elements in the first row and first column represent the second-category location.
  • the elements in the third row and second column represent the number of edges contained on the path from second-category point 2 to first-category point 5.
  • quantity which is 2.
  • Each row of elements in the first matrix is used to represent the coordinates contained in the attributes of each location point in a first-type location point (in this example, represented by longitude and latitude in the geodetic coordinate system, and the elevation is omitted in this example). That is, in this example, the coordinates included in the attributes of the three first-type position points are (21, 28), (42, 56), and (21, 14) respectively.
  • each row element in the second matrix represents the coordinates (ie, longitude and latitude) that will be included in the attributes of each location point in a second type location point.
  • the coordinates that need to be included in the attributes of the four second-type position points finally calculated are (30, 34), (24, 26), (28, 33), and (30, 42) respectively.
  • indoor and outdoor location maps can be generated through the above-mentioned trajectory data representation method in this application to provide users with indoor and outdoor positioning and other services based on indoor and outdoor positioning.
  • the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier
  • the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location point.
  • attributes that can be completed include but are not limited to coordinates of location points, indoor and outdoor location identifiers, and collected feature signals.
  • the above method further includes: performing location positioning, navigation, big data statistical analysis or service recommendation based on the coordinates of each location point in the plurality of location points.
  • a map (representing indoor and outdoor locations) of a specific area where the multiple trajectory data is located can be generated based on the coordinates of each location point. Then, after the user enters this specific area, the user can send a positioning request to the server, and the server can provide the user with positioning services based on the generated map of the specific area.
  • navigation and service recommendations can also be provided for users based on user requests (for example, recommending merchants closest to the user).
  • the above big data statistical analysis can be based on the characterized regional or city trajectory data to construct a city-level or regional-level heat map to analyze the regional or city's flow of people, etc.
  • This application is not limited to this.
  • the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the quantity of characteristic signals located in different intervals.
  • the index based on counting the quantity distribution characteristics of each type of characteristic signal may be the signal strength of each type of characteristic signal, etc., which is not limited in this application.
  • interval distribution when interval distribution is used to characterize the quantity distribution characteristics of a type of characteristic signal, the number of characteristic signals of this type in different signal strength intervals can be counted, and then the range and corresponding number of the signal strength interval are added to the attributes of the location point for storage.
  • the normal distribution is used to characterize the quantitative distribution characteristics of a type of feature signal, the mean and variance of the normal distribution obtained based on signal strength statistics can be added to the location point attributes for storage.
  • the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
  • L is a positive integer.
  • the first type of characteristic signal is any type of characteristic signal among at least one type of characteristic signal mentioned above.
  • the attributes of the location point also include part of the original data of each type of characteristic signal, and this part of the original data conforms to the quantity distribution characteristics of the type of characteristic signal.
  • the storage method based on location points and attributes can retain the original data characteristics of multiple trajectory data to the greatest extent.
  • the first type of characteristic signal is a WIFI signal
  • the number of WIFI signals collected in one or more time periods corresponding to the first location point is 5,000.
  • the quantity distribution characteristics of the 5,000 WIFI signals are represented by a normal distribution, that is, the normal distribution is used to represent the normal distribution characteristics of the 5,000 signal strengths corresponding to the 5,000 signals.
  • 100 WIFI signals can be selected from 5,000 WIFI signals, and the signal strength distribution of these 100 WIFI signals conforms to the normal distribution of the signal strengths of the 5,000 WIFI signals, and these 100 WIFI signals The signal is stored together as an attribute of the first position point.
  • interval distribution when using interval distribution to describe the quantity distribution characteristics of the above 5000 WIIF signals, first count the number of WIFI signals in different signal strength intervals. Assume that the signal strength is between [-10, 0]dbm, [0, The quantity proportions in 10]dbm, [10,20]dbm, and [20,30]dbm are 10%, 20%, 40%, and 30% respectively. Then, 100 original data that are added to the attributes of the location point and stored together are selected from 5,000 WIFI signals, and their quantity distribution also conforms to the distribution rules of the above four signal strength intervals.
  • the first type of characteristic signal when the first type of characteristic signal is also represented by other quantity distribution characteristics, the first type of characteristic signal that is selected and stored as an attribute of the first position point must also conform to the corresponding other quantity distribution characteristics.
  • the attributes of the first position point may include at least one of the following three types.
  • Table 1 is used to represent the compression ratio of trajectory data in a certain area of Nanjing within one week and one month after adopting the trajectory data characterization method in this application.
  • the number of trajectory points in Table 1 is the number of time periods obtained after dividing the collected multiple trajectory data.
  • the compression ratio is obtained by dividing the number of trajectory points by the number of position points, and M represents the counting unit—million.
  • Position point the representation method of attributes. Trajectory data is prone to over-accumulation, especially for trajectory data collected through crowdsourcing. Crowdsourcing data itself has a large amount of data. When the data is accumulated to a certain extent, the accumulated data has fully reflected the capabilities of the crowdsourcing data itself. For all the data characteristics expressed, increasing the amount of data at this time has no impact on the data distribution and characteristics. This has reached the boundary of crowdsourcing data accumulation. At this time, the continuous accumulation of increasing data is redundant. In the characterization method of this application, attributes (quantity distribution characteristics and quantities, etc.) are used to characterize multiple types of characteristic signals in a local range, and limited storage of part of the original data not only balances the crowdsourcing data, but also avoids data crowdsourcing. Transition accumulation over data localities and signal categories. Thus achieving the compression effect.
  • Table 1 Compression ratio table of trajectory data in different collection time periods
  • Figures 7A-7B are schematic diagrams of indoor and outdoor spatial topological structures extracted based on the trajectory data representation method of the present application.
  • Figure 7A shows the extracted outdoor spatial topology.
  • the left side of Figure 7A shows the distribution of all trajectory points included in the collected trajectory data (that is, the position points corresponding to each time period included in the trajectory data) on the map. The denser the trajectory points are, the higher the density of human flow is. Usually, roads have the highest density of people flow, and the entire outdoor trajectory data will essentially tend to represent the road network.
  • the left image in Figure 7A shows the location information of outdoor track points, which is consistent with the road network.
  • the right side of Figure 7A shows the outdoor heat statistics map after rasterization of the left image, showing the overall road network situation.
  • Figure 7B shows the extracted indoor spatial topology. Based on the distribution of indoor trajectory points in each area, hotspot indoor areas with dense human flow can be found. These hotspot areas provide the basis for subsequent indoor positioning and floor identification.
  • Figure 8 is a schematic structural diagram of a trajectory data characterization device provided by an embodiment of the present application, which can be used to perform each step in the foregoing method embodiment.
  • the device includes an acquisition unit 810 and a processing unit 820. in,
  • the acquisition unit 810 is configured to acquire multiple pieces of trajectory data, each of the multiple pieces of trajectory data including multiple time periods, and the characteristic signals collected in each of the multiple time periods.
  • the processing unit 820 is configured to generate multiple location points and attributes of each location point in the multiple location points based on the multiple trajectory data.
  • the plurality of position points include a first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, and each time period in the one or more time periods
  • the combination identification is the same, the combination identification of each time period is generated by the identification ID of at least one characteristic signal corresponding to each time period, the first position point is characterized by the combination identification;
  • the one or The characteristic signals collected in multiple time periods include at least one type of characteristic signal, and the attributes of the first location point include the number of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal.
  • the one or more time periods include a first time period, the first time period includes M characteristic signals, and the M characteristic signals include Class C characteristic signals, M and C are positive integers; the combination identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D-type characteristic signals, and N is a positive integer less than or equal to M. , D is a positive integer less than or equal to C.
  • the C-type characteristic signals include first-type characteristic signals;
  • the M characteristic signals include A of the first-type characteristic signals, and the N characteristic signals include B
  • the first type of characteristic signals, A of the first type of characteristic signals include the B of the first type of characteristic signals, the signal strength of the B of the first type of characteristic signals is greater than that of the A of the first type of characteristic signals The signal strength of other characteristic signals; where B is a positive integer less than or equal to A.
  • the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
  • the plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of E characteristic signals; when the first position point and When the second position point satisfies one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point; wherein the first condition is :
  • the E characteristic signals and the N characteristic signals include common G characteristic signals, the ratio of G and E, and the ratio of G and N are both greater than or equal to the first threshold; and among the E characteristic signals The difference between the average signal strength of the first signal and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is any one of the G characteristic signals;
  • the second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods
  • the segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold.
  • the I time segment includes the third time segment, and the third time segment
  • the second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period;
  • the third condition is: the distance between the first position point and the second position point is less than or equal to the fifth threshold, the first position point and the third position point are The distance between the two position points is calculated by the coordinates of the first position point and the coordinates of the second position point.
  • the coordinates of the first position point are calculated by comparing the coordinates collected in the one or more time periods. The position coordinates are summed and averaged.
  • the attributes of the first location point include a first attribute
  • the processing unit 820 is further configured to: generate a first graph structure based on the adjacent relationship between the multiple location points. ; Wherein, the first graph structure is used to describe the connection relationship between the plurality of position points, and when the first position point is adjacent to the second position point, the first position point and The second location points are connected by an edge in the first graph structure.
  • the first graph structure includes first type location points and second type location points.
  • Each of the first type location points The attributes of the position points include the first attribute, and the attributes of each position point in the second type of position points do not include the first attribute; cut the first graph structure to obtain the second graph Structure; wherein, the edge between any two of the first type position points in the first graph structure is cut, and the edge between any two of the second type of position points connected to the first type of position point is The edges are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
  • the processing unit 820 is further configured to: obtain an adjacency distance matrix based on the connection relationship between each location point in the second graph structure; wherein each element in the adjacency distance matrix elements are used to represent the number of edges between a second type position point and a first type position point in the second graph structure; normalize the adjacency distance matrix to obtain a weight matrix ; Multiply the weight matrix and the first matrix to obtain a second matrix; wherein the elements in the first matrix are used to characterize the attributes of each first type position point in the second graph structure.
  • the elements in the second matrix are used to characterize the first attributes included in the attributes of each second type position point in the second graph structure.
  • the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier
  • the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location. location point.
  • the processing unit 820 is further configured to perform location positioning, navigation, big data statistical analysis, or service recommendation based on the coordinates of each location point in the plurality of location points.
  • the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
  • the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, wherein, Radio frequency signals include one or more of WIFI signals, Bluetooth signals, cell CELL signals or ultra-wideband UWB signals.
  • each unit in the trajectory data characterization device can be referred to the specific steps in the method embodiment of FIG. 3, and will not be described again here.
  • FIG. 9 is a schematic structural diagram of a computer device provided by the present application. It may be a server or user device in the cloud 110 in the embodiment of FIG. 1 .
  • the computer device includes a memory 901, one or more (only one is shown in the figure) processors 902, an interface circuit 903 and a bus 904. Among them, the memory 901, the processor 902, and the interface circuit 903 implement communication connections between each other through the bus 904.
  • the processor 902 is configured to obtain multiple pieces of trajectory data through the interface circuit 903.
  • Each piece of trajectory data in the multiple pieces of trajectory data includes multiple time periods, and the characteristic signals collected in each of the multiple time periods. .
  • the multiple pieces of trajectory data correspond to one or more time periods, the combination identifier of each time period in the one or more time periods is the same, and the combination identifier of each time period is determined by the number of times in each time period.
  • the identification ID of at least one characteristic signal collected is generated, and the first location point is characterized by the combined identification; the characteristic signal collected in one or more time periods includes at least one type of characteristic signal, and the first location point is characterized by the combined identification.
  • the attributes of the location points include the number of each type of characteristic signals in the at least one type of characteristic signals and/or the quantity distribution characteristics of each type of characteristic signals.
  • the memory 901 may be used to store the above-mentioned plurality of location points and attributes of each of the plurality of location points.
  • the memory 901 can be any one of random access memory (Random Access Memory, RAM), read-only memory (Read-Only Memory, ROM) or flash memory (Flash Memory); wherein, RAM includes static random access memory (Static random access memory). RAM, SRAM) and dynamic random access memory (Dynamic RAM, DRAM), etc.
  • ROM includes erasable programmable ROM (Erasable Programmable ROM, EPROM) and electrically erasable programmable read-only memory (Electrically Erasable Programmable ROM, EEPROM), etc.
  • the memory 901 can store programs. When the program stored in the memory 901 is executed by the processor 902, the processor 902 and the interface circuit 903 are used to execute various steps of the trajectory data characterization method in the embodiment of the present application.
  • the processor 902 may be a general central processing unit (CPU), a microprocessor, an application specific integrated circuit (ASIC), a graphics processing unit (GPU), or one or more Integrated circuits, etc., are used to execute relevant programs to realize the functions required to be performed by each unit in the trajectory data characterization device according to the embodiment of the present application, or to execute the trajectory data characterization method according to the method embodiment of the present application.
  • CPU central processing unit
  • ASIC application specific integrated circuit
  • GPU graphics processing unit
  • Integrated circuits etc.
  • the interface circuit 903 uses a transceiver device such as, but not limited to, a transceiver to implement communication between the computer device and other devices or communication networks.
  • a transceiver device such as, but not limited to, a transceiver to implement communication between the computer device and other devices or communication networks.
  • trajectory data can be obtained from a smart terminal (user equipment) through the interface circuit 903.
  • Bus 904 may include a path for transmitting information between various components (eg, memory 901, processor 902, interface circuit 903) on the computer device shown in Figure 9.
  • various components eg, memory 901, processor 902, interface circuit 903
  • the embodiment of the present application provides a chip system.
  • the chip system includes at least one processor, a memory and an interface circuit.
  • the memory, the interface circuit and the at least one processor are interconnected through lines.
  • the at least one memory Instructions are stored in; when the instructions are executed by the processor, some or all of the steps described in any of the above method embodiments can be realized.
  • Embodiments of the present application provide a computer storage medium.
  • the computer storage medium stores a computer program.
  • the computer program is executed, some or all of the steps described in any of the above method embodiments can be realized.
  • Embodiments of the present application provide a computer program product.
  • the computer program product includes program instructions.
  • the program instructions When the program instructions are run on a computer, some or all of the steps described in any of the above method embodiments can be implemented.
  • the disclosed device can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the above units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be combined or integrated. to another system, or some features can be ignored, or not implemented.
  • the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical or other forms.
  • the units described above as separate components may or may not be physically separated.
  • the components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

Landscapes

  • Radar Systems Or Details Thereof (AREA)

Abstract

A trajectory data characterization method and apparatus. The method comprises: acquiring a plurality of pieces of trajectory data, wherein each of the plurality of pieces of trajectory data includes a plurality of time periods, and characteristic signals, which are acquired in all of the plurality of time periods (S310); and generating a plurality of location points and an attribute of each of the plurality of location points on the basis of the plurality of pieces of trajectory data, wherein the plurality of location points include a first location point, the first location point corresponds to one or more time periods in the plurality of pieces of trajectory data, combination identifiers (ID) of all of the one or more time periods are the same, the combination ID of each time period is generated from the ID of at least one characteristic signal corresponding to the time period, and the attribute of the first location point includes the number of characteristic signals of each class and/or a number distribution characteristic of characteristic signals of each class in the one or more time periods (S320). Storage space for trajectory data can be effectively saved on; moreover, the trajectory data can be compatible with the format of outdoor data, such that integration of indoor data and the outdoor data is achieved.

Description

轨迹数据的表征方法和装置Trajectory data representation methods and devices 技术领域Technical field
本申请涉及室内外定位技术领域,尤其涉及一种轨迹数据的表征方法和装置。The present application relates to the field of indoor and outdoor positioning technologies, and in particular, to a method and device for representing trajectory data.
背景技术Background technique
移动设备的普及与移动通信技术的发展为实现定位导航提供了不同的可能。以定位为例,移动设备中集成的全球导航卫星系统(Global Navigation Satellite System,GNSS)芯片可以用于为用户提供精准的室外定位服务。基于卫星的室外定位导航可以满足用户的许多室外出行需求,但由于建筑物等遮挡,基于卫星信号的定位技术无法在室内提供可用的定位服务。The popularity of mobile devices and the development of mobile communication technology provide different possibilities for realizing positioning and navigation. Taking positioning as an example, the Global Navigation Satellite System (GNSS) chip integrated in mobile devices can be used to provide users with accurate outdoor positioning services. Satellite-based outdoor positioning and navigation can meet many outdoor travel needs of users, but due to obstructions such as buildings, satellite signal-based positioning technology cannot provide usable positioning services indoors.
现有的室内定位技术从数据来源和定位方式可以分为三种:1)重部署室内定位:通过在室内环境中部署特定的定位基站及通讯服务器实现室内定位;2)轻部署室内定位:通过人工采集的方式构建室内多楼层指纹数据库,通过发起定位请求,匹配当前扫描的指纹和指纹数据库实现定位;3)零部署室内定位:通过众包的方式收集用户在特定触发条件下所收集的传感器、WIFI、全球定位系统(Global Positioning System,GPS)等数据,并上传到云端,云端通过算法处理,获得室内定位指纹库,用于后续室内定位。其中,零部署室内定位由于成本低、时效性高、自动化程度高等诸多优势,逐步成为主流的室内定位方案。Existing indoor positioning technology can be divided into three types based on data sources and positioning methods: 1) redeployment indoor positioning: indoor positioning is achieved by deploying specific positioning base stations and communication servers in the indoor environment; 2) light deployment indoor positioning: through An indoor multi-floor fingerprint database is constructed through manual collection, and positioning is achieved by initiating a positioning request and matching the currently scanned fingerprint with the fingerprint database; 3) Zero-deployment indoor positioning: Collect sensors collected by users under specific trigger conditions through crowdsourcing. , WIFI, Global Positioning System (GPS) and other data are uploaded to the cloud. The cloud processes the algorithm to obtain an indoor positioning fingerprint database for subsequent indoor positioning. Among them, zero-deployment indoor positioning has gradually become a mainstream indoor positioning solution due to its many advantages such as low cost, high timeliness, and high degree of automation.
然而,在上述零部署定位方案中,由于室内轨迹数据大部分不包含GNSS等位置信息,只能逐条存储,因而在数据量较大时会面临无法存储的问题;同时,还存在和室外数据无法兼容的问题。However, in the above zero-deployment positioning solution, since most of the indoor trajectory data does not contain location information such as GNSS and can only be stored one by one, it will face the problem of being unable to store when the amount of data is large; at the same time, there is also the inability to store outdoor data. Compatibility issues.
发明内容Contents of the invention
本申请实施例提供了一种轨迹数据的表征方法和装置,可以有效节省轨迹数据的存储空间;同时能与室外数据的格式兼容,实现室内外数据一体化,进而提升轨迹数据在室内外定位领域的实用性。The embodiments of the present application provide a method and device for representing trajectory data, which can effectively save the storage space of trajectory data; at the same time, it can be compatible with the format of outdoor data, realize the integration of indoor and outdoor data, and thereby improve the use of trajectory data in the field of indoor and outdoor positioning. of practicality.
第一方面,本申请提供了一种轨迹数据的表征方法,所述方法包括:获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号;基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性;其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段对应的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。In a first aspect, this application provides a method for characterizing trajectory data. The method includes: acquiring multiple pieces of trajectory data, each piece of trajectory data in the multiple pieces of trajectory data including multiple time periods, and the multiple pieces of trajectory data. Characteristic signals collected in each time period in the time period; generating multiple location points and attributes of each of the multiple location points based on the multiple trajectory data; wherein the multiple location points include A first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, the combination identifier of each time period in the one or more time periods is the same, and each time period The combined identification of the segment is generated by the identification ID of at least one characteristic signal corresponding to each time period, and the first position point is characterized by the combined identification; the characteristic signals collected in one or more time periods include At least one type of characteristic signal, the attribute of the first position point includes the quantity of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal.
其中,所述第一位置点即为所述一个或多个时间段内特征信号的采集位置。Wherein, the first position point is the collection position of the characteristic signal in the one or more time periods.
其中,每个时间段对应的至少一个特征信号包括该时间段内采集的特征信号。Wherein, at least one characteristic signal corresponding to each time period includes the characteristic signal collected during the time period.
从技术效果上看,由于不同位置点处特征信号组合标识的唯一性,对于轨迹数据中组合标识相同的一个或多个时间段而言,可知该一个或多个时间段内的特征信号数据是在同一位置点进行采集的。进而,可以将该一个或多个时间段中的特征信号数据共同统计和表征,得到该一个或多个时间段内每类特征信号的数量和数量分布特征等数据,以作为位置点的属性。 即本申请通过使用位置点和位置点的属性来表征多条轨迹数据,以取代现有技术中直接存储原始每条轨迹数据的方式,达到节省轨迹数据存储空间的技术效果。进一步地,在轨迹数据达到一定数量时,对于新采集的轨迹数据进行存储几乎不会增加存储开销,使得大数据时代的海量轨迹数据的存储成为可能,进而可以有效提升轨迹数据对于室内外定位的价值。From a technical effect point of view, due to the uniqueness of the characteristic signal combination identification at different location points, for one or more time periods with the same combination identification in the trajectory data, it can be known that the characteristic signal data in the one or more time periods is Collected at the same location. Furthermore, the characteristic signal data in the one or more time periods can be jointly counted and characterized to obtain data such as the quantity and quantity distribution characteristics of each type of characteristic signal in the one or more time periods as attributes of the location point. That is, this application uses position points and attributes of the position points to characterize multiple trajectory data to replace the existing method of directly storing the original trajectory data, thereby achieving the technical effect of saving trajectory data storage space. Furthermore, when the trajectory data reaches a certain amount, storing the newly collected trajectory data will hardly increase the storage overhead, making it possible to store massive trajectory data in the big data era, which can effectively improve the effectiveness of trajectory data for indoor and outdoor positioning. value.
在一种可行的实施方式中,所述一个或多个时间段中包含第一时间段,所述第一时间段内包含M个特征信号,所述M个特征信号中包含C类特征信号,M和C为正整数;所述组合标识由所述M个特征信号中的N个特征信号的ID生成,所述N个特征信号中包含D类特征信号,N为小于或等于M的正整数,D为小于或等于C的正整数。In a feasible implementation, the one or more time periods include a first time period, the first time period includes M characteristic signals, and the M characteristic signals include Class C characteristic signals, M and C are positive integers; the combination identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D-type characteristic signals, and N is a positive integer less than or equal to M. , D is a positive integer less than or equal to C.
从技术效果上看,由于不同位置点处采集到的特征信号以及对应的信号强度是不同的,因而可以利用每个时间段内采集特征信号中的至少一个来生成每个时间段的组合标识,以标识出每个时间段对应的采集位置,进而实现后续对于同一位置点所采集数据的统一表征和存储,从而替代现有技术中对轨迹数据逐条存储的方式,节省轨迹数据的存储空间。From a technical effect point of view, since the characteristic signals collected at different locations and the corresponding signal strengths are different, at least one of the characteristic signals collected in each time period can be used to generate a combined identification of each time period. The collection location corresponding to each time period is identified, thereby achieving unified representation and storage of subsequent data collected at the same location point, thus replacing the method of storing trajectory data one by one in the existing technology and saving storage space for trajectory data.
在一种可行的实施方式中,所述C类特征信号中包括第一类特征信号,所述M个特征信号中包含A个所述第一类特征信号,所述N个特征信号包含B个第一类特征信号,A个所述第一类特征信号中包含所述B个第一类特征信号,所述B个第一类特征信号的信号强度大于A个所述第一类特征信号中其它特征信号的信号强度;其中,B为小于或等于A的正整数。In a feasible implementation, the C-type characteristic signals include first-type characteristic signals, the M characteristic signals include A of the first-type characteristic signals, and the N characteristic signals include B The first type of characteristic signals, A of the first type of characteristic signals include the B of the first type of characteristic signals, the signal strength of the B of the first type of characteristic signals is greater than that of the A of the first type of characteristic signals The signal strength of other characteristic signals; where B is a positive integer less than or equal to A.
从技术效果上看,由于不同位置点采集的特征信号的信号强度不同,可以利用特征信号强度来筛选出每类特征信号中用于生成组合标识的特征信号,通过此种方式使得在同一位置点采集到的特征信号(即一个或多个时间段内的特征信号数据是在同一位置点采集到的)生成的组合标识唯一,不同位置点采集到的特征信号生成的组合标识不同;进而,利用组合标识的异同将组合标识相同的一个或多个时间段的特征信号数据统一表征和存储,从而替代现有技术中对轨迹数据逐条存储的方式,节省轨迹数据的存储空间。From a technical effect point of view, since the signal strength of the characteristic signals collected at different locations is different, the characteristic signal intensity can be used to filter out the characteristic signals used to generate the combined identification in each type of characteristic signal. In this way, the characteristic signals collected at the same location can be The combined identification generated by the collected characteristic signals (that is, the characteristic signal data in one or more time periods is collected at the same location point) is unique, and the combination identification generated by the characteristic signals collected at different location points is different; further, using The similarities and differences of the combined identifiers will uniformly represent and store the characteristic signal data of one or more time periods with the same combined identifier, thus replacing the way of storing trajectory data one by one in the existing technology and saving the storage space of trajectory data.
在一种可行的实施方式中,所述第一位置点的属性包含室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点;当所述第一位置点为室内位置点时,所述一个或多个时间段内特征信号的采集位置位于室内,当所述第一位置点为室外位置点时,所述一个或多个时间段内特征信号的采集位置位于室外。In a feasible implementation, the attributes of the first location point include indoor and outdoor location identifiers, and the indoor and outdoor location identifiers are used to indicate that the first location point is an indoor location point or an outdoor location point; when the When the first position point is an indoor position point, the collection position of the characteristic signal in the one or more time periods is located indoors. When the first position point is an outdoor position point, the collection position of the characteristic signal in the one or more time periods is The signal collection location is located outdoors.
从技术效果上看,由于大部分室外位置点所指示位置的坐标已包含在轨迹数据中,通过室内外位置标识来区分不同位置点的位置,使得后续可以利用室外位置点的坐标来计算相关室内位置点的经纬度,从而使得室内位置点也可以基于坐标进行表征和存储,并实现室内外位置点数据的兼容。From a technical effect point of view, since the coordinates of the positions indicated by most outdoor position points are already included in the trajectory data, the positions of different position points are distinguished through indoor and outdoor position markers, so that the coordinates of the outdoor position points can be used to calculate the relevant indoor The longitude and latitude of the location point, so that indoor location points can also be represented and stored based on coordinates, and the compatibility of indoor and outdoor location point data can be achieved.
在一种可行的实施方式中,所述多个位置点中还包含第二位置点,表征所述第二位置点的组合标识由E个特征信号的ID生成;当所述第一位置点和所述第二位置点满足第一条件、第二条件或第三条件中的一个或多个时,所述第一位置点与所述第二位置点相邻;其中,所述第一条件为:所述E个特征信号和所述N个特征信号中包含共同的G个特征信号,G和E的比值、G和N的比值都大于或等于第一阈值;且所述E个特征信号中的第一信号的平均信号强度与所述N个特征信号中第一信号的平均信号强度的差值小于或等于第二阈值,所述第一信号为所述G个特征信号中的任意一个;所述第二条件为:所述第一位置点在所述多条轨迹数据中对应H个时间段,第二位置点在所述多条轨迹数据中对应I个时间段,所述H个时间段中包含J个时间段,J与H的比值大于或等于第三阈值,对于所述J个时间段中的第二时间段,所述I个时间段内包含第三时间段,所述第二时间段和所述第三时间段位于同一轨迹数据上,且所述第二时间段和所述第三时间段的间隔小于或等于第四阈值,所述第二时间段 为所述J个时间段内的任一时间段;所述第三条件为:所述第一位置点与所述第二位置点之间的距离小于或等于第五阈值,所述第一位置点和所述第二位置点之间的距离通过所述第一位置点的坐标和所述第二位置点的坐标计算得到,所述第一位置点的坐标通过对所述一个或多个时间段内采集到的位置坐标进行求和并取均值得到。In a feasible implementation, the plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of E characteristic signals; when the first position point and When the second position point satisfies one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point; wherein the first condition is : The E characteristic signals and the N characteristic signals include common G characteristic signals, the ratio of G and E, and the ratio of G and N are both greater than or equal to the first threshold; and among the E characteristic signals The difference between the average signal strength of the first signal and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is any one of the G characteristic signals; The second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods The segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold. For the second time segment among the J time segments, the I time segment includes the third time segment, and the third time segment The second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period; the third condition is: the distance between the first position point and the second position point is less than or equal to the fifth threshold, the first position point and the third position point are The distance between the two position points is calculated by the coordinates of the first position point and the coordinates of the second position point. The coordinates of the first position point are calculated by comparing the coordinates collected in the one or more time periods. The position coordinates are summed and averaged.
从技术效果上看,对于上述多个位置点中的任意两个位置点而言,可以基于空间关系(即第一条件和第三条件)和时序关系(即第二条件)来确定该任意两个位置点所指示的位置是否相邻,进而后续可以利用此种相邻关系,基于相邻位置点间同一属性的近似性,通过部分位置点的属性中包含的第一属性)来计算与此部分位置点具有一定相邻关系的其它位置点(其属性中不包含第一属性)的第一属性。其中,表示空间关系的第一条件具体为:对于任意两个位置点,当生成其组合标识的特征信号的信号强度接近时,即可表明该两个位置点相邻或接近;时序关系具体为:对于一个位置点对应的多个时间段,针对其中的任一时间段而言,在另一个位置点对应的多个时间段中,都存在一个采集时间与之接近的时间段,且这两个时间段位于同一条轨迹数据上,通过室内人物或物体移动规律可知,这两个时间段所对应的采集位置相邻。From a technical effect point of view, for any two position points among the above multiple position points, the any two position points can be determined based on the spatial relationship (i.e., the first condition and the third condition) and the temporal relationship (i.e., the second condition). Whether the positions indicated by the position points are adjacent, and then this adjacent relationship can be used in the future, based on the approximation of the same attribute between adjacent position points, through the first attribute included in the attributes of some position points) to calculate this Some position points have the first attribute of other position points (whose attributes do not include the first attribute) with a certain adjacent relationship. Among them, the first condition indicating the spatial relationship is specifically: for any two position points, when the signal intensity of the characteristic signal that generates their combined identification is close, it can be indicated that the two position points are adjacent or close; the temporal relationship is specifically: : For any one of the multiple time periods corresponding to a location point, among the multiple time periods corresponding to another location point, there is a time period with a collection time close to it, and these two The two time periods are located on the same trajectory data. It can be seen from the movement patterns of indoor people or objects that the collection positions corresponding to these two time periods are adjacent.
在一种可行的实施方式中,所述第一位置点的属性中包含第一属性,所述方法还包括:基于所述多个位置点之间的相邻关系生成第一图结构;其中,所述第一图结构用于描述所述多个位置点之间的连接关系,且当所述第一位置点与所述第二位置点相邻时,所述第一位置点和所述第二位置点在所述第一图结构中通过一条边相连,所述第一图结构中包含第一类位置点和第二类位置点,所述第一类位置点中的每个位置点的属性中包含所述第一属性,所述第二类位置点中的每个位置点的属性中不包含所述第一属性;对所述第一图结构进行切割,得到第二图结构;其中,所述第一图结构中任意两个所述第一类位置点之间的边被切割,与所述第一类位置点相连的任意两个所述第二类位置点之间的边被切割,所述第二图结构中的每个位置点与至少一个位置点具有连接关系。In a feasible implementation, the attributes of the first position point include a first attribute, and the method further includes: generating a first graph structure based on the adjacent relationship between the plurality of position points; wherein, The first graph structure is used to describe the connection relationship between the plurality of position points, and when the first position point is adjacent to the second position point, the first position point and the third position point Two position points are connected by an edge in the first graph structure. The first graph structure includes a first type of position point and a second type of position point. The position points of each position point in the first type of position point are The attributes include the first attribute, and the attributes of each position point in the second type of position points do not include the first attribute; the first graph structure is cut to obtain a second graph structure; where , the edge between any two first-type position points in the first graph structure is cut, and the edge between any two second-type position points connected to the first-type position point is cut Cutting, each position point in the second graph structure has a connection relationship with at least one position point.
其中,第一属性为第一位置点的属性中的任意一个。Wherein, the first attribute is any one of the attributes of the first position point.
从技术效果上看,通过第一图结构来表征每个位置点的相邻关系,然后生成由第一类位置点和与第一类位置点相邻的第二类位置点构成的第二图结构,并利用第二图结构来计算其中第二类位置点的属性中所缺失或不准确的第一属性,以去除较远的第一类位置点对第二类位置点属性中需要包含的第一属性的计算带来的影响。From a technical effect point of view, the adjacent relationship of each position point is represented through the first graph structure, and then a second graph composed of the first type of position point and the second type of position point adjacent to the first type of position point is generated. structure, and use the second graph structure to calculate the missing or inaccurate first attributes among the attributes of the second type of location points, so as to remove the farther first type of location points that need to be included in the attributes of the second type of location points. The impact of the calculation of the first attribute.
在一种可行的实施方式中,所述方法还包括:基于所述第二图结构中各位置点之间的连接关系,得到邻接距离矩阵;其中,所述邻接距离矩阵中的每个元素用于表征所述第二图结构中一个所述第二类位置点到一个所述第一类位置点之间边的数量;对所述邻接距离矩阵进行归一化处理,得到权重矩阵;将所述权重矩阵和第一矩阵相乘,得到第二矩阵;其中,所述第一矩阵中的元素用于表征所述第二图结构中每个所述第一类位置点的属性中包含的第一属性,所述第二矩阵中的元素用于表征所述第二图结构中每个所述第二类位置点的属性中包含的第一属性。In a feasible implementation, the method further includes: obtaining an adjacency distance matrix based on the connection relationship between each position point in the second graph structure; wherein each element in the adjacency distance matrix is represented by In order to characterize the number of edges between a second type position point and a first type position point in the second graph structure; normalize the adjacency distance matrix to obtain a weight matrix; Multiply the weight matrix and the first matrix to obtain a second matrix; wherein, the elements in the first matrix are used to characterize the attributes of each first type position point in the second graph structure. An attribute, the elements in the second matrix are used to characterize the first attribute included in the attributes of each second type position point in the second graph structure.
从技术效果上看,通过边的数量来表征第二类位置点到第一类位置点的距离,然后进行归一化处理,得到第二图结构中所有第二类位置点的权重矩阵,再与第一矩阵相乘,从而准确得到第二类位置点中每个位置点所需要的包含的第一属性。From a technical effect point of view, the distance from the second type position point to the first type position point is represented by the number of edges, and then normalized to obtain the weight matrix of all second type position points in the second graph structure, and then Multiply with the first matrix to accurately obtain the first attribute required by each position point in the second type of position point.
在一种可行的实施方式中,所述第一属性为所述第一位置点的坐标或室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点。In a feasible implementation, the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier, and the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location. location point.
从技术效果上看,通过将坐标进行补全,并存入位置点的属性中,可以使得后续利用坐 标来替代每个室内位置点的组合标识进行索引,实现与室外数据的格式兼容和统一。此外,通过此种方式进行属性补全,可以较好地弥补数据采集过程中的瑕疵和误差,使得每个位置点都包含完整的属性,有效提升方案的实用性。From a technical effect point of view, by completing the coordinates and storing them in the attributes of the location points, the coordinates can be used to replace the combined identification of each indoor location point for indexing, achieving format compatibility and unification with outdoor data. In addition, attribute completion in this way can better compensate for flaws and errors in the data collection process, so that each location point contains complete attributes, effectively improving the practicality of the solution.
在一种可行的实施方式中,所述方法还包括:基于所述多个位置点中每个位置点的坐标进行位置定位、导航、大数据统计分析或服务推荐。In a feasible implementation, the method further includes: performing location positioning, navigation, big data statistical analysis or service recommendation based on the coordinates of each location point in the plurality of location points.
从技术效果上看,在计算出每个位置点的坐标后,便可基于位置点的坐标来为各种上层的应用场景,应用前景广阔。From a technical effect point of view, after calculating the coordinates of each location point, various upper-layer application scenarios can be based on the coordinates of the location points, and the application prospects are broad.
在一种可行的实施方式中,所述第一位置点的属性还包含L个第一类特征信号,所述L个第一类特征信号符合所述第一类特征信号的数量分布特征。In a feasible implementation, the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
从技术效果上看,还可以在位置点的属性中存储一些符合分布规律的原始数据,实现在丢弃大部分原始轨迹数据时还可以较好保存原始数据的真实特性。From a technical perspective, some original data that conforms to the distribution rules can also be stored in the attributes of the location points, so that the true characteristics of the original data can be better preserved while discarding most of the original trajectory data.
在一种可行的实施方式中,所述特征信号包含全球导航卫星系统GNSS信号、位置坐标、射频信号、光学信号、声学信号、传感器信号或地磁信号中的一种或多种,其中,所述射频信号包括WIFI信号、蓝牙信号、小区CELL信号或超宽频UWB信号中的一种或多种。In a feasible implementation, the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, wherein, Radio frequency signals include one or more of WIFI signals, Bluetooth signals, cell CELL signals or ultra-wideband UWB signals.
在一种可行的实施方式中,所述数量分布特征包含正态分布、泊松分布、离散分布或区间分布中的至少一种,所述区间分布用于描述位于不同区间内特征信号的数量。In a feasible implementation, the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
从技术效果上看,可以区间分布和/或正态分布可以极大节省轨迹数据的存储空间,通过保存分布特定(例如正态分布的均值和方差)和部分符合分布特定的原始数据,可实现在节省存储空间的条件下极大地保留原始数据的特性。From a technical effect point of view, interval distribution and/or normal distribution can greatly save the storage space of trajectory data. By saving distribution-specific (such as the mean and variance of normal distribution) and partially consistent with distribution-specific original data, it can be achieved Greatly retain the characteristics of the original data while saving storage space.
第二方面,本申请提供了一种轨迹数据的表征装置,所述装置包括:获取单元,用于获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号;处理单元,用于基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性;其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段对应的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。In a second aspect, the present application provides a device for characterizing trajectory data. The device includes: an acquisition unit configured to acquire multiple pieces of trajectory data. Each piece of trajectory data in the multiple pieces of trajectory data includes multiple time periods, and the characteristic signals collected in each of the plurality of time periods; a processing unit configured to generate a plurality of position points and attributes of each of the plurality of position points based on the plurality of trajectory data; Wherein, the plurality of position points include a first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, and each time period in the one or more time periods The combination identification is the same, the combination identification of each time period is generated by the identification ID of at least one characteristic signal corresponding to each time period, the first position point is characterized by the combination identification; the one or The characteristic signals collected in multiple time periods include at least one type of characteristic signal, and the attributes of the first location point include the number of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal. .
在一种可行的实施方式中,所述一个或多个时间段中包含第一时间段,所述第一时间段内包含M个特征信号,所述M个特征信号中包含C类特征信号,M和C为正整数;所述组合标识由所述M个特征信号中的N个特征信号的ID生成,所述N个特征信号中包含D类特征信号,N为小于或等于M的正整数,D为小于或等于C的正整数。In a feasible implementation, the one or more time periods include a first time period, the first time period includes M characteristic signals, and the M characteristic signals include Class C characteristic signals, M and C are positive integers; the combination identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D-type characteristic signals, and N is a positive integer less than or equal to M. , D is a positive integer less than or equal to C.
在一种可行的实施方式中,所述C类特征信号中包括第一类特征信号,所述M个特征信号中包含A个所述第一类特征信号,所述N个特征信号包含B个第一类特征信号,A个所述第一类特征信号中包含所述B个第一类特征信号,所述B个第一类特征信号的信号强度大于A个所述第一类特征信号中其它特征信号的信号强度;其中,B为小于或等于A的正整数。In a feasible implementation, the C-type characteristic signals include first-type characteristic signals, the M characteristic signals include A of the first-type characteristic signals, and the N characteristic signals include B The first type of characteristic signals, A of the first type of characteristic signals include the B of the first type of characteristic signals, the signal strength of the B of the first type of characteristic signals is greater than that of the A of the first type of characteristic signals The signal strength of other characteristic signals; where B is a positive integer less than or equal to A.
在一种可行的实施方式中,所述第一位置点的属性还包含L个第一类特征信号,所述L个第一类特征信号符合所述第一类特征信号的数量分布特征。In a feasible implementation, the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
在一种可行的实施方式中,所述多个位置点中还包含第二位置点,表征所述第二位置点的组合标识由E个特征信号的ID生成;当所述第一位置点和所述第二位置点满足第一条件、 第二条件或第三条件中的一个或多个时,所述第一位置点与所述第二位置点相邻;其中,所述第一条件为:所述E个特征信号和所述N个特征信号中包含共同的G个特征信号,G和E的比值、G和N的比值都大于或等于第一阈值;且所述E个特征信号中的第一信号的平均信号强度与所述N个特征信号中第一信号的平均信号强度的差值小于或等于第二阈值,所述第一信号为所述G个特征信号中的任意一个;所述第二条件为:所述第一位置点在所述多条轨迹数据中对应H个时间段,第二位置点在所述多条轨迹数据中对应I个时间段,所述H个时间段中包含J个时间段,J与H的比值大于或等于第三阈值,对于所述J个时间段中的第二时间段,所述I个时间段内包含第三时间段,所述第二时间段和所述第三时间段位于同一轨迹数据上,且所述第二时间段和所述第三时间段的间隔小于或等于第四阈值,所述第二时间段为所述J个时间段内的任一时间段;所述第三条件为:所述第一位置点与所述第二位置点之间的距离小于或等于第五阈值,所述第一位置点和所述第二位置点之间的距离通过所述第一位置点的坐标和所述第二位置点的坐标计算得到,所述第一位置点的坐标通过对所述一个或多个时间段内采集到的位置坐标进行求和并取均值得到。In a feasible implementation, the plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of E characteristic signals; when the first position point and When the second position point satisfies one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point; wherein the first condition is : The E characteristic signals and the N characteristic signals include common G characteristic signals, the ratio of G and E, and the ratio of G and N are both greater than or equal to the first threshold; and among the E characteristic signals The difference between the average signal strength of the first signal and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is any one of the G characteristic signals; The second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods The segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold. For the second time segment among the J time segments, the I time segment includes the third time segment, and the third time segment The second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period; the third condition is: the distance between the first position point and the second position point is less than or equal to the fifth threshold, the first position point and the third position point are The distance between the two position points is calculated by the coordinates of the first position point and the coordinates of the second position point. The coordinates of the first position point are calculated by comparing the coordinates collected in the one or more time periods. The position coordinates are summed and averaged.
在一种可行的实施方式中,所述第一位置点的属性中包含第一属性,所述处理单元还用于:基于所述多个位置点之间的相邻关系生成第一图结构;其中,所述第一图结构用于描述所述多个位置点之间的连接关系,且当所述第一位置点与所述第二位置点相邻时,所述第一位置点和所述第二位置点在所述第一图结构中通过一条边相连,所述第一图结构中包含第一类位置点和第二类位置点,所述第一类位置点中的每个位置点的属性中包含所述第一属性,所述第二类位置点中的每个位置点的属性中不包含所述第一属性;对所述第一图结构进行切割,得到第二图结构;其中,所述第一图结构中任意两个所述第一类位置点之间的边被切割,与所述第一类位置点相连的任意两个所述第二类位置点之间的边被切割,所述第二图结构中的每个位置点与至少一个位置点具有连接关系。In a feasible implementation, the attributes of the first location point include a first attribute, and the processing unit is further configured to: generate a first graph structure based on the adjacent relationship between the multiple location points; Wherein, the first graph structure is used to describe the connection relationship between the plurality of position points, and when the first position point is adjacent to the second position point, the first position point and the The second position points are connected by an edge in the first graph structure. The first graph structure includes first type position points and second type position points. Each position in the first type of position point The attributes of the points include the first attribute, and the attributes of each position point in the second type of position points do not include the first attribute; cut the first graph structure to obtain the second graph structure ; Wherein, the edge between any two of the first type position points in the first graph structure is cut, and the edge between any two of the second type of position points connected to the first type of position point The edges are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
在一种可行的实施方式中,所述处理单元还用于:基于所述第二图结构中各位置点之间的连接关系,得到邻接距离矩阵;其中,所述邻接距离矩阵中的每个元素用于表征所述第二图结构中一个所述第二类位置点到一个所述第一类位置点之间边的数量;对所述邻接距离矩阵进行归一化处理,得到权重矩阵;将所述权重矩阵和第一矩阵相乘,得到第二矩阵;其中,所述第一矩阵中的元素用于表征所述第二图结构中每个所述第一类位置点的属性中包含的第一属性,所述第二矩阵中的元素用于表征所述第二图结构中每个所述第二类位置点的属性中包含的第一属性。In a feasible implementation, the processing unit is further configured to: obtain an adjacency distance matrix based on the connection relationship between each location point in the second graph structure; wherein each of the adjacency distance matrix The element is used to represent the number of edges between a second type position point and a first type position point in the second graph structure; normalize the adjacency distance matrix to obtain a weight matrix; Multiply the weight matrix and the first matrix to obtain a second matrix; wherein the elements in the first matrix are used to represent the attributes contained in each of the first type position points in the second graph structure. The first attribute of the second matrix is used to characterize the first attribute included in the attributes of each second type position point in the second graph structure.
在一种可行的实施方式中,所述第一属性为所述第一位置点的坐标或室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点。In a feasible implementation, the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier, and the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location. location point.
在一种可行的实施方式中,所述处理单元还用于:基于所述多个位置点中每个位置点的坐标进行位置定位、导航、大数据统计分析或服务推荐。In a feasible implementation, the processing unit is further configured to perform location positioning, navigation, big data statistical analysis, or service recommendation based on the coordinates of each of the multiple location points.
在一种可行的实施方式中,所述数量分布特征包含正态分布、泊松分布、离散分布或区间分布中的至少一种,所述区间分布用于描述位于不同区间内特征信号的数量。In a feasible implementation, the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
在一种可行的实施方式中,所述特征信号包含全球导航卫星系统GNSS信号、位置坐标、射频信号、光学信号、声学信号、传感器信号或地磁信号中的一种或多种,其中,所述射频信号包括WIFI信号、蓝牙信号、小区CELL信号或超宽频UWB信号中的一种或多种。In a feasible implementation, the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, wherein, Radio frequency signals include one or more of WIFI signals, Bluetooth signals, cell CELL signals or ultra-wideband UWB signals.
第三方面,本申请实施例提供了一种芯片系统,所述芯片系统包括至少一个处理器,存储器和接口电路,所述存储器、所述接口电路和所述至少一个处理器通过线路互联,所述至 少一个存储器中存储有指令;所述指令被所述处理器执行时,上述第一方面中任意一项所述的方法得以实现。In a third aspect, embodiments of the present application provide a chip system. The chip system includes at least one processor, a memory, and an interface circuit. The memory, the interface circuit, and the at least one processor are interconnected through lines. Instructions are stored in the at least one memory; when the instructions are executed by the processor, the method described in any one of the above first aspects is implemented.
第四方面,本申请实施例提供了一种计算机设备,所述计算机设备包括至少一个处理器,存储器和接口电路,所述存储器、所述接口电路和所述至少一个处理器通过线路互联,所述至少一个存储器中存储有指令;所述指令被所述处理器执行时,上述第一方面中任意一项所述的方法得以实现。In a fourth aspect, embodiments of the present application provide a computer device. The computer device includes at least one processor, a memory, and an interface circuit. The memory, the interface circuit, and the at least one processor are interconnected through lines. Instructions are stored in the at least one memory; when the instructions are executed by the processor, the method described in any one of the above first aspects is implemented.
在一种可行的实施方式中,所述计算机设备为服务器或终端设备,其中,所述终端设备包括手机、电脑、车机或平板。In a feasible implementation, the computer device is a server or a terminal device, wherein the terminal device includes a mobile phone, a computer, a car machine, or a tablet.
第五方面,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,该计算机程序被执行时,上述第一方面中任意一项所述的方法得以实现。In a fifth aspect, embodiments of the present application provide a computer-readable storage medium that stores a computer program. When the computer program is executed, the method described in any one of the above-mentioned first aspects can be performed. accomplish.
第六方面,本申请实施例提供了一种计算机程序产品,所述计算机程序产品包括程序指令,当所述程序指令在计算机上运行时,上述第一方面中任意一项所述的方法得以实现。In a sixth aspect, embodiments of the present application provide a computer program product. The computer program product includes program instructions. When the program instructions are run on a computer, the method described in any one of the above first aspects is implemented. .
附图说明Description of drawings
以下对本申请实施例用到的附图进行介绍。The drawings used in the embodiments of this application are introduced below.
图1为本申请实施例提供的一种系统架构示意图;Figure 1 is a schematic diagram of a system architecture provided by an embodiment of the present application;
图2为本申请实施例提供的一种轨迹数据采集的场景示意图;Figure 2 is a schematic diagram of a trajectory data collection scenario provided by an embodiment of the present application;
图3为本申请实施例提供的一种轨迹数据的表征方法流程示意图;Figure 3 is a schematic flow chart of a characterization method for trajectory data provided by an embodiment of the present application;
图4为本申请实施例中一种轨迹数据的结构示意图;Figure 4 is a schematic structural diagram of trajectory data in an embodiment of the present application;
图5A-图5C为本申请实施例提供的一种图结构的生成过程示意图;Figures 5A-5C are schematic diagrams of the generation process of a graph structure provided by the embodiment of the present application;
图6为本申请实施例提供的一种第二矩阵的计算过程示意图;Figure 6 is a schematic diagram of the calculation process of a second matrix provided by an embodiment of the present application;
图7A-图7B为基于本申请的轨迹数据表征方法提取出的室内外空间拓扑结构示意图;Figures 7A-7B are schematic diagrams of indoor and outdoor spatial topological structures extracted based on the trajectory data representation method of this application;
图8为本申请实施例提供的一种轨迹数据的表征装置结构示意图;Figure 8 is a schematic structural diagram of a device for characterizing trajectory data provided by an embodiment of the present application;
图9为本申请提供的一种计算机设备的结构示意图。Figure 9 is a schematic structural diagram of a computer device provided by this application.
具体实施方式Detailed ways
下面结合本申请实施例中的附图对本申请实施例进行描述。其中,在本申请实施例的描述中,除非另有说明,“/”表示或的意思,例如,A/B可以表示A或B;文本中的“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况,另外,在本申请实施例的描述中,“多个”是指两个或多于两个。The embodiments of the present application will be described below with reference to the drawings in the embodiments of the present application. Among them, in the description of the embodiments of this application, unless otherwise stated, "/" means or, for example, A/B can mean A or B; "and/or" in the text is only a way to describe related objects. The association relationship means that there can be three relationships. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, and B exists alone. In addition, in the description of the embodiment of the present application , "plurality" means two or more than two.
本申请的说明书和权利要求书及所述附图中的术语“第一”、“第二”、“第三”和“第四”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备,没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。The terms “first”, “second”, “third” and “fourth” in the description, claims and drawings of this application are used to distinguish different objects, rather than to describe a specific sequence. . Furthermore, the terms "including" and "having" and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device that includes a series of steps or units is not limited to the listed steps or units, but optionally includes steps or units that are not listed, or alternatively Other steps or elements inherent to such processes, methods, products or devices are also included. Reference herein to "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The appearances of this phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those skilled in the art understand, both explicitly and implicitly, that the embodiments described herein may be combined with other embodiments.
首先对本申请中的相关术语进行解释。First, the relevant terms in this application are explained.
(1)众包:指通过一定的触发机制,在用户无感知的情况下,收集用户智能终端的传感器、射频信号及网络信号等可用数据。众包是采集室内外轨迹数据的一种常用方式。(1) Crowdsourcing: refers to collecting available data such as sensors, radio frequency signals and network signals of users’ smart terminals without the user’s perception through a certain triggering mechanism. Crowdsourcing is a common way to collect indoor and outdoor trajectory data.
(2)轨迹数据:指收集一个连续时间段内用户智能终端所采集的特征信号数据,每个轨迹数据表示了智能终端随用户移动过程中在不同位置处采集到的特征信号数据,包括室内轨迹数据和室外轨迹数据。(2) Trajectory data: refers to the collection of characteristic signal data collected by the user's intelligent terminal within a continuous period of time. Each trajectory data represents the characteristic signal data collected at different locations by the intelligent terminal as the user moves, including indoor trajectories. data and outdoor trajectory data.
(3)小区CELL:采用基站识别码或全球小区标识(Cell Global Identifier,CGI)进行标识的无线覆盖的区域。在使用全向天线结构时,小区即为基站区。(3) Cell CELL: A wireless coverage area identified by a base station identification code or a Cell Global Identifier (CGI). When using an omnidirectional antenna structure, the cell is the base station area.
(4)室内外位置标识(Indoor or Outdoor Location Identifier,IOD):通过是否接收到GNSS信号来生成的位置标识,以标识当前位置位于室内还是室外。例如,当接收到上述信号时,将其设置为1,即该位置点位于室外,当未接收到上述信号时,将其设置为0,即该位置点位于室内。(4) Indoor or Outdoor Location Identifier (IOD): A location identifier generated by whether a GNSS signal is received to identify whether the current location is indoors or outdoors. For example, when the above signal is received, it is set to 1, that is, the location point is located outdoors. When the above signal is not received, it is set to 0, that is, the location point is located indoors.
下面将介绍本申请的系统架构和应用场景。The system architecture and application scenarios of this application will be introduced below.
请参见图1,图1为本申请实施例提供的一种系统架构示意图。如图1所示,系统架构包括用户设备101、用户设备102、云端110和数据存储系统120,其中,云端110和用户设备通过通信网络进行数据传输。应当注意,图1中所列举的用户设备并不构成对其数量的限定。Please refer to Figure 1, which is a schematic diagram of a system architecture provided by an embodiment of the present application. As shown in Figure 1, the system architecture includes user equipment 101, user equipment 102, cloud 110 and data storage system 120, where cloud 110 and user equipment perform data transmission through a communication network. It should be noted that the user equipment listed in Figure 1 does not constitute a limit to their number.
下面基于图1所示系统架构介绍本发明适用的两个具体应用场景。The following will introduce two specific application scenarios to which the present invention is applicable based on the system architecture shown in Figure 1.
在第一种场景下,用户设备采集一段时间内的数据形成轨迹数据,然后将轨迹数据通过通信网络上传到云端110。可选地,轨迹数据的采集方式可以是众包、人工采集或其它采集方式,本申请对此不限定。轨迹数据包括室内轨迹数据和/或室外轨迹数据。In the first scenario, the user equipment collects data over a period of time to form trajectory data, and then uploads the trajectory data to the cloud 110 through the communication network. Optionally, the trajectory data collection method may be crowdsourcing, manual collection or other collection methods, which is not limited by this application. The trajectory data includes indoor trajectory data and/or outdoor trajectory data.
云端110对一段时间内(如3天或一个月)所接收到的特定区域内(例如以商场为中心的区域)的轨迹数据进行表征(即本申请中的轨迹数据的表征方法),形成该特定区域内(包含室内和室外)的地图,并将生成的位置地图和轨迹数据的表征数据(即后文实施例找中的位置点和位置点的属性)存入数据存储系统120中。用户设备在后续进入该特定区域中的建筑物时,可以向云端110发送定位请求。The cloud 110 characterizes the trajectory data received in a specific area (such as a shopping mall-centered area) within a period of time (such as 3 days or a month) (i.e., the characterization method of trajectory data in this application) to form the Maps in a specific area (including indoor and outdoor), and store the generated position map and representation data of the trajectory data (ie, the position points and attributes of the position points found in the following embodiments) into the data storage system 120 . When the user device subsequently enters a building in the specific area, it may send a positioning request to the cloud 110 .
云端110基于已经生成的地图为用户提供在线定位服务,或者向用户发送该特定区域的地图,以为用户提供离线低功耗定位服务。然后在定位服务的基础上,基于用户其它个性化需求(例如,导航请求和服务推荐请求),为用户提供导航服务和个性化推荐服务(例如,基于用户特定请求向用户推荐该特定区域内的商家等)。The cloud 110 provides users with online positioning services based on the generated map, or sends the user a map of the specific area to provide users with offline low-power positioning services. Then, based on the positioning service and other personalized needs of the user (for example, navigation request and service recommendation request), the user is provided with navigation service and personalized recommendation service (for example, based on the user's specific request to recommend the user in the specific area) merchants, etc.).
可选地,云端110可以包含至少一台服务器,数据存储系统120可以是与服务器独立的设备,或集成在服务器内,本申请对此不限定。Optionally, the cloud 110 may include at least one server, and the data storage system 120 may be a device independent of the server, or integrated within the server, which is not limited in this application.
在第二种场景下,当轨迹数据是从一个比较小的范围内(区域半径小于预设半径时,具体可以是一个地铁站或公交站等场所)采集到的情况下,图1中的用户设备可以独立地执行本申请中的轨迹数据表征方法。此时的轨迹数据可以是通过用户设备采集和/或者其它设备传输给用户设备的,本申请对此不限定。In the second scenario, when the trajectory data is collected from a relatively small range (when the area radius is smaller than the preset radius, it can be a subway station or a bus station, etc.), the user in Figure 1 The device can independently execute the trajectory data characterization method in this application. The trajectory data at this time may be collected by the user equipment and/or transmitted to the user equipment by other equipment, which is not limited in this application.
可选地,用户设备为手机、电脑、智能手表、手环等具有通信和数据采集功能的终端设备,本申请对此不限定。Optionally, the user equipment is a terminal device with communication and data collection functions such as a mobile phone, computer, smart watch, bracelet, etc. This application is not limited to this.
请参见图2,图2为本申请实施例提供的一种轨迹数据采集的场景示意图,用于描述以 建筑物为中心的特定区域内轨迹数据的采集情况,该特定区域包括该建筑物以及与建筑物相邻的街道。Please refer to Figure 2. Figure 2 is a schematic diagram of a trajectory data collection scenario provided by an embodiment of the present application. It is used to describe the collection of trajectory data in a specific area centered on a building. The specific area includes the building and the surrounding area. Streets adjacent to buildings.
图2示出该特定区域内的6条轨迹(即6条轨迹数据)。其中,轨迹1和轨迹2上数据的采集位置都位于室内,轨迹3-轨迹5上数据的采集位置既包括室内,也包括室外,轨迹6上数据的采集位置都位于室外。Figure 2 shows 6 trajectories (ie, 6 trajectories data) within this specific area. Among them, the data collection locations on Track 1 and Track 2 are all located indoors, the data collection locations on Tracks 3 to 5 include both indoors and outdoors, and the data collection locations on Track 6 are all located outdoors.
请参见图3,图3为本申请实施例提供的一种轨迹数据的表征方法流程示意图。如图3所示,该方法包括步骤S310和步骤S320。其中,Please refer to FIG. 3 , which is a schematic flowchart of a method for characterizing trajectory data provided by an embodiment of the present application. As shown in Figure 3, the method includes step S310 and step S320. in,
步骤S310:获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号。Step S310: Acquire multiple pieces of trajectory data, each of the multiple pieces of trajectory data includes multiple time periods, and characteristic signals collected in each of the multiple time periods.
其中,每条轨迹数据为一段连续时间内采集的特征信号,该一段连续时间的长度不限,例如,可以是30分钟或1小时等。其中,该一段连续时间内采集的特征信号包含至少一类特征信号。Each track data is a characteristic signal collected within a continuous period of time. The length of the continuous period of time is not limited, for example, it can be 30 minutes or 1 hour. Wherein, the characteristic signals collected within a continuous period of time include at least one type of characteristic signals.
其中,所述特征信号包含全球导航卫星系统GNSS信号、位置坐标、射频信号、光学信号、声学信号、传感器信号或地磁信号中的一种或多种,其中,所述射频信号包括WIFI信号、蓝牙信号、小区CELL信号或超宽频UWB信号中的一种或多种。Wherein, the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, where the radio frequency signals include WIFI signals, Bluetooth signal, cell CELL signal or ultra-wideband UWB signal.
可选地,上述位置坐标可以是不同的坐标参考系下的坐标。其中,坐标参考系可以是大地坐标系,投影坐标系或自定义坐标系等,本申请对此不限定。其中,大地坐标系下的位置坐标可以用经度、维度和高程等来进行表征。Optionally, the above-mentioned position coordinates may be coordinates in different coordinate reference systems. The coordinate reference system may be a geodetic coordinate system, a projected coordinate system, a custom coordinate system, etc. This application is not limited to this. Among them, the position coordinates in the geodetic coordinate system can be characterized by longitude, latitude, elevation, etc.
其中,超宽频(Ultra Wide Band,UWB)为频率带宽在1GHz以上无线载波信号。Among them, Ultra Wide Band (UWB) is a wireless carrier signal with a frequency bandwidth above 1GHz.
可选地,上述位置坐标可以是通过采集到的GNSS信号计算得到的、也可以是通过人工采集或从其它终端设备直接获取等方式得到的,本申请对此不限定。Optionally, the above position coordinates may be calculated through collected GNSS signals, or may be obtained through manual collection or direct acquisition from other terminal devices, which is not limited in this application.
其中,在上述一段连续时间内,每类特征信号可以按照固定或非固定的时间间隔进行采集,不同类别特征信号的采集时间间隔可以相同或者不同。Among them, within the above-mentioned continuous period of time, each type of characteristic signal can be collected at fixed or non-fixed time intervals, and the collection time intervals of different types of characteristic signals can be the same or different.
其中,对于具有信号强度的特征信号而言,其在一个采集时刻的采集内容包括至少一个该类特征信号的标识ID和对应的信号强度。Among them, for a characteristic signal with signal strength, its collection content at a collection moment includes at least one identification ID of this type of characteristic signal and the corresponding signal strength.
例如,对于WIFI信号而言,每个采集时刻的采集内容包括智能终端识别到的WIFI信号的ID(如,媒体存取控制(Media Access Control,MAC)ID)和对应的信号强度。For example, for WIFI signals, the collection content at each collection moment includes the ID of the WIFI signal recognized by the smart terminal (for example, Media Access Control (MAC) ID) and the corresponding signal strength.
再例如,对于CELL信号而言,每个采集时刻的采集内容包括智能终端识别到的CELL信号的ID(如全球小区标识(Cell Global Identifier,CGI)和/或物理小区标识(Physical Cell Identifier,PCI))和对应的信号强度。For another example, for CELL signals, the collection content at each collection moment includes the ID of the CELL signal recognized by the smart terminal (such as the global cell identifier (Cell Global Identifier, CGI) and/or the physical cell identifier (Physical Cell Identifier, PCI) )) and the corresponding signal strength.
其中,上述每条轨迹数据中包含的多个时间段与采集每条轨迹数据的一段连续时间的关系为:每条轨迹数据内包含的多个时间段是对该一段连续时间按照固定时间间隔,或非固定时间间隔划分得到的,且划分得到的该多个时间段中任意两个时间段无重叠。Among them, the relationship between the multiple time periods included in the above-mentioned trajectory data and a continuous period of time during which each trajectory data is collected is: the multiple time periods included in each trajectory data are based on a fixed time interval for this continuous period of time. Or divided into non-fixed time intervals, and any two time periods among the multiple divided time periods do not overlap.
可选地,对每条轨迹数据进行数据采集的一段连续时间的划分方式可以依据具体场景确定,本申请对此不限定。Optionally, the method of dividing a continuous period of time for data collection of each trajectory data can be determined according to the specific scenario, which is not limited in this application.
可选地,对于每类特征信号而言,划分好的每个时间段内只包含该类特征信号的一个采集时刻。Optionally, for each type of characteristic signal, each divided time period contains only one collection moment of the type of characteristic signal.
图4为本申请实施例中一种轨迹数据的结构示意图。列举了一条轨迹数据上两类特征信号的采集示例。如图4所示,该条轨迹数据的采集时间(即上述的一段连续时间)为12:30 到12:45,其上包含三个时间段:12:30-12:35、12:35-12:40、12:40-12:45,即是按照固定时间间隔(5分钟)对该一段连续时间(15分钟)划分得到的。所采集的特征信号包含小区CELL信号和WIFI信号。其中,小区信号的采集时刻为:12:32、12:37和12:43;WIFI信号的采集时刻为12:34、12:39和12:44。即小区信号和WIFI信号的采集时间间隔都为5分钟。Figure 4 is a schematic structural diagram of trajectory data in an embodiment of the present application. Examples of collecting two types of characteristic signals on a trajectory data are listed. As shown in Figure 4, the collection time of this trajectory data (i.e., the above-mentioned continuous time) is from 12:30 to 12:45, which contains three time periods: 12:30-12:35, 12:35- 12:40, 12:40-12:45 is obtained by dividing the continuous time (15 minutes) according to a fixed time interval (5 minutes). The collected characteristic signals include cell CELL signals and WIFI signals. Among them, the collection times of cell signals are: 12:32, 12:37 and 12:43; the collection times of WIFI signals are 12:34, 12:39 and 12:44. That is, the collection time interval of both cell signal and WIFI signal is 5 minutes.
可选地,上述每个时刻采集的小区信号包含一个主站信号和至少一个邻站信号。每个时刻采集到的WIFI信号包含智能终端所识别到的所有WIFI信号。即在上述采集时间内的每个采集时刻,采集到的特征信号的数量至少为一个。Optionally, the above-mentioned cell signals collected at each moment include a master station signal and at least one neighbor station signal. The WIFI signals collected at each moment include all WIFI signals recognized by the smart terminal. That is, at each collection moment within the above collection time, the number of collected characteristic signals is at least one.
应当理解,上述图4所示仅为一个轨迹数据的采集示例,其不构成对采集的特征信号的种类、每类特征信号的采集时间间隔等的限定。It should be understood that the above-mentioned figure 4 is only an example of collecting trajectory data, and it does not constitute a limitation on the types of characteristic signals collected, the collection time interval of each type of characteristic signal, etc.
步骤S320:基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性;其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段对应的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。Step S320: Generate multiple location points and attributes of each location point in the multiple location points based on the multiple trajectory data; wherein the multiple location points include a first location point, and the first location point Points in the plurality of trajectory data correspond to one or more time periods, the combination identifier of each time period in the one or more time periods is the same, and the combination identifier of each time period is determined by each time period. The identification ID of at least one characteristic signal corresponding to the segment is generated, and the first location point is characterized by the combined identification; the characteristic signals collected in one or more time periods include at least one type of characteristic signal, and the first The attributes of the location points include the number of each type of characteristic signals in the at least one type of characteristic signals and/or the quantity distribution characteristics of each type of characteristic signals.
应当理解,本申请是以多个位置点中的第一位置点为对象进行描述,来一同表示每个位置点及其属性的生成,即第一位置点为多个位置点中的任意一个。It should be understood that this application describes the first position point among multiple position points as an object to represent the generation of each position point and its attributes, that is, the first position point is any one of the multiple position points.
在按照前述方法得到每条轨迹数据的多个时间段后,开始构建每个时间段的组合标识。组合标识相同的时间段对应同一个位置点,且该同一位置点由组合标识进行表示。After obtaining multiple time periods of each trajectory data according to the aforementioned method, start constructing the combined identifier of each time period. Time periods with the same combination identifier correspond to the same location point, and the same location point is represented by the combination identifier.
其中,每个时间段对应的至少一个特征信号包括该时间段内采集的特征信号。Wherein, at least one characteristic signal corresponding to each time period includes the characteristic signal collected during the time period.
具体地,每个时间段的组合标识是由该时间段内采集到的至少一个特征信号的标识组合得到的。从技术效果上看,由于不同位置采集到的特征信号的种类和信号强度不同,因而可以利用组合标识来表示采集位置的异同,也即具有相同组合标识的时间段对应相同的采集位置,不同组合标识的时间段对应不同的采集位置。Specifically, the combined identifier of each time period is obtained by combining the identifiers of at least one characteristic signal collected within the time period. From a technical effect point of view, since the types and signal strengths of the characteristic signals collected at different locations are different, combination identifiers can be used to indicate the similarities and differences of the collection locations. That is, the time period with the same combination identifier corresponds to the same collection location, and different combinations The identified time periods correspond to different collection locations.
下面以第一位置点对应的一个或多个时间段为对象描述每个时间段的组合标识的构建过程。The following describes the construction process of the combined identifier of each time period, taking one or more time periods corresponding to the first position point as an object.
可选地,所述一个或多个时间段中包含第一时间段,所述第一时间段内包含M个特征信号,所述M个特征信号中包含C类特征信号,M和C为正整数;所述组合标识由所述M个特征信号中的N个特征信号的ID生成,所述N个特征信号中包含D类特征信号,N为小于或等于M的正整数,D为小于或等于C的正整数。Optionally, the one or more time periods include a first time period, the first time period contains M characteristic signals, the M characteristic signals include C-type characteristic signals, M and C are positive Integer; the combined identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D type characteristic signals, N is a positive integer less than or equal to M, and D is less than or equal to A positive integer equal to C.
其中,第一时间段为上述一个或多个时间段中的任一时间段。即对于每个时间段而言,其组合标识是由该时间段内采集到的至少一个特征信号的ID组合生成。该至少一个特征信号包含至少一类特征信号。Wherein, the first time period is any time period among the one or more time periods mentioned above. That is, for each time period, its combination identification is generated by a combination of the IDs of at least one characteristic signal collected within the time period. The at least one characteristic signal includes at least one type of characteristic signal.
例如,每个时间段的组合标识可以由多个CELL信号(如,一个主站信号+多个邻站信号)的ID组合生成;或者由多个WIFI信号的MAC ID组合生成;或者由至少一个CELL信号、至少一个WIFI信号和至少一个蓝牙信号的ID组合生成;或者由至少一个WIFI信号、至少一个地磁信号和至少一个传感器信号的ID组合生成,本申请对此不限定。For example, the combined identification of each time period can be generated by a combination of IDs of multiple CELL signals (such as a master station signal + multiple neighbor station signals); or by a combination of MAC IDs of multiple WIFI signals; or by at least one It is generated by an ID combination of CELL signal, at least one WIFI signal and at least one Bluetooth signal; or it is generated by an ID combination of at least one WIFI signal, at least one geomagnetic signal and at least one sensor signal, which is not limited by this application.
其中,由至少一个特征信号的ID组合生成组合标识的方式,可以是直接利用ID串联组合,也可以利用特定算法(如哈希算法等)计算得到。Among them, the method of generating the combined identification from the ID combination of at least one characteristic signal can be directly using the serial combination of IDs, or can be calculated using a specific algorithm (such as a hash algorithm, etc.).
应当理解,用于生成组合标识的特征信号种类越多,组合标识的唯一性越强,即不同组合标识的时间段对应的采集位置不同。It should be understood that the more types of characteristic signals used to generate the combined identification, the stronger the uniqueness of the combined identification, that is, the collection locations corresponding to the time periods of different combined identifications are different.
进一步地,当利用到同一类特征信号中的多个特征信号来生成组合标识时,该同一类特征信号中多个特征信号的筛选方式具体如下:Further, when multiple characteristic signals in the same type of characteristic signals are used to generate a combined identifier, the screening method of multiple characteristic signals in the same type of characteristic signals is as follows:
对于有信号强度的特征信号而言,将第一时间段内同一类特征信号按照信号强度的大小从高到低进行排序,然后按照信号强度从高到低筛选出预设数量的特征信号,被筛选出的特征信号后续用于生成第一时间段的组合标识。For characteristic signals with signal strength, the same type of characteristic signals in the first time period are sorted from high to low according to the signal strength, and then a preset number of characteristic signals are filtered out according to the signal strength from high to low, and are The filtered characteristic signals are subsequently used to generate a combined identification of the first time period.
具体地,上述C类特征信号中包括第一类特征信号。所述M个特征信号中包含A个所述第一类特征信号,所述N个特征信号包含B个第一类特征信号,A个所述第一类特征信号中包含所述B个第一类特征信号,所述B个第一类特征信号的信号强度大于A个所述第一类特征信号中其它特征信号的信号强度;其中,B为小于或等于A的正整数。Specifically, the above-mentioned type C characteristic signals include first type characteristic signals. The M characteristic signals include A first-type characteristic signals, the N characteristic signals include B first-type characteristic signals, and the A first-type characteristic signals include the B first-type characteristic signals. Class characteristic signals, the signal strength of the B first class characteristic signals is greater than the signal intensity of other characteristic signals in the A first class characteristic signals; where B is a positive integer less than or equal to A.
其中,上述A个第一类特征信号中包含上述的B个第一类特征信号。B个第一类特征信号是基于特征信号的信号强度从A个第一类特征信号中筛选出来的。具体地,B个第一类特征信号的信号强度大于A个第一类特征信号中其它特征信号的信号强度。Among them, the above-mentioned A first-type characteristic signals include the above-mentioned B first-type characteristic signals. B first-type characteristic signals are selected from A first-type characteristic signals based on the signal strength of the characteristic signal. Specifically, the signal strength of the B first-type characteristic signals is greater than the signal strength of other characteristic signals among the A first-type characteristic signals.
举例来说,假设每个时间段内组合标识是由主站ID和3个邻站ID组合生成的。若第一时间段内采集到的CELL信号包含一个主站信号和10个邻站信号,此时先按照邻站信号的信号强度筛选出信号强度较高的前3个邻站信号,然后利用主站ID(如CGI)和筛选出的3个邻站ID(如PCI)来组合生成第一时间段的组合标识。For example, assume that the combined identifier in each time period is generated by a combination of the primary station ID and three neighbor station IDs. If the CELL signals collected in the first time period include a master station signal and 10 neighbor station signals, first filter out the top 3 neighbor station signals with higher signal strength according to the signal strength of the neighbor station signals, and then use the main station signal to The station ID (such as CGI) and the three filtered neighbor station IDs (such as PCI) are combined to generate a combined identification of the first time period.
举例来说,假设每个时间段内组合标识是由5个WIFI信号组合生成的。若第一时间段内采集到的WIFI信号包含10个WIFI信号,此时按照WIFI信号的信号强度筛选出信号强度较高的前5个WIIF信号,然后利用该前5个WIFI信号的ID组合生成第一时间段的组合标识。For example, assume that the combination identifier is generated by a combination of 5 WIFI signals in each time period. If the WIFI signals collected in the first time period include 10 WIFI signals, then the top 5 WIIF signals with higher signal strengths are screened out according to the signal strength of the WIFI signals, and then the ID combination of the top 5 WIFI signals is used to generate The combination identifier of the first time period.
可选地,所述第一位置点的属性包含室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点;当所述第一位置点为室内位置点时,所述一个或多个时间段内特征信号的采集位置位于室内,当所述第一位置点为室外位置点时,所述一个或多个时间段内特征信号的采集位置位于室外。Optionally, the attributes of the first location point include indoor and outdoor location identifiers, and the indoor and outdoor location identifiers are used to indicate that the first location point is an indoor location point or an outdoor location point; when the first location point is When the first position point is an indoor position point, the collection position of the characteristic signal in the one or more time periods is located indoors. When the first position point is an outdoor position point, the collection position of the characteristic signal in the one or more time periods is located in outdoor.
具体地,第一位置点的室内外标识可通过以下方式确定:统计上述第一位置点对应的时间段的数量,以及每个时间段内采集的特征信号是否包含GNSS信号。当采集到卫星定位信号的时间段的数量大于或等于预设比例时,可将第一位置点的室内外位置标识设置为1,用于指示第一位置点对应的一个或多个时间段内特征信号的采集位置位于室外;当采集到卫星定位信号的时间段的数量小于预设比例时,可将第一位置点的室内外位置标识设置为0,用于指示第一位置点对应的一个或多个时间段内特征信号的采集位置位于室内。Specifically, the indoor and outdoor identification of the first location point can be determined by counting the number of time periods corresponding to the first location point and whether the characteristic signals collected in each time period include GNSS signals. When the number of time periods during which satellite positioning signals are collected is greater than or equal to the preset ratio, the indoor and outdoor location identifier of the first location point can be set to 1 to indicate one or more time periods corresponding to the first location point. The collection location of the characteristic signal is located outdoors; when the number of time periods in which satellite positioning signals are collected is less than the preset ratio, the indoor and outdoor location identifier of the first location point can be set to 0 to indicate a location corresponding to the first location point. Or the collection location of characteristic signals in multiple time periods is located indoors.
在利用获取到的多条轨迹数据生成了多个位置点后,开始确定该多个位置点之间的相邻关系。After multiple position points are generated using the acquired trajectory data, adjacent relationships between the multiple position points are determined.
其中,所述多个位置点中还包含第二位置点,表征所述第二位置点的组合标识由E个特征信号的ID生成。Wherein, the plurality of position points also include a second position point, and the combined identifier representing the second position point is generated from the IDs of the E characteristic signals.
下面以上述多个位置点中的第一位置点和第二位置点为例来描述如何确定多个位置点中的任意两个位置点的相邻关系,即第一位置点和第二位置点为多个位置点中的任意两个。The following takes the first position point and the second position point among the above multiple position points as an example to describe how to determine the adjacent relationship between any two position points among the multiple position points, that is, the first position point and the second position point. are any two of multiple location points.
具体地,当所述第一位置点和所述第二位置点满足第一条件、第二条件或第三条件中的 一个或多个时,所述第一位置点与所述第二位置点相邻。Specifically, when the first position point and the second position point satisfy one or more of the first condition, the second condition or the third condition, the first position point and the second position point are Adjacent.
即可以依据具体地应用场景选择不同的规则,来确定第一位置点和第二位置点是否相邻(或称具有相邻关系)。其中,上述第一位置点和第二位置点相邻指第一位置点和第二位置点之间的距离小于或等于预设距离。That is, different rules can be selected according to specific application scenarios to determine whether the first position point and the second position point are adjacent (or have an adjacent relationship). Wherein, the above-mentioned first position point and the second position point are adjacent means that the distance between the first position point and the second position point is less than or equal to the preset distance.
由上述描述可知,确定第一位置点和第二位置点是否相邻的规则包括7种:1)满足第一条件;2)满足第二条件;3)满足第三条件;4)同时满足第一条件和第二条件;5)同时满足第条件和第三条件;6)同时满足第一条件和第三条件;7)同时满足第一条件、第二条件和第三条件。It can be seen from the above description that there are seven rules for determining whether the first position point and the second position point are adjacent: 1) satisfy the first condition; 2) satisfy the second condition; 3) satisfy the third condition; 4) satisfy the third condition at the same time The first condition and the second condition; 5) The first condition and the third condition are met simultaneously; 6) The first condition and the third condition are met simultaneously; 7) The first condition, the second condition and the third condition are met simultaneously.
下面具体介绍上述第一条件、第二条件和第三条件。The above-mentioned first condition, second condition and third condition are introduced in detail below.
(1)第一条件(1)First condition
所述第一条件为:所述E个特征信号和所述N个特征信号中包含共同的G个特征信号,G和E的比值、G和N的比值都大于或等于第一阈值;且所述E个特征信号中的第一信号的平均信号强度与所述N个特征信号中第一信号的平均信号强度的差值小于或等于第二阈值,所述第一信号为所述G个特征信号中的任意一个。The first condition is: the E characteristic signals and the N characteristic signals include common G characteristic signals, and the ratio of G and E and the ratio of G and N are both greater than or equal to the first threshold; and The difference between the average signal strength of the first signal among the E characteristic signals and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is the G characteristic signal. any of the signals.
具体地,对于第一位置点而言,首先计算出用于生成组合标识的N个特征信号中每个特征信号的平均信号强度。具体地,对于该N个特征信号中的每个特征信号而言,将第一位置点对应的每个时间段内的该特征信号的信号强度进行求和,然后取平均,得到每个特征信号的平均信号强度。同理,E个特征信号中每个特征信号的平均信号强度也可以参照上述方法得到。最后,计算E个特征信号中第一特征信号的平均信号强度和N个特征信号中的平均信号强度的差值,当该差值小于或等于第二阈值,且G个特征信号中的每个特征信号都满足第一特征信号的上述条件,且G和E的比值、G和N的比值都大于或等于第一阈值时,则认定第一顶点和第二顶点满足第一条件。Specifically, for the first position point, the average signal strength of each of the N characteristic signals used to generate the combined identification is first calculated. Specifically, for each of the N characteristic signals, the signal intensity of the characteristic signal in each time period corresponding to the first position point is summed, and then averaged to obtain each characteristic signal. the average signal strength. In the same way, the average signal strength of each of the E characteristic signals can also be obtained by referring to the above method. Finally, calculate the difference between the average signal intensity of the first characteristic signal among the E characteristic signals and the average signal intensity among the N characteristic signals. When the difference is less than or equal to the second threshold, and each of the G characteristic signals When the characteristic signals all meet the above conditions of the first characteristic signal, and the ratio of G and E and the ratio of G and N are both greater than or equal to the first threshold, then the first vertex and the second vertex are deemed to meet the first condition.
其中,其中,第一阈值和第二阈值基于具体场景设定。第一阈值是确保E个特征信号和N个特征信号中包含的相同特征信号达到一定比例,来证明第一位置点和第二位置点相距较近。E、G和N为正整数。Wherein, the first threshold and the second threshold are set based on specific scenarios. The first threshold is to ensure that the same characteristic signals contained in the E characteristic signals and N characteristic signals reach a certain proportion to prove that the first position point and the second position point are close to each other. E, G and N are positive integers.
(2)第二条件(2) Second condition
所述第二条件为:所述第一位置点在所述多条轨迹数据中对应H个时间段,第二位置点在所述多条轨迹数据中对应I个时间段,所述H个时间段中包含J个时间段,J与H的比值大于或等于第三阈值,对于所述J个时间段中的第二时间段,所述I个时间段内包含第三时间段,所述第二时间段和所述第三时间段位于同一轨迹数据上,且所述第二时间段和所述第三时间段的间隔小于或等于第四阈值,所述第二时间段为所述J个时间段内的任一时间段。The second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods The segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold. For the second time segment among the J time segments, the I time segment includes the third time segment, and the third time segment The second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period.
具体地,第二条件也称为时序条件。对于第一位置点对应的H个时间段中的J个时间段(J与H的比值大于或等于第三阈值),对于J个时间段中的任意一个时间段(即上述第二时间段)而言,在第二位置点对应的I个时间段中都存在一个第三时间段,此第三时间段与第二时间段位于同一条轨迹数据上,且第二时间段和第三时间段的时间间隔小于或等于第四阈值时,认定第一位置点和第二位置点满足上述第二条件。Specifically, the second condition is also called a timing condition. For J time periods among the H time periods corresponding to the first position point (the ratio of J to H is greater than or equal to the third threshold), for any one of the J time periods (i.e., the above-mentioned second time period) In terms of , there is a third time period in I time periods corresponding to the second position point. This third time period and the second time period are located on the same trajectory data, and the second time period and the third time period When the time interval is less than or equal to the fourth threshold, it is determined that the first position point and the second position point satisfy the above-mentioned second condition.
例如,第二时间段的采集时间为15:35-15:45,第三时间段的采集时间为15:45-15:55,可利用在先时间段(即第二时间段)的结束时间和在后时间段(即第三时间段)的开始时间计算其时间间隔,即第二时间段和第三时间段的时间间隔为0。当然,也可采用其它方式来计算上述两个时间段的时间间隔,本申请对此不限定。For example, the collection time of the second time period is 15:35-15:45, and the collection time of the third time period is 15:45-15:55. The end time of the previous time period (i.e., the second time period) can be used Calculate the time interval with the start time of the subsequent time period (ie, the third time period), that is, the time interval between the second time period and the third time period is 0. Of course, other methods can also be used to calculate the time interval between the above two time periods, and this application is not limited to this.
其中,第三阈值和第四阈值基于具体场景设定。H、I和J为正整数。The third threshold and the fourth threshold are set based on specific scenarios. H, I and J are positive integers.
(3)第三条件(3) The third condition
所述第三条件为:所述第一位置点与所述第二位置点之间的距离小于或等于第五阈值,所述第一位置点和所述第二位置点之间的距离通过所述第一位置点的坐标和所述第二位置点的坐标计算得到,所述第一位置点的坐标通过对所述一个或多个时间段内采集到的位置坐标进行求和并取均值得到。The third condition is: the distance between the first position point and the second position point is less than or equal to a fifth threshold, and the distance between the first position point and the second position point passes through The coordinates of the first position point and the coordinates of the second position point are calculated. The coordinates of the first position point are obtained by summing and averaging the position coordinates collected in the one or more time periods. .
具体地,对于具有坐标的位置点而言,可以直接利用坐标计算两个位置点之间的距离。每个位置点的坐标基于其对应的一个或多个时间段内采集到的位置坐标进行求和取平均得到。Specifically, for position points with coordinates, the distance between two position points can be directly calculated using the coordinates. The coordinates of each position point are obtained by summing and averaging the position coordinates collected in one or more corresponding time periods.
例如,第一位置点对应100个时间段,100个时间段中的89个时间段内采集有位置坐标,则第一位置点的坐标可以利用对该89个时间段内分别采集到的89个位置坐标计算平均值得到。其中,每个时间段内位置坐标的采集过程可以参见前述实施例中的描述。For example, the first position point corresponds to 100 time periods, and position coordinates are collected in 89 of the 100 time periods. Then the coordinates of the first position point can be obtained using the 89 data collected in the 89 time periods. The position coordinates are calculated by averaging. For the collection process of position coordinates in each time period, please refer to the description in the previous embodiment.
其中,第五阈值依据具体场景设定,本申请对此不限定。The fifth threshold is set according to specific scenarios, which is not limited in this application.
在依据上述实施例确定了多个位置点之间的相邻关系后,可以基于该相邻关系进行位置点的属性补全。第一位置点的属性中包含第一属性,第一属性为需要进行属性补全的任一属性。第一属性的属性补全即:对于第一属性缺失或者不准确的位置点而言,计算这些位置点的第一属性。过程如下:After the adjacent relationships between multiple location points are determined according to the above embodiment, attribute completion of the location points can be performed based on the adjacent relationships. The attributes of the first position point include the first attribute, and the first attribute is any attribute that needs attribute completion. Attribute completion of the first attribute means: for position points whose first attribute is missing or inaccurate, calculate the first attribute of these position points. The process is as follows:
可选地,所述方法还包括:基于所述多个位置点之间的相邻关系生成第一图结构;其中,所述第一图结构用于描述所述多个位置点之间的连接关系,且当所述第一位置点与所述第二位置点相邻时,所述第一位置点和所述第二位置点在所述第一图结构中通过一条边相连,所述第一图结构中包含第一类位置点和第二类位置点,所述第一类位置点中的每个位置点的属性中包含所述第一属性,所述第二类位置点中的每个位置点的属性中不包含所述第一属性;对所述第一图结构进行切割,得到第二图结构;其中,所述第一图结构中任意两个第一类位置点之间的边被切割,与第一类位置点相连的任意两个第二类位置点之间的边被切割,所述第二图结构中的每个位置点与至少一个位置点具有连接关系。Optionally, the method further includes: generating a first graph structure based on adjacent relationships between the multiple location points; wherein the first graph structure is used to describe connections between the multiple location points. relationship, and when the first position point is adjacent to the second position point, the first position point and the second position point are connected by an edge in the first graph structure, and the third position point A graph structure includes a first type of location point and a second type of location point. The attributes of each location point in the first type of location point include the first attribute. Each location point in the second type of location point includes the first attribute. The attributes of the position points do not contain the first attribute; the first graph structure is cut to obtain a second graph structure; wherein, the distance between any two first-type position points in the first graph structure The edges are cut, and the edges between any two second-type position points connected to the first-type position points are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
其中,第一类位置点位属性中包含第一属性的位置点,上述第二类位置点为属性中不包含第一属性的位置点。其中,不包含第一属性应理解为属性中不存在第一属性或者第一属性不准确。第一属性的是否准确由具体场景设定,本申请不做具体展开。Among them, the first type of location points are location points whose attributes include the first attribute, and the above-mentioned second type of location points are location points whose attributes do not include the first attribute. Wherein, not including the first attribute should be understood to mean that the first attribute does not exist in the attributes or the first attribute is inaccurate. Whether the first attribute is accurate depends on the specific scenario, and this application will not elaborate on it in detail.
具体地,对于上述多个位置点而言,将相邻(具有相邻关系)的两个位置点之间用一条边相连,生成表征上述多个位置点之间连接关系的第一图结构。然后将第一图结构中所有第一类位置点之间的边进行切割,以及将与第一类位置点具有边关系(有一条边相连)的任意两个第二类位置点之间的边进行切割,得到第二图结构。Specifically, for the plurality of position points mentioned above, two adjacent position points (having an adjacent relationship) are connected with an edge to generate a first graph structure representing the connection relationship between the plurality of position points. Then cut the edges between all first-type position points in the first graph structure, and cut the edges between any two second-type position points that have an edge relationship with the first-type position points (connected by an edge). Carry out cutting to obtain the second picture structure.
可以看出,第二图结构是用于表征第二类位置点之间,以及第二类位置点和相邻的第一类位置点之间的连接关系。It can be seen that the second graph structure is used to represent the connection relationship between the second type of location points, and between the second type of location points and adjacent first type location points.
请参见图5A-图5C,图5A-图5C为本申请实施例提供的一种图结构的生成过程示意图。Please refer to Figures 5A-5C. Figures 5A-5C are schematic diagrams of a generation process of a graph structure provided by an embodiment of the present application.
图5A为一种第一图结构的具体示例,用于描述从特定区域内采集到的轨迹数据生成的多个位置点之间的相邻关系。如图5A所示,共包含14个位置点(即位置点0-13)。其中,虚线圆形表示第一类位置点(即位置点4-6、位置点8-13),实线圆形表示第二类位置点(即位置点0-3和位置点7),具有相邻关系的两个位置点之间有一条边相连。FIG. 5A is a specific example of a first graph structure, used to describe adjacent relationships between multiple position points generated from trajectory data collected in a specific area. As shown in Figure 5A, it contains a total of 14 position points (i.e. position points 0-13). Among them, the dotted circle represents the first type of location points (i.e., location points 4-6 and location points 8-13), and the solid line circle represents the second type of location points (i.e., location points 0-3 and location point 7), with There is an edge connecting the two position points of the adjacent relationship.
图5B用于描述对图5A所示的第一图结构进行切割的过程示意图。如图5B所示,依据前述切割方式的描述,其中虚线段指示将要进行切割的边。对图5B中所有第一类位置点之 间的边进行切割的过程,包括对位置点6和8之间、位置点6和9之间、位置点5和10之间、位置点5和11之间、位置点4和12之间、位置点4和13之间、位置点12和13之间的边进行切割。对与第一类位置点具有边关系的任意两个第二类位置点之间的边进行切割的过程,包括对位置点7和3之间的边进行切割。FIG. 5B is a schematic diagram describing the process of cutting the first image structure shown in FIG. 5A. As shown in FIG. 5B , according to the foregoing description of the cutting method, the dotted line segment indicates the edge to be cut. The process of cutting the edges between all first-type position points in Figure 5B includes cutting between position points 6 and 8, between position points 6 and 9, between position points 5 and 10, and between position points 5 and 11. Cut the edges between, between position points 4 and 12, between position points 4 and 13, and between position points 12 and 13. The process of cutting the edge between any two second-type position points that have an edge relationship with the first-type position point includes cutting the edge between position points 7 and 3.
图5C用于描述进行切割后得到的第二图结构的示例。如图5C所示,其中包含4个第二类位置点(即位置点0-3)和3个第一类位置点(即位置点4-6)。FIG. 5C is used to describe an example of the second graph structure obtained after cutting. As shown in Figure 5C, it contains 4 second-type position points (ie, position points 0-3) and 3 first-type position points (ie, position points 4-6).
在得到上述第二图结构后,可以基于第二图结构中的第一类位置点中每个位置点的属性中包含的第一属性和各位置点之间的连接关系来计算第二类位置点中每个位置点的属性中需要包含的第一属性。具体过程如下:After obtaining the above second graph structure, the second type of position can be calculated based on the first attribute contained in the attributes of each position point in the first type of position point in the second graph structure and the connection relationship between each position point. The first attribute that needs to be included in the attributes of each position point in the point. The specific process is as follows:
可选地,所述方法还包括:基于所述第二图结构中各位置点之间的连接关系,得到邻接距离矩阵;其中,所述邻接距离矩阵中的每个元素用于表征所述第二图结构中一个第二类位置点到一个第一类位置点之间边的数量;对所述邻接距离矩阵进行归一化处理,得到权重矩阵;将所述权重矩阵和第一矩阵相乘,得到第二矩阵;其中,所述第一矩阵中的元素用于表征所述第二图结构中每个第一类位置点的属性中包含的第一属性,所述第二矩阵中的元素用于表征所述第二图结构中每个第二类位置点的属性中包含的第一属性。Optionally, the method further includes: obtaining an adjacency distance matrix based on the connection relationship between each position point in the second graph structure; wherein each element in the adjacency distance matrix is used to characterize the first The number of edges between a second type position point and a first type position point in the two-graph structure; normalize the adjacency distance matrix to obtain a weight matrix; multiply the weight matrix and the first matrix , obtain the second matrix; wherein, the elements in the first matrix are used to characterize the first attributes contained in the attributes of each first type position point in the second graph structure, and the elements in the second matrix The first attribute used to characterize the attributes of each second type position point in the second graph structure.
具体地,在邻接距离矩阵中,每一行中的元素分别表示一个第二类位置点到每个第一类位置点的路径上(或称为一个第二类位置点与一个第一类位置点之间)所包含的边的数量。例如,第二特征图上包含3个第一类位置点,则一个第二类位置点分别与这3个第一类位置点的路径上包含边的数量(如2、3和4条),为邻接距离矩阵中的某一行元素。同理,也可以利用邻接距离矩阵中的每列元素来分别表示一个第二类位置点到每个第一类位置点的路径上所包含边的数量,本申请对此不限定。即,邻接距离矩阵中的每个元素用于表示一个第二类位置点到一个第一类位置点的路径上所包含边的数量。Specifically, in the adjacency distance matrix, the elements in each row respectively represent the path from a second type location point to each first type location point (or a second type location point and a first type location point). the number of edges contained between). For example, if the second feature map contains three first-category position points, then the path between a second-category position point and these three first-category position points contains the number of edges (such as 2, 3, and 4), is a row element in the adjacency distance matrix. In the same way, each column element in the adjacency distance matrix can also be used to respectively represent the number of edges contained on the path from a second type location point to each first type location point, and this application is not limited to this. That is, each element in the adjacency distance matrix is used to represent the number of edges contained on the path from a second-type location point to a first-type location point.
进一步,可选地,上述归一化处理指以邻接距离矩阵中的每行或每列元素为单位进行归一化处理。当邻接距离矩阵中每行元素表示一个第二类位置点到每个第一类位置点的路径上所包含的边的数量时,以每行元素为单位进行归一化处理;当邻接距离矩阵中每列元素表示一个第二类位置点到每个第一类位置点的路径上所包含的边的数量时,以每列元素为单位进行归一化处理。Further, optionally, the above-mentioned normalization processing refers to normalization processing in units of each row or column of elements in the adjacency distance matrix. When each row element in the adjacency distance matrix represents the number of edges contained on the path from a second-type position point to each first-type position point, normalization is performed in units of each row element; when the adjacency distance matrix When the elements in each column represent the number of edges contained on the path from a second-category position point to each first-category position point, normalization is performed in units of each column element.
例如,在以每行元素为单元进行归一化处理时,若其中一行元素为2、3、3,则归一化处理后,该行元素分别为1/4、3/8、3/8。For example, when normalizing each row of elements as a unit, if one of the row elements is 2, 3, and 3, then after normalization, the row elements will be 1/4, 3/8, and 3/8 respectively. .
其中,第一矩阵中的每行或每列元素用于表示每个第一类位置点的属性中包含的第一属性。第二矩阵中的每行或每列元素用于表示每个第二类位置点的属性中需要包含的第一属性。即,在计算出,每个第二类位置点的属性中需要包含的第一属性后,将计算出的第一属性加入位置点的属性中,进行存储。Wherein, each row or column element in the first matrix is used to represent the first attribute contained in the attributes of each first-type position point. Each row or column element in the second matrix is used to represent the first attribute that needs to be included in the attributes of each second type position point. That is, after calculating the first attribute that needs to be included in the attributes of each second type position point, the calculated first attribute is added to the attributes of the position point and stored.
下面基于图6,以第一属性为坐标进行举例,详细介绍第二矩阵的计算过程。此时,上述第一类位置点即为属性中包含坐标的位置点,第二类位置点即为属性中不包含或包含的坐标不准确的位置点。The following is an example based on Figure 6, taking the first attribute as the coordinate, and introduces the calculation process of the second matrix in detail. At this time, the above-mentioned first type of location point is the location point that contains coordinates in the attribute, and the second type of location point is the location point that is not included in the attribute or contains inaccurate coordinates.
如图6所示,邻接距离矩阵是基于图5A-图5C所示实施例中的第二图结构得到的,为一个4*3矩阵,其中的每行元素用于表示一个第二类位置点到每个第一类位置点的路径上所包含的边的数量。邻接距离矩阵左侧的0、1、2和3表示与每行元素对应的第二类位置点,邻 接距离矩阵上方的4、5和6表示与每列元素对应的第一类位置点。每个元素用于表示一个第二类位置点到一个第一类位置点的路径上所包含边的数量,例如,在该邻接距离矩阵中,第一行第一列的元素表示第二类位置点0到第一类位置点4的路径上所包含边的数量,即1,第三行第二列的元素表示第二类位置点2到第一类位置点5的路径上所包含边的数量,即2。As shown in Figure 6, the adjacency distance matrix is obtained based on the second graph structure in the embodiment shown in Figures 5A-5C. It is a 4*3 matrix, in which each row of elements is used to represent a second type of location point. The number of edges contained on the path to each first-type location point. 0, 1, 2 and 3 on the left side of the adjacency distance matrix represent the second type of position points corresponding to the elements in each row, and 4, 5 and 6 above the adjacency distance matrix represent the first type of position points corresponding to the elements in each column. Each element is used to represent the number of edges contained on the path from a second-category location point to a first-category location point. For example, in the adjacency distance matrix, the elements in the first row and first column represent the second-category location. The number of edges contained on the path from point 0 to first-category point 4, that is, 1. The elements in the third row and second column represent the number of edges contained on the path from second-category point 2 to first-category point 5. quantity, which is 2.
然后以邻接距离矩阵中的每行元素分别为单位,进行归一化处理,得到如图6所示的权重矩阵。Then, normalization is performed with each row element in the adjacency distance matrix as the unit, and the weight matrix shown in Figure 6 is obtained.
第一矩阵中每行元素用于表示一个第一类位置点中每个位置点的属性中包含的坐标(本例中以大地坐标系中经度和纬度表示,本例中省略高程)。即,本例中,3个第一类位置点的属性中包含的坐标分别为(21,28)、(42,56)、(21,14)。Each row of elements in the first matrix is used to represent the coordinates contained in the attributes of each location point in a first-type location point (in this example, represented by longitude and latitude in the geodetic coordinate system, and the elevation is omitted in this example). That is, in this example, the coordinates included in the attributes of the three first-type position points are (21, 28), (42, 56), and (21, 14) respectively.
最后,将权重矩阵与第一矩阵相乘,得到第二矩阵。其中,第二矩阵中的每行元素表示一个第二类位置点中每个位置点的属性中将要包含的坐标(即经度和纬度)。最后计算得到的4个第二类位置点的属性中需要包含的坐标分别为(30,34)、(24,26)、(28,33)、(30,42)。Finally, multiply the weight matrix with the first matrix to get the second matrix. Wherein, each row element in the second matrix represents the coordinates (ie, longitude and latitude) that will be included in the attributes of each location point in a second type location point. The coordinates that need to be included in the attributes of the four second-type position points finally calculated are (30, 34), (24, 26), (28, 33), and (30, 42) respectively.
通常,包含坐标的位置点为室外位置点,不包含坐标的位置点为室内位置点。因此,通过本申请中上述轨迹数据表征方法便可生成室内外位置地图,以为用户提供室内外定位和以室内外定位为基础的其它服务。Generally, location points that contain coordinates are outdoor location points, and location points that do not contain coordinates are indoor location points. Therefore, indoor and outdoor location maps can be generated through the above-mentioned trajectory data representation method in this application to provide users with indoor and outdoor positioning and other services based on indoor and outdoor positioning.
同理,可以参照上述坐标的补全过程,对其它属性进行补全,此处不再赘述。In the same way, you can refer to the above coordinate completion process to complete other attributes, which will not be described again here.
可选地,所述第一属性为所述第一位置点的坐标或室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点。Optionally, the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier, and the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location point.
具体地,可以进行属性补全的属性包括但不限于位置点的坐标、室内外位置标识和采集到的特征信号等。Specifically, attributes that can be completed include but are not limited to coordinates of location points, indoor and outdoor location identifiers, and collected feature signals.
可选地,上述方法还包括:基于所述多个位置点中每个位置点的坐标进行位置定位、导航、大数据统计分析或服务推荐。Optionally, the above method further includes: performing location positioning, navigation, big data statistical analysis or service recommendation based on the coordinates of each location point in the plurality of location points.
具体地,在得到上述多个位置点中每个位置点的坐标后,便可基于每个位置点的坐标生成该多条轨迹数据所位于特定区域的地图(表征室内外位置)。进而在用户进入此特定区域后,可以向服务器发送定位请求,服务器即可基于生成的该特定区域地图为用户提供定位服务。Specifically, after obtaining the coordinates of each of the multiple location points, a map (representing indoor and outdoor locations) of a specific area where the multiple trajectory data is located can be generated based on the coordinates of each location point. Then, after the user enters this specific area, the user can send a positioning request to the server, and the server can provide the user with positioning services based on the generated map of the specific area.
进一步地,在提供定位服务的基础上,还可以基于用户请求,为用户提供导航和服务推荐(例如,推荐距离用户最近的商家)。Furthermore, in addition to providing positioning services, navigation and service recommendations can also be provided for users based on user requests (for example, recommending merchants closest to the user).
其中,上述大数据统计分析可以是基于表征后的区域或城市轨迹数据,构建城市级或区域级的热力图,以分析区域或城市的人流量等,本申请对此不限定。Among them, the above big data statistical analysis can be based on the characterized regional or city trajectory data to construct a city-level or regional-level heat map to analyze the regional or city's flow of people, etc. This application is not limited to this.
可选地,所述数量分布特征包含正态分布、泊松分布、离散分布或区间分布中的至少一种,所述区间分布用于描述位于不同区间内特征信号的数量。Optionally, the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the quantity of characteristic signals located in different intervals.
进一步,可选地,统计每类特征信号的数量分布特征时所依据的指标可以是每类特征信号的信号强度等,本申请对此不限定。Furthermore, optionally, the index based on counting the quantity distribution characteristics of each type of characteristic signal may be the signal strength of each type of characteristic signal, etc., which is not limited in this application.
具体地,当采用区间分布表征一类特征信号的数量分布特征时,可以统计该类特征信号在不同信号强度区间内的数量,然后将信号强度区间的范围和对应数量加入位置点的属性进行存储。当采用正态分布表征一类特征信号的数量分布特征时,可以将依据信号强度统计得到的正态分布的均值和方差加入位置点属性中进行存储。Specifically, when interval distribution is used to characterize the quantity distribution characteristics of a type of characteristic signal, the number of characteristic signals of this type in different signal strength intervals can be counted, and then the range and corresponding number of the signal strength interval are added to the attributes of the location point for storage. . When the normal distribution is used to characterize the quantitative distribution characteristics of a type of feature signal, the mean and variance of the normal distribution obtained based on signal strength statistics can be added to the location point attributes for storage.
应当理解,还可以采用其它数学分布规律(如泊松分布、离散分布、伯努利分布等)来描述/表征每类特征信号的数量分布特征,本申请对此不限定。It should be understood that other mathematical distribution laws (such as Poisson distribution, discrete distribution, Bernoulli distribution, etc.) can also be used to describe/characterize the quantitative distribution characteristics of each type of characteristic signal, and this application is not limited to this.
可选地,所述第一位置点的属性还包含L个第一类特征信号,所述L个第一类特征信号符合所述第一类特征信号的数量分布特征。L为正整数。Optionally, the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals. L is a positive integer.
其中,第一类特征信号为上述至少一类特征信号中的任意一类特征信号。Wherein, the first type of characteristic signal is any type of characteristic signal among at least one type of characteristic signal mentioned above.
具体地,对于每类特征信号而言,位置点的属性中还会包含每类特征信号的部分原始数据,且该部分原始数据符合该类特征信号的数量分布特征。从而使得基于位置点和属性的存储方式可以最大程度地保留多条轨迹数据的原始数据特征。Specifically, for each type of characteristic signal, the attributes of the location point also include part of the original data of each type of characteristic signal, and this part of the original data conforms to the quantity distribution characteristics of the type of characteristic signal. As a result, the storage method based on location points and attributes can retain the original data characteristics of multiple trajectory data to the greatest extent.
例如,第一类特征信号为WIFI信号,第一位置点对应的一个或多个时间段内采集的WIFI信号的数量为5000个。该5000个WIFI信号的数量分布特征采用正态分布表示,即用正态分布表示5000个信号分别对应的5000个信号强度的正态分布特征。此时,可根据具体场景需求,从5000个WIFI信号中选出100个,且这100个WIFI信号的信号强度分布符合该5000个WIFI信号的信号强度的正态分布,并将这100个WIFI信号作为第一位置点的属性,一同存储。For example, the first type of characteristic signal is a WIFI signal, and the number of WIFI signals collected in one or more time periods corresponding to the first location point is 5,000. The quantity distribution characteristics of the 5,000 WIFI signals are represented by a normal distribution, that is, the normal distribution is used to represent the normal distribution characteristics of the 5,000 signal strengths corresponding to the 5,000 signals. At this time, according to the specific scene requirements, 100 WIFI signals can be selected from 5,000 WIFI signals, and the signal strength distribution of these 100 WIFI signals conforms to the normal distribution of the signal strengths of the 5,000 WIFI signals, and these 100 WIFI signals The signal is stored together as an attribute of the first position point.
再例如,当采用区间分布来描述上述5000个WIIF信号的数量分布特征时,首先统计不同信号强度区间范围内的WIFI信号的数量,假设,信号强度在[-10,0]dbm、[0,10]dbm、[10,20]dbm、[20,30]dbm中的数量占比分别为10%、20%、40%、30%。则从5000个WIFI信号中筛选出加入到位置点的属性一同存储的100个原始数据,其数量分布也符合上述四个信号强度区间的分布规律。For another example, when using interval distribution to describe the quantity distribution characteristics of the above 5000 WIIF signals, first count the number of WIFI signals in different signal strength intervals. Assume that the signal strength is between [-10, 0]dbm, [0, The quantity proportions in 10]dbm, [10,20]dbm, and [20,30]dbm are 10%, 20%, 40%, and 30% respectively. Then, 100 original data that are added to the attributes of the location point and stored together are selected from 5,000 WIFI signals, and their quantity distribution also conforms to the distribution rules of the above four signal strength intervals.
同理,当第一类特征信号还采其它数量分布特征进行表示时,筛选出作为第一位置点的属性进行存储的第一类特征信号,也须符合相应的其它数量分布特征。In the same way, when the first type of characteristic signal is also represented by other quantity distribution characteristics, the first type of characteristic signal that is selected and stored as an attribute of the first position point must also conform to the corresponding other quantity distribution characteristics.
综上,第一位置点的属性可以包含下列三种中的至少一种。To sum up, the attributes of the first position point may include at least one of the following three types.
(1)上述至少一类特征信号中每类特征信号的数量。(1) The number of each type of characteristic signal in at least one type of characteristic signal mentioned above.
(2)上述至少一类特征信号中每类特征信号的数量分布特征。(2) The quantity distribution characteristics of each type of characteristic signal in at least one type of characteristic signal mentioned above.
(3)上述至少一类特征信号中每类特征信号中的部分特征信号,该部分特征信号符合其所属类别特征信号的数量分布特征。(3) Part of the characteristic signals of each type of characteristic signal in at least one type of characteristic signal mentioned above, which part of the characteristic signal conforms to the quantitative distribution characteristics of the characteristic signal of the type to which it belongs.
下面基于表1描述采用本申请中轨迹数据表征方法进行数据表征和存储时,所带来的技术效果。表1用于表示一周内和一个月内,采用本申请中轨迹数据表征方法后,南京某地区内轨迹数据的压缩倍率。The technical effects brought about by using the trajectory data characterization method in this application for data characterization and storage are described below based on Table 1. Table 1 is used to represent the compression ratio of trajectory data in a certain area of Nanjing within one week and one month after adopting the trajectory data characterization method in this application.
其中,表1中的轨迹点数即为对采集到的的多条轨迹数据进行划分后,得到的时间段的数量。压缩倍率是由轨迹点数量除以位置点数量得到的,M表示计数单位—百万million。Among them, the number of trajectory points in Table 1 is the number of time periods obtained after dividing the collected multiple trajectory data. The compression ratio is obtained by dividing the number of trajectory points by the number of position points, and M represents the counting unit—million.
如表1所示,对于一周内采集的轨迹数据而言,移动运营商中,轨迹点数量为113.59M个,在使用本申请方法进行轨迹数据表征后,只需要存储0.94M个位置点,压缩倍率达到了120.29倍。同理,一周内,对于联通和电信运营商而言,进行轨迹数据表征后,压缩倍率也分别达到了79.55倍和128.44倍。As shown in Table 1, for the trajectory data collected within a week, the number of trajectory points among mobile operators is 113.59M. After using the method of this application to characterize the trajectory data, only 0.94M location points need to be stored and compressed. The magnification reached 120.29 times. Similarly, within a week, for China Unicom and telecom operators, after characterization of trajectory data, the compression ratio reached 79.55 times and 128.44 times respectively.
此外,从表1中还可以看出,随着采集时间的增加,压缩倍率也会不断提高。一个月内,移动、联通和电信运营商的采集的轨迹数据而言,在进行轨迹数据表征后,压缩倍率分别达到了238.64倍、142.44倍、215.02倍。In addition, it can be seen from Table 1 that as the acquisition time increases, the compression ratio will continue to increase. Within a month, for the trajectory data collected by China Mobile, China Unicom and telecom operators, after characterization of the trajectory data, the compression ratio reached 238.64 times, 142.44 times and 215.02 times respectively.
采用本申请中轨迹数据表征方法达到数据压缩技术效果的原因主要是:The main reasons for using the trajectory data representation method in this application to achieve data compression technical effects are:
(1)位置点—属性的表征方式。轨迹数据容易产生过积累,尤其是对于利用众包的方式采集到的轨迹数据,众包数据本身数据量大,当数据积累到一定程度之后,已经积累的数据 已经完全反映众包数据本身所能表达的全部数据特征,此时再增加数据量对数据分布、特征已无影响,这是就达到了众包数据积累的边界,此时不断积累增加的数据,是冗余的。本申请的表征方法中使用属性(数量分布特征和数量等)去表征局部范围内的多类特征信号,并有限制的存储部分原始数据,不但均衡了众包数据,也同时避免了数据众包数据局部和信号类别上的过渡积累。从而达到压缩效果。(1) Position point—the representation method of attributes. Trajectory data is prone to over-accumulation, especially for trajectory data collected through crowdsourcing. Crowdsourcing data itself has a large amount of data. When the data is accumulated to a certain extent, the accumulated data has fully reflected the capabilities of the crowdsourcing data itself. For all the data characteristics expressed, increasing the amount of data at this time has no impact on the data distribution and characteristics. This has reached the boundary of crowdsourcing data accumulation. At this time, the continuous accumulation of increasing data is redundant. In the characterization method of this application, attributes (quantity distribution characteristics and quantities, etc.) are used to characterize multiple types of characteristic signals in a local range, and limited storage of part of the original data not only balances the crowdsourcing data, but also avoids data crowdsourcing. Transition accumulation over data localities and signal categories. Thus achieving the compression effect.
(2)位置点的有限性。一个区域内位置点的数量是有限的,随着采集时间不断增加,数据也将线性增加。线性增加的数据会转换为数量分布特征和数量等属性,以压缩存储到有限的位置点内,从而压缩倍率也会不断增加。(2) Limitation of location points. The number of location points in an area is limited, and as the collection time continues to increase, the data will also increase linearly. The linearly increasing data will be converted into attributes such as quantity distribution characteristics and quantity, and then compressed and stored into limited location points, so the compression ratio will continue to increase.
表1:不同采集时间段内轨迹数据的压缩倍率表Table 1: Compression ratio table of trajectory data in different collection time periods
4G运营商4G operator 累计天数Cumulative days 位置点数量(M)Number of location points (M) 轨迹点数量(M)Number of track points (M) 压缩倍率Compression ratio
移动move 77 944292944292 113592680113592680 120.29120.29
联通 China Unicom 77 319097319097 2538551025385510 79.5579.55
电信 telecommunications 77 587243587243 7542682875426828 128.44128.44
移动 move 3030 23631362363136 508115826508115826 215.02215.02
联通 China Unicom 3030 835072835072 118943509118943509 142.44142.44
电信 telecommunications 3030 14146251414625 337584951337584951 238.64238.64
请参见图7A-图7B。图7A-图7B为基于本申请的轨迹数据表征方法提取出的室内外空间拓扑结构示意图。See Figures 7A-7B. Figures 7A-7B are schematic diagrams of indoor and outdoor spatial topological structures extracted based on the trajectory data representation method of the present application.
如图7A所示,图7A为提取出的室外的空间拓扑结构。图7A左侧表示采集的轨迹数据中所包含的所有轨迹点(即轨迹数据包含的每个时间段对应的位置点)在地图上的分布。轨迹点越密集,说明人流量密度越高。通常,人流密度最高的就是路,整个室外轨迹数据,实质上会倾向于对路网的表征。图7A中左图是室外轨迹点的位置信息展示,整体和路网相符,而图7A右侧是左图栅格化后的室外热度统计图,显示了整体路网情况。As shown in Figure 7A, Figure 7A shows the extracted outdoor spatial topology. The left side of Figure 7A shows the distribution of all trajectory points included in the collected trajectory data (that is, the position points corresponding to each time period included in the trajectory data) on the map. The denser the trajectory points are, the higher the density of human flow is. Usually, roads have the highest density of people flow, and the entire outdoor trajectory data will essentially tend to represent the road network. The left image in Figure 7A shows the location information of outdoor track points, which is consistent with the road network. The right side of Figure 7A shows the outdoor heat statistics map after rasterization of the left image, showing the overall road network situation.
如图7B所示,图7B为提取出的室内的空间拓扑结构。基于每个区域室内轨迹点的分布,可以找到人流密集的热点室内区域。这些热点区域后续室内定位、楼层识别提供基础。As shown in Figure 7B, Figure 7B shows the extracted indoor spatial topology. Based on the distribution of indoor trajectory points in each area, hotspot indoor areas with dense human flow can be found. These hotspot areas provide the basis for subsequent indoor positioning and floor identification.
请参见图8,图8为本申请实施例提供的一种轨迹数据的表征装置结构示意图,可以用于执行前述方法实施例中的各步骤。如图8所示,该装置包括获取单元810和处理单元820。其中,Please refer to Figure 8. Figure 8 is a schematic structural diagram of a trajectory data characterization device provided by an embodiment of the present application, which can be used to perform each step in the foregoing method embodiment. As shown in Figure 8, the device includes an acquisition unit 810 and a processing unit 820. in,
获取单元810,用于获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号。The acquisition unit 810 is configured to acquire multiple pieces of trajectory data, each of the multiple pieces of trajectory data including multiple time periods, and the characteristic signals collected in each of the multiple time periods.
处理单元820,用于基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性。The processing unit 820 is configured to generate multiple location points and attributes of each location point in the multiple location points based on the multiple trajectory data.
其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段对应的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。Wherein, the plurality of position points include a first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, and each time period in the one or more time periods The combination identification is the same, the combination identification of each time period is generated by the identification ID of at least one characteristic signal corresponding to each time period, the first position point is characterized by the combination identification; the one or The characteristic signals collected in multiple time periods include at least one type of characteristic signal, and the attributes of the first location point include the number of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal. .
在一种可行的实施方式中,所述一个或多个时间段中包含第一时间段,所述第一时间段内包含M个特征信号,所述M个特征信号中包含C类特征信号,M和C为正整数;所述组合标识由所述M个特征信号中的N个特征信号的ID生成,所述N个特征信号中包含D类特征信号,N为小于或等于M的正整数,D为小于或等于C的正整数。In a feasible implementation, the one or more time periods include a first time period, the first time period includes M characteristic signals, and the M characteristic signals include Class C characteristic signals, M and C are positive integers; the combination identification is generated by the ID of N characteristic signals among the M characteristic signals, the N characteristic signals include D-type characteristic signals, and N is a positive integer less than or equal to M. , D is a positive integer less than or equal to C.
在一种可行的实施方式中,所述C类特征信号中包括第一类特征信号;所述M个特征信号中包含A个所述第一类特征信号,所述N个特征信号包含B个第一类特征信号,A个所述第一类特征信号中包含所述B个第一类特征信号,所述B个第一类特征信号的信号强度大于A个所述第一类特征信号中其它特征信号的信号强度;其中,B为小于或等于A的正整数。In a feasible implementation, the C-type characteristic signals include first-type characteristic signals; the M characteristic signals include A of the first-type characteristic signals, and the N characteristic signals include B The first type of characteristic signals, A of the first type of characteristic signals include the B of the first type of characteristic signals, the signal strength of the B of the first type of characteristic signals is greater than that of the A of the first type of characteristic signals The signal strength of other characteristic signals; where B is a positive integer less than or equal to A.
在一种可行的实施方式中,所述第一位置点的属性还包含L个第一类特征信号,所述L个第一类特征信号符合所述第一类特征信号的数量分布特征。In a feasible implementation, the attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
在一种可行的实施方式中,所述多个位置点中还包含第二位置点,表征所述第二位置点的组合标识由E个特征信号的ID生成;当所述第一位置点和所述第二位置点满足第一条件、第二条件或第三条件中的一个或多个时,所述第一位置点与所述第二位置点相邻;其中,所述第一条件为:所述E个特征信号和所述N个特征信号中包含共同的G个特征信号,G和E的比值、G和N的比值都大于或等于第一阈值;且所述E个特征信号中的第一信号的平均信号强度与所述N个特征信号中第一信号的平均信号强度的差值小于或等于第二阈值,所述第一信号为所述G个特征信号中的任意一个;所述第二条件为:所述第一位置点在所述多条轨迹数据中对应H个时间段,第二位置点在所述多条轨迹数据中对应I个时间段,所述H个时间段中包含J个时间段,J与H的比值大于或等于第三阈值,对于所述J个时间段中的第二时间段,所述I个时间段内包含第三时间段,所述第二时间段和所述第三时间段位于同一轨迹数据上,且所述第二时间段和所述第三时间段的间隔小于或等于第四阈值,所述第二时间段为所述J个时间段内的任一时间段;所述第三条件为:所述第一位置点与所述第二位置点之间的距离小于或等于第五阈值,所述第一位置点和所述第二位置点之间的距离通过所述第一位置点的坐标和所述第二位置点的坐标计算得到,所述第一位置点的坐标通过对所述一个或多个时间段内采集到的位置坐标进行求和并取均值得到。In a feasible implementation, the plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of E characteristic signals; when the first position point and When the second position point satisfies one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point; wherein the first condition is : The E characteristic signals and the N characteristic signals include common G characteristic signals, the ratio of G and E, and the ratio of G and N are both greater than or equal to the first threshold; and among the E characteristic signals The difference between the average signal strength of the first signal and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is any one of the G characteristic signals; The second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods The segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold. For the second time segment among the J time segments, the I time segment includes the third time segment, and the third time segment The second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period; the third condition is: the distance between the first position point and the second position point is less than or equal to the fifth threshold, the first position point and the third position point are The distance between the two position points is calculated by the coordinates of the first position point and the coordinates of the second position point. The coordinates of the first position point are calculated by comparing the coordinates collected in the one or more time periods. The position coordinates are summed and averaged.
在一种可行的实施方式中,所述第一位置点的属性中包含第一属性,所述处理单元820还用于:基于所述多个位置点之间的相邻关系生成第一图结构;其中,所述第一图结构用于描述所述多个位置点之间的连接关系,且当所述第一位置点与所述第二位置点相邻时,所述第一位置点和所述第二位置点在所述第一图结构中通过一条边相连,所述第一图结构中包含第一类位置点和第二类位置点,所述第一类位置点中的每个位置点的属性中包含所述第一属性,所述第二类位置点中的每个位置点的属性中不包含所述第一属性;对所述第一图结构进行切割,得到第二图结构;其中,所述第一图结构中任意两个所述第一类位置点之间的边被切割,与所述第一类位置点相连的任意两个所述第二类位置点之间的边被切割,所述第二图结构中的每个位置点与至少一个位置点具有连接关系。In a feasible implementation, the attributes of the first location point include a first attribute, and the processing unit 820 is further configured to: generate a first graph structure based on the adjacent relationship between the multiple location points. ; Wherein, the first graph structure is used to describe the connection relationship between the plurality of position points, and when the first position point is adjacent to the second position point, the first position point and The second location points are connected by an edge in the first graph structure. The first graph structure includes first type location points and second type location points. Each of the first type location points The attributes of the position points include the first attribute, and the attributes of each position point in the second type of position points do not include the first attribute; cut the first graph structure to obtain the second graph Structure; wherein, the edge between any two of the first type position points in the first graph structure is cut, and the edge between any two of the second type of position points connected to the first type of position point is The edges are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
在一种可行的实施方式中,所述处理单元820还用于:基于所述第二图结构中各位置点之间的连接关系,得到邻接距离矩阵;其中,所述邻接距离矩阵中的每个元素用于表征所述第二图结构中一个所述第二类位置点到一个所述第一类位置点之间边的数量;对所述邻接距离矩阵进行归一化处理,得到权重矩阵;将所述权重矩阵和第一矩阵相乘,得到第二矩阵;其中,所述第一矩阵中的元素用于表征所述第二图结构中每个所述第一类位置点的属性中包含的第一属性,所述第二矩阵中的元素用于表征所述第二图结构中每个所述第二类位置点的属性中包含的第一属性。In a feasible implementation, the processing unit 820 is further configured to: obtain an adjacency distance matrix based on the connection relationship between each location point in the second graph structure; wherein each element in the adjacency distance matrix elements are used to represent the number of edges between a second type position point and a first type position point in the second graph structure; normalize the adjacency distance matrix to obtain a weight matrix ; Multiply the weight matrix and the first matrix to obtain a second matrix; wherein the elements in the first matrix are used to characterize the attributes of each first type position point in the second graph structure. The elements in the second matrix are used to characterize the first attributes included in the attributes of each second type position point in the second graph structure.
在一种可行的实施方式中,所述第一属性为所述第一位置点的坐标或室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点。In a feasible implementation, the first attribute is the coordinates of the first location point or an indoor or outdoor location identifier, and the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location. location point.
在一种可行的实施方式中,所述处理单元820还用于:基于所述多个位置点中每个位置点的坐标进行位置定位、导航、大数据统计分析或服务推荐。In a feasible implementation, the processing unit 820 is further configured to perform location positioning, navigation, big data statistical analysis, or service recommendation based on the coordinates of each location point in the plurality of location points.
在一种可行的实施方式中,所述数量分布特征包含正态分布、泊松分布、离散分布或区间分布中的至少一种,所述区间分布用于描述位于不同区间内特征信号的数量。In a feasible implementation, the quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
在一种可行的实施方式中,所述特征信号包含全球导航卫星系统GNSS信号、位置坐标、射频信号、光学信号、声学信号、传感器信号或地磁信号中的一种或多种,其中,所述射频信号包括WIFI信号、蓝牙信号、小区CELL信号或超宽频UWB信号中的一种或多种。In a feasible implementation, the characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, wherein, Radio frequency signals include one or more of WIFI signals, Bluetooth signals, cell CELL signals or ultra-wideband UWB signals.
具体地,轨迹数据的表征装置中各单元的具体执行过程可以参见前述图3方法实施例中的具体步骤,此处不再赘述。Specifically, the specific execution process of each unit in the trajectory data characterization device can be referred to the specific steps in the method embodiment of FIG. 3, and will not be described again here.
请参见图9,图9为本申请提供的一种计算机设备的结构示意图,其可以是图1实施例中云端110中的服务器或用户设备。该计算机设备包含存储器901、一个或多个(图中仅示出一个)处理器902、接口电路903以及总线904。其中,存储器901、处理器902、接口电路903通过总线904实现彼此之间的通信连接。Please refer to FIG. 9 . FIG. 9 is a schematic structural diagram of a computer device provided by the present application. It may be a server or user device in the cloud 110 in the embodiment of FIG. 1 . The computer device includes a memory 901, one or more (only one is shown in the figure) processors 902, an interface circuit 903 and a bus 904. Among them, the memory 901, the processor 902, and the interface circuit 903 implement communication connections between each other through the bus 904.
处理器902用于通过接口电路903获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号。并基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性;其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段内所采集的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。The processor 902 is configured to obtain multiple pieces of trajectory data through the interface circuit 903. Each piece of trajectory data in the multiple pieces of trajectory data includes multiple time periods, and the characteristic signals collected in each of the multiple time periods. . And generate a plurality of position points and attributes of each position point in the plurality of position points based on the plurality of trajectory data; wherein the plurality of position points include a first position point, and the first position point is at The multiple pieces of trajectory data correspond to one or more time periods, the combination identifier of each time period in the one or more time periods is the same, and the combination identifier of each time period is determined by the number of times in each time period. The identification ID of at least one characteristic signal collected is generated, and the first location point is characterized by the combined identification; the characteristic signal collected in one or more time periods includes at least one type of characteristic signal, and the first location point is characterized by the combined identification. The attributes of the location points include the number of each type of characteristic signals in the at least one type of characteristic signals and/or the quantity distribution characteristics of each type of characteristic signals.
存储器901可以用于存储上述多个位置点以及多个位置点中每个位置点的属性。The memory 901 may be used to store the above-mentioned plurality of location points and attributes of each of the plurality of location points.
具体地,计算机设备执行轨迹数据的表征方法的具体过程可以参见图3方法实施例中的各个步骤,此处不再赘述。Specifically, the specific process of the computer device performing the characterization method of trajectory data can be referred to the various steps in the method embodiment of FIG. 3, which will not be described again here.
存储器901可以是随机存取存储器(Random Access Memory,RAM)、只读存储器(Read-Only Memory,ROM)或闪存(Flash Memory)等存储器中的任意一种;其中,RAM包括静态随机存储器(Static RAM,SRAM)和动态随机存储器(Dynamic RAM,DRAM)等,ROM包括可擦除可编程ROM(Erasable Programmable ROM,EPROM)和电可擦可编程只读存储器(Electrically Erasable Programmable ROM,EEPROM)等。存储器901可以存储程序,当存储器901中存储的程序被处理器902执行时,处理器902和接口电路903用于执行本申请实施例的轨迹数据的表征方法的各个步骤。The memory 901 can be any one of random access memory (Random Access Memory, RAM), read-only memory (Read-Only Memory, ROM) or flash memory (Flash Memory); wherein, RAM includes static random access memory (Static random access memory). RAM, SRAM) and dynamic random access memory (Dynamic RAM, DRAM), etc. ROM includes erasable programmable ROM (Erasable Programmable ROM, EPROM) and electrically erasable programmable read-only memory (Electrically Erasable Programmable ROM, EEPROM), etc. The memory 901 can store programs. When the program stored in the memory 901 is executed by the processor 902, the processor 902 and the interface circuit 903 are used to execute various steps of the trajectory data characterization method in the embodiment of the present application.
处理器902可以是通用的中央处理器(central processing unit,CPU),微处理器,应用专用集成电路(application specific integrated circuit,ASIC),图形处理器(graphics processing unit,GPU)或者一个或多个集成电路等,用于执行相关程序,以实现本申请实施例的轨迹数据的表征装置中的各单元所需执行的功能,或者执行本申请方法实施例的轨迹数据的表征方法。The processor 902 may be a general central processing unit (CPU), a microprocessor, an application specific integrated circuit (ASIC), a graphics processing unit (GPU), or one or more Integrated circuits, etc., are used to execute relevant programs to realize the functions required to be performed by each unit in the trajectory data characterization device according to the embodiment of the present application, or to execute the trajectory data characterization method according to the method embodiment of the present application.
接口电路903使用例如但不限于收发器一类的收发装置,来实现计算机设备与其他设备或通信网络之间的通信。例如,可以通过接口电路903从智能终端(用户设备)获取轨迹数据。The interface circuit 903 uses a transceiver device such as, but not limited to, a transceiver to implement communication between the computer device and other devices or communication networks. For example, trajectory data can be obtained from a smart terminal (user equipment) through the interface circuit 903.
总线904可包括在图9所示的计算机设备上各个部件(例如,存储器901、处理器902、接口电路903)之间传送信息的通路。 Bus 904 may include a path for transmitting information between various components (eg, memory 901, processor 902, interface circuit 903) on the computer device shown in Figure 9.
本申请实施例提供了一种芯片系统,所述芯片系统包括至少一个处理器,存储器和接口电路,所述存储器、所述接口电路和所述至少一个处理器通过线路互联,所述至少一个存储器中存储有指令;所述指令被所述处理器执行时,上述方法实施例中记载的任意一种的部分或全部步骤得以实现。The embodiment of the present application provides a chip system. The chip system includes at least one processor, a memory and an interface circuit. The memory, the interface circuit and the at least one processor are interconnected through lines. The at least one memory Instructions are stored in; when the instructions are executed by the processor, some or all of the steps described in any of the above method embodiments can be realized.
本申请实施例提供了一种计算机存储介质,所述计算机存储介质存储有计算机程序,该计算机程序被执行时,使得上述方法实施例中记载的任意一种的部分或全部步骤得以实现。Embodiments of the present application provide a computer storage medium. The computer storage medium stores a computer program. When the computer program is executed, some or all of the steps described in any of the above method embodiments can be realized.
本申请实施例提供了一种计算机程序产品,所述计算机程序产品包括程序指令,当所述程序指令在计算机上运行时,上述方法实施例中记载的任意一种的部分或全部步骤得以实现。Embodiments of the present application provide a computer program product. The computer program product includes program instructions. When the program instructions are run on a computer, some or all of the steps described in any of the above method embodiments can be implemented.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其它实施例的相关描述。需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可能可以采用其它顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本申请所必须的。In the above embodiments, each embodiment is described with its own emphasis. For parts that are not described in detail in a certain embodiment, please refer to the relevant descriptions of other embodiments. It should be noted that for the sake of simple description, the foregoing method embodiments are expressed as a series of action combinations. However, those skilled in the art should know that the present application is not limited by the described action sequence. Because according to this application, certain steps may be performed in other orders or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are preferred embodiments, and the actions and modules involved are not necessarily necessary for this application.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如上述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed device can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the above units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated. to another system, or some features can be ignored, or not implemented. On the other hand, the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical or other forms.
上述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described above as separate components may or may not be physically separated. The components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。As mentioned above, the above embodiments are only used to illustrate the technical solution of the present application, but not to limit it. Although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still make the foregoing technical solutions. The technical solutions described in each embodiment may be modified, or some of the technical features may be equivalently replaced; however, these modifications or substitutions do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions in each embodiment of the present application.

Claims (27)

  1. 一种轨迹数据的表征方法,其特征在于,所述方法包括:A method for characterization of trajectory data, characterized in that the method includes:
    获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号;Acquire multiple pieces of trajectory data, each of the multiple pieces of trajectory data includes multiple time periods, and characteristic signals collected in each of the multiple time periods;
    基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性;Generate multiple location points and attributes of each location point in the multiple location points based on the multiple trajectory data;
    其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段对应的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。Wherein, the plurality of position points include a first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, and each time period in the one or more time periods The combination identification of each time period is the same, the combination identification of each time period is generated by the identification ID of at least one characteristic signal corresponding to each time period, the first position point is characterized by the combination identification; the one or The characteristic signals collected in multiple time periods include at least one type of characteristic signal, and the attributes of the first location point include the number of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal. .
  2. 根据权利要求1所述的方法,其特征在于,The method according to claim 1, characterized in that:
    所述一个或多个时间段中包含第一时间段,所述第一时间段内包含M个特征信号,所述M个特征信号中包含C类特征信号,M和C为正整数;The one or more time periods include a first time period, the first time period includes M characteristic signals, the M characteristic signals include C-type characteristic signals, and M and C are positive integers;
    所述组合标识由所述M个特征信号中的N个特征信号的ID生成,所述N个特征信号中包含D类特征信号,N为小于或等于M的正整数,D为小于或等于C的正整数。The combined identification is generated by the ID of N characteristic signals among the M characteristic signals. The N characteristic signals include D type characteristic signals. N is a positive integer less than or equal to M, and D is less than or equal to C. positive integer.
  3. 根据权利要求2所述的方法,其特征在于,所述C类特征信号中包括第一类特征信号,The method according to claim 2, characterized in that the type C characteristic signal includes a first type characteristic signal,
    所述M个特征信号中包含A个所述第一类特征信号,所述N个特征信号包含B个第一类特征信号,A个所述第一类特征信号中包含所述B个第一类特征信号,所述B个第一类特征信号的信号强度大于A个所述第一类特征信号中其它特征信号的信号强度;其中,B为小于或等于A的正整数。The M characteristic signals include A first-type characteristic signals, the N characteristic signals include B first-type characteristic signals, and the A first-type characteristic signals include the B first-type characteristic signals. Class characteristic signals, the signal strength of the B first class characteristic signals is greater than the signal intensity of other characteristic signals in the A first class characteristic signals; where B is a positive integer less than or equal to A.
  4. 根据权利要求3中所述的方法,其特征在于,The method according to claim 3, characterized in that:
    所述第一位置点的属性还包含L个第一类特征信号,所述L个第一类特征信号符合所述第一类特征信号的数量分布特征。The attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
  5. 根据权利要求2-4中任一项所述的方法,其特征在于,The method according to any one of claims 2-4, characterized in that,
    所述多个位置点中还包含第二位置点,表征所述第二位置点的组合标识由E个特征信号的ID生成;The plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of the E characteristic signals;
    当所述第一位置点和所述第二位置点满足第一条件、第二条件或第三条件中的一个或多个时,所述第一位置点与所述第二位置点相邻;When the first position point and the second position point satisfy one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point;
    其中,所述第一条件为:所述E个特征信号和所述N个特征信号中包含共同的G个特征信号,G和E的比值、G和N的比值都大于或等于第一阈值;且所述E个特征信号中的第一信号的平均信号强度与所述N个特征信号中第一信号的平均信号强度的差值小于或等于第二阈值,所述第一信号为所述G个特征信号中的任意一个;Wherein, the first condition is: the E characteristic signals and the N characteristic signals include common G characteristic signals, and the ratio of G and E and the ratio of G and N are both greater than or equal to the first threshold; And the difference between the average signal strength of the first signal among the E characteristic signals and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is the G Any one of the characteristic signals;
    所述第二条件为:所述第一位置点在所述多条轨迹数据中对应H个时间段,第二位置点在所述多条轨迹数据中对应I个时间段,所述H个时间段中包含J个时间段,J与H的比值大于或等于第三阈值,对于所述J个时间段中的第二时间段,所述I个时间段内包含第三时间段,所述第二时间段和所述第三时间段位于同一轨迹数据上,且所述第二时间段和所述第 三时间段的间隔小于或等于第四阈值,所述第二时间段为所述J个时间段内的任一时间段;The second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods The segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold. For the second time segment among the J time segments, the I time segment includes the third time segment, and the third time segment is included in the segment. The second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period;
    所述第三条件为:所述第一位置点与所述第二位置点之间的距离小于或等于第五阈值,所述第一位置点和所述第二位置点之间的距离通过所述第一位置点的坐标和所述第二位置点的坐标计算得到,所述第一位置点的坐标通过对所述一个或多个时间段内采集到的位置坐标进行求和并取均值得到。The third condition is: the distance between the first position point and the second position point is less than or equal to a fifth threshold, and the distance between the first position point and the second position point passes through The coordinates of the first position point and the coordinates of the second position point are calculated. The coordinates of the first position point are obtained by summing and averaging the position coordinates collected in the one or more time periods. .
  6. 根据权利要求5所述的方法,其特征在于,所述第一位置点的属性中包含第一属性,所述方法还包括:The method of claim 5, wherein the attributes of the first location point include a first attribute, and the method further includes:
    基于所述多个位置点之间的相邻关系生成第一图结构;其中,所述第一图结构用于描述所述多个位置点之间的连接关系,且当所述第一位置点与所述第二位置点相邻时,所述第一位置点和所述第二位置点在所述第一图结构中通过一条边相连,所述第一图结构中包含第一类位置点和第二类位置点,所述第一类位置点中的每个位置点的属性中包含所述第一属性,所述第二类位置点中的每个位置点的属性中不包含所述第一属性;A first graph structure is generated based on the adjacent relationship between the multiple location points; wherein the first graph structure is used to describe the connection relationship between the multiple location points, and when the first location point When adjacent to the second position point, the first position point and the second position point are connected by an edge in the first graph structure, and the first graph structure contains the first type of position point. and a second type of position point, the attributes of each position point in the first type of position point include the first attribute, and the attributes of each position point in the second type of position point do not include the first attribute;
    对所述第一图结构进行切割,得到第二图结构;其中,所述第一图结构中任意两个所述第一类位置点之间的边被切割,与所述第一类位置点相连的任意两个所述第二类位置点之间的边被切割,所述第二图结构中的每个位置点与至少一个位置点具有连接关系。Cut the first graph structure to obtain a second graph structure; wherein, the edge between any two first-type position points in the first graph structure is cut, and is connected to the first-type position point. The edges between any two connected second type position points are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
  7. 根据权利要求6所述的方法,其特征在于,所述方法还包括:The method of claim 6, further comprising:
    基于所述第二图结构中各位置点之间的连接关系,得到邻接距离矩阵;其中,所述邻接距离矩阵中的每个元素用于表征所述第二图结构中一个所述第二类位置点到一个所述第一类位置点之间边的数量;Based on the connection relationship between each position point in the second graph structure, an adjacency distance matrix is obtained; wherein each element in the adjacency distance matrix is used to characterize one of the second categories in the second graph structure The number of edges between a location point and one of the first-type location points;
    对所述邻接距离矩阵进行归一化处理,得到权重矩阵;Normalize the adjacency distance matrix to obtain a weight matrix;
    将所述权重矩阵和第一矩阵相乘,得到第二矩阵;其中,所述第一矩阵中的元素用于表征所述第二图结构中每个所述第一类位置点的属性中包含的第一属性,所述第二矩阵中的元素用于表征所述第二图结构中每个所述第二类位置点的属性中包含的第一属性。Multiply the weight matrix and the first matrix to obtain a second matrix; wherein the elements in the first matrix are used to represent the attributes contained in each of the first type position points in the second graph structure. The first attribute of the second matrix is used to characterize the first attribute included in the attributes of each second type position point in the second graph structure.
  8. 根据权利要求6或7所述的方法,其特征在于,The method according to claim 6 or 7, characterized in that,
    所述第一属性为所述第一位置点的坐标或室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点。The first attribute is the coordinates of the first location point or an indoor or outdoor location identifier, and the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location point.
  9. 根据权利要求8所述的方法,其特征在于,所述方法还包括:The method of claim 8, further comprising:
    基于所述多个位置点中每个位置点的坐标进行位置定位、导航、大数据统计分析或服务推荐。Position positioning, navigation, big data statistical analysis or service recommendation are performed based on the coordinates of each of the plurality of position points.
  10. 根据权利要求1-9中任一项所述的方法,其特征在于,The method according to any one of claims 1-9, characterized in that,
    所述数量分布特征包含正态分布、泊松分布、离散分布或区间分布中的至少一种,所述区间分布用于描述位于不同区间内特征信号的数量。The quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
  11. 根据权利要求1-10中任一项所述的方法,其特征在于,The method according to any one of claims 1-10, characterized in that,
    所述特征信号包含全球导航卫星系统GNSS信号、位置坐标、射频信号、光学信号、声学信号、传感器信号或地磁信号中的一种或多种,其中,所述射频信号包括WIFI信号、蓝 牙信号、小区CELL信号或超宽频UWB信号中的一种或多种。The characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, where the radio frequency signals include WIFI signals, Bluetooth signals, One or more of cell CELL signals or ultra-wideband UWB signals.
  12. 一种轨迹数据的表征装置,其特征在于,所述装置包括:A device for characterizing trajectory data, characterized in that the device includes:
    获取单元,用于获取多条轨迹数据,所述多条轨迹数据中的每条轨迹数据包含多个时间段,以及所述多个时间段中每个时间段内采集的特征信号;An acquisition unit, configured to acquire multiple pieces of trajectory data, each of the multiple pieces of trajectory data including multiple time periods, and characteristic signals collected in each of the multiple time periods;
    处理单元,用于基于所述多条轨迹数据生成多个位置点以及所述多个位置点中每个位置点的属性;A processing unit configured to generate a plurality of position points and attributes of each position point in the plurality of position points based on the plurality of trajectory data;
    其中,所述多个位置点中包含第一位置点,所述第一位置点在所述多条轨迹数据中对应一个或多个时间段,所述一个或多个时间段中每个时间段的组合标识相同,所述每个时间段的组合标识由所述每个时间段对应的至少一个特征信号的标识ID生成,所述第一位置点由所述组合标识进行表征;所述一个或多个时间段内采集的特征信号包含至少一类特征信号,所述第一位置点的属性包含所述至少一类特征信号中每类特征信号的数量和/或每类特征信号的数量分布特征。Wherein, the plurality of position points include a first position point, the first position point corresponds to one or more time periods in the plurality of trajectory data, and each time period in the one or more time periods The combination identification is the same, the combination identification of each time period is generated by the identification ID of at least one characteristic signal corresponding to each time period, the first position point is characterized by the combination identification; the one or The characteristic signals collected in multiple time periods include at least one type of characteristic signal, and the attributes of the first location point include the number of each type of characteristic signal in the at least one type of characteristic signal and/or the quantity distribution characteristics of each type of characteristic signal. .
  13. 根据权利要求12所述的装置,其特征在于,The device according to claim 12, characterized in that:
    所述一个或多个时间段中包含第一时间段,所述第一时间段内包含M个特征信号,所述M个特征信号中包含C类特征信号,M和C为正整数;The one or more time periods include a first time period, the first time period includes M characteristic signals, the M characteristic signals include C-type characteristic signals, and M and C are positive integers;
    所述组合标识由所述M个特征信号中的N个特征信号的ID生成,所述N个特征信号中包含D类特征信号,N为小于或等于M的正整数,D为小于或等于C的正整数。The combined identification is generated by the ID of N characteristic signals among the M characteristic signals. The N characteristic signals include D type characteristic signals. N is a positive integer less than or equal to M, and D is less than or equal to C. positive integer.
  14. 根据权利要求13所述的装置,其特征在于,所述C类特征信号中包括第一类特征信号;The device according to claim 13, characterized in that the type C characteristic signal includes a first type characteristic signal;
    所述M个特征信号中包含A个所述第一类特征信号,所述N个特征信号包含B个第一类特征信号,A个所述第一类特征信号中包含所述B个第一类特征信号,所述B个第一类特征信号的信号强度大于A个所述第一类特征信号中其它特征信号的信号强度;其中,B为小于或等于A的正整数。The M characteristic signals include A first-type characteristic signals, the N characteristic signals include B first-type characteristic signals, and the A first-type characteristic signals include the B first-type characteristic signals. Class characteristic signals, the signal strength of the B first class characteristic signals is greater than the signal intensity of other characteristic signals in the A first class characteristic signals; where B is a positive integer less than or equal to A.
  15. 根据权利要求14所述的装置,其特征在于,The device according to claim 14, characterized in that:
    所述第一位置点的属性还包含L个第一类特征信号,所述L个第一类特征信号符合所述第一类特征信号的数量分布特征。The attributes of the first location point also include L first-type characteristic signals, and the L first-type characteristic signals conform to the quantity distribution characteristics of the first-type characteristic signals.
  16. 根据权利要求12-15中任一项所述的装置,其特征在于,The device according to any one of claims 12-15, characterized in that,
    所述多个位置点中还包含第二位置点,表征所述第二位置点的组合标识由E个特征信号的ID生成;The plurality of position points also includes a second position point, and the combined identifier characterizing the second position point is generated by the ID of the E characteristic signals;
    当所述第一位置点和所述第二位置点满足第一条件、第二条件或第三条件中的一个或多个时,所述第一位置点与所述第二位置点相邻;When the first position point and the second position point satisfy one or more of the first condition, the second condition or the third condition, the first position point is adjacent to the second position point;
    其中,所述第一条件为:所述E个特征信号和所述N个特征信号中包含共同的G个特征信号,G和E的比值、G和N的比值都大于或等于第一阈值;且所述E个特征信号中的第一信号的平均信号强度与所述N个特征信号中第一信号的平均信号强度的差值小于或等于第二阈值,所述第一信号为所述G个特征信号中的任意一个;Wherein, the first condition is: the E characteristic signals and the N characteristic signals include common G characteristic signals, and the ratio of G and E and the ratio of G and N are both greater than or equal to the first threshold; And the difference between the average signal strength of the first signal among the E characteristic signals and the average signal strength of the first signal among the N characteristic signals is less than or equal to the second threshold, and the first signal is the G Any one of the characteristic signals;
    所述第二条件为:所述第一位置点在所述多条轨迹数据中对应H个时间段,第二位置点 在所述多条轨迹数据中对应I个时间段,所述H个时间段中包含J个时间段,J与H的比值大于或等于第三阈值,对于所述J个时间段中的第二时间段,所述I个时间段内包含第三时间段,所述第二时间段和所述第三时间段位于同一轨迹数据上,且所述第二时间段和所述第三时间段的间隔小于或等于第四阈值,所述第二时间段为所述J个时间段内的任一时间段;The second condition is: the first location point corresponds to H time periods in the multiple trajectory data, the second location point corresponds to 1 time period in the multiple trajectory data, and the H time periods The segment contains J time segments, and the ratio of J to H is greater than or equal to the third threshold. For the second time segment among the J time segments, the I time segment includes the third time segment, and the third time segment The second time period and the third time period are located on the same trajectory data, and the interval between the second time period and the third time period is less than or equal to the fourth threshold, and the second time period is the J Any time period within the time period;
    所述第三条件为:所述第一位置点与所述第二位置点之间的距离小于或等于第五阈值,所述第一位置点和所述第二位置点之间的距离通过所述第一位置点的坐标和所述第二位置点的坐标计算得到,所述第一位置点的坐标通过对所述一个或多个时间段内采集到的位置坐标进行求和并取均值得到。The third condition is: the distance between the first position point and the second position point is less than or equal to a fifth threshold, and the distance between the first position point and the second position point passes through The coordinates of the first position point and the coordinates of the second position point are calculated. The coordinates of the first position point are obtained by summing and averaging the position coordinates collected in the one or more time periods. .
  17. 根据权利要求16所述的装置,其特征在于,所述第一位置点的属性中包含第一属性,所述处理单元还用于:The device according to claim 16, wherein the attributes of the first location point include a first attribute, and the processing unit is further configured to:
    基于所述多个位置点之间的相邻关系生成第一图结构;其中,所述第一图结构用于描述所述多个位置点之间的连接关系,且当所述第一位置点与所述第二位置点相邻时,所述第一位置点和所述第二位置点在所述第一图结构中通过一条边相连,所述第一图结构中包含第一类位置点和第二类位置点,所述第一类位置点中的每个位置点的属性中包含所述第一属性,所述第二类位置点中的每个位置点的属性中不包含所述第一属性;A first graph structure is generated based on the adjacent relationship between the multiple location points; wherein the first graph structure is used to describe the connection relationship between the multiple location points, and when the first location point When adjacent to the second position point, the first position point and the second position point are connected by an edge in the first graph structure, and the first graph structure contains the first type of position point. and a second type of position point, the attributes of each position point in the first type of position point include the first attribute, and the attributes of each position point in the second type of position point do not include the first attribute;
    对所述第一图结构进行切割,得到第二图结构;其中,所述第一图结构中任意两个所述第一类位置点之间的边被切割,与所述第一类位置点相连的任意两个所述第二类位置点之间的边被切割,所述第二图结构中的每个位置点与至少一个位置点具有连接关系。Cut the first graph structure to obtain a second graph structure; wherein, the edge between any two first-type position points in the first graph structure is cut, and is connected to the first-type position point. The edges between any two connected second type position points are cut, and each position point in the second graph structure has a connection relationship with at least one position point.
  18. 根据权利要求17所述的装置,其特征在于,所述处理单元还用于:The device according to claim 17, characterized in that the processing unit is also used for:
    基于所述第二图结构中各位置点之间的连接关系,得到邻接距离矩阵;其中,所述邻接距离矩阵中的每个元素用于表征所述第二图结构中一个所述第二类位置点到一个所述第一类位置点之间边的数量;Based on the connection relationship between each position point in the second graph structure, an adjacency distance matrix is obtained; wherein each element in the adjacency distance matrix is used to characterize one of the second categories in the second graph structure The number of edges between a location point and one of the first-type location points;
    对所述邻接距离矩阵进行归一化处理,得到权重矩阵;Normalize the adjacency distance matrix to obtain a weight matrix;
    将所述权重矩阵和第一矩阵相乘,得到第二矩阵;其中,所述第一矩阵中的元素用于表征所述第二图结构中每个所述第一类位置点的属性中包含的第一属性,所述第二矩阵中的元素用于表征所述第二图结构中每个所述第二类位置点的属性中包含的第一属性。Multiply the weight matrix and the first matrix to obtain a second matrix; wherein the elements in the first matrix are used to represent the attributes contained in each of the first type position points in the second graph structure. The first attribute of the second matrix is used to characterize the first attribute included in the attributes of each second type position point in the second graph structure.
  19. 根据权利要求17或18所述的装置,其特征在于,The device according to claim 17 or 18, characterized in that,
    所述第一属性为所述第一位置点的坐标或室内外位置标识,所述室内外位置标识用于指示所述第一位置点为室内位置点或室外位置点。The first attribute is the coordinates of the first location point or an indoor or outdoor location identifier, and the indoor or outdoor location identifier is used to indicate that the first location point is an indoor location point or an outdoor location point.
  20. 根据权利要求19所述的装置,其特征在于,所述处理单元还用于:The device according to claim 19, characterized in that the processing unit is also used for:
    基于所述多个位置点中每个位置点的坐标进行位置定位、导航、大数据统计分析或服务推荐。Position positioning, navigation, big data statistical analysis or service recommendation are performed based on the coordinates of each of the plurality of position points.
  21. 根据权利要求12-20中任一项所述的装置,其特征在于,The device according to any one of claims 12-20, characterized in that:
    所述数量分布特征包含正态分布、泊松分布、离散分布或区间分布中的至少一种,所述区间分布用于描述位于不同区间内特征信号的数量。The quantity distribution characteristics include at least one of normal distribution, Poisson distribution, discrete distribution or interval distribution, and the interval distribution is used to describe the number of characteristic signals located in different intervals.
  22. 根据权利要求12-21中任一项所述的装置,其特征在于,The device according to any one of claims 12-21, characterized in that,
    所述特征信号包含全球导航卫星系统GNSS信号、位置坐标、射频信号、光学信号、声学信号、传感器信号或地磁信号中的一种或多种,其中,所述射频信号包括WIFI信号、蓝牙信号、小区CELL信号或超宽频UWB信号中的一种或多种。The characteristic signal includes one or more of GNSS signals, position coordinates, radio frequency signals, optical signals, acoustic signals, sensor signals or geomagnetic signals, where the radio frequency signals include WIFI signals, Bluetooth signals, One or more of cell CELL signals or ultra-wideband UWB signals.
  23. 一种计算机设备,其特征在于,所述计算机设备包括至少一个处理器,存储器和接口电路,所述存储器、所述接口电路和所述至少一个处理器通过线路互联,所述至少一个存储器中存储有指令;所述指令被所述处理器执行时,上述权利要求1-11中的任一项方法得以实现。A computer device, characterized in that the computer device includes at least one processor, a memory and an interface circuit, the memory, the interface circuit and the at least one processor are interconnected through lines, and the at least one memory stores There are instructions; when the instructions are executed by the processor, the method of any one of the above claims 1-11 is implemented.
  24. 根据权利要求23所述的设备,其特征在于,所述计算机设备为服务器或终端设备,其中,所述终端设备包括手机、电脑、车机或平板。The device according to claim 23, characterized in that the computer device is a server or a terminal device, wherein the terminal device includes a mobile phone, a computer, a car machine or a tablet.
  25. 一种芯片系统,所述芯片系统包括至少一个处理器,存储器和接口电路,所述存储器、所述接口电路和所述至少一个处理器通过线路互联,所述至少一个存储器中存储有指令;所述指令被所述处理器执行时,上述权利要求1-11中的任一项方法得以实现。A chip system, the chip system includes at least one processor, a memory and an interface circuit, the memory, the interface circuit and the at least one processor are interconnected through lines, and instructions are stored in the at least one memory; When the instructions are executed by the processor, the method of any one of the above claims 1-11 is implemented.
  26. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,该计算机程序被执行时,上述权利要求1-11中的任一项方法得以实现。A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program. When the computer program is executed, any one of the methods in claims 1-11 can be implemented.
  27. 一种计算机程序产品,其特征在于,所述计算机程序产品包括程序指令,当所述程序指令在计算机上运行时,上述权利要求1-11中的任一项方法得以实现。A computer program product, characterized in that the computer program product includes program instructions. When the program instructions are run on a computer, the method of any one of the above claims 1-11 is implemented.
PCT/CN2022/103140 2022-06-30 2022-06-30 Trajectory data characterization method and apparatus WO2024000519A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/103140 WO2024000519A1 (en) 2022-06-30 2022-06-30 Trajectory data characterization method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/103140 WO2024000519A1 (en) 2022-06-30 2022-06-30 Trajectory data characterization method and apparatus

Publications (1)

Publication Number Publication Date
WO2024000519A1 true WO2024000519A1 (en) 2024-01-04

Family

ID=89383850

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/103140 WO2024000519A1 (en) 2022-06-30 2022-06-30 Trajectory data characterization method and apparatus

Country Status (1)

Country Link
WO (1) WO2024000519A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110852342A (en) * 2019-09-26 2020-02-28 京东城市(北京)数字科技有限公司 Road network data acquisition method, device, equipment and computer storage medium
CN112395512A (en) * 2020-11-06 2021-02-23 中山大学 Method for constructing complex attribute network representation model based on path aggregation
US20210248181A1 (en) * 2020-02-11 2021-08-12 Samsung Electronics Co., Ltd. Electronic device and control method thereof
CN113793295A (en) * 2021-08-05 2021-12-14 西人马帝言(北京)科技有限公司 Data processing method, device and equipment and readable storage medium
CN113807457A (en) * 2021-09-26 2021-12-17 北京市商汤科技开发有限公司 Method, device and equipment for determining road network characterization information and storage medium
CN114511740A (en) * 2022-01-25 2022-05-17 北京市商汤科技开发有限公司 Vehicle image classification method, vehicle track restoration method, device and equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110852342A (en) * 2019-09-26 2020-02-28 京东城市(北京)数字科技有限公司 Road network data acquisition method, device, equipment and computer storage medium
US20210248181A1 (en) * 2020-02-11 2021-08-12 Samsung Electronics Co., Ltd. Electronic device and control method thereof
CN112395512A (en) * 2020-11-06 2021-02-23 中山大学 Method for constructing complex attribute network representation model based on path aggregation
CN113793295A (en) * 2021-08-05 2021-12-14 西人马帝言(北京)科技有限公司 Data processing method, device and equipment and readable storage medium
CN113807457A (en) * 2021-09-26 2021-12-17 北京市商汤科技开发有限公司 Method, device and equipment for determining road network characterization information and storage medium
CN114511740A (en) * 2022-01-25 2022-05-17 北京市商汤科技开发有限公司 Vehicle image classification method, vehicle track restoration method, device and equipment

Similar Documents

Publication Publication Date Title
EP2620024B1 (en) Generation and use of coverage area models
CN108601029B (en) Base station construction evaluation method and device
US9426044B2 (en) Radio access network geographic information system with multiple format
CN108450060B (en) Positioning method and device based on WI-FI access point
CN107846688B (en) Wireless network site planning method and device based on multiple operators
CN107110652B (en) Processing spatial features
CN112312301B (en) User terminal positioning method, device, equipment and computer storage medium
CN109068272B (en) Similar user identification method, device, equipment and readable storage medium
CN111107556B (en) Signal coverage quality evaluation method and device of mobile communication network
CN109936820B (en) User terminal positioning method and device
WO2020024597A1 (en) Indoor positioning method and apparatus
CN108281023B (en) Method and system for displaying real-time road conditions through mobile terminal
CN111311193B (en) Method and device for configuring public service resources
CN103167605B (en) A kind of WiFi outdoor positioning method that satellite auxiliary signal coverage diagram is set up/upgraded
CN110727752B (en) Position fingerprint database processing method, device and computer readable storage medium
WO2013189089A1 (en) A method and equipment for distinguishing indoor business data from outdoor business data
CN112526572B (en) Network switching method and positioning system for indoor and outdoor seamless navigation
WO2024000519A1 (en) Trajectory data characterization method and apparatus
Moradbeikie et al. A cost-effective LoRaWAN-based IoT localization method using fixed reference nodes and dual-slope path-loss modeling
CN114980138B (en) Planning method, system and terminal of 5G wireless base station
CN116133037A (en) Wireless network evaluation method and device
CN114786199A (en) Method, device, equipment and storage medium for determining network problem point
CN113593244A (en) Flow determination method and device, storage medium and electronic device
CN107979818B (en) Method for processing initial data of wireless fingerprint database
CN112929938B (en) AP topological relation graph generation method and system based on wireless log information