CN107766495A - Distributed temporal index method based on Voronoi diagram - Google Patents

Distributed temporal index method based on Voronoi diagram Download PDF

Info

Publication number
CN107766495A
CN107766495A CN201710976133.6A CN201710976133A CN107766495A CN 107766495 A CN107766495 A CN 107766495A CN 201710976133 A CN201710976133 A CN 201710976133A CN 107766495 A CN107766495 A CN 107766495A
Authority
CN
China
Prior art keywords
point
voronoi
data
cluster
subregion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710976133.6A
Other languages
Chinese (zh)
Inventor
季长清
汪祖民
刘艳
高杨
李泽宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University
Original Assignee
Dalian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University filed Critical Dalian University
Priority to CN201710976133.6A priority Critical patent/CN107766495A/en
Publication of CN107766495A publication Critical patent/CN107766495A/en
Pending legal-status Critical Current

Links

Abstract

Distributed temporal index method based on Voronoi diagram, belongs to data directory field, and for solving the problems, such as to improve available data querying method index efficiency, technical essential is:Each object s objects in each object r, data set S in data set R are calculated respectively with representing the distance between point p points, and by object r, s distributes to immediate representative point P;The immediate representative point with m object r, an object r and any object s is all collected in a Voronoi cell in R, thus produces into m Voronoi cell as subregion, output<VCm, List (Pi)>It is right;Effect is:Greatly reduce space cost so that space efficiency is very high.

Description

Distributed temporal index method based on Voronoi diagram
Technical field
The invention belongs to data directory field, is related to big data processing and spatial query algorithms application.
Background technology
With mobile communication and the fast development based on location-based service correlation technique, cloud computing, big data, Internet of Things, shifting The technologies such as dynamic calculating and space orientation are also progressively ripe, and GPS, camera, blue-teeth data etc. are also constantly increasing, and emerge in large numbers Substantial amounts of spatial data, this to be faced with huge challenge in the storage and processing of various spatial datas or object.
When data carry out big data processing, the problem of operation time is long, spatiotemporal data warehouse efficiency is low is frequently encountered.And The computing system of traditional computer is poor with distributed performance parallel because only supporting limited thread, the calculating money of unit Source usually limited (be such as limited to the size of hard disk or internal memory, CPU element computing capability is not strong etc.) and can not directly apply.
Index has important influence to large-scale data access efficiency.New space index method needs to be incorporated into tradition Database processing engine in, so as to R-tree structures occur.R-tree is indexed in multidimensional data ring equivalent to two-dimentional B+ trees Extension under border.The algorithm inquired about to carry out arest neighbors (Nearest Neighbor, NN) for being currently based on R-tree indexes has A lot, but these methods all concentrate single thread execution task on a single computer.When data scale increases rapidly it is necessary to Application distribution formula Database Systems are handled to be indexed with data query etc..
The content of the invention
In order to improve available data querying method index efficiency, the present invention provides following scheme:
A kind of distributed temporal index method based on Voronoi diagram, its step are as follows:The row of making is built using Spark Voronoi indexes, give two datasets R and S, Spark in d dimension spaces and carry out burst by default mechanism, part mappers is same Shi Binghang is run, and using the reducer of acquiescence in Spark tasks, before map functions are started, is obtained using pre- clustering algorithm To representative point p, and it is loaded into each map main memory;
In each map treatment progress, the burst of input is read using TextInputFormat successively, TextInputFormat reads data into Mapper example from file, calculates each object r in data set R respectively, Each object s objects in data set S are with representing the distance between point p points, and by object r, s distributes to immediate representative Point P;The immediate representative point with m object r, an object r and any object s is all collected at a Voronoi in R In cell, m Voronoi cell is thus produced into as subregion, output<VCm, List (Pi)>It is right, query point p is given, is sentenced Its other closest subregion or most some neighbouring partition sets, mapper output initial data concentrate to closest subregion or Each object r, s and its subregion VC of closest partition setmId;Mapper is output to Spark file system.
One space is divided into multiple disjoint polygons by Voronoi diagram, some point in each polygon Arest neighbors be respectively positioned in the Voronoi cells where the point, each polygon in figure is referred to as associated with point p Voronoi cells, any point in the cell where point p are all p arest neighbors.
Row's Voronoi indexes include two parts:Master index, including all cluster centres;Second index, including storage In the presence of each subregion VC to as queue.
The acquisition methods of point are represented, it is determined that internal cluster point and consecutive points, by the internal data clusters for clustering point, after cluster Select cluster centre to be indexed, required data are the consecutive points with internal cluster point connection, are circle with this inside cluster point The heart, circle is established comprising adjacent cluster centre point, Delaunay triangles, we are used as the triangle of circumscribed circle using this circle Two different inside cluster points are established into Delaunay triangles respectively in method, the two Delaunay triangles are with consecutive points Establish Delaunay triangulation network for common ground, data object be divided into several big subregions, select a wherein cluster representative point into To represent a little, each object being divided to be clustered in a Voronoi unit, in each Voronoi grids containing pair As id.
Voronoi diagram is by VD (p)={ V (p1),V(p2),...,V(pm), wherein:VD (p) is the Voronoi diagram on P Intersection, V (p1) be p1 Voronoi diagram, the set associated with all points provided, be referred to as following distance caused by p Function Dist () Voronoi diagram, the Voronoi diagram of each p points is necessarily including the institute than other any points closer to q here A little, thus a query point q neighbour be closure Voronoi diagram;
Voronoi units mark off a region for including n point, i.e. P on the R of space from D dimension spaces:{p1, p2,…,pn, the region that subregion VC is provided, i.e. VC subregions are on point piRegion VC (pi), if meeting VC (pi)=p | d (p, pi) ≤(p,pj), then the region is referred to as the Voronoi unit associated with p;
Wherein:Wherein p is specified point or query point, d (p, pi) it is p and piBetween minimum Eustachian distance, i, j are variables, N >=2, p1≠p2, i ≠ j, i, j ∈ In=1 .., n, and i takes all values in 1 .., the n, when often taking a value, j is taken all over 1 .., Except all values of i values now in n.
Beneficial effect:The present invention is a kind of indexing means using Voronoi diagram, due to having used multidimensional Voronoi ropes Drawing, the index supports Spatial-data Integration, is suitable for indexing the data set of various dimensions, can support mass data collection and various dimensions, And a very small space is needed because preferable Spatial Objects store, because we only need to store each object Point information is represented, so greatly reducing space cost so that space efficiency is very high, using arranging safe polygon to distribution Medical spatio-temporal region is indexed, and this solution has important influence to large-scale data access efficiency.
Brief description of the drawings
Fig. 1 .Voronoi scheme
Fig. 2 fall to arrange Voronoi diagram index schematic diagram;
The example key diagram of Fig. 3 present invention;
Fig. 4 .Delaunay triangulation networks establish schematic diagram;
Specific embodiment party
Embodiment 1:A kind of distributed temporal index method based on Voronoi diagram, its characterization step are as follows:Use Spark structures make down row's Voronoi indexes, and it is a kind of existing calculating to give two datasets R and S, Spark in d dimension spaces Engine, it carries out burst by default mechanism, part mappers parallel operations simultaneously, acquiescence is used in Spark tasks Reducer, before map functions are started, obtain representing point p using pre- clustering algorithm, and be loaded into each map main memory In;
In each map treatment progress, the burst of input is read using TextInputFormat successively, TextInputFormat reads data into Mapper example from file, calculates each object r in data set R respectively, Each object s objects in data set S are with representing the distance between point p points, and by object r, s distributes to immediate representative Point P;The immediate representative point with m object r, an object r and any object s is all collected at a Voronoi in R In cell, m Voronoi cell is thus produced into as subregion, output<VCm, List (Pi)>It is right, PiIt is one obtained Series is immediate to be represented a little, and i represents the position sequence of point, gives query point p, differentiates its closest subregion or most some are neighbouring Partition set, each object r, s to closest subregion or closest partition set that mapper output initial data is concentrated And its subregion VCmId;Mapper is output to Spark file system.
Wherein described Voronoi diagram, it is that a space is divided into multiple disjoint polygons, in each polygon In arest neighbors of some point be respectively positioned in the Voronoi cells where the point, each polygon in figure is referred to as and point p Associated Voronoi cells, any point in the cell where point p are all p arest neighbors.
Voronoi diagram is by VD (p)={ V (p1),V(p2),...,V(pm) wherein:VD (p) is the Voronoi diagram on P Intersection, V (p1) be p1 Voronoi diagram, the set associated with all points provided, be referred to as following distance caused by p Function Dist () Voronoi diagram, the Voronoi diagram of each p points is necessarily including the institute than other any points closer to q here A little, thus a query point q neighbour be closure Voronoi diagram;
Voronoi units mark off a region for including n point, i.e. P on the R of space from D dimension spaces:{p1, p2,…,pn, the region that subregion VC is provided, i.e. VC subregions are on point piRegion VC (pi), if meeting VC (pi)=p | d (p, pi) ≤(p,pj), then the region is referred to as the Voronoi unit associated with p;
Wherein:Wherein p is specified point or query point, d (p, pi) it is p and piBetween minimum Eustachian distance, i, j are variables, N >=2, p1≠p2, i ≠ j, i, j ∈ In=1 .., n, and i takes all values in 1 .., the n, when often taking a value, j is taken all over 1 .., Except all values of i values now in n.
Row's Voronoi indexes include two parts:Master index, including all cluster centres;Second index, including storage In the presence of each subregion VC to as queue.
The acquisition methods of point are represented, it is determined that internal cluster point and consecutive points, by the internal data clusters for clustering point, after cluster Select cluster centre to be indexed, required data are the consecutive points with internal cluster point connection, are circle with this inside cluster point The heart, circle is established comprising adjacent cluster centre point, Delaunay triangles, we are used as the triangle of circumscribed circle using this circle Two different inside cluster points are established into Delaunay triangles respectively in method, the two Delaunay triangles are with consecutive points Establish Delaunay triangulation network for common ground, data object be divided into several big subregions, select a wherein cluster representative point into To represent a little, each object being divided to be clustered in a Voronoi unit, in each Voronoi grids containing pair As id.
Embodiment 2:Further scheme supplement or explanation of the present embodiment as embodiment 1, as shown in Figure 1, Voronoi One space is divided into multiple disjoint polygons by figure.The arest neighbors of some point in each polygon is respectively positioned on this In Voronoi cells where point.Each polygon in figure is referred to as the Voronoi cell associated with point p.This sampling point Any point in cell where p is all p arest neighbors.So in the K-NN search based on Voronoi, each The data point p of Voronoi cells may serve to be verified its whether be some query point q neighbour.And inverted index leads to It is usually used in the search of text similarity, the position of record is determined by property value.
Voronoi diagram (Voronoi Diagram, VD):By VD (p)={ V (p1),V(p2),...,V(pm) provide with The associated set of all points, is referred to as the Voronoi diagram that distance function Dist () is followed caused by p.Here each p points Voronoi diagram necessarily include than other any points closer to q institute a little.Therefore query point q neighbour is closure Voronoi diagram.Accompanying drawing 1 shows 8 Neighbor Points in the two-dimentional Euclidean space of Voronoi diagram.
Voronoi units (Voronoi Cell, VC):On the R of space, one is marked off from D dimension spaces and includes n point Region, i.e. P:{p1,p2,…,pn, wherein n >=2, p1≠p2, i ≠ j, i, j ∈ InThe region VC that=1 .., n.VC are provided (pi)=p | d (p, pi)≤(p,pj), wherein d (p, pi) it is p and piBetween minimum Eustachian distance, then the region be referred to as and pi Associated Voronoi units.
Our row's of falling Voronoi indexes are to be combined inverted index and Vornoi indexes, produce new index, simultaneous Both advantages of tool.The Voronoi indexes of the specific row of falling are the extensive spatial data structures of storage mapping data point.Given one Individual large data sets P, it includes the set of data objects in Euclidean space, and for directoried data set, each object is to be clustered one In individual Voronoi units, Voronoi diagram can be expressed as VC (p)={ VC1,VC2,…,VCm}.We are using VC (p) as the row of falling The key value of index.All data object { Pi}∈VCmId be stored in queue and be used as value.That is, each Voronoi Contain substantial amounts of object id in grid.
In such a system, face it is following some:
S1. the data handled are very big;
S2. query point occurs at random, is not included in data set, while data set is probably that distribution tilts 's;
S3. the data model established under multidimensional theorem in Euclid space and distance.
Arrange Voronoi indexes (Inverted Voronoi Index, IVI) and include two parts:S1. master index, bag Include all cluster centres;
S2. the second index, including be stored in each VC to as queue.Inverted index be in order to effectively index position with Data object in the adjacent queue of query object.When a given inquiry, we differentiate closest VC or most one at can A little neighbouring VC collection.Then the corresponding queue element (QE)s of these VC are included to come, so as to obtain kNN query resultses.
As shown in Figure 2, an IVI for including two-dimensional space object is illustrated, is divided based on Voronoi, we will be right As being divided into 6 subregions.For the sake of simplicity, we select P as representing a little,Therefore, each object is closest with it Representative point be each assigned to same Voronoi cells.Intuitively, the method for arranging Voronoi diagram index partition It is the Voronoi units that hyperspace is divided into multiple forms of falling row.
Therefore, our IVI has advantages below:
S1. mass data collection is supported:Because the row's of falling Voronoi diagram index structure inherits the form of inverted index, It is very directly perceived it is known that, this index scheme is applied to distributed treatment.
S2. various dimensions are supported:Multidimensional Voronoi indexes are make use of, the index supports Spatial-data Integration, is suitable for indexing The data set of various dimensions.
S3. space efficiency:Preferable Spatial Objects storage needs a very small space.Because we only need to store The representative point information of each object, so greatly reducing space cost.
Build Spark and fall to arrange Voronoi diagram index
How we using Spark builds IVI if introducing.Because Voronoi diagram can be multiple with merging by fractionation Voronoi diagram (VP) obtains, so construction falls row's Voronoi indexes and is applied to Spark models.Particularly every sub- VP is closed And obtain Voronoi to the end.
As shown in algorithm 1:Two datasets R and S are given in given d dimension spaces.Spark peace default mechanisms carry out burst. Some mappers parallel operations simultaneously.In Spark tasks, we use the reducer given tacit consent to.Start map functions it Before, we obtain representing point p using quick pre- clustering algorithm, and are loaded into each map main memory.
Then, in each map treatment progress, it will read point of input using TextInputFormat successively Piece (presses the pattern of the input in distributed file system), and TextInputFormat can read data to Mapper's from file In example.Each r, the distance between s objects and p points are calculated, and by r, s distributes to immediate representative point P. in algorithm In 2-3 rows, each point is collected in a Voronoi cell, and it will be produced into m Voronoi cell, in algorithm It can be exported in 4-6 rows<VCm, List (Pi)>Right, mapper output raw data sets (R or S) arrive each of hithermost subregion Individual object r, s and its subregion VCmId.
Finally, in algorithm 8-10 rows, it would be desirable to needed according to what is controlled oneself by customized Mapper is output to Spark file system by MultipleOutputFormat functions.It is determined how task result Write back in the lasting storage of bottom.Voronoi index structure of the structure based on Spark is described in detail in we in algorithm 1 Algorithm pseudo code.Using IVI, if given one represents a little, our cans start Spark tasks to carry out data partition simultaneously Collect some data messages of each subregion.
Embodiment 3:In today that medical social security service develops rapidly at a high speed, with the living standard day of people Benefit improves, and also becomes more hommization and personalization for the demand of medical services.Also there are increasing people to need simultaneously Medical services that will be more convenient and perfect.Simultaneously with mobile communication and the fast development based on location-based service correlation technique, The technologies such as cloud computing, big data, Internet of Things, mobile computing and space orientation are also progressively ripe, and GPS, camera, bluetooth number Also constantly increasing according to waiting, emerging in large numbers substantial amounts of spatial data, this causes the storage and processing of various spatial datas or object In be faced with huge challenge.Electronic health record, nursing call center system, extensive medical data base in industry of medical care Also improving operating efficiency in fast development, portable medical correlation technique Deng application, improving medical services, Economy type medicine cost etc. Aspect has played more and more effects.
It is especially flourishing but China's geographical environment difference is huge, economic development is uneven, medical resource skewness weighing apparatus Area is compared with outlying district, and medical level is there is also very big difference, while as rural area is to industries such as urban migration, tourisms Rapid development so that exponentially type increases on the basis of script population mobility is big, and patient is frequently encountered originally to one When individual local, it is unknown to where see a doctor after suffering from the disease, stands in the queue to register it is more likely that need several months ahead of time to preengage hospital, Toss about multiple hospitals by bus, most a large amount of manpower financial capacities have been wasted in traffic etc. at last, and disease does not obtain in time The problem for the treatment of.It is daily that we are also frequently encountered when needing emergency treatment, do not know but around have what hospital, which hospital's energy This state of an illness is handled, which hospital position is more preferable closer to, service from patient, so as to because the delay time at stop, causing treatment not in time, Tragedy because of delay treatment and lethal even can occur.
Although there are the website of oneself in more hospitals at present, it can in advance register, inquire about, online interrogation also becomes to hold very much Easily, but hospital of China is numerous, and it is difficult to distinguish the true from the false for size medical web site, and online doctor's qualification cannot get certification, while PC end equipments It is not easy to carry, when needing complicated inquiry and family's distress call so that related interrogation of seeing a doctor becomes extremely difficult.
In recent years, with the arrival in medical big data epoch, there are the related data of more medical resources.Mobile doctor The concept for the treatment of is arisen at the historic moment, and so-called portable medical refers to use mobile communication technology and equipment, and any place carries at any time For the medical services suitable for masses and medical information.In development in recent years, the skill such as internet, mobile communication, multimedia The rapid development of the rapid development of art, especially 3G, 4G technology, portable medical technology is set to achieve significant progress.But in recent years Come, it has been found that when carrying out big data processing for this kind of portable medical data, be frequently encountered operation time length, space-time data The problem of search efficiency is low.And the computing system of traditional computer is because only support limited thread, parallel with distribution Poor performance, the computing resource of unit are usually limited (be such as limited to the size of hard disk or internal memory, CPU element computing capability is not strong etc.) And the processing of Large-scale Mobile medical data can not be directly applied to.This big data inquiry given in Mobile medical system and processing band Come a series of with challenging.
It is well known that index has important influence to large-scale data access efficiency.New space index method needs It is incorporated into traditional database processing engine, so as to R-tree structures occur.R-tree indexes equivalent to two-dimentional B+ trees Extension under multidimensional data environment.It is currently based on being looked into carry out arest neighbors (Nearest Neighbor, NN) for R-tree indexes The algorithm of inquiry has a lot, but these methods all concentrate single thread execution task on a single computer.When data scale is rapid Handled during growth it is necessary to application distribution formula Database Systems to be indexed with data query etc..
The distributed temporal index method based on Voronoi diagram in embodiment 1 or 2 is applied to mobile cure by the present embodiment Calling field is treated, current existing medical call system there are three kinds, there is bus medical care intercom system, IP network Semi-digital medical care Intercom system, IP network medical care information intercom system.And these medical call systems have significant limitation, they can only Short range transmission information, if patient not in the range of information transfer, can not perform.And it is used to performing and described is based on Voronoi The medical call system of the distributed temporal index method of figure is not influenceed then by these, and it can effectively be carried under distributed environment NN Query efficiency in tall and big size range.This just makes this invention particularly important, especially for paroxysmal disease or Need the patient the more paid close attention to, it is necessary to preferably service is provided, while be also required for a kind of equipment can more preferable corresponding disease Communication between the service of people's needs and medical personnel, there is provided a good medical environment.
The system of the distributed temporal index method based on Voronoi diagram is able to carry out, by the information of patient according to attribute After being classified, establishing turns into internal cluster point, and when patient uses medical call system, system is analyzed according to patient information to be belonged to Property, which kind of analysis patient now needs most and helps, and is the help of help or the life inconvenience of great medical knowledge.At this moment, exist The point in the Thiessen polygon nearest from it is found out using patient information as discrete points data, is now needed most so as to obtain patient Help, to make patient obtain best help.
The present invention, the system for being able to carry out the distributed temporal index method based on Voronoi diagram, due to having used multidimensional Voronoi indexes, the index support Spatial-data Integration, are suitable for indexing the data set of various dimensions, can support mass data collection And various dimensions, and a very small space is needed because preferable Spatial Objects store, because we only need storage every The representative point information of one object, so greatly reducing space cost so that space efficiency is very high, can make patient timely Get help.
In another embodiment scheme, the row's of falling Voronoi diagram index is based on to build using Spark, 3-dimensional is given in space Fixed two medical associated data set R and S, R are medical resource data sets, including such as the reaction medical treatment such as doctor, Medical Devices, position The data set of resource information.S is patient data set, includes the data set of the reaction such as patient's case information, position conditions of patients, The two data sets are uploaded in HDFS, because Spark peace default mechanisms carry out burst.Some mappers parallel operations simultaneously. In Spark tasks, we use the reducer given tacit consent to.Before map functions are started, we use quick pre- clustering algorithm The representative point p of the medical resource in a region is obtained, and is loaded into each map main memory.
Then, in each map treatment progress, it will read point of input using TextInputFormat successively Piece (presses the pattern of the input in distributed file system), and file can be read data by TextInputFormat in a streaming manner Into Mapper example.Calculate each medical resource data r object, the distance between patient data s objects and p points, and By r, s distributes to immediate representative point P, and in the algorithm, it is mono- that each medical resource representative point is collected at a Voronoi In first lattice, production (in actual scene, is exactly that an extensive medical resource is concentrated, is divided into m by it into m Voronoi cell There is the representative for representing a medical resource point in the medical area of same nature, such as a city medical centre, each region, than Such as say a Grade A hospital), such program can export upon execution<VCm, List (Pi)>It is right, mapper output raw data sets (R or S) arrives each object r, s and its subregion VC of hithermost subregionmId.We need to be passed through according to the needs controlled oneself Mapper is output to Spark file system by customized MultipleOutputFormat functions.It determine how by Task result is write back in the lasting storage of bottom.Using the row of falling medical IVI, if giving the inquiry of a patient user Request, such as a hospital for meeting case diagnosis and treatment needs is found from the medical data in the whole nation, we start can Spark tasks carry out data partition and collect some data messages of each subregion.Medical treatment is found by the key of inverted index It is a representative hospital that resource, which represents point, then finds correlation by the specific data of hospital and need medical resource, and is fed back to Patient.Thus can quickly using Spark data handling system Spark using number with thousand note computers, in a distributed manner The distributed data for finding correlation from extensive medical resource.

Claims (5)

1. a kind of distributed temporal index method based on Voronoi diagram, its characterization step are as follows:
Row's Voronoi indexes are made down using Spark structures, two datasets R and S, Spark in d dimension spaces is given and presses acquiescence machine System carries out burst, part mappers parallel operations simultaneously, using the reducer of acquiescence in Spark tasks, is starting map letters Before number, obtain representing point p using pre- clustering algorithm, and be loaded into each map main memory;
In each map treatment progress, the burst of input is read using TextInputFormat successively, TextInputFormat reads data into Mapper example from file, calculates each object r in data set R respectively, Each object s objects in data set S are with representing the distance between point p points, and by object r, s distributes to immediate representative Point P;The immediate representative point with m object r, an object r and any object s is all collected at a Voronoi in R In cell, m Voronoi cell is thus produced into as subregion, output<VCm, List (Pi)>It is right, query point p is given, is sentenced Its other closest subregion or most some neighbouring partition sets, mapper output initial data concentrate to closest subregion or Each object r, s and its subregion VC of closest partition setmId;Mapper is output to Spark file system.
2. the distributed temporal index method as claimed in claim 1 based on the row's of falling Thiessen polygon, it is characterised in that: One space is divided into multiple disjoint polygons by Voronoi diagram, the arest neighbors of some point in each polygon It is respectively positioned in the Voronoi cells where the point, each polygon in figure is referred to as the Voronoi unit associated with point p Lattice, any point in the cell where point p are all p arest neighbors.
3. the distributed temporal index method as claimed in claim 1 based on the row's of falling Thiessen polygon, it is characterised in that:Arrange Voronoi indexes include two parts:Master index, including all cluster centres;Second index, including it is stored in each subregion VC to as queue.
4. the distributed temporal index method as claimed in claim 1 based on the row's of falling Thiessen polygon, it is characterised in that:Represent The acquisition methods of point, it is determined that internal cluster point and consecutive points, inside is clustered to the data clusters of point, cluster centre is selected after cluster It is indexed, required data are to cluster a consecutive points for connection with internal, with this inside cluster point for the center of circle, comprising adjacent Cluster centre point establishes circle, using this circle for circumscribed circle triangle as Delaunay triangles, in this method by two not Same inside cluster point establishes Delaunay triangles respectively, and the two Delaunay triangles are established by common ground of consecutive points Delaunay triangulation network, data object is divided into several big subregions, selects a wherein cluster representative point to turn into and represent a little, drawn The each object divided contains object id to be clustered in a Voronoi unit in each Voronoi grids.
5. the distributed temporal index method as claimed in claim 4 based on the row's of falling Thiessen polygon, it is characterised in that: Voronoi diagram is by VD (p)={ V (p1),V(p2),...,V(pm), wherein:VD (p) is the Voronoi diagram intersection on P, V (p1) be p1 Voronoi diagram, the set associated with all points provided, be referred to as following distance function caused by p Dist () Voronoi diagram, here the Voronoi diagram of each p points necessarily include than other any points closer to q institute a little, Thus query point q neighbour is the Voronoi diagram of closure;
Voronoi units mark off a region for including n point, i.e. P on the R of space from D dimension spaces:{p1,p2,…, pn, the region that subregion VC is provided, i.e. VC subregions are on point piRegion VC (pi), if meeting VC (pi)=p | d (p, pi)≤(p, pj), then the region is referred to as the Voronoi unit associated with p;
Wherein:Wherein p is specified point or query point, d (p, pi) it is p and piBetween minimum Eustachian distance, i, j are variables, n >= 2, p1≠p2, i ≠ j, i, j ∈ In=1 .., n, and i takes all values in 1 .., n, when often taking a value, j is taken in 1 .., n Except all values of i values now.
CN201710976133.6A 2017-10-19 2017-10-19 Distributed temporal index method based on Voronoi diagram Pending CN107766495A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710976133.6A CN107766495A (en) 2017-10-19 2017-10-19 Distributed temporal index method based on Voronoi diagram

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710976133.6A CN107766495A (en) 2017-10-19 2017-10-19 Distributed temporal index method based on Voronoi diagram

Publications (1)

Publication Number Publication Date
CN107766495A true CN107766495A (en) 2018-03-06

Family

ID=61269756

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710976133.6A Pending CN107766495A (en) 2017-10-19 2017-10-19 Distributed temporal index method based on Voronoi diagram

Country Status (1)

Country Link
CN (1) CN107766495A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113191158A (en) * 2021-05-21 2021-07-30 润联软件系统(深圳)有限公司 Voronoi diagram-based training sample masking method, device and related equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102253961A (en) * 2011-05-17 2011-11-23 复旦大学 Method for querying road network k aggregation nearest neighboring node based on Voronoi graph
CN102495877A (en) * 2011-12-05 2012-06-13 中国科学院地理科学与资源研究所 Technique method of lake nutrient zoology zone boundary identification
CN103164529A (en) * 2012-11-01 2013-06-19 沈阳建筑大学 Reverse k nearest neighbor query method based on Voronoi pictures
CN106203449A (en) * 2016-07-08 2016-12-07 大连大学 The approximation space clustering system of mobile cloud environment
CN106548141A (en) * 2016-11-01 2017-03-29 南京大学 A kind of object-oriented farmland information extraction method based on the triangulation network
CN107025685A (en) * 2017-04-11 2017-08-08 南京林业大学 Airborne building summit cloud modeling method under topology ambiguity

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102253961A (en) * 2011-05-17 2011-11-23 复旦大学 Method for querying road network k aggregation nearest neighboring node based on Voronoi graph
CN102495877A (en) * 2011-12-05 2012-06-13 中国科学院地理科学与资源研究所 Technique method of lake nutrient zoology zone boundary identification
CN103164529A (en) * 2012-11-01 2013-06-19 沈阳建筑大学 Reverse k nearest neighbor query method based on Voronoi pictures
CN106203449A (en) * 2016-07-08 2016-12-07 大连大学 The approximation space clustering system of mobile cloud environment
CN106548141A (en) * 2016-11-01 2017-03-29 南京大学 A kind of object-oriented farmland information extraction method based on the triangulation network
CN107025685A (en) * 2017-04-11 2017-08-08 南京林业大学 Airborne building summit cloud modeling method under topology ambiguity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
吴晓兵: "基于Voronoi图的分布式反最近邻查询方法研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113191158A (en) * 2021-05-21 2021-07-30 润联软件系统(深圳)有限公司 Voronoi diagram-based training sample masking method, device and related equipment
CN113191158B (en) * 2021-05-21 2021-10-26 润联软件系统(深圳)有限公司 Voronoi diagram-based training sample masking method, device and related equipment

Similar Documents

Publication Publication Date Title
JP2017523485A (en) Techniques for tiling location-based information with server control
CN106777093A (en) Skyline inquiry systems based on space time series data stream application
CN107818147A (en) Distributed temporal index system based on Voronoi diagram
CN107658010A (en) Portable medical querying method and application based on the row&#39;s of falling Thiessen polygon index
Ho et al. Distributed graph database for large-scale social computing
CN105117497A (en) Ocean big data master-slave index system and method based on Spark cloud network
CN103886508A (en) Mass farmland data monitoring method and system
Zeng et al. Data visualization for air quality analysis on bigdata platform
Li et al. Efficient subspace skyline query based on user preference using MapReduce
Shemshadi et al. Searching for the internet of things: where it is and what it looks like
CN116166191A (en) Integrated system of lake and storehouse
Xia et al. DAPR-tree: a distributed spatial data indexing scheme with data access patterns to support Digital Earth initiatives
CN107766495A (en) Distributed temporal index method based on Voronoi diagram
Khedr Decomposable algorithm for computing k-nearest neighbours across partitioned data
Akdogan et al. ToSS-it: A cloud-based throwaway spatial index structure for dynamic location data
Raghav et al. Bigdata fog based cyber physical system for classifying, identifying and prevention of SARS disease
US11836637B2 (en) Construction method of human-object-space interaction model based on knowledge graph
CN107679216A (en) The distributed temporal index method of the row&#39;s of falling Thiessen polygon of portable medical and application
CN116993555A (en) Partition method, system and storage medium for identifying territory space planning key region
CN107766496A (en) Based on MapReduce and the extensive NN Query system for arranging Thiessen polygon
CN107844532A (en) Based on MapReduce and the extensive nearest Neighbor for arranging Thiessen polygon
Wang et al. A novel visual analytics approach for clustering large-scale social data
CN108153910B (en) Establishing distributed space-time multidimensional indexing system for mobile medical service
Weipeng et al. Research on improved method of storage and query of large-scale remote sensing images
Soundararaj et al. Medium data toolkit-A case study on smart street Sensor project

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180306

RJ01 Rejection of invention patent application after publication