CN108737185A - A kind of triangle count method and device in datagram stream based on random sampling - Google Patents
A kind of triangle count method and device in datagram stream based on random sampling Download PDFInfo
- Publication number
- CN108737185A CN108737185A CN201810499136.XA CN201810499136A CN108737185A CN 108737185 A CN108737185 A CN 108737185A CN 201810499136 A CN201810499136 A CN 201810499136A CN 108737185 A CN108737185 A CN 108737185A
- Authority
- CN
- China
- Prior art keywords
- subgraph
- sampling
- triangle
- intermediate cam
- raw
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/142—Network analysis or design using statistical or mathematical methods
Landscapes
- Physics & Mathematics (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Pure & Applied Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention relates to technical field of data processing, the triangle count method and device in a kind of datagram stream based on random sampling is provided, this method includes:Side in the raw-data map stream of reception is sampled to obtain subgraph, and calculates and retains ratio;The quantity of the subgraph intermediate cam shape of sampling acquisition is counted;The quantity of the subgraph intermediate cam shape obtained according to statistics and the raw-data map stream intermediate cam figurate number amount retained than calculating reception.The present invention obtains subgraph by flowing into line sampling to raw-data map, and the triangle count for restoring artwork according to the triangle count result of subgraph shortens the execution time of algorithm as a result, to reduce the data volume of processing.
Description
Technical field
The present invention relates to the triangles in technical field of data processing more particularly to a kind of datagram stream based on random sampling
Shape method of counting and device.
Background technology
It is increasingly permeating to internet in our life, the generation speed and data volume of data are also growing day by day, very
To the increase for being all not enough to description data volume with " magnanimity ", " explosive increase " etc., however the speed that we handle data does not have but
The growth rate for the data that can coincide, the value information implied in big data are far from fully being excavated and being utilized by us.Closely
Over a little years, the research tendency of internet is developed towards the direction that can handle mass data.Also, as the weight of big data
The data flow of one of Deta bearer form is wanted, the concern of researcher and enterprise is increasingly caused.
In actual life, the data packet in website visiting request data stream, the image data stream of satellite backhaul, network monitor
Stream, share certificate many data of real-time fluctuations etc. be all to exist in the form of data flow, and it is universal with Internet of Things,
The data magnitude of data flow will improve several orders of magnitude, this is greatly to test to the requirement of data handled in real time.So
The research of data Mining stream can find many application scenarios in reality, can greatly be pushed to the research of data flow
The development of related industry improves enterprises production efficiency, scientific data analysis efficiency and the quality of life etc. of numerous residents.
Trigometric analysis is an interesting problem in diagram data analysis, while being also a more traditional problem,
Application on social networks is particularly extensive.For example, in social networks, A recognizes mutually with B, C, and B and C is also to recognize mutually
Two people known, then A, B and C three just constitute a triangle.Similar, if in a social networks, possess more
More triangle, then the contact of personnel is also closer in this social networks.The targeted abstract problem of the present invention is
Calculate the triangle number in a datagram stream.
Triangle count problem has many solutions, but both for the exact algorithm of small-scale data, for sea
The solution for measuring data flow is now also fewer and fewer, and count accuracy is to be improved.
Invention content
The technical problem to be solved in the present invention is, for the existing triangle count method lacked for mass data flow
Defect, provide the triangle count method and device in a kind of datagram stream based on random sampling.
In order to solve the above technical problem, the present invention provides the triangle counts in the datagram stream based on random sampling
Method, including:
1) side in the raw-data map stream of reception is sampled to obtain subgraph, and calculates and retains ratio;
2) quantity of the subgraph intermediate cam shape obtained to sampling counts;
3) quantity of the subgraph intermediate cam shape obtained according to statistics and the raw-data map stream retained than calculating reception
Intermediate cam figurate number amount.
Optionally, the step 1) includes:
After being sampled to the side in the raw-data map stream of reception using the cistern methods of sampling, pass through following formula meter
Calculation, which retains, compares α:
Wherein m is by the end of the total quantity for receiving the side that window receives altogether, and k is the subgraph that the cistern methods of sampling extracts
The quantity on middle side.
Optionally, the raw-data map stream intermediate cam figurate number amount N of reception is calculated by the following formula in the step 3):
N=n α3
Wherein, n is the quantity for the subgraph intermediate cam shape that statistics obtains.
Optionally, included the following steps using the quantity of multiple knot method statistic subgraph intermediate cam shape in the step 2):
V is combined for arbitrary three vertex in the multiple knot set of subgraphi,vj,vk, judge whether it is triangle and
vi<vj<vk, it is that triangle count adds 1;
For the arbitrary a line (v in subgraphi,vj), if viIt is not multiple knot and vi<vj, then for viAny neighbour
Meet vertex uiIf triangle (vi,ui,vj) exist and vi<ui<vj, then triangle count add 1.
Optionally, described retain than α is 4.
Triangle count device in the present invention also provides a kind of datagram stream based on random sampling, includes at least:
Sampling unit, subgraph statistic unit and artwork evaluation unit;
The sampling unit is sampled to obtain subgraph for the side in the raw-data map stream to reception, and calculates and deposit
Stay ratio;
The quantity of the subgraph statistic unit, the subgraph intermediate cam shape for being obtained to sampling counts;
The artwork evaluation unit, the quantity of subgraph intermediate cam shape for being obtained according to statistics and described retains than calculating
The raw-data map stream intermediate cam figurate number amount of reception.
Optionally, the sampling unit takes out the side in the raw-data map stream of reception using the cistern methods of sampling
After sample, it is calculated by the following formula to retain and compares α:
Wherein m is by the end of the total quantity for receiving the side that window receives altogether, and k is the subgraph that the cistern methods of sampling extracts
The quantity on middle side.
Optionally, the raw-data map stream intermediate cam figurate number of reception is calculated by the following formula in the artwork evaluation unit
Measure N:
N=n α3
Wherein, n is the quantity for the subgraph intermediate cam shape that statistics obtains.
Optionally, the quantity of multiple knot method statistic subgraph intermediate cam shape is used in the subgraph statistic unit, wherein:
V is combined for arbitrary three vertex in the multiple knot set of subgraphi,vj,vk, judge whether it is triangle and
vi<vj<vk, it is that triangle count adds 1;
For the arbitrary a line (v in subgraphi,vj), if viIt is not multiple knot and vi<vj, then for viAny neighbour
Meet vertex uiIf triangle (vi,ui,vj) exist and vi<ui<vj, then triangle count add 1.
Optionally, described retain than α is 4.
Implement the triangle count method and device in the datagram stream provided in an embodiment of the present invention based on random sampling,
At least have the advantages that:
1, the present invention obtains subgraph by flowing into line sampling to raw-data map, and according to the triangle count result of subgraph
The triangle count of artwork is restored as a result, to reduce the data volume of processing, shortens the execution time of algorithm;
2, present invention selection is sampled the side of figure, rather than is sampled to the vertex of figure, avoids opposite vertexes
The integrally-built change for the figure that sampling is easy to cause;And the run time of algorithm is related to the quantity on side, is calculated after sampling
The execution time of method also greatly reduces;
3, the present invention calculates the number of triangles of subgraph using multiple knot method, first calculates the quantity of multiple knot triangle,
It counts the quantity of non-multiple knot triangle again, is the number of triangles of subgraph after the two, the computational methods efficiency and accurate
Property is relatively high.
Description of the drawings
Fig. 1 is the stream for the triangle count method that the embodiment of the present invention one is provided in the datagram stream based on random sampling
Cheng Tu;
Fig. 2 is that the embodiment of the present invention three provides showing for triangle count device in the datagram stream based on random sampling
It is intended to;
Fig. 3 a, 3b, 3c and 3d are respectively the quantity k on sampling side under different datagram stream size for triangle count standard
The influence of exactness;
Fig. 4 a, 4b and 4c are respectively approximation of the invention under different datagram stream size and the exact value that actually measures
Comparison figure.
In figure:201:Sampling unit;202:Subgraph statistic unit;203:Artwork evaluation unit.
Specific implementation mode
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention
In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is
A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, ordinary skill people
The every other embodiment that member is obtained without making creative work, shall fall within the protection scope of the present invention.
Embodiment one
As shown in Figure 1, the triangle count method in the datagram stream provided in an embodiment of the present invention based on random sampling,
It may comprise steps of:
Step S101:Side in the raw-data map stream of reception is sampled to obtain subgraph, and calculates and retains ratio.It is preferred that
Ground passes through following public affairs after being sampled to the side in the raw-data map stream of reception using the cistern methods of sampling in the step
Formula calculating, which retains, compares α:
Wherein m is by the end of the total quantity for receiving the side that window receives altogether, and k is the subgraph that the cistern methods of sampling extracts
The quantity on middle side.The present invention is not limited to use the cistern methods of sampling, other this field basic technology personnel can also be used
The known and applicable methods of sampling.
Step S102:The quantity n of the subgraph intermediate cam shape of sampling acquisition is counted.
Step S103:The quantity of the subgraph intermediate cam shape obtained according to statistics and the original number retained than calculating reception
Intermediate cam figurate number amount is flowed according to figure.Preferably, it is calculated by the following formula three in the raw-data map stream of reception in step S103
Angular quantity N:
N=n α3
Wherein, n is the quantity for the subgraph intermediate cam shape that statistics obtains.
Due to huge datagram stream be likely to can not disposable loading enter memory, in order to ensure three for datagram stream
Angular method of counting normal operation, the present invention sacrifices accuracy to a certain extent, by the way of sampling, is estimated with subgraph
Whole number of triangles, and the precise results of subgraph triangle count can be calculated in a short time.
Due to the real-time arrival of data flow data, data flow total length can not be predicted in advance, but be known that by the end of
Until receiving window, the data total number that receives.Since the sampling of opposite vertexes can lead to the integrally-built change of figure, so
Selection is sampled the side in figure in the present invention.It will also realize that by above-mentioned analysis, the run time of the method for the present invention and side
Related, so, once reducing the quantity on side in figure, the execution time of algorithm can also greatly reduce.
Therefore, the classical cistern methods of sampling may be used in the present embodiment, it is assumed that want to preserve side in memory
Integrate as k, is m by the end of receiving window to have the total quantity on the side received altogether.After carrying out cistern sampling, it is known that in figure stream
Any a line, the probability that finally can be able to retain in memory are
Since each edge probability present in subgraph is in artworkAnd the selection on each side is random independent
, then probability of the three sides of a triangle all in subgraph is exactly in artworkSo, it is only necessary to by subgraph
In information carry out triangle statistics and calculate, so that it may to estimate artwork intermediate cam shape by the quantity for counting subgraph intermediate cam shape
Quantity.It enablesTo retain ratio, then the number of triangles in artwork can be multiplied by α with the number of triangles in subgraph3Come
Estimated.
Embodiment two
On the basis of embodiment one provides the triangle count method in the datagram stream based on random sampling, wherein
The process that the quantity n of the subgraph intermediate cam shape obtained to sampling in step S102 is counted, specifically can be in the following way
It realizes:
The triangle number purpose method of calculating subgraph employed in step S102 is multiple knot method, so-called multiple knot,
Refer to if the degree on a vertex is not less thanSo this vertex is multiple knot, can according to the quantitative relation on vertex and side
Multiple knot number is obtained to be no more thanAs soon as if three vertex of triangle are all multiple knots, then this triangle claims
Make multiple knot triangle.This method is divided into three steps:
1, preprocessing part:Firstly the need of the degree and preservation for calculating each vertex;And two vertex of each edge are established
Index so that can determine whether there is side between two vertex by index later;Each vertex is established and is indexed, i.e., given top
Point returns to the adjacent vertex on the vertex.
2, the statistics of multiple knot triangle:The degree that vertex is calculated from subgraph is not less thanVertex be multiple knot, and
Constitute the multiple knot set of the subgraph.V is combined for arbitrary three vertex in the multiple knot set of subgraphi,vj,vk, judgement is
No triangle and vi<vj<vk, it is that triangle count adds 1.The heavy triangle can energy level be O (m3/2).In the present invention
Vertex size relatively refer to first comparing the number of degrees, number is compared if the number of degrees are identical.
3, the statistics of non-multiple knot triangle:For the arbitrary a line (v in subgraphi,vj), if viIt is not multiple knot
And vi<vj, then for viAny adjacent vertex uiIf triangle (vi,ui,vj) exist and vi<ui<vj, then triangle count
Add 1.
In one embodiment of the invention, each edge (v can also be first considered in the stepi,vj), if vi,vjAll it is
Multiple knot, then ignoring this edge;If viIt is not multiple knot, and vi<vj, then, it is found using index before and viIt is adjacent
All vertex u1, u2..., uk, whereinTo each adjacent vertex ui, i=1,2 ..., k judge side using index
(ui,vj) whether there is, if in the presence of and vi<uiThen to triangle (vi,ui,vj)) carry out statistical counting.
The number of triangles in subgraph is calculated by multiple knot method, it, can be also by being multiplied by a proportionality coefficient
Original goes out the quantity survey of artwork intermediate cam shape.
The pseudocode of figure sampling algorithm is as follows:
Input:Scheme G (V, E), parameter k
Output:Figure G ' (V ', E ') after sampling, total number of edges m
The purpose of above-mentioned figure sampling algorithm is the triangle statistics after facilitating in order to sample artwork at a subgraph,
The input of algorithm is artwork and requires the quantity k on side finally retained, the side in subgraph and artwork after exporting as sampling it is total
Quantity.Counter is counter, and can reflect current screening is which side of data flow.
The pseudocode that the triangle approximation counts is as follows:
Input:Figure G ' after sampling, parameter k, m
Output:The estimation Δ G ' that figure G intermediate cam shapes are counted
The last output result of the algorithm is the finally quantity survey to artwork intermediate cam shape, and input is after sampling
Subgraph, first cyclic part are statistics multiple knot triangles, and the second part is the non-multiple knot triangle of statistics, Rule of judgment
In the comparison of node size refer to the number of degrees of node, if the number of degrees are identical, the number of comparison node, in order to last
The result of statistics does not repeat, some counting of also not leaking down.
Embodiment three
As shown in Fig. 2, the triangle count device in the datagram stream provided in an embodiment of the present invention based on random sampling,
May include:Sampling unit 201, subgraph statistic unit 202 and artwork evaluation unit 203;
Sampling unit 201 obtains subgraph for being sampled for the side in the raw-data map stream to reception, and calculates
Retain ratio.The operation that the sampling unit 201 executes is identical as abovementioned steps S101;
The quantity of subgraph statistic unit 202, the subgraph intermediate cam shape for being obtained to sampling counts.The subgraph counts
The operation that unit 202 executes is identical as abovementioned steps S102;
Artwork evaluation unit 203, the quantity of subgraph intermediate cam shape for being obtained according to statistics and described retains than calculating
The raw-data map stream intermediate cam figurate number amount of reception.The operation that the artwork evaluation unit 203 executes is identical as abovementioned steps S103.
Preferably, sampling unit 201 takes out the side in the raw-data map stream of reception using the cistern methods of sampling
After sample, it is calculated by the following formula to retain and compares α:
Wherein m is by the end of the total quantity for receiving the side that window receives altogether, and k is the subgraph that the cistern methods of sampling extracts
The quantity on middle side.
Preferably, the raw-data map stream intermediate cam figurate number of reception is calculated by the following formula in artwork evaluation unit 203
Measure N:
N=n α3
Wherein, n is the quantity for the subgraph intermediate cam shape that statistics obtains.
Preferably, the quantity of multiple knot method statistic subgraph intermediate cam shape is used in subgraph statistic unit 202, wherein:
V is combined for arbitrary three vertex in the multiple knot set of subgraphi,vj,vk, judge whether it is triangle and
vi<vj<vk, it is that triangle count adds 1;
For the arbitrary a line (v in subgraphi,vj), if viIt is not multiple knot and vi<vj, then for viAny neighbour
Meet vertex uiIf triangle (vi,ui,vj) exist and vi<ui<vj, then triangle count add 1.
It is further to note that the triangle meter in the datagram stream provided in an embodiment of the present invention based on random sampling
Counting apparatus can also be realized by software realization by way of hardware or software and hardware combining.It is implemented in software to be
Example, as shown in Fig. 2, as the device on a logical meaning, being will be in nonvolatile memory by the CPU of equipment where it
Corresponding computer program instructions read what operation in memory was formed.
The time complexity of the present invention is analyzed as follows:
For multiple knot method:First, it is that (m refers to the number of edges of subgraph to O (m) that preprocessing part, which establishes index time complexity,
Amount);Multiple knot triangle count, since multiple knot number is no more thanTherefore the combination of all multiple knot triangles is total
There are O (m3/2) kind, i.e. time complexity is O (m3/2);Other triangle count parts, number of edges are up to m, and Therefore judge that all possibilities need O (m3/2);Therefore total time complexity of multiple knot algorithm
For O (m3/2)。
The quantity on the side of subgraph is determined by sampling algorithm, and sampling process synchronous with the reading of data flow can carry out,
So being not counted in time complexity here, it is assumed that the quantity on the side retained in memory after sampling is k, then brings k into epimeres and answer
Miscellaneous degree, the final time complexity that can obtain this algorithm are O (k3/2)。
The present invention also verifies effect by experiment.In experiment, n indicates the vertex quantity of figure in datagram stream, m
Indicate the quantity on side in datagram stream.Experimental data is based on random self-generating data.If Fig. 3 a, 3b, 3c and 3d are respectively difference
Datagram stream size under sampling side influences of the quantity k for triangle count accuracy.Wherein horizontal axis is the number on sampling side
K is measured, the longitudinal axis is triangle count accuracy.It can be seen from upper curve in varied situations, as parameter k is to the approach of m
In the process, the accuracy rate of algorithm gradually rises, also, figure is denser, and curve climbing speed is faster.
Fig. 4 a, 4b and 4c are respectively approximation of the invention under different datagram stream size and the exact value that actually measures
Comparison.It is 4 that order, which is retained than α,.Horizontal axis is the quantity on the side of figure in figure, and the longitudinal axis is number of triangles.At each location point, the left side
For the exact value (also referred to as actual value) obtained by accurate metering algorithm counts, the right is three calculated according to the method for the present invention
Angular count value (also referred to as approximation).It is fine in the propinquity effect of small-scale data that algorithm is can be seen that from the above chart,
Accuracy rate can averagely reach 99% or more.
Finally it should be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although
Present invention has been described in detail with reference to the aforementioned embodiments, it will be understood by those of ordinary skill in the art that:It still may be used
With technical scheme described in the above embodiments is modified or equivalent replacement of some of the technical features;
And these modifications or replacements, various embodiments of the present invention technical solution that it does not separate the essence of the corresponding technical solution spirit and
Range.
Claims (10)
1. a kind of triangle count method in datagram stream based on random sampling, which is characterized in that including:
1) side in the raw-data map stream of reception is sampled to obtain subgraph, and calculates and retains ratio;
2) quantity of the subgraph intermediate cam shape obtained to sampling counts;
3) it the quantity of subgraph intermediate cam shape that is obtained according to statistics and described retains than calculating three in the raw-data map stream received
Angular quantity.
2. according to the method described in claim 1, it is characterized in that, the step 1) includes:
After being sampled to the side in the raw-data map stream of reception using the cistern methods of sampling, it is calculated by the following formula and deposits
It stays and compares α:
Wherein m is by the end of the total quantity for receiving the side that window receives altogether, and k is side in the subgraph that the cistern methods of sampling extracts
Quantity.
3. according to the method described in claim 2, it is characterized in that, being calculated by the following formula the original of reception in the step 3)
Beginning datagram stream intermediate cam figurate number amount N:
N=n α3
Wherein, n is the quantity for the subgraph intermediate cam shape that statistics obtains.
4. according to the method described in claim 1, it is characterized in that, using in multiple knot method statistic subgraph in the step 2)
The quantity of triangle, includes the following steps:
V is combined for arbitrary three vertex in the multiple knot set of subgraphi,vj,vk, judge whether triangle and vi<vj<
vk, it is that triangle count adds 1;
For the arbitrary a line (v in subgraphi,vj), if viIt is not multiple knot and vi<vj, then for viAny adjacent top
Point uiIf triangle (vi,ui,vj) exist and vi<ui<vj, then triangle count add 1.
5. according to the method described in claim 1, it is characterized in that, described retain than α is 4.
6. the triangle count device in a kind of datagram stream based on random sampling, which is characterized in that include at least:Sampling is single
Member, subgraph statistic unit and artwork evaluation unit;
The sampling unit is sampled to obtain subgraph for the side in the raw-data map stream to reception, and calculates and retain ratio;
The quantity of the subgraph statistic unit, the subgraph intermediate cam shape for being obtained to sampling counts;
The artwork evaluation unit, the quantity of the subgraph intermediate cam shape for being obtained according to statistics and described retain receive than calculating
Raw-data map stream intermediate cam figurate number amount.
7. device according to claim 6, which is characterized in that the sampling unit is using the cistern methods of sampling to receiving
Raw-data map stream in side be sampled after, be calculated by the following formula to retain and compare α:
Wherein m is by the end of the total quantity for receiving the side that window receives altogether, and k is side in the subgraph that the cistern methods of sampling extracts
Quantity.
8. device according to claim 7, which is characterized in that be calculated by the following formula and connect in the artwork evaluation unit
The raw-data map stream intermediate cam figurate number amount N of receipts:
N=n α3
Wherein, n is the quantity for the subgraph intermediate cam shape that statistics obtains.
9. device according to claim 6, which is characterized in that use multiple knot method statistic in the subgraph statistic unit
The quantity of subgraph intermediate cam shape, wherein:
V is combined for arbitrary three vertex in the multiple knot set of subgraphi,vj,vk, judge whether triangle and vi<vj<
vk, it is that triangle count adds 1;
For the arbitrary a line (v in subgraphi,vj), if viIt is not multiple knot and vi<vj, then for viAny adjacent top
Point uiIf triangle (vi,ui,vj) exist and vi<ui<vj, then triangle count add 1.
10. device according to claim 6, which is characterized in that described retain than α is 4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810499136.XA CN108737185A (en) | 2018-05-23 | 2018-05-23 | A kind of triangle count method and device in datagram stream based on random sampling |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810499136.XA CN108737185A (en) | 2018-05-23 | 2018-05-23 | A kind of triangle count method and device in datagram stream based on random sampling |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108737185A true CN108737185A (en) | 2018-11-02 |
Family
ID=63935161
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810499136.XA Pending CN108737185A (en) | 2018-05-23 | 2018-05-23 | A kind of triangle count method and device in datagram stream based on random sampling |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108737185A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113448732A (en) * | 2021-07-09 | 2021-09-28 | 北京睿芯高通量科技有限公司 | Triangle counting method and device for large-scale flow graph online self-adaptive sampling space |
CN114389961A (en) * | 2022-01-14 | 2022-04-22 | 北京睿芯高通量科技有限公司 | Graph flow triangle counting method and device based on node heat sampling |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20130066352A (en) * | 2011-12-12 | 2013-06-20 | 건국대학교 산학협력단 | System and method for triangle counting sampling by using map-reduce |
CN104268270A (en) * | 2014-10-13 | 2015-01-07 | 北京建筑大学 | Map Reduce based method for mining triangles in massive social network data |
CN107786388A (en) * | 2017-09-26 | 2018-03-09 | 西安交通大学 | A kind of abnormality detection system based on large scale network flow data |
-
2018
- 2018-05-23 CN CN201810499136.XA patent/CN108737185A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20130066352A (en) * | 2011-12-12 | 2013-06-20 | 건국대학교 산학협력단 | System and method for triangle counting sampling by using map-reduce |
CN104268270A (en) * | 2014-10-13 | 2015-01-07 | 北京建筑大学 | Map Reduce based method for mining triangles in massive social network data |
CN107786388A (en) * | 2017-09-26 | 2018-03-09 | 西安交通大学 | A kind of abnormality detection system based on large scale network flow data |
Non-Patent Citations (2)
Title |
---|
DURU TÜRKOGLU等: "Edge-Based Wedge Sampling to Estimate Triangle Counts in Very Large Graphs", 《2017 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM)》 * |
LIVEWAY6: "社交网络图挖掘4--三角形计数问题", 《HTTPS://BLOG.CSDN.NET/LIVEWAY6/ARTICLE/DETAILS/78345022》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113448732A (en) * | 2021-07-09 | 2021-09-28 | 北京睿芯高通量科技有限公司 | Triangle counting method and device for large-scale flow graph online self-adaptive sampling space |
CN113448732B (en) * | 2021-07-09 | 2022-07-01 | 北京睿芯高通量科技有限公司 | Triangle counting method and device for large-scale flow graph online self-adaptive sampling space |
CN114389961A (en) * | 2022-01-14 | 2022-04-22 | 北京睿芯高通量科技有限公司 | Graph flow triangle counting method and device based on node heat sampling |
CN114389961B (en) * | 2022-01-14 | 2024-03-08 | 北京中科通量科技有限公司 | Graph flow triangle counting method and device based on node heat sampling |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ahmed et al. | Graph sample and hold: A framework for big-graph analytics | |
WO2020207214A1 (en) | Data processing method and apparatus, electronic device and storage medium | |
WO2019100967A1 (en) | Method and device for identifying social group having abnormal transaction activity | |
CN107247259B (en) | K distribution sea clutter shape parameter estimation method based on neural network | |
CN108446711A (en) | A kind of Software Defects Predict Methods based on transfer learning | |
CN110457486A (en) | The people entities alignment schemes and device of knowledge based map | |
CN110110529B (en) | Software network key node mining method based on complex network | |
Wang et al. | Approximately counting triangles in large graph streams including edge duplicates with a fixed memory usage | |
CN107786388A (en) | A kind of abnormality detection system based on large scale network flow data | |
WO2016090877A1 (en) | Generalized maximum-degree random walk graph sampling algorithm | |
CN108737185A (en) | A kind of triangle count method and device in datagram stream based on random sampling | |
CN110111113A (en) | A kind of detection method and device of exception transaction node | |
CN108647261B (en) | Global contour line drawing method based on meteorological data discrete point gridding processing | |
CN107316301A (en) | Concentration of suspended particles on-line detecting system in a kind of easily sewage | |
CN109347662B (en) | The quick digging system of distributed social network structure towards large-scale network traffic | |
CN103902798B (en) | Data preprocessing method | |
CN110689055B (en) | Cross-scale statistical index spatialization method considering grid unit attribute grading | |
CN106971107B (en) | Safety grading method for data transaction | |
CN113408808B (en) | Training method, data generation device, electronic equipment and storage medium | |
CN110907327A (en) | Complex fracture network equivalent method considering equivalent efficiency optimization | |
CN110083777B (en) | Social network user grouping method and device and server | |
CN109039797A (en) | Big stream detection method based on intensified learning | |
CN101834901B (en) | Network coordinate system input delay pre-treatment method based on t detection model | |
WO2016086634A1 (en) | Reject rate-controllable metropolis-hastings graph sampling algorithm | |
CN108830444A (en) | A kind of assessment of raob data and modification method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181102 |
|
RJ01 | Rejection of invention patent application after publication |