A kind of interactive theme river method for visualizing
Technical field
The present invention relates to information visualization field, espespecially a kind of interactive theme river method for visualizing.
Background technology
In Internet era, the Working Life of people all rapidly produces data all the time.People produce data capability
Development speed be significantly larger than the development speed of people's analyze data ability.How to process, analyze these data, so that
Valuable information is found in data, is instantly important problem for people provide decision support.Visualization is a kind of analysis number
According to technology, it include data treatment, analysis with present, but it be more than simply by data exhibiting be chart, but knot
Close man-machine interaction, the understanding that user is made by the specialty analysis of oneself from image as feedback, so as to help user to exist
Knowledge is found in data.Importance recently as data analysis constantly increases, and visualization is permitted in physics, biology, medical science etc.
Multi-field all receiving is widely applied.Time series data is a kind of common data mode, is characterized in that data have time dimension
The attribute of degree, is to show the trend that data change over time to the visual key of time series data.
Theme river is a kind of visual means for being widely used in time series data.It in 2000 by Susan Havre,
Beth Hetzler etc. propose, are a kind of by the abstract ways that one group of theme presentation is a rule river, phase between theme river
Mutually stack, every river is placed all along time shaft, width changes over time.One theme river is in the width at moment
Represent the current value of the theme.The change of flow width is exactly changing pattern of this theme on time dimension.Theme river
It is a kind of efficient visual pattern, which show the overall changing pattern with local theme over time, while giving user
Present the numerical values recited of certain moment theme.
Although theme river can show the variation tendency of all themes, cannot intuitively show single theme with when
Between variation tendency because substantially theme river is a kind of stacking figure, all themes stack to form total sequentially in either order
The view of body.Due to being stacked between theme river, then the shape and fluctuation trend in single theme river will necessarily be subject to
The interference in theme river below.In fact, in theme river model, every fluctuation tendency in river is all its own fluctuation
The summation fluctuated with all rivers being disposed below, if expecting, the value user in wall scroll river needs to carry out a subtraction fortune
Calculate, increased the complexity of interpreting blueprints, will easily produce information to misread.Existing optimization method only puts in order excellent to river
Change, or the mode being laid out to external expansion from centre by the way of " inside-out ", to reduce overall fluctuation, but work as data volume
When larger, the localised waving that this mode cannot still solve river misreads the interpreting blueprints that user produces.Therefore, the present invention is to tradition
Theme river model propose following improvement:1. river degree of fluctuation is evaluated using coefficient of dispersion, the small river that will fluctuate is placed in
In lower floor, the influence to upper strata river is reduced;2. the concept of ambiguous point is proposed, fluctuation degree and contribution degree is provided the user as ginseng
Number, user's arrange parameter in interactive interface according to demand, filters out the point for being likely to result in ambiguity as ambiguous point and in figure
Dynamically it is identified;3. ambiguity point identification is designed, and mark content includes:The peak valley information of the point;Contribution degree highest theme
Information;4. personalized theme river reorders:User can change theme river according to ambiguous point information by the interactive operation for pulling
Putting in order for stream, is locally reaching personalized optimal sequencing.
The content of the invention
In view of the problem that existing theme river is present, can it is an object of the invention to provide a kind of theme river for interacting
Depending on changing method, allow users to as needed, reorder theme river, realize personalized interpreting blueprints.
Specific implementation step is as follows:
A kind of interactive theme river method for visualizing, to the theme for giving, calculates each theme time series data
Coefficient of dispersion, and theme is initially sorted according to coefficient of dispersion;Calculate all extreme points in theme river;To each extreme value
Point calculates cymomotive force and contribution degree;The cymomotive force screening ratio and contribution degree screening ratio of user's selection are obtained, to extreme value
Point screen obtaining ambiguous point;The visual design is carried out to ambiguous point, and is shown in interface;Ambiguous point is obtained in user
After identification information, personalization is carried out to river finally by interaction and is reordered;The sortord in theme river is optimized with this and is
User provides the interactive mode of river rearrangement, including following key step:
Step 1:Read in theme topici(i=1,2 ..., n), yit(t=1,2 ..., m) wherein, topiciRepresent i-th
Individual theme, n is the sum of theme, yitRepresent topiciIn the value of moment t, m is the number of moment t;
Step 2:Sequence is optimized to theme;
Step 2.1:Respectively to each theme topiciAll numerical value yitCalculate coefficient of dispersion CVi;
Step 2.2:Theme is ranked up according to coefficient of dispersion, CViIt is smaller, topiciBe arranged in more bottom closer to when
The position of countershaft;
Step 3:Obtain extreme value point set;
Step 3.1:Calculate value wys of the Total in each moment tt, wherein Total represent set in all themes corresponding
The superposition of moment height, may be considered the profile of theme river top, wy from visualized graphstComputing formula such as
Under:
Step 3.2:In wytSet wy_Set in search for extreme point, work as wyt> wyt+1and wyt> wyt-1, or wyt<
wyt+1and wyt< wyt-1, t is extreme point, and makes the collection of extreme point be combined into wExt_Set;
Step 4:The localised waving intensity of extreme point is calculated, the localised waving intensity fluctuation of Total is two phases
The gradient between adjacent extreme point;
Step 5:Calculate the maximum contribution degree of extreme point;
Step 5.1:Calculate topiciIn moment wExt_tkContribution degree cki, wherein ckiIt is topiciIn moment wExt_tk
Cymomotive force fluctuationikWith Total in moment wExt_tkCymomotive force fluctuationkRatio;
Step 5.2:By all themes in moment wExt_tkContribution degree sequence, contribution degree highest theme is designated as
topicmaxk, its contribution degree is the maximum contribution degree of the extreme point, is designated as cmaxk;
Step 6:Ambiguous point is screened;
Step 6.1:Cymomotive force and maximum contribution degree to all extreme points in set wExt_Set are ranked up;
Step 6.2:Design interactive mode, provides the user interactive interface, and user sets cymomotive force selection percentage p1%
With maximum contribution degree selection percentage p2%, ambiguous point is filtered out from wExt_Set;
Step 7:Ambiguous point the visual design, ambiguity point identification includes following information:1. the ambiguous point belongs to crest or ripple
Paddy;2. ambiguous point contribution degree highest topicmaxk;
Step 8:Interaction design;
Step 8.1:Design plays frame and shows topics river details;
Step 8.2:Ambiguous point is screened, and provides the user setting cymomotive force selection percentage and maximum contribution degree selection percentage
Interactive interface;
Step 8.3:Reorder in river, there is provided the interactive function of dragging, the information that user provides according to ambiguous point, to having
The river of ambiguity is reordered locally, reduces influence of the fluctuation in lower floor river to upper strata river, realizes personalized interpreting blueprints.
Step 8.4:Design scaling interaction.
Using such treatment, the present invention can be reduced under totality to a certain extent to the initial sorting consistence in theme river
Influence of the layer river fluctuation to the fluctuation of upper strata river.Ambiguous point concept proposed by the present invention and the visual design to ambiguous point,
Effectively for ID goes out the point easily produced ambiguity in theme river, and interaction can be combined, user can be according to actual need
Screening ratio to ambiguous point is set.According to the information that ambiguous point shows, user can be by the interactive operation of dragging to discrimination
River at adopted point is reordered, and with reference to the interactive functions such as frame details, scaling are played, realizes flexible personalized interpreting blueprints behaviour
Make.
Compared with prior art, the positive effect of the present invention is:
First, the sortord at present to theme river is generally the importance ranking according to theme or the standard according to theme
Difference sequence.The characteristics of having serious unreliability according to thematic importance sequence, does not account for the readability of visualization result,
The dynamic influence to upper strata river in lower floor river is not eliminated;According to standard deviation sequence although it is contemplated that visualization result it is readable
Property, but the method that the degree of fluctuation in river is weighed by standard deviation will be lost when the size of data gap in each river is larger
Reliability.And the present invention using coefficient of dispersion weigh river fluctuation, it is ensured that when River Data size gap is larger according to
The degree of fluctuation in river so can be effectively weighed, the initial sorting consistence to river is still reliable.
2nd, traditional theme river is different from, The present invention gives the exchange method reordered to theme river.Mesh
Preceding all of theme river is all optimized when user partial interpreting blueprints ambiguity problem is solved only with being sorted to theme river
Method, although this method can generally improve the fluctuation in river, cannot but eliminate the problem of local interpreting blueprints ambiguity.And
The present invention is proposed on the basis of optimization theme river initially sequence, and the interactive theme river of design, is that ID goes out discrimination
Adopted, user can reorder as desired by pulling to river, effectively eliminate local ambiguity, have more theme river
Customization.
Brief description of the drawings
Fig. 1 is localised waving intensity schematic diagram
Fig. 2 is ambiguous point the visual design displaying figure
Fig. 3 is ambiguous point screening interaction design schematic diagram
Fig. 4 is that river reorders effect displaying figure
Fig. 5 is system effect figure
Fig. 6 is the displaying of experimental data ambiguous point
Fig. 7 is that experimental data river reorders design sketch
Fig. 8 is embodiment of the present invention coefficient of dispersion table.
Fig. 9 is the fluctuation coefficient of dispersion table in every layer of theme river before and after embodiment of the present invention Optimal scheduling.
Specific embodiment
Specific implementation step is as follows:
Step 1:Read in theme topici(i=1,2 ..., n), yit(t=1,2 ..., m) wherein, topiciRepresent i-th
Individual theme, n is the sum of theme, yitRepresent topiciIn the value of moment t, m is the number of moment t.
Step 2:Sequence is optimized to theme.
Step 2.1:Respectively to each theme topiciAll numerical value yitCalculate coefficient of dispersion CVi, CViComputing formula
It is as follows:
CVi=σi/μi,
Wherein σiIt is topiciStandard deviation, μiIt is topiciAverage.
Step 2.2:Theme is ranked up according to coefficient of dispersion, CViIt is smaller, topiciBe arranged in more bottom closer to when
The position of countershaft.
Step 3:Obtain extreme value point set.
Step 3.1:Calculate value wys of the Total in each moment tt, wherein Total represent set in all themes corresponding
The superposition of moment height, may be considered the profile of theme river top, wy from visualized graphstComputing formula such as
Under:
Step 3.2:In wytSet wy_Set in search for extreme point, work as wyt> wyt+1and wyt> wyt-1, or wyt<
wyt+1and wyt< wyt-1, t is extreme point, and makes the collection of extreme point be combined into wExt_Set.
Step 4:The localised waving intensity of extreme point is calculated, the localised waving intensity fluctuation of Total is two phases
The gradient between adjacent extreme point, as shown in figure 1, the computing formula of fluctuation is as follows:
Then momentCymomotive forceIt is adjacent extreme pointTo pointThe gradient, as shown by the following formula:
Step 5:Calculate the maximum contribution degree of extreme point.
Step 5.1:Calculate topiciIn moment wExt_tkContribution degree cki, wherein ckiIt is topiciIn moment wExt_tk
Cymomotive force fluctuationikWith Total in moment wExt_tkCymomotive force fluctuationkRatio.
topiciIn moment wExt_tkCymomotive force fluctuationikComputing formula it is as follows:
Then topiciIn moment wExt_tkContribution degree ckiFor:
Step 5.2:By all themes in moment wExt_tkContribution degree sequence, contribution degree highest theme is designated as
topicmaxk, its contribution degree is the maximum contribution degree of the extreme point, is designated as cmaxk。
Step 6:Ambiguous point is screened.
Step 6.1:Cymomotive force and maximum contribution degree to all extreme points in set wExt_Set are ranked up.
Step 6.2:Design interactive mode, provides the user interactive interface, and user sets cymomotive force selection percentage p1%
With maximum contribution degree selection percentage p2%, ambiguous point is filtered out from wExt_Set.
The ambiguous point filtered out according to the method has the characteristics that:
1. the cymomotive force fluctuation of the pointkSize be located at set wExt_Set preceding p1%;
2. the maximum contribution degree c of the pointmaxkSize be located at set wExt_Set preceding p2%,
And p1% is smaller, bigger, the p of ambiguous point fluctuation for filtering out2% is more big, and the ambiguous point for filtering out causes ambiguity
It is more serious.
Step 7:Ambiguous point the visual design, ambiguity point identification includes following information:1. the ambiguous point belongs to crest or ripple
Paddy;2. ambiguous point contribution degree highest topicmaxk。
The present invention marks ambiguous point using triangle, as shown in Fig. 2 equilateral triangle represents crest, i.e. moment Total reaches
To maximum, del represents trough, i.e. moment Total reaches minimum.The Fill Color of triangle is the moment tribute
Offer the topic of maximummaxkColor, be user displaying Total main by topic in the fluctuation of the pointmaxkCause.
Step 8:Interaction design.
Step 8.1:Design plays frame and shows topics river details, is that user shows concrete numerical value of each theme in moment t.
Step 8.2:Ambiguous point is screened, and provides the user setting cymomotive force selection percentage and maximum contribution degree selection percentage
Interactive interface, as shown in Figure 3.
Step 8.3:Reorder in river, there is provided the interactive function of dragging, the information that user provides according to ambiguous point, to having
The river of ambiguity is reordered locally, and user can be by topicmaxkThe river the superiors are drawn to, are reduced to upper strata river
Influence, Fig. 4 is the result schematic diagram reordered to Fig. 2 ambiguous points, while user can also pull to any river
Operation, realizes personalized interpreting blueprints.
Step 8.4:Design zoom operations.
In the present invention, the sequence in river no longer cannot just change by after initial sequence, but for ID goes out
Ambiguous point, user can enter Mobile state screening to ambiguous point as the case may be.User can be carried out after obtaining ambiguous point to river
Reorder, eliminate local ambiguity.
Embodiment 1:In order to verify effectiveness of the invention, the present invention knows www.zhihu.com from network Ask-Answer Community
Choose from December 31,1 day to 2015 January in 2015, totally 10 topical subjects and its daily degrees of data of enlivening are made within 365 days
It is checking data.
Coefficient of dispersion is calculated to each theme, as a result as shown in Figure 8.
Coefficient of dispersion according to each theme is ranked up to theme, and theme presses coefficient of dispersion from small to large in theme river
In arrange from low to high.The height in every layer of theme river river before and after calculation optimization sorts respectively, the kth layer river in theme river
In the height height of date tktComputing formula it is as follows:
Fluctuation to every layer of theme river before and after Optimal scheduling is estimated using coefficient of dispersion, as a result as shown in Figure 9.
Theme river can be obtained by Fig. 9 and reduce 40.4% by coefficient of dispersion average after Optimal scheduling, illustrated by Optimal scheduling
The fluctuation in theme river can effectively be reduced.
10 themes optimize sequence and visual result is as shown in figure 5, the total extreme point in theme river to more than
226, user can be selected cymomotive force ratio and contribution degree ratio as needed, and the ratio of selection is bigger, filters out
Ambiguous point it is more.
As shown in fig. 6, ambpiIt is the ambiguous point in November 25 in 2015 on the date to be, the ambiguity point identification of such as point can be seen
Go out, the ambiguous point is crest, and the maximum theme topic of contribution degreemaxiIt is " fashion ", because " fashion " is in the strong ripple of the point
It is dynamic, cause the theme above it all to present strong fluctuation, chosen positioned at bottom river, i.e. " fashion ", by it with mouse
The outermost layer in theme river is drawn to, so as to exclude the influence to other rivers, the result that reorders is as shown in Figure 7.