CN106354838A - Data visualization method based on semantic resonance colors - Google Patents

Data visualization method based on semantic resonance colors Download PDF

Info

Publication number
CN106354838A
CN106354838A CN201610790092.7A CN201610790092A CN106354838A CN 106354838 A CN106354838 A CN 106354838A CN 201610790092 A CN201610790092 A CN 201610790092A CN 106354838 A CN106354838 A CN 106354838A
Authority
CN
China
Prior art keywords
color
colors
palette
semantic
picture
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610790092.7A
Other languages
Chinese (zh)
Other versions
CN106354838B (en
Inventor
盛斌
熊浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN201610790092.7A priority Critical patent/CN106354838B/en
Publication of CN106354838A publication Critical patent/CN106354838A/en
Application granted granted Critical
Publication of CN106354838B publication Critical patent/CN106354838B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5838Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using colour

Landscapes

  • Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Image Analysis (AREA)
  • Processing Or Creating Images (AREA)

Abstract

The invention relates to a data visualization method based on semantic resonance colors. The method comprises the following steps of 1, setting a set of nouns of articles with colors to be distributed as a word set and obtaining a picture set related to the word set according to keywords of the nouns; 2, selecting a proper color palette, wherein the color palette comprises all colors of the articles with colors to be distributed; 3, generating a color histogram of pictures in the picture set, and according to the distribution phenomenon of the colors in the color histogram, obtaining the occurrence probability of the colors in the color palette in the color histogram; 4, according to the distinguishing degree of the colors in the color palette and the occurrence probability in the color histogram, distributing the colors in the color palette to the articles with the colors to be distributed in the word set, and forming and displaying keyword-color pairs. Compared with the prior art, the data visualization method has the advantages of reserving main colors, being accurate in color matching and the like.

Description

A kind of data visualization method based on semantic sympathetic response color
Technical field
The present invention relates to image processing field, especially relate to a kind of data visualization side based on semantic sympathetic response color Method.
Background technology
During data visualization, color method will play a significant role, and good color rendering intent will greatly be lifted Visual effect, for example, we can show the data of " ocean " with blue, or with peach come for keyword " love " colours, and different colors can help us to identify and distinguish between different classifications.
It is very easy to find, color is consistent with the meaning of word to accelerate cognitive process, otherwise then can hinder cognition, this interesting showing As referred to as " Stroop effect " (stroop effect), in fact, not being singly the title of color, people are accustomed to already by color Bind together with each conception of species, these natural and semantic congruence colors are referred to as " semantic sympathetic response color " (semantically resonant color), if energy effectively utilizes semanteme sympathetic response color, we can improve visual cognition Efficiency.
But the form of expression of existing word is excessively dull, black font in the whole text for word semantics recognition very Unfavorable, nor lift the reading interest of reader, the textual representation method of existing semanteme sympathetic response color is generally using related in a large number The background of black and white, as training set, when obtaining the rejecting of the background before color histogram, is often weeded out by picture, But because the corresponding body color of some vocabulary is exactly black and white, the color of itself is rejected by therefore this kind of method, lead to unite Meter is inaccurate.
Content of the invention
The purpose of the present invention be exactly provide to overcome the defect that above-mentioned prior art exists a kind of retain body color, The color matching accurately data visualization method based on semantic sympathetic response color.
The purpose of the present invention can be achieved through the following technical solutions:
A kind of data visualization method based on semantic sympathetic response color, comprises the following steps:
1) as word finder, the key word according to noun obtains associated therewith the article noun setting one group of color to be allocated Pictures;
2) choose suitable palette, this palette includes all colours of the article of color to be allocated;
3) produce the color histogram of every width picture in pictures, according to the distribution situation of color histogram in figure color, obtain Take the probability that the color in palette occurs in color histogram in figure;
4) according to the identification between the color in palette and the probability in the appearance of color histogram in figure, by palette The article of color to be allocated in word finder for the color assignment on, formed key word-color to and show.
Described step 2) in, described palette adopts 20 tone colour tables.
Described step 3) specifically include following steps:
31) by the way of pixel filtration, background rejecting is carried out to the picture in pictures;
32) pixel count is adopted to count the quantity that in picture, each color occurs;
33) colour type quantity being in front three as many classes, remaining colour type as few class, accordingly to every width Picture gives color label,
34) abandon the corresponding picture of few class, and according to the corresponding picture of many classes, black and white are added to color histogram In the probability statistics colour type of figure and palette, produce corresponding color histogram.
The method is further comprising the steps of:
5) vocabulary semantic with color that this method produces is made comparisons with artificial color assignment statistics, obtain color matching accurate Exactness.
Described step 32) in, in lab space to picture in the pixel count of each color carry out quantity statistics.
Compared with prior art, the invention has the advantages that
First, retain body color: background rejecting is carried out using pixel statistics, existing eliminates substantial amounts of effective letter Breath, filtering background is changed to pixel and filters by the present invention, does not have white and black, because both colors are seldom used in former palette In the displaying of data visualization, therefore filtered again in statistical pixel color histogram, carried out for single pixel.
2nd, match colors accurately: the probability statistics of version cooperation color histogram of being matched colors using 20 colors, so that the method for the present invention is matched colors Accurately.
Brief description
Fig. 1 is the pixels statisticses figure in lab space.
Specific embodiment
The present invention is described in detail with specific embodiment below in conjunction with the accompanying drawings.
Embodiment:
1. obtain data
It is that the vocabulary such as vegetable to one group of particular category (" Fructus Lycopersici esculenti ", " Rhizoma Solani tuber osi ", " Capsicum annuum L. " etc.) carries out Google's picture searching, And the picture searched for is taken together, the color of all pixels counting all pictures is to produce color histogram.
For a key word, such as " Fructus Mali pumilae ", in addition to direct search " Fructus Mali pumilae ", this method can also be by additional Word obtaining more rich image data, such as " Fructus Mali pumilae+material ", so each key word will be obtained with two color histograms Figure.
2. candidate's palette
Be not arbitrarily take average be one color of a key word arrangement be exactly suitable.In visualization tasks In it is necessary to realize in all of classification a kind of take into account discrimination and semantic color arrange just meaningful.In order to obtain Significant color arrangement, algorithm employs the 20 tone colour tables of well-known visualization company tableau.In each visualization tasks Middle reasonably will be distributed according to semanteme for classification based on 20 kinds of colors in this palette.
3. color arrangement
After obtaining all of color histogram, rectangular histogram is made with Density Estimator to simulate the distribution of color, and according to it Calculate the probability that in candidate's palette, each color occurs, then take into account the identification between color, obtain every a pair " color-word " sympathetic response fraction " (affinity score) that remittance " is combined.These sympathetic response fractions feature between concept and color the strong of connection Degree.According to all sympathetic response fractions, candidate color is assigned to one by one in word finder so that total sympathetic response fraction using Hungarian method Highest, that is, Color Semantic is the strongest.
The core formula of algorithm be presented herein below:
p ( c | v , t ) &proportional; σ b &element; t t ( b ) exp ( d i s t ( b , c ) σ )
h ( c | v , t ) = - σ c &element; c p ( c | v , t ) l n p ( c | v , t )
p ( c | v , t ) &proportional; m a x ( s a t ( c ) , t ) · [ w c p ( c | v , t c ) h ( c | v , t ) + ( 1 - w c ) p ( c | v , t c ) h ( c | v , t ) ]
a f f i n i t y ( c , v ) = p ( c | v ) h ( c | v )
Finally, using the mass-rent resource on amazon mechanical turk it is desirable to participant is to identical word finder Mated with color set, and adjusted the parameter of automation algorithm accordingly, made the result that automatization distributes as close possible to artificial The scheme selecting, a, t two row are respectively algorithm and the artificial color allocation scheme producing it is seen that both have much approximate ground Side.
4. arithmetic result and experiment
In the experimental stage, compare the impact to rectangular histogram reading efficiency for three kinds of different color allocation, respectively expert The scheme that selection, algorithm generate and be randomly assigned.Wherein select expert scheme by business visual software tableau design Person is given, the upper limit as other scheme works and criterion.In an experiment, tested it will be appreciated that three kinds of rectangular histograms, and want rooting According to the upper information answers problem of figure, its response time is recorded and is used as the tolerance of rectangular histogram effect.And the result tested shows, For coloring height, there is the vocabulary being relatively fixed color, such as " sky ", " Fructus Fragariae Ananssae ", " Rhizoma Solani tuber osi " etc., have semantic sympathetic response Color allocation can improve about 10% response time, wherein the scheme of select expert slightly be better than algorithm produce scheme.But it is right In abstract, vocabulary that is lacking fixing color, such as " coca cola ", " Google ", " healthy " etc., semantic sympathetic response can only provide relatively Little improvement.
Three. case study
Because color arrangement is highly dependent on the image data of acquisition, the quality of therefore result is heavily dependent on former The process of data prediction, and statistical method afterwards is not provided that too many denoising correction.
See one group of test result, See Figure first.
Can see that the color arrangement of first three keyword is unsatisfactory, find that there are the following problems:
1. the picture of mass efficient is left out during the background of former algorithm is rejected.(former algorithm passes through image edge 's
Pixels statisticses, count and black, number of pixels in defined threshold for the white, if it exceeds whole edge 75%, then it is assumed that this picture is difficult to extract crucial colors information, is deleted.) this directly results in mass efficient information quilt Ignore, totally die down, and the weight of noise data is bigger.For example in " milk ", blue background is retained to be become Big interference, and effective picture of white background is ignored.
2. the feature of the former data maximum of " Bulbus Lilii ", " Paeonia suffruticosa " is that color is numerous and diverse, and key feature data is difficult to extract.This The result directly contributing during statistics is that the conjugation of single key word and candidate's color is average, and last result dispersion does not have concentration Property.
Four. improvement
First, in data acquisition phase, this method is using up-to-date baidu image more stable at home Search api, repairs the support to Chinese character set.
Secondly, modification background rejects algorithm.In view of eliminating substantial amounts of effective information before, therefore filtering background is changed to Pixel filters.Originally there is no white and black, because both colors are rarely used in data visualization in the palette of existing use The displaying changed.Therefore filtered again in statistical pixel color histogram, carried out for single pixel.
3rd, as shown in figure 1, will be once polymerized before statistics.For lab space, it is pressed l, a, b are positive and negative to be amounted to 6 directions are divided into 6 classes.The all pixels of single width figure are sorted out by color, is finally counted pixel count of all categories.Take at most Pixel count that class be this width figure label.
Once counted again after all of figure all carries color classification label, take all kinds of sum classifications of first three to be crowd Class, remaining three classes are few class, and the picture of these classifications is abandoned.
Finally, displaying to be improved.Needing to carry out expressive enough displaying to result under existence conditionses could be to knot Fruit is estimated.Can not possibly look to holding the coding of several colors as final output result.Can be realized from d3 storehouse The digital independent of front end and display.

Claims (5)

1. a kind of data visualization method based on semantic sympathetic response color is it is characterised in that comprise the following steps:
1) as word finder, the key word according to noun obtains relative figure to the article noun setting one group of color to be allocated Piece collection;
2) choose suitable palette, this palette includes all colours of the article of color to be allocated;
3) produce the color histogram of every width picture in pictures, according to the distribution situation of color histogram in figure color, obtain and adjust The probability that color in colour table occurs in color histogram in figure;
4) according to the identification between the color in palette and the probability in the appearance of color histogram in figure, by the face in palette Color is assigned on the article of the color to be allocated in word finder, formed key word-color to and show.
2. a kind of data visualization method based on semantic sympathetic response color according to claim 1 is it is characterised in that described Step 2) in, described palette adopts 20 tone colour tables.
3. a kind of data visualization method based on semantic sympathetic response color according to claim 1 is it is characterised in that described Step 3) specifically include following steps:
31) by the way of pixel filtration, background rejecting is carried out to the picture in pictures;
32) pixel count is adopted to count the quantity that in picture, each color occurs;
33) colour type quantity being in front three as many classes, remaining colour type as few class, accordingly to every width picture Give color label,
34) abandon the corresponding picture of few class, and according to the corresponding picture of many classes, black is added to color histogram with white In probability statistics colour type and palette, produce corresponding color histogram.
4. a kind of data visualization method based on semantic sympathetic response color according to claim 1 is it is characterised in that the party Method is further comprising the steps of:
5) vocabulary semantic with color that this method produces is made comparisons with artificial color assignment statistics, obtain color matching accurately Degree.
5. a kind of data visualization method based on semantic sympathetic response color according to claim 1 is it is characterised in that described Step 32) in, in lab space to picture in the pixel count of each color carry out quantity statistics.
CN201610790092.7A 2016-08-31 2016-08-31 Data visualization method based on semantic resonance color Active CN106354838B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610790092.7A CN106354838B (en) 2016-08-31 2016-08-31 Data visualization method based on semantic resonance color

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610790092.7A CN106354838B (en) 2016-08-31 2016-08-31 Data visualization method based on semantic resonance color

Publications (2)

Publication Number Publication Date
CN106354838A true CN106354838A (en) 2017-01-25
CN106354838B CN106354838B (en) 2019-12-10

Family

ID=57856513

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610790092.7A Active CN106354838B (en) 2016-08-31 2016-08-31 Data visualization method based on semantic resonance color

Country Status (1)

Country Link
CN (1) CN106354838B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110472083A (en) * 2018-05-08 2019-11-19 优酷网络技术(北京)有限公司 Colour gamut recommended method and device
CN115082703A (en) * 2022-07-19 2022-09-20 深圳大学 Concept-associated color extraction method, device, computer device and storage medium
CN115268665A (en) * 2022-08-02 2022-11-01 四川大学 Input method and device based on coloring and electronic equipment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1975762A (en) * 2006-12-11 2007-06-06 浙江大学 Skin detecting method
CN105761115A (en) * 2003-11-06 2016-07-13 贝洱工艺有限公司 Data structure for method and system for coordinating colors

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105761115A (en) * 2003-11-06 2016-07-13 贝洱工艺有限公司 Data structure for method and system for coordinating colors
CN1975762A (en) * 2006-12-11 2007-06-06 浙江大学 Skin detecting method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SHARON LIN等: "Selecting Semantically-Resonant Colors", 《EUROGRAPHICS CONFERENCE ON VISUALIZATION (EUROVIS) 2013》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110472083A (en) * 2018-05-08 2019-11-19 优酷网络技术(北京)有限公司 Colour gamut recommended method and device
CN115082703A (en) * 2022-07-19 2022-09-20 深圳大学 Concept-associated color extraction method, device, computer device and storage medium
CN115082703B (en) * 2022-07-19 2022-11-11 深圳大学 Concept-associated color extraction method, device, computer equipment and storage medium
CN115268665A (en) * 2022-08-02 2022-11-01 四川大学 Input method and device based on coloring and electronic equipment

Also Published As

Publication number Publication date
CN106354838B (en) 2019-12-10

Similar Documents

Publication Publication Date Title
CN101777060B (en) Webpage classification method and system based on webpage visual characteristics
CN102799635B (en) The image collection sort method that a kind of user drives
CN106354872A (en) Text clustering method and system
CN103577475B (en) A kind of picture mechanized classification method, image processing method and its device
CN109614508B (en) Garment image searching method based on deep learning
CN102314614B (en) Image semantics classification method based on class-shared multiple kernel learning (MKL)
CN101739428B (en) Method for establishing index for multimedia
CN106126585B (en) The unmanned plane image search method combined based on quality grading with perceived hash characteristics
CN101706780A (en) Image semantic retrieving method based on visual attention model
Sun et al. Specific comic character detection using local feature matching
CN106202285A (en) Search Results methods of exhibiting and device
CN114092700B (en) Ancient character recognition method based on target detection and knowledge graph
CN101901346A (en) Method for identifying unsuitable content in colour digital image
CN109213853A (en) A kind of Chinese community's question and answer cross-module state search method based on CCA algorithm
CN106354838A (en) Data visualization method based on semantic resonance colors
CN107861970A (en) A kind of commodity picture searching method and device
CN110377659A (en) A kind of intelligence chart recommender system and method
CN109213886A (en) Image search method and system based on image segmentation and Fuzzy Pattern Recognition
CN101594314A (en) A kind of spam image-recognizing method and device based on high-order autocorrelation characteristic
CN109741351A (en) A kind of classification responsive type edge detection method based on deep learning
Bairwa et al. Classification of Fruits Based on Shape, Color and Texture using Image Processing Techniques
CN111191716A (en) Method and device for classifying printed pictures
CN110363092A (en) Histogram recognition methods, device, equipment and computer readable storage medium
CN106066887A (en) A kind of sequence of advertisements image quick-searching and the method for analysis
Parikh et al. Interactive discovery of task-specific nameable attributes

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant