CN102436497B - Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling - Google Patents
Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling Download PDFInfo
- Publication number
- CN102436497B CN102436497B CN201110359688.9A CN201110359688A CN102436497B CN 102436497 B CN102436497 B CN 102436497B CN 201110359688 A CN201110359688 A CN 201110359688A CN 102436497 B CN102436497 B CN 102436497B
- Authority
- CN
- China
- Prior art keywords
- owl
- model
- ontology
- hot
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
The invention discloses a mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling. The system is characterized by comprising the following functional modules: 1), an OWL ontology instance transforming module; 2), an invert index module; 3), an ontology element frequency counting module, which is used for performing the ontology element frequency counting on the OWL ontology invert index table and sequencing according to the size of the frequency; 4), an OWL seed model generating module; 5), a knowledge ontology model polymerizing module; 6), an OWL hot model counting module; 7), a news hot frequency counting module. For utilizing the method of the OWL ontology model analysis, the invention promotes the internet news hot-spot analysis to the concept hierarchy; some related hot-spot concepts are polymerized by using the relationship between the concepts to form the news hot, thereby overcoming the shortcoming due to partial hot-spot analyzing in traditional method.
Description
Technical field
The present invention relates to a kind of mainstream media social focus analytic system, belong to network computer technology field.
Background technology
Traditional internet news analysis of central issue is quoted (reprinting) with crucial phrase or article by other websites, estimated by the frequency that user inquires about.Recently, utilize some short essays to be forwarded in microblogging, be concerned and also can be found out by the frequency statistics commented on " temperature " of this short essay.Effective to the estimation of the temperature of the news report being carrier with these crucial phrases, short essay, article, such as: Guo Mei good job part, two years old child are hit to many people by car and pass by and do not sue and labour, etc.But this analysis of central issue method cannot accomplish extension and the association analysis of concept, such as: Guo Meimei and the Red Cross Society of China? Ka Zhafei's is dead favourable to whom? Europe debt and world financial crisis, the economy of the U.S. on the impact of China, etc.Although some article does not directly occur certain keyword, what it was still said is the same same thing, uses traditional analysis of central issue method will omit such article.
Summary of the invention
Technical matters to be solved by this invention is mainstream media's social focus analytic system, by relevant focus concept polymerization composition hot news, thus makes retrieved relevant focus article more comprehensive.
For solving the problems of the technologies described above, the invention provides a kind of mainstream media's social focus analytic system based on learning type OWL modeling, it is characterized in that, comprise following functional module:
1) OWL instances of ontology modular converter: the conversion of OWL instances of ontology is done to the magnanimity news information text that search engine obtains;
2) inverted index module: this volume elements inverted index is done to OWL instances of ontology storehouse, described volume elements is undecomposable body minimum in OWL instances of ontology storehouse;
3) this volume elements frequency statistics module: this volume elements frequency statistics is done to this volume elements of OWL inverted index table, and sort according to the size of the frequency;
4) OWL Seed model generation module: using this volume elements of specified quantity of coming above as focus Seed model candidate, and in the mode of knowledge model stored in internet news OWL model bank; Seed model be the most original, be often also minimum knowledge model;
5) Ontology Model aggregation module: each Seed model in internet news OWL model bank is taken out respectively, do concept with other this volume elements in inverted index table to be polymerized, and amended hot news knowledge model is deposited back internet news OWL model bank, move in circles, until all Seed models are all complete by polymerization;
6) OWL hot spot model statistical module: traversal internet news OWL model bank, the knowledge model that statistical correlation concept is many also sorts, and the knowledge model coming specified quantity is above taken out as hot news knowledge model;
7) hot news frequency statistics module: find the instances of ontology corresponding with hot news knowledge model by inverted index table, the original article corresponding with these instances of ontology is found again by original document management system, and the frequency that original article is propagated, pays close attention to and commented on is added up, form hot news analysis result and export.
The beneficial effect that the present invention reaches:
The method that the present invention adopts OWL ontology model to analyze, internet news analysis of central issue is risen to concept hierarchy, utilizes the relation between concept and concept, by some relevant focus concept polymerizations, and form hot news, incomplete deficiency in traditional analysis of central issue method can be made up with this.
Accompanying drawing explanation
Fig. 1 is the structural representation of the report focal point analysis system of the mainstream media based on learning type OWL modeling in the present invention.
Embodiment
The present invention is the same with traditional news media analysis of central issue method, first hot news analytic system based on OWL (network ontology language Ontology of Web Language) knowledge model also will obtain the news information of magnanimity by search engine, be also propagate, pay close attention to, Main Basis that the frequency statistics commented on is estimated for temperature.Difference is, traditional is crucial phrase, short essay or article by objects of statistics, and this patent adopts based on OWL (being correlated with) conceptual combinations, that is: OWL knowledge model.
In order to obtain OWL knowledge model accurately, first the conversion of OWL instances of ontology to be carried out to the magnanimity news information text that search engine obtains, see patent " internet text this analysis and OWL converter based on OWL ", then, this volume elements inverted index table is done to the OWL instances of ontology storehouse representing magnanimity news information, see patent " a kind of inverted index method based on OWL ", just lie in this this volume elements inverted index table about the knowledge model of hot news.
The key of this patent is, does this volume elements frequency statistics to this volume elements inverted index table, and sets one group of OWL focus seed knowledge model according to statistics.Here basic assumption is, this volume elements (that is: key concept) that frequency of occurrence is higher in magnanimity news, and it is that the possibility of hot news carrier is larger.
Use this group seed knowledge model to do concept polymerization to whole volume elements inverted index table, that is: this relevant volume elements is aggregated in a knowledge model, each Seed model just " expansion ".Finally, the OWL knowledge model that frequency of occurrence is maximum, related notion is maximum is just defined as hot news model.
Pass through inverted index table, find out corresponding to the instances of ontology of each this volume elements and the original document of binding thereof in hot news model, re-use the frequency statistics method of traditional propagation, concern, comment, obtained original article is added up, the net result that its result is analyzed as hot news.As shown in Figure 1, each functional module and main flow are concrete function logic:
1) the magnanimity news information text obtained search engine does the conversion of OWL instances of ontology, and be 2011102707850 see patent " internet text this analysis and OWL converter based on OWL " application number, the applying date is on September 14th, 2011;
2) this volume elements inverted index is done to OWL instances of ontology storehouse;
3) this volume elements frequency statistics module does this volume elements frequency statistics to this volume elements of OWL inverted index table, and sorts according to the size of the frequency;
4) OWL Seed model generation module, using this volume elements of specified quantity of coming above as focus Seed model candidate, them with the form of knowledge model stored in internet news OWL model bank;
5) Ontology Model aggregation module, each Seed model in internet news OWL model bank is taken out respectively, do concept with other this volume elements in inverted index table to be polymerized, and amended hot news knowledge model is deposited back internet news OWL model bank, move in circles, until all Seed models are all complete by polymerization;
6) OWL hot spot model statistical module, traversal internet news OWL model bank, the knowledge model that statistical correlation concept is many also sorts, and the knowledge model coming specified quantity is above taken out as hot news knowledge model;
7) hot news frequency statistics module, the instances of ontology corresponding with hot news knowledge model is found by inverted index table, the original article corresponding with these instances of ontology is found again by original document management system, call traditional statistical method, to the frequency statistics that the article at the concept place in these hot news knowledge models is propagated, pays close attention to and commented on.Finally, form hot news analysis result to export.
Below disclose the present invention with preferred embodiment, so it is not intended to limiting the invention, and all employings are equal to replacement or the technical scheme that obtains of equivalent transformation mode, all drop within protection scope of the present invention.
Claims (1)
1., based on mainstream media's social focus analytic system of learning type OWL modeling, it is characterized in that, comprise following functional module:
1) OWL instances of ontology modular converter: the conversion of OWL instances of ontology is done to the magnanimity news information text that search engine obtains; Described OWL is network ontology language;
2) inverted index module: this volume elements inverted index is done to OWL instances of ontology storehouse, described volume elements is undecomposable body minimum in OWL instances of ontology storehouse;
3) this volume elements frequency statistics module: this volume elements frequency statistics is done to this volume elements of OWL inverted index table, and sort according to the size of the frequency;
4) OWL focus Seed model generation module: using this volume elements of specified quantity of coming above as focus Seed model candidate, and by focus Seed model in the mode of knowledge model stored in internet news OWL model bank;
5) Ontology Model aggregation module: each the focus Seed model in internet news OWL model bank is taken out respectively, do concept with other this volume elements in inverted index table to be polymerized, and amended hot news knowledge model is deposited back internet news OWL model bank, move in circles, until all focus Seed models are all complete by polymerization;
6) OWL hot spot model statistical module: traversal internet news OWL model bank, the knowledge model that statistical correlation concept is many also sorts, and the knowledge model coming specified quantity is above taken out as hot news knowledge model;
7) hot news frequency statistics module: find the instances of ontology corresponding with hot news knowledge model by inverted index table, the original article corresponding with instances of ontology is found again by original document management system, and the frequency that original article is propagated, pays close attention to and commented on is added up, form hot news analysis result and export.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110359688.9A CN102436497B (en) | 2011-11-14 | 2011-11-14 | Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110359688.9A CN102436497B (en) | 2011-11-14 | 2011-11-14 | Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102436497A CN102436497A (en) | 2012-05-02 |
CN102436497B true CN102436497B (en) | 2014-12-31 |
Family
ID=45984559
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110359688.9A Active CN102436497B (en) | 2011-11-14 | 2011-11-14 | Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102436497B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103455758A (en) * | 2013-08-22 | 2013-12-18 | 北京奇虎科技有限公司 | Method and device for identifying malicious website |
CN105335888A (en) * | 2014-07-17 | 2016-02-17 | 南方科技大学 | Market monitoring system and method |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101923544A (en) * | 2009-06-15 | 2010-12-22 | 北京百分通联传媒技术有限公司 | Method for monitoring and displaying Internet hot spots |
CN102004792A (en) * | 2010-12-07 | 2011-04-06 | 百度在线网络技术(北京)有限公司 | Method and system for generating hot-searching word |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101441561B (en) * | 2007-11-23 | 2012-05-23 | 国际商业机器公司 | Method and device for generating service-oriented architecture strategy based on context model |
-
2011
- 2011-11-14 CN CN201110359688.9A patent/CN102436497B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101923544A (en) * | 2009-06-15 | 2010-12-22 | 北京百分通联传媒技术有限公司 | Method for monitoring and displaying Internet hot spots |
CN102004792A (en) * | 2010-12-07 | 2011-04-06 | 百度在线网络技术(北京)有限公司 | Method and system for generating hot-searching word |
Also Published As
Publication number | Publication date |
---|---|
CN102436497A (en) | 2012-05-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zuo et al. | Green building evaluation from a life-cycle perspective in Australia: A critical review | |
CN105260474B (en) | A kind of microblog users influence power computational methods based on information exchange network | |
CN103678670B (en) | Micro-blog hot word and hot topic mining system and method | |
CN103489045B (en) | Demand response load optimization potential evaluation method based on multi-scene design | |
CN106372072A (en) | Location-based recognition method for user relations in mobile social network | |
Zhang et al. | A system for tender price evaluation of construction project based on big data | |
CN105721279A (en) | Relationship circle excavation method and system of telecommunication network users | |
CN106570080A (en) | Multilevel semantic matching method for cloud manufacturing resource services | |
Hong et al. | Structural changes and growth factors of the ICT industry in Korea: 1995–2009 | |
Wang et al. | Integrated development of digital and energy industries: Paving the way for carbon emission reduction | |
CN107944755A (en) | A kind of business model design method and system calculated based on city | |
Kolosok et al. | Machine analysis of the UK electrical energy initiatives based on the e-petitions to the UK government and parliament | |
Tao et al. | Coupling coordination analysis and Spatiotemporal heterogeneity between data elements and green development in China | |
CN102436497B (en) | Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling | |
Li et al. | Digital development influencing mechanism on green innovation performance: a perspective of green innovation network | |
Tian et al. | The effect of resource abundance on Chinese urban green economic growth: A regional heterogeneity perspective | |
Pan et al. | Research on the status of e-commerce development based on big data and Internet technology | |
Fallah et al. | Forward patent citations as predictive measures for diffusion of emerging technologies | |
CN102004951B (en) | Role group dividing method based on role correlation | |
Li et al. | Mapping the scientific structure and evolution of renewable energy for sustainable development | |
Yoon et al. | Ontological functional modeling of technology for reusability | |
Szczepańczyk | Transformation towards circular economy in comparison with eco-innovation development on the example of EU member states | |
Liang | Allocation of multi-dimensional distance learning resource based on MOOC data | |
Wang et al. | Analysis of international competitive situation of key core technology in strategic emerging industries: New generation of information technology industry as an example | |
Liu et al. | Classification of China's county administrative units based on carbon emissions from energy consumption and economic indicators |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee | ||
CP01 | Change in the name or title of a patent holder |
Address after: 210006, 12 floor, Tong Tong Building, 501 South Zhongshan Road, Nanjing, Jiangsu Patentee after: Jiangsu United Industrial Limited by Share Ltd Address before: 210006, 12 floor, Tong Tong Building, 501 South Zhongshan Road, Nanjing, Jiangsu Patentee before: Jiangsu Lianzhu Industrial Co.,Ltd. |